JPH09212311A

JPH09212311A - Disk array device

Info

Publication number: JPH09212311A
Application number: JP8015646A
Authority: JP
Inventors: Fumitoshi Uno; 文敏宇野
Original assignee: EKUSHINGU KK; Brother Industries Ltd; Xing Inc
Current assignee: EKUSHINGU KK; Brother Industries Ltd; Xing Inc
Priority date: 1996-01-31
Filing date: 1996-01-31
Publication date: 1997-08-15
Also published as: TW436681B

Abstract

PROBLEM TO BE SOLVED: To prevent the speed of a disk array device from being influenced as a whole by the delay of operation end of a disk device due to error correction. SOLUTION: A disk array controller 35 divides data transferred from a main computer 10 into prescribed units in parallel, successively writes the divided data in the same sectors of respective disk devices A31, B32, C33, generates parities corresponding to their data, and writes the parities in a disk device D 34. When a respone from one of the disk devices 31 to 33 is delayed at the time of reading out data, data from the disk device delaying the response are generated based upon the data read out from the other two disk devices and the parity data read out from the disk device 34.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、複数のディスク装
置を並列に同時動作させることで、メインコンピュータ
からの要求に対して見かけ上１台のディスク装置に見せ
かけて応答するようにしたディスクアレイ装置に関する
ものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a disk array device in which a plurality of disk devices are simultaneously operated in parallel so as to respond to a request from a main computer by apparently responding to one disk device. It is about.

【０００２】[0002]

【従来の技術】従来、ディスクアレイ装置では、ディス
クアレイ装置内の複数のディスク装置の動作が全て終了
するのを待って、メインコンピュータ側にデータを送出
していた。ここで、エラー訂正機能があるディスクアレ
イ装置の場合は、何台かのディスク装置がエラーを起こ
した場合、エラー訂正が可能であればエラー訂正をして
からメインコンピュータにデータを転送する。この場
合、メインコンピュータはディスクアレイ装置内のエラ
ーを関知することなく動作を続けられる。2. Description of the Related Art Conventionally, in a disk array device, data is sent to the main computer side after all the operations of a plurality of disk devices in the disk array device are completed. Here, in the case of a disk array device having an error correction function, when an error occurs in some of the disk devices, if error correction is possible, error correction is performed and then data is transferred to the main computer. In this case, the main computer can continue to operate without being aware of an error in the disk array device.

【０００３】このようなディスクアレイ装置の利点は、
主に次の３点である。まず１番目として大容量化の実現
である。ディスク装置の限界容量はその時々の技術進歩
によって常に変化していくが、いつの時代にも１台のデ
ィスク装置についての限界容量がある。例えば本願の出
願時点での限界容量は４Ｇ（ギガ）バイト程度であり、
さらに量産品となるとその容量は２Ｇバイト程度であ
る。上述した４Ｇバイトの限界容量のディスク装置は量
産できないので、一般的に高価でかつ信頼性が低い。The advantage of such a disk array device is that
There are three main points. First is the realization of large capacity. Although the limit capacity of a disk device is constantly changing with technological progress, there is a limit capacity for one disk device at any given time. For example, the limit capacity at the time of filing of this application is about 4 G (giga) bytes,
Further, when it becomes a mass-produced product, its capacity is about 2 GB. Since the above-mentioned disk device having a limit capacity of 4 GB cannot be mass-produced, it is generally expensive and low in reliability.

【０００４】それに対して量産品は製品が安定している
ので、一般的に安価でかつ信頼性が高い。例えば３台の
ディスク装置を備えるディスクアレイ装置において、各
ディスク装置に量産品の２Ｇバイトのディスク装置を使
用すると、見かけ上６Ｇバイトのハードディスクができ
上がる。量産品のディスク装置を寄せ集めることで限界
容量よりも大きいディスク装置を作り出せることになる
ため、信頼性が高く、かつ大容量のものが得られる。ま
た、限界容量の４Ｇバイトのディスク装置を使用する
と、その時点では製造不可能と考えられる１２Ｇバイト
のディスク装置を作ることが可能になる。On the other hand, since mass-produced products are stable, they are generally inexpensive and highly reliable. For example, in a disk array device including three disk devices, if a mass-produced 2 GB disk device is used for each disk device, an apparent 6 GB hard disk is completed. Since mass-produced disk devices can be collected to create a disk device having a capacity larger than the limit capacity, a highly reliable and large-capacity disk device can be obtained. Further, if a disk device of 4 Gbytes having a limit capacity is used, it becomes possible to make a disk device of 12 Gbytes which cannot be manufactured at that time.

【０００５】２番目としては処理速度の向上が挙げられ
る。ディスク装置は物理的な動作部分を多数持つ装置で
あるため、その動作スピードを向上させるのは容易では
ない。これは、ディスクの回転スピード、読み書きヘッ
ドの移動スピード、ヘッド移動後ヘッド位置が安定する
までに必要な時間等の物理的要因が多数あるためであ
る。ところが、ディスクアレイ装置は、本来１台のディ
スク装置に書き込まれるはずのデータが複数台のディス
ク装置に分散して書き込まれるため、トータルで考えた
場合、１台のディスク装置に対する読み書きの頻度や読
み出すデータ量が低減されるため、全体として見れば処
理速度の向上が図られるのである。The second reason is improvement in processing speed. Since the disk device is a device having many physical operation parts, it is not easy to improve the operation speed. This is because there are many physical factors such as the rotation speed of the disk, the moving speed of the read / write head, and the time required for the head position to stabilize after moving the head. However, in the disk array device, since data that should originally be written in one disk device is distributed and written in a plurality of disk devices, when considered in total, the frequency of reading and writing to one disk device and reading Since the amount of data is reduced, the processing speed can be improved as a whole.

【０００６】[0006]

【発明が解決しようとする課題】ところで、従来のディ
スクアレイ装置では、メインコンピュータから読み込み
等の要求が来た場合には、アクセス対象となっている複
数のディスク装置の動作が全て終了するのを待って、そ
れらから得たデータをメインコンピュータ側に送出して
いた。ここで、エラー訂正機能があるディスクアレイ装
置の場合は、何台かのディスク装置がエラーを起こした
場合、エラー訂正が可能であればエラー訂正をしてから
メインコンピュータにデータを転送していた。By the way, in the conventional disk array apparatus, when a request for reading or the like is received from the main computer, the operation of the plurality of disk apparatuses to be accessed is completed. Waiting, I sent the data from them to the main computer. Here, in the case of a disk array device having an error correction function, if an error occurs in some of the disk devices, if the error correction is possible, the error is corrected and then the data is transferred to the main computer. .

【０００７】しかしながら、エラー発生時には、次のよ
うな手順でそのエラーが判ることとなる。つまり、各デ
ィスク装置でそれぞれ読出指令に応じてデータを読み出
そうとするのであるが、読み出せないと自動的にリトラ
イリード処理を実行した後、結局はリトライオーバーと
なってエラーとなることが多い。このためエラーが発生
するディスク装置は、正常終了するディスク装置より
も、動作終了までの時間が余分にかかることが多い。However, when an error occurs, the error will be known by the following procedure. In other words, each disk device tries to read data in response to a read command, but if it cannot read the data, it may automatically execute retry read processing, and eventually a retry over may result in an error. Many. For this reason, a disk device in which an error occurs often takes more time to complete its operation than a disk device that normally ends.

【０００８】そのような理由から、従来のディスクアレ
イ装置の場合、装置内のディスク装置のうち、１台でも
エラーが発生した場合、結果的にエラー訂正してデータ
修復ができたとしても、ディスクアレイ装置全体の動作
スピードに影響を与えることになってしまう。特にメイ
ンコンピュータからの指令に対するレスポンスを低下さ
せてしまうという不都合が存在している。For this reason, in the case of the conventional disk array device, even if one of the disk devices in the device has an error, even if the error is corrected and the data can be restored as a result, the disk can be recovered. This will affect the operation speed of the entire array device. In particular, there is an inconvenience that the response to the command from the main computer is lowered.

【０００９】本発明は、上述した問題点を解決するため
になされたものであり、エラー発生等の理由で動作終了
が相対的に遅れるディスク装置がエラー訂正の面での許
容範囲内数であれば、そのディスク装置の動作終了の遅
れを、ディスクアレイ装置全体のスピードには影響させ
ないようにすることを目的とする。The present invention has been made in order to solve the above-mentioned problems, and the number of disk devices whose operation is relatively delayed due to the occurrence of an error may be within a permissible range in terms of error correction. For example, it is an object of the present invention to prevent the delay of the operation end of the disk device from affecting the speed of the entire disk array device.

【００１０】[0010]

【課題を解決するための手段】この目的を達成するため
になされた請求項１記載の発明は、ディスクアレイコン
トローラによって複数台のディスク装置を制御し、メイ
ンコンピュータからのアクセスに対しては見かけ上１台
のディスク装置に見せかけて応答するよう構成され、さ
らに、前記ディスク装置のデータエラー発生に対するエ
ラー訂正機能を備えたディスクアレイ装置において、前
記ディスクアレイコントローラは、前記メインコンピュ
ータからの要求に応じて複数台のディスク装置に読出指
令を出した際、それら複数台のディスク装置の内の一部
からの応答が前記指令を出してから所定時間経過しても
ない場合には、その応答がないディスク装置に対応する
データを前記エラー訂正機能によって作成し、前記メイ
ンコンピュータに、要求されたデータを送出するよう構
成されていることを特徴とするディスクアレイ装置であ
る。SUMMARY OF THE INVENTION In order to achieve this object, the invention according to claim 1 controls a plurality of disk devices by a disk array controller and apparently makes an access from a main computer. In a disk array device that is configured to respond by pretending to be one disk device, and further has an error correction function for data error occurrence of the disk device, the disk array controller responds to a request from the main computer. When a read command is issued to a plurality of disk devices, if a response from a part of the plurality of disk devices does not reach a predetermined time after issuing the command, the disk that does not respond Data corresponding to the device is created by the error correction function, and stored in the main computer. It is a disk array device according to claim that is configured to deliver the requested data.

【００１１】本発明のディスクアレイ装置は、ディスク
アレイコントローラによって複数台のディスク装置を制
御し、メインコンピュータからのアクセスに対しては見
かけ上１台のディスク装置に見せかけて応答することが
できるため、上記従来技術の説明においてディスクアレ
イ装置の利点として述べたように、大容量化の実現や処
理速度の向上を図ることができる。そして、ディスク装
置のデータエラー発生に対するエラー訂正を行ってデー
タを修復することも可能であるため、信頼性の向上も図
ることができる。In the disk array device of the present invention, a plurality of disk devices can be controlled by the disk array controller, and an access from the main computer can be made to appear to be one disk device and can respond. As described above as an advantage of the disk array device in the description of the prior art, it is possible to realize a large capacity and an improvement in processing speed. Further, since it is possible to correct the data by correcting the occurrence of the data error in the disk device, it is possible to improve the reliability.

【００１２】このような基本的な作用・効果に加えて、
本発明のディスクアレイ装置では、ディスクアレイコン
トローラが、メインコンピュータからの要求に応じて複
数台のディスク装置に読出指令を出した際、それら複数
台のディスク装置の内の一部からの応答が指令を出して
から所定時間経過してもない場合には、そのディスク装
置からの応答を待たずに、そのディスク装置に対応する
データをエラー訂正機能によって作成し、メインコンピ
ュータに、要求されたデータを送出する。In addition to such basic functions and effects,
In the disk array device of the present invention, when the disk array controller issues a read command to a plurality of disk devices in response to a request from the main computer, a response from some of the plurality of disk devices is a command. If the specified time has not passed since the message was issued, the data corresponding to the disk device is created by the error correction function without waiting for the response from the disk device, and the requested data is sent to the main computer. Send out.

【００１３】これは、所定時間を経過しても応答がない
場合には、そのディスク装置は、例えば最初のリード時
にはエラーにより読み出せなかったためリードリトライ
処理を実行しているため動作終了が遅れているものとみ
なし、そのディスク装置の動作終了を待たずに、その他
のディスク装置から得たデータを基にした所定のエラー
訂正によって該当するディスク装置に対応するデータを
作成するようにしたのである。こうすることで、エラー
リトライ等を実行している等の理由で動作終了が遅れて
いるディスク装置があったとしても、それを無視してエ
ラー訂正機能でメインコンピュータへ転送するデータを
作り出すことができ、ディスクアレイ装置全体の動作ス
ピードには影響を与えずに動作を継続することができ
る。This is because if there is no response even after the lapse of a predetermined time, the disk drive is executing read retry processing because it could not be read due to an error during the first read, so the operation end is delayed. Therefore, the data corresponding to the corresponding disk device is created by a predetermined error correction based on the data obtained from the other disk device without waiting for the end of the operation of the disk device. By doing so, even if there is a disk device whose operation is delayed due to execution of error retry, etc., it is possible to ignore it and create data to be transferred to the main computer by the error correction function. Therefore, the operation can be continued without affecting the operation speed of the entire disk array device.

【００１４】つまり、従来のディスクアレイ装置の場
合、複数のディスク装置の内の１台でもエラーが発生す
ると、そのディスク装置だけリードリトライ等の処理に
よって正常終了するディスク装置よりも動作終了までに
時間が余分にかかってしまっていた。そのため、結果的
にエラー訂正してデータ修復をしたとしても、ディスク
アレイ装置全体の動作スピードに影響を与えることにな
ってしまい、特にメインコンピュータからの指令に対す
るレスポンスを低下させていた。In other words, in the case of the conventional disk array device, when an error occurs even in one of the plurality of disk devices, it takes longer than the disk device that normally ends by the processing such as read retry, etc. until the operation ends. Was overloaded. Therefore, even if the error is corrected and the data is repaired as a result, the operation speed of the entire disk array device is affected, and particularly the response to the command from the main computer is lowered.

【００１５】それに対して、本発明のディスクアレイ装
置によれば、そのようなレスポンスを低下させるディス
ク装置からの応答を待たないで処理してしまうので、デ
ィスクアレイ装置全体のレスポンスを低下させないので
ある。なお、上述したように、応答がないディスク装置
に対応するデータをエラー訂正機能によって作成し、メ
インコンピュータに要求されたデータを送出した後、再
度メインコンピュータから次の読み出し要求が来た場合
に、上述のリードリトライ等を実行しているディスク装
置の動作がまだ終了していない場合も考えられる。その
ような場合には、当該ディスク装置に対しては読出指令
自体を出すことができないので、請求項２に示すよう
に、残った（正常と考えられる）ディスク装置のみに読
出指令を出し、やはりエラー訂正機能で、上記読出指令
が出せなかったディスク装置に対応するデータを作り出
し、メインコンピュータに要求されたデータを送出する
ようにすることが考えられる。On the other hand, according to the disk array apparatus of the present invention, the response from the disk apparatus which lowers the response is processed without waiting, so that the response of the entire disk array apparatus is not lowered. . As described above, when the data corresponding to the disk device that does not respond is created by the error correction function, the requested data is sent to the main computer, and then the next read request comes from the main computer again, It is also conceivable that the operation of the disk device that is executing the above-mentioned read retry or the like has not been completed. In such a case, the read command itself cannot be issued to the disk device, so as shown in claim 2, the read command is issued only to the remaining (presumably normal) disk device. It is conceivable to generate data corresponding to the disk device for which the read command could not be issued by the error correction function and to send the requested data to the main computer.

【００１６】なお、このようなエラー訂正機能の一つと
しては、パリティチェック方式によるものがある。そし
て、その場合には、請求項３に示すように、並列に動作
する所定数台のデータ記憶用ディスク装置と、それら各
データ記憶用ディスク装置の所定の同一記憶位置に記録
された並列データ毎に作成されたパリティが記録された
パリティ記憶用ディスク装置とからなるディスク装置の
組を読出指令備えており、前記ディスクコントローラ
は、前記ディスク装置の組単位でパリティチェック方式
によるエラー訂正を実行可能に構成することが考えられ
る。As one of such error correction functions, there is a parity check method. In that case, as described in claim 3, a predetermined number of data storage disk devices that operate in parallel and each parallel data recorded in a predetermined same storage position of each of the data storage disk devices. The disk controller is provided with a read command for a set of disk storages for storing parity stored in the disk drive, and the disk controller can execute error correction by the parity check method in units of the set of disk devices. It is possible to configure.

【００１７】ここで、「所定数台のデータ記憶用ディス
ク装置とパリティ記憶用ディスク装置とからなるディス
ク装置の組を読出指令備える」としたのは、ディスクア
レイ装置を、例えば３台のデータ記憶用ディスク装置と
１台のパリティ記憶用ディスク装置の計４台だけを備え
る構成としてもよいが、この計４台を一組とし、それを
複数組備えるようにすれば、大容量化の実現や処理速度
の向上を図る上でさらに有効だからである。Here, the phrase "provides a read command for a set of disk devices consisting of a predetermined number of disk devices for data storage and disk devices for parity storage" means that the disk array device is, for example, three data storage devices. It is also possible to have a configuration in which only a total of four disk devices for storage and one disk device for parity storage are provided, but if these four devices are made into a set and a plurality of sets are provided, realization of large capacity and This is because it is more effective in improving the processing speed.

【００１８】そして、この場合、例えば９台のデータ記
憶用ディスク装置と１台のパリティ記憶用ディスク装置
の計１０台の構成としてもよいが、次のような問題があ
る。それは、計１０台のディスク装置の内のいずれか１
台について応答が遅い場合には、パリティチェック方式
によるエラー訂正機能によってデータ作成ができるが、
２台以上について応答が遅くなった場合、それら２台に
ついてのデータをエラー訂正機能によって作成すること
ができなくなる。すると、計１０台のディスク装置によ
る構成が全て利用不可となってしまい、リスクが大き
い。したがって、例えばそれらを３台のデータ記憶用デ
ィスク装置と１台のパリティ記憶用ディスク装置の計４
台の組を３組で構成することにより、４台中の２台の応
答が遅くなる確率は上記１０台中の２台の場合と比べて
非常に少なくなるので、信頼性向上の上でも有効であ
る。In this case, for example, a total of ten data storage disk devices and one parity storage disk device may be used, but there are the following problems. It is one of a total of 10 disk devices
If the response is slow for the machine, data can be created by the error correction function using the parity check method.
When the response is delayed for two or more units, the data for those two units cannot be created by the error correction function. As a result, the entire configuration of 10 disk devices cannot be used, which poses a large risk. Therefore, for example, a total of four disk storage devices including three data storage disk devices and one parity storage disk device are used.
By constructing a set of 3 units, the probability that the response of 2 units out of 4 units will be much slower than that of 2 units out of 10 units, which is also effective in improving reliability. .

【００１９】なお、請求項３では、エラー訂正機能の一
つとしてはパリティチェック方式によるものを例に挙げ
たが、それ以外の方式でのエラー訂正機能を備えている
ものであっても構わない。In claim 3, the parity check method is used as an example of the error correction function, but the error correction function may be provided by other methods. .

【００２０】[0020]

【発明の実施の形態】次に、本発明の実施形態について
説明する。図１に示す本発明の一実施形態であるディス
クアレイ装置３０は、４台のディスク装置３１，３２，
３３，３４と、それら各ディスク装置３１〜３４を個別
に制御可能なディスクアレイコントローラ３５とを備え
ている。なお、以下の説明では、４台のディスク装置を
区別する場合ために、Ａディスク装置３１、Ｂディスク
装置３２、Ｃディスク装置３３及びＤディスク装置３４
と記載することにする。また、ディスク装置３１〜３４
は、いわゆる物理的なハードディスクドライブとそれを
制御するコントロールボードとが一体化されたものであ
る。Next, an embodiment of the present invention will be described. A disk array device 30 according to an embodiment of the present invention shown in FIG. 1 includes four disk devices 31, 32,
33 and 34, and a disk array controller 35 capable of individually controlling the respective disk devices 31 to 34. In the following description, in order to distinguish four disk devices, the A disk device 31, the B disk device 32, the C disk device 33, and the D disk device 34.
Will be described. In addition, the disk devices 31 to 34
Is a so-called physical hard disk drive integrated with a control board for controlling it.

【００２１】ディスクアレイ装置３０はメインコンピュ
ータ１０とＳＣＳＩ（Small Computer System Interfac
e ）規格のバス２０を介して接続されており、メインコ
ンピュータ１０からは見かけ上１台のディスク装置のよ
うに見える。なお、メインコンピュータ１０には入力手
段としてのキーボード１１及び表示手段としてのＣＲＴ
１２が接続されている。The disk array device 30 includes a main computer 10 and a SCSI (Small Computer System Interfac).
e) Connected via a standard bus 20, and the main computer 10 looks like a single disk device. The main computer 10 has a keyboard 11 as an input means and a CRT as a display means.
12 are connected.

【００２２】本ディスクアレイ装置３０においては、デ
ィスクアレイコントローラ３５が４台のディスク装置３
１〜３４を並列に同時動作させことができる。そして、
ディスクアレイコントローラ３５は、メインコンピュー
タ１０から転送されたデータを所定の単位で並列に分割
し、この場合はＡディスク装置３１、Ｂディスク装置３
２、Ｃディスク装置３３の３台の同一セクタ上に順次書
き込むと共に、それらのデータに対応するパリティを生
成してＤディスク装置３４に書き込むように構成されて
いる。In this disk array device 30, the disk array controller 35 has four disk devices 3
1 to 34 can be simultaneously operated in parallel. And
The disk array controller 35 divides the data transferred from the main computer 10 in parallel in a predetermined unit. In this case, the A disk device 31 and the B disk device 3 are divided.
2, the C disk device 33 is sequentially written on the same sector, and the parity corresponding to the data is generated and written to the D disk device 34.

【００２３】したがって、Ａディスク装置３１、Ｂディ
スク装置３２、Ｃディスク装置３３の３台の内のいずれ
か一台からのデータが読み出せなくても、その他２台か
らのデータとＤディスク装置３４に書き込まれたパリテ
ィとに基づき、所定のエラー訂正機能を発揮すること
で、上記データを読み出せなかったディスク装置からの
データを作成（復元）することができる。Therefore, even if the data from any one of the A disk device 31, the B disk device 32, and the C disk device 33 cannot be read, the data from the other two devices and the D disk device 34. By exerting a predetermined error correction function based on the parity written in, the data from the disk device that could not read the data can be created (restored).

【００２４】このような構成の本ディスクアレイ装置３
０の基本的な利点は主に次の３点である。まず１番目と
して大容量化の実現である。ディスク装置３１〜３４の
限界容量はその時々の技術進歩によって常に変化してい
くが、いつの時代にもディスク装置３１〜３４の限界容
量がある。例えば本願の出願時点での限界容量は４Ｇ
（ギガ）バイト程度であり、さらに量産品となるとその
容量は２Ｇバイト程度である。上述した４Ｇバイトの限
界容量のディスク装置３１〜３４は量産できないので、
一般的に高価でかつ信頼性が低い。The present disk array device 3 having such a configuration
The basic advantages of 0 are mainly the following three points. First is the realization of large capacity. Although the limit capacities of the disk devices 31 to 34 are constantly changing with technological progress, the limit capacities of the disk devices 31 to 34 are always present. For example, the limit capacity at the time of filing this application is 4G
It is about (giga) bytes, and when it becomes a mass-produced product, its capacity is about 2 GB. Since the above-mentioned disk devices 31 to 34 having a limit capacity of 4 GB cannot be mass-produced,
Generally expensive and unreliable.

【００２５】それに対して量産品は製品が安定している
ので、一般的に安価でかつ信頼性が高い。例えば本例の
構成のようにデータ記憶用としてＡディスク装置３１、
Ｂディスク装置３２及びＣディスク装置３３の３台を備
えるディスクアレイ装置３０において、各ディスク装置
３１〜３３に量産品の２Ｇバイトのものを使用すると、
見かけ上６Ｇバイトのディスク装置ができ上がる。量産
品のディスク装置を寄せ集めることで限界容量よりも大
きいディスク装置を作り出せることになるため、信頼性
が高く、かつ大容量のものが得られる。また、限界容量
の４Ｇバイトのディスク装置３１〜３３を使用すると、
その時点では製造不可能と考えられる１２Ｇバイトのデ
ィスク装置を作ることが可能になる。On the other hand, mass-produced products are stable and generally inexpensive and highly reliable. For example, as in the configuration of this example, an A disk device 31 for storing data,
In a disk array device 30 including three B disk devices 32 and C disk devices 33, if mass-produced 2 Gbyte products are used for the respective disk devices 31 to 33,
Apparently a 6GB disk device is completed. Since mass-produced disk devices can be collected to create a disk device having a capacity larger than the limit capacity, a highly reliable and large-capacity disk device can be obtained. In addition, when the disk devices 31 to 33 each having a limit capacity of 4 GB are used,
At that point, it is possible to make a 12 Gbyte disk device that is considered unmanufacturable.

【００２６】２番目としては処理速度の向上が挙げられ
る。ディスク装置３１〜３４は物理的な動作部分を多数
持つ装置であるため、その動作スピードを向上させるの
は容易ではない。これは、ディスクの回転スピード、読
み書きヘッドの移動スピード、ヘッド移動後にヘッド位
置が安定するまでに必要な時間等の物理的要因が多数あ
るためである。ところが、ディスクアレイ装置３０は、
本来１台のディスク装置に書き込まれるはずのデータが
複数台、本例ではＡ〜Ｃのディスク装置３１〜３３に分
散して書き込まれるため、トータルで考えた場合、１台
のディスク装置に対する読み書きの頻度や読み出すデー
タ量が低減されるため、全体として見れば処理速度の向
上が図られるのである。Secondly, improvement in processing speed can be mentioned. Since the disk devices 31 to 34 are devices having many physical operation parts, it is not easy to improve the operation speed. This is because there are many physical factors such as the rotation speed of the disk, the moving speed of the read / write head, and the time required for the head position to stabilize after moving the head. However, the disk array device 30
Since a plurality of data, which should originally be written in one disk device, are distributed and written in the disk devices 31 to 33 of A to C in this example, in a total consideration, reading and writing to one disk device is performed. Since the frequency and the amount of data to be read are reduced, the processing speed can be improved as a whole.

【００２７】そして、本例のディスクアレイ装置３０の
場合、データ記憶用としてＡ〜Ｃの３台のディスク装置
３１〜３３とパリティ記憶用のＤディスク装置３４を持
っているため、このうちのどれか１台が丸ごと壊れても
データは失われない。その仕組みに付いて、図２を参照
して説明する。Since the disk array device 30 of this example has three disk devices 31 to 33 for data storage and D disk device 34 for parity storage, which one of them is used? Data is not lost even if one unit is totally broken. The mechanism will be described with reference to FIG.

【００２８】まず、データ書込時にディスクアレイコン
トローラ３５は、データ記憶用のＡディスク装置３１、
Ｂディスク装置３２、Ｃディスク装置３３の３台の同一
セクタ上にデータを順次書き込むと共に、それらライト
データを排他的論理和演算処理をし、その演算結果に基
づいて対応するパリティデータを生成する。そして、そ
の作成したパリティデータをＤディスク装置３４の同一
セクタに書き込む。First, when writing data, the disk array controller 35 uses the A disk device 31 for data storage,
Data is sequentially written on three identical sectors of the B disk device 32 and the C disk device 33, the write data is subjected to exclusive OR operation processing, and corresponding parity data is generated based on the operation result. Then, the created parity data is written in the same sector of the D disk device 34.

【００２９】具体例としては、図２（ａ）に示すように
データ「１０１」を書き込む場合には、データ記憶用の
Ａディスク装置３１には「１」、Ｂディスク装置３２に
は「０」、Ｃディスク装置３３には「１」がそれぞれ書
き込まれる。そして、それらのライトデータを排他的論
理和演算処理すると、本実施形態の場合は偶数パリティ
としてパリティデータ「０」が得られるので、それをＤ
ディスク装置３４に書き込む。As a concrete example, when writing data "101" as shown in FIG. 2A, "1" is stored in the A disk device 31 for storing data, and "0" is stored in the B disk device 32. , 1 is written in the C disk device 33, respectively. Then, when the write data is subjected to the exclusive OR operation, the parity data “0” is obtained as the even parity in the case of the present embodiment, so that it is D
Write to the disk device 34.

【００３０】一方、データ読出時にディスク装置３１〜
３４の故障によって読み出しエラーが発生すると、ディ
スクアレイコントローラ３５は次のように制御する。な
お、本例では４台のディスク装置３１〜３４の内の２台
以上でエラーが発生した場合には対応不可能なので、い
ずれか１台においてエラーが発生した場合を想定して以
下の説明を進める。データ記憶用のＡ〜Ｃのディスク装
置３１〜３３の内の１台でエラーが発生した場合には、
残りの正常なディスク装置からリードしたデータと、Ｄ
ディスク装置３４からリードしたパリティデータに基づ
き、エラー発生した１台のディスク装置に対応するデー
タをエラー訂正機能によって作成（復元）する。On the other hand, at the time of reading data, the disk devices 31 to 31
When a read error occurs due to a failure of 34, the disk array controller 35 controls as follows. It should be noted that in this example, it is not possible to cope with the case where an error occurs in two or more of the four disk devices 31 to 34, so the following description will be given assuming the case where an error occurs in any one of them. Proceed. When an error occurs in one of the disk devices 31 to 33 for data storage A to C,
Data read from the remaining normal disk unit and D
Based on the parity data read from the disk device 34, data corresponding to one disk device in which an error has occurred is created (restored) by the error correction function.

【００３１】具体例を図２（ｂ）を参照して説明する。
図２（ｂ）中のの場合には、Ａディスク装置３１がエ
ラーであるため、Ｂディスク装置３２から「０」、Ｃデ
ィスク装置３３から「１」、そしてＤディスク装置３４
からパリティ「０」がそれぞれリードデータとして得ら
れる。これらのデータを基に、上述した排他的論理和演
算処理をすると、本実施形態の場合には偶数パリティで
あるので、Ａディスク装置３１からリードすべきであっ
たデータは「１」であることが判り、その作成したデー
タを加えて「１０１」を正常なリードデータとして、メ
インコンピュータ１０に送出することとなる。A specific example will be described with reference to FIG.
In the case of FIG. 2B, since the A disk device 31 has an error, the B disk device 32 gives “0”, the C disk device 33 gives “1”, and the D disk device 34.
From this, parity “0” is obtained as read data. When the above-described exclusive OR operation processing is performed based on these data, the data that should have been read from the A disk device 31 is "1" because the parity is even parity in this embodiment. Then, the created data is added and "101" is sent to the main computer 10 as normal read data.

【００３２】同様に、図２（ｂ）中のの場合には、Ｂ
ディスク装置３２がエラーであるため、Ａディスク装置
３１から「１」、Ｃディスク装置３３から「１」、そし
てＤディスク装置３４からパリティ「０」がそれぞれリ
ードデータとして得られる。これらのデータを基にエラ
ー訂正機能によりＢディスク装置３２からリードすべき
であったデータを作成すると「０」となるため、その作
成したデータを加えた「１０１」が正常なリードデータ
となる。Similarly, in the case of FIG. 2B, B
Since the disk device 32 has an error, the A disk device 31 obtains "1", the C disk device 33 produces "1", and the D disk device 34 obtains parity "0" as read data. When the data that should have been read from the B disk device 32 by the error correction function is created based on these data, it becomes "0", so that "101" including the created data becomes normal read data.

【００３３】また、図２（ｂ）中のの場合には、Ｃデ
ィスク装置３３がエラーであるため、Ａディスク装置３
１から「１」、Ｂディスク装置３２から「０」、そして
Ｄディスク装置３４からパリティ「０」がそれぞれリー
ドデータとして得られる。これらのデータを基にエラー
訂正機能によりＣディスク装置３３からリードすべきで
あったデータを作成すると「１」となるため、その作成
したデータを加えた「１０１」が正常なリードデータと
なる。In the case of FIG. 2B, since the C disk device 33 has an error, the A disk device 3
1 to “1”, B disk device 32 to “0”, and D disk device 34 to obtain parity “0”, respectively, as read data. When the data that should have been read from the C disk device 33 is created based on these data by the error correction function, it becomes "1", so that "101" including the created data becomes normal read data.

【００３４】なお、図２（ｂ）中のの場合には、Ｄデ
ィスク装置３４がエラーであるが、これはその他のディ
スク装置３１〜３３でエラーが発生した場合のためのパ
リティ記憶用であるので、実質的には問題ない。つま
り、Ａディスク装置３１から「１」、Ｂディスク装置３
２から「０」、そしてＣディスク装置３３から「１」が
それぞれリードデータとして転送されるため、Ｄディス
ク装置３４からのパリティデータがなくても、正常なリ
ードデータ「１０１」が得られる。In the case of FIG. 2B, the D disk device 34 has an error, but this is for parity storage in case an error occurs in the other disk devices 31 to 33. Therefore, there is practically no problem. That is, A disk device 31 to "1", B disk device 3
Since “2” to “0” and “1” from the C disk device 33 are transferred as read data, normal read data “101” can be obtained even if there is no parity data from the D disk device 34.

【００３５】以上説明した構成、そしてその構成に基づ
く作用・効果は、従来より知られていることであるが、
それを前提としても次のような問題がある。すなわち、
メインコンピュータ１０からディスクアレイ装置３０に
対して読み出しの要求が来た場合には、ディスクアレイ
コントローラ３５は、アクセス対象であるＡ〜Ｃのディ
スク装置３１〜３３に対して読出指令を出し、それら全
てのディスク装置３１〜３３の動作が全て終了するのを
待って、それらから得たデータをメインコンピュータ１
０側に転送していた。そして、それらの内の１台におい
てデータが読み出しができないエラーが起きた場合には
Ｄディスク装置３４からパリティデータを読み出し、そ
れ以外の２台からのデータとＤディスク装置３４のパリ
ティデータを基にエラー訂正機能によって故障したディ
スク装置からのデータを作成する。この作成したデータ
を要求してきたメインコンピュータ１０に送出すること
となる。The configuration described above, and the operation and effect based on the configuration are known from the prior art.
Even assuming that, there are the following problems. That is,
When a read request is issued from the main computer 10 to the disk array device 30, the disk array controller 35 issues a read command to the disk devices 31 to 33 of A to C to be accessed, and all of them are read. Wait for all the operations of the disk devices 31 to 33 of the main computer 1 to finish, and then obtain the data obtained from them by the main computer 1
It was transferred to the 0 side. When an error that data cannot be read occurs in one of them, the parity data is read from the D disk device 34, and based on the data from the other two devices and the parity data of the D disk device 34. Creates data from a failed disk device using the error correction function. This created data is sent to the requesting main computer 10.

【００３６】しかしながら、通常エラー発生時には、デ
ィスク装置３１〜３４内で何回か自動的にリトライリー
ドした後で、結局はリトライオーバーとなってエラーと
なることが多い。例えば図２（ｂ）に示すの場合に
は、エラーが発生したＡディスク装置３１は、正常終了
するＢ及びＣディスク装置３２，３３よりも、動作終了
までの時間が余分にかかる。However, when a normal error occurs, after retry reading is automatically performed several times in the disk devices 31 to 34, eventually a retry over occurs and an error often occurs. For example, in the case shown in FIG. 2B, the A disk device 31 in which the error has occurred takes more time than the B and C disk devices 32 and 33 that normally end, until the operation ends.

【００３７】したがって、Ｂ及びＣディスク装置３２，
３３で動作終了していても、Ａディスク装置３１におい
て動作終了し、そこで初めてエラーであることが判って
からでないと、Ｄディスク装置３４のパリティデータを
用いたエラー訂正の処理に移行できない。そのため、デ
ィスクアレイ装置３０内のデータ記憶用のＡ〜Ｃのディ
スク装置３１〜３３の内で１台でもエラーが発生した場
合は、結果的にエラー訂正してデータ修復をしたとして
も、ディスクアレイ装置３０全体の動作スピードに影響
を与えることになってしまう。特にメインコンピュータ
１０からの要求に対するレスポンスを低下させてしまう
こととなる。Therefore, the B and C disk devices 32,
Even if the operation is completed in 33, the operation is completed in the A disk device 31, and it is not until the error is known for the first time that the process of error correction using the parity data of the D disk device 34 cannot be started. Therefore, if an error occurs in even one of the disk devices 31 to 33 for data storage in the disk array device 30, even if the error is corrected and the data is restored, the disk array This will affect the operation speed of the entire device 30. Especially, the response to the request from the main computer 10 is deteriorated.

【００３８】それに対して本ディスクアレイ装置３０に
おいては、このようなレスポンスの低下を防止するため
の工夫もなされている。それはメインコンピュータ１０
からデータの読み出し要求がなされた場合の制御であ
る。その制御について、図３，４を参照して説明する。
図３は、メインコンピュータ１０からデータの読み出し
要求がなされた場合に実行するデータ読出制御処理のフ
ローチャートであり、図４は、その制御処理によるディ
スクアレイコントローラ３５及び各ディスク装置３１〜
３４の動作の概略を示す説明図である。On the other hand, the present disk array device 30 is also devised to prevent such a decrease in response. It is the main computer 10
This is the control when a data read request is issued from. The control will be described with reference to FIGS.
FIG. 3 is a flowchart of a data read control process executed when a data read request is issued from the main computer 10, and FIG. 4 is a disk array controller 35 and each disk device 31 to 31 by the control process.
It is explanatory drawing which shows the outline of operation | movement of 34.

【００３９】まず図３の最初のステップＳ１０において
は、メインコンピュータ１０からのデータ読出要求に応
じて、Ａ〜Ｄの各ディスク装置３１〜３４に対して読出
指令を出力する。Ｓ１０で読出指令を出力した後は、Ｓ
２０にて各ディスク装置３１〜３４からデータ転送があ
ればディスクアレイコントローラ３５内の図示しないバ
ッファメモリに格納しておく。First, in the first step S10 of FIG. 3, a read command is output to each of the disk devices 31 to 34 of A to D in response to a data read request from the main computer 10. After outputting the read command in S10,
If data is transferred from each of the disk devices 31 to 34 at 20, the data is stored in a buffer memory (not shown) in the disk array controller 35.

【００４０】そして、続くＳ２５では、全ディスク装置
３１〜３５からの応答が終了しているかどうかを判断
し、もしも終了していれば（Ｓ２５：ＹＥＳ）、そのま
まＳ５０へ移行する。Ｓ５０の処理については後述す
る。一方、終了していなければ（Ｓ２５：ＮＯ）、Ｓ３
０へ移行する。Then, in the following S25, it is judged whether or not the responses from all the disk devices 31 to 35 have been completed, and if the responses have been completed (S25: YES), the process directly proceeds to S50. The process of S50 will be described later. On the other hand, if not completed (S25: NO), S3
Move to 0.

【００４１】Ｓ３０では、Ｓ１０の処理で各ディスク装
置３１〜３４に読出指令を出力してから所定時間が経過
したかどうか（タイムアウトかどうか）を判断する。タ
イムアウトでない場合には（Ｓ３０：ＮＯ）、Ｓ２０の
処理を繰り返し、タイムアウトとなった場合には（Ｓ３
０：ＹＥＳ）、Ｓ４０へ移行する。In S30, it is determined whether or not a predetermined time has elapsed (whether a timeout has occurred) since the read command was output to each of the disk devices 31 to 34 in the process of S10. If not timed out (S30: NO), the process of S20 is repeated. If timed out (S3:
0: YES), and the process proceeds to S40.

【００４２】Ｓ４０では、Ｓ１０で読出指令を出力した
Ａ〜Ｄの各ディスク装置３１〜３４の中で応答がないも
のがあるかどうかを判断する。Ｓ３０で設定されている
所定時間は、正常に読み出しが実行される場合には、読
出指令を出力してからデータ転送が終了するのに十分な
時間である。但し、うまく読めなかったために、リード
リトライをした場合には、この所定時間を超えてしまう
場合がある。In S40, it is determined whether or not there is no response among the disk devices 31 to 34 of A to D that output the read command in S10. The predetermined time set in S30 is a time sufficient to finish the data transfer after outputting the read command when the reading is normally executed. However, if the read retry is performed because the reading was not successful, this predetermined time may be exceeded.

【００４３】したがって、各ディスク装置３１〜３４に
て正常に動作が実行されていれば、タイムアウト時（Ｓ
３０：ＹＥＳ）において応答がないディスク装置はない
ので、Ｓ４０にて否定判断となり、Ｓ５０へ移行する。
もちろん、上述したＳ２５にて肯定判断の場合には、タ
イムアウトを待たなくてもすでに全てのディスク装置に
て応答終了しているので、Ｓ５０へ移行する。Therefore, if the disk devices 31 to 34 are normally operating, a timeout occurs (S
Since there is no disk device that does not respond in 30: YES), a negative determination is made in S40, and the process proceeds to S50.
Of course, in the case of affirmative determination in S25 described above, since the response has already been completed in all the disk devices without waiting for timeout, the process proceeds to S50.

【００４４】Ｓ５０では、リードデータをメインコンピ
ュータ１０に送出する。Ｓ２５０で肯定判断あるいはＳ
４０で否定判断されてＳ５０に移行する場合には、各デ
ィスク装置３１〜３４からのデータが存在するため、そ
の内のＡ〜Ｃのディスク装置３１〜３３のデータによっ
てメインコンピュータ１０に送出すべきデータが得られ
る。At S50, the read data is sent to the main computer 10. Affirmative judgment in S250 or S
When a negative determination is made in step 40 and the processing shifts to S50, data from each of the disk devices 31 to 34 exists, so the data of the disk devices 31 to 33 of A to C among them should be sent to the main computer 10. Data is obtained.

【００４５】一方、Ｓ４０で肯定判断である場合には、
そのままではメインコンピュータ１０に送出すべきデー
タが得られない可能性があるので、Ｓ６０以降の所定の
処理に移行する。まず、Ｓ６０では、応答なしのディス
ク装置が１台かどうかを判断する。もしも１台であれば
（Ｓ６０：ＹＥＳ）、続くＳ７０にて、その応答無しの
ディスク装置がＤディスク装置３４であるかどうかを判
断する。もしも、Ｄディスク装置３４であった場合には
（Ｓ７０：ＹＥＳ）、そのままＳ５０へ移行する。これ
はＤディスク装置３４はパリティを記憶しているもので
あるため、それ以外のＡ〜Ｃのディスク装置３１〜３３
のデータが得られているのであれば、Ｓ５０での処理に
おいて何等支障がないからである。つまり、Ａ〜Ｃのデ
ィスク装置３１〜３３のデータによってメインコンピュ
ータ１０に送出すべきデータが得られるからである。On the other hand, if the determination in S40 is affirmative,
Since there is a possibility that the data to be sent to the main computer 10 may not be obtained as it is, the process proceeds to a predetermined process after S60. First, in S60, it is determined whether or not there is one disk device that does not respond. If the number is one (S60: YES), it is determined at S70 whether the unresponsive disk device is the D disk device 34. If it is the D disk device 34 (S70: YES), the process directly proceeds to S50. Since the D disk device 34 stores the parity, the other disk devices 31 to 33 of A to C are used.
This is because if the data of 1 is obtained, there is no problem in the processing of S50. That is, data to be sent to the main computer 10 can be obtained from the data of the disk devices 31 to 33 of A to C.

【００４６】しかしながら、Ｓ７０で否定判断、つまり
応答の無い１台のディスク装置がＡ〜Ｃのディスク装置
３１〜３３の内のいずれか１台である場合には、そのま
まではメインコンピュータ１０に送出すべきデータが得
られない。したがって、Ｓ８０へ移行して、エラー訂正
により該当するデータを作成する。However, a negative determination is made in S70, that is, if one disk device that does not respond is one of the disk devices 31 to 33 of A to C, the disk device is sent to the main computer 10 as it is. Data is not available. Therefore, the process proceeds to S80 and the corresponding data is created by error correction.

【００４７】これは、上記図２を参照して説明したよう
に、データ記憶用のＡ〜Ｃのディスク装置３１〜３３の
内のエラーが発生していない残りの正常なディスク装置
からリードした２つのデータと、Ｄディスク装置３４か
らリードしたパリティデータとに基づき、エラー発生し
たディスク装置に対応するデータをエラー訂正機能によ
って作成（復元）するのである。As described above with reference to FIG. 2, this is because the data is read from the remaining normal disk devices of the disk devices 31 to 33 for storing data which have no error. Based on one data and the parity data read from the D disk device 34, the data corresponding to the disk device in which the error has occurred is created (restored) by the error correction function.

【００４８】Ｓ８０でデータを作成した後は、Ｓ５０へ
移行する。正常なディスク装置からリードした２つのデ
ータと、Ｓ８０で作成したデータとでメインコンピュー
タ１０に送出すべきデータが得られるため、それをメイ
ンコンピュータ１０に送出する。After the data is created in S80, the process proceeds to S50. Since the data to be sent to the main computer 10 is obtained from the two data read from the normal disk device and the data created in S80, the data is sent to the main computer 10.

【００４９】なお、Ｓ６０で否定判断、すなわちタイム
アウト時に応答が無いディスク装置が２台以上あった場
合には、上述したＳ８０でのエラー訂正では対応できな
いので、その場合にはＳ６１へ移行する。Ｓ６１では、
上記Ｓ２０と同様、各ディスク装置３１〜３４からデー
タ転送があればディスクアレイコントローラ３５内の図
示しないバッファメモリに格納しておき、続くＳ６２に
おいて、応答なしのディスク装置が１台以下となったか
どうかを判断する。１台以下となった場合には（Ｓ６
２：ＹＥＳ）、Ｓ７０へ移行するが、１台以下でない場
合（つまり２台以上の場合）には（Ｓ６２：ＮＯ）、Ｓ
６３へ移行して所定時間が経過したかどうか（タイムア
ウトかどうか）を判断する。タイムアウトでない場合に
は（Ｓ６３：ＮＯ）、Ｓ６１へ戻ってＳ６１以下の処理
を繰り返すが、タイムアウトとなった場合には（Ｓ６
３：ＹＥＳ）、Ｓ９０へ移行し、メインコンピュータ１
０に所定のエラー通知を送り、本処理を終了することと
なる。Incidentally, if a negative determination is made in S60, that is, if there are two or more disk devices that do not respond at the time-out, the error correction in S80 described above cannot deal with it. In that case, the process proceeds to S61. In S61,
Similar to S20, if data is transferred from each of the disk devices 31 to 34, it is stored in a buffer memory (not shown) in the disk array controller 35, and in subsequent S62, whether or not there is one or less disk device with no response. To judge. If the number is less than 1 (S6
2: YES), the process proceeds to S70, but if it is not one or less (that is, two or more) (S62: NO), S
It is judged whether or not a predetermined time has elapsed after shifting to 63 (whether or not it has timed out). If it has not timed out (S63: NO), the process returns to S61 to repeat the processing from S61 onward, but if it has timed out (S6
3: YES), shift to S90, main computer 1
A predetermined error notification is sent to 0, and this processing ends.

【００５０】つまり、２台以上のディスク装置からの応
答が遅れている場合には、応答がないディスク装置が１
台以下になるまで応答終了を待つ。これは、リードリト
ライによって応答が遅れている場合でも、必ずエラーに
なるとは限らないので、所定時間は応答終了を待つので
ある。その所定時間内に応答がないディスク装置が１台
以下になればよいが、そうでない場合にはＳ９０にてエ
ラー通知を送ることとなる。That is, when the responses from two or more disk devices are delayed, the number of disk devices that do not respond is 1
Wait for the end of the response until it is below the limit. This is because an error does not always occur even if the response is delayed due to the read retry, so that the response is waited for a predetermined time. It is sufficient that the number of disk devices that do not respond within the predetermined time is one or less. If not, an error notification will be sent in S90.

【００５１】なお、Ｓ３０でのタイムアウトは比較的短
い所定時間が設定され、Ｓ６３でのタイムアウトは比較
的長い所定時間が設定されている。このように、図３で
説明した処理を実行すると、各ディスク装置３１〜３４
にて正常な読取動作が実行される場合には従来と何等変
わらないが、エラーが発生する状況においては、レスポ
ンスの向上が実現される。つまり、図３のＳ８０の処理
を経由する場合がそれである。The timeout in S30 is set to a relatively short predetermined time, and the timeout in S63 is set to a relatively long predetermined time. In this way, when the processing described with reference to FIG. 3 is executed, the respective disk devices 31 to 34 are
When the normal reading operation is executed in, there is no difference from the conventional method, but in the situation where an error occurs, the improvement of the response is realized. That is, this is the case where the processing goes through the processing of S80 in FIG.

【００５２】この点を図４を参照して図解する。メイン
コンピュータ１０からのデータ読出要求（図４（ａ）中
の参照）に応じて、ディスクアレイコントローラ３５
がＡ〜Ｄの各ディスク装置３１〜３４に読出指令を出す
（図４（ａ）中の参照）。その読出指令を出してから
所定時間後の状態例を図４（ｂ）に示す。この場合に
は、Ａ，Ｂ，Ｄの３台のディスク装置３１，３２，３４
は正常な読取動作がなされたためデータ転送が終了して
いるが、Ｃのディスク装置３３においては、正常な読取
動作がなされず、例えばリードリトライ処理を実行して
いるため、まだデータ転送が終了せず応答がない状態で
ある（図４（ｂ）中の参照）。This point will be illustrated with reference to FIG. In response to a data read request from the main computer 10 (see FIG. 4A), the disk array controller 35
Issues a read command to the disk devices 31 to 34 of A to D (see FIG. 4A). FIG. 4B shows an example of a state after a predetermined time has elapsed from issuing the read command. In this case, the three disk devices 31, 32, 34 of A, B, D
, The data transfer is completed because a normal read operation has been performed, but the disk device 33 of C does not perform a normal read operation and, for example, the read retry process is being executed, so the data transfer is not completed yet. There is no response (see FIG. 4 (b)).

【００５３】このような状況のときに、従来は、Ｃディ
スク装置３３の動作が終了するのを待ち、Ｃディスク装
置３３からエラーのため読み出せないとの通知を得てか
らでないと、Ａ，Ｂ２台のディスク装置３１，３２から
得たデータとＤディスク装置３４からのパリティデータ
を基にしたエラー訂正機能によるデータ作成が実行でき
なかった。つまり、Ａ，Ｂディスク装置３１，３２で動
作終了されていても（データ転送がされていても）、Ｃ
ディスク装置３３において動作終了し、そこで初めてエ
ラーであることが判ってからでないと、Ｄディスク装置
３４のパリティデータを用いたエラー訂正の処理に移行
できないため、正常に読み出した場合と比較して、メイ
ンコンピュータ１０からの要求に対するレスポンスがか
なり低下させてしまうこととなる。In such a situation, conventionally, until the operation of the C disk device 33 is completed and the C disk device 33 is notified that the data cannot be read due to an error, A, Data could not be created by the error correction function based on the data obtained from the B2 disk devices 31 and 32 and the parity data from the D disk device 34. That is, even if the operation is completed in the A and B disk devices 31 and 32 (even if data is transferred), C
The operation is terminated in the disk device 33, and it is necessary to know that there is an error there for the first time, so that the process cannot be shifted to the error correction process using the parity data of the D disk device 34. The response to the request from the main computer 10 will be considerably lowered.

【００５４】それに対して、本ディスクアレイ装置３０
の場合には、最終的にそのディスク装置３１〜３４から
データが得られるかどうかは関係なく、ディスクアレイ
コントローラ３５が読出指令を出力してから所定時間経
過した時点で応答がないディスク装置については読出エ
ラーの発生によりリードリトライ等の処理を実行してい
るものと見なし、それが１台である場合には、その動作
終了を待たずに他のディスク装置からのデータによって
メインコンピュータ１０へ送出すべきデータを作成して
送出してしまうのである。On the other hand, the present disk array device 30
In this case, regardless of whether data is finally obtained from the disk device 31 to 34, for a disk device that does not respond at the time when a predetermined time has elapsed after the disk array controller 35 issued the read command, It is considered that a read error or the like is being executed due to the occurrence of a read error, and when there is only one device, the data is sent to the main computer 10 by the data from another disk device without waiting for the end of the operation. It creates and sends the correct data.

【００５５】つまり、図４（ｂ）に示すように、タイム
アウト時にＣディスク装置３３からの応答がなくても、
図４（ｃ）に示すように、Ａ，Ｂ２台のディスク装置３
１，３２からリードした２つのデータと、Ｄディスク装
置３４からリードしたパリティデータとに基づき、エラ
ー発生したＣディスク装置３３に対応するデータをエラ
ー訂正機能によって作成（復元）するのである。そし
て、その作成データとＡ，Ｂ２台のディスク装置３１，
３２からリードした２つのデータとでメインコンピュー
タ１０への送出用のデータが得られるので、それをメイ
ンコンピュータ１０へ送出するのである。That is, as shown in FIG. 4B, even if there is no response from the C disk device 33 at the time-out,
As shown in FIG. 4C, two disk devices 3 of A and B are provided.
The data corresponding to the C disk device 33 in which the error has occurred is created (restored) by the error correction function based on the two data read from the first and the second data 32 and the parity data read from the D disk device 34. Then, the created data and the disk devices 31, A and B,
Since the data to be sent to the main computer 10 is obtained from the two data read from 32, the data is sent to the main computer 10.

【００５６】なお、Ｄディスク装置３４において動作終
了が遅れている場合には、図３についての説明でも述べ
たように、Ａ〜Ｃのディスク装置３１〜３３だけからリ
ードしたデータだけで十分なので、そのだけでメインコ
ンピュータ１０へ送出すべきデータが得られる。When the operation of the D disk device 34 is delayed, the data read from only the disk devices 31 to 33 of A to C is sufficient, as described in the description of FIG. Only then, the data to be sent to the main computer 10 can be obtained.

【００５７】また、タイムアウト時に応答が無かった１
台のディスク装置の動作終了を待たずにメインコンピュ
ータ１０にデータを送出した場合には、次にメインコン
ピュータ１０から読出要求が来た時点でも、まだ上記特
定の１台のディスク装置の動作終了が終了していない場
合も考えられる。この場合はそのディスク装置への読出
指令自体が出せないので、残る３台にだけ読出指令を出
力する。そして、その３台からのデータ転送があれば、
その３つのデータに基づき、必要ならエラー訂正で足り
ないデータを作成し、メインコンピュータ１０に要求さ
れたデータを送出すればよい。Also, there was no response at the time of timeout 1
When data is sent to the main computer 10 without waiting for the operation of one disk device to be completed, the operation of one specific disk device is not completed at the next read request from the main computer 10. It is possible that it has not ended. In this case, since the read command itself cannot be issued to the disk device, the read command is output only to the remaining three units. And if there is data transfer from the three units,
On the basis of the three data, if necessary, error correction may be performed to create insufficient data, and the requested data may be sent to the main computer 10.

【００５８】このように、本ディスクアレイ装置３０に
よれば、大容量化の実現や処理速度の向上、そしてディ
スク装置３１〜３４におけるデータエラー発生に対する
エラー訂正ができるため、信頼性の向上も図ることがで
きるという基本的な作用・効果を発揮する。As described above, according to the disk array device 30, the capacity can be increased, the processing speed can be improved, and the error correction for the data error occurrence in the disk devices 31 to 34 can be performed, so that the reliability can be improved. The basic action and effect of being able to do is demonstrated.

【００５９】そして、それらに加えて、メインコンピュ
ータ１０からの要求に応じてディスク装置３１〜３４に
読出指令を出した際、ディスク装置３１〜３４の内の１
台からの応答が指令を出してから所定時間経過してもな
い場合には、そのディスク装置からの応答を待たずに、
その他のディスク装置から転送されたデータに基づき、
必要であればエラー訂正機能によって不足のデータを作
成し、メインコンピュータ１０に要求されたデータを送
出することができる。In addition to them, when a read command is issued to the disk devices 31 to 34 in response to a request from the main computer 10, one of the disk devices 31 to 34 is read.
If the response from the stand does not elapse a predetermined time after issuing the command, do not wait for the response from the disk device,
Based on the data transferred from other disk devices,
If necessary, the error correction function can be used to create insufficient data and send the requested data to the main computer 10.

【００６０】これによって、読出エラーのためリードリ
トライ処理を実行している等の理由で、正常終了するデ
ィスク装置よりも動作終了までに時間が余分にかかって
しまうディスク装置があったとしても、それを無視して
エラー訂正機能でメインコンピュータ１０へ転送すべき
データを作り出すことができ、ディスクアレイ装置３０
全体の動作スピードには影響を与えずに動作を継続する
ことができる。As a result, even if there is a disk device that takes more time to complete its operation than a normally completed disk device due to a read retry process being executed due to a read error, etc. Data to be transferred to the main computer 10 can be created by the error correction function by ignoring
The operation can be continued without affecting the overall operation speed.

【００６１】つまり、従来は、上記応答が遅れているデ
ィスク装置からの応答を待ち、その時点でエラーのため
必要なデータが得られないと判ってからエラー訂正によ
りデータ修復していたため、ディスクアレイ装置３０全
体の動作スピードに影響を与えることになってしまい、
メインコンピュータ１０からの指令に対するレスポンス
が低下していた。それに対して、本案のディスクアレイ
装置３０によれば、そのようなレスポンスを低下させる
ディスク装置からの応答を待たないで処理してしまうの
で、ディスクアレイ装置全体のレスポンスを低下させな
いのである。That is, in the past, since the response was delayed from the disk device, the data was restored by error correction after it was determined that the necessary data could not be obtained due to an error at that time, so the disk array It will affect the operation speed of the entire device 30,
The response to the command from the main computer 10 was degraded. On the other hand, according to the disk array device 30 of the present invention, since the processing is performed without waiting for the response from the disk device that deteriorates such a response, the response of the entire disk array device is not deteriorated.

【００６２】以上、本発明はこのような実施形態に何等
限定されるものではなく、本発明の主旨を逸脱しない範
囲において種々なる形態で実施し得る。図１に示すディ
スクアレイ装置３０の場合には、データ記憶用のＡ〜Ｃ
の３台のディスク装置３１〜３３とパリティ記憶用のＤ
ディスク装置３４とを一組として備えていたが、このよ
うなディスク装置の組を２組以上備えていてもよい。例
えば、図５に示す別の実施形態のディスクアレイ装置
は、データ記憶用のＡ１〜Ｃ１の３台のディスク装置１
０１〜１０３とパリティ記憶用のＤ１ディスク装置１０
４とで構成される第１組、データ記憶用のＡ２〜Ｃ２の
３台のディスク装置１１１〜１１３とパリティ記憶用の
Ｄ２ディスク装置１１４とで構成される第２組、データ
記憶用のＡ３〜Ｃ３の３台のディスク装置１２１〜１２
３とパリティ記憶用のＤ３ディスク装置１２４とで構成
される第３組、……というように複数のディスク装置の
組を備えるようにしてもよい。この場合、図５に示すデ
ィスクコントローラは、前記ディスク装置の組単位でパ
リティチェック方式によるエラー訂正を実行するように
すれば、大容量化の実現や処理速度の向上を図る上でさ
らに有効である。また、接続されるメインコンピュータ
からマルチタスク処理が要求された場合でも有効であ
る。As described above, the present invention is not limited to such an embodiment, and can be implemented in various forms without departing from the gist of the present invention. In the case of the disk array device 30 shown in FIG. 1, A to C for data storage
3 disk units 31 to 33 and D for parity storage
Although the disk device 34 and the disk device 34 are provided as one set, two or more sets of such disk devices may be provided. For example, a disk array device according to another embodiment shown in FIG. 5 has three disk devices A1 to C1 for data storage.
01-103 and D1 disk device 10 for parity storage
4 and a second set including three disk devices 111 to 113 A2 to C2 for data storage and a D2 disk device 114 for parity storage, A3 to data storage. Three C3 disk devices 121 to 12
3 and a D3 disk device 124 for parity storage may be provided as a third set, .. In this case, if the disk controller shown in FIG. 5 executes the error correction by the parity check method for each set of the disk devices, it is more effective in realizing a large capacity and improving the processing speed. . It is also effective when multitask processing is requested from the connected main computer.

【００６３】なお、図５の場合には、３台のデータ記憶
用ディスク装置と１台のパリティ記憶用ディスク装置の
計４台で１組を構成しているが、この場合、例えば９台
のデータ記憶用ディスク装置と１台のパリティ記憶用デ
ィスク装置の計１０台の構成とすることも考えられる。
しかし、計１０台のディスク装置の内のいずれか１台に
ついて応答が遅い場合には、パリティチェック方式によ
るエラー訂正機能によってデータ作成ができるが、２台
以上について応答が遅くなった場合、それら２台につい
てのデータをエラー訂正機能によって作成することがで
きなくなる。すると、計１０台のディスク装置による構
成が全て利用不可となってしまい、リスクが大きい。し
たがって、例えばそれらを３台のデータ記憶用ディスク
装置と１台のパリティ記憶用ディスク装置の計４台の組
を３組で構成することにより、４台中の２台の応答が遅
くなる確率は上記１０台中の２台の場合と比べて非常に
少なくなるので、信頼性向上の上でも有効である。In the case of FIG. 5, a total of four data storage disk devices and one parity storage disk device make up one set, but in this case, for example, nine data storage disk devices. It is also conceivable to configure a total of 10 data storage disk devices and one parity storage disk device.
However, if the response is slow for any one of the 10 disk devices in total, data can be created by the error correction function by the parity check method. It becomes impossible to create the data about the stand by the error correction function. As a result, the entire configuration of 10 disk devices cannot be used, which poses a large risk. Therefore, for example, by configuring them with a total of four sets of three data storage disk devices and one parity storage disk device, the probability that the response of two of the four devices will be delayed is Since it is much smaller than the case of 2 units out of 10, it is effective in improving reliability.

【００６４】なお、上記説明では、エラー訂正機能の一
つとしてはパリティチェック方式によるものを例に挙げ
たが、それ以外の方式でのエラー訂正機能を備えている
ものであっても構わない。In the above description, one of the error correction functions is based on the parity check method, but an error correction function of any other method may be used.

[Brief description of drawings]

【図１】ディスクアレイ装置及びそれと接続されるメ
インコンピュータのの概略構成を示す説明図である。FIG. 1 is an explanatory diagram showing a schematic configuration of a disk array device and a main computer connected thereto.

【図２】パリティチェック方式によるエラー訂正の説
明図である。FIG. 2 is an explanatory diagram of error correction by a parity check method.

【図３】ディスクアレイコントローラにおいて実行さ
れる読出制御処理を示すフローチャートである。FIG. 3 is a flowchart showing a read control process executed in the disk array controller.

【図４】ディスクアレイ装置内のディスク装置の内の
１台の動作終了が遅れている場合の処理の説明図であ
る。FIG. 4 is an explanatory diagram of processing when the end of operation of one of the disk devices in the disk array device is delayed.

【図５】別の実施形態を示す説明図である。FIG. 5 is an explanatory diagram showing another embodiment.

[Explanation of symbols]

１０…メインコンピュータ２０…バス３０…ディスクアレイ装置３１…Ａディスク装
置３２…Ｂディスク装置３３…Ｃディスク装
置３４…Ｄディスク装置３５…ディスクアレ
イコントローラ１０１〜１０４，１１１〜１１４，１２１〜１２４…デ
ィスク装置10 ... Main computer 20 ... Bus 30 ... Disk array device 31 ... A disk device 32 ... B disk device 33 ... C disk device 34 ... D disk device 35 ... Disk array controller 101-104, 111-114, 121-124 ... Disk apparatus

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁶ 識別記号庁内整理番号ＦＩ技術表示箇所Ｇ１１Ｂ 20/18 ５２２Ｇ１１Ｂ 20/18 ５２２Ｄ５７０５７０Ｚ５７２５７２Ｆ ─────────────────────────────────────────────────── ─── Continuation of the front page (51) Int.Cl. ⁶ Identification number Office reference number FI Technical display location G11B 20/18 522 G11B 20/18 522D 570 570Z 572 572F

Claims

[Claims]

1. A disk array controller is configured to control a plurality of disk devices and respond to an access from a main computer by masquerading as one disk device. In a disk array device having an error correction function for error occurrence, when the disk array controller issues a read command to a plurality of disk devices in response to a request from the main computer, among the plurality of disk devices If a response from a part of the above does not occur within a predetermined time after issuing the command, the data corresponding to the disk device which does not respond is created by the error correction function and requested by the main computer. Disk, which is characterized in that it is configured to send Array device.

2. When the disk array controller issues a read command to a plurality of disk devices in response to a request from the main computer, the current read command is issued because the operation for the previous read command has not been completed yet. When there is a disk device that cannot be issued, data corresponding to the disk device for which the read command cannot be issued is created by the error correction function, and the requested data is sent to the main computer. The disk array device according to claim 1, wherein:

3. A predetermined number of data storage disk devices operating in parallel, and a parity in which a parity created for each parallel data recorded in a predetermined same storage position of each data storage disk device is recorded. A plurality of sets of disk devices each including a storage disk device are provided, and the disk controller is capable of performing error correction by a parity check method for each set of the disk devices. 1. The disk array device according to 1 or 2.