JPH09231014A

JPH09231014A - Defect processor for storage medium

Info

Publication number: JPH09231014A
Application number: JP8036710A
Authority: JP
Inventors: Shigeru Yato; 茂矢頭
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1996-02-23
Filing date: 1996-02-23
Publication date: 1997-09-05

Abstract

PROBLEM TO BE SOLVED: To make possible defect processing of a storage medium without damaging a real time property by restoring data for which an error is generated from the data of another storage medium other than the storage medium for which the error is generated. SOLUTION: A retrial control part writes the data to plural hard disks 9 for the data, writes the parity of the data to the other hard disk 10 for the parity, and when a write error is generated, retries by the minimum number of times of retrials. An error registration part stores information relating to the data unrestorable by the minimum number of times of the retrials in an error table 5 as the error. An alternate sector control part reads error information, restores the data for which the error is generated from the data of the plural hard disks 9 other than the hard disk for which the error is generated and the other hard disk 10, retries by the maximum number of times of the retrials and allocates an alternate sector to the data unrestorable by the maximum number of times of the retrials.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、例えば、ビデオサ
ーバーに用いるハードディスクの書き込み及び読み出し
のエラー回復を効率よくする記憶媒体の欠陥処理装置に
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a storage medium defect processing device for efficiently performing writing and reading error recovery of a hard disk used in, for example, a video server.

【０００２】[0002]

【従来の技術】従来、多数の利用者の要求に基づいて、
個々の利用者に対して同時に種々の映像を供給するビデ
オサーバー装置があった。ビデオサーバー装置は、映像
ストリームの送出処理を行うビデオストリームサーバー
と、ナビゲーション機能など非リアルタイム処理を行う
アプリケーションサーバーとで構成される。2. Description of the Related Art Conventionally, based on the demands of many users,
There has been a video server device that simultaneously supplies various images to individual users. The video server device includes a video stream server that performs a process of transmitting a video stream, and an application server that performs non-real-time processing such as a navigation function.

【０００３】一方の、アプリケーションサーバーは、情
報をやりとりする上で中心的な役割を担うもので、情報
の受け渡しに必要なネットワークプロトコルの処理機能
を備えている。また、アプリケーションサーバーは、管
理データベースを有し、管理データベースに、アプリケ
ーションや管理情報、顧客情報などを管理するようにし
ている。[0003] On the other hand, an application server plays a central role in exchanging information, and has a network protocol processing function necessary for exchanging information. The application server has a management database, and manages applications, management information, customer information, and the like in the management database.

【０００４】他方の、ビデオストリームサーバーは、複
数のデータ記憶用のハードディスクと、１つのパリティ
記憶用のハードディスクと、これらのハードディスクに
対するデータの書き込みや読み出しの処理を行うＲＡＩ
Ｄ（ＲｅｄｕｎｄａｎｔＡｒｒａｙｓＩｎｅｘｐｅ
ｎｓｉｖｅＤｉｓｃｓ）コントローラと、データの変
換や並び替えを行ってデータ送出を行うプロセッサーと
を備えている。このビデオストリームサーバーをＲＡＩ
Ｄ装置と呼んでいる。このビデオストリームサーバーに
要求される条件として、時間連続性、同一ビデオの同時
平行提供、特殊再生処理の３つがあげられる。On the other hand, the video stream server has a plurality of hard disks for storing data, one hard disk for storing parity, and RAI for writing and reading data to and from these hard disks.
D (Redundant Arrays Inexpe
It is provided with a (native Discs) controller and a processor for converting and rearranging data and transmitting the data. RAI this video stream server
It is called D device. There are three requirements for this video stream server: time continuity, simultaneous provision of the same video in parallel, and special playback processing.

【０００５】ここでいう時間連続性とは、以下のことを
いう。つまり、ビデオストリームサーバーに蓄積される
データが、一度出力されると２時間連続して読み出しを
続けて、途切れないことをいう。また、同一ビデオの同
時平行提供とは、以下のことをいう。つまり、ビデオス
トリームサーバーには多くの利用者が平行して１つの映
像または種々の映像の要求を出してくる。この要求は完
全に非同期である。同じ映像を要求してもその要求の時
点は微妙にずれている。すなわち、同時に数百、数千の
映像ストリームを異なる頭出しで出力できなければなら
ない。従って、送出という点でリアルタイム性が要求さ
れる。また、特殊再生処理とは、以下のことをいう。つ
まり、一度映像をスタートしても利用者は途中で一時停
止したり、巻き戻し、スロー再生など、映像ストリーム
の制御は多様でなければならない。The term "time continuity" as used herein means the following. That is, it means that once the data accumulated in the video stream server is output, the data is continuously read for 2 hours without interruption. Further, the simultaneous provision of the same video in parallel means the following. That is, many users request one image or various images in parallel to the video stream server. This request is completely asynchronous. Even if the same image is requested, the time point of the request is slightly deviated. That is, hundreds or thousands of video streams must be output at different cue points at the same time. Therefore, real-time performance is required in terms of transmission. The special reproduction process means the following. That is, even if the video is started once, the user has to perform various control of the video stream, such as pause, rewind, and slow playback.

【０００６】このようなビデオストリームサーバーの動
作を以下に説明する。映像の記録時には、予めＭＰＥＧ
２などの高能率符号化された映像信号は、６４Ｋ〜２５
６Ｋバイト程度のセグメントデータに分割される。分割
されたセグメントデータは複数のデータ記憶用のハード
ディスクに蓄積される。複数のハードディスクに分散蓄
積することにより、同一映像に対する同時使用ユーザー
数を大きくすることができるからである。この分散蓄積
する方法として、ランダムレイアウトとストライピング
レイアウトがある。ランダムレイアウトはマルチレート
の映像の蓄積に適していて、ストライピングレイアウト
はディスク使用効率が高いという特徴がある。ストライ
ピングレイアウトで分散蓄積されたセグメントデータを
ストライピングユニットという。The operation of such a video stream server will be described below. Before recording the video, MPEG
A video signal encoded with high efficiency such as 2 is 64K to 25K.
It is divided into segment data of about 6 Kbytes. The divided segment data is stored in a plurality of data storage hard disks. This is because the number of simultaneous users for the same video can be increased by distributing and storing the data in a plurality of hard disks. There are a random layout and a striping layout as methods of this distributed storage. The random layout is suitable for storing multi-rate video, and the striping layout is characterized by high disk usage efficiency. The segment data distributed and accumulated in the striping layout is called a striping unit.

【０００７】映像の再生時には、ＲＡＩＤコントローラ
はストライピングユニットがどのディスクに蓄積されて
いるかというデータ配列を示すアロケーションテーブル
に従い、ストライピングユニットをハードディスクから
読み出す。そして、プロセッサーは読み出したストライ
ピングユニットを再生順序に並び替え、揺らぎを吸収し
てネットワークに送出する。また、スロー再生、早送
り、ポーズ、スキップなどの特殊再生機能は、ＲＡＩＤ
コントローラがストライピングユニットを読み出す速度
を調整することにより実現される。At the time of reproducing an image, the RAID controller reads the striping unit from the hard disk according to an allocation table showing a data array indicating on which disk the striping unit is stored. Then, the processor rearranges the read striping units in the reproduction order, absorbs fluctuations, and sends them to the network. In addition, special playback functions such as slow playback, fast forward, pause, skip, etc.
It is realized by adjusting the speed at which the controller reads the striping unit.

【０００８】また、ＲＡＩＤコントローラは、データ書
き込み時にはストライピングユニットを複数のハードデ
ィスクに分散させ、データ読み出し時には反対に分散さ
れたストライピングユニットを１つのデータストリーム
に集結させる。Further, the RAID controller distributes the striping units to a plurality of hard disks at the time of writing data, and condenses the striping units, which are distributed oppositely at the time of reading data, into one data stream.

【０００９】このような、ＲＡＩＤ装置では、複数のデ
ータ記憶用のハードディスクに通常のデータを記憶する
のとは別に、パリティを書き込むハードディスクを設け
ることによって、故障が発生した場合に、パリティ用の
ハードディスクのデータをもとにエラーが発生したデー
タを復旧するようにしている。また、個々のハードディ
スクで発生するアクセスエラーに対しては、エラー回復
に十分な回数のリトライを行うようにしていた。そし
て、このリトライでもエラー回復が不能の場合にセクタ
ー交替処理を行うようにしていた。In such a RAID device, a hard disk for writing parity is provided in addition to storing normal data in a plurality of hard disks for storing data, so that when a failure occurs, a hard disk for parity is used. The error data is recovered based on the data of. In addition, for an access error that occurs in each hard disk, a retry is performed a sufficient number of times to recover the error. Then, even in this retry, if the error recovery is impossible, the sector replacement process is performed.

【００１０】[0010]

【発明が解決しようとする課題】しかし、このような従
来のＲＡＩＤ装置では、アクセスエラーに対するエラー
回復処理にかかる時間が長くなって予測できない場合が
生じることがあり、このような場合には、上述したリア
ルタイム性が損なわれるという不都合があった。また、
ハードディスクの読み出しエラーが発生した場合に、リ
トライ回数を最小限に抑えて、規定回数以上になったら
リトライによる回復をせずに、パリティ用のハードディ
スクのデータをもとにエラーが発生したデータを復旧す
ることによって、読み出し時間の増加を防ぐことも考え
られる。しかし、読み出しエラーの原因が媒体の欠陥に
よるものなのか、偶発的な原因によるものなのかを見分
けることができないので、エラーの原因が媒体の欠陥に
よるものである場合であっても交替セクターを割り当て
てデータを再書き込みすることができないという不都合
があった。However, in such a conventional RAID device, there are cases in which the time required for error recovery processing for an access error becomes long and unpredictable. In such a case, the above-mentioned problem may occur. However, there was an inconvenience that the real-time property was lost. Also,
When a hard disk read error occurs, the number of retries is minimized, and when the number of retries exceeds the specified number, the error data is recovered based on the parity hard disk data without recovery by retry. By doing so, it is possible to prevent an increase in read time. However, since it is not possible to tell whether the read error is due to a media defect or an accidental cause, the replacement sector is allocated even if the error cause is due to a media defect. However, there is an inconvenience that the data cannot be rewritten.

【００１１】本発明は、かかる点を考慮してなされたも
のであり、リアルタイム性が損なわれることなく、記憶
媒体の欠陥処理を行うことができる記憶媒体の欠陥処理
装置の提供を目的とする。The present invention has been made in view of the above points, and an object of the present invention is to provide a storage medium defect processing apparatus capable of performing a defect processing of a storage medium without deteriorating the real-time property.

【００１２】[0012]

【課題を解決するための手段】この発明の記憶媒体の欠
陥処理装置は、複数の記憶媒体にデータを書き込むと共
に、他の記憶媒体に上記データのパリティを書き込み、
上記複数の記憶媒体及び上記他の記憶媒体にデータを書
き込むときに書き込みエラーが発生したら、最小のリト
ライ回数でリトライをするリトライ制御部と、上記最小
のリトライ回数で回復不可能なデータに関する情報をエ
ラーとしてエラーテーブルに記憶するエラー登録制御部
と、上記エラーテーブルに登録されたエラー情報を読み
出し、エラーの発生した記憶媒体以外の上記複数の記憶
媒体及び上記他の記憶媒体のデータからエラーの発生し
たデータを復元し、最大のリトライ回数でリトライを
し、上記最大のリトライ回数で回復不可能なデータに対
して交替セクターを割り当てる交替セクター制御部とを
備えたものである。A storage medium defect processing apparatus according to the present invention writes data to a plurality of storage media and at the same time writes the parity of the data to another storage medium.
If a write error occurs when writing data to the plurality of storage media and the other storage medium, a retry control unit that performs a retry with a minimum number of retries, and information regarding unrecoverable data with the minimum number of retries are provided. An error registration control unit that stores an error in the error table and error information registered in the error table is read, and an error occurs from the data of the plurality of storage media other than the storage medium in which the error has occurred and the data of the other storage media. The data is restored, the retry is performed with the maximum number of retries, and the alternate sector control unit assigns the alternate sector to the data that cannot be recovered with the maximum number of retries.

【００１３】また、この発明の記憶媒体の欠陥処理装置
は、複数の記憶媒体にデータが書き込まれると共に、他
の記憶媒体に上記データのパリティが書き込まれ、上記
複数の記憶媒体及び上記他の記憶媒体に書き込まれたデ
ータを読み出すときに読み出しエラーが発生したら、最
小のリトライ回数でリトライをするリトライ制御部と、
上記最小のリトライ回数で回復不可能なデータに関する
情報をエラーとしてエラーテーブルに記憶するエラー登
録制御部と、上記エラーテーブルに登録されたエラー情
報を読み出し、エラーの発生した記憶媒体及び上記他の
記憶媒体のデータを読み出すように、最大のリトライ回
数でリトライをし、上記最大のリトライ回数で回復不可
能なデータに対してエラーの発生した記憶媒体以外の上
記複数の記憶媒体及び上記他の記憶媒体のデータからエ
ラーの発生したデータを復元し、交替セクターを割り当
てる交替セクター制御部とを備えたものである。Further, in the storage medium defect processing apparatus of the present invention, data is written in a plurality of storage media and the parity of the data is written in another storage medium, and the plurality of storage media and the other storages are stored. If a read error occurs when reading the data written to the medium, a retry control unit that retries with the minimum number of retries,
An error registration control unit that stores information on unrecoverable data as an error in the error table with the minimum number of retries, and the error information registered in the error table to read the storage medium in which the error occurred and the other storages. A plurality of storage media other than the storage medium in which an error occurs with respect to the data which cannot be recovered by the maximum number of retries by performing a retry with the maximum number of retries so as to read the data of the medium and the other storage medium. And a replacement sector control unit for allocating replacement sectors by restoring the data in which an error has occurred from the above data.

【００１４】この発明の記憶媒体の欠陥処理装置によれ
ば、以下の作用をする。まず、書き込み動作を説明す
る。書き込み要求があると、書き込み動作を開始して、
リトライ制御部のデータ分割手段はデータをストライピ
ングユニットに分割する。リトライ制御部のストライピ
ングユニット書き込み手段はストライピングユニットを
データ記憶用のハードディスクに書く。この場合、複数
のデータ記憶用のハードディスクにストライピングユニ
ットを分散蓄積するように書き込み動作を行う。According to the defect processing apparatus for a storage medium of the present invention, the following effects are obtained. First, the write operation will be described. When there is a write request, the write operation is started,
The data dividing means of the retry control unit divides the data into striping units. The striping unit writing means of the retry controller writes the striping unit on the hard disk for data storage. In this case, the write operation is performed so that the striping units are dispersedly stored in a plurality of data storage hard disks.

【００１５】リトライ制御部の書き込みエラー検出手段
はエラーがあるか否かを判断する。エラーがあるときは
リトライ制御部の最小リトライ回数判定手段はリトライ
回数がオーバーしたか否かを判断する。この最小の回数
とは、許される実行時間の範囲内での最小の回数であ
る。リトライ回数がオーバーしたらエラー登録制御部の
エラーテーブル登録手段はエラー情報をエラーテーブル
に登録する。この場合、エラーテーブル登録手段は、ス
テータス信号からエラーの発生したハードディスクの番
号およびエラーの発生したセクターアドレスをエラーテ
ーブルに記憶する。The write error detection means of the retry controller determines whether or not there is an error. When there is an error, the minimum retry count determination means of the retry control unit determines whether the retry count has exceeded. The minimum number of times is the minimum number of times within the allowable execution time range. If the number of retries is exceeded, the error table registration means of the error registration control unit registers the error information in the error table. In this case, the error table registration means stores the number of the hard disk in which the error occurred and the sector address in which the error occurred in the error table from the status signal.

【００１６】エラーがないときはパリティ書き込み手段
はパリティ用ハードディスクにデータ用ハードディスク
に書き込んだデータの排他的論理和であるパリティを書
く。リトライ制御部の書き込みエラー検出手段はエラー
があるか否かを判断する。エラーがあるときはリトライ
制御部の最小リトライ回数判定手段はリトライ回数がオ
ーバーしたか否かを判断する。リトライ回数がオーバー
したらエラー登録制御部のエラーテーブル登録手段はエ
ラー情報をエラーテーブルに登録する。エラーテーブル
に登録したら、書き込み動作が終了か否かを判断する。
データの書き込みの処理とパリティの書き込みの処理は
並行して実行される。これで一連の番組等の素材書き込
みが終了する。When there is no error, the parity writing means writes the parity, which is the exclusive OR of the data written in the data hard disk, in the parity hard disk. The write error detection means of the retry control unit determines whether there is an error. When there is an error, the minimum retry count determination means of the retry control unit determines whether the retry count has exceeded. If the number of retries is exceeded, the error table registration means of the error registration control unit registers the error information in the error table. After registering in the error table, it is determined whether or not the writing operation is completed.
The data writing process and the parity writing process are executed in parallel. This completes the writing of materials such as a series of programs.

【００１７】以下に述べる処理は、空き時間や通常の処
理のバックグランド処理で実行される。交替セクター制
御部のエラーテーブル読み出し手段はエラーテーブルか
らエラーの発生したハードディスクのエラーの発生した
セクターアドレスを読む。交替セクター制御部のエラー
テーブル読み出し手段はエラーテーブルからエラーの発
生したハードディスクのドライブ番号を読む。交替セク
ター制御部のストライピングユニット復元手段はエラー
の発生したストライピングユニットをパリティ演算に基
づいて復元する。この場合、パリティ用ハードディスク
には、データ用ハードディスクに書き込んだデータの排
他的論理和が計算されて書き込まれているので、エラー
の発生した以外のデータ用ハードディスクに書き込んだ
データとパリティ用ハードディスクに書き込んだパリテ
ィとによりデータを復元することができる。このように
して、パリティ用のハードディスクも含み、エラーの発
生したハードディスク以外のハードディスクのデータか
らデータを再構築する。The processing described below is executed in the idle time or in the background processing of normal processing. The error table reading means of the alternate sector control unit reads the error sector address of the hard disk in which the error occurred from the error table. The error table reading means of the alternate sector control unit reads the drive number of the hard disk in which the error occurred from the error table. The striping unit restoring means of the alternate sector control unit restores the striping unit in which the error has occurred based on the parity operation. In this case, since the exclusive OR of the data written to the data hard disk is calculated and written to the parity hard disk, the data written to the data hard disk other than the one in which the error occurred and the parity hard disk are written. Data can be restored by using the parity. In this way, the data is reconstructed from the data of the hard disks other than the hard disk in which the error has occurred, including the hard disk for parity.

【００１８】交替セクター制御部の復元ストライピング
ユニット書き込み手段は復元したストライピングユニッ
トをエラーの発生したハードディスクのエラーの発生し
たセクターアドレスに書く。交替セクター制御部の書き
込みエラー検出手段はエラーの発生したハードディスク
に対してデータ書き込みの再度のエラーがあるか否かを
判断する。エラーがあるときは交替セクター制御部の最
大リトライ回数判定手段は書き込みエラーに対して最大
に設定されたリトライ回数がオーバーしたか否かを判断
する。この最大の回数とは、ハードディスクの特性に応
じて決められる書込みエラーを回復するのに充分な回数
である。The restoration striping unit writing means of the alternate sector control unit writes the restored striping unit to the sector address where the error occurred in the hard disk where the error occurred. The write error detection means of the alternate sector control unit determines whether or not there is another data write error in the hard disk in which the error has occurred. When there is an error, the maximum retry count determination means of the alternate sector control unit determines whether or not the maximum retry count for the write error has been exceeded. The maximum number of times is a sufficient number of times for recovering the write error determined according to the characteristics of the hard disk.

【００１９】リトライ回数がオーバーしたら交替セクタ
ー制御部の交替セクター割り当て手段は書き込みエラー
の発生したセクターを欠陥セクターとして処理して交替
セクターを割り当てる。このような交替処理はハードデ
ィスクにエラー回復のために十分なリトライ回数を指定
し、自動交替処理を許可したモードで、ライトコマンド
を発行することにより実現される。エラーがないときは
エラーテーブルのすべてのセクターについてエラー回復
処理が終了か否かを判断する。エラーテーブルが終了の
ときは書き込み動作を終了する。When the number of retries is exceeded, the replacement sector allocating means of the replacement sector control unit processes the sector in which the write error has occurred as a defective sector and allocates the replacement sector. Such a replacement process is realized by designating a sufficient number of retries for error recovery on the hard disk and issuing a write command in a mode in which the automatic replacement process is permitted. When there is no error, it is determined whether or not the error recovery processing is completed for all the sectors in the error table. When the error table is finished, the write operation is finished.

【００２０】次に、読み出し動作を説明する。読み出し
要求があると、読み出し動作を開始して、リトライ制御
部のストライピングユニット読み出し手段は複数のデー
タ記憶用のハードディスクに分散蓄積されたストライピ
ングユニットの読み出し動作を行う。リトライ制御部の
読み出しエラー検出手段は複数のデータ記憶用のハード
ディスクに対するデータ読み出しにエラーがあるか否か
を判断する。リトライ制御部の最小リトライ回数判定手
段は、読み出しエラーに対して最小に設定されたリトラ
イ回数をオーバーしたか否かを判断する。この最小の回
数とは、許される実行時間の範囲内での最大の回数であ
る。Next, the read operation will be described. When a read request is issued, the read operation is started, and the striping unit read means of the retry control unit performs the read operation of the striping units distributed and accumulated in the plurality of hard disks for storing data. The read error detection means of the retry control unit determines whether or not there is an error in reading data from a plurality of hard disks for storing data. The minimum retry number determination means of the retry control unit determines whether or not the retry number set to the minimum for the read error is exceeded. The minimum number of times is the maximum number of times within the allowable execution time range.

【００２１】最小リトライ回数をオーバーしているとき
は、エラー登録制御部のエラーテーブル登録手段はステ
ータス信号に基づいてエラーの発生したハードディスク
の番号およびエラーの発生したセクターアドレスをエラ
ーテーブルに記憶する。エラー登録制御部のストライピ
ングユニット復元手段はエラーの発生したストライピン
グユニットをパリティ演算に基づいて復元する。エラー
登録制御部のデータ合成手段はストライピングユニット
に分割されていたデータから元のデータを合成する。こ
の映像データはデータ供給源に供給される。これによ
り、合成された映像データは送出される。これで一連の
番組等の素材読み出しが終了する。If the minimum number of retries has been exceeded, the error table registration means of the error registration control unit stores the number of the hard disk in which the error occurred and the sector address in which the error occurred in the error table based on the status signal. The striping unit restoring means of the error registration control unit restores the striping unit in which the error has occurred based on the parity operation. The data synthesizing means of the error registration control unit synthesizes the original data from the data divided into the striping units. This video data is supplied to the data supply source. As a result, the combined video data is transmitted. This completes the reading of materials such as a series of programs.

【００２２】以下に述べる処理は、空き時間や通常の処
理のバックグランド処理で実行される。交替セクター制
御部のエラーテーブル読み出し手段は、エラーテーブル
からエラーの発生したハードディスクのエラーの発生し
たセクターアドレスを読み出す。エラーテーブル読み出
し手段は、エラーテーブルからエラーの発生したハード
ディスクの番号を読み出す。ストライピングユニット読
み出し手段は、エラーの発生したハードディスクに対す
るデータの再読み出しを行う。交替セクター制御部の読
み出しエラー検出手段は、エラーの発生したハードディ
スクに対するデータ読み出しにエラーがあるか否かを判
断する。The processing described below is executed in the idle time or in the background processing of normal processing. The error table reading means of the alternate sector control unit reads the errored sector address of the hard disk where the error occurred from the error table. The error table reading means reads the number of the hard disk in which the error occurred from the error table. The striping unit reading means rereads the data from the hard disk in which the error has occurred. The read error detection means of the alternate sector control unit determines whether or not there is an error in reading data from the hard disk in which the error has occurred.

【００２３】交替セクター制御部の最大リトライ回数判
定手段は、読み出しエラーに対して最大に設定されたリ
トライ回数をオーバーしたか否かを判断する。このよう
な処理および判断は、パリティ用ハードディスクも含み
エラーの起きたハードディスクのセクタに対して、エラ
ー回復に十分なリトライ回数の指定してリードコマンド
を発行することにより実現される。最大リトライ回数を
オーバーしたときは交替セクター制御部のストライピン
グユニット復元手段は、エラーの発生したストライピン
グユニットをパリティ演算に基づいて復元する。この場
合、パリティ用ハードディスクも含み、エラーの発生し
たハードディスク以外のハードディスクのデータから、
ストライピングユニットを復元する。The maximum retry count judging means of the replacement sector control unit judges whether or not the maximum retry count has been set for a read error. Such processing and determination are realized by designating a sufficient number of retries for error recovery and issuing a read command to a sector of a hard disk in which an error has occurred, including a hard disk for parity. When the maximum number of retries is exceeded, the striping unit restoring means of the alternate sector control unit restores the striping unit in which the error has occurred based on the parity operation. In this case, including the hard disk for parity, from the data of the hard disk other than the hard disk where the error occurred,
Restore the striping unit.

【００２４】交替セクター制御部の交替セクター割り当
て手段は、書き込みエラーの発生したセクターを欠陥セ
クターとして処理し、交替セクターを割り当てる。交替
セクター制御部の復元ストライピングユニット書き込み
手段は、復元したストライピングユニットをエラーの発
生したハードディスクのエラーの発生したセクターアド
レスに書き込む。このような処理は、リードエラーの発
生したハードディスクにエラー回復に十分なリトライ回
数を指定し、自動交替処理を許可したモードで、ライト
コマンドを発行することにより実行される。エラーテー
ブルのすべてのセクターについてエラー回復処理が終了
か否かを判断する。エラーテーブルが終了のときは読み
出し動作を終了する。The replacement sector allocation means of the replacement sector control unit processes the sector in which the write error has occurred as a defective sector and allocates the replacement sector. The restoration striping unit writing means of the alternate sector control unit writes the restored striping unit to the sector address where the error occurred on the hard disk where the error occurred. Such processing is executed by designating a hard disk in which a read error has occurred with a sufficient number of retries for error recovery and issuing a write command in a mode in which automatic replacement processing is permitted. It is determined whether or not the error recovery processing is completed for all the sectors in the error table. When the error table is finished, the read operation is finished.

【００２５】[0025]

【発明の実施の形態】以下、本実施例について説明す
る。まず、本実施例の構成を説明する。図１に示す本実
施例の記憶媒体の欠陥処理装置はビデオサーバー装置に
おけるビデオストリームサーバーに相当する。この記憶
媒体の欠陥処理装置は、複数のデータ用のハードディス
ク９と、１つのパリティ用のハードディスク１０と、こ
れらのハードディスクに対するデータの書き込みや読み
出しの処理を行うＲＡＩＤ（ＲｅｄｕｎｄａｎｔＡｒ
ｒａｙｓＩｎｅｘｐｅｎｓｉｖｅＤｉｓｃｓ）コン
トローラ２と、データの変換や並び替えを行ってデータ
送出を行うプロセッサー１とを備えている。このビデオ
ストリームサーバーをＲＡＩＤ装置と呼んでいる。この
ビデオストリームサーバーに要求される条件として、時
間連続性、同一ビデオの同時平行提供、特殊再生処理の
３つがあげられる。この３つの条件については、従来の
技術において説明した通りである。BEST MODE FOR CARRYING OUT THE INVENTION This embodiment will be described below. First, the configuration of the present embodiment will be described. The storage medium defect processing apparatus of this embodiment shown in FIG. 1 corresponds to a video stream server in a video server apparatus. This storage medium defect processing apparatus includes a plurality of hard disks 9 for data, a single hard disk 10 for parity, and a RAID (Redundant Ar) for writing and reading data to and from these hard disks.
It is provided with a ray Inexpensive Discs) controller 2 and a processor 1 for transmitting data by converting and rearranging data. This video stream server is called a RAID device. There are three requirements for this video stream server: time continuity, simultaneous provision of the same video in parallel, and special playback processing. The three conditions are as described in the related art.

【００２６】ここで、データ用ハードディスク９は記憶
媒体、パリティ用ハードディスク１０は他の記憶媒体を
構成する。ＲＡＩＤコントローラ２は、制御を司るＣＰ
Ｕ３と、制御プログラムを記憶するＲＯＭ４と、制御デ
ータおよびエラーテーブル５を記憶するＲＡＭ６とを有
する。ＲＡＩＤコントローラ２は、機能ブロックで示す
と、書き込み制御部７および読み出し制御部８とを有す
る。書き込み制御部７はハードディスクに対するデータ
の書き込みを制御する。読み出し制御部８はハードディ
スクに対するデータの読み出しを制御する。The data hard disk 9 constitutes a storage medium, and the parity hard disk 10 constitutes another storage medium. The RAID controller 2 is a CP that controls the control.
It has a U3, a ROM 4 for storing a control program, and a RAM 6 for storing control data and an error table 5. The RAID controller 2 has a write control unit 7 and a read control unit 8 in functional blocks. The write controller 7 controls writing of data to the hard disk. The read control unit 8 controls the reading of data from the hard disk.

【００２７】次に書き込み制御部の構成を図２に基づい
て説明する。書き込み制御部７は、機能ブロックで示す
と、リトライ制御部１１と、エラー登録制御部１２と、
交替セクター制御部１３とを有する。リトライ制御部１
１は、複数のデータ用ハードディスク９にデータを書き
込むと共に、他のパリティ用ハードディスク１０にこの
データのパリティを書き込み、複数のデータ用ハードデ
ィスク９および他のパリティ用ハードディスク１０にデ
ータを書き込むときに書き込みエラーが発生したら、最
小のリトライ回数でリトライをする制御を行う。エラー
登録制御部１２は、最小のリトライ回数で回復不可能な
データに関する情報をエラーとしてエラーテーブル５に
記憶する制御を行う。交替セクター制御部１３は、エラ
ーテーブル５に登録されたエラー情報を読み出し、エラ
ーの発生したハードディスク以外の複数のハードディス
ク９及び他のハードディスク１０のデータからエラーの
発生したデータを復元し、最大のリトライ回数でリトラ
イをし、最大のリトライ回数で回復不可能なデータに対
して交替セクターを割り当てる制御を行う。Next, the structure of the write controller will be described with reference to FIG. The write control unit 7 is, in terms of functional blocks, a retry control unit 11, an error registration control unit 12,
The replacement sector control unit 13 is included. Retry controller 1
1 is a write error when writing data to a plurality of data hard disks 9 and at the same time writing the parity of this data to another parity hard disk 10 and writing data to the plurality of data hard disks 9 and another parity hard disk 10. When occurs, control is performed to retry with the minimum number of retries. The error registration control unit 12 performs control to store information regarding unrecoverable data as an error in the error table 5 with the minimum number of retries. The alternate sector control unit 13 reads the error information registered in the error table 5, restores the error data from the data of the plurality of hard disks 9 and other hard disks 10 other than the hard disk in which the error occurred, and performs the maximum retry. The control is performed by retrying the number of times and assigning the replacement sector to the data that cannot be recovered by the maximum number of times of retrying.

【００２８】リトライ制御部１１は、データ分割手段１
４と、ストライピングユニット書き込み手段１５と、パ
リティ書き込み手段１６と、書き込みエラー検出手段１
７と、最小リトライ回数判定手段１８とを有する。デー
タ分割手段１４は、予めＭＰＥＧ２などの高能率符号化
されたデータを所定の単位のストライピングユニットに
分割する。ストライピングユニットは複数のデータ用の
ハードディスク９に分散蓄積するセグメントデータの一
群である。分散蓄積する理由は、複数のデータ用ハード
ディスク９に分散蓄積することにより、同一映像に対す
る同時使用ユーザー数を大きくすることができるからで
ある。この分散蓄積する方法として、ランダムレイアウ
トとストライピングレイアウトがある。ランダムレイア
ウトはマルチレートの映像の蓄積に適していて、ストラ
イピングレイアウトはディスク使用効率が高いという特
徴がある。ストライピングレイアウトで分散蓄積された
セグメントデータをストライピングユニットという。The retry control section 11 includes the data dividing means 1
4, striping unit write means 15, parity write means 16, and write error detection means 1
7 and a minimum retry count determination means 18. The data dividing unit 14 divides the data, which has been encoded with high efficiency such as MPEG2 in advance, into striping units of a predetermined unit. The striping unit is a group of segment data which is distributed and accumulated in the hard disk 9 for a plurality of data. The reason for the distributed storage is that the number of simultaneous users for the same video can be increased by the distributed storage in the plurality of data hard disks 9. There are a random layout and a striping layout as methods of this distributed storage. The random layout is suitable for storing multi-rate video, and the striping layout is characterized by high disk usage efficiency. The segment data distributed and accumulated in the striping layout is called a striping unit.

【００２９】ストライピングユニット書き込み手段１５
は、複数のデータ用のハードディスク９にストライピン
グユニットを分散蓄積するように書き込み動作を行う。
この書き込み動作は、プロセッサー１から供給されるＳ
ＣＳＩコマンドに基づいて行われる。パリティ書き込み
手段１６は、パリティ用ハードディスク１０に、データ
用ハードディスク９に書き込んだデータの排他的論理和
を書き込む。この書き込み動作も、プロセッサー１から
供給されるＳＣＳＩコマンドに基づいて行われる。Striping unit writing means 15
Performs a write operation so as to disperse and store striping units in a plurality of hard disks 9 for data.
This write operation is performed by S supplied from the processor 1.
It is performed based on the CSI command. The parity writing means 16 writes the exclusive OR of the data written in the data hard disk 9 to the parity hard disk 10. This write operation is also performed based on the SCSI command supplied from the processor 1.

【００３０】書き込みエラー検出手段１７は、複数のデ
ータ用のハードディスク９に対するデータ書き込みのエ
ラーを検出する。このエラー検出動作は、ＳＣＳＩコマ
ンドに対するステータス信号を検出することにより行わ
れる。最小リトライ回数判定手段１８は、書き込みエラ
ーに対して最小に設定されたリトライ回数の書き込みリ
トライを実行したことを判定する。最小に設定されたリ
トライ回数は、ＳＣＳＩコマンドに対するモードパラメ
ータに基づいて設定される。またこのリトライ回数をオ
ーバーしたことの判定はモードパラメータおよびステー
タス信号を検出することにより行われる。この最小の回
数とは、許される実行時間の範囲内での最大の回数であ
る。The write error detection means 17 detects a data write error in the hard disk 9 for a plurality of data. This error detection operation is performed by detecting the status signal for the SCSI command. The minimum retry count determination means 18 determines that the write retry has been executed with the retry count set to the minimum with respect to the write error. The minimum number of retries is set based on the mode parameter for the SCSI command. The determination that the number of retries has been exceeded is made by detecting the mode parameter and the status signal. The minimum number of times is the maximum number of times within the allowable execution time range.

【００３１】エラー登録制御部１２はエラーテーブル登
録手段１９を有する。エラーテーブル登録手段１９は、
ステータス信号からエラーの発生したハードディスクの
番号およびエラーの発生したセクターアドレスをエラー
テーブルに記憶する。このように最小のリトライ後にエ
ラーテーブルに登録する理由は、リトライ回数を多くす
ると書き込み実行時間が増えるからであり、少なくする
とリトライによって回復できるセクターも欠陥セクター
と判断されるため交替セクターの数が増えるからであ
る。The error registration control section 12 has an error table registration means 19. The error table registration means 19
The number of the hard disk where the error occurred and the sector address where the error occurred are stored in the error table from the status signal. The reason for registering in the error table after the minimum number of retries is that the write execution time increases as the number of retries increases. If the number of retries increases, the number of replacement sectors increases because it is determined that the sectors that can be recovered by retries are defective. Because.

【００３２】交替セクター制御部１３は、エラーテーブ
ル読み出し手段２０と、ストライピングユニット復元手
段２１と、復元ストライピングユニット書き込み手段２
２と、書き込みエラー検出手段２３と、最大リトライ回
数判定手段２４と、交替セクター割り当て手段２５とを
有する。エラーテーブル読み出し手段２０は、エラーテ
ーブル５からエラーの発生したハードディスクの番号お
よびエラーの発生したセクターアドレスを読み出す。ス
トライピングユニット復元手段２１は、エラーの発生し
たストライピングユニットをパリティ演算に基づいて復
元する。つまり、パリティ用ハードディスク１０には、
データ用ハードディスク９に書き込んだデータの排他的
論理和が計算されて書き込まれている。従って、エラー
の発生した以外のデータ用ハードディスク９に書き込ん
だデータとパリティ用ハードディスク１０に書き込んだ
パリティとによりデータを復元することができる。The alternate sector control unit 13 includes an error table reading means 20, a striping unit restoring means 21, and a restoring striping unit writing means 2.
2, a write error detection means 23, a maximum retry count determination means 24, and an alternate sector allocation means 25. The error table reading means 20 reads the number of the hard disk in which the error occurred and the sector address in which the error occurred from the error table 5. The striping unit restoring means 21 restores the striping unit in which the error has occurred based on the parity calculation. That is, in the parity hard disk 10,
The exclusive OR of the data written in the data hard disk 9 is calculated and written. Therefore, the data can be restored by the data written in the data hard disk 9 and the parity written in the parity hard disk 10 except for the error.

【００３３】復元ストライピングユニット書き込み手段
２２は、復元したストライピングユニットをエラーの発
生したハードディスクのエラーの発生したセクターアド
レスに書き込む。この書き込み動作は、プロセッサーか
ら供給されるＳＣＳＩコマンドに基づいて行われる。書
き込みエラー検出手段２３は、エラーの発生したハード
ディスクに対するデータ書き込みのエラーを検出する。
このエラー検出動作は、ＳＣＳＩコマンドに対するステ
ータス信号を検出することにより行われる。The restoration striping unit writing means 22 writes the restored striping unit to the sector address where the error occurred in the hard disk where the error occurred. This write operation is performed based on the SCSI command supplied from the processor. The write error detection means 23 detects an error in writing data to the hard disk in which the error has occurred.
This error detection operation is performed by detecting the status signal for the SCSI command.

【００３４】最大リトライ回数判定手段２４は、書き込
みエラーに対して最大に設定されたリトライ回数の書き
込みリトライを実行したことを判定する。最大に設定さ
れたリトライ回数は、ＳＣＳＩコマンドに対するモード
パラメータに基づいて設定される。またこのリトライ回
数をオーバーしたことの判定はステータス信号を検出す
ることにより行われる。この最大の回数とは、ハードデ
ィスクのエラー回復に充分なリトライ回数である。交替
セクター割り当て手段は、書き込みエラーの発生したセ
クターを欠陥セクターとして処理し、交替セクターを割
り当てる。The maximum retry count judging means 24 judges that the write retry of the maximum retry count has been executed for the write error. The maximum number of retries is set based on the mode parameter for the SCSI command. The determination that the number of retries has been exceeded is made by detecting the status signal. This maximum number is the number of retries sufficient for error recovery of the hard disk. The replacement sector allocation means processes a sector in which a write error has occurred as a defective sector and allocates a replacement sector.

【００３５】このようにして、交替セクター制御部１３
は、空き時間にエラーテーブル５に登録されたセクター
アドレスで示されるセクターのデータを読み出して正し
いデータを復元し、エラーの起きたハードディスクに十
分なリトライ回数で再書き込みを行う。そこで、再書き
込みで回復できなかった場合に欠陥セクターと判断して
交替セクターを割り当てて再書き込みを行う。In this way, the replacement sector control unit 13
Reads the data of the sector indicated by the sector address registered in the error table 5 in the idle time, restores the correct data, and rewrites the hard disk in which the error occurred with a sufficient number of retries. Therefore, when the recovery cannot be performed by the rewriting, the defective sector is determined and the replacement sector is assigned to perform the rewriting.

【００３６】次に読み出し制御部の構成を図３に基づい
て説明する。読み出し制御部８は、機能ブロックで示す
と、リトライ制御部２６と、エラー登録制御部２７と、
交替セクター制御部２８とを有する。リトライ制御部２
６は、複数のデータ用ハードディスク９にデータが書き
込まれると共に、他のパリティ用ハードディスク１０に
このデータのパリティが書き込まれ、複数のデータ用ハ
ードディスク９および他のパリティ用ハードディスク１
０に書き込まれたデータを読み出すときに読み出しエラ
ーが発生したら、最小のリトライ回数でリトライをする
制御を行う。Next, the structure of the read control unit will be described with reference to FIG. The read control unit 8 is, in terms of functional blocks, a retry control unit 26, an error registration control unit 27,
The replacement sector control unit 28. Retry control unit 2
6, the data is written to the plurality of data hard disks 9 and the parity of this data is written to the other parity hard disks 10, and the plurality of data hard disks 9 and the other parity hard disks 1 are written.
If a read error occurs when reading the data written in 0, control is performed to retry with the minimum number of retries.

【００３７】エラー登録制御部２７は、最小のリトライ
回数で回復不可能なデータに関する情報をエラーとして
エラーテーブル５に記憶する制御を行う。交替セクター
制御部２８は、エラーテーブル５に登録されたエラー情
報を読み出し、エラーの発生したデータ用ハードディス
ク９及び他のパリティ用ハードディスク１０のデータを
読み出すように、最大のリトライ回数でリトライをし、
最大のリトライ回数で回復不可能なデータに対してエラ
ーの発生したハードディスク以外の複数のデータ用ハー
ドディスク９及び他のパリティ用ハードディスク１０の
データからエラーの発生したデータを復元し、交替セク
ターを割り当てる制御を行う。The error registration control section 27 performs control to store information regarding unrecoverable data in the error table 5 as an error with the minimum number of retries. The alternate sector control unit 28 reads the error information registered in the error table 5 and retries with the maximum number of retries so as to read the data of the data hard disk 9 and the other parity hard disk 10 in which the error has occurred,
Control for allocating an alternate sector by restoring the error data from the data of a plurality of data hard disks 9 other than the error hard disk and the other parity hard disk 10 for the unrecoverable data with the maximum number of retries I do.

【００３８】リトライ制御部２６は、ストライピングユ
ニット読み出し手段２９と、読み出しエラー検出手段３
０と、最小リトライ回数判定手段３１とを有する。スト
ライピングユニット読み出し手段２９は、複数のデータ
用のハードディスク９に分散蓄積されたストライピング
ユニットの読み出し動作を行う。この読み出し動作は、
プロセッサー１から供給されるＳＣＳＩコマンドに基づ
いて行われる。読み出しエラー検出手段３０は、複数の
データ用のハードディスク９に対するデータ読み出しの
エラーを検出する。このエラー検出動作は、ＳＣＳＩコ
マンドに対するステータス信号を検出することにより行
われる。最小リトライ回数判定手段３１は、読み出しエ
ラーに対して最小に設定されたリトライ回数の読み出し
リトライを実行したことを判定する。最小に設定された
リトライ回数は、ＳＣＳＩコマンドに対するモードパラ
メータに基づいて設定される。またこのリトライ回数を
オーバーしたことの判定はステータス信号を検出するこ
とにより行われる。The retry controller 26 includes a striping unit reading means 29 and a read error detecting means 3.
It has 0 and the minimum retry count judging means 31. The striping unit reading means 29 performs the reading operation of the striping units that are distributed and accumulated in the plurality of data hard disks 9. This read operation is
This is performed based on the SCSI command supplied from the processor 1. The read error detection means 30 detects a data read error with respect to the plurality of data hard disks 9. This error detection operation is performed by detecting the status signal for the SCSI command. The minimum retry count determination means 31 determines that the read retry with the retry count set to the minimum for the read error has been executed. The minimum number of retries is set based on the mode parameter for the SCSI command. The determination that the number of retries has been exceeded is made by detecting the status signal.

【００３９】エラー登録制御部２７はエラーテーブル登
録手段３２と、ストライピングユニット復元手段３３
と、データ合成手段３４とを有する。エラーテーブル登
録手段３２は、ステータス信号からエラーの発生したハ
ードディスクの番号およびエラーの発生したセクターア
ドレスをエラーテーブルに記憶する。ストライピングユ
ニット復元手段３３は、エラーの発生したストライピン
グユニットをパリティ演算に基づいて復元する。データ
合成手段３４は、ストライピングユニットに分割されて
いたデータから元のデータを合成する。これにより、合
成されたデータが送出される。The error registration control section 27 includes an error table registration means 32 and a striping unit restoration means 33.
And data synthesizing means 34. The error table registration means 32 stores the number of the hard disk where the error occurred and the sector address where the error occurred from the status signal in the error table. The striping unit restoring means 33 restores the striping unit in which the error has occurred based on the parity calculation. The data synthesizing means 34 synthesizes original data from the data divided into striping units. As a result, the combined data is transmitted.

【００４０】このようにして、リトライ制御部２６は、
プロセッサー１からの読み出し要求に対して、ストライ
ピングユニットを読み出して、本来のデータに構築し直
してプロセッサー１に返す。そして、エラーが発生した
場合はパリティ演算によりストライピングユニットを復
元してから、本来のデータに構築し直してプロセッサー
１に返す。In this way, the retry controller 26
In response to a read request from the processor 1, the striping unit is read, reconstructed into the original data and returned to the processor 1. When an error occurs, the striping unit is restored by the parity operation, the original data is reconstructed, and the data is returned to the processor 1.

【００４１】交替セクター制御部２８は、エラーテーブ
ル読み出し手段３５と、ストライピングユニット読み出
し手段３６と、読み出しエラー検出手段３７と、最大リ
トライ回数判定手段３８と、ストライピングユニット復
元手段３９と、交替セクター割り当て手段４０と、復元
ストライピングユニット書き込み手段４１とを有する。
エラーテーブル読み出し手段３５は、エラーテーブル５
からエラーの発生したハードディスクの番号およびエラ
ーの発生したセクターアドレスを読み出す。ストライピ
ングユニット読み出し手段３６は、エラーの発生したハ
ードディスクに対するデータの再読み出しを行う。読み
出しエラー検出手段３７は、エラーの発生したハードデ
ィスクに対するデータ読み出しのエラーを検出する。こ
のエラー検出動作は、ＳＣＳＩコマンドに対するステー
タス信号を検出することにより行われる。The replacement sector control unit 28 includes an error table reading unit 35, a striping unit reading unit 36, a reading error detecting unit 37, a maximum retry number judging unit 38, a striping unit restoring unit 39, and a replacement sector allocating unit. 40 and restoration striping unit writing means 41.
The error table reading means 35 uses the error table 5
Read the hard disk number where the error occurred and the sector address where the error occurred. The striping unit reading means 36 rereads the data from the hard disk in which the error has occurred. The read error detection means 37 detects an error in reading data from the hard disk in which the error has occurred. This error detection operation is performed by detecting the status signal for the SCSI command.

【００４２】最大リトライ回数判定手段３８は、読み出
しエラーに対して最大に設定されたリトライ回数の読み
出しリトライを実行したことを判定する。最大に設定さ
れたリトライ回数は、ＳＣＳＩコマンドに対するモード
パラメータに基づいて設定される。またこのリトライ回
数をオーバーしたことの判定はステータス信号を検出す
ることにより行われる。ストライピングユニット復元手
段３９は、エラーの発生したストライピングユニットを
パリティ演算に基づいて復元する。The maximum retry count judging means 38 judges that the read retry with the maximum retry count for the read error has been executed. The maximum number of retries is set based on the mode parameter for the SCSI command. The determination that the number of retries has been exceeded is made by detecting the status signal. The striping unit restoring means 39 restores the striping unit in which the error has occurred based on the parity calculation.

【００４３】交替セクター割り当て手段４０は、書き込
みエラーの発生したセクターを欠陥セクターとして処理
し、交替セクターを割り当てる。復元ストライピングユ
ニット書き込み手段４１は、復元したストライピングユ
ニットをエラーの発生したハードディスクのエラーの発
生したセクターアドレスに書き込む。この書き込み動作
は、プロセッサー１から供給されるＳＣＳＩコマンドに
基づいて行われる。The replacement sector allocation means 40 processes the sector in which the write error has occurred as a defective sector and allocates a replacement sector. The restoration striping unit writing means 41 writes the restored striping unit to the sector address where the error occurred in the hard disk where the error occurred. This write operation is performed based on the SCSI command supplied from the processor 1.

【００４４】このようにして、交替セクター制御部２８
は、空き時間にエラーテーブル５に登録されたハードデ
ィスクのセクターアドレスで示されるデータを読み出
す。このとき規定のリトライ回数で回復できる場合は、
正常なセクターとして判断する。そして、回復できない
場合は欠陥セクターと判断して交替セクターを割り当
て、正しいデータを復元して交替セクターに書き込む。In this way, the replacement sector control unit 28
Reads out the data indicated by the sector address of the hard disk registered in the error table 5 in the free time. At this time, if you can recover with the specified number of retries,
Judge as a normal sector. If it cannot be recovered, it is determined as a defective sector, a replacement sector is assigned, and correct data is restored and written in the replacement sector.

【００４５】このように構成された本実施例の記憶媒体
の欠陥処理装置の動作を図１のブロック図、図６乃至図
９のフローチャートを参照しながら説明する。まず、図
６および図７を参照しながら書き込み動作を説明する。
図６および図７に示したフローチャートは図２に示した
書き込み制御部の動作に対応する。ホストコンピュータ
から書き込み要求があると、図１に示したプロセッサー
にコマンドおよび映像データが供給される。プロセッサ
ー１は供給されたコマンドをＳＣＳＩコマンドに変換す
る。映像データはディジタルでＤ１あるいはＤ２フォー
マットでフォーマッティングされている。映像データは
予め圧縮処理されている場合と圧縮処理がされていない
場合があり、圧縮処理がされていない場合にはプロセッ
サー１は映像データをＭＰＥＧ２により圧縮処理する。
プロセッサー１は映像データを書き込むためのファイル
システムを生成する。つまり、どの映像データのどのブ
ロックサイズをどのブロックアドレスに転送するかとい
うファイルシステムを作る。The operation of the storage medium defect processing apparatus of the present embodiment thus constructed will be described with reference to the block diagram of FIG. 1 and the flow charts of FIGS. First, the write operation will be described with reference to FIGS. 6 and 7.
The flowcharts shown in FIGS. 6 and 7 correspond to the operation of the write control unit shown in FIG. When a write request is issued from the host computer, the command and video data are supplied to the processor shown in FIG. The processor 1 converts the supplied command into a SCSI command. The video data is digitally formatted in the D1 or D2 format. The video data may be compressed in advance or may not be compressed in advance. When the compression process is not performed, the processor 1 compresses the video data by MPEG2.
The processor 1 creates a file system for writing video data. In other words, a file system is created for which block size of which video data is transferred to which block address.

【００４６】プロセッサー１はこのように変換処理され
たＳＣＳＩコマンドと映像データとをそれぞれフェーズ
を分けてＲＡＩＤコントローラ２に供給する。ＲＡＩＤ
コントローラ２の書き込み動作を図２に示した書き込み
制御部の機能ブロック図に対応させながら説明する。The processor 1 supplies the converted SCSI command and video data to the RAID controller 2 in each phase. RAID
The write operation of the controller 2 will be described with reference to the functional block diagram of the write controller shown in FIG.

【００４７】図６において、書き込み動作を開始して、
ステップＳ１でリトライ制御部１１のデータ分割手段１
４はデータをストライピングユニットに分割する。スト
ライピングユニットは、所定単位としたが、バイト単
位、ビット単位、セクター単位でも良く、これが書き込
みをする単位となる。ステップＳ２でストライピングユ
ニット書き込み手段１５はストライピングユニットをデ
ータ用のハードディスク９に書く。この場合、複数のデ
ータ用のハードディスク９にストライピングユニットを
分散蓄積するように書き込み動作を行う。この書き込み
動作は、プロセッサー１から供給されるＳＣＳＩコマン
ドに基づいて行われる。In FIG. 6, after starting the write operation,
In step S1, the data dividing means 1 of the retry control unit 11
4 divides the data into striping units. The striping unit is a predetermined unit, but may be a byte unit, a bit unit, or a sector unit, and this is a unit for writing. In step S2, the striping unit writing means 15 writes the striping unit on the data hard disk 9. In this case, the write operation is performed so as to disperse and store the striping units in the plurality of data hard disks 9. This write operation is performed based on the SCSI command supplied from the processor 1.

【００４８】ステップＳ３で書き込みエラー検出手段１
７はエラーがあるか否かを判断する。このエラー検出動
作は、複数のデータ用のハードディスク９についてＳＣ
ＳＩコマンドに対するステータス信号を検出することに
より行われる。ステップＳ３でエラーがあるときはステ
ップＳ４で最小リトライ回数判定手段１８はリトライ回
数がオーバーしたか否かを判断する。この場合、最小に
設定されたリトライ回数は、ＳＣＳＩコマンドに対する
モードパラメータに基づいて設定される。またこのリト
ライ回数をオーバーしたことの判定はステータス信号を
検出することにより行われる。この最小の回数とは、許
される実行時間の範囲内での最大の回数である。つま
り、許される実行時間を１回のリトライ実行時間で割っ
た値で例えば１回である。In step S3, the write error detecting means 1
7 determines whether or not there is an error. This error detection operation is performed on the hard disk 9 for a plurality of data by SC
This is done by detecting the status signal for the SI command. If there is an error in step S3, the minimum retry count determination means 18 determines in step S4 whether or not the retry count has exceeded. In this case, the minimum number of retries is set based on the mode parameter for the SCSI command. The determination that the number of retries has been exceeded is made by detecting the status signal. The minimum number of times is the maximum number of times within the allowable execution time range. In other words, it is a value obtained by dividing the allowable execution time by one retry execution time, for example, once.

【００４９】ステップＳ４でリトライ回数がオーバーし
たらステップＳ５でエラー登録制御部１２のエラーテー
ブル登録手段１９はエラー情報をエラーテーブル５に登
録する。この場合、エラーテーブル登録手段１９は、ス
テータス信号からエラーの発生したハードディスクの番
号およびエラーの発生したセクターアドレスをエラーテ
ーブルに記憶する。例えば、図５に示すように、データ
用ハードディスク９のＨＤＤ３のアドレス２がエラーで
あることを記録する。If the number of retries is exceeded in step S4, the error table registration means 19 of the error registration control unit 12 registers the error information in the error table 5 in step S5. In this case, the error table registration means 19 stores the number of the hard disk where the error occurred and the sector address where the error occurred from the status signal in the error table. For example, as shown in FIG. 5, it is recorded that the address 2 of the HDD 3 of the data hard disk 9 is in error.

【００５０】ステップＳ４でリトライ回数がオーバーし
ていないときはステップＳ２に戻って、ステップＳ２か
らステップＳ４までの処理および判断を繰り返す。ステ
ップＳ３でエラーがないときはステップＳ６でパリティ
書き込み手段１６はパリティ用ハードディスク１０にデ
ータ用ハードディスク９に書き込んだデータの排他的論
理和であるパリティを書く。この書き込み動作も、プロ
セッサー１から供給されるＳＣＳＩコマンドに基づいて
行われる。When the number of retries has not exceeded in step S4, the process returns to step S2, and the processes and determinations from step S2 to step S4 are repeated. If there is no error in step S3, the parity writing means 16 writes the parity which is the exclusive OR of the data written in the data hard disk 9 to the parity hard disk 10 in step S6. This write operation is also performed based on the SCSI command supplied from the processor 1.

【００５１】ステップＳ７で書き込みエラー検出手段１
７はエラーがあるか否かを判断する。このエラー検出動
作はステップＳ３と同様である。ステップＳ７でエラー
があるときはステップＳ８で最小リトライ回数判定手段
１８はリトライ回数がオーバーしたか否かを判断する。
最小リトライ回数およびリトライ回数をオーバーしたこ
との判定はステップＳ３と同様である。In step S7, the write error detecting means 1
7 determines whether or not there is an error. This error detection operation is the same as step S3. If there is an error in step S7, the minimum retry count determination means 18 determines in step S8 whether the retry count has exceeded.
The minimum retry count and the determination that the retry count has been exceeded are the same as in step S3.

【００５２】ステップＳ８でリトライ回数がオーバーし
たらステップＳ９でエラー登録制御部１２のエラーテー
ブル登録手段１９はエラー情報をエラーテーブル５に登
録する。エラー情報の内容はステップＳ４と同様であ
る。例えば、図５に示すように、パリティ用ハードディ
スク１０のＨＤＤＰのアドレス４がエラーであることを
記録する。ステップＳ８でリトライ回数がオーバーして
いないときはステップＳ６に戻って、ステップＳ６から
ステップＳ８までの処理および判断を繰り返す。そし
て、ステップＳ５およびステップＳ９でエラーテーブル
５に登録したら、ステップＳ１０で書ききみ動作が終了
か否かを判断する。ステップＳ６からステップＳ９はス
テップＳ２からステップＳ５と並行して実行される。ス
テップＳ１０で終了のときはへ進む。ステップＳ１０
で終了でないときはステップＳ１に戻ってステップＳ１
からステップＳ１０までの処理および判断を繰り返す。
ステップＳ１０の終了で一連の番組等の素材書き込みが
終了する。If the number of retries is exceeded in step S8, the error table registration means 19 of the error registration control unit 12 registers the error information in the error table 5 in step S9. The content of the error information is the same as in step S4. For example, as shown in FIG. 5, it is recorded that the address 4 of the HDDP of the parity hard disk 10 is in error. When the number of retries has not exceeded in step S8, the process returns to step S6, and the processes and determinations from step S6 to step S8 are repeated. Then, after registration in the error table 5 in steps S5 and S9, it is determined in step S10 whether or not the writing operation is completed. Steps S6 to S9 are executed in parallel with steps S2 to S5. When the process ends in step S10, the process proceeds to. Step S10
If not completed, the procedure returns to step S1 and returns to step S1.
The processes and determinations from to S10 are repeated.
At the end of step S10, the writing of materials such as a series of programs ends.

【００５３】以下に述べるステップＳ１１以降の処理
は、空き時間や通常の処理のバックグランド処理で実行
される。図７において、へ進むとステップＳ１１で交
替セクター制御部１３のエラーテーブル読み出し手段２
０はエラーテーブル５からエラーの発生したハードディ
スクのエラーの発生したセクターアドレスを読む。ステ
ップＳ１２で交替セクター制御部１３のエラーテーブル
読み出し手段２０はエラーテーブル５からエラーの発生
したハードディスクのドライブ番号を読む。The processing after step S11 described below is executed in the idle time or in the background processing of normal processing. Referring to FIG. 7, in step S11, the error table reading means 2 of the replacement sector control unit 13 is proceeded to.
0 reads the errored sector address of the hard disk where the error occurred from the error table 5. In step S12, the error table reading means 20 of the replacement sector control unit 13 reads the drive number of the hard disk in which the error occurred from the error table 5.

【００５４】ステップＳ１３でストライピングユニット
復元手段２１はエラーの発生したストライピングユニッ
トをパリティ演算に基づいて復元する。この場合、パリ
ティ用ハードディスク１０には、データ用ハードディス
ク９に書き込んだデータの排他的論理和が計算されて書
き込まれているので、エラーの発生した以外のデータ用
ハードディスク９に書き込んだデータとパリティ用ハー
ドディスク１０に書き込んだパリティとによりデータを
復元することができる。このようにして、パリティ用の
ハードディスク１０も含み、エラーの発生したハードデ
ィスク以外のハードディスクのデータからデータを再構
築する。In step S13, the striping unit restoring means 21 restores the striping unit in which the error has occurred based on the parity calculation. In this case, since the exclusive OR of the data written in the data hard disk 9 is calculated and written in the parity hard disk 10, the data written in the data hard disk 9 other than the one in which the error has occurred and the parity Data can be restored by the parity written in the hard disk 10. In this way, data is reconstructed from the data of the hard disks other than the hard disk in which the error has occurred, including the hard disk 10 for parity.

【００５５】ステップＳ１４で復元ストライピングユニ
ット書き込み手段２２は復元したストライピングユニッ
トをエラーの発生したハードディスクのエラーの発生し
たセクターアドレスに書く。この書き込み動作は、プロ
セッサー１から供給されるＳＣＳＩコマンドに基づいて
行われる。ステップＳ１５で書き込みエラー検出手段２
３はエラーの発生したハードディスクに対してデータ書
き込みの再度のエラーがあるか否かを判断する。このエ
ラー検出動作は、ＳＣＳＩコマンドに対するステータス
信号を検出することにより行われる。In step S14, the restoring striping unit writing means 22 writes the restored striping unit in the sector address where the error occurred in the hard disk where the error occurred. This write operation is performed based on the SCSI command supplied from the processor 1. In step S15, the write error detection means 2
3 determines whether or not there is another data writing error in the hard disk in which the error has occurred. This error detection operation is performed by detecting the status signal for the SCSI command.

【００５６】ステップＳ１５でエラーがあるときはステ
ップＳ１６で最大リトライ回数判定手段２４は書き込み
エラーに対して最大に設定されたリトライ回数がオーバ
ーしたか否かを判断する。リトライ回数の設定およびリ
トライ回数をオーバーしたことの判定はステップＳ４お
よびステップＳ８と同様であるが、ステップＳ４および
ステップＳ８と異なる点はリトライ回数を最大に設定し
た点である。If there is an error in step S15, the maximum retry count judging means 24 judges in step S16 whether or not the maximum retry count for the write error has been exceeded. The setting of the number of retries and the determination that the number of retries has been exceeded are the same as in steps S4 and S8, but the difference from steps S4 and S8 is that the number of retries is set to the maximum.

【００５７】ステップＳ１６でリトライ回数がオーバー
したらステップＳ１７で交替セクター割り当て手段２５
は書き込みエラーの発生したセクターを欠陥セクターと
して処理して交替セクターを割り当てる。例えば、図４
に示すように、データ用ハードディスク９のＨＤＤ３の
アドレス２を欠陥セクター４２として処理し、アドレス
ｎを交替セクター４３として割り当てる。また、パリテ
ィ用ハードディスク１０のＨＤＤＰのアドレス４を欠陥
セクター４４として処理し、アドレスｋを交替セクター
４５として割り当てる。If the number of retries is exceeded in step S16, the replacement sector allocation means 25 is operated in step S17.
Treats a sector in which a write error has occurred as a defective sector and assigns a replacement sector. For example, FIG.
As shown in, the address 2 of the HDD 3 of the data hard disk 9 is processed as the defective sector 42, and the address n is assigned as the replacement sector 43. Further, the address 4 of the HDDP of the parity hard disk 10 is processed as the defective sector 44, and the address k is allocated as the replacement sector 45.

【００５８】そして、ステップＳ１４に戻って、ステッ
プＳ１４からステップＳ１７までの判断および処理を繰
り返す。ステップＳ１４からステップＳ１７の処理はハ
ードディスクにエラー回復のために十分なリトライ回数
を指定し、自動交替処理を許可したモードで、ライトコ
マンドを発行することにより実現される。ステップＳ１
５でエラーがないときはステップＳ１８でエラーテーブ
ル５のすべてのセクターについてエラー回復処理が終了
か否かを判断する。ステップＳ１８でエラーテーブルが
終了のときは書き込み動作を終了する。ステップＳ１８
でエラーテーブル５が終了でないときはステップＳ１１
に戻って、ステップＳ１１からステップＳ１８までの判
断および処理を繰り返す。Then, returning to step S14, the determination and processing from step S14 to step S17 are repeated. The processing from step S14 to step S17 is realized by designating the hard disk with a sufficient number of retries for error recovery and issuing a write command in a mode in which the automatic replacement processing is permitted. Step S1
If there is no error in step 5, it is determined in step S18 whether the error recovery process is completed for all sectors in the error table 5. When the error table is finished in step S18, the write operation is finished. Step S18
If the error table 5 is not finished in step S11, step S11
Returning to step S11, the judgment and processing from step S11 to step S18 are repeated.

【００５９】次に、図８および図９を参照しながら読み
出し動作を説明する。図８および図９に示したフローチ
ャートは図３に示した読み出し制御部の動作に対応す
る。ホストコンピュータから読み出し要求があると、図
１に示したプロセッサー１にコマンドが供給される。プ
ロセッサー１は供給されたコマンドをＳＣＳＩコマンド
に変換する。プロセッサー１はこのように変換処理され
たＳＣＳＩコマンドをＲＡＩＤコントローラ２に供給す
る。Next, the read operation will be described with reference to FIGS. The flowcharts shown in FIGS. 8 and 9 correspond to the operation of the read control unit shown in FIG. When a read request is issued from the host computer, a command is supplied to the processor 1 shown in FIG. The processor 1 converts the supplied command into a SCSI command. The processor 1 supplies the SCSI command thus converted to the RAID controller 2.

【００６０】ＲＡＩＤコントローラ２は映像データを読
み出すためのファイルシステムを生成する。つまり、ど
の映像データのどのブロックサイズをどのブロックアド
レスから転送するかというファイルシステムを作る。こ
のようなＲＡＩＤコントローラ２の読み出し動作を図３
に示した読み出し制御部の機能ブロック図に対応させな
がら説明する。The RAID controller 2 creates a file system for reading video data. In other words, a file system is created for which block size of which video data is transferred from which block address. Such a read operation of the RAID controller 2 is shown in FIG.
The description will be made in correspondence with the functional block diagram of the read control unit shown in FIG.

【００６１】図８において、読み出し動作を開始して、
ステップＳ１９でリトライ制御部２６のストライピング
ユニット読み出し手段２９は複数のデータ用のハードデ
ィスク９に分散蓄積されたストライピングユニットの読
み出し動作を行う。この読み出し動作は、プロセッサー
１から供給されるＳＣＳＩコマンドに基づいて行われ
る。ステップＳ２０で読み出しエラー検出手段３０は複
数のデータ用のハードディスク９に対するデータ読み出
しにエラーがあるか否かを判断する。このエラー検出動
作は、ＳＣＳＩコマンドに対するステータス信号を検出
することにより行われる。ステップＳ２１で最小リトラ
イ回数判定手段３１は、読み出しエラーに対して最小に
設定されたリトライ回数をオーバーしたか否かを判断す
る。最小に設定されたリトライ回数は、ＳＣＳＩコマン
ドに対するモードパラメータに基づいて設定される。こ
の最小の回数とは、許される実行時間の範囲内での最大
の回数である。またこのリトライ回数をオーバーしたこ
との判定はモードパラメータおよびステータス信号を検
出することにより行われる。In FIG. 8, after starting the read operation,
In step S19, the striping unit reading means 29 of the retry control unit 26 performs the reading operation of the striping units dispersedly accumulated in the plurality of data hard disks 9. This read operation is performed based on the SCSI command supplied from the processor 1. In step S20, the read error detection means 30 determines whether or not there is an error in reading data from the hard disks 9 for a plurality of data. This error detection operation is performed by detecting the status signal for the SCSI command. In step S21, the minimum retry count determination means 31 determines whether or not the read error has exceeded the minimum retry count. The minimum number of retries is set based on the mode parameter for the SCSI command. The minimum number of times is the maximum number of times within the allowable execution time range. The determination that the number of retries has been exceeded is made by detecting the mode parameter and the status signal.

【００６２】ステップＳ２１で最小リトライ回数をオー
バーしていないときは、ステップＳ１９に戻って、ステ
ップＳ１９からステップＳ２１までの処理および判断を
繰り返す。ステップＳ２１で最小リトライ回数をオーバ
ーしているときは、ステップＳ２２でエラー登録制御部
２７のエラーテーブル登録手段３２はステータス信号に
基づいてエラーの発生したハードディスクの番号および
エラーの発生したセクターアドレスをエラーテーブル５
に記憶する。ステップＳ２３でストライピングユニット
復元手段３３はエラーの発生したストライピングユニッ
トをパリティ演算に基づいて復元する。If the minimum number of retries has not been exceeded in step S21, the process returns to step S19, and the processes and determinations from step S19 to step S21 are repeated. If the minimum number of retries has been exceeded in step S21, the error table registration means 32 of the error registration control unit 27 determines the hard disk number in which the error occurred and the sector address in which the error occurred in step S22 based on the status signal. Table 5
To memorize. In step S23, the striping unit restoring means 33 restores the striping unit in which the error has occurred based on the parity calculation.

【００６３】ステップＳ２４でデータ合成手段３４はス
トライピングユニットに分割されていたデータから元の
データを合成する。この映像データはプロセッサー１に
供給される。この映像データはディジタルでＤ１あるい
はＤ２フォーマットでフォーマッティングされている。
また、プロセッサー１が映像データをＭＰＥＧ２により
圧縮処理している場合には、プロセッサー１は映像デー
タを伸張処理する。これにより、合成された映像データ
はプロセッサー１を介して送出される。In step S24, the data synthesizing means 34 synthesizes the original data from the data divided into striping units. This video data is supplied to the processor 1. This video data is digitally formatted in the D1 or D2 format.
Further, when the processor 1 compresses the video data by MPEG2, the processor 1 expands the video data. As a result, the combined video data is sent out via the processor 1.

【００６４】ステップＳ２０でエラーがなかったときは
リトライを行わないのでステップＳ２４に直接進む。ス
テップＳ２５で読み出し動作が終了したか否かを判断す
る。ステップＳ２５で読み出し動作が終了していないと
きはステップＳ１９に戻って、ステップＳ１９からステ
ップＳ２５までの判断および処理を繰り返す。ステップ
Ｓ２５で読み出し動作が終了したときはへ進む。ステ
ップＳ２５の終了で一連の番組等の素材読み出しが終了
する。If there is no error in step S20, the retry is not performed, so the process directly proceeds to step S24. In step S25, it is determined whether the read operation is completed. When the read operation is not completed in step S25, the process returns to step S19 to repeat the determination and processing from step S19 to step S25. When the read operation is completed in step S25, the process proceeds to. At the end of step S25, the material reading of a series of programs is completed.

【００６５】以下に述べるステップＳ２６以降の処理
は、空き時間や通常の処理のバックグランド処理で実行
される。ステップＳ２６で交替セクター制御部２８のエ
ラーテーブル読み出し手段３５は、エラーテーブル５か
らエラーの発生したハードディスクのエラーの発生した
セクターアドレスを読み出す。ステップＳ２７でエラー
テーブル読み出し手段３５６は、エラーテーブル５から
エラーの発生したハードディスクの番号を読み出す。ス
テップＳ２８でストライピングユニット読み出し手段３
６は、エラーの発生したハードディスクに対するデータ
の再読み出しを行う。ステップＳ２９で読み出しエラー
検出手段３７は、エラーの発生したハードディスクに対
するデータ読み出しにエラーがあるか否かを判断する。
このエラー検出動作は、ＳＣＳＩコマンドに対するステ
ータス信号を検出することにより行われる。The processing after step S26 described below is executed in the idle time or in the background processing of normal processing. In step S26, the error table reading means 35 of the alternate sector control unit 28 reads the errored sector address of the errored hard disk from the error table 5. In step S27, the error table reading means 356 reads from the error table 5 the number of the hard disk in which the error occurred. In step S28, the striping unit reading means 3
6 rereads the data from the hard disk in which the error occurred. In step S29, the read error detection means 37 determines whether or not there is an error in reading data from the hard disk in which the error has occurred.
This error detection operation is performed by detecting the status signal for the SCSI command.

【００６６】ステップＳ３０で最大リトライ回数判定手
段３８は、読み出しエラーに対して最大に設定されたリ
トライ回数をオーバーしたか否かを判断する。ステップ
Ｓ３０で最大リトライ回数をオーバーしていないときは
ステップＳ２８に戻って、ステップＳ２８からステップ
Ｓ３０までの処理および判断を繰り返す。このようなス
テップＳ２８からステップＳ３０までの処理および判断
は、パリティ用ハードディスク１０も含みエラーの起き
たハードディスクのセクタに対して、エラー回復に十分
なリトライ回数を指定してリードコマンドを発行するこ
とにより実現される。In step S30, the maximum retry count determination means 38 determines whether or not the maximum retry count for a read error has been exceeded. When the maximum number of retries has not been exceeded in step S30, the process returns to step S28, and the processes and determinations from step S28 to step S30 are repeated. The processes and determinations from step S28 to step S30 are performed by issuing a read command to the sector of the hard disk including the parity hard disk 10 in which an error has occurred by designating a sufficient number of retries for error recovery. Will be realized.

【００６７】リトライ回数の設定およびリトライ回数を
オーバーしたことの判定はステップＳ２１と同様である
が、ステップＳ２１と異なる点はリトライ回数を最大に
設定した点である。The setting of the number of retries and the determination that the number of retries has been exceeded are the same as in step S21, but the point different from step S21 is that the number of retries is set to the maximum.

【００６８】ステップＳ３０で最大リトライ回数をオー
バーしたときはステップＳ３１でストライピングユニッ
ト復元手段３９は、エラーの発生したストライピングユ
ニットをパリティ演算に基づいて復元する。この場合、
パリティ用ハードディスク１０も含み、エラーの発生し
たハードディスク以外のハードディスクのデータから、
ストライピングユニットを復元する。When the maximum number of retries is exceeded in step S30, the striping unit restoring means 39 restores the striping unit in which the error has occurred based on the parity operation in step S31. in this case,
From the data of the hard disk other than the hard disk where the error occurred, including the parity hard disk 10,
Restore the striping unit.

【００６９】ステップＳ３２で交替セクター割り当て手
段４０は、書き込みエラーの発生したセクターを欠陥セ
クターとして処理し、交替セクターを割り当てる。ステ
ップＳ３３で復元ストライピングユニット書き込み手段
４１は、復元したストライピングユニットをエラーの発
生したハードディスクのエラーの発生したセクターアド
レスに書き込む。この書き込み動作は、プロセッサー１
から供給されるＳＣＳＩコマンドに基づいて行われる。
このようなステップＳ３２およびステップＳ３３の処理
は、リードエラーの発生したハードディスクにエラー回
復に十分なリトライ回数を指定し、自動交替処理を許可
したモードで、ライトコマンドを発行することにより実
行される。In step S32, the replacement sector allocation means 40 processes the sector in which the write error has occurred as a defective sector and allocates a replacement sector. In step S33, the restoration striping unit writing means 41 writes the restored striping unit in the sector address where the error occurred in the hard disk where the error occurred. This writing operation is performed by the processor 1
It is performed based on the SCSI command supplied from.
The processes of steps S32 and S33 are executed by designating the hard disk in which the read error has occurred with a sufficient number of retries for error recovery and issuing a write command in a mode in which the automatic replacement process is permitted.

【００７０】このあと、ステップＳ２８に戻って、ステ
ップＳ２８からステップＳ３３までの処理および判断を
繰り返す。そして、ステップＳ２９でエラーがないとき
にステップＳ３４に進み、エラーテーブル５のすべての
セクターについてエラー回復処理が終了か否かを判断す
る。ステップＳ３４でエラーテーブル５が終了のときは
読み出し動作を終了する。ステップ３４でエラーテーブ
ル５が終了でないときはステップＳ２６に戻って、ステ
ップＳ２６からステップＳ３４までの判断および処理を
繰り返す。After that, the process returns to step S28, and the processes and judgments from step S28 to step S33 are repeated. Then, when there is no error in step S29, the process proceeds to step S34, and it is determined whether the error recovery process is completed for all the sectors of the error table 5. When the error table 5 is finished in step S34, the read operation is finished. When the error table 5 is not completed in step 34, the process returns to step S26, and the determination and processing from step S26 to step S34 are repeated.

【００７１】上例によれば、この記憶媒体の欠陥処理装
置は、複数のデータ用ハードディスク９にデータを書き
込むと共に、他のパリティ用ハードディスク１０に上記
データのパリティを書き込み、上記複数のデータ用ハー
ドディスク９及び上記他のパリティ用ハードディスク１
０にデータを書き込むときに書き込みエラーが発生した
ら、最小のリトライ回数でリトライをするリトライ制御
部１１と、上記最小のリトライ回数で回復不可能なデー
タに関する情報をエラーとしてエラーテーブル５に記憶
するエラー登録制御部１２と、上記エラーテーブル５に
登録されたエラー情報を読み出し、エラーの発生したハ
ードディスク以外の上記複数のデータ用ハードディスク
９及び上記他のパリティ用ハードディスク１０のデータ
からエラーの発生したデータを復元し、最大のリトライ
回数でリトライをし、上記最大のリトライ回数で回復不
可能なデータに対して交替セクターを割り当てる交替セ
クター制御部１３とを備えたので、最小のリトライ回数
で回復不可能なデータに関する情報をエラーとしてエラ
ーテーブル５に記憶しておいて、別の余裕のある時間に
エラーの発生したデータを復元して最大のリトライ回数
でリトライをしても回復不可能なデータに対してのみ交
替セクターを割り当てるように処理を分けることがで
き、これにより、データ書き込み時において、映像デー
タや音声データなどのリアルタイム性が要求されるデー
タのアクセス中に書き込みエラーが発生しても、エラー
回復のための時間を最小限に抑えることにより、リアル
タイム性を損なうこと無くアクセスをすることができる
という効果を奏する。According to the above example, the defect processing apparatus for this storage medium writes the data to the plurality of data hard disks 9 and at the same time writes the parity of the above data to the other parity hard disks 10 to write the above plurality of data hard disks. 9 and other parity hard disk 1
If a write error occurs when writing data to 0, a retry control unit 11 that retries with the minimum number of retries, and an error that stores information about unrecoverable data with the above minimum number of retries in the error table 5 as an error The error information registered in the error table 5 is read out from the registration control unit 12, and the error data is extracted from the data of the plurality of data hard disks 9 other than the error hard disk and the other parity hard disk 10. Since it is provided with the replacement sector control unit 13 that restores data, retries with the maximum number of retries, and allocates replacement sectors to data that cannot be recovered with the maximum number of retries, recovery cannot be performed with the minimum number of retries. Information related to data is recorded in error table 5 as an error. Then, divide the processing so that the data in which the error occurred is restored in another spare time and the replacement sector is assigned only to the data that cannot be recovered even if it is retried with the maximum number of retries. By doing this, even if a write error occurs during access of data such as video data and audio data that requires real-time characteristics during data writing, the time for error recovery can be minimized. Thus, it is possible to access without deteriorating the real-time property.

【００７２】また、上例によれば、この記憶媒体の欠陥
処理装置は、複数のデータ用ハードディスク９にデータ
が書き込まれると共に、他のパリティ用ハードディスク
１０に上記データのパリティが書き込まれ、上記複数の
データ用ハードディスク９及び上記他のパリティ用ハー
ドディスク１０に書き込まれたデータを読み出すときに
読み出しエラーが発生したら、最小のリトライ回数でリ
トライをするリトライ制御部２６と、上記最小のリトラ
イ回数で回復不可能なデータに関する情報をエラーとし
てエラーテーブル５に記憶するエラー登録制御部２７
と、上記エラーテーブル５に登録されたエラー情報を読
み出し、エラーの発生したデータ用ハードディスク９及
び上記他のパリティ用ハードディスク１０のデータを読
み出すように、最大のリトライ回数でリトライをし、上
記最大のリトライ回数で回復不可能なデータに対してエ
ラーの発生したハードディスク以外の上記複数のデータ
用ハードディスク９及び上記他のパリティ用ハードディ
スク１０のデータからエラーの発生したデータを復元
し、交替セクターを割り当てる交替セクター制御部２８
とを備えたので、最小のリトライ回数で回復不可能なデ
ータに関する情報をエラーとしてエラーテーブル５に記
憶しておいて、別の余裕のある時間に最大のリトライ回
数でリトライをしても回復不可能なデータに対してのみ
エラーの発生したデータを復元して交替セクターを割り
当てるように処理を分けることができ、これにより、デ
ータ読み出し時において、映像データや音声データなど
のリアルタイム性が要求されるデータのアクセス中に読
み出しエラーが発生しても、エラー回復のための時間を
最小限に抑えることにより、リアルタイム性を損なうこ
と無くアクセスをすることができるという効果を奏す
る。Further, according to the above example, in the defect processing apparatus for this storage medium, the data is written to the plurality of data hard disks 9 and the parity of the data is written to the other parity hard disks 10, and the plurality of data hard disks 9 are written. If a read error occurs when reading the data written in the data hard disk 9 and the other parity hard disk 10, the retry control unit 26 that retries with the minimum number of retries, and the recovery failure with the minimum number of retries. An error registration control unit 27 that stores information about possible data in the error table 5 as an error.
Then, the error information registered in the error table 5 is read out, and the data is read from the data hard disk 9 and the other parity hard disk 10 in which the error has occurred. Replacement of allocating a replacement sector by restoring error data from the data of the plurality of data hard disks 9 and the other parity hard disk 10 other than the hard disk in which an error has occurred for data that cannot be recovered by the number of retries Sector control unit 28
Therefore, even if information about unrecoverable data is stored in the error table 5 as an error with the minimum number of retries, and recovery is performed with the maximum number of retries at another spare time, recovery is not possible. The processing can be divided so that the data in which an error has occurred is restored only to the possible data and the replacement sector is assigned, which requires real-time processing of video data, audio data, etc. when reading data. Even if a read error occurs during data access, by minimizing the time for error recovery, it is possible to access without deteriorating the real-time property.

【００７３】上例では、読み出し動作のステップＳ１９
からステップＳ２３において、読み出しエラーが発生し
た後に、パリティを演算する例を示したが、読み出しな
がらパリティを演算しても良い。また、上例の書き込み
動作のステップＳ１１からステップＳ１８、読み出し動
作のステップＳ２６からステップＳ３４の処理および判
断は、予め、サービス中断時や夜間等にタイムスロット
を設けて、この時間に実行しても良い。In the above example, step S19 of the read operation is performed.
From step S23 to S23, the example in which the parity is calculated after the read error occurs is described, but the parity may be calculated while reading. Further, the processes and judgments of steps S11 to S18 of the write operation and steps S26 to S34 of the read operation in the above example may be executed at this time by providing a time slot at the time of service interruption or at night. good.

【００７４】[0074]

【発明の効果】この発明の記憶媒体の欠陥処理装置は、
複数の記憶媒体にデータを書き込むと共に、他の記憶媒
体に上記データのパリティを書き込み、上記複数の記憶
媒体及び上記他の記憶媒体にデータを書き込むときに書
き込みエラーが発生したら、最小のリトライ回数でリト
ライをするリトライ制御部と、上記最小のリトライ回数
で回復不可能なデータに関する情報をエラーとしてエラ
ーテーブルに記憶するエラー登録制御部と、上記エラー
テーブルに登録されたエラー情報を読み出し、エラーの
発生した記憶媒体以外の上記複数の記憶媒体及び上記他
の記憶媒体のデータからエラーの発生したデータを復元
し、最大のリトライ回数でリトライをし、上記最大のリ
トライ回数で回復不可能なデータに対して交替セクター
を割り当てる交替セクター制御部とを備えたので、最小
のリトライ回数で回復不可能なデータに関する情報をエ
ラーとしてエラーテーブルに記憶しておいて、別の余裕
のある時間にエラーの発生したデータを復元して最大の
リトライ回数でリトライをしても回復不可能なデータに
対してのみ交替セクターを割り当てるように処理を分け
ることができ、これにより、データ書き込み時におい
て、映像データや音声データなどのリアルタイム性が要
求されるデータのアクセス中に書き込みエラーが発生し
ても、エラー回復のための時間を最小限に抑えることに
より、リアルタイム性を損なうこと無くアクセスをする
ことができるという効果を奏する。The storage medium defect processing apparatus of the present invention is
When writing data to multiple storage media, writing parity of the above data to other storage media, and writing error when writing data to the above multiple storage media and the above other storage media, the minimum retry count A retry controller that performs retries, an error registration controller that stores information about unrecoverable data in the error table as an error in the minimum number of retries, and error information registered in the error table, and an error occurs The data in which the error occurred is restored from the data of the above-mentioned multiple storage media other than the above-mentioned storage media and the data of the above-mentioned other storage media, and the retry is performed with the maximum number of retries. It has a replacement sector control unit that allocates replacement sectors with a minimum number of retries. Data that cannot be recovered even if information about unrecoverable data is stored as an error in the error table, and the data in which the error occurred is restored at another spare time and retries are performed with the maximum number of retries. It is possible to divide the processing so that the replacement sector is assigned only to, even if a write error occurs during the access of data such as video data and audio data that requires real-time processing, when writing data. By minimizing the time for error recovery, it is possible to access without deteriorating the real-time property.

【００７５】また、この発明の記憶媒体の欠陥処理装置
は、複数の記憶媒体にデータが書き込まれると共に、他
の記憶媒体に上記データのパリティが書き込まれ、上記
複数の記憶媒体及び上記他の記憶媒体に書き込まれたデ
ータを読み出すときに読み出しエラーが発生したら、最
小のリトライ回数でリトライをするリトライ制御部と、
上記最小のリトライ回数で回復不可能なデータに関する
情報をエラーとしてエラーテーブルに記憶するエラー登
録制御部と、上記エラーテーブルに登録されたエラー情
報を読み出し、エラーの発生した記憶媒体及び上記他の
記憶媒体のデータを読み出すように、最大のリトライ回
数でリトライをし、上記最大のリトライ回数で回復不可
能なデータに対してエラーの発生した記憶媒体以外の上
記複数の記憶媒体及び上記他の記憶媒体のデータからエ
ラーの発生したデータを復元し、交替セクターを割り当
てる交替セクター制御部とを備えたので、最小のリトラ
イ回数で回復不可能なデータに関する情報をエラーとし
てエラーテーブルに記憶しておいて、別の余裕のある時
間に最大のリトライ回数でリトライをしても回復不可能
なデータに対してのみエラーの発生したデータを復元し
て交替セクターを割り当てるように処理を分けることが
でき、これにより、データ読み出し時において、映像デ
ータや音声データなどのリアルタイム性が要求されるデ
ータのアクセス中に読み出しエラーが発生しても、エラ
ー回復のための時間を最小限に抑えることにより、リア
ルタイム性を損なうこと無くアクセスをすることができ
るという効果を奏する。Further, in the storage medium defect processing apparatus of the present invention, the data is written in a plurality of storage media and the parity of the data is written in another storage medium, and the plurality of storage media and the other storages are stored. If a read error occurs when reading the data written to the medium, a retry control unit that retries with the minimum number of retries,
An error registration control unit that stores information on unrecoverable data as an error in the error table with the minimum number of retries, and the error information registered in the error table to read the storage medium in which the error occurred and the other storages. A plurality of storage media other than the storage medium in which an error occurs with respect to the data which cannot be recovered by the maximum number of retries by performing a retry with the maximum number of retries so as to read the data of the medium and the other storage medium. Since it has an alternate sector control unit that restores the data in which an error has occurred from the data of No. 1 and assigns the alternate sector, information about unrecoverable data with the minimum number of retries is stored as an error in the error table, For data that cannot be recovered even if you retry with the maximum number of retries in another spare time It is possible to divide the processing so that the data in which the error occurred is restored and the replacement sector is allocated. This allows the data to be read during the access of data such as video data and audio data that requires real-time processing. Even if an error occurs, by minimizing the time for error recovery, it is possible to access without deteriorating the real-time property.

[Brief description of drawings]

【図１】本実施例の記憶媒体の欠陥処理装置の構成を示
すブロック図である。FIG. 1 is a block diagram showing the configuration of a defect processing apparatus for a storage medium according to the present embodiment.

【図２】本実施例の記憶媒体の欠陥処理装置の書き込み
制御部の機能ブロック図である。FIG. 2 is a functional block diagram of a write control unit of the defect processing device for a storage medium according to the present embodiment.

【図３】本実施例の記憶媒体の欠陥処理装置の読み出し
制御部の機能ブロック図である。FIG. 3 is a functional block diagram of a read control unit of the storage medium defect processing apparatus of this embodiment.

【図４】本実施例の記憶媒体の欠陥処理装置の交替セク
ターを説明する図である。FIG. 4 is a diagram illustrating a replacement sector of the defect processing apparatus for the storage medium according to the present embodiment.

【図５】本実施例の記憶媒体の欠陥処理装置のエラーテ
ーブルを説明する図である。FIG. 5 is a diagram illustrating an error table of the storage medium defect processing apparatus according to the present embodiment.

【図６】本実施例の記憶媒体の欠陥処理装置の書き込み
動作を示すフローチャートである。FIG. 6 is a flowchart showing the write operation of the defect processing apparatus for a storage medium of the present embodiment.

【図７】本実施例の記憶媒体の欠陥処理装置の書き込み
動作を示すフローチャートである。FIG. 7 is a flowchart showing a write operation of the storage medium defect processing apparatus of the present embodiment.

【図８】本実施例の記憶媒体の欠陥処理装置の読み出し
動作を示すフローチャートである。FIG. 8 is a flowchart showing a read operation of the defect processing apparatus for a storage medium according to the present embodiment.

【図９】本実施例の記憶媒体の欠陥処理装置の読み出し
動作を示すフローチャートである。FIG. 9 is a flowchart showing a read operation of the defect processing device for a storage medium according to the present embodiment.

[Explanation of symbols]

１プロセッサー、２ＲＡＩＤコントローラ、３Ｃ
ＰＵ、４ＲＯＭ、５エラーテーブル、６ＲＡＭ、７
書き込み制御部、８読み出し制御部、９データ用ハ
ードディスク、１０パリティ用ハードディスク、１１
リトライ制御部、１２エラー登録制御部、１３交
替セクター制御部、１４データ分割手段、１５スト
ライピングユニット書き込み手段、１６パリティ書き
込み手段、１７書き込みエラー検出手段、１８最小
リトライ回数判定手段、１９エラーテーブル登録手段、
２０エラーテーブル読み出し手段、２１ストライピ
ングユニット復元手段、２２復元ストライピングユニ
ット書き込み手段、２３書き込みエラー検出手段、２
４最大リトライ回数判定手段、２５交替セクター割
り当て手段、２６リトライ制御部、２７エラー登録
制御部、２８交替セクター制御部、２９ストライピン
グユニット読み出し手段、３０読み出しエラー検出手
段、３１最小リトライ回数判定手段、３２エラーテ
ーブル登録手段、３３ストライピングユニット復元手
段、３４データ合成手段、３５エラーテーブル読み
出し手段、３６ストライピングユニット読み出し手
段、３７読み出しエラー検出手段、３８最大リトラ
イ回数判定手段、３９ストライピングユニット復元手
段、４０交替セクター割り当て手段、４１復元スト
ライピングユニット書き込み手段、４２欠陥セクタ
ー、４３交替セクター、４４欠陥セクター、４５
交替セクター1 processor, 2 RAID controller, 3 C
PU, 4 ROM, 5 error table, 6 RAM, 7
Write controller, 8 read controller, 9 data hard disk, 10 parity hard disk, 11
Retry control section, 12 error registration control section, 13 alternate sector control section, 14 data dividing section, 15 striping unit writing section, 16 parity writing section, 17 write error detecting section, 18 minimum retry count determining section, 19 error table registering section ,
20 error table reading means, 21 striping unit restoring means, 22 restoring striping unit writing means, 23 write error detecting means, 2
4 maximum retry count determination means, 25 replacement sector allocation means, 26 retry control section, 27 error registration control section, 28 replacement sector control section, 29 striping unit read means, 30 read error detection means, 31 minimum retry count determination means, 32 Error table registering means, 33 striping unit restoring means, 34 data synthesizing means, 35 error table reading means, 36 striping unit reading means, 37 read error detecting means, 38 maximum retry number judging means, 39 striping unit restoring means, 40 replacement sector Assigning means, 41 restoring striping unit writing means, 42 defective sectors, 43 replacement sectors, 44 defective sectors, 45
Replacement sector

Claims

[Claims]

1. Writing data to a plurality of storage media and writing parity of the data to other storage media,
If a write error occurs when writing data to the plurality of storage media and the other storage media, a retry control unit that performs a retry with the minimum number of retries, and information regarding unrecoverable data with the minimum number of retries are provided. An error registration control unit that stores an error in the error table, and error information registered in the error table is read, and an error occurs from the data of the plurality of storage media other than the storage medium in which the error has occurred and the data of the other storage medium. Restore the data you have made, retry with the maximum number of retries,
A defect processing apparatus for a storage medium, comprising: a replacement sector control unit that allocates a replacement sector to data that cannot be recovered with the maximum number of retries.

2. Data is written to a plurality of storage media, parity of the data is written to another storage medium, and read when reading the data written to the plurality of storage media and the other storage medium. When an error occurs, a retry control unit that retries with the minimum number of retries, an error registration control unit that stores information about unrecoverable data with the minimum number of retries in an error table as an error, and register with the above error table The error information is read and the data in the storage medium in which the error has occurred and the data in the other storage medium is read, and the retry is performed with the maximum number of retries. Error from the data of the plurality of storage media other than the storage media in which the No data restores, and a replacement sector controller for allocating a replacement sector, the defect processing unit of the storage medium, characterized in that.