JPH0354625A

JPH0354625A - Cluster type magnetic disk device

Info

Publication number: JPH0354625A
Application number: JP1188772A
Authority: JP
Inventors: Noriyuki Kaneoka; 則幸兼岡; Mitsuo Oyama; 大山　光男; Kanji Kato; 加藤　寛次; Hisamitsu Kawaguchi; 川口　久光; Hiromichi Fujisawa; 藤沢　浩道
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1989-07-24
Filing date: 1989-07-24
Publication date: 1991-03-08

Abstract

PURPOSE:To input/output plural files successively by satisfying fast transfer speed by requested by host equipment, and comprising the device of the minimum number of magnetic disk devices corresponding to the capacity of a data base to store data. CONSTITUTION:The device is comprised of (n) data memory devices 15 having the magnetic disk device 1, an input/output buffer 3 with the capacity of one cylinder of the magnetic disk device 1 connected to each of the memory devices, respectively, and a multi-disk controller 4 which controls them. Assuming data transfer speed from the magnetic disk device to the input/output buffer as tByte/sec, and the rotating speed of the magnetic disk device as Rrps, the device is comprised of the minimum number of data memory devices which satisfies conditions I and II as equation II in the case of being less than equation I when the minimum seek time of the magnetic disk device exceeds a time M/ Tsec to transfer the data of MByte of one input/output buffer to the host equipment. In such a way, it is possible to input/output the plural files large in memory capacity successively and at high speed in spite of the size of the file.

Description

[Detailed description of the invention] [Industrial application field]

本発明は外部記憶装置において、記憶容量が大きく，短
時間の書き込み，読み出しが可能な集合型磁気ディスク
装置、ならびに、複数件のファイルの連続書き込み，読
み出しに適した集合型磁気ディスク装置、および，集合
型磁気ディスク装置のデータ格納構造に関する。The present invention relates to an external storage device, including a collective magnetic disk device with a large storage capacity and capable of writing and reading in a short time, a collective magnetic disk device suitable for continuous writing and reading of multiple files, and This invention relates to a data storage structure of a collective magnetic disk device.

[Conventional technology]

近年、文献情報や特許情報などの２次情報（書誌情報）
のみならず、１次情報（本文）をも含む大規模データベ
ース・サービスの重要性が増している．このようなデー
タベースの情報検索では、従来、キーワードや分類コー
ドによる方法が用いられてきている．しかし、この方法
では数十件から数百件までにしか絞り込めないため、検
索者が最終段階で直接本文を読んで内容を確認しなけれ
ばならないという効率上の問題がある。また、分類体系
自体が年月と共に変化するため、常にキーワードや分類
コードを更新しなければならないという問題も生じてく
る。更に、キーワード付け（インデキシングと言う）に
は時間がかかるため新たな文書はバッチ処理によりかな
りの量をまとめて登録する．そのため、検索できる登録
された情報は常に一定期間の遅れを持つという問題があ
った。これらの問題に対処する一つの方法として、検索者が自
由なキーワードに基づいて文書の本文を直接参照して内
容を検索できる全文検索システムが考えられている。一方、このような全文検索システムを目指した文字列検
索装置がいくつか提案されている。その代表的な構戊を
第２図に示し、まず、その内容について説明する。文字列検索装置１０１において、検索制御手段１０２は
検索装置全体の制御と、ホストコンピュータから送られ
てくる検索要求を受け付け、これを解析し、文字列照合
手段１０５と複合条件判別千段１０４へ検索情報として
送出する。また、検索制御手段１−０２はディスク制御
手段１０３を制御して、文字列記憶手段１０６に格納さ
れる文字列データを文字列照合手段へ送り込む。文字列照合手段１０５は入力文字データの中に検索要求
に合致するものがあるかどうかを調べ、もし該当するも
のがあれば、文字列を識別する情報を複合条件判別手段
１．　０　４へ出力する。複合条件判別手段１０４は該
文字列識別情報に基づいて検索要求中に指定された相互
の位置関係などの複合条件が満足するか否かを調べる。複合条件が潤足する場合には、該当する文書へのポイン
タ情報や文書内容のテキストデータを検索結果としてホ
ストコンピュータへ返送する。文字列検索装置１．　０　１の構戊要素である文字列記
憶手段１０６として大規模なデータの記憶ができる磁気
ディスク装置が必要となる。一般の磁気ディスク装置は
データの入出力が高速にできない問題があり、また、デ
ータの入出力が高速にできるマルチヘッド型の磁気ディ
スク装置は非常に高価であるという問題があった。そこで、安価な一般の小型磁気ディスクを複数台接続し
てデータの入出力の速度を高速化する集合型の磁気ディ
スク装置が考えられてきた。そのひとつとして特開昭６
　０　−　１．　１　７　３　２　６号公報記載の「画
像データ分割記憶装置Ｊがある。この装置は複数台の磁気ディスク装置を有し、磁気ディ
スク装置と同数の磁気ディスクコントローラ，入出力バ
ッファと外部装置との間のデータ転送を制御するマスク
コントローラによって構威し，外部装置から入力したデ
ータをマスタコントローラにおいて、入出力バソファの
容量以下に分割し、その分割したデータを各磁気ディス
クコントローラに順次転送し、該磁気ディスクコントロ
ーラは対応する磁気ディスク装置に書き込む。マスクコ
ントローラは書き込みを行なっていない磁気ディスク装
置の磁気ディスクコントローラに対し、シーク動作を行
なわせることによって、データを格納する複数の磁気デ
ィスク装置の２台目以降の、シーク時間を見掛け上なく
し、データの書き込み，読み出し時間を短縮しようとす
るものである。（発明が解決しようとする課題）文字列検索装置の文字列記憶手段で重嬰となる要素は、
記憶容量が大きいこと，ファイルのサイズにかかわらず
、複数のファイルを連続的に高速で入出力できること、
安価であることの３点であり、これらの要素を満足する
集合型磁気ディスク装置の開発が必要であった。従来技術では、ただシーク時間のアクセス時間を見掛け
」二なくすことにより、データの書き込み，読み出し時
間を短縮しようとするもので、外部機器の要求するデー
タ転送速度に対して何台の磁気ディスク装置を用いて構
成すれば良いかについて配慮されておらずコストパフォ
ーマンスの点で問題があった。また、従来技術は画像データのようにデータサイズの大
きなファイルが複数の磁気ディスク装置にまたがるよう
な場合にはアクセス時間を削減できる効果があるが、複
数の磁気ディスク装置にまたがらないデータサイズの小
さなファイルの書き込み，読み出しを行なう場合には、
シーク時間を隠すことができず，１介の磁気ディスク’
ｊＡｒｌと同じアクセス時間となってしまう問題があっ
た。また、従来技術は複数のファイルの連続的な書き込み，
読み出しを行なう点に配慮がされておらず、上位機器か
らの書き込み，読み出し命令を１件のファイルについて
のみ処理可能で，複数のファイルをアクセスする場合に
は、１件の処理を繰返し行なう必要があり、それに要す
るオーバヘッド時間が長くなってしまう問題があった。また、オーバヘッド時間のひとつとして、上位機器から
アクセス対象となるファイルを指定するためのファイル
識別コードから磁気ディスク装置の格納位置情報を検索
する処理がある。従来の一般的な磁気ディスク装置では
、ファイル識別コードとしてＡＳＣＩＩコード等の文字
コード列で構戒されるファイル名称で表現されており、
このファイル名称により，磁気ディスク装置のファイル
管理情報エリアに格納されているファイル管理情報を検
索して物理的な格納位置を求めなければならず、それに
要する処理時間が大きい問題があった。本発明の目的は、記憶容量が大きい、ファイルのサイズ
にかかわらず複数のファイルを連続的に高速に入出力で
きる，安価な集合型磁気ディスク装置を提供するもので
ある。［発明が解決するための手段】上記目的を達成するために以下の手段を採用した集合型磁気ディスク装置を、磁気ディスク装置を有する
複数台のデータ記憶装置と、データ記憶装置へ入出力す
るデータを一時格納する入出力バッファと、データ記憶
装置と入出力バッファの制御を行なうマルチディスクコ
ントローラとによって構威したものである。さらに、データ記憶装置を、磁気ディスクコントローラ
を有するｌ台の磁気ディスク装置によって構成するか、
または、磁気ディスクコントローラを有する複数台の磁
気ディスク装置と、磁気ディスク装置を選択するマルチ
プレクサとによって構成したものである。さらに，入出力バッファは、上記データ記憶装置装置と
、複数台につき、磁気ディスク装置の少なくとも１シリ
ンダ分の容量を持ち、１面、また、２面の半導体メモリ
によって構成する。なお、メモリは半導体記憶素子以外の光メモリ等の高速
記憶素子を用いて実現することもできる。データ記憶装置と入出力バッファの制御を行うマルチデ
ィスクコントローラは、上位機器からの要求を格納する
半導体記憶素子を用いた通信メモリと，データ転送の制
御を行なうマルチプレクスコントローラと、磁気ディス
ク装置内の物理的格納位置を検索するための半導体記憶
素子を用いた物理情報テーブルと、それらを制御するマ
スタコントローラとによって構成している．なお、通信
メモリ，物理情報テーブルは半導体記憶素子以外の光メ
モリ等の高速記憶素子を用いて実現することもできる．なお、マスクコントローラは、マイクロコンピュータを
使用し、各構成要素を制御するものである．さらにマルチディスクコントローラに，ファイル識別子
として、階層的なグループに分類する論理分類を行なっ
たファイルの該論理分類固有の識別コードである論理分
類よりと、該論理分類内の固有の番号とによって構戒す
るファイルＩＤを用いるようにしたものである。また、マルチディスクコントローラでは、ファイルＩＤ
内の論理分類ＩＤに従い、ファイルの磁気ディスク装置
への物理的格納位置を決定する管理情報を格納した構造
定義テーブルをマスクコントローラのメモリ内に持つよ
うにしたものである。上位機器と入出力バッファ間のデータ転送の制御を行な
うマルチプレクスコントローラは、入出力バッフアのデ
ータバスを選択するマルチプレクサと、マスクコントロ
ーラの介在なしにデータ転送を行なうＤＭＡコントロー
ラと、データ転送が必要な範囲の入出力パッファの先頭
アドレスを格納する先頭アドレス登録テーブルと、終了
アドレスを格納する終了アドレス登録テーブルとによっ
て構成したものである。In recent years, secondary information such as literature information and patent information (bibliographic information)
Large-scale database services that include not only primary information (text) but also primary information (text) are becoming increasingly important. Conventionally, methods using keywords and classification codes have been used to search for information in such databases. However, this method only narrows down the results from a few dozen to a few hundred, and there is an efficiency problem in that the searcher must directly read the text to confirm the content at the final stage. Furthermore, since the classification system itself changes over time, the problem arises that keywords and classification codes must be constantly updated. Furthermore, since adding keywords (referred to as indexing) takes time, new documents are registered in batches in large quantities. Therefore, there has been a problem that registered information that can be searched always has a certain period of delay. As one method for dealing with these problems, a full-text search system is being considered that allows a searcher to directly refer to the text of a document and search for content based on free keywords. On the other hand, several character string search devices aiming at such a full-text search system have been proposed. A typical structure thereof is shown in FIG. 2, and its contents will be explained first. In the string search device 101, the search control means 102 controls the entire search device, receives search requests sent from the host computer, analyzes them, and sends the search to the string matching means 105 and the compound condition discrimination stage 104. Send as information. The search control means 1-02 also controls the disk control means 103 to send the character string data stored in the character string storage means 106 to the character string collation means. The character string matching means 105 checks whether there is anything in the input character data that matches the search request, and if there is any matching, the character string matching means 105 passes information identifying the character string to the complex condition determining means 1. Output to 04. The compound condition determining means 104 checks whether compound conditions such as mutual positional relationship specified in the search request are satisfied based on the character string identification information. If the compound conditions are satisfied, pointer information to the corresponding document and text data of the document content are returned to the host computer as search results. Character string search device 1. A magnetic disk device capable of storing a large amount of data is required as the character string storage means 106, which is a component of 0.01. General magnetic disk drives have the problem of not being able to input and output data at high speeds, and multi-head magnetic disk drives that can input and output data at high speeds are extremely expensive. Therefore, a collective type magnetic disk device has been considered in which a plurality of inexpensive general small magnetic disks are connected to increase the data input/output speed. One of them is the Japanese Patent Publication No. 6
0-1. There is an "Image data division storage device J" described in Publication No. 17326. This device has multiple magnetic disk devices, and has the same number of magnetic disk controllers, input/output buffers, and external devices as the magnetic disk devices. The master controller divides data input from an external device into pieces that are less than the capacity of the input/output bus sofa, and sequentially transfers the divided data to each magnetic disk controller. The magnetic disk controller writes to the corresponding magnetic disk device.The mask controller causes the magnetic disk controller of the magnetic disk device that is not writing to perform a seek operation, so that two of the plurality of magnetic disk devices that store data are stored. This is an attempt to apparently eliminate the seek time after the second one and shorten the time for writing and reading data. (Problem to be solved by the invention) An important element in the string storage means of a string search device. teeth,
Large storage capacity, ability to input and output multiple files continuously at high speed regardless of file size,
There were three points: low cost, and it was necessary to develop a collective magnetic disk device that satisfied these factors. Conventional technology attempts to shorten data writing and reading time by simply eliminating the access time of seek time, and it is difficult to reduce the number of magnetic disk drives needed to meet the data transfer speed required by external equipment. There was a problem in terms of cost performance as there was no consideration given to whether it should be used in the configuration. In addition, the conventional technology has the effect of reducing access time when a file with a large data size such as image data spans multiple magnetic disk drives, but when the data size does not span multiple magnetic disk drives, When writing or reading small files,
Unable to hide seek time, only one magnetic disk'
There was a problem that the access time was the same as jArl. In addition, the conventional technology continuously writes multiple files,
No consideration is given to the point of reading, and write and read commands from the host device can only be processed for one file, and when accessing multiple files, it is necessary to repeatedly perform one process. However, there is a problem in that the overhead time required for this becomes long. Further, as one of the overhead times, there is a process of searching storage location information of the magnetic disk device from a file identification code for specifying a file to be accessed from a host device. In conventional general magnetic disk drives, the file identification code is expressed as a file name that is a string of character codes such as ASCII code.
This file name requires searching the file management information stored in the file management information area of the magnetic disk device to find the physical storage location, which poses a problem in that the processing time required is large. SUMMARY OF THE INVENTION An object of the present invention is to provide an inexpensive collective magnetic disk device that has a large storage capacity and can input and output multiple files continuously at high speed regardless of the file size. [Means for Solving the Invention] In order to achieve the above object, a collective magnetic disk device employing the following means is provided with a plurality of data storage devices each having a magnetic disk device, and data to be input/output to the data storage device. It consists of an input/output buffer that temporarily stores data, and a multi-disk controller that controls the data storage device and the input/output buffer. Furthermore, the data storage device is configured by l magnetic disk devices each having a magnetic disk controller, or
Alternatively, it is configured by a plurality of magnetic disk devices each having a magnetic disk controller and a multiplexer for selecting the magnetic disk devices. Furthermore, the input/output buffer has a capacity equivalent to at least one cylinder of a magnetic disk device for each of the data storage devices, and is constituted by semiconductor memory on one or two sides. Note that the memory can also be realized using a high-speed storage element such as an optical memory other than a semiconductor storage element. The multi-disk controller, which controls data storage devices and input/output buffers, consists of a communication memory using semiconductor storage elements that stores requests from higher-level devices, a multiplex controller that controls data transfer, and a multi-disk controller that controls data transfer. It consists of a physical information table using semiconductor memory elements to search for physical storage locations, and a master controller to control them. Note that the communication memory and physical information table can also be realized using high-speed storage elements such as optical memory other than semiconductor storage elements. Note that the mask controller uses a microcomputer to control each component. Furthermore, the multi-disk controller uses the logical classification, which is an identification code unique to the logical classification of files that have been logically classified into hierarchical groups, as well as a unique number within the logical classification, as a file identifier. In this case, the file ID is used. In addition, in a multi-disk controller, the file ID
The mask controller has a structure definition table in its memory that stores management information for determining the physical storage location of a file in the magnetic disk device according to the logical classification ID in the file. The multiplex controller controls data transfer between the host device and the input/output buffer, and includes a multiplexer that selects the data bus for the input/output buffer, a DMA controller that transfers data without the intervention of a mask controller, and a DMA controller that controls data transfer between the host device and the input/output buffer. It consists of a start address registration table that stores the start address of the input/output puffer in the range, and an end address registration table that stores the end address of the range.

[Effect]

上記技術手段の働きを以下に述べる６データ記憶装置がｎ台、データ記憶装置内の上記磁気デ
ィスク装置の転送データがトラック間にまたがらずシー
ク動作を行なわないときの磁気ディスク装置から入出力
バッファヘのデータ転送速度をを　［　Ｂ　ｙｔｅ／　ｓ
ｅｃ］、ディスク装置の１シリンダ分の容量をＭ　［Ｂ
ｙｔｅｌ，磁気ディスク装置の最小シーク時間をｓ　［
ｓｅｅｌ　．磁気ディスク装置の回転速度をＲ　［ｒｐ
ｓ］　、出力バッファの容量を上記磁気ディスク装置の
１シリンダ分の容量Ｍ［Ｂｙｔｅｌと同一とした場合に
、集合型磁気ディスク装置から上位機器へのデータ転送
速度Ｔ　［　Ｂ　ｙｔｅ／　ｓｅｃｌは以下の条件を満
足する必要がある。磁気ディスク装置の最小シーク時間ｓ　［ｓｅｃｌが１
つの上記入出力バッファのＭ［Ｂｙｔｅ］のデータを上
位機器に転送する時間（Ｍ／Ｔ）　［ｓｅｃｌより大き
い場合、データ記憶装置から出力バッファ八のデータ転
送時間は、磁気ディスク装置の最小シーク時間ｓ　［ｓ
ｅｃｌと，磁気ディスク装置の最大回転待ち時間（　１
　／　Ｒ　）　　［ｓｅｅ］と、データ記憶装置から入
出力バッファへのデータ転送時間（Ｍ／ｔ）　　［ｓｅ
ｃ］の合計時間で、これが、全ての入出力バッファのデ
ータを上位機器に転送する時間（ｎＭ／Ｔ）　　［ｓｅ
ｃ］以内で動作すればよい。これを数式で表すとのようになり、データ記憶装置の台数ｎは次式のように
書き表すことができる。また、磁気ディスク装置の最小シーク時間５　［ｓｅｃ
］が１つの入出力バツファのＭ［Ｂｙｔｅコのデータを
上位機器に転送する時間（Ｍ／Ｔ）［ｓｅＣ］以下の場
合には、磁気ディスク装置がシーク動作を終了しても磁
気ディスク装置からデー夕転送を行なおうとする入出力
バッファが、上記機器へのデータ転送を行っているため
にデータ記憶装置から入出力バッフアへのデータ転送が
できない。そのため、入出力バッファから上位機器への
データ転送が終了するまで待つ必要がある。そこで、デ
ータ記憶装置から入出力バッファへのデータ転送時間は
、１つの入出力バッファから上位機器へのデータ転送時
間（Ｍ　／　Ｔ　）　　［ｓｅｃｌと、磁気ディスク装
置の最大回転待ち時間（１／Ｒ）［ｓｅｃ］とデータ記
憶装置から入出力バッファへのデータ転送時間（Ｍ　／
　ｔ　）　　［ｓｅｃｌの合計時間が、全ての入出力バ
ッファのデータを上位機器に転送する時間（ｎＭ／Ｔ）
　　［ｓｅｃ］以内で動作すればよい．これを数式で表
すとのようになり、データ記憶装置の台数ｎは次式のように
書き表すことができる。これらの条件式を満足する最小台数のデータ記憶装置で
集合型磁気ディスク装置を構或することで、上位機器の
要求するデータ転送速度を満足するコストパフォーマン
スの良い磁気ディスクｉｔを提供することができる。データ記憶装置は、データファイルの記憶を行なう。デ
ータ記憶装置を磁気ディスクコントローラを有する磁気
ディスク装置で構或することによって、磁気ディスクへ
のデータの書き込み，読み出し制御を該磁気ディスクコ
ントローラが行ない、マルチディスクコントローラの処
理が軽減される。また、データ記憶装置を複数台の磁気ディスク装置と、
磁気ディスク装置のデータパスを上記入出力バッフアの
データバスに選択して接続するマルチプレクサにより構
或することにより、記憶容量を大きくすることができる
。入出力バッファはデータ記憶装置に入出力するデータの
一時格納を行なう。書き込みの場合、データ記憶装置内の磁気ディスク装置
の書き込み速度より早い速度で、上位機器から入出力バ
ッファに次々とデータの転送を行ない、データ転送が終
了した入出力バッファは磁気ディスク装置へ磁気ディス
ク装置の書き込み速度でデータの書き込みを行なう。読
み出しの場合、それぞれの磁気ディスク装置は磁気ディ
スク装置の読み出し速度で入出力バッファへのデータの
読み出しを行ない，読み出しが終了した入出力バッファ
は、磁気ディスク装置の読み出し速度よりも早い速度で
、上位機器へのデータの転送を行なう．これにより、上
位機器へのデータの入出力を磁気ディスク装置の書き込
み，読み出し速度よりも早い速度で行なうことができる
。さらに入出力バッファをデータ記憶装置装置と、複数台
につき２面持つことにより，第ｌ面の入出力バッファが
上位機器とデータ転送している間に、第２面の入出力バ
ッファがデータ記憶装置との書き込み，読み出しを行な
える。これにより上位機器とのデ一夕転送が終了するま
で磁気ディスク装置がデータの転送動作を待つ時間を削
減でき、書き込み，読み出しが短時間に行なえる。この
ときの上位機器の要求するデータ転送速度を満足するコ
ストパフォーマンスの良い磁気ディスク装置を提供する
条件式は、第１式で表現される。マルチディスクコントローラは上位機器からのデータフ
ァイルの書き込み，読み出し要求に対し、データ記憶装
置と入出力バッファの制御を行なうものである．書き込
み，読み出しの対象となるファイルのファイルＩＤを複
数件格納できる半導体記憶素子を用いた通信メモリは、
上位機器からの命令の受理，処理の終了報告処理におけ
るオーバヘッド時間が削減され、データファイルの連続
書き込み，読み出しが短時間に行なえる．短時間にアク
セス可能な半導体記憶素子を用いた物理情報テーブルは
、論理的なファイルＩＤから磁気ディスク装置の物理的
格納位置を短時間に求めることができ、このため、デー
タファイルの読み出しにかかるオーバヘッド時間が短時
間になる。また、磁気ディスク装置に格納するファイルの識別を行
なうものは、従来、可変長の文字コード列で構成される
ファイル名称であったのに対し、固定長の数値コードで
構或されるファイルＩＤは、小さなサイズのコードで表
現が可能で，書き込み，読み出しを行なうデータファイ
ルの指定や物理的格納位置の検索処理が単純化され、そ
れに要するオーバヘッド時間も短縮できる。さらに，データファイルを格納する場合にも、論理的に
関係するファイルの物理的格納位置を近接させることに
より、シーク時間を短くすることができアクセス時間を
短縮できる．マルチブレクスコントローラ内のマルチプレクサは、上
記入出力バッファのデータパスを選択する．先頭アドレ
ス登録テーブルと終了アドレス登録テーブルは，入出力
バッファに格納されているデータの内、必要なデータが
格納されている範囲を指定する先頭アドレスと終了アド
レスをいくつか格納する．ＤＭＡコントローラは、先頭
アドレス登録テーブルと終了アドレス登録テーブルで指
定した範囲の入出力パッファのデータを上位機器にマス
クコントローラの介在なしで高速に転送する。磁気ディスク装置の同一シリンダ上に読み出すファイル
が複数件ある場合に、読み出すファイルのサイズをｆ　
１　［Ｂｙｔｅｌ　ｖ　ｆ　２　［Ｂｙｔｅ］　、その
間の読み出し不要のファイルのサイズをｋ　［Ｂｙｔｅ
］　、磁気ディスク装置の読み出し速度をを　［　Ｂ　
ｙｔｅ／　ｓｅｃ］　．磁気ディスク装置の回転速度を
Ｒ　［ｒｐｓ］　、磁気ディスク装置の平均シーク時間
をＳ　［ｓｅｃ］とするとき、平均回転待ち時間は（１
／２Ｒ）　　［ｓｅｃコであり、一度に読み出す時間が
一つづつ読み出す時間よりも短かくなる条件は、のように表すことができる。この数式は容易に次式のように書き表すことができる。ｆこの条件式を満足する時、マルチプレクスコントローラ
は、読み出し不要のファイルも一旦入出力バッファに読
み出し、上位機器に転送する際に不要なファイルの部分
を除いて必要部分のみを転送する。これにより，磁気デ
ィスクが一度の読み出し処理で複数のファイルを読み出
すことができ、読み出し処理で発生するアクセス時間を
短くすることができる。The operation of the above technical means will be described below.6 When there are n data storage devices and the transfer data of the above magnetic disk devices in the data storage devices does not span between tracks and does not perform a seek operation, the transfer data is transferred from the magnetic disk device to the input/output buffer. The data transfer rate is [Byte/s]
ec], the capacity of one cylinder of the disk device is M [B
ytel, the minimum seek time of the magnetic disk device is s [
seel. The rotational speed of the magnetic disk device is R [rp
s], the output buffer capacity is the same as the capacity of one cylinder of the magnetic disk device M [Byte], the data transfer rate from the collective magnetic disk device to the host device T [Byte/sec is as follows. conditions must be met. Minimum seek time s of magnetic disk device [secl is 1
The time (M/T) to transfer M [Bytes] of data from the above input/output buffers to the host device [If larger than secl, the data transfer time from the data storage device to the output buffer 8 is the minimum seek time of the magnetic disk device s [s
ecl and the maximum rotational waiting time of the magnetic disk device (1
/ R ) [see] and the data transfer time from the data storage device to the input/output buffer (M/t) [se
c], which is the time (nM/T) to transfer the data of all input/output buffers to the upper device [se
c]. This can be expressed as a mathematical formula, and the number n of data storage devices can be expressed as in the following formula. In addition, the minimum seek time of the magnetic disk device is 5 [sec
] is less than the time (M/T) [seC] for transferring 1 Byte of data to the host device of one input/output buffer, even if the magnetic disk device finishes the seek operation, the magnetic disk device Data cannot be transferred from the data storage device to the input/output buffer because the input/output buffer that is attempting to transfer data is currently transferring data to the device mentioned above. Therefore, it is necessary to wait until the data transfer from the input/output buffer to the higher-level device is completed. Therefore, the data transfer time from the data storage device to the input/output buffer is the data transfer time from one input/output buffer to the host device (M/T) [secl] and the maximum rotational waiting time (1/R) of the magnetic disk device. ) [sec] and the data transfer time from the data storage device to the input/output buffer (M/
t) [The total time of secl is the time to transfer data of all input/output buffers to the upper device (nM/T)
It only needs to operate within [sec]. This can be expressed as a mathematical formula, and the number n of data storage devices can be expressed as in the following formula. By constructing a collective magnetic disk device with the minimum number of data storage devices that satisfy these conditional expressions, it is possible to provide a magnetic disk IT with good cost performance that satisfies the data transfer speed required by host equipment. . The data storage device stores data files. By configuring the data storage device as a magnetic disk device having a magnetic disk controller, the magnetic disk controller controls writing and reading data to and from the magnetic disk, and the processing of the multi-disk controller is reduced. In addition, the data storage device can be a plurality of magnetic disk devices,
The storage capacity can be increased by configuring a multiplexer that selectively connects the data path of the magnetic disk device to the data bus of the input/output buffer. The input/output buffer temporarily stores data to be input/output to the data storage device. In the case of writing, data is transferred one after another from the host device to the input/output buffer at a speed faster than the writing speed of the magnetic disk device in the data storage device, and after the data transfer is completed, the input/output buffer is transferred to the magnetic disk device. Data is written at the writing speed of the device. In the case of reading, each magnetic disk device reads data to the input/output buffer at the read speed of the magnetic disk device, and the input/output buffer after reading is transferred to the upper level at a speed faster than the read speed of the magnetic disk device. Transfers data to the device. Thereby, data can be input/output to the host device at a faster speed than the writing/reading speed of the magnetic disk device. Furthermore, by having two input/output buffers for each data storage device, while the input/output buffer on the first side is transferring data with the host device, the input/output buffer on the second side is connected to the data storage device. You can write to and read from. This can reduce the time the magnetic disk device waits for the data transfer operation until the data transfer with the host device is completed, and writing and reading can be performed in a short time. The conditional expression for providing a magnetic disk device with good cost performance that satisfies the data transfer speed required by the host device at this time is expressed by the first expression. The multi-disk controller controls data storage devices and input/output buffers in response to data file write and read requests from higher-level devices. A communication memory using a semiconductor memory element that can store multiple file IDs of files to be written or read is
Overhead time in receiving commands from host devices and processing completion reports is reduced, allowing continuous writing and reading of data files in a short time. A physical information table using a semiconductor memory element that can be accessed in a short time can quickly determine the physical storage location of a magnetic disk device from a logical file ID, and therefore reduces the overhead required to read data files. Time becomes short. In addition, whereas files stored in magnetic disk devices have traditionally been identified by file names consisting of variable-length character code strings, file IDs are made up of fixed-length numerical codes. , can be expressed using small-sized code, simplifying the process of specifying data files to be written or read and searching for physical storage locations, and reducing the overhead time required. Furthermore, when storing data files, by physically storing logically related files close to each other, seek time and access time can be shortened. A multiplexer in the multiplex controller selects the data path of the input/output buffer. The start address registration table and end address registration table store several start addresses and end addresses that specify the range where the necessary data is stored among the data stored in the input/output buffer. The DMA controller transfers input/output buffer data in the range specified by the start address registration table and the end address registration table to the host device at high speed without the intervention of a mask controller. If there are multiple files to be read on the same cylinder of the magnetic disk device, set the size of the file to be read to f.
1 [Byte v f 2 [Byte], the size of the file that does not need to be read during that time is k [Byte]
] , the read speed of the magnetic disk device [B
yte/sec]. When the rotational speed of the magnetic disk device is R [rps] and the average seek time of the magnetic disk device is S [sec], the average rotational waiting time is (1
/2R) [sec], and the condition that the time to read at a time is shorter than the time to read one by one can be expressed as follows. This formula can be easily written as the following formula. f When this conditional expression is satisfied, the multiplex controller temporarily reads files that do not need to be read into the input/output buffer, and transfers only the necessary portions, excluding unnecessary file portions, when transferring the files to the host device. This allows the magnetic disk to read a plurality of files in one read process, thereby shortening the access time that occurs in the read process.

【実施例１以下、本発明を文字列検索装置に適用した実施例である
実施例１を説明する。第３図は本発明を用いた集合型磁気ディスク装置の構成
を示すもので、磁気ディスク装置１を有するｎ台のデー
タ記憶装置１５と、データ記憶装置１５それぞれに接続
する磁気ディスク装置１の１シリンダ分の容量を持つ入
出力バッファ３と、データ記憶装１１５と入出力バッフ
ァ３の制御を行なうマルチディスクコントローラ４によ
って構或している。ここではデータ記憶装Ｗ１５は装置と、複数台の磁気デ
ィスク装置１で構威し，入出力バッファ３は上記磁気デ
ィスク装置１の１シリンダの容量を持つメモリ１面で構
威している。マルチディスクコントローラ４は、アクセスの対象とな
るファイルのファイルＩＤを上位機器７から直接設定で
きる通信メモリ５と高速データバス１０の制御を行なう
マルチプレクスコントローラ８とファイルＩＤから磁気
ディスク装置の格納先物理情報を求めるための変換テー
ブルである物理情報テーブル６および、それらを制御す
るマスクコントローラ９によって構成している。上位機器７は集合型磁気ディスク装置に命令を与えるホ
ストコントローラと入力されるデータの中から指定した
文字列を検出し、その検出情報を出力する文字列検索装
置により構威している。本集合型磁気ディスク装置にデータファイルを構或する
データベースの構築を行なう前には、データベースの構
造定義処理を行なう。本集合型磁気ディスク装置では論理的に関連するファイ
ルを物理的格納位置が近接するように配置する手段とし
て、最初に物理シリンダを階層構造を持つ論理分類ＩＤ
に従い割り振っている。複数件のファイルを一度にアク
セスする場合、論理的に関連するファイルを対象にする
ことが多い。そこで、格納位置を近接させることにより、磁気ディス
ク装置のシリンダ間を磁気ヘッドが移動する距離を短く
し、アクセス時間の一部であるシーク時間を短縮させる
．階層構造を持つ論理分類ＩＤに従った物理シリンダの割
り振りは、上位機器７が論理分類ＩＤと該ファイル分類
が必要とする記憶容量の組が集まって構成されるデータ
ベース構造定義情報を通信メモリ５に格納した後、マル
チディスクコントローラ４に対しデータベースの構造定
義命令を発行する。構造定義命令を受けたマルチディス
クコントローラ４内のマスクコントローラ９は、通信メ
モリ５にセットされたデータベースの構造定義情報に基
づいて、論理分類に物理位置がどう対応するかをマスク
コントローラ９内のメモリ上に第４図で示すような構造
の構造定義テーブルを作或する。第４図は２階層でそれ
ぞれの階層で２つの分類を持つ例で、磁気ディスク装置
全体を一台の磁気ディスク装置としてまとめて、各分類
ごとの格納位置をシリンダの位置で，記憶容量をシリン
ダ数で示したものである。また、データベースの構造定義処理では，マルチディス
クコントローラ４内のマスクコントローラ９は論理分類
毎に、書き込むファイルの格納先の物理位置を保持する
ために、マスクコントローラ９内のメモリ上に第５図に
示すような、書き込むファイルの格納先の物理位置を差
し示す格納位置ポインタテーブルを作戒する。構造定義
が終了した時点では、格納位置ポインタテーブルは構造
定義で設定した各論理分類の先頭シリンダ，先頭トラッ
ク，先頭セクタ，セクタ内先頭位置を示すことになる。第５図では第４図で示した例の分類でファイルを格納し
た場合の格納位置ポインタ情報を格納している。次にデータベースの構築について説明する。本集合型磁
気ディスク装置ではアクセスの対象となるファイルをフ
ァイルＩＤ（論理分類ＩＤと論理分類内の個有の番号で
構成）により指定する手段として、ファイルＩＤを用い
た管理情報を作成している。上位機器７は通信メモリ５に書込み対象となるファイル
のファイルＩＤとファイルサイズの組が複数件分集まっ
て構成されるファイル情報を格納した後、マルチディス
クコントローラ４に対し書き込み命令を発行する。書き
込み命令を受けたマルチディスクコントローラ４は，第
７図に示すフローで処理を実行する。マルチディスクコ
ントローラ４内のマスクコントローラ９は，通信メモリ
５からファイル情報の中のファイルＩＤを読み出し，該
ファイルＩＤが示すファイルを格納する格納位置を格納
位置ポインタテーブルから読み出す．格納位置が求まる
とその物理シリンダに書き込める残り容量が求まる。そ
の残り容量よりもファイル情報のファイルサイズで与え
られるファイルのサイズが小さければ第６図に示すよう
なファイルＩＤをエントリとする物理情報テーブル６に
その格納位！！（ディスク番号，シリンダ番号，トラッ
ク番号，セクタ番号，セクタ内位置）、ファイルサイズ
，ディスクまたがり数を書き込む．ディスクまたがり数
は，ファイルが何台の磁気ディスク装置１にまたがって
いるかを表わすもので、処理対象となっているファイル
が、装置と、複数台の磁気ディスク装置の１つのシリン
ダに書き切れなかった場合はファイルを分割して書き残
したファイルを次のディスクに書き込むことになる。こ
のファイル分割した書き残しファイルであれば，この値
をカウントアップする。物理情報テーブル６のエントリ
はファイル情報で与えられるファイルＩＤで示される。物理情報テーブルへの書込みの後、格納位置ポインタを
ファイルサイズ分進める。ファイルサイズと残り容量が等しい場合は，ｌ台の磁気
ディスク装置１のシリンダがいっぱいになった時で，そ
の磁気ディスク装置１への書き込み処理を行なう。残り容量よりもファイルサイズが大きい場合には、磁り
容量と分割基準サイズを比べる．分割基準サイズは構造
定義処理で設定する値で，シリンダの残り容量が非常に
小さいにもかかわらずファイルを磁気ディスク装置１の
間にまたがるように格納すると、そのファイルを読み出
すためには２台の磁気ディスク装置１を制御しなければ
ならず、その処理分オーバヘッドが大きくなる．そこで
、ある基準を設定してその基準値よりも残り容量が小さ
い場合には次の磁気ディスク装ｆｌｌのシリンダの先頭
から書き込むようにするものである。残り容量が分割基準サイズ以上の場合には、物理情報テ
ーブル６に格納位置、ファイルサイズを格納した後，残
り容量に書き込める分のファイルと書き残した分の書き
残しファイルとに分割する．物理情報テーブル６には格
納物理位置とファイルサイズを書込む．ｌシリンダがいっぱいとなる物理情報を作威した磁気デ
ィスク装置１は書き込み処理を行なう。書き残しファイルはループを戻り、次の処理対象ファイ
ルとなる．残り容量が分割基準サイズよりも小さい場合には，格納
位置ポインタテーブルを次のシリンダの先頭に進めた後
、処理対象ファイルをそのまま次の処理対象ファイルと
してループを戻り処理を続ける．この時、１シリンダが
いっぱいとなる物理情報を作威した磁気ディスク装置は
書き込み処理を行なう．書き込み処理は、マスクコントローラ９がシーク命令を
磁気ディスク装置１に発行し、シーク動作を開始する．
次に，上位機器７にファイルの転送要求を発行し，マス
クコントローラ９は上位機器７にファイルの転送を要求
するとともに，マルチプレクスコントローラ８を制御し
てデータパスを切り換え、転送されてくるファイルを物
理情報で指定する入出力バッファ３へのファイルの転送
を行なう。シーク動作が終了し、ファイルの転送が終了
するとマスタコントローラ９は書込み命令を磁気ディス
ク装置１に発行し、該磁気ディスク装置１は書き込み動
作を実行する。上記の動作を繰返しデータベースの構築を行なう。第８図は書き込み処理の時間的な関係を示すもので、上
位機器７から図に示すように１−１，２−１，・・・・
・・　Ｈ　　１．１−２＋　２　　２ｔ・・・・・・と
次々と転送されてくるデータは、マルチディスクコント
ローラ４内のマルチプレクスコントローラ８により、入
出力バッファ３−１．３−２，・・・・・・３−ｎ，　
３−１．　３−２，・・・・・・に格納される。このと
き、例えば磁気ディスク装置１−１は、データ転送１−
１を開始する直前にマスクコントローラ９の指令により
シークを開始している。データ転送１−１が終了した時
点で、マスクコントローラ９は磁気ディスク装置１−１
に書き込み命令を発行する。磁気ディスク装置１−１は
指定の書き込み位置に達するまで回転待ちを行なった後
、入出力バッファ３−１のデータ１−１を所定のシリン
ダ，トラック，セクタへ書き込み始める。この間、他の磁気ディスク装置も図に示すように同様の
処理を行なうことになる。第８図とこれに関する以上の説明から明らかなように、
各磁気ディスク装置はそれぞれ並行して、連続でファイ
ルの書き込みができ、短時間でデータベースの構築がで
きる。次に、ファイルの読み出し処理について説明する。また
、同一磁気ディスク装置の同一シリンダ上に読み出すフ
ァイルが複数件ある場合に、読み出すファイルの間にあ
る読み出し不要のファイルも入出力バッファに一旦読み
出し、上位機器に転送する際に読み出し不要のファイル
を削除する手段について説明する。上位機器７は読み出すファイルのファイルＩＤが複数件
分集まって構成するファイル情報を通信メモリ５に格納
した後、マルチディスクコントローラ４に対して読み出
し命令を発行する。読み出し命令を受けたマルチディスクコントロ一ラ４は
、第９図に示すフローで処理を実行する．マルチディス
クコントローラ４内のマスクコントローラ９は、通信メ
モリ５から最初に読み出すべきファイルのファイルＩＤ
を読み出し、該ファイルＩＤから該ファイルが格納され
ている物理情報を物理情報テーブル６により検索する。このファイルを先ファイル、物理情報を先ファイルの物
理情報とする。次に、通信メモリ５から次に読み出すべ
きファイルのファイルＩＤを読み出し、該ファイルＩＤ
から該ファイルが格納されている物理情報を物理情報テ
ーブル６により検索する。このファイルを後ファイル、
物理情報を後ファイルの物理情報とする。求めた物理情報から先ファイルと後ファイルが同一シリ
ンダに存在するかを調べ、同一シリンダに存在するかを
調べ，同一シリンダに存在すれば先ファイルと後ファイ
ルの間に、指定していない読み出し不要のファイル群が
あるか調べ、あれば、そのファイル群の総サイズを求め
る。読み出し不要の一ファイルのサイズが小さい場合に
は、先ファイルと後ファイルを一度の読み出し命令で読
出せるように、物理情報を合成する。次に合成した物理
情報を先ファイルの物理情報としてループを戻り，通信
メモリ５から次のファイルＩＤを読み出し、そのファイ
ルを後ファイルとして同様な処理を行なう。先ファイルと後ファイルが同一シリンダに存在しない場
合と読み出し不要ファイルのサイズが大きい場合には、
先ファイルの磁気ディスク装置からの読み出し処理を実
行する。後ファイルの物理情報は先ファイルの物理情報
としてループを戻り、通信メモリ５から次のファイルＩ
Ｄを読出し、それを後ファイルとして同様な処理を行な
う。このような動作を指定したファイルすべてを読み出すま
で繰り返す。先ファイルの磁気ディスク装置からの読み出し処理は、
まず、マスクコントローラ９は先ファイルの物理情報が
示す磁気ディスク装置１−ｉの磁気ディスクコントロー
ラ２−ｉに物理情報が示す物現位置へ磁気ヘッドを移動
させるシーク命令を発行し、磁気ディスク装ｉｉ　１　
−　ｉはシーク動作を開始する。シーク動作が終了する
と、入出力バツファ３−ｉがデータを書き込んでも良い
状態であれば、マスクコントローラ９は読み出し命令を
磁気ディスクコントローラ２−ｉに発行し、入出力バッ
ファ３−ｉに磁気ディスク装置１−ｉから読み出したフ
ァイルの格納を開始する。格納が終了すると、マスクコ
ントローラ９はマルチプレクスコントローラ８を制御し
て入出力バッフア３−ｉから上位機器７へのデータの転
送を開始させる。マルチプレクスコントローラ８は第１０図に示すように
、上位機器７のデータパスに入出力バツファ３−１から
３−ｎのデータパスを選択して接続するマルチプレクサ
２０１と選択したｉ番目の入出力バッファ３−ｉから上
位機器７にマスクコントローラ９の介在なしにデータを
出力するＤＭＡコントローラ２０２と該ＤＭＡコントロ
ーラ２０２に入出力バッファ３−ｉの転送範囲を指定す
るための先頭アドレスと終了アドレスを格納する先頭ア
ドレス登録テーブル２０３と終了アドレス登録テーブル
２０４により構或している。マスクコントローラ９は人出カバツファ３−ｉの転送す
べきファイルが存在する先頭アドレスを先頭アドレス登
録テーブル２０３に、終了アドレスを終了アドレス登録
テーブル２０４に設定した後、他の入出力バッファ３か
ら上位機器７へのデータの転送が行なわれていなければ
ＤＭＡコントローラ２０２に起動命令を発行する。ＤＭ
Ａコントローラ２０２は先頭アドレス登録テーブル２０
３と終了アドレス登録テーブル２０４を参照しながら指
定した範囲のデータのみを上位機器７の要求する転送速
度でマスクコントローラ９の介在なしに転送を行なう。先ファイルと後ファイルを一度の読出し命令で読み出せ
るように、物理情報を合成する処理を行ない入出力バッ
ファ３−ｉに読み出した場合には、先頭アドレス登録テ
ーブル２０３と終了アドレス登録テーブル２０４に必要
なファイルすべてが転送されるようにアドレスを複数件
分設定し、同様な処理を行なう。先ファイルと後ファイルを一度の読出し命令で読み出せ
るように、物理情報を合成する処理は次の条件を満足す
る場合に行なう．先ファイルのサイズをｆ　１　［Ｂｙｔｅ］　、後ファ
イルのサイズをｆ　２　［Ｂｙｔｅ］　＋読み出し不要
のファイル群の総サイズをｋ　［Ｂｙｔｅ］　、磁気デ
ィスク装置１から入出力パツファ３へのシーク動作を含
まない実効的な転送速度をを　［　Ｂ　ｙｔｅ／　ｓｅ
ｅ］、回転速度をＲ　［ｒｐｓ］　．平均シーク時間を
ｓ　［ｓｅｃ］とするとき、平均回転待ち時間は（１／
２Ｒ）であり、一度に読み出す時間が一つずつ読み出す
時間よりも短かくなる条件は，のようになり，第３式で示すように書き表すことができ
る．ｔｋ ≦ （３）２Ｒファイルの読み出し処理の時間的な関係は、上位機器７
が要求する転送速度をＴ　［Ｂｙｔｅ／ｓｅＣｌ、各磁
気ディスク装置１の１シリンダ分の容量がＭ　［Ｂｙｔ
ｅ］　、各磁気ディスク装置１から入出力バッファ３へ
の転送速度をを　［Ｂｙｔｅ／ｓｅｃ］　．各磁気ディ
スク装置ｌの最小シーク時間をｓ　［ｓｅｃ］，回転速
度をＲ　［ｒｐｓ］とすると、最少シーク時間ｓ　［ｓ
ｅｃ］がｉ番目の入出力バッフア３−ｉ上のファイルを
上位機器７に転送する時間（Ｍ／Ｔ）より大きい場合に
は、第１１図に示すようになる。上位機器７の要求する転送速度を満足するには、ｉ台目
の磁気ディスク装１１−ｉが入出力バッファ３−ｉにフ
ァイルを読み出す時間（　ｓ　＋　１　／　Ｒ＋Ｍ／　
ｔ　）が、全ての入出力バッファ３上のファイルを上位
機器７に転送する時間（　ｎ　Ｍ　／　Ｔ　）以内であ
れば良いことになる。ここでは、連続したシリンダを読
み出すためシーク時間を最少シーク時間とした。また、
磁気ディスク装Ｗ１に読み出し命令を発行した時点の磁
気ヘッドの位置がいかなる場合でも、上位機器７の要求
する転送速度を満足するように、回転待ちの時間を最大
値である（１／Ｒ）とした。この関係を数式で表わすと
のようになり、第１式で示すように書き表わすことがで
きる。また、最少シーク時間５　［ｓｅｃ］がｉ番目の入出力
バッフア３−ｉ上のファイルを上位機器７に転送する時
間（Ｍ／Ｔ）以下の場合のファイルの読み出し処理の時
間的な関係は、第１２図に示すようになる。この場合は
、シーク動作が終了しても入出力バッファ３−ｉはファ
イルを上位機器７に転送中であるため、読み出し命令を
ｉ台目の磁気ディスク装１ｉ　１　−　ｉに発行するこ
とができない．そこで、入出力バッファ３−ｉのファイ
ルが上位機器７に転送が終了した時点に読み出し命令を
ｉ台目の磁気ディスク装置１　−　ｉに発行することに
なる。従って、上位機器７の要求する転送速度を満足す
るには，ｉ台目の磁気ディスク装Ｍ　１　−　ｉが入出
力バッファ３−ｉにファイルを読み出す時間（Ｍ／Ｔ　
＋　１　／Ｒ　＋Ｍ／　ｔ　）が、全ての入出力バッフ
ァ３上のファイルを上位機器７に転送する時間（ｎＭ／
Ｔ）以内であれば良いことになる．この関係を数式で表
わすとのようになり、第２式で示すように書き表わすことがで
きる。ｔＭＲ速度を満足するには磁気ディスク装置ｌを何台組み合わ
せればよいかを求めることができ、第工式を満足する最
少の台数の磁気ディスク装置１で集合型磁気ディスク装
置を構或すれば最もコストパフォーマンスの良いものと
なる。例えば、１トラックの容量が２０ｋ　（キロ）［　Ｂ　
ｙｔｅ］の６トラックからなる，１シリンダ分の容量が
１　２　０　ｋ　［Ｂｙｔｅｌの磁気ディスク装置１に
より構威し、上位機器７が要求する転送速度を２Ｍ（メ
ガ）　　［　Ｂ　ｙｔｅ／　ｓｅｃ］　．各磁気ディス
ク装置１から入出力バッファ３へのシーク動作を含まな
い実効的な転送速度をＩ　Ｍ　［　Ｂ　ｙｔｅ／　ｓｅ
ｃ］、各磁気ディスク装置１の最小シーク時間を１０ｍ
（ミリ）　　［ｓｅｃコ、回転速度を５０［ｒｐｓ］と
すると、第１式は次のようになり，この式を満足する最少のｎは４となる。第１３図に３台の磁気ディスク装Ｗ１で構成した集合型
磁気ディスク装置の読み出し中の時間関係で、第１４図
に４台の磁気ディスク装置工で構成した集合型磁気ディ
スク装置の読み出し中の時間関係、第１５図に５台の磁
気ディスク装置１で構成した集合型磁気ディスク装置の
読み出し中の時間関係を示す。第１３図の３台の磁気ディスク装置１で構或した場合に
は、図からもわかるように磁気ディスク装置１から入出
力バッファ３にデータを読み出す時間が入出力バッファ
３から上位機器７への転送時間に間に合わず、入出力バ
ッファ３から上位機器７にデータの転送ができない時間
ａが発生し、入出力バッファ３から上位機器７への転送
速度が約１　．　６　Ｍ　［　Ｂ　ｙｔｅ／　ｓｅｅコ
となり上位機器が要求する転送速度を満足できない。また、第１５図の５台の磁気ディスク装置１で構成した
場合には、上位機器７が要求する転送速度を満足はする
ものの，第１４図の４台の磁気ディスク装ｉ！ｌで構成
した場合に比べ、王台の磁気ディスク装Ｎ１が処理をし
ない時間ｂが長く磁気ディスク装置の使用効率が悪い。従って，第１式を満足する最少のｎに一致する４台の磁
気ディスク装置１で構或した場合が、最もコストパフォ
ーマンスの良い集合型磁気ディスク装置と言える。本発明を文字列検索装置に適用したもう１つの実施例で
ある実施例２について第ｌ図を用いて説明する．実施例１で説明した集合型磁気ディスク装置は、指定し
たファイルのみを読み出す場合、指定したファイルが磁
気ディスク装Ｗ１−１から１−ｎに平均して存在すれば
、実施例１で述べたような動作を実施して、上位機器７
八のデータ転送速度を高めることができる。しかし、装
置と、複数台の磁気ディスク装置１−ｉにだけ指定した
ファイルが存在する場合、装置と、複数台の磁気ディス
ク装置１　−　ｉの読み出しが連続して行われることに
なる。この場合、上位機器７八のデータ転送は、一旦磁
気ディスク装置１−ｉから入出力バッファ３−ｉに読み
出した後、入出力バッファ３−ｉから上位機器７へ転送
する２段読み出しを行なわねばならないため、データ転
送が低下してしまうという状況が発生する。このように
、指定したファイルが偏って磁気ディスク装置１に存在
すると上位機器７八のデータ転送速度を効果的に高める
ことができない状況が発生し得る。そこで、実施例２は
、ファイルが偏って格納されないようにすることで，常
に全磁気ディスク装置１を読み出し動作させ、上位機器
７へのデータ転送速度を高めるものである。また，本実施例では記憶容量をさらに高めるために、磁
気ディスク装置の台数を増やしている。第１図は本発明を用いた集合型磁気ディスク装置の構或
を示すもので、第３図との相違点は磁気ディスク装置１
の１シリンダ分と同じ容量の入出力バッフア３を２面持
ち、第１面の入出力バッフア３ａのデータを上位機器７
に転送している間に、第２面の入出力バッファ３ｂに磁
気ディスク装置１からの読み出したファイルを格納する
ことができることである．また、一つのデータ記憶装Ｍ１５をｍ台の磁気ディスク
装１ｉ　１　−　ｉ　−　１〜１−ｉ−ｍとマルチプレ
クサ１４によって構威し、集合型磁気ディスク装置の総
記憶容量を装置と、複数台の磁気ディスク装置の記憶容
量の（ｎＸｍ）倍にしている．動作を説明すると、まず
、実施例１と同様にデータベースの構造定義処理を行な
うが、入出力バッファ３にマルチプレクサ１４を介して
接続するｍ台の磁気ディスク装Ｎ１を識別する情報を構
造定義情報に追加する。データベースの構築は実施例１と同様に行なうが、いく
つかの相違点がある。実施例１との相違点は、ファイル
情報で与えられるファイルを構戒する磁気ディスク装置
の台数分に分割して、全磁気ディスク装置に分散して格
納することである。また，入出力バッフア３のデータを格納物理情報で与え
られるｍ台の内の装置と、複数台の磁気ディスク装置１
−ｉ−ｊにマルチプレクサ〜ｌ４を制御して格納するこ
とである．ファイルの分割方法としては、ファイルサイズを台数で
割った分割サイズを求め、ファイルの先頭から分割サイ
ズごとに装置と、複数台目の磁気ディスク装置１−１−
ｊから１−２−ｊ，１−３−ｊと順番に格納していくも
のと，ファイルの先頭から１バイトずつと言ったように
，決められたサイズごとに装置と、複数台目の磁気ディ
スク装置１−１−ｊから１−２−ｊ，１−３−ｊと順番
に格納していくものがある．ファイルサイズが磁気ディスク装置の台数で割り切れな
い場合は，ファイルサイズが磁気ディスクの倍数となる
ように無効データを末尾に付加して、常に装置と、複数
台目の磁気ディスク装ｉｌ１−１−ｊにファイルの先頭
がくるように格納する。次にファイルの読出しについて説明する。これも実施例
１と同様に行なうが、本構成では入出力バッファ３を２
面（３ａ及び３ｂ）持うているため，それぞれの磁気デ
ィスク装置ｌから入出力バッファ３に読出したファイル
を格納した時点で、次のファイルの読出しの処理を開始
することができる。ファイルの読み出し処理の時間的な関係は第上６図のよ
うになり、実施例１に比べると入出力バッファ３にデー
タを書き込んでも良い状態になるまでの待ち時間がなく
なり、より高速の転送が可能になる．実施例１と同じ条
件で上位機器７の要求する転送速度を満足する関係は，
装置と、複数台の磁気ディスク装ｉｌｌ　−　ｉ　−　
ｊから２面ある入出力バッファ３−ｉの一方の入出力バ
ッファ３ａ−ｉにファイルを読み出す時間（　ｓ　＋　
１　／　Ｒ　＋　Ｍ　／　ｔ　）が、もう一方の全ての
入出力バッファ３ｂ−１から３ｂ−ｎまでのファイルを
上位機器７に転送する時間（　ｎ　Ｍ．　／　Ｔ　）以
内であればよく、これを数式で表すとのようになり、この数式は容易に次式のように書き表す
ことができる。この条件により、実施例１と同様に上位機器が要求する
転送速度を満足するためのデータ記憶装置１５の台数を
求めることができる。また、大きな記憶容量が求められる場合には、データ記
憶装置１５をｍ台の磁気ディスク装置」−とマルチプレ
クサ１４によって構威し、記憶容量をｍ倍化することが
できる６これらのことから決定される最少台数の磁気ディスク装
置ｌで集合型磁気ディスク装置をａ或すれば、最もコス
トパフォーマンスの良いものとなる。第１６図の実施例では各磁気ディスク装置のシーク動作
の起動を上位機器への入出力バッファ３−１〜３−ｎの
データ転送が終了した時点で行なっているが、それぞれ
読み出しが終了した時点で行なっても良いことは明らか
である。以上の２つの実施例では磁気ディスク装置を用いた場合
について説明したが、磁気ディスク装置以外の光ディス
ク装置等の記憶媒体が回転する記憶装置についても同様
なことは明確である。【発明の効果】本発明によれば、上位機器が要求する高速の転送速度を
満足すると共に、格納するデータベースの容量に応じて
集合型磁気ディスク装置を最少台数の磁気ディスク装置
で構或することができ、ファイルのサイズにかかわらず
複数のファイルを連続的に高速に入出力できる、安価な
集合型磁気ディスクを提供できる効果がある。[Embodiment 1] Embodiment 1, which is an embodiment in which the present invention is applied to a character string search device, will be described below. FIG. 3 shows the configuration of a collective magnetic disk device using the present invention, which includes n data storage devices 15 each having a magnetic disk device 1, and one of the magnetic disk devices 1 connected to each of the data storage devices 15. It consists of an input/output buffer 3 having a capacity equivalent to a cylinder, and a multi-disk controller 4 that controls the data storage device 115 and the input/output buffer 3. Here, the data storage device W15 consists of a device and a plurality of magnetic disk drives 1, and the input/output buffer 3 consists of one memory surface of the magnetic disk drive 1 having a capacity of one cylinder. The multi-disk controller 4 includes a communication memory 5 that can directly set the file ID of the file to be accessed from the host device 7, a multiplex controller 8 that controls the high-speed data bus 10, and a physical storage destination of the magnetic disk device based on the file ID. It consists of a physical information table 6, which is a conversion table for obtaining information, and a mask controller 9, which controls them. The host device 7 consists of a host controller that gives commands to the collective magnetic disk device, and a character string search device that detects a specified character string from input data and outputs the detected information. Before constructing a database constituting data files in this collective magnetic disk device, a database structure definition process is performed. In this collective magnetic disk device, as a means of arranging logically related files so that their physical storage locations are close to each other, we first assign physical cylinders to logical classification IDs with a hierarchical structure.
It is allocated according to the following. When accessing multiple files at once, logically related files are often targeted. Therefore, by placing the storage locations close together, the distance that the magnetic head moves between the cylinders of the magnetic disk drive is shortened, and the seek time, which is a part of the access time, is shortened. In order to allocate physical cylinders according to logical classification IDs that have a hierarchical structure, the higher-level device 7 stores database structure definition information in the communication memory 5, which is composed of a set of logical classification IDs and storage capacity required by the file classification. After storing, a database structure definition command is issued to the multi-disk controller 4. Upon receiving the structure definition command, the mask controller 9 in the multi-disk controller 4 stores in the memory in the mask controller 9 how the physical position corresponds to the logical classification based on the structure definition information of the database set in the communication memory 5. A structure definition table having the structure shown in FIG. 4 above is created. Figure 4 is an example of a two-layer structure with two categories in each layer.The entire magnetic disk drive is combined into one magnetic disk drive, and the storage location for each category is determined by the cylinder position, and the storage capacity is determined by the cylinder position. It is shown in numbers. In addition, in the database structure definition process, the mask controller 9 in the multi-disk controller 4 stores the physical location of the storage destination of the file to be written for each logical classification on the memory in the mask controller 9 as shown in FIG. Create a storage location pointer table that indicates the physical location of the storage location of the file to be written, as shown. When the structure definition is completed, the storage position pointer table indicates the first cylinder, first track, first sector, and first position within the sector of each logical classification set in the structure definition. In FIG. 5, storage position pointer information when files are stored according to the classification shown in FIG. 4 is stored. Next, the construction of the database will be explained. This collective magnetic disk device creates management information using file IDs as a means of specifying files to be accessed using file IDs (consisting of a logical classification ID and a unique number within the logical classification). . After storing, in the communication memory 5, file information consisting of a plurality of sets of file IDs and file sizes of files to be written, the host device 7 issues a write command to the multi-disk controller 4. The multi-disk controller 4 that has received the write command executes processing according to the flow shown in FIG. The mask controller 9 in the multi-disk controller 4 reads the file ID in the file information from the communication memory 5, and reads the storage location where the file indicated by the file ID is to be stored from the storage location pointer table. Once the storage location is determined, the remaining capacity that can be written to that physical cylinder is determined. If the file size given by the file size of the file information is smaller than the remaining capacity, the file size is stored in the physical information table 6 whose entry is the file ID as shown in FIG. ! Write the (disk number, cylinder number, track number, sector number, position within the sector), file size, and number of disks. The number of disk spans indicates how many magnetic disk devices 1 the file spans, and the file being processed could not be written to one cylinder of the device or multiple magnetic disk devices. In this case, the file will be divided and the remaining files will be written to the next disc. If this file is an unwritten file after being divided, this value is counted up. Entries in the physical information table 6 are indicated by file IDs given in file information. After writing to the physical information table, advance the storage location pointer by the file size. If the file size and remaining capacity are equal, when the cylinders of l magnetic disk devices 1 are full, the write process to that magnetic disk device 1 is performed. If the file size is larger than the remaining capacity, compare the magnetic capacity and the division standard size. The division standard size is a value that is set in the structure definition process. Even though the remaining capacity of the cylinder is very small, if a file is stored across magnetic disk units 1, it will take two drives to read the file. The magnetic disk device 1 must be controlled, and the processing overhead increases. Therefore, a certain standard is set, and if the remaining capacity is smaller than the standard value, data is written from the beginning of the cylinder of the next magnetic disk device fll. If the remaining capacity is equal to or larger than the division reference size, the storage location and file size are stored in the physical information table 6, and then the files are divided into files that can be written in the remaining capacity and files that are left unwritten. The storage physical location and file size are written in the physical information table 6. The magnetic disk device 1 that has created the physical information that fills the l cylinder performs a write process. The unwritten file returns through the loop and becomes the next file to be processed. If the remaining capacity is smaller than the division standard size, advance the storage position pointer table to the beginning of the next cylinder, then return to the loop and continue processing with the file to be processed as the next file to be processed. At this time, the magnetic disk device that has created the physical information that fills one cylinder performs a write process. In the write process, the mask controller 9 issues a seek command to the magnetic disk device 1 and starts a seek operation.
Next, a file transfer request is issued to the higher-level device 7, and the mask controller 9 requests the higher-level device 7 to transfer the file, and also controls the multiplex controller 8 to switch the data path and process the transferred file. The file is transferred to the input/output buffer 3 specified by the physical information. When the seek operation is completed and the file transfer is completed, the master controller 9 issues a write command to the magnetic disk device 1, and the magnetic disk device 1 executes the write operation. The above operations are repeated to construct the database. FIG. 8 shows the temporal relationship of write processing, from the host device 7 to 1-1, 2-1, . . . as shown in the figure.
... H 1.1-2+ 2 2t... The data that is transferred one after another is sent to the input/output buffers 3-1.3-2, . . . by the multiplex controller 8 in the multi-disk controller 4. ...3-n,
3-1. 3-2, . . . At this time, for example, the magnetic disk device 1-1 transfers data 1-
1, the seek is started by a command from the mask controller 9. When the data transfer 1-1 is completed, the mask controller 9 transfers the data to the magnetic disk device 1-1.
Issue a write command to. After the magnetic disk device 1-1 waits for rotation until it reaches a designated write position, it begins writing data 1-1 in the input/output buffer 3-1 to a predetermined cylinder, track, or sector. During this time, other magnetic disk devices will also perform similar processing as shown in the figure. As is clear from Figure 8 and the above explanation,
Each magnetic disk device can write files in parallel and continuously, making it possible to build a database in a short time. Next, file read processing will be explained. In addition, when there are multiple files to be read on the same cylinder of the same magnetic disk device, the files that do not need to be read between the files to be read are also read into the input/output buffer, and the files that do not need to be read are saved when transferring to the host device. The method for deleting will be explained. After storing file information consisting of a plurality of file IDs of files to be read in the communication memory 5, the host device 7 issues a read command to the multi-disk controller 4. The multi-disk controller 4 that receives the read command executes processing according to the flow shown in FIG. The mask controller 9 in the multi-disk controller 4 determines the file ID of the file to be read first from the communication memory 5.
is read out, and the physical information table 6 is searched for the physical information in which the file is stored based on the file ID. Let this file be the destination file and the physical information be the physical information of the destination file. Next, the file ID of the file to be read next is read from the communication memory 5, and the file ID
The physical information table 6 is searched for the physical information in which the file is stored. After this file,
Set the physical information as the physical information of the subsequent file. Check whether the first file and the next file exist in the same cylinder from the obtained physical information, check whether they exist in the same cylinder, and if they exist in the same cylinder, there is no need to read unspecified files between the first file and the next file. Find out if there is a group of files, and if so, find the total size of the group of files. When the size of one file that does not need to be read is small, the physical information is combined so that the first file and the next file can be read with a single read command. Next, the process returns to the loop using the synthesized physical information as the physical information of the previous file, reads the next file ID from the communication memory 5, and performs similar processing using that file as the next file. If the destination file and subsequent file do not exist in the same cylinder, or if the size of the file that does not need to be read is large,
Executes the process of reading the destination file from the magnetic disk device. The physical information of the next file returns through the loop as the physical information of the previous file, and is transferred from the communication memory 5 to the next file I.
D is read and the same process is performed using it as a subsequent file. This operation is repeated until all specified files are read. The process of reading the destination file from the magnetic disk device is as follows:
First, the mask controller 9 issues a seek command to the magnetic disk controller 2-i of the magnetic disk device 1-i indicated by the physical information of the previous file to move the magnetic head to the physical position indicated by the physical information, and 1
- i starts a seek operation. When the seek operation is completed, if the input/output buffer 3-i is in a state where data can be written, the mask controller 9 issues a read command to the magnetic disk controller 2-i, and writes the input/output buffer 3-i to the magnetic disk device. Start storing the file read from 1-i. When the storage is completed, the mask controller 9 controls the multiplex controller 8 to start transferring data from the input/output buffer 3-i to the host device 7. As shown in FIG. 10, the multiplex controller 8 connects a multiplexer 201 that selects and connects the data paths of the input/output buffers 3-1 to 3-n to the data path of the host device 7, and the selected i-th input/output buffer. A DMA controller 202 that outputs data from 3-i to the host device 7 without the intervention of the mask controller 9, and a start address and an end address for specifying the transfer range of the input/output buffer 3-i are stored in the DMA controller 202. It consists of a start address registration table 203 and an end address registration table 204. The mask controller 9 sets the start address where the file to be transferred of the crowd buffer 3-i exists in the start address registration table 203 and the end address in the end address registration table 204, and then transfers the data from other input/output buffers 3 to the host device. If data has not been transferred to DMA controller 202, a start command is issued to DMA controller 202. DM
The A controller 202 has the start address registration table 20
3 and the end address registration table 204, only the data in the specified range is transferred at the transfer speed requested by the host device 7 without the intervention of the mask controller 9. When the physical information is combined and read into the input/output buffer 3-i so that the first file and the next file can be read with a single read command, the first address registration table 203 and the end address registration table 204 are required. Set multiple addresses so that all files are transferred, and perform the same process. In order to read the first file and the next file with a single read command, the process of combining physical information is performed when the following conditions are satisfied. The size of the first file is f 1 [Byte], the size of the next file is f 2 [Byte] + the total size of the file group that does not need to be read is k [Byte], and the seek operation from the magnetic disk device 1 to the input/output buffer 3 is The effective transfer rate does not include [Byte/se
e], and the rotational speed is R [rps]. When the average seek time is s [sec], the average rotation waiting time is (1/
2R), and the condition that the time to read at a time is shorter than the time to read one by one is as follows, and can be written as shown in the third equation. t k ≦ (3) The temporal relationship of the 2R file read processing is as follows:
The transfer rate required by
e], the transfer rate from each magnetic disk device 1 to the input/output buffer 3 is [Byte/sec]. If the minimum seek time of each magnetic disk device l is s [sec] and the rotation speed is R [rps], then the minimum seek time s [s]
ec] is longer than the time (M/T) for transferring the file on the i-th input/output buffer 3-i to the higher-level device 7, as shown in FIG. 11. In order to satisfy the transfer speed required by the host device 7, the time required for the i-th magnetic disk unit 11-i to read the file to the input/output buffer 3-i (s + 1 / R + M /
t) is within the time (nM/T) for transferring all the files on the input/output buffers 3 to the host device 7. Here, the seek time was set as the minimum seek time in order to read consecutive cylinders. Also,
Regardless of the position of the magnetic head at the time a read command is issued to the magnetic disk unit W1, the rotation waiting time is set to the maximum value (1/R) so as to satisfy the transfer speed required by the host device 7. did. This relationship can be expressed mathematically as shown in the first equation. Furthermore, the temporal relationship of file read processing when the minimum seek time 5 [sec] is less than the time (M/T) for transferring the file on the i-th input/output buffer 3-i to the host device 7 is as follows: The result is as shown in FIG. In this case, even after the seek operation is completed, the input/output buffer 3-i is still transferring the file to the host device 7, so a read command cannot be issued to the i-th magnetic disk device 1i1-i. ．． Therefore, when the transfer of the file in the input/output buffer 3-i to the host device 7 is completed, a read command is issued to the i-th magnetic disk device 1-i. Therefore, in order to satisfy the transfer speed required by the host device 7, the time required for the i-th magnetic disk device M1-i to read the file to the input/output buffer 3-i (M/T
+ 1 /R +M/t) is the time (nM/t) to transfer the files on all input/output buffers 3 to the host device
It is good if it is within T). This relationship can be expressed mathematically as shown in the second equation. It is possible to determine how many magnetic disk drives 1 should be combined in order to satisfy the tMR speed, and to construct a collective magnetic disk drive with the minimum number of magnetic disk drives 1 that satisfy the first formula. It is the most cost effective option. For example, the capacity of one truck is 20k (kg) [B
The magnetic disk device 1 is composed of 6 tracks of 120K [Bytel] and has a capacity of 1 cylinder, and the transfer rate required by the host device 7 is 2M (mega) [Byte/sec]. The effective transfer rate from each magnetic disk device 1 to the input/output buffer 3, excluding seek operations, is I M [Byte/se
c], the minimum seek time of each magnetic disk device 1 is 10 m
(millimeter) If the rotational speed is 50 [rps], the first equation becomes as follows, and the minimum n that satisfies this equation is 4. Figure 13 shows the time relationship during reading from a collective magnetic disk drive configured with three magnetic disk units W1, and Figure 14 shows the time relationship during readout from a collective magnetic disk drive consisting of four magnetic disk units W1. Time Relationship: FIG. 15 shows the time relationship during reading in a collective magnetic disk device composed of five magnetic disk devices 1. When the three magnetic disk devices 1 shown in FIG. A time a occurs during which data cannot be transferred from the input/output buffer 3 to the higher-level device 7 because the transfer time is not met, and the transfer speed from the input-output buffer 3 to the higher-level device 7 is approximately 1. 6 M[Byte/see], and the transfer speed required by the upper-level device cannot be satisfied. Furthermore, when configured with the five magnetic disk devices 1 shown in FIG. 15, although the transfer speed required by the host device 7 is satisfied, the four magnetic disk devices i! shown in FIG. Compared to the case where the main magnetic disk unit N1 does not perform processing, the time b is longer when the main magnetic disk unit N1 does not perform processing, and the usage efficiency of the magnetic disk unit is poor. Therefore, it can be said that the case where the four magnetic disk drives 1 that meet the minimum value n that satisfies the first equation are used is the most cost-effective collective magnetic disk drive. Embodiment 2, which is another embodiment in which the present invention is applied to a character string search device, will be explained using FIG. When reading only a specified file, the collective magnetic disk device described in Embodiment 1 can read only a specified file, as long as the specified file exists on average in magnetic disk devices W1-1 to 1-n. The host device 7
Eight data transfer speeds can be increased. However, if the specified file exists only in the device and the plurality of magnetic disk devices 1-i, reading from the device and the plurality of magnetic disk devices 1-i will be performed continuously. In this case, data transfer from the host device 78 must be performed in two stages, in which data is read from the magnetic disk device 1-i to the input/output buffer 3-i and then transferred from the input/output buffer 3-i to the host device 7. As a result, a situation occurs in which data transfer is degraded. In this way, if the designated files exist unevenly in the magnetic disk device 1, a situation may arise in which the data transfer speed of the host device 78 cannot be effectively increased. Therefore, in the second embodiment, by preventing files from being stored unbalanced, all the magnetic disk devices 1 are always operated for reading, and the data transfer speed to the host device 7 is increased. Furthermore, in this embodiment, the number of magnetic disk devices is increased in order to further increase the storage capacity. FIG. 1 shows the configuration of a collective magnetic disk device using the present invention, and the difference from FIG. 3 is that the magnetic disk device 1
It has two sides of input/output buffer 3 with the same capacity as one cylinder of
The file read from the magnetic disk device 1 can be stored in the input/output buffer 3b on the second side while the file is being transferred to the magnetic disk device 1. In addition, one data storage device M15 is configured by m magnetic disk devices 1i1-i-1 to 1-i-m and a multiplexer 14, and the total storage capacity of the collective magnetic disk device is divided by the devices and multiple devices. It is (nXm) times the storage capacity of the magnetic disk device. To explain the operation, first, similar to the first embodiment, the database structure definition process is performed, but information identifying the m magnetic disk drives N1 connected to the input/output buffer 3 via the multiplexer 14 is added to the structure definition information. to add. The database is constructed in the same manner as in Example 1, but there are some differences. The difference from the first embodiment is that the file given by the file information is divided into the number of magnetic disk drives to be monitored, and is distributed and stored in all the magnetic disk drives. In addition, the data of the input/output buffer 3 is stored in the m devices given by the physical information and the plurality of magnetic disk devices 1.
-i-j to control and store multiplexer ~l4. The file division method is to calculate the division size by dividing the file size by the number of devices, and from the beginning of the file to the device for each division size and to the multiple magnetic disk devices 1-1-
In some cases, data is stored in order from j to 1-2-j, then 1-3-j, and in other cases, data is stored in a device of a predetermined size, such as 1-2-j, 1-3-j, etc. from the beginning of the file. There are disk devices that store data in order from disk devices 1-1-j to 1-2-j and 1-3-j. If the file size is not divisible by the number of magnetic disk units, invalid data is added to the end so that the file size is a multiple of the number of magnetic disks. Store the file so that the beginning of the file is located at . Next, file reading will be explained. This is also done in the same way as in the first embodiment, but in this configuration, the input/output buffer 3 is
3a and 3b, it is possible to start the process of reading the next file at the time when the file read from each magnetic disk device l is stored in the input/output buffer 3. The time relationship of file read processing is as shown in Figure 6 above, and compared to the first embodiment, there is no waiting time until the state is ready for writing data to the input/output buffer 3, and faster transfer is possible. It becomes possible. The relationship that satisfies the transfer speed required by the host device 7 under the same conditions as in Example 1 is as follows:
device and multiple magnetic disk drives ill-i-
The time to read a file from j to one input/output buffer 3a-i of two input/output buffers 3-i (s +
1/R + M/t) is within the time (n M./T) for transferring files from all other input/output buffers 3b-1 to 3b-n to the host device 7, and this is expressed as a mathematical formula, and this mathematical formula can be easily written as the following formula. Based on this condition, it is possible to determine the number of data storage devices 15 to satisfy the transfer speed required by the host device, as in the first embodiment. Furthermore, when a large storage capacity is required, the data storage device 15 can be configured with m magnetic disk devices and the multiplexer 14, and the storage capacity can be multiplied by m. If a collective magnetic disk device is constructed using the minimum number of magnetic disk devices l, it will have the best cost performance. In the embodiment shown in FIG. 16, the seek operation of each magnetic disk device is started at the time when the data transfer from the input/output buffers 3-1 to 3-n to the host device is completed, but each time the read operation is completed. It is clear that you can do this with Although the above two embodiments have been described using a magnetic disk device, it is clear that the same applies to a storage device in which a storage medium rotates, such as an optical disk device other than a magnetic disk device. [Effects of the Invention] According to the present invention, it is possible to satisfy the high transfer speed required by host equipment, and to configure a collective magnetic disk device with a minimum number of magnetic disk devices according to the capacity of the database to be stored. This has the effect of providing an inexpensive collective magnetic disk that can input and output multiple files continuously at high speed regardless of the file size.

[Brief explanation of drawings]

第１図は本発明を文字列検索装置に適用した一実施例で
ある実施例２を示す構成図、第２図は文字列検索装置の
一例を示す構成図、第３図は本発明を文字列検索装置に
適用した一実施例である実施例１を示す構或図、第４図
は構造定義テーブルの構造を示す図、第５図は格納位置
ポインタテーブルの構造を示す図、第６図は物理情報テ
ーブルの構造を示す図、第７図は実施例１のファイルの
書き込み処理のフローチャート図、第８図は実施例１の
集合型磁気ディスク装置におけるファイルの書き込み処
理のタイムチャート図、第９図は実施例ｌのファイルの
読み出し処理のフローチャート図、第１０［ｉ１はマル
チプレクスコントローラの構威を示す図、第１１図は実
施例１の集合型磁気ディスク装置におけるファイルの読
み出し処理のタイムチャート図，第ｌ２＠は実施例ｌの
集合型磁気ディスク装置におけるファイルの読み出し処
理のタイムチャート図，第１３図は実施例工において３
台の磁気ディスク装置で構威した集合型磁気ディスク装
置におけるファイルの読出し処理のタイムチャート図、
第１４図は実施例１において４台の磁気ディスク装置で
構成した集合型磁気ディスク装置におけるファイルの読
出し処理のタイムチャート図、第１５図は実施例１にお
いて５台の磁気ディスク装置で構威した集合型磁気ディ
スク装置におけるファイルの読出し処理のタイムチャー
ト図、第１６図は実施例２の集合型磁気ディスク装置に
おけるファイルの読み出し処理のタイムチャート図であ
る。符号の説明１・・・磁気ディスク装置２・・・磁気ディスクコントローラ３・・・入出力バッファ４・・・マルチディスクコントローラ５・・・通信メモリ６・・・物理情報テーブル７・・・上位機器８・・・マルチプレクスコントローラ９・・・マスクコントローラ１０．１２・・・データバス１１．１３・・・制御バス１４・・・マルチプレクサ１５・・・データ記憶装置工６・・・集合型磁気ディスク装置１０１・・・文字列検索装置１０２・・・検索制御手段１０３・・・ディスク制御手段１０４・・・複合条件判別手段１０５・・・文字照合手段１０６・・・文字列記憶手段１０７〜１１１・・・制御手段２０１・・・マルチプレクサ２０２・・・ＤＭＡコントローラ２０３・・・先頭アドレス登録テーブル２０４・・・終
了アドレス登録テーブル１３図わー竿ζぎ不５ぺ１りイ立夏　〆イ〉７テー１１レ鼾圓１句賛Ｌ・ｊ嘴宅反デーフ゛１レ茅ど図５社 ’ｉ−ｖ−ｍｋｉ．＋３園算？φ園７−　ｔデ図FIG. 1 is a block diagram showing a second embodiment, which is an embodiment in which the present invention is applied to a character string search device. FIG. 2 is a block diagram showing an example of a string search device. FIG. FIG. 4 is a diagram showing the structure of a structure definition table, FIG. 5 is a diagram showing the structure of a storage position pointer table, and FIG. 7 is a flowchart of the file writing process in the first embodiment. FIG. 8 is a time chart of the file writing process in the collective magnetic disk device of the first embodiment. FIG. 9 is a flowchart of file read processing in Embodiment 1, 10 [i1 is a diagram showing the structure of the multiplex controller, and FIG. 11 is a time chart of file read processing in the collective magnetic disk device of Embodiment 1. Chart diagram, No. 12@ is a time chart diagram of file read processing in the collective magnetic disk device of Example I, and FIG.
A time chart diagram of file read processing in a collective magnetic disk device configured with one magnetic disk device,
FIG. 14 is a time chart of file read processing in a collective magnetic disk device configured with four magnetic disk devices in Example 1, and FIG. 15 is a time chart of file read processing in a collective magnetic disk device configured with five magnetic disk devices in Example 1. FIG. 16 is a time chart of file read processing in the collective magnetic disk device of the second embodiment. Explanation of symbols 1...Magnetic disk device 2...Magnetic disk controller 3...I/O buffer 4...Multi-disk controller 5...Communication memory 6...Physical information table 7...Upper device 8...Multiplex controller 9...Mask controller 10.12...Data bus 11.13...Control bus 14...Multiplexer 15...Data storage device 6...Collective magnetic disk Device 101...Character string search device 102...Search control means 103...Disk control means 104...Compound condition determination means 105...Character matching means 106...Character string storage means 107-111. ...Control means 201...Multiplexer 202...DMA controller 203...Start address registration table 204...End address registration table 13 11th level snoring round 1 poem praise L・j beak defense 1st level chido diagram 5 company'i-v-m ki. +3 garden calculation? φ garden 7-t de figure

Claims

[Claims] 1. A plurality of data storage devices each having a magnetic disk device, an input/output buffer for temporarily storing data to be input/output to the data storage device, and control of the data storage device and the input/output buffer. In a magnetic disk system that has a collective magnetic disk device consisting of a multi-disk controller that performs magnetic disk device 1
The capacity of the cylinder is M [Bytes], and the data transfer rate from the data storage device to the input/output buffer is t [B
yte/sec], the minimum seek time of the magnetic disk device is s [sec], the rotational speed of the magnetic disk device is R [rps], and the capacity of the input/output buffer is the capacity M of the magnetic disk device for the cylinder. [Byte], when the minimum seek time s [sec] of the magnetic disk device is the time (M
/T)[sec] when n≧T{1/t+1/M[s+(1/R)]} Also, the minimum seek time s[sec] of the magnetic disk device is longer than the data M[sec] of the input/output buffer. When the time (M/T) [sec] for transferring [Byte] to the above-mentioned host device is less than n≧1+T(1/t+1/MR)}. type magnetic disk device. 2. In the collective magnetic disk device according to claim 1,
A collective type characterized in that the data storage device is constituted by a plurality of magnetic disk devices each having a magnetic disk controller and a multiplexer that selects one of the plurality of magnetic disk devices. Magnetic disk device. 3. In the collective magnetic disk device according to claim 1,
2 of the above input/output buffers per data storage device.
While the data in the input/output buffer on the first side is being transferred to the host device, the data read from the data storage device is stored in the input/output buffer on the second side, and n≧T{1/t+1/M[s+(1/K)]} is satisfied when the seek time s[sec] is less than or equal to the time to transfer data M[Byte] of the input/output buffer to the higher-level device. A collective magnetic disk device characterized by having n. 4. In the collective magnetic disk device according to claim 1,
A collective magnetic disk device characterized in that the multi-disk controller includes communication memory means for connecting the host device and the multi-disk controller. 5. In the collective magnetic disk device according to claim 4,
A collective magnetic disk device comprising a semiconductor storage element as the communication memory means. 6. The collective magnetic disk device according to claim 1,
A physical controller that interprets a file ID consisting of a logical classification, which is a unique identification code of a logical classification transferred from a higher-level device, and a number unique to a file within the logical classification, and makes it correspond to the physical location of the magnetic disk device. A collective magnetic disk device comprising information table means in the multi-disk controller. 7. The collective magnetic disk device according to claim 6,
A collective magnetic disk device comprising a semiconductor storage element as the physical information table means. 8. The collective magnetic disk device according to claim 1,
When there are multiple files to be read on the same cylinder of the above magnetic disk device, the total capacity of the group of files that will not be read between the first file to be read and the next file to be read is set in [Bytes], and the total capacity of the files to be read from the above data storage device is If the transfer rate to the input/output buffer is [Byte/sec] and the rotation speed of the magnetic disk device is R [rps], then if k≦t/2R is satisfied [Byte], read first. Files that do not need to be read between the file and the next file to be read are also read into the input/output buffer, and when transferred from the input/output buffer to the host device, the unnecessary files are removed. A collective magnetic disk device characterized by having a multi-disk controller having means for transferring data.