JP3515581B2

JP3515581B2 - Data search method

Info

Publication number: JP3515581B2
Application number: JP06076192A
Authority: JP
Inventors: 英治石坂
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1992-03-18
Filing date: 1992-03-18
Publication date: 2004-04-05
Anticipated expiration: 2019-04-05
Also published as: JPH05266078A

Description

【発明の詳細な説明】【０００１】【産業上の利用分野】本発明は、外部記憶装置に設けら
れたファイルに格納されているデータを検索する情報処
理システムにおけるデータ検索方式に関する。【０００２】【従来の技術】図７は従来ある情報処理システムの一例
を示す図であり、図８は図７における順次検索用ファイ
ル構造の一例を示す図であり、図９は図７における順次
検索処理の一例を示す図であり、図10は図７における二
進検索用ファイル構造の一例を示す図であり、図11は図
７における二進検索処理の一例を示す図であり、図12は
図７におけるハッシュ検索用ファイル構造の一例を示す
図であり、図13は図７におけるハッシュ検索処理の一例
を示す図である。【０００３】図７において、情報処理システム１０は、
処理装置（ＣＰＵ）１、記憶装置（ＭＭ）２、磁気ディ
スク装置（ＤＫ）３、磁気ディスク制御装置（ＤＫＣ）
５、入出力装置（ＩＯ）６および入出力制御装置（ＩＯ
Ｃ）７を具備しており、処理装置（ＣＰＵ）１は、例え
ば入出力装置（ＩＯ）６から入力されるデータ検索要求
に従い、磁気ディスク装置（ＤＫ）３に構築されている
ファイル４から指定されたデータを検索し、また例えば
入出力装置（ＩＯ）６から入力されるデータ登録要求に
従い、指定されたデータをファイル４に登録する。【０００４】従来ある情報処理システム１０において
は、ファイル４内のデータ検索処理として、順次検索処
理、二進検索処理およびハッシュ検索処理を採用してい
た。最初に、従来ある順次検索処理の一例を、図８およ
び図９により説明する。【０００５】図８に示されるファイル４には、各データ
Ａ（個々のデータをＡ_i、但しｉは登録順を示す）が登
録された順に配列されており、データＡ₁が最も早く登
録されたデータであり、データＡ_nが最後に登録された
データである。【０００６】またファイル４の先頭位置には、現在ファ
イル４に登録済のデータ数ｎが格納されている。かかる
状態で、入出力装置（ＩＯ）６から任意のデータＡ_Xの
検索要求が入力されると、処理装置（ＣＰＵ）１はファ
イル４に登録済のデータＡ_iを登録順に抽出しては入力
された検索対象データＡ_Xと比較し（図９ステップＳ９
１乃至Ｓ９３）、検索対象データＡ_Xと一致するデータ
Ａ_iが検出される迄、抽出および比較を繰返し（ステッ
プＳ９５）、一致するデータＡ_iが検出されると（ステ
ップＳ９３）、検出したデータＡ_iに対応する所要の情
報Ｄ_iを参照し（ステップＳ９６）、総てのデータＡ_i
が不一致であれば（ステップＳ９４）、検索対象データ
Ａ_Xは未登録と判定する（ステップＳ９７）。【０００７】また新たなデータＡ_Yをファイル４に登録
する場合には、ファイル４の先頭に格納されているデー
タ数ｎを参照することにより、最後のデータＡ_nが登録
されているｎ番目の領域を識別し、登録対象データＡ_Y
を次の（ｎ＋１）番目の領域にデータＡ_n+1として格納
すると共に、データ数ｎを（ｎ＋１）に更新する。【０００８】以上の説明から明らかな如く、従来ある順
次検索処理においては、新たなデータＡ_Yの登録処理
は、常にデータ数ｎを参照してデータＡ_Yの登録領域を
認識し、認識した登録領域にデータＡ_Yを格納する為、
登録時間が一定、且つ短時間で済むが、データＡ_Xの検
索処理は、検索対象データＡ_Xと一致するデータＡ_iが
検出される迄、第一領域から順次検索する為、検索時間
を決定するファイル４のアクセス回数は検索対象データ
Ａ_Xの登録順により変化し、且つ平均アクセス回数もデ
ータ数ｎに比例して長くなる。【０００９】なお処理装置（ＣＰＵ）１は、ファイル４
から一回のアクセスで予め定められた数のデータを抽出
または格納するが、アクセス回数がデータ数ｎに比例す
る傾向は変わらない。【００１０】次に、従来ある二進検索処理の一例を、図
10および図11により説明する。図10に示されるファイル
４には、各データＢ（個々のデータをＢ_j、但しｊは昇
順を示す）が一定の大小順（昇順または降順）に分類さ
れており、例えばデータＢが昇順に分類されているとす
ると、データＢ₁が現在登録済の最小データＢであり、
データＢ_nが現在登録済の最大データＢである。【００１１】またファイル４の先頭位置には、現在ファ
イル４に登録済のデータ数ｎが格納されている。かかる
状態で、入出力装置（ＩＯ）６から任意のデータＢ_Xの
検索要求が入力されると、処理装置（ＣＰＵ）１は先ず
データ数ｎを参照し、ファイル４に登録済の全データＢ
の中間値Ｂ_m（例えばデータ数ｎ＝７の場合はデータＢ
₄、またデータ数ｎ＝８の場合はデータＢ₄等）を抽出
して検索対象データＢ_Xと比較し（図11ステップＳ１１
１およびＳ１１２）、検索対象データＢ_Xが中間値Ｂ_m
より小さければ、中間値Ｂ_mにより二分された下半領域
を対象として更に中間値Ｂ _mを抽出して検索対象データ
Ｂ_Xと比較し（ステップＳ１１３およびＳ１１２）、ま
た検索対象データＢ_Xが中間値Ｂ_mより大きければ、中
間値Ｂ_mにより二分された上半領域を対象として更に中
間値Ｂ_mを抽出して検索対象データＢ_Xと比較し（ステ
ップＳ１１４およびＳ１１２）、検索対象データＢ_Xと
一致する中間値Ｂ_mが抽出される迄、以上の過程を繰返
し、検索対象データＢ_Xと一致する中間値Ｂ_mが抽出さ
れると（ステップＳ１１２）、抽出した中間値Ｂ_mに対
応する所要の情報Ｄ_iを参照する（ステップＳ１１
５）。【００１２】また新たなデータＢ_Yをファイル４に登録
する場合には、前述の検索処理と同様の過程で登録対象
データＢ_Yの登録領域を決定し、決定した登録領域以降
に格納されている総てのデータＢを一データＢ宛後へ移
動させて決定した登録領域を空き領域とし、登録対象デ
ータＢ_Yを登録領域に格納すると共に、データ数ｎを
（ｎ＋１）に更新する。【００１３】以上の説明から明らかな如く、従来ある二
進検索処理においては、データＢ_Xの検索処理は、登録
済の全データＢを二分し乍ら検索を行う為、検索時間を
決定するファイル４へのアクセス回数も登録順および昇
順に拘らず略一定となり、またデータ数ｎの増加に直接
比例して増加することも無く、比較的平均した検索時間
が得られるが、新たなデータＢ_Yの登録処理は、検索処
理と同様の過程で登録領域を決定した後、登録領域以降
に格納済の総てのデータＢを一データ宛後へ移動させる
必要があり、登録領域が先頭に近い程、登録時間が増大
する。【００１４】次に、従来あるハッシュ検索処理の一例
を、図12および図13を用いて説明する。図12に示される
ファイル４には、ハッシュ値検索領域Ａ_hと、データ領
域Ａ_chと、チェーンデータ領域Ａ_chxとが設けられてい
る。【００１５】ハッシュ値検索領域Ａ_hは、検索対象デー
タＣ（個々のデータをＣ_jと称する）を予め定められた
ハッシュ関数に代入することにより、対応するハッシュ
値ｈを求める領域あり、例えば各検索対象データＣ
_jと、求められたハッシュ値ｈ_jとを対応させて索引表
を構成することが考慮される。【００１６】またデータ領域Ａ_chは、各データＣを対応
するハッシュ値ｈにより指定される領域に格納した領域
であり、ハッシュ値ｈ_jに対応するデータをＣ_hjと表
す。またチェーンデータ領域Ａ_chxは、例えばデータ配
列の長さに対してデータ数が多い場合等に、複数データ
から求められるハッシュ値が同一となる確率が高くな
り、複数のデータＣから同一ハッシュ値ｈが求められた
場合に、ハッシュ値ｈに対応する複数のデータＣを連繋
して格納する領域であり、データ領域Ａ_ch内には、この
種のハッシュ値ｈ_jに対応する複数のデータＣ_hj1、Ｃ
_hj2、…の連繋の格納領域を示すポインタＰ_chjが格納
されている。【００１７】かかる状態で、入出力装置（ＩＯ）６から
任意のデータＣ_Xの検索要求が入力されると、処理装置
（ＣＰＵ）１は先ずファイル４のハッシュ値検索領域Ａ
_hを検索して検索対象データＣ_Xのハッシュ値ｈ_Xを求
め（図13ステップＳ１３１）、次にデータ領域Ａ_chの、
ハッシュ値ｈ_Xにより指定される領域を参照し（ステッ
プＳ１３２）、該領域に唯一個のデータＣ_hXが格納され
ている場合には（ステップＳ１３３）、抽出したデータ
Ｃ_hXに対応する所要の情報Ｄ_Xを参照する（ステップＳ
１３６）。【００１８】一方、データ領域Ａ_chの、ハッシュ値ｈ_X
により指定される領域に、ポインタＰ_chXが格納されて
いる場合には（ステップＳ１３３）、更にチェーンデー
タ領域Ａ_chxを参照し、ポインタＰ_chXにより指定され
る領域に格納されている複数のデータＣ_hXを一個宛抽出
して検索対象データＣ_Xと比較し（ステップＳ１３４お
よびＳ１３５）、一致したデータＣ_hXが検出された場合
に、抽出したデータＣ _hXに対応する所要の情報Ｄ_Xを参
照する（ステップＳ１３６）。【００１９】また新たなデータＣ_Yをファイル４に登録
する場合には、前述の検索処理と同様の過程で登録対象
データＣ_Yに対応するハッシュ値ｈ_Yを求め、データ領
域Ａ _ch内のハッシュ値ｈ_Yに対応する領域に、登録対象
データＣ_Yを格納する。なおハッシュ値ｈ_Yが他の登録
済データＣ_hjと一致する場合には、データ領域Ａ_chには
ポインタＰ_chを格納し、また登録対象データＣ_Yはチェ
ーンデータ領域Ａ_chxに格納する。【００２０】以上の説明から明らかな如く、従来あるハ
ッシュ検索処理においては、データＣ_Xの検索処理は、
ハッシュ値ｈを求めて検索領域を決定する為、検索時間
を決定するファイル４へのアクセス回数も少なくなり、
検索時間も短縮されるが、複数データから求められるハ
ッシュ値が同一となる場合に、検索回数が増大し、検索
時間が長くなる恐れがある。また新たなデータＣ_Yを登
録する場合も、同様の問題がある。【００２１】【発明が解決しようとする課題】以上の説明から明らか
な如く、従来ある情報処理システムにおいては、順次検
索を用いる場合には、新データの登録時間は一律に短時
間で済むが、検索時間はデータ数および登録順により大
きく変動し、また二進検索を用いる場合には、検索時間
は略均一に短時間で済むが、新データの登録時間は大小
順により大きく変動し、更にハッシュ検索を用いる場合
には、データの構成により検索時間および登録時間が長
くなる可能性があり、検索時間および登録時間が共に短
時間で済むには不充分であった。【００２２】本発明は、データ数および構成に拘らず検
索時間および登録時間を短縮可能とすることを目的とす
る。【００２３】【課題を解決するための手段】図１は本発明の原理を示
す図である。図１において、１００は本発明の対象とな
る情報処理システム、２００は情報処理システム１００
に設けられた検索の対象となるファイルである。【００２４】２１０は、本発明によりファイル２００内
に複数設けられた複数のデータ領域である。２２０は、
本発明によりファイル２００内に各データ領域２１０に
対応して設けられた予備領域である。【００２５】２０１は、本発明によりファイル２００内
に各データ領域２１０および予備領域２２０に共通に設
けられた先頭データ領域である。３００は、本発明によ
り情報処理システム１００に設けられた検索手段であ
る。【００２６】４００は、本発明により情報処理システム
１００に設けられた登録手段である。５００は、本発明
により情報処理システム１００に設けられた再編集手段
である。【００２７】【作用】各データ領域２１０は、予め定められた数のデ
ータ格納容量を有し、互いに大小順に分類された複数の
データをそれぞれ格納する。【００２８】予備領域２２０は、データ領域２１０と同
一のデータ格納容量を有し、データ領域２１０の格納領
域が不足した場合に、該データ領域２１０に対応する予
備領域２２０に、データ領域２１０および予備領域２２
０を通して所定の大小順に分類して格納する。【００２９】先頭データ領域２０１は、各データ領域２
１０に格納されているデータの大小順配列上で先頭に位
置する先頭データのみを格納し、大小順に配列してい
る。検索手段３００は、所要のデータの検索要求が情報
処理システム１００に入力された場合に、先頭データ領
域２０１を検索し、検索対象データが格納されているデ
ータ領域２１０および予備領域２２０の対を検出し、検
出したデータ領域２１０または予備領域２２０から検索
対象データを検索する。【００３０】登録手段４００は、所要のデータの登録要
求が前記情報処理システム１００に入力された場合に、
先頭データ領域２０１を検索し、登録対象データを格納
すべきデータ領域２１０および予備領域２２０の対を検
出し、検出したデータ領域２１０または予備領域２２０
内に大小順に従って格納する。【００３１】再編集手段５００は、任意のデータ領域２
１０および予備領域２２０の対に更にデータを追加登録
する余裕が無くなった場合に、余裕の無くなったデータ
領域２１０および予備領域２２０の対の予備領域２２０
に格納されているデータを、新たなデータ領域２１０お
よび予備領域２２０の対のデータ領域２１０に移動した
後、新たなデータ領域２１０および予備領域２２０の対
を含めた総てのデータ領域２１０および予備領域２２０
の対に格納されているデータを大小順に再編集し、且つ
先頭データ領域２０１に新たなデータ領域２１０の先頭
データを追加して大小順に分類する。【００３２】なおデータ領域２１０および予備領域２２
０は、情報処理システム１００がファイル２００に対
し、一回のアクセスで格納および抽出可能なデータ数を
格納し得る如く格納容量を定めることが考慮される。【００３３】また検索手段３００、登録手段４００およ
び再編集手段５００は、各データ領域２１０、予備領域
２２０および先頭データ領域２０１を検索する際に、二
進検索方法により検索することが考慮される。【００３４】従って、データの検索範囲が一対のデータ
領域および予備領域に限定される為、検索時間が大幅に
短縮され、また新たなデータを登録する為の空き領域を
生成する為に移動させるデータ数も限定される為、登録
時間も大幅に短縮されることとなり、当該情報処理シス
テムのデータ検索および登録効率が大幅に向上する。【００３５】【実施例】以下、本発明の一実施例を図面により説明す
る。図２は本発明の一実施例による情報処理システムを
示す図であり、図３は図２におけるファイル構造の一例
を示す図であり、図４は図２におけるデータ検索処理の
一例を示す図であり、図５は図２における新データ登録
処理の一例を示す図であり、図６は図２におけるファイ
ル再編集処理の一例を示す図である。なお、全図を通じ
て同一符号は同一対象物を示す。【００３６】図２においては、図１における情報処理シ
ステム１００として、図７に示される情報処理システム
と同様の構成を有する情報処理システム１０が示され、
また図１におけるファイル２００として、図３に示され
る如きファイル４０が磁気ディスク装置（ＤＫ）３内に
設けられ、また図１における検索手段３００として、図
４に示される如きデータ検索処理を実行する検索部１１
が処理装置（ＣＰＵ）１内に設けられ、また図１におけ
る登録手段４００として、図５に示される如き新データ
登録処理を実行する登録部１２が処理装置（ＣＰＵ）１
内に設けられ、更に図１における再編集手段５００とし
て、図６に示される如きファイル再編集処理を実行する
再編集部１３が処理装置（ＣＰＵ）１内に設けられてい
る。【００３７】図３に示されるファイル４０には、予め定
められたデータ格納容量Ｎを有するｋ個の基本データ領
域Ａ_B（個々の基本データ領域をＡ_Bfと称する、但しｆ
は１乃至ｋ）が設けられ、また各基本データ領域Ａ_Bに
対応して、基本データ領域Ａ _Bと同一のデータ格納容量
Ｎを有する予備領域Ａ_Sが一個宛設けられ、また各基本
データ領域Ａ_Bおよび予備領域Ａ_Sに共通に、先頭デー
タ領域Ａ_Hおよび最終データ格納領域Ａ_Tがそれぞれ一
個宛設けられている。【００３８】なお各基本データ領域Ａ_Ｂおよび予備領域
Ａ_Ｓのデータ格納容量Ｎは、処理装置（ＣＰＵ）１がフ
ァイル４０に一回アクセスする際に、格納および抽出可
能なデータ数に等しく定められているものとする。【００３９】ファイル４０に格納される総てのデータＥ
は昇順に分類された後、ｋ組以下のデータ群Ｇに区分さ
れ、それぞれ基本データ領域Ａ_Bに格納され、基本デー
タ領域Ａ_Bに格納し切れなかった場合には、対応する予
備領域Ａ_Sに格納する。【００４０】従って、各データ群Ｇ₁、Ｇ₂、…、Ｇ_k
から抽出された任意のデータは、常に昇順に配列され
る。また各データ群Ｇに含まれる各データＥは、それぞ
れ昇順に分類され、それぞれ対応する基本データ領域Ａ
_Bに、また必要により予備領域Ａ_Sにも格納される。な
おデータ群Ｇの内、基本データ領域Ａ_Bに格納される部
分を基本データ群Ｇ _Bと称し、予備領域Ａ_Sに格納され
る部分を予備データ群Ｇ_Sと称する。【００４１】また各基本データ領域Ａ_Bには、各基本デ
ータ群Ｇ_Bに含まれるデータ数ｎ_Bが格納され、各予備
領域Ａ_Sには、各予備データ群Ｇ_Sに含まれるデータ数
ｎ_Sが格納されている。【００４２】一方先頭データ領域Ａ_Hには、各基本デー
タ領域Ａ_Bfに格納されている各基本データ群Ｇ_Bfの中
で、昇順に先頭に分類される各先頭データＥ_BfH（即ち
各基本データ群Ｇ_B内で最小のデータ）が格納され、互
いに昇順に配列されている。【００４３】更に最終データ格納領域Ａ_Tには、ファイ
ル４０に格納される総てのデータＥを昇順に配列した場
合に、最終に分類される最終データＥ_T（即ち最大のデ
ータ）が格納されている。【００４４】最初に、本発明によるファイル検索処理
を、図２乃至図４により説明する。図２乃至図４におい
て、入出力装置（ＩＯ）６から任意のデータＥ_Xの検索
要求が入力されると、処理装置（ＣＰＵ）１は検索部１
１を起動する。【００４５】起動された検索部１１は、先ずファイル４
０の先頭データ領域Ａ_Hにアスセスし、条件Ｅ_BfH≦Ｅ
_X＜Ｅ_B(f+1)Hが成立する先頭データＥ_BfHを検索する
ことにより、検索対象データＥ_Xが格納されている可能
性の有る基本データ領域Ａ_Bfおよび予備領域Ａ_Sfを決定
する（図４ステップＳ４１）。【００４６】次に検索部１１は、決定した予備領域Ａ_Sf
にアクセスし、格納されている予備データ群Ｇ_Sfの先頭
データＥ_SfHを抽出し、検索対象データＥ_Xと大小比較
する（ステップＳ４２）。【００４７】比較の結果、条件Ｅ_X≧Ｅ_SfHが成立すれ
ば、検索対象データＥ_Xは予備領域Ａ_Sfに格納されてい
ると判定し（ステップＳ４３）、予備データ群Ｇ_Sfから
二進検索により検索対象データＥ_Xを抽出し（ステップ
Ｓ４４）、抽出した検索対象データＥ_Xに対応する所要
の情報Ｄ_Xを参照する（ステップＳ４６）。【００４８】また比較の結果、条件Ｅ_X＜Ｅ_SfHが成立
すれば、検索対象データＥ_Xは基本データ領域Ａ_Sfに格
納されていると判定し（ステップＳ４３）、基本データ
領域Ａ_Bfにアクセスして基本データ群Ｇ_Bfを抽出し、抽
出した基本データ群Ｇ_Bfから二進検索により検索対象デ
ータＥ_Xを抽出し（ステップＳ４５）、抽出した検索対
象データＥ_Xに対応する所要の情報Ｄ_Xを参照する（ス
テップＳ４６）。【００４９】以上の説明から明らかな如く、本実施例に
よれば、任意のデータＥ_Xを検索する検索部１１は、最
初にファイル４０の先頭データ領域Ａ_Hにアクセスして
検索対象データＥ_Xが格納されている基本データ領域Ａ
_Bfおよび予備領域Ａ_Sfを決定し、続いて予備領域Ａ_Sfに
アクセスして検索対象データＥ_Xが基本データ領域Ａ _Bf
および予備領域Ａ_Sfの何れに格納されているかを判定
し、予備領域Ａ_Sfに格納されていると判定された場合に
は直ちに予備データ群Ｇ_Sfから検索対象データＥ _Xを抽
出し、基本データ領域Ａ_Bfに格納されていると判定され
た場合には基本データ領域Ａ_Bfにアクセスして基本デー
タ群Ｇ_Bfから検索対象データＥ_Xを抽出する為、ファイ
ル４０に対するアクセス回数は二回または三回で済み、
検索時間も短縮される。【００５０】次に、本発明による新データ登録処理を、
図２、図３および図５により説明する。図２、図３およ
び図５において、入出力装置（ＩＯ）６から新たなデー
タＥ_Yの登録要求が入力されると、処理装置（ＣＰＵ）
１は登録部１２を起動する。【００５１】起動された登録部１２は、先ずファイル４
０の最終データ格納領域Ａ_Tにアスセスして格納されて
いる最終データＥ_Tを抽出し、登録対象データＥ_Yと大
小比較する（図５ステップＳ５１）。【００５２】比較の結果、条件Ｅ_Y＜Ｅ_Tが成立する場
合には（ステップＳ５２）、登録部１２は次に先頭デー
タ領域Ａ_Hにアクセスして各先頭データＥ_BfHを抽出
し、条件Ｅ_B1H＞Ｅ_Yが成立する場合には登録対象デー
タＥ_Yを格納する基本データ領域Ａ_B1を決定し、また条
件Ｅ_BfH≦Ｅ_Y＜Ｅ_B(f+1)Hが成立する先頭データＥ_Bf
_Hを検索した場合には、登録対象データＥ_Yを格納する
基本データ領域Ａ_Bfおよび予備領域Ａ_Sfを決定する（ス
テップＳ５３）。【００５３】次に登録部１２は、決定した予備領域Ａ_Sf
にアクセスし、格納されている予備データ群Ｇ_Sfの先頭
データＥ_SfHを抽出して検索対象データＥ_Xと大小比較
し、比較の結果、条件Ｅ_Y＞Ｅ_SfHが成立すれば（ステ
ップＳ５４）、検索対象データＥ_Xは予備領域Ａ_Sfに格
納すべきと判定し、予備領域Ａ_Sfから抽出済のデータ数
ｎ_Sfおよび予備データ群Ｇ_Sfを二進検索により登録対象
データＥ_Yを格納する登録領域を決定し（ステップＳ５
５）、登録領域以降に格納済の総ての登録済データＥ_Sf
を、一データＥ分宛後に移動して登録領域を空けた後、
登録対象データＥ_Yを登録領域に格納すると共に（ステ
ップＳ５６）、データ数ｎ_Sfを（ｎ_Sf＋１）に更新し
（ステップＳ５７）、更新済のデータ数ｎ_Sfおよび予備
データ群Ｇ _Sfを予備領域Ａ_Sfに格納する。【００５４】また登録部１２は、更新済のデータ数ｎ_Sf
を予備領域Ａ_Sfのデータ格納容量Ｎと比較することによ
り、予備領域Ａ_Sfに更に新たなデータＥ_Yを登録する余
裕が存在するか否かを判定し（ステップＳ５８）、デー
タ数ｎ_Sfがデータ格納容量Ｎ未満で余裕が存在すれば特
に再編集処理を実行すること無く、登録処理を終了する
が、データ数ｎ_Sfがデータ格納容量Ｎと等しくて余裕が
存在しなければ、再編集部１３を起動し、図６に示され
る如き再編集処理を実行させる（ステップＳ５９）。【００５５】一方、登録部１２が予備領域Ａ_Sfから抽出
した先頭データＥ_SfHと登録対象データＥ_Yとを比較し
た結果、条件Ｅ_Y＜Ｅ_SfHが成立すれば（ステップＳ５
４）、検索対象データＥ_Xは基本データ領域Ａ_Bfに格納
すべきと判定し、基本データ領域Ａ_Bfにアクセスしてデ
ータ数ｎ_Sfおよび基本データ群Ｇ_Bfを抽出し、二進検索
により登録対象データＥ_Yを格納する登録領域を決定す
る（ステップＳ５１０）。【００５６】なお条件Ｅ_B1H＞Ｅ_Yが成立した場合には
（ステップＳ５１１）、登録対象データＥ_Yは基本デー
タ群Ｇ_B1の先頭データＥ_B1Hとして、基本データ領域Ａ
_B1の先頭領域に格納される為、登録対象データＥ_Yの登
録領域を決定した後、更に先頭データ領域Ａ_Hにアクセ
スし、先頭データＥ_B1Hを登録対象データＥ_Yに更新す
る（ステップＳ５１２）。【００５７】登録対象データＥ_Yの登録領域が決定され
ると、登録部１２は登録領域以降に格納済の総ての登録
済データＥ_Bfを、一データＥ分宛後に移動して登録領域
を空けた後、登録対象データＥ_Yを登録領域に格納する
（ステップＳ５１３）。【００５８】なお登録部１２は、登録済データＥ_Bfを一
データＥ分宛後に移動させた結果、登録済データＥ_Bfが
基本データ領域Ａ_Bfから溢れたか否かを確認し（ステッ
プＳ５１４）、データ数ｎ_Bfがデータ格納容量Ｎ未満で
溢れた登録済データＥ_Bfが出なければ、抽出済のデータ
数ｎ_Bfを（ｎ_Bf＋１）に更新した後、更新済のデータ数
ｎ_Bfおよび基本データ群Ｇ_Bfを基本データ領域Ａ_Bfに格
納して登録処理を終了するが、データ数ｎ_Bfがデータ格
納容量Ｎに等しく、溢れた登録済データＥ_Bfが出た場合
には、更に予備領域Ａ_Sfにアクセスして格納済のデータ
数ｎ_Sfおよび予備データ群Ｇ_Sfを抽出し、予備データ群
Ｇ_Sfに格納済の総ての登録済データＥ_Sfを、一データＥ
分宛後に移動して先頭領域を空けた後、基本データ領域
Ａ_Bfから溢れたデータＥ_Bfを先頭領域に格納すると共に
（ステップＳ５１６）、データ数ｎ_Sfを（ｎ_Sf＋１）に
更新し（ステップＳ５７）、更新済のデータ数ｎ_Sfおよ
び予備データ群Ｇ_Sfを予備領域Ａ_Sfに格納する。【００５９】また登録部１２は、前述と同様に、更新済
のデータ数ｎ_Sfを予備領域Ａ_Sfのデータ格納容量Ｎと比
較することにより、予備領域Ａ_Sfに更に新たなデータＥ
_Yを登録する余裕が存在するか否かを判定し（ステップ
Ｓ５８）、データ数ｎ_Sfがデータ格納容量Ｎ未満で余裕
が存在すれば、特に再編集処理を実行すること無く登録
処理を終了するが、データ数ｎ_Sfがデータ格納容量Ｎと
等しくて余裕が存在しなければ、再編集部１３を起動
し、図６に示される如き再編集処理を実行させる（ステ
ップＳ５９）。【００６０】一方、登録部１２が最終データ格納領域Ａ
_Tから抽出した最終データＥ_Tと登録対象データＥ_Yと
を大小比較した結果（ステップＳ５１）、条件Ｅ_Y＞Ｅ
_Tが成立する場合には（ステップＳ５２）、登録部１２
は先頭データ領域Ａ_Hにアクセスして最終先頭データＥ
_BfHを抽出し、最終データＥ_Tが格納されている基本デ
ータ領域Ａ_Bfおよび予備領域Ａ_Sfを求め（ステップＳ５
１７）、登録対象データＥ_Yを格納する基本データ領域
Ａ_Bfおよび予備領域Ａ_Sfを決定する。【００６１】次に、登録部１２は基本データ領域Ａ_Bfに
アクセスし、格納されているデータ数ｎ_Bfおよび基本デ
ータ群Ｇ_Bfを抽出し、データ数ｎ_Bfとデータ格納容量Ｎ
とを比較することにより、基本データ領域Ａ_Bfに登録対
象データＥ_Yを登録し得る余裕が存在するか否かを分析
し（ステップＳ５１８）、データ数ｎ_Bfがデータ格納容
量Ｎ未満で余裕が存在することが確認された場合には、
登録対象データＥ_Yを基本データ群Ｇ_Bfの末尾に登録す
ると共に（Ｓ５１９）、データ数ｎ_Bfを（ｎ_Bf＋１）に
更新した後（ステップＳ５２０）、更新済のデータ数ｎ
_Bfおよび基本データ群Ｇ_Bfを基本データ領域Ａ_Bfに格納
して登録処理を終了するが、データ数ｎ _Bfがデータ格納
容量Ｎと等しくて基本データ領域Ａ_Bfに登録対象データ
Ｅ_Yを登録し得る余裕が存在しないことを確認した場合
には（ステップＳ５１８）、次に対応する予備領域Ａ_Sf
にアクセスし、格納されているデータ数ｎ_Sfおよび予備
データ群Ｇ_Sfを抽出し、登録対象データＥ_Yを予備デー
タ群Ｇ_Sfの末尾に登録すると共に（Ｓ５２１）、データ
数ｎ_Sfを（ｎ_Sf＋１）に更新した後（ステップＳ５
７）、更新済のデータ数ｎ_Sfおよび予備データ群Ｇ_Sfを
予備領域Ａ_Sfに格納する。【００６２】また登録部１２は、更新済のデータ数ｎ_Sf
を予備領域Ａ_Sfの格納容量と比較することにより、予備
領域Ａ_Sfに更に新たなデータＥ_Yを登録する余裕が存在
するか否かを判定し（ステップＳ５８）、データ数ｎ_Sf
がデータ格納容量Ｎ未満で余裕が存在すれば特に再編集
処理を実行すること無く、登録処理を終了するが、デー
タ数ｎ_Sfがデータ格納容量Ｎと等しくて余裕が存在しな
ければ、再編集部１３を起動し、図６に示される如き再
編集処理を実行させる（ステップＳ５９）。【００６３】以上の説明から明らかな如く、本実施例に
よれば、新たなデータＥ_Yを登録する登録部１２は、最
初にファイル４０の最終データ格納領域Ａ_Tおよび先頭
データ領域Ａ_Hにアクセスして登録対象データＥ_Yを登
録すべき基本データ領域Ａ_Bfおよび予備領域Ａ_Sfを決定
し、続いて予備領域Ａ_Sfにアクセスして登録対象データ
Ｅ_Yを基本データ領域Ａ_Bfおよび予備領域Ａ_Sfの何れに
登録すべきかを判定し、予備領域Ａ_Sfに登録すべきと判
定した場合には直ちに予備データ群Ｇ_Sfに登録対象デー
タＥ_Yを昇順に配列した後、予備領域Ａ_Sfに格納し、基
本データ領域Ａ _Bfに格納すべきと判定した場合には基本
データ領域Ａ_Bfにアクセスして基本データ群Ｇ_Bfに登録
対象データＥ_Yを昇順に配列した後基本データ領域Ａ_Bf
に格納し、その結果基本データ領域Ａ_Bfを溢れたデータ
Ｅ_Bfが発生した場合には、再び予備領域Ａ_Sfにアクセス
して溢れたデータＥ_Bfを予備データ群Ｇ_Sfに昇順に配列
した後、予備領域Ａ_Sfに格納し、更に必要に応じて最終
データ格納領域Ａ_Tにアクセスして最終データＥ_Tを更
新し、或いは先頭データ領域Ａ_Hにアクセスして先頭デ
ータＥ_B1Hを更新することとなり、ファイル４０に対す
るアクセス回数は四回乃至七回で済み、検索時間も短縮
される。【００６４】次に、本発明によるファイル再編集処理
を、図２、図３および図６により説明する。図２、図３
および図６において、登録部１２が新たなデータＥ_Yを
予備領域Ａ _Sfに登録した結果、データ数ｎ_Sfがデータ格
納容量Ｎと等しくなり、予備領域Ａ _Sfに更に新たなデー
タＥ_Yを登録する余裕が存在存在しないと判定した場合
に、前述の如く再編集部１３を起動する。【００６５】起動された再編集部１３は、データ数ｎ_Sf
がデータ格納容量Ｎと等しくなった予備領域Ａ_Sfに対応
する基本データ領域Ａ_Bfの直後の基本データ領域Ａ_Bg以
降の総ての基本データ領域Ａ_Bにアクセスし、それぞれ
格納されている基本データ群Ｇ_Bg（但しｇ＝ｆ＋１）以
降の総ての基本データ群Ｇ_Bを、一基本データ領域Ａ _B
分宛後へ移動して格納し、基本データ領域Ａ_Bgを空ける
（図６ステップＳ６１）。【００６６】次に再編集部１３は、予備領域Ａ_Sfにアク
セスし、格納されている予備データ群Ｇ_Sfを抽出し、新
たな基本データ群Ｇ_Bgとして空き領域となった基本デー
タ領域Ａ_Bgに移動し、予備領域Ａ_Sfを空ける（ステップ
Ｓ６２）。【００６７】次に再編集部１３は、予備領域Ａ_Sf直後の
予備領域Ａ_Sg以降の総ての予備領域Ａ_Sにアクセスし、
それぞれ格納されている予備データ群Ｇ_Sg以降の総ての
予備データ群Ｇ_Sを、一予備領域Ａ_S分宛後へ移動し、
予備領域Ａ_Sgを空ける（ステップＳ６３）。【００６８】次に再編集部１３は、先頭データ領域Ａ_H
にアクセスし、格納されている総ての先頭データＥ_BfH
を抽出し、ステップＳ６１乃至Ｓ６２において移動した
基本データ群Ｇ_Bg以降に対応する先頭データＥ_BgH以降
を更新する（ステップＳ６４）。以上により、基本デー
タ領域Ａ_Bfには、データ数ｎ_Bfがデータ格納容量Ｎと等
しい基本データ群Ｇ_Bfが格納されているが、対応する予
備領域Ａ_Sfは総て空き領域（ｎ_Sf＝０）となり、以後新
たなデータＥを登録する場合には予備領域Ａ_Sfに登録可
能となり、また基本データ領域Ａ_Bgには、此迄予備領域
Ａ_Sfに格納されていたデータ数ｎ_Sfがデータ格納容量Ｎ
と等しい予備データ群Ｇ_Sfが、データ数ｎ _Bgがデータ格
納容量Ｎと等しい基本データ群Ｇ_Bgとして格納されてい
るが、対応する予備領域Ａ_Sgは総て空き領域（ｎ_Sg＝
０）となり、以後新たなデータＥを登録する場合には予
備領域Ａ_Sgに登録可能となる。【００６９】以上の説明から明らかな如く、本実施例に
よれば、登録部１２がデータＥ_Yを予備領域Ａ_Sfに登録
した結果、データ数ｎ_Sfがデータ格納容量Ｎと等しくな
り、更に新たなデータＥ_Yを登録する余裕が無くなった
場合には、再編集部１３が基本データ領域Ａ_Bおよび予
備領域Ａ_Sに登録済の基本データ群Ｇ_Bおよび予備デー
タ群Ｇ_Sを一基本データ領域Ａ_Bおよび一予備領域Ａ_S
宛移動させることにより、各予備領域Ａ_Sに新たなデー
タＥ_Yを登録可能な余裕を持たせることが可能となる。【００７０】なお、図２乃至図６はあく迄本発明の一実
施例に過ぎず、例えばデータ群Ｇ内の各データＥ、或い
は先頭データ領域Ａ_H内の各先頭データＥ_BfHは昇順に
分類されるものに限定されることは無く、降順に分類す
る等、他に幾多の変形が考慮されるが、何れの場合にも
本発明の効果は変わらない。またファイル２００の構造
は図示されるファイル４０に限定されることは無く、他
に幾多の変形が考慮されるが、何れの場合にも本発明の
効果は変わらない。また本発明の対象となる情報処理シ
ステム１００は、図示される情報処理システム１０に限
定されることは無く、例えばファイル４０を磁気ディス
ク装置（ＤＫ）３以外に光ディスク装置、或いはフロッ
ピイディスク装置に設ける等、他に幾多の変形が考慮さ
れるが、何れの場合にも本発明の効果は変わらない。【００７１】【発明の効果】以上、本発明によれば、前記情報処理シ
ステムにおいて、データの検索範囲が一対のデータ領域
および予備領域に限定される為、検索時間が大幅に短縮
され、また新たなデータを登録する為の空き領域を生成
する為に移動させるデータ数も限定される為、登録時間
も大幅に短縮されることとなり、当該情報処理システム
のデータ検索および登録効率が大幅に向上する。DETAILED DESCRIPTION OF THE INVENTION [0001] BACKGROUND OF THE INVENTION The present invention relates to an external storage device.
Information processing to search for data stored in
Data retrieval method in a physical system. [0002] 2. Description of the Related Art FIG. 7 shows an example of a conventional information processing system.
FIG. 8 is a diagram showing the sequential search file in FIG.
FIG. 9 is a diagram showing an example of a file structure, and FIG.
FIG. 10 is a diagram illustrating an example of a search process. FIG.
FIG. 11 is a diagram showing an example of a hexadecimal search file structure, and FIG.
FIG. 12 is a diagram showing an example of a binary search process in FIG.
8 shows an example of a hash search file structure in FIG.
FIG. 13 is an example of the hash search process in FIG.
FIG. In FIG. 7, an information processing system 10 comprises:
Processing device (CPU) 1, storage device (MM) 2, magnetic disk
Disk unit (DK) 3, magnetic disk controller (DKC)
5, input / output device (IO) 6 and input / output control device (IO)
C) 7 and the processing device (CPU) 1
Data search request input from the input / output device (IO) 6
In the magnetic disk drive (DK) 3
Retrieve the specified data from file 4 and, for example,
In response to a data registration request input from the input / output device (IO) 6
Accordingly, the designated data is registered in the file 4. In a conventional information processing system 10,
Is a sequential search process as a data search process in the file 4.
, Binary search processing and hash search processing
Was. First, an example of a conventional sequential search process is shown in FIGS.
And FIG. [0005] File 4 shown in FIG.
A (Each data is A_i, Where i indicates the order of registration)
It is arranged in the order of recording, and data A₁Is the fastest climb
It is recorded data and data A_nWas last registered
Data. At the beginning of file 4, the current file
The number n of registered data is stored in the file 4. Take
In the state, any data A from the input / output device (IO) 6_Xof
When a search request is input, the processing device (CPU) 1
Data A registered in file 4_iExtract and enter in the order of registration
Search target data A_X(Step S9 in FIG. 9)
1 to S93), search target data A_XData that matches
A_iThe extraction and comparison are repeated until
S95), matching data A_iIs detected (step
S93), detected data A_iRequired information corresponding to
Report D_i(Step S96), and all data A_i
Does not match (step S94), the search target data
A_XIs not registered (step S97). [0007] New data A_YRegister to file 4
The data stored at the beginning of file 4
By referring to the data number n, the last data A_nIs registered
The n-th area that has been registered is identified, and the registration target data A_Y
In the next (n + 1) th area_{n + 1}Stored as
At the same time, the number of data n is updated to (n + 1). As is clear from the above description, the conventional order
In the next search process, new data A_YRegistration process
Always refers to the number of data n and the data A_YRegistration area
Recognizes and stores data A in the recognized registration area._YTo store
Although the registration time is constant and short, data A_XInspection
The search processing is performed on the search target data A._XData A that matches_iBut
Search time to search sequentially from the first area until detected
The number of accesses to file 4 that determines
A_XAnd the average number of access
It becomes longer in proportion to the data number n. The processing device (CPU) 1 has a file 4
Extract a predetermined number of data from a single access
Or store, but the number of accesses is proportional to the number n of data
The tendency is not to change. Next, an example of a conventional binary search process is shown in FIG.
This will be described with reference to FIG. 10 and FIG. Files shown in Figure 10
4, each data B (individual data B_jWhere j is ascending
Order) are classified into a certain order (ascending or descending).
For example, assume that data B is sorted in ascending order.
Then, data B₁Is the currently registered minimum data B,
Data B_nIs the currently registered maximum data B. At the beginning of file 4, the current file
The number n of registered data is stored in the file 4. Take
In the state, any data B from the input / output device (IO) 6_Xof
When a search request is input, the processing device (CPU) 1 first
Referring to the number of data n, all data B registered in file 4
Intermediate value B of_m(For example, when the number of data n = 7, the data B
_Four, And when the number of data n = 8, the data B_FourEtc.)
And search target data B_X(Step S11 in FIG. 11)
1 and S112), search target data B_XIs the intermediate value B_m
If smaller, intermediate value B_mLower area bisected by
Intermediate value B _mExtract and search target data
B_X(Steps S113 and S112).
Search target data B_XIs the intermediate value B_mMedium if larger
Intermediate value B_mIs further medium for the upper half area divided by
Intermediate value B_mTo extract the search target data B_XAnd compare
S114 and S112), search target data B_XWhen
Matching intermediate value B_mRepeat the above process until is extracted
And search target data B_XIntermediate value B that matches_mIs extracted
(Step S112), the extracted intermediate value B_mTo
Required required information D_i(Step S11)
5). Further, new data B_YRegister to file 4
If you want to register
Data B_YDetermine the registration area of the
All the data B stored in the
The registration area determined by moving
Data B_YIs stored in the registration area, and the number of data n is
Update to (n + 1). As is apparent from the above description, the conventional two
In binary search processing, data B_XSearch processing of the registration
Search time while searching for all data B
The number of accesses to file 4 to be determined is also in the order of registration and ascending
It is almost constant regardless of the order, and increases directly with the number of data n.
Relatively average search time without any proportional increase
, But new data B_YRegistration process
After the registration area is determined in the same process as the
Move all data B stored in the destination to one data destination
Required, the longer the registration area is near the top, the longer the registration time
I do. Next, an example of a conventional hash search process
This will be described with reference to FIGS. 12 and 13. Shown in Figure 12
File 4 has a hash value search area A_hAnd the data area
Area A_chAnd the chain data area A_chxAnd is provided
You. Hash value search area A_hIs the search target
Data C (each data is_j) Is predetermined
By assigning to the hash function, the corresponding hash
There is an area for calculating the value h, for example, each search target data C
_jAnd the calculated hash value h_jAnd an index table corresponding to
Is considered. Data area A_chCorresponds to each data C
Area stored in the area specified by the hash value h
And the hash value h_jData corresponding to_hjAnd table
You. Chain data area A_chxIs the data distribution
Multiple data, such as when the number of data is large relative to the length of the column
Is likely to be the same
Thus, the same hash value h was obtained from a plurality of data C.
In this case, a plurality of data C corresponding to the hash value h are linked.
Data area A_chWithin this
Seed hash value h_jData C corresponding to_hj1, C
_hj2,... A pointer P indicating the storage area of the concatenation of_chjIs stored
Have been. In this state, the input / output device (IO) 6
Arbitrary data C_XWhen a search request is input, the processing unit
(CPU) 1 first of all, hash value search area A of file 4
_hTo retrieve the search target data C_XHash value h of_XSeeking
(Step S131 in FIG. 13), then the data area A_chof,
Hash value h_XRefers to the area specified by
Step S132), the only data C in the area_hXIs stored
If there is (step S133), the extracted data
C_hXRequired information D corresponding to_X(Step S
136). On the other hand, the data area A_chThe hash value h_X
Pointer P to the area specified by_chXIs stored
If there is (step S133),
Area A_chxAnd pointer P_chXSpecified by
Data C stored in the area_hXExtract one to
Search target data C_X(Step S134
And S135), matching data C_hXIs detected
And the extracted data C _hXRequired information D corresponding to_XSee
(Step S136). Further, new data C_YRegister to file 4
If you want to register
Data C_YHash value h corresponding to_YAnd the data area
Area A _chHash value h in_YIn the area corresponding to
Data C_YIs stored. Note that the hash value h_YBut other registration
Data C_hjIf they match, the data area A_chTo
Pointer P_chAnd the registration target data C_YIs Choi
Data area A_chxTo be stored. As is clear from the above description, the conventional c
In the hash search process, the data C_XThe search process of
Search time to determine the search area by finding the hash value h
The number of accesses to file 4 that determines
Although search time is reduced, c
When the hash value is the same, the number of searches increases,
The time may be long. Also new data C_YClimb
Recording has the same problem. [0021] The problem to be solved by the present invention is apparent from the above description.
As described above, in a conventional information processing system,
When using a search, the registration time for new data is uniformly short
Search time is longer depending on the number of data and the order of registration.
Fluctuate, and when using a binary search, the search time
Is almost uniform in a short time, but the registration time of new data is large or small.
When it fluctuates greatly in order and further uses hash search
Has a long search time and registration time depending on the data structure.
Search time and registration time are both short
Time was not enough. According to the present invention, regardless of the number and configuration of data,
The aim is to reduce search and registration time
You. [0023] FIG. 1 shows the principle of the present invention.
FIG. In FIG. 1, reference numeral 100 denotes an object of the present invention.
Information processing system, 200 is an information processing system 100
Is a file to be searched which is provided in. Reference numeral 210 denotes a file in the file 200 according to the present invention.
Are a plurality of data areas provided. 220 is
According to the present invention, each data area 210 is stored in the file 200.
This is a spare area provided correspondingly. Reference numeral 201 denotes a file in the file 200 according to the present invention.
Common to each data area 210 and spare area 220
This is the first data area that has been shifted. 300 according to the invention
Search means provided in the information processing system 100.
You. Reference numeral 400 denotes an information processing system according to the present invention.
100 is a registration means. 500 is the present invention
Re-editing means provided in information processing system 100
It is. [0027] Each data area 210 has a predetermined number of data.
Data storage capacity, and multiple
Store the data respectively. The spare area 220 is the same as the data area 210.
It has a single data storage capacity and the storage area of the data area 210
When the data area becomes insufficient, the
Storage area 220, data area 210 and spare area 22
The data is classified and stored in a predetermined order from 0 through 0. The first data area 201 is composed of each data area 2
At the top of the array of data stored in
Only the first data to be stored is stored and arranged in
You. The search means 300 determines that a search request for required data is information
When input to the processing system 100, the first data area
Searches the area 201 and retrieves the data in which the search target data is stored.
Data area 210 and the spare area 220 are detected.
Search from the issued data area 210 or spare area 220
Search for the target data. The registration means 400 is used to register required data.
RequestSaidWhen input to the information processing system 100,
Searches the first data area 201 and stores the registration target data
Search for a pair of data area 210 and spare area 220 to be
Data area 210 or spare area 220
Are stored in descending order. The re-editing means 500 is provided for any data area 2
Additional data is registered in the pair of 10 and spare area 220
When there is no room left
Spare area 220 of a pair of area 210 and spare area 220
The data stored in the new data area 210 and
And moved to the data area 210 of the pair of the spare area 220
Later, a pair of a new data area 210 and a spare area 220
All data areas 210 and spare areas 220 including
Re-edit the data stored in the pair in descending order, and
Start of new data area 210 in start data area 201
Add data and sort by size. The data area 210 and the spare area 22
0 indicates that the information processing system 100
And the number of data that can be stored and extracted in one access
Determining the storage capacity so that it can be stored is considered. The search means 300, registration means 400 and
And the reediting means 500, each data area 210, the spare area
When searching for the first data area 220 and the first data area 201,
Searching by a binary search method is considered. Therefore, the data search range is a pair of data
Search time is significantly longer because it is limited to the area and spare area
It is shortened and free space for registering new data
Since the number of data to be moved for generation is also limited, register
The time will be greatly reduced, and the information processing system will be
System data retrieval and registration efficiency is greatly improved. [0035] An embodiment of the present invention will be described below with reference to the drawings.
You. FIG. 2 shows an information processing system according to an embodiment of the present invention.
FIG. 3 is an example of a file structure in FIG.
FIG. 4 is a diagram showing the data search process in FIG.
FIG. 5 is a diagram showing an example, and FIG. 5 shows a new data registration in FIG.
FIG. 6 is a diagram showing an example of processing, and FIG.
FIG. 14 is a diagram illustrating an example of a file reediting process. In addition, through all figures
The same reference numerals indicate the same objects. FIG. 2 shows the information processing system shown in FIG.
Information processing system shown in FIG.
An information processing system 10 having the same configuration as that of FIG.
FIG. 3 shows a file 200 shown in FIG.
File 40 in the magnetic disk drive (DK) 3
Provided, and as a search means 300 in FIG.
Search unit 11 for executing a data search process as shown in FIG.
Is provided in the processing unit (CPU) 1 and is also shown in FIG.
The new data as shown in FIG.
The registration unit 12 that executes the registration process is a processing device (CPU) 1
And the re-editing means 500 in FIG.
To execute a file re-editing process as shown in FIG.
The reediting unit 13 is provided in the processing device (CPU) 1
You. The file 40 shown in FIG.
K basic data areas having the specified data storage capacity N
Area A_B(Individual basic data area is A_BfWhere f
Are provided with 1 to k), and each basic data area A_BTo
Correspondingly, the basic data area A _BSame data storage capacity as
Spare area A with N_SIs provided for each one, and each basic
Data area A_BAnd spare area A_SCommon to the first data
Area A_HAnd final data storage area A_TEach one
Are provided. Each basic data area A_BAnd spare area
A_SThe data storage capacity N of the processing device (CPU) 1 is
File40Can be stored and extracted once access to
It is assumed that the number is set equal to the number of usable data. All data E stored in the file 40
Are sorted in ascending order and then divided into k or less data groups G.
And the basic data area A_BIs stored in the
Area A_BIf it cannot be stored in the
Area A_STo be stored. Therefore, each data group G₁, G_Two, ..., G_k
Any data extracted from is always arranged in ascending order
You. Each data E included in each data group G is
Are sorted in ascending order, and the corresponding basic data areas A
_BAnd, if necessary, spare area A_SIs also stored. What
Basic data area A in data group G_BPart stored in
The basic data group G _BAnd reserved area A_SStored in
Part is the spare data group G_SCalled. Each basic data area A_BEach basic data
Data group G_BNumber of data contained in_BIs stored in each spare
Area A_SContains each preliminary data group G_SNumber of data included in
n_SIs stored. On the other hand, head data area A_HContains the basic data
Area A_BfBasic data group G stored in_Bfin
, And each head data E sorted at the top in ascending order_BfH(Ie
Each basic data group G_BIs the smallest data in the
They are arranged in ascending order. Further, the final data storage area A_TIn the file
When all the data E stored in the file 40 are arranged in ascending order,
The final data E that is finally classified_T(Ie the largest de
Data) is stored. First, a file search process according to the present invention
This will be described with reference to FIGS. 2 to 4
And any data E from the input / output device (IO) 6_XSearch for
When a request is input, the processing device (CPU) 1
Start 1 The started search unit 11 first stores the file 4
0 leading data area A_HTo condition E_BfH≦ E
_X<E_{B (f + 1) H}Data E where_BfHSearch for
As a result, the search target data E_XMay be stored
Basic data area A_BfAnd spare area A_{Science fiction}Decide
(Step S41 in FIG. 4). Next, the search unit 11 determines the determined preliminary area A_{Science fiction}
And the stored spare data group G_{Science fiction}At the beginning
Data E_SfHAnd extract the search target data E_XAnd size comparison
(Step S42). As a result of the comparison, the condition E_X≧ E_SfHHolds
For example, search target data E_XIs the spare area A_{Science fiction}Stored in
(Step S43), the preliminary data group G_{Science fiction}From
Search target data E by binary search_XExtract (Step
S44), extracted search target data E_XRequired for
Information D_XIs referred to (step S46). As a result of the comparison, the condition E_X<E_SfHIs established
Then, search target data E_XIs the basic data area A_{Science fiction}Case
Is determined (step S43), and the basic data
Area A_BfTo access the basic data group G_BfExtract and extract
Released basic data group G_BfSearch target data by binary search from
Data E_XIs extracted (step S45), and the extracted search pair
Elephant data E_XRequired information D corresponding to_XRefer to
Step S46). As is clear from the above description, the present embodiment
According to the data E_XThe search unit 11 for searching for
First, the head data area A of the file 40_HAccess
Search target data E_XBasic data area A in which is stored
_BfAnd spare area A_{Science fiction}And then reserve area A_{Science fiction}To
Access and search target data E_XIs the basic data area A _Bf
And spare area A_{Science fiction}Judge which is stored in
And spare area A_{Science fiction}Is determined to be stored in
Is immediately the spare data group G_{Science fiction}Search target data E from _XExtract
Out, basic data area A_BfIs determined to be stored in
Data area A_BfAccess to basic data
Group G_BfSearch target data E from_XTo extract
The number of accesses to the file 40 may be two or three times,
Search time is also reduced. Next, a new data registration process according to the present invention will be described.
This will be described with reference to FIGS. 2, 3 and 5. Figures 2, 3 and
In FIG. 5, new data is input from the input / output device (IO) 6.
TA E_YWhen a registration request is input, the processing device (CPU)
1 activates the registration unit 12. The started registration unit 12 first stores the file 4
0 final data storage area A_TAssessed and stored
Last data E_TAnd the registration target data E_YAnd large
A small comparison is made (step S51 in FIG. 5). As a result of the comparison, the condition E_Y<E_TWhere is established
In this case (step S52), the registration unit 12
Area A_HTo access each head data E_BfHExtract
And condition E_B1H> E_YIf the condition is satisfied,
TA E_YBasic data area A for storing_B1And determine the article
Case E_BfH≦ E_Y<E_{B (f + 1) H}Data E where_Bf
_HIs searched, the registration target data E_YStore
Basic data area A_BfAnd spare area A_{Science fiction}Determine (S
Step S53). Next, the registration unit 12 determines the reserved area A_{Science fiction}
And the stored spare data group G_{Science fiction}At the beginning
Data E_SfHTo extract the search target data E_XAnd size comparison
And, as a result of the comparison, the condition E_Y> E_SfHIs satisfied,
S54), search target data E_XIs the spare area A_{Science fiction}Case
Is determined to be stored, and the spare area A_{Science fiction}Number of data extracted from
n_{Science fiction}And preliminary data group G_{Science fiction}To be registered by binary search
Data E_YIs determined (step S5).
5), all registered data E stored after the registration area_{Science fiction}
Is moved to the destination of one data E to make the registration area empty,
Registration target data E_YIs stored in the registration area and
Step S56), number of data n_{Science fiction}To (n_{Science fiction}+1)
(Step S57), number of updated data n_{Science fiction}And spare
Data group G _{Science fiction}To the spare area A_{Science fiction}To be stored. The registration unit 12 stores the number of updated data n_{Science fiction}
To spare area A_{Science fiction}By comparing with the data storage capacity N of
And spare area A_{Science fiction}New data E_YTo register
It is determined whether or not there is room (step S58).
Number n_{Science fiction}Is smaller than the data storage capacity N and there is a margin.
Finishes the registration process without executing the re-editing process
Is the number of data n_{Science fiction}Is equal to the data storage capacity N,
If it does not exist, the re-editing unit 13 is started, and
Then, a reediting process is executed (step S59). On the other hand, the registration unit 12_{Science fiction}Extract from
Top data E_SfHAnd registration target data E_YAnd compare
As a result, condition E_Y<E_SfHIs satisfied (step S5
4), search target data E_XIs the basic data area A_BfStored in
And the basic data area A_BfTo access
Data number n_{Science fiction}And basic data group G_BfExtract and binary search
The registration target data E_YDetermine the registration area for storing
(Step S510). Condition E_B1H> E_YIf
(Step S511), registration target data E_YIs Basic Day
Group G_B1Start data E_B1HAs the basic data area A
_B1Of the registration target data E_YClimbing
After determining the recording area, the head data area A_HAccess
And start data E_B1HTo be registered E_YUpdate to
(Step S512). Registration target data E_YRegistration area is determined
Then, the registration unit 12 registers all the registrations stored after the registration area.
Data E_BfTo the registration area by moving
After opening, the registration target data E_YIs stored in the registration area
(Step S513). The registration unit 12 stores the registered data E_BfOne
As a result of moving to the destination of the data E, the registered data E_BfBut
Basic data area A_BfTo see if it overflowed (step
Step S514), the number of data n_BfIs less than the data storage capacity N
Overflowing registered data E_BfIf not, the extracted data
Number n_BfTo (n_BfNumber of updated data after updating to +1)
n_BfAnd basic data group G_BfTo the basic data area A_BfCase
To complete the registration process, but the number of data n_BfIs the data case
Overfilled registered data E equal to the storage capacity N_BfWhen comes out
Also has a spare area A_{Science fiction}Access to stored data
Number n_{Science fiction}And preliminary data group G_{Science fiction}And extract the preliminary data
G_{Science fiction}All registered data E stored in_{Science fiction}To one data E
After moving to the destination and leaving the first area, the basic data area
A_BfData E overflowing from_BfIs stored in the first area and
(Step S516), number of data n_{Science fiction}To (n_{Science fiction}+1)
Updated (step S57), the number of updated data n_{Science fiction}And
And preliminary data group G_{Science fiction}To the spare area A_{Science fiction}To be stored. The registration unit 12 checks the updated state as described above.
Number of data n_{Science fiction}To spare area A_{Science fiction}And data storage capacity N
By comparison, the spare area A_{Science fiction}New data E
_YTo determine whether there is room to register
S58), number of data n_{Science fiction}Is less than the data storage capacity N
If exists, register without executing re-editing process
The processing ends, but the number of data n_{Science fiction}Is the data storage capacity N
If they are equal and there is no room, start the reediting unit 13
Then, a reediting process is executed as shown in FIG.
Step S59). On the other hand, the registration unit 12 stores the final data storage area A
_TFinal data E extracted from_TAnd registration target data E_YWhen
(Step S51), the condition E_Y> E
_TIs satisfied (step S52), the registration unit 12
Is the first data area A_HTo access the last head data E
_BfHAnd extract the final data E_TIs the basic data
Data area A_BfAnd spare area A_{Science fiction}(Step S5)
17), registration target data E_YBasic data area for storing
A_BfAnd spare area A_{Science fiction}To determine. Next, the registration unit 12 stores the basic data area A_BfTo
Number of data accessed and stored n_BfAnd basic data
Data group G_BfIs extracted and the number of data n_BfAnd data storage capacity N
Is compared with the basic data area A_BfRegistered vs
Elephant data E_YAnalyze whether there is room to register
(Step S518), the number of data n_BfIs the data storage capacity
If it is confirmed that there is a margin below the amount N,
Registration target data E_YTo the basic data group G_BfRegister at the end of
(S519), and the number of data n_BfTo (n_Bf+1)
After updating (step S520), the number of updated data n
_BfAnd basic data group G_BfTo the basic data area A_BfStored in
To end the registration process, but the number of data n _BfIs data storage
Basic data area A equal to capacity N_BfData to be registered in
E_YIf you confirm that there is no room to register
(Step S518), the next corresponding spare area A_{Science fiction}
And the number of stored data n_{Science fiction}And spare
Data group G_{Science fiction}And the registration target data E_YThe preliminary day
Group G_{Science fiction}(S521) and the data
Number n_{Science fiction}To (n_{Science fiction}+1) (step S5).
7), number of updated data n_{Science fiction}And preliminary data group G_{Science fiction}To
Reserved area A_{Science fiction}To be stored. The registration unit 12 stores the number of updated data n_{Science fiction}
To the spare area A_{Science fiction}By comparing with the storage capacity of
Area A_{Science fiction}New data E_YThere is room to register
It is determined whether or not to perform (step S58), and the number of data n_{Science fiction}
Is re-edited especially if the data storage capacity is less than N and there is room
The registration process ends without executing the process.
Number n_{Science fiction}Is equal to the data storage capacity N and there is no room
If so, the reediting unit 13 is started, and the
An editing process is executed (step S59). As is clear from the above description, the present embodiment
According to the new data E_YThe registration unit 12 for registering
First, the final data storage area A of the file 40_TAnd the beginning
Data area A_HTo access the registration target data E_YClimb
Basic data area A to be recorded_BfAnd spare area A_{Science fiction}Decide
And then the spare area A_{Science fiction}Access to registration data
E_YTo the basic data area A_BfAnd spare area A_{Science fiction}Any of
It is determined whether to register, and the spare area A_{Science fiction}Should be registered with
If specified, the preliminary data group G_{Science fiction}Data to be registered in
TA E_YAre arranged in ascending order, and the spare area A_{Science fiction}Stored in the base
This data area A _BfIf it is determined that it should be stored in
Data area A_BfTo access the basic data group G_BfRegister with
Target data E_YAre arranged in ascending order and then the basic data area A_Bf
In the basic data area A_BfData overflowing
E_BfOccurs again, the spare area A_{Science fiction}Access
And overflowing data E_BfTo the preliminary data group G_{Science fiction}Array in ascending order
After that, the spare area A_{Science fiction}At the end and, if necessary, final
Data storage area A_TTo access the final data E_TUpdate
New or top data area A_HTo access the first
Data E_B1HWill be updated, and the file 40
The number of access times is only 4 to 7 times, and the search time is reduced
Is done. Next, the file re-editing process according to the present invention
Will be described with reference to FIGS. 2, 3, and 6. FIG. 2 and 3
In FIG. 6 and FIG._YTo
Reserved area A _{Science fiction}As a result, the number of data n_{Science fiction}Is the data case
Spare area A _{Science fiction}New day to
TA E_YWhen it is determined that there is no room to register
Then, the reediting unit 13 is started as described above. When the reediting unit 13 is started, the data number n_{Science fiction}
Is equal to the data storage capacity N._{Science fiction}Compatible with
Basic data area A_BfBasic data area A immediately after_BgLess than
All descending basic data areas A_BAccess to each
Basic data group G stored_Bg(However, g = f + 1) or less
All basic data group G of descending_BTo one basic data area A _B
Move to the destination and store it in the basic data area A_BgEmpty
(Step S61 in FIG. 6). Next, the reediting unit 13 sets the spare area A_{Science fiction}Access
Spare data group G accessed and stored_{Science fiction}Extract the new
Tana basic data group G_BgBasic data that became free space as
Area A_BgTo the spare area A_{Science fiction}Empty (step
S62). Next, the reediting section 13 sets the spare area A_{Science fiction}Immediately after
Reserved area A_SgAll subsequent spare areas A_SAccess to
Spare data group G stored respectively_SgAll subsequent
Spare data group G_STo one spare area A_SMove to
Reserved area A_SgIs empty (step S63). Next, the reediting unit 13 sets the start data area A_H
, And all the stored leading data E_BfH
And moved in steps S61 to S62.
Basic data group G_BgStart data E corresponding to the following_BgHOr later
Is updated (step S64). Thus, the basic data
Area A_BfContains the number of data n_BfIs equal to the data storage capacity N
New basic data group G_BfIs stored, but the corresponding
Area A_{Science fiction}Are all free areas (n_{Science fiction}= 0) and new
When registering data E, the spare area A_{Science fiction}Can be registered to
And basic data area A_BgSo far, the reserve area
A_{Science fiction}Number of data stored in n_{Science fiction}Is the data storage capacity N
Spare data group G equal to_{Science fiction}Is the number of data n _BgIs the data case
Basic data group G equal to storage capacity N_BgStored as
But the corresponding spare area A_SgAre all free areas (n_Sg=
0), and when registering new data E thereafter,
Area A_SgCan be registered. As is clear from the above description, the present embodiment
According to the registration unit 12, the data E_YTo the spare area A_{Science fiction}Register with
As a result, the number of data n_{Science fiction}Is equal to the data storage capacity N.
And new data E_YI can't afford to register
In this case, the reediting unit 13_BAnd forecast
Area A_SBasic data group G registered in_BAnd preliminary data
Group G_STo one basic data area A_BAnd one spare area A_S
Each spare area A_SNew day in
TA E_YCan be registered. FIGS. 2 to 6 show only one embodiment of the present invention.
This is only an example. For example, each data E in the data group G, or
Is the first data area A_HEach head data E in_BfHAre in ascending order
It is not limited to what is classified, it is classified in descending order
Many other variations are considered, such as
The effect of the present invention does not change. The structure of the file 200
Is not limited to the file 40 shown in FIG.
Many variations are taken into account, but in each case
The effect remains the same. In addition, the information processing system
The stem 100 is limited to the illustrated information processing system 10.
The file 40 is not specified.
Optical disk drive or floppy disk drive other than the disk drive (DK) 3
Considering many other deformations such as mounting on a pi disc device
However, the effect of the present invention does not change in any case. [0071] As described above, according to the present invention, the information processing system
In the system, the data search range is a pair of data areas
Search time is greatly reduced because it is limited to
And create free space to register new data
Registration time because the number of data to be moved is limited.
Is greatly reduced, and the information processing system
Data retrieval and registration efficiency is greatly improved.

【図面の簡単な説明】【図１】本発明の原理を示す図【図２】本発明の一実施例による情報処理システムを
示す図【図３】図２におけるファイル構造の一例を示す図【図４】図２におけるデータ検索処理の一例を示す図【図５】図２における新データ登録処理の一例を示す
図【図６】図２におけるファイル再編集処理の一例を示
す図【図７】従来ある情報処理システムの一例を示す図【図８】図７における順次検索用ファイル構造の一例
を示す図【図９】図７における順次検索処理の一例を示す図【図10】図７における二進検索用ファイル構造の一例
を示す図【図11】図７における二進検索処理の一例を示す図【図12】図７におけるハッシュ検索用ファイル構造の
一例を示す図【図13】図７におけるハッシュ検索処理の一例を示す
図【符号の説明】１処理装置（ＣＰＵ）２記憶装置（ＭＭ）３磁気ディスク装置（ＤＫ）４、４０、２００ファイル５磁気ディスク制御装置（ＤＫＣ）６入出力装置（ＩＯ）７入出力制御装置（ＩＯＣ）１０、１００情報処理システム１１検索部１２登録部１３再編集部２０１先頭データ領域２１０データ領域２２０予備領域３００検索手段４００登録手段５００再編集手段BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 illustrates the principle of the present invention. FIG. 2 illustrates an information processing system according to an embodiment of the present invention. FIG. 3 illustrates an example of a file structure in FIG. FIG. 4 is a diagram showing an example of a data search process in FIG. 2 FIG. 5 is a diagram showing an example of a new data registration process in FIG. 2 FIG. 6 is a diagram showing an example of a file re-editing process in FIG. FIG. 8 shows an example of a conventional information processing system. FIG. 8 shows an example of a sequential search file structure in FIG. 7. FIG. 9 shows an example of a sequential search process in FIG. FIG. 11 shows an example of a binary search file structure. FIG. 11 shows an example of a binary search process in FIG. 7. FIG. 12 shows an example of a hash search file structure in FIG. Diagram showing an example of hash search processing [ DESCRIPTION OF SYMBOLS 1 Processing unit (CPU) 2 Storage unit (MM) 3 Magnetic disk unit (DK) 4, 40, 200 File 5 Magnetic disk control unit (DKC) 6 Input / output unit (IO) 7 Input / output control unit ( IOC) 10, 100 Information processing system 11 Search unit 12 Registration unit 13 Reediting unit 201 Top data area 210 Data area 220 Spare area 300 Search unit 400 Registration unit 500 Reediting unit

フロントページの続き (56)参考文献特開昭63−276639（ＪＰ，Ａ) 特開平３−263138（ＪＰ，Ａ) 山谷正己，ファイル編成入門，日本, 株式会社オーム社，1980年７月25日, 第１版，ｐ．65−ｐ．83 Ａ．Ｖ．エイホ・Ｊ．Ｅ．ホップクロフト・Ｊ．Ｄ．ウルマン，データ構造とアルゴリズム，日本，株式会社培風館, 1987年３月10日，初版，ｐ．323−ｐ. 336 (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06F 12/00 G06F 17/30 Continuation of the front page (56) References JP-A-62-276639 (JP, A) JP-A-3-263138 (JP, A) Masami Yamatani, Introduction to File Organization, Ohmsha, Japan, July 25, 1980 Sun, 1st edition, p. 65-p. 83 A. V. Eiho J. E. FIG. Hop Croft J. D. Ullman, Data Structures and Algorithms, Japan, Baifukan Co., Ltd., March 10, 1987, first edition, p. 323-p. 336 (58) Fields investigated (Int. Cl. ⁷ , DB name) G06F 12/00 G06F 17/30

Claims

(57) [Claim 1] In an information processing system for retrieving data stored in a file provided in an external storage device, the file can be stored and extracted with a single access.
A plurality of data areas having a data storage capacity of a large number of data are provided, and one spare area having the same data storage capacity as the data area is provided for each of the data areas, and in each of the data areas, A plurality of data classified in the order of magnitude are stored, and when the storage area of each data area is insufficient, a spare area corresponding to the data area is classified in a predetermined magnitude through the data area and the spare area. Only the top data located at the top of the data stored in each data area in the descending order of magnitude is stored in common with each of the data areas and the spare area, and the top data sorted in descending order of magnitude. An area is provided, and when a search request for required data is input to the information processing system, the head data area is searched, and search target data is stored. Search means for detecting a pair of a data area and a spare area that have been searched, and searching for the search target data from the detected data area or the spare area; and a request for registering required data is input to the information processing system. Registration means for searching for the head data area, detecting a pair of a data area and a spare area in which data to be registered is to be stored, and storing the pair in the detected data area or the spare area in the descending order; When there is no room to further register additional data in the pair of the area and the spare area, the data stored in the spare area of the pair of the data area and the spare area which has no room is replaced with a new data area and a spare area. After moving to the data area of the area pair, all of the data area and the spare area including the pair of the new data area and the spare area are moved. Data re-editing means for re-editing the data stored in the data area in the descending order of the size and adding head data of a new data area to the head data area and classifying the data in the order of the magnitude. .