JP5010958B2

JP5010958B2 - Data management method, program and apparatus

Info

Publication number: JP5010958B2
Application number: JP2007091712A
Authority: JP
Inventors: 鐘龍崔; 昌樹白桃
Original assignee: Fujitsu Broad Solution and Consulting Inc
Current assignee: Fujitsu Broad Solution and Consulting Inc
Priority date: 2007-03-30
Filing date: 2007-03-30
Publication date: 2012-08-29
Anticipated expiration: 2027-03-30
Also published as: JP2008250727A

Description

本発明は、コンピュータを用いた大規模なデータベースにおけるデータ管理方法、プログラム及び装置に関し、特に、表形式のデータを読み込んで保存する際の手法に関する。 The present invention relates to a data management method, program, and apparatus for a large-scale database using a computer, and more particularly to a technique for reading and storing tabular data.

大規模なデータベースには、多数のデータを表形式で管理し、複数の表を関連付けて運用するリレーショナルデータベース(ＲＤＢ)が従来から用いられている。ＲＤＢはトランザクション処理に向いているため、各種の基幹システムに用いられているが、全件のソートや更新、検索といった処理には時間がかかる。 Conventionally, a large-scale database uses a relational database (RDB) that manages a large number of data in a table format and associates and operates a plurality of tables. Since RDB is suitable for transaction processing, it is used in various backbone systems, but it takes time to sort, update, and search all cases.

このようなＲＤＢの弱点を克服するため、特許文献１には、成分分解法によるＦＡＳＴ(Filter Array Structure)構造が開示されている。ＦＡＳＴ構造では、表形式のレコードの配列であるデータを順序、位置、値の成分に分解(コンパイル)して管理することにより、全件を対象とした処理の高速化を可能としている。 In order to overcome such weaknesses of RDB, Patent Document 1 discloses a FAST (Filter Array Structure) structure based on a component decomposition method. In the FAST structure, data that is an array of records in a tabular format is managed by being decomposed (compiled) into order, position, and value components, thereby enabling high-speed processing for all cases.

特許第３５８１８３１号公報Japanese Patent No. 3581831

しかしながら、上記のＦＡＳＴ構造を採用したデータ管理方法では、表形式の元データに含まれる全ての項目についてＦＡＳＴ構造へのコンパイルが必要となるため、データの登録コストが高くなる。また、データの追加や更新、集計の際にもコンパイルが必要となるため、これらの処理に時間がかかるという問題がある。 However, in the data management method adopting the FAST structure described above, since all items included in the tabular original data need to be compiled into the FAST structure, the data registration cost increases. In addition, since compilation is also required when adding, updating, and tabulating data, there is a problem that these processes take time.

本発明は、上述した従来技術の問題点に鑑みてなされたものであり、高速検索が可能なＦＡＳＴ構造を利用しつつ、データの登録、追加、更新等の際の処理時間を短縮することができるデータ管理方法、プログラム及び装置を提供することを目的(課題)とする。 The present invention has been made in view of the above-described problems of the prior art, and can reduce the processing time for data registration, addition, update, etc., while using a FAST structure capable of high-speed search. It is an object (problem) to provide a data management method, program, and apparatus that can be used.

本発明にかかるデータ管理方法は、コンピュータが、複数の項目についてそれぞれ項目値を持つレコードを複数含む表形式の元データの入力を受け付ける入力受付手順、入力された元データについて、データ処理のキーとして使用する項目については、ユニークな項目値をソートした配列である項目値テーブルと、レコードの配列順に当該レコードの項目値が格納されている前記項目値テーブル内の位置を記録した配列であるインデックス値テーブルとに分解して保存するコンパイル手順、入力された元データについて、データ処理のキーとして使用しない項目については、１レコードに複数の項目を含む表形式のデータとして保存する表形式保存手順を実行することを特徴とする。 In the data management method according to the present invention, the computer receives an input reception procedure for receiving input of original data in a table format including a plurality of records each having an item value for a plurality of items, and the input original data is used as a key for data processing. For items to be used, an item value table that is an array in which unique item values are sorted, and an index value that is an array that records the positions in the item value table in which the item values of the records are stored in the order in which the records are arranged Compile procedure for disassembling and saving to table, for input source data, for items not used as data processing key, execute tabular format save procedure to save as tabular data including multiple items in one record It is characterized by doing.

コンピュータが、さらに、表形式保存手段により保存されたデータについて、表形式で保存された全項目を１つの項目と見立て、ユニークな項目値をソートした配列である項目値テーブルと、レコードの配列順に当該レコードの項目値が格納されている項目値テーブル内の位置を記録した配列であるインデックス値テーブルとに分解して保存する第２のコンパイル手順を実行するようにしてもよい。 The computer further regards the data saved by the tabular saving means, assuming that all items saved in the tabular format are regarded as one item, an item value table that is an array in which unique item values are sorted, and the order in which the records are arranged You may make it perform the 2nd compilation procedure which decomposes | disassembles into the index value table which is the arrangement | sequence which recorded the position in the item value table in which the item value of the said record is stored.

なお、本発明のデータ管理プログラムは、上記の方法の各手順に相当する手段としてコンピュータを機能させることを特徴とし、本発明のデータ管理装置は、そのように機能するコンピュータと等価である。 The data management program of the present invention is characterized by causing a computer to function as a means corresponding to each procedure of the above method, and the data management apparatus of the present invention is equivalent to a computer that functions as such.

本発明によれば、入力された表形式の元データのうち、キーとして利用する項目についてのみＦＡＳＴ構造へのコンパイルを実行し、他の項目については表形式のまま保存するようにしたため、検索等に利用する項目についてはＦＡＳＴ構造に変換して高速検索を可能としつつ、キーとして利用しない項目についてはコンパイルをしないで表形式で保存することにより、全項目をコンパイルする従来の方法と比較してデータ登録の際、あるいは、レコードの追加、更新の際の処理時間を短縮することができる。 According to the present invention, the FAST structure is compiled only for the items used as keys in the input table format original data, and the other items are stored in the table format. Compared to the conventional method of compiling all items by compiling all the items by saving them in tabular form without compiling them, converting items used in the FAST structure to enable high-speed search Processing time for data registration or record addition / update can be shortened.

以下、本発明にかかるデータ管理方法の実施形態を説明する。最初に、図１に基づいて本実施形態のデータ管理方法が適用されるシステムの概要を説明する。このシステム１は、単独のコンピュータ１０と、周辺機器とから構成されている。コンピュータ１０は、ＣＰＵ１１、並びにこのＣＰＵ１１に接続されたハードディスク(ＨＤ)２０、メモリ(ＲＡＭ)１２及びインターフェイス１３を備えている。 Embodiments of a data management method according to the present invention will be described below. First, an outline of a system to which the data management method of the present embodiment is applied will be described based on FIG. The system 1 includes a single computer 10 and peripheral devices. The computer 10 includes a CPU 11, a hard disk (HD) 20, a memory (RAM) 12, and an interface 13 connected to the CPU 11.

なお、コンピュータ１０のＣＰＵ１１には、周辺機器としてディスプレイ３０、キーボード３１がインターフェイス１３を介して接続されている。 Note that a display 30 and a keyboard 31 are connected to the CPU 11 of the computer 10 via the interface 13 as peripheral devices.

ＨＤ２０には、図示せぬオペレーティングシステムの他、データ管理プログラム２１がインストールされると共に、データベース(ＤＢ)２２が構築されている。ＣＰＵ１１は、起動するとＨＤ２０からオペレーティングシステムをＲＡＭ１３上に読み出して実行し、このオペレーティングシステム上でデータ管理プログラム２１を実行する。 In addition to an operating system (not shown), a data management program 21 is installed in the HD 20 and a database (DB) 22 is constructed. When the CPU 11 is activated, it reads the operating system from the HD 20 onto the RAM 13 and executes it, and executes the data management program 21 on this operating system.

データ管理プログラム２１には、複数の項目についてそれぞれ項目値を持つレコードを複数含む表形式の元データの入力を受け付ける入力受付機能２１ａと、入力された元データについて、データ処理のキーとして使用する項目については、ユニークな項目値をソートした配列である項目値テーブルと、レコードの配列順に当該レコードの項目値が格納されている項目値テーブル内の位置を記録した配列であるインデックス値テーブルとに分解して保存するコンパイル機能２１ｂと、入力された元データについて、データ処理のキーとして使用しない項目については、１レコードに複数の項目を含む表形式のデータとして保存する表形式保存機能２１ｃとが含まれている。 The data management program 21 includes an input reception function 21a that accepts input of tabular source data including a plurality of records each having item values for a plurality of items, and items used as data processing keys for the input source data. Is broken down into an item value table that is an array in which unique item values are sorted, and an index value table that is an array that records the positions in the item value table in which the item values of the records are stored in the order in which the records are arranged And a table format saving function 21c that saves the input original data as items of a table format including a plurality of items in one record for items that are not used as data processing keys. It is.

ＤＢ２２は、キーとなる項目についてコンパイルして保存されたＦＡＳＴ構造データ２２ａと、キーとならない項目についてコンパイルせずに保存された表形式データ２２ｂとが格納されている。 The DB 22 stores FAST structure data 22a that is compiled and saved for items that are keys, and tabular data 22b that is saved without being compiled for items that are not keys.

なお、図１には、作業管理システムが単独のコンピュータにより構成される例を示したが、このコンピュータをサーバとしてネットワークを介して複数の端末を接続し、各端末から入力、閲覧ができるようにしてもよい。 FIG. 1 shows an example in which the work management system is configured by a single computer, but a plurality of terminals are connected via a network using this computer as a server so that input and browsing can be performed from each terminal. May be.

次に、上記のシステム１において実行されるデータ管理処理の内容を、図２に示すフローチャートに基づいて説明する。図２のフローチャートは、入力されたテキストデータを所定の形式で保存した後、入力されたコマンドに従ってデータを処理する手順を示している。処理が開始すると、ＣＰＵ１１は、ステップS001においてテキストデータの入力を受け付ける。テキストデータは、CSV形式、あいるはタブ区切りのテキスト等の表形式のデータであり、外部のリレーショナルデータベースから出力され、あるいは、ユーザにより入力される。 Next, the contents of the data management process executed in the system 1 will be described based on the flowchart shown in FIG. The flowchart of FIG. 2 shows a procedure for processing the data according to the input command after storing the input text data in a predetermined format. When the process starts, the CPU 11 accepts input of text data in step S001. Text data is CSV format or tabular data such as tab-delimited text and is output from an external relational database or input by a user.

入力時には、例えば図３に示すような選択画面がディスプレイ３０上に表示される。ユーザは、この画面上でテキストファイルを選択し、キーとする項目名をチェックボックスを使ってチェックする。例えば、この例では、master.txtというテキストファイルが入力ファイルとして選択されている。このファイルの内容は、図４に示されている。項目としてID，名称、性別が含まれている。図３の選択画面では、その中の項目「性別」がキーとして利用する項目として指定されている。 At the time of input, a selection screen as shown in FIG. 3 is displayed on the display 30, for example. The user selects a text file on this screen and checks the item name as a key using a check box. For example, in this example, a text file called master.txt is selected as the input file. The contents of this file are shown in FIG. Items include ID, name, and gender. In the selection screen of FIG. 3, the item “sex” is designated as an item to be used as a key.

ステップS001でテキストデータが入力されると、ＣＰＵ１１は、ステップS002において、テキストデータの構造から各項目を識別し、項目毎にキーとして利用される項目か否かを判断する。キーとして利用する項目については、ステップS003でＦＡＳＴ構造へコンパイルされ、キーとして利用しない項目については、ステップS004で表形式の中継データ項目の配列に追加される。ステップS005で他の項目があると判断される間は、ステップS002〜S004の処理を繰り返し、全ての項目の処理が終了すると、ステップS006に処理が進められる。 When text data is input in step S001, the CPU 11 identifies each item from the structure of the text data in step S002, and determines whether the item is used as a key for each item. Items used as keys are compiled into a FAST structure in step S003, and items not used as keys are added to the table-format relay data item array in step S004. While it is determined in step S005 that there are other items, the processes in steps S002 to S004 are repeated. When the processes for all the items are completed, the process proceeds to step S006.

処理されたテキストデータは、図５に示すような形式で保存される。すなわち、キーとして利用する項目「性別」については、ＦＡＳＴ構造にコンパイルされ、図５(A)に示すように、ユニークな項目値をソートした配列である項目値テーブルValueListと、レコードの配列順OrderSetと、当該レコードの項目値が格納されている項目値テーブル内の位置を記録した配列であるインデックス値テーブルValueNoとに分解して保存される。 The processed text data is stored in a format as shown in FIG. That is, the item “gender” used as a key is compiled into a FAST structure, and as shown in FIG. 5A, an item value table ValueList which is an array in which unique item values are sorted, and the order of records in the order set And an index value table ValueNo that is an array in which the position in the item value table in which the item value of the record is stored is recorded and saved.

一方、キーとして利用されない項目「ＩＤ」、「名称」は、コンパイルされず、図５(B)に示すような１つのレコードに複数の項目値を含む表形式の中継データ項目として保存される。ただし、レコードの位置を示すインデックス値テーブルValueListは生成される。 On the other hand, items “ID” and “name” that are not used as keys are not compiled, but are stored as relay data items in a table format including a plurality of item values in one record as shown in FIG. However, an index value table ValueList indicating the position of the record is generated.

次に、ステップS006でコマンドが入力されると、ＣＰＵ１１はステップS007〜S009でコマンドの種類に応じて項目をチェックする。すなわち、コマンドがコンパイルされた項目を対象とするものである場合には、対象となる項目が全てコンパイル済みか否かを判断し、１つでもコンパイル済みでない項目があると異常終了する。全ての対象項目がコンパイル済みである場合、あるいは、コマンドの対象がコンパイルされた項目を対象としない場合に、ステップS010でコマンドを実行する。コマンドが複数ある場合には、ステップS011からステップS007に戻って全てのコマンドについて上記のS007〜S010の処理を繰り返し実行する。 Next, when a command is input in step S006, the CPU 11 checks items according to the type of command in steps S007 to S009. That is, if the command is for a compiled item, it is determined whether all the items to be compiled have been compiled. If there is even one item that has not been compiled, the command terminates abnormally. If all the target items have been compiled, or if the target of the command does not target the compiled item, the command is executed in step S010. When there are a plurality of commands, the process returns from step S011 to step S007, and the processes of S007 to S010 are repeatedly executed for all commands.

全てのコマンドの実行が終了すると、ＣＰＵ１１はステップS012で処理結果としてテキストデータを出力する。ここでは、コンパイル済みの項目については、テキスト形式の文字列に変換し、表形式の中継データ項目と結合して１行ずつ出力する。全ての行の出力が終了すると、処理を終了する。 When the execution of all the commands is completed, the CPU 11 outputs text data as a processing result in step S012. Here, the compiled items are converted into text format character strings, combined with the tabular relay data items, and output line by line. When all the lines have been output, the process ends.

例えば、コマンドとして先に登録された図５に示すデータを性別順にソートして出力するよう入力した場合、出力のオプションを選択するために図６に示すような選択画面がディスプレイ３０に表示される。ここでは、出力ファイル名をmaster_new.txtと指定し、タブ区切りのデータとして、キーでない項目も出力するよう選択している。 For example, when the data shown in FIG. 5 previously registered as a command is input to be sorted and output in the order of gender, a selection screen as shown in FIG. 6 is displayed on the display 30 in order to select an output option. . Here, the output file name is specified as master_new.txt, and items that are not keys are output as tab-delimited data.

性別によるソートを指定しているため、ステップS007ではコンパイル対象のコマンドであると判断され、ステップS008でこの項目がコンパイル済みであることが確認され、他の処理対象はないためステップS009をNoで抜けてステップS010で性別によるソートが実行される。ソートの結果は図７に示すとおりである。すなわち、項目「性別」については、図７(A)に示すように、インデックス値テーブルValueNoの値が昇順となるように、レコードの配列順OrderSetが並べ替えられる。ここではOderSetの「２」と「３」とが入れ替えられている。キーとして利用されない項目は、操作されず、図７(B)に示した状態で保存されている。 Since sorting by gender is specified, it is determined in step S007 that it is a command to be compiled. In step S008, it is confirmed that this item has been compiled. Since there is no other processing target, step S009 is set to No. In step S010, sorting by gender is executed. The result of sorting is as shown in FIG. That is, for the item “gender”, as shown in FIG. 7A, the array order OrderSet is rearranged so that the values in the index value table ValueNo are in ascending order. Here, “2” and “3” of OderSet are interchanged. Items that are not used as keys are not operated and are stored in the state shown in FIG.

他のコマンドがなくステップS011をNoで抜けると、ステップS012でソートされた結果が１行ずつ出力される。このとき、図７(A)に示されるコンパイルされた項目「性別」は、コンパイル前の文字列に変換されて図７(B)に示される中継データ項目と結合されて出力される。図８は、出力フィルの内容を示す。この図に示されるように、ＩＤ、名称、性別の各項目を含むレコードが性別順にソートされて出力される。 If there is no other command and step S011 is skipped with No, the results sorted in step S012 are output line by line. At this time, the compiled item “gender” shown in FIG. 7A is converted into a pre-compiled character string, combined with the relay data item shown in FIG. 7B, and output. FIG. 8 shows the contents of the output fill. As shown in this figure, records including ID, name, and gender items are sorted and output in the order of gender.

なお、上記の実施形態では、キー項目とならない項目については、ＦＡＳＴ構造にコンパイルせずに、入力されたテキストデータと同様の表形式で保存している。ただし、これをさらにＦＡＳＴ構造にコンパイルすることも可能である。すなわち、表形式で保存されたデータについて、表形式で保存された全項目を１つの項目と見立て、ユニークな項目値をソートした配列である項目値テーブルと、レコードの配列順に当該レコードの項目値が格納されている項目値テーブル内の位置を記録した配列であるインデックス値テーブルとに分解して保存してもよい。 In the above embodiment, items that are not key items are stored in the same table format as the input text data without being compiled into the FAST structure. However, it can be further compiled into a FAST structure. That is, for data saved in tabular format, all items saved in tabular format are regarded as one item, an item value table that is an array in which unique item values are sorted, and the item value of the record in the order in which the records are arranged May be decomposed into an index value table that is an array in which positions in the item value table in which are stored are recorded.

図９は、中継項目についてもコンパイルする場合の入力時の選択画面の例である。図３で示した選択画面に加え、「中継項目もコンパイルする」のチェックボックスが加えられている。ここで、図１０に示すようなテキストデータが入力された場合を例に説明する。図４に示したデータと比較すると、ＩＤ「０００１」の名称「山田太郎」のレコード(行)が４行になっている。 FIG. 9 is an example of a selection screen at the time of input when compiling relay items. In addition to the selection screen shown in FIG. 3, a check box “Compile relay items” is also added. Here, a case where text data as shown in FIG. 10 is input will be described as an example. Compared with the data shown in FIG. 4, the record (row) of the name “Taro Yamada” with ID “0001” has four rows.

このテキストデータを入力し、上記の実施形態と同様性別をキー項目としてコンパイルすると、図１１(Ａ)に示すようなテーブルとなる。中継項目にはＩＤと名称とが含まれるが、これをさらにＦＡＳＴ構造にコンパイルする。 When this text data is input and compiled using gender as a key item as in the above embodiment, a table as shown in FIG. 11A is obtained. The relay item includes an ID and a name, which are further compiled into a FAST structure.

すなわち、中継項目に含まれる全項目を１つの項目と見立て、ユニークな項目値をソートする。この例では、ＩＤと名称との２つの項目を１つの項目と見立て、これらの値が共に一致するレコードは、同一と判断して項目値テーブルでは１つのレコードとする。すなわち、図１１(Ｂ)に示されるように、「０００１山田太郎」という項目値を持つ４つのレコードは、１つのレコードにまとめられる。 That is, all items included in the relay items are regarded as one item, and unique item values are sorted. In this example, two items of ID and name are regarded as one item, and records whose values match each other are determined to be the same and are regarded as one record in the item value table. That is, as shown in FIG. 11B, four records having the item value “0001 Taro Yamada” are combined into one record.

この変形例のように中継項目も圧縮(FAST構造へのコンパイル)した場合には、中継項目をコンパイルしない実施形態の方法よりコンパイルのための時間はかかるが、複数の項目について一回のみコンパイルすればよいため、全ての項目について１つずつコンパイルする場合と比較すれば時間を短縮でき、しかも、コンパイル後はデータ量が圧縮されるため、ハードディスクやメモリの消費量を削減することができる。 When relay items are also compressed (compiled to FAST structure) as in this modification, it takes more time to compile than the method of the embodiment in which relay items are not compiled, but a plurality of items are compiled only once. Therefore, the time can be shortened as compared with the case where all items are compiled one by one, and the amount of data is compressed after the compilation, so that the consumption of the hard disk and the memory can be reduced.

本発明の実施形態に係るデータ管理装置を含むコンピュータシステムを示すブロック図である。1 is a block diagram showing a computer system including a data management apparatus according to an embodiment of the present invention. 図１のデータ管理装置による処理の内容を示すフローチャートである。It is a flowchart which shows the content of the process by the data management apparatus of FIG. 図１のデータ管理装置のファイル入力時の選択画面を示す説明図である。It is explanatory drawing which shows the selection screen at the time of the file input of the data management apparatus of FIG. 図１のデータ管理装置に入力されるテキストファイルの内容を示す説明図である。It is explanatory drawing which shows the content of the text file input into the data management apparatus of FIG. 図１のデータ管理装置により保存されたデータの構造を示す説明図である。It is explanatory drawing which shows the structure of the data preserve | saved by the data management apparatus of FIG. 図１のデータ管理装置のファイル出力時の選択画面を示す説明図である。It is explanatory drawing which shows the selection screen at the time of the file output of the data management apparatus of FIG. 図１のデータ管理装置により操作されたデータの構造を示す説明図である。It is explanatory drawing which shows the structure of the data operated by the data management apparatus of FIG. 図１のデータ管理装置から出力されるテキストファイルの内容を示す説明図である。It is explanatory drawing which shows the content of the text file output from the data management apparatus of FIG. 本発明の実施形態の変形例におけるデータ管理装置のファイル入力時の選択画面を示す説明図である。It is explanatory drawing which shows the selection screen at the time of the file input of the data management apparatus in the modification of embodiment of this invention. 本発明の実施形態の変形例におけるデータ管理装置に入力されるテキストファイルの内容を示す説明図である。It is explanatory drawing which shows the content of the text file input into the data management apparatus in the modification of embodiment of this invention. 本発明の実施形態の変形例におけるデータ管理装置により保存されたデータの構造を示す説明図である。It is explanatory drawing which shows the structure of the data preserve | saved by the data management apparatus in the modification of embodiment of this invention.

Explanation of symbols

１システム
１０コンピュータ
１１ＣＰＵ
１２ＲＡＭ
１３インターフェイス
２０ＨＤ
２１データ管理プログラム
２１ａ入力受付機能
２１ｂコンパイル機能
２１ｃ表形式保存機能
２２ＤＢ
２２ａＦＡＳＴ構造データ
２２ｂ表形式データ
３０ディスプレイ
３１キーボード 1 System 10 Computer 11 CPU
12 RAM
13 Interface 20 HD
21 Data management program 21a Input reception function 21b Compile function 21c Tabular format storage function 22 DB
22a FAST structure data 22b tabular data 30 display 31 keyboard

Claims

Computer
An input acceptance procedure for accepting input of original data in a tabular format including multiple records each having an item value for multiple items,
For items used as data processing keys for the input original data, an item value table that is an array in which unique item values are sorted, and the item value in which the item values of the record are stored in the order in which the records are arranged Compile procedure to decompose and save the index value table which is an array that records the position in the table,
For items that are not used as data processing keys for the input original data, a table format storage procedure for storing data as a table format including a plurality of items in one record, and
Regarding the data saved in the table format saving procedure, all items saved in the table format are regarded as one item, an item value table that is an array in which unique item values are sorted, and the items of the record in the order in which the records are arranged. A data management method comprising: executing a second compile procedure that decomposes and stores an index value table that is an array that records positions in the item value table in which values are stored .

Computer
Input accepting means for accepting input of original data in a table format including a plurality of records each having an item value for a plurality of items;
For items used as data processing keys for the input original data, an item value table that is an array in which unique item values are sorted, and the item value in which the item values of the record are stored in the order in which the records are arranged A compiling means for disassembling and saving an index value table that is an array in which positions in the table are recorded;
With respect to items that are not used as data processing keys for the input original data, table format storage means for storing the data as tabular data including a plurality of items in one record, and
For the data saved by the tabular format saving means, all items saved in the tabular format are regarded as one item, an item value table that is an array in which unique item values are sorted, and the items of the record in the order in which the records are arranged A data management program that functions as a second compiling unit that decomposes and stores an index value table that is an array in which positions in the item value table in which values are stored are recorded .

An input receiving means for receiving input of original data in a tabular format including a plurality of records each having an item value for a plurality of items;
For items used as data processing keys for the input original data, an item value table that is an array in which unique item values are sorted, and the item value in which the item values of the record are stored in the order in which the records are arranged A compiling means for disassembling and storing an index value table that is an array in which positions in the table are recorded;
With respect to items that are not used as data processing keys for the input original data, a table format storage unit that stores data as a table format including a plurality of items in one record;
For the data saved by the tabular format saving means, all items saved in the tabular format are regarded as one item, an item value table that is an array in which unique item values are sorted, and the items of the record in the order in which the records are arranged A data management apparatus comprising: a second compiling unit that decomposes and stores an index value table that is an array in which positions in the item value table in which values are stored are recorded .