JPH06337762A

JPH06337762A - How to compress and restore database records

Info

Publication number: JPH06337762A
Application number: JP5126734A
Authority: JP
Inventors: Fumio Gomi; 文男五味; Yoshifumi Kawasaki; 良文川崎; Masahisa Horie; 正久堀江; Yuji Toyama; 雄司外山; Takenori Iwato; 丈典岩戸; Yoshifumi Nogami; 敬文野上
Original assignee: Hitachi Software Engineering Co Ltd; Hitachi Ltd
Current assignee: Hitachi Software Engineering Co Ltd; Hitachi Ltd
Priority date: 1993-05-28
Filing date: 1993-05-28
Publication date: 1994-12-06
Anticipated expiration: 2013-08-27
Also published as: JP2790594B2

Abstract

(57)【要約】【目的】データベースレコードのデータにおいて同一
文字の連続性がとぎれる場合でもデータベースレコード
の圧縮効率を上げ、データベースファイルへの格納する
データベースレコード長を短くする。【構成】レコード形式単位に各データ形式に応じた初
期値を設定したマスクテーブルファイル２２を作成して
おき、格納要求されたレコード受渡しエリア１６とマス
クテーブル常駐エリア１８との排他的論理和をとり、値
が一致する項目全てをゼロ″００″にし、同一文字の連
続性を大幅に高めた後、連続同一文字の部分を圧縮制御
情報に置換し、データベースファイル１９に格納する。 (57) [Summary] [Purpose] To improve the database record compression efficiency and shorten the database record length to be stored in the database file even when the continuity of the same characters is interrupted in the data of the database record. [Structure] A mask table file 22 in which an initial value corresponding to each data format is set for each record format is created, and an exclusive OR of the record transfer area 16 and the mask table resident area 18 requested to be stored is calculated. , All the items having the same value are set to zero "00", and the continuity of the same character is significantly increased, and then the portion of the same character is replaced with the compression control information and stored in the database file 19.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、主にデータベースマネ
ージメントシステムにおけるデータベースレコードの圧
縮方法および復元方法に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention mainly relates to a method of compressing and restoring a database record in a database management system.

【０００２】[0002]

【従来の技術】従来のデータ圧縮方法では、データベー
スレコード内に同一文字が３バイト以上連続している場
合には、繰返文字数をカウントし、文字種別と繰返回数
をそれぞれ各１バイトずつ合計２バイトの圧縮情報とし
てレコード内の同一文字があった位置に置換し、データ
ベースレコードを圧縮している。2. Description of the Related Art In the conventional data compression method, when the same character is consecutive for 3 bytes or more in a database record, the number of repeated characters is counted and the character type and the number of repetitions are summed up by 1 byte each. The 2-byte compression information is replaced with the position where the same character is present in the record, and the database record is compressed.

【０００３】この種に関連するものには例えば特開平４
−３４８６１７号公報等が挙げられる。For example, Japanese Unexamined Patent Publication No. Hei.
-348617 publication etc. are mentioned.

【０００４】[0004]

【発明が解決しようとする課題】しかし、上記従来技術
では、データベースレコード内に同一文字が３バイト以
上連続している場合のみデータ圧縮の対象となっている
ため、データベースレコード内の初期値データ項目内に
４ビットの符号コードが付くことにより同一文字の連続
性がとぎれるような場合には、圧縮効率が上がらないと
いう問題があった。However, in the above-mentioned prior art, since the data compression is performed only when the same character is consecutive for 3 bytes or more in the database record, the initial value data item in the database record. When the continuity of the same character is interrupted by adding a 4-bit code to the inside, there is a problem that the compression efficiency cannot be improved.

【０００５】本発明の目的は、データベースレコードの
データにおいて、同一文字の連続性がとぎれる場合でも
データベースレコードの圧縮効率を上げ、データベース
ファイルへの格納するデータベースレコード長を短くす
ることである。An object of the present invention is to improve the compression efficiency of a database record and shorten the database record length stored in a database file even when the continuity of the same characters is interrupted in the data of the database record.

【０００６】[0006]

【課題を解決するための手段】上記目的を達成するため
に、本発明の第１の手段は、データベースレコード内の
データ形式に応じた初期値で設定されたマスクテーブル
を作成して、１データベースレコード単位に格納される
データベースレコードのデータと前記マスクテーブルと
の排他的論理和をとり、前記データベースレコード内の
初期値項目部分に対応するビットをオフにして連続同一
文字列を作り、前記データベースレコードのデータにお
ける連続同一文字列がある位置に連続同一文字列に関す
る情報をもつ圧縮情報を置換し、前記レコードデータを
ビットオフにならなかった残りのデータベースレコード
のデータと圧縮情報とからなるデータベースレコードの
データに圧縮することを特徴とする。In order to achieve the above object, the first means of the present invention is to create a mask table set with an initial value according to the data format in a database record to create one database. The exclusive OR of the data of the database record stored in record units and the mask table is taken, and the bit corresponding to the initial value item part in the database record is turned off to make a continuous identical character string, and the database record Of the database record consisting of the data and the compression information of the remaining database records that did not bit-off the record data by replacing the compression information having the information about the continuous identical character string at the position where the continuous identical character string exists in the data of It is characterized by being compressed into data.

【０００７】本発明の第２の手段は、外部記憶装置上の
データベースファイルに一旦格納されている圧縮された
データベースレコードを読み出し、このデータベースフ
ァイルから読み出したデータベースレコード内の圧縮情
報に置換された部分を連続同一文字の長さ分に拡張し、
データベースレコードのデータ圧縮時と同一のマスクテ
ーブルとの排他的論理和をとり対応するビットをオンに
し、データベースレコードのデータ圧縮前のデータベー
スレコードに復元することを特徴とする。A second means of the present invention reads a compressed database record once stored in a database file on an external storage device and replaces it with the compressed information in the database record read from this database file. To the length of consecutive same characters,
It is characterized by performing exclusive OR with the same mask table as when the data of the database record is compressed and turning on the corresponding bit to restore the database record to the database record before data compression.

【０００８】前記マスクテーブルの作成は、データベー
スレコードを処理するアプリケーションプログラムの作
成で使用するレコードフォーマット規定部分のソースプ
ログラムをそのまま使用し、言語に応じた初期値データ
をもとに行うことを特徴とする。The mask table is created by using the source program of the record format defining part used in creating the application program for processing the database record as it is, based on the initial value data according to the language. To do.

【０００９】[0009]

【作用】前述の手段によれば、データベースレコード内
のデータ形式に応じた初期値、または発生頻度の高い任
意の文字列で設定されたマスクテーブルを作成して、１
データベースレコード単位に格納されるデータベースレ
コードのデータと前記マスクテーブルとの排他的論理和
をとり、前記データベースレコード内の初期値項目部分
に対応するビットをオフにして連続同一文字列を作り、
前記データベースレコードのデータにおける連続同一文
字列がある位置に連続同一文字列に関する情報をもつ圧
縮情報を置換し、前記レコードデータをビットオフにな
らなかった残りのデータベースレコードのデータと圧縮
情報とからなるデータベースレコードのデータに圧縮す
るので、圧縮可能な範囲が従来より広くとれ、レコード
データの圧縮効率が向上し、データベースファイルへの
格納データベースレコード長を短くすることが可能とな
る。According to the above-mentioned means, the mask table set with the initial value according to the data format in the database record or an arbitrary character string with high occurrence frequency is created, and 1
The exclusive OR of the data of the database record stored in database record units and the mask table is taken, and the bit corresponding to the initial value item part in the database record is turned off to form a continuous identical character string,
The compressed information having the information about the continuous identical character string is replaced at a position where the continuous identical character string is present in the data of the database record, and the record data is composed of the data of the remaining database record and the compressed information which are not bit-off. Since the data is compressed into the data of the database record, the compressible range can be made wider than before, the compression efficiency of the record data can be improved, and the length of the database record stored in the database file can be shortened.

【００１０】そして、圧縮前のデータベースレコードに
復元する時は、外部記憶装置上のデータベースファイル
に一旦格納されている圧縮されたデータベースレコード
を読み出し、データベースファイルから読み出したデー
タベースレコード内の圧縮情報に置換された部分を連続
同一文字の長さ分に拡張し、データベースレコードのデ
ータ圧縮時と同一のマスクテーブルとの排他的論理和を
とり対応するビットをオンにし、データベースレコード
のデータ圧縮前のデータベースレコードに簡単に復元す
ることが可能である。When restoring the database record before compression, the compressed database record once stored in the database file on the external storage device is read and replaced with the compression information in the database record read from the database file. The extended part of the specified part is extended to the length of the continuous same character, and the corresponding bit is turned on by exclusive-oring with the same mask table as when the database record data was compressed, and the database record before data compression of the database record is turned on. It is possible to easily restore to.

【００１１】更に、マスクテーブルの作成は、データベ
ースレコードを処理するアプリケーションプログラムの
作成で使用するレコードフォーマット規定部分のソース
プログラムをそのまま使用し、言語に応じた初期値デー
タをもとに行われるため、データベースレコードを処理
するアプリケーションプログラムとデータベースファイ
ルのズレを防止できる。Further, since the mask table is created by using the source program of the record format defining part used in creating the application program for processing the database record as it is, based on the initial value data according to the language, The gap between the application program that processes the database record and the database file can be prevented.

【００１２】[0012]

【実施例】以下、図面を使用して本発明の一実施例につ
いて詳細に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described in detail below with reference to the drawings.

【００１３】図１は、本発明を実現するデータベースマ
ネジメントシステムの全体構成を示すブロック図であ
り、図２は、図１のデータベースマネジメントシステム
を実施するための装置のハードウエア構成を示すブロッ
ク図である。FIG. 1 is a block diagram showing the overall configuration of a database management system for implementing the present invention, and FIG. 2 is a block diagram showing the hardware configuration of an apparatus for implementing the database management system of FIG. is there.

【００１４】図１および図２において、１０はデータベ
ースマネジメントシステム、１１はアプリケーションプ
ログラム、１９はデータベースファイル、２０はレコー
ド規定管理ファイル、２１はコンパイラ、２２はマスク
テーブルファイル、３１は入出力装置、３２は中央処理
装置（ＣＰＵ）、３３は主記憶装置、３４は外部記憶装
置である。In FIGS. 1 and 2, 10 is a database management system, 11 is an application program, 19 is a database file, 20 is a record definition management file, 21 is a compiler, 22 is a mask table file, 31 is an input / output device, and 32 is a 32. Is a central processing unit (CPU), 33 is a main memory, and 34 is an external memory.

【００１５】図１に示すように、本発明の実施例のデー
タ圧縮方法を実行するデータベースマネジメントシステ
ム１０は、アプリケーションプログラム１１が使用する
データベースファイル１９のデータベースレコードの入
出力要求により動作し、データベースマネジメントシス
テム内を制御するコントローラ１２、データベースレコ
ードの圧縮処理を行うデータ圧縮部１３、データベース
レコードの復元処理を行うデータ復元部１４、マスクテ
ーブルファイル２２の入力を行うマスクテーブルロード
部１５から構成される。As shown in FIG. 1, the database management system 10 for executing the data compression method according to the embodiment of the present invention operates according to the input / output request of the database record of the database file 19 used by the application program 11 to perform the database management. A controller 12 for controlling the inside of the system, a data compressing unit 13 for compressing database records, a data restoring unit 14 for restoring database records, and a mask table loading unit 15 for inputting a mask table file 22.

【００１６】また、使用するマスクテーブルファイル２
２は、レコード規定管理ファイル２０をもとにコンパイ
ラ２１により作成され、ファイル内容はデータベースマ
ネジメントシステム１０の開始時に、マスクテーブルロ
ード部１５によってマスクテーブル常駐エリア１８に常
駐化される。The mask table file 2 to be used
2 is created by the compiler 21 based on the record regulation management file 20, and the file contents are made resident in the mask table resident area 18 by the mask table loading unit 15 at the start of the database management system 10.

【００１７】図３は、データベースレコードのレコード
フォーマット、およびデータ形式を規定するレコード規
定管理ファイル２０に設定されているレコード形式の一
例をもとに、コンパイラ２１に入力し、オブジェクト形
式データに変換後、マスクテーブル２３がマスクテーブ
ルファイル２２に作成される手順を示したものである。FIG. 3 shows an example of the record format of the database record and the record format set in the record regulation management file 20 that regulates the data format, which is input to the compiler 21 and converted into object format data. , The mask table 23 is shown in the mask table file 22.

【００１８】この図３の例では、コンパイラにより内部
１０進形式（パック形式１０進データ）２０ａに対して
は００…０Ｃの形式で、外部１０進形式（ゾーン形式１
０進データ）２０ｂに対しては、Ｆ０Ｆ０…の形式で、
文字形式（文字列データ）２０ｃに対しては、４０４０
…の形式で、漢字形式（日本語列データ）２０ｄに対し
ては、Ａ１Ａ１…の形式でマスクテーブルは設定され
る。In the example of FIG. 3, the compiler uses the format 00 ... 0C for the internal decimal format (pack format decimal data) 20a and the external decimal format (zone format 1).
For 0b data) 20b, in the format of F0F0 ...
4040 for the character format (character string data) 20c
The mask table is set in the format A1A1 ... For the kanji format (Japanese string data) 20d in the format.

【００１９】このようにして、データベースレコードの
レコードフォーマット、およびデータ形式を規定するレ
コード規定管理ファイル２０に設定されているレコード
形式をコンパイラに入力することにより簡単にマスクテ
ーブルを作成できる。In this way, the mask table can be easily created by inputting the record format of the database record and the record format set in the record regulation management file 20 that regulates the data format to the compiler.

【００２０】また、データベースレコードを処理するア
プリケーションプログラムの作成で使用するレコードフ
ォーマット規定部分のソースプログラムをそのまま使用
し、言語に応じた初期値を設定することにより、データ
ベースレコードを処理するアプリケーションプログラム
とデータベースファイルのズレを防止できる。Further, by using the source program of the record format defining part used for creating the application program for processing the database record as it is, and setting the initial value according to the language, the application program and the database for processing the database record are set. You can prevent the files from shifting.

【００２１】図４は、アプリケーションプログラム１１
から格納要求されたデータベースレコード（以下、レコ
ードと略す）の一例を用いて、データ圧縮する処理過程
を説明するためのものである。FIG. 4 shows an application program 11
This is for explaining the process of data compression by using an example of a database record (hereinafter abbreviated as a record) requested to be stored by the.

【００２２】アプリケーションプログラム１１内に確保
されたレコード受け渡しエリア１６内にセットされたレ
コードデータ１６ａに対して、マスクテーブル２３で排
他的論理和をとると、レコードデータ１６ａとマスクテ
ーブル２３間で同一の値をもつ全てのデータ項目部分が
減算されゼロの値となり、結果はレコード受渡しエリア
１６のマスク済みレコードデータ１６ｂの内容となる。The exclusive OR of the record data 16a set in the record passing area 16 secured in the application program 11 by the mask table 23 is the same between the record data 16a and the mask table 23. All data item parts having a value are subtracted to give a value of zero, and the result becomes the contents of the masked record data 16b in the record passing area 16.

【００２３】そして、そのマスク済みレコードデータ１
６ｂをもとに、連続した同一文字の部分を圧縮制御情報
（文字と長さ）に置換し、また不連続部分の先頭にも非
圧縮制御情報（長さ）を設定し、レコード入出力エリア
１７に圧縮レコードデータ１７ａをセットし、その圧縮
レコードデータ１７ａをデータベースファイル１９へ格
納する。Then, the masked record data 1
Based on 6b, the consecutive same character parts are replaced with compression control information (character and length), and non-compression control information (length) is set at the beginning of the discontinuous part. The compressed record data 17a is set in 17, and the compressed record data 17a is stored in the database file 19.

【００２４】次に、図５は、図４の説明で使用したレコ
ードの一例を用いて、データベースファイル１９に格納
された圧縮レコードデータ１７ａを読み出し、データ圧
縮部分を元のレコードデータ１６ａの内容に復元し、ア
プリケーションプログラム１１に渡す処理過程を説明す
るためのものである。Next, FIG. 5 reads the compressed record data 17a stored in the database file 19 by using the example of the record used in the description of FIG. 4, and replaces the data compression portion with the contents of the original record data 16a. This is for explaining the process of restoring and passing it to the application program 11.

【００２５】データベースファイル１９から読み出され
たレコード入出力エリア１７内の圧縮レコードデータ１
７ａの圧縮制御情報と非圧縮制御情報をもとに、連続同
一文字圧縮の場合は、圧縮制御情報内の文字をその長さ
分に拡張し、それ以外の場合は非圧縮制御情報内の長さ
分の非圧縮データ部分そのままを、レコード受け渡しエ
リア１６にセットする。Compressed record data 1 in the record input / output area 17 read from the database file 19
Based on the compression control information and the non-compression control information of 7a, in the case of continuous identical character compression, the characters in the compression control information are expanded to that length, and in other cases, the length in the non-compression control information is expanded. The uncompressed data portion corresponding to the size is set in the record passing area 16 as it is.

【００２６】すなわち、レコード受渡しエリア１６に圧
縮レコードデータ１７ａを圧縮制御情報および非圧縮情
報より変換した変換レコードデータ１６ｃとしてセット
する。なお、このときの変換レコードデータ１６ｃはマ
スク済みレコードデータ１６ｂと同一のデータ内容とな
る。That is, the compressed record data 17a is set in the record delivery area 16 as the converted record data 16c converted from the compression control information and the non-compression information. The converted record data 16c at this time has the same data content as the masked record data 16b.

【００２７】そして、その変換レコードデータ１６ｃに
対して、格納時に使用したマスクテーブル２３で排他的
論理和をとると、変換レコードデータ１６ｃのゼロの値
の全ての部分にマスクテーブルの値が加算され、結果は
レコード受渡しエリア１６の復元レコードデータ１６ｄ
に示すように圧縮前のレコードデータ１６ａの内容に復
元され、その復元レコードデータ１６ｄをアプリケーシ
ョンプログラム１１へ渡す。When the conversion table data 16c is subjected to exclusive OR with the mask table 23 used at the time of storage, the values of the mask table are added to all the zero values of the conversion record data 16c. , The result is the restored record data 16d in the record passing area 16
As shown in (4), the contents of the record data 16a before compression are restored, and the restored record data 16d is passed to the application program 11.

【００２８】図６は、本実施例のデータ圧縮の処理手順
のを示したフローチャートであり、図４をもとにして説
明する。FIG. 6 is a flow chart showing a processing procedure of data compression of this embodiment, which will be described with reference to FIG.

【００２９】アプリケーションプログラム１１から格納
要求されたレコード受渡しエリア１６のレコードデータ
１６ａの内容をもとに対応するマスクテーブル２３を決
定する（ステップ１００）。ここで決定したマスクテー
ブル２３を用いてレコード受け渡しエリア１６のレコー
ドデータ１６ａの内容に対し、１命令で２５６バイトづ
つの排他的論理和を実行する（ステップ１１０）。The corresponding mask table 23 is determined based on the contents of the record data 16a in the record delivery area 16 requested to be stored by the application program 11 (step 100). Using the mask table 23 determined here, the exclusive OR of 256 bytes is executed by one instruction for the contents of the record data 16a in the record passing area 16 (step 110).

【００３０】この実行結果のレコード受け渡しエリア１
６のレコードデータ１６ｂをもとに、１命令で２５６バ
イトづつ連続同一文字を検索（ステップ１２０）、２バ
イト以上の連続同一文字を検出した場合は（ステップ１
３０）、同一文字の連続する部分の長さを求め（ステッ
プ１４０）この文字と長さをもとに圧縮制御情報を作成
し、レコード入出力エリア１７にセット（ステップ１５
０）する。Record passing area 1 of this execution result
Based on the record data 16b of No. 6, the continuous identical character is searched for 256 bytes each by one command (step 120). When the continuous identical character of 2 bytes or more is detected (step 1
30), the length of the continuous portion of the same character is obtained (step 140), compression control information is created based on this character and the length, and set in the record input / output area 17 (step 15).
0)

【００３１】連続同一文字がない場合は（ステップ１３
０）、不連続文字部分の長さをもとに非圧縮制御情報を
作成し、非圧縮データ部分をレコード入出力エリア１７
にセットする（ステップ１６０）。If there is no continuous identical character (step 13
0), non-compression control information is created based on the length of the discontinuous character portion, and the non-compressed data portion is recorded in the record input / output area 17
(Step 160).

【００３２】次に、ステップ１２０からステップ１６０
までをマスク済みレコードデータ１６ｂの検索が終了す
るまで繰返し、検索終了時には、レコード入出力エリア
の圧縮レコードデータ１７ａをセットし、その内容をデ
ータベースファイル１９へ格納（ステップ１７０）し要
求元へリターンする。Next, steps 120 to 160
The above steps are repeated until the search for the masked record data 16b is completed. At the end of the search, the compressed record data 17a in the record input / output area is set, the contents are stored in the database file 19 (step 170), and the process is returned to the request source. .

【００３３】図７は、本実施例のデータ復元の処理手順
を示したフローチャートであり、図５をもとにして説明
する。FIG. 7 is a flow chart showing a processing procedure of data restoration of the present embodiment, which will be described with reference to FIG.

【００３４】アプリケーションプログラム１１からのデ
ータベースレコード読み出し要求をもとに、データベー
スファイル１９から圧縮レコードデータ１７ａをレコー
ド入出力エリア１７へ読み出す（ステップ２００）、こ
の内容をもとに対応するマスクテーブルを決定する（ス
テップ２１０）、レコード入出力エリア１７内の圧縮レ
コードデータ１７ａの先頭から圧縮制御情報、または非
圧縮制御情報を検索し（ステップ２２０）、圧縮制御情
報の場合は（ステップ２３０）、圧縮制御情報内の文字
をその長さ分に拡張し、レコード受渡しエリア１６ヘセ
ットする（ステップ２４０）。Based on a database record read request from the application program 11, the compressed record data 17a is read from the database file 19 to the record input / output area 17 (step 200), and the corresponding mask table is determined based on this content. (Step 210), the compression control information or the non-compression control information is searched from the beginning of the compressed record data 17a in the record input / output area 17 (step 220). In the case of compression control information (step 230), the compression control is performed. The characters in the information are expanded to that length and set in the record passing area 16 (step 240).

【００３５】非圧縮制御情報の場合は（ステップ２３
０）、非圧縮制御情報内の長さ分の非圧縮データ部分を
そのままレコード受渡しエリア１６へセットする。In the case of non-compression control information (step 23
0), the uncompressed data portion of the length in the uncompressed control information is set as it is in the record passing area 16.

【００３６】ステップ２２０からステップ２５０までを
レコード入出力エリア１７の圧縮レコードデータ１７ａ
が終了まで繰返し、検索終了時には、レコード受渡しエ
リア１６の変換レコードデータ１６ｃに対して先に決定
したマスクテーブル２３を用いて、１命令で２５６バイ
トづつの排他的論理和を実行する（ステップ２６０）。
この実行結果は、レコード受渡しエリア１６の復元レコ
ードデータ１６ｄとなる。The compressed record data 17a in the record input / output area 17 is executed from step 220 to step 250.
Is repeated until the end, and at the end of the search, an exclusive OR of 256 bytes is executed by one instruction using the mask table 23 previously determined for the converted record data 16c in the record passing area 16 (step 260). .
The result of this execution is the restored record data 16d in the record passing area 16.

【００３７】そして、この復元レコードデータ１６ｄ
は、元のレコードデータ１６ａと同一のものであり、要
求元へリターン時、この復元レコードデータ１６ｄが要
求元へ渡される。Then, the restored record data 16d
Is the same as the original record data 16a, and when returning to the request source, the restored record data 16d is passed to the request source.

【００３８】以上説明したように、本発明によれば、レ
コード件数が多く、レコード内のデータ項目数が多い大
規模データベースシステムにおいて、データベースレコ
ード内に内部１０進形式等の初期値データ項目が多数存
在している場合は、これら全てのデータ項目部分をデー
タ圧縮するため、データベースレコード長が大幅に短縮
され、データベースファイルを格納する外部記憶装置容
量の大幅な削減となる。As described above, according to the present invention, in a large-scale database system in which the number of records is large and the number of data items in a record is large, a large number of initial value data items such as internal decimal format are included in a database record. If it exists, data compression is performed on all of these data item portions, so the database record length is greatly reduced, and the external storage device capacity for storing the database file is significantly reduced.

【００３９】このため、データベースレコード格納時と
読み出し時のデータ転送時間も大幅な削減となる。Therefore, the data transfer time at the time of storing and reading the database record can be greatly reduced.

【００４０】そして、圧縮されたデータベースレコード
から圧縮される前のデータベースレコードに復元する場
合も、レコード圧縮で行った処理手順を逆にすることで
簡単に復元できる。When the compressed database record is restored to the database record before being compressed, it can be easily restored by reversing the processing procedure performed in the record compression.

【００４１】また、データベースレコードを処理するア
プリケーションプログラムの作成で使用するレコードフ
ォーマット規定部分のソースプログラムをそのまま使用
し、言語に応じた初期値を設定することにより、データ
ベースレコードを処理するアプリケーションプログラム
とデータベースファイルのズレを防止できる。Further, by using the source program of the record format defining part used for creating the application program for processing the database record as it is and setting the initial value according to the language, the application program and the database for processing the database record are set. You can prevent the files from shifting.

【００４２】[0042]

【発明の効果】以上説明したように、本発明によれば、
レコード件数が多く、レコード内のデータ項目数が多い
大規模データベースシステムにおいて、データベースレ
コード内に内部１０進形式等の初期値データ項目が多数
存在している場合は、これら全てのデータ項目部分をデ
ータ圧縮するため、データベースレコード長が大幅に短
縮され、データベースファイルを格納する外部記憶装置
容量の大幅な削減となる。As described above, according to the present invention,
In a large-scale database system with a large number of records and a large number of data items in a record, if there are many initial value data items such as internal decimal format in the database record, all of these data item parts are Because of the compression, the database record length is greatly reduced, and the external storage capacity for storing the database file is also significantly reduced.

【００４３】さらにデータベースレコード格納時と読み
出し時のデータ転送時間も大幅な削減となる。Further, the data transfer time at the time of storing and reading the database record is also greatly reduced.

[Brief description of drawings]

【図１】本発明を実現するデータベースマネジメントシ
ステムの全体構成を示すブロック図である。FIG. 1 is a block diagram showing the overall configuration of a database management system that realizes the present invention.

【図２】本発明を実現するハードウエアの構成を示すブ
ロック図である。FIG. 2 is a block diagram showing a hardware configuration for implementing the present invention.

【図３】本実施例のマスクテーブル作成手順を説明する
ための図である。FIG. 3 is a diagram for explaining a mask table creation procedure of the present embodiment.

【図４】本実施例のデータ圧縮する処理過程を説明する
ための図である。FIG. 4 is a diagram for explaining a process of compressing data according to the present embodiment.

【図５】本実施例のデータ復元する処理過程を説明する
ための図である。FIG. 5 is a diagram for explaining a process of restoring data according to the present embodiment.

【図６】本実施例のデータ圧縮の処理手順を示すフロー
チャートである。FIG. 6 is a flowchart showing a processing procedure of data compression of the present embodiment.

【図７】本実施例のデータ復元の処理手順を示すフロー
チャートである。FIG. 7 is a flowchart showing a processing procedure of data restoration of the present embodiment.

[Explanation of symbols]

１０ …データベースマネジメントシステム、１１ …アプリケーションプログラム、１２ …コントローラ、１３ …データ圧縮部、１４ …データ復元部、１５ …マスクテーブルロード部、１６ …レコード受け渡しエリア、１６ａ…レコードデータ、１６ｂ…マスク済みレコードデータ、１６ｃ…変換レコードデータ、１６ｄ…復元レコードデータ、１７ …レコード入出力エリア、１７ａ…圧縮レコードデータ、１８ …マスクテーブル常駐エリア、１９ …データベースファイル、２０ …レコード規定管理ファイル、２１ …コンパイラ、２２ …マスクテーブルファイル、２３ …マスクテーブル。３１ …入出力装置３２ …中央処理装置３３ …主記憶装置３４ …外部記憶装置 10 ... Database management system, 11 ... Application program, 12 ... Controller, 13 ... Data compression section, 14 ... Data decompression section, 15 ... Mask table loading section, 16 ... Record passing area, 16a ... Record data, 16b ... Masked record Data, 16c ... Converted record data, 16d ... Decompressed record data, 17 ... Record input / output area, 17a ... Compressed record data, 18 ... Mask table resident area, 19 ... Database file, 20 ... Record regulation management file, 21 ... Compiler, 22 ... Mask table file, 23 ... Mask table. 31 ... Input / output device 32 ... Central processing unit 33 ... Main storage device 34 ... External storage device

フロントページの続き (72)発明者川崎良文神奈川県横浜市中区尾上町６丁目81番地日立ソフトウェアエンジニアリング株式会社内 (72)発明者堀江正久神奈川県横浜市中区尾上町６丁目81番地日立ソフトウェアエンジニアリング株式会社内 (72)発明者外山雄司神奈川県横浜市中区尾上町６丁目81番地日立ソフトウェアエンジニアリング株式会社内 (72)発明者岩戸丈典神奈川県横浜市戸塚区戸塚町5030番地株式会社日立製作所ソフトウェア開発本部内 (72)発明者野上敬文神奈川県横浜市戸塚区戸塚町5030番地株式会社日立製作所ソフトウェア開発本部内(72) Inventor Yoshifumi Kawasaki 6-81 Onoe-cho, Naka-ku, Yokohama-shi, Kanagawa Hitachi Software Engineering Co., Ltd. (72) Masahisa Horie 6-81 Onoe-cho, Naka-ku, Yokohama, Hitachi Hitachi Software Engineering Stock Company In-house (72) Inventor Yuji Toyama 6-81 Onoue-cho, Naka-ku, Yokohama-shi, Kanagawa Hitachi Software Engineering Stock Company In-house (72) Inori Takenori Iwato 5030 Totsuka-cho, Totsuka-ku Yokohama-shi Hitachi, Ltd. (72) Inventor Takafumi Nogami 5030 Totsuka-cho, Totsuka-ku, Yokohama-shi, Kanagawa Incorporated company Hitachi Ltd. Software Development Headquarters

Claims

[Claims]

1. A method of compressing a database record in a database management system, wherein a mask table is created for a database record having a fixed record format with an initial value according to a data format in the database record. Then, the exclusive OR of the data of the database record stored in the unit of one database record and the mask table is taken, and the bit corresponding to the initial value item part in the database record is turned off to make a continuous identical character string. , Replacing the compression information having the information on the continuous identical character string at the position where the continuous identical character string is present in the data of the database record, and compressing the data of the remaining database record that did not bit off the data of the database record And A method of compressing a database record, characterized by compressing the data of a database record consisting of

2. The method for compressing a database record according to claim 1, wherein a compressed database record is read from a database file on an external storage device in which the compressed database record is stored and read from this database file. The same character is continuously replaced based on the compression information at the position where it was replaced with the compression information in the database record, and the corresponding bit is turned on by performing an exclusive OR with the same mask table as when compressing the data in the database record. , A method for restoring a database record, characterized by restoring the database record before data compression.

3. The method of compressing or decompressing a database record according to claim 1 or 2,
The database is characterized in that the mask table is created based on the initial value data according to the language by using the source program of the record format defining part used in creating the application program for processing the database record as it is. How to compress or decompress records.