JP3186530B2

JP3186530B2 - How to compress and expand computer data

Info

Publication number: JP3186530B2
Application number: JP21069495A
Authority: JP
Inventors: 毅山本
Original assignee: Sumitomo Metal Industries Ltd
Current assignee: Nippon Steel Corp
Priority date: 1995-08-18
Filing date: 1995-08-18
Publication date: 2001-07-11
Anticipated expiration: 2015-08-18
Also published as: JPH0964752A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明はコンピュータデータ
を磁気ディスク・磁気テープ等の外部記憶媒体へ格納す
る方式、あるいはコンピュータ間のデータ転送方式に関
し、特に格納効率および転送効率を高めるためのデータ
圧縮・伸長方式に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method for storing computer data in an external storage medium such as a magnetic disk or a magnetic tape, or a method for transferring data between computers, and more particularly to a method for compressing data for improving storage efficiency and transfer efficiency. It relates to a decompression method.

【０００２】[0002]

【発明が解決しようとする課題】従来のデータ圧縮伸長
方式は、たとえば特開平５−２６００９７号、および特
開平３−１６２１３４号に示されているように、コンピ
ュータ間のデータ転送処理において圧縮対象となる通信
データに、１バイトまたは２バイトの同一コードが繰返
し出現したとき、圧縮を指示する制御コードと、同一コ
ードの繰返し回数であるカウントバイトと、繰返したコ
ードとでデータ中の同一コードが連続して繰返し出現す
る部分を表すことにより圧縮し、伸長時には圧縮制御コ
ード検出時に同一コードカウントバイトだけ繰返しても
とのデータに復元する連続文字圧縮処理であった。A conventional data compression / expansion method is disclosed in, for example, Japanese Patent Application Laid-Open Nos. 5-260097 and 3-162134. When the same code of 1 byte or 2 bytes repeatedly appears in the communication data, the same code in the data is repeated with the control code indicating compression, the count byte indicating the number of times the same code is repeated, and the repeated code. This is a continuous character compression process in which the data is compressed by representing a portion that appears repeatedly, and when decompression is performed, when the compression control code is detected, the same code count byte is repeated to restore the original data.

【０００３】また、同一コードの出現判定は１バイトま
たは２バイト単位にデータ文字列の順次サーチを繰返す
ためコンピュータ中央処理装置の使用率が高くなり、デ
ータ圧縮による効果を相殺しているという問題点があっ
た。In addition, since the same code is repeatedly determined in order of one byte or two bytes, the use of the computer central processing unit is increased and the effect of data compression is offset. was there.

【０００４】以上に述べたように従来のデータ圧縮・伸
長方式はレコード内の連続文字圧縮に注目した方式であ
り、データの種類によっては低効率の圧縮率しか得られ
ないことと、同一コードの連続文字出現の判定方法をデ
ータの順次サーチ方式により行なうためコンピュータの
ＣＰＵの負荷を上げる要因となるため、その適用範囲を
狭くしている。As described above, the conventional data compression / expansion method focuses on continuous character compression in a record. Depending on the type of data, only a low-efficiency compression ratio can be obtained. Since the method for determining the appearance of continuous characters is performed by the data sequential search method, the load on the CPU of the computer is increased. Therefore, the applicable range is narrowed.

【０００５】一般にデータ圧縮・伸長処理はデータの順
次文字サーチを行なうために高いＣＰＵ使用率となる。
したがって、圧縮率を相当以上に上げないと無駄なコン
ピュータ資源使用となり、コンピュータ・ランニングコ
スト削減という目的を達成することができない。In general, the data compression / decompression process requires a high CPU usage rate because data is sequentially searched for characters.
Therefore, if the compression ratio is not increased significantly, useless computer resources are used, and the object of reducing computer running costs cannot be achieved.

【０００６】本発明はこのような問題を解決し、データ
圧縮率を上げるために、レコード間にもデータの冗長性
があるという特性を加味した圧縮手段、２段階に連続文
字・レコード間同一文字列判定を行なう効率のよい文字
サーチ手段、圧縮データの二重化防止手段等を有するデ
ータ圧縮・伸長方式を提供することを目的とする。The present invention solves such a problem, and in order to increase the data compression rate, compression means taking into account the characteristic that data is redundant also between records, two consecutive steps, the same character between records It is an object of the present invention to provide a data compression / expansion method having efficient character search means for performing column determination, means for preventing duplication of compressed data, and the like.

【０００７】すなわち、本願の請求項１に記載のコンピ
ュータデータの圧縮方法は、データ圧縮率を向上させる
ために、同一レコード内のみならず、相異なるレコード
間においてもデータの冗長性があるという特性を加味し
たコンピュータデータのデータ圧縮方法を提供すること
を目的とする。That is, the method of compressing computer data according to the first aspect of the present invention is characterized in that there is data redundancy not only within the same record but also between different records in order to improve the data compression rate. It is an object of the present invention to provide a data compression method of computer data in consideration of the above.

【０００８】また請求項１に記載のコンピュータデータ
の圧縮方法は、データ圧縮処理の効率を向上させるため
に、２段階に連続文字・レコード間同一文字列判定を行
なう効率のよい文字サーチ手段を含むコンピュータデー
タのデータ圧縮方法を提供することを目的とする。The computer data compression method according to the first aspect of the present invention includes efficient character search means for performing the same character string determination between continuous characters and records in two stages in order to improve the efficiency of data compression processing. An object of the present invention is to provide a data compression method for computer data.

【０００９】また請求項２、請求項３、および請求項４
に記載のコンピュータデータの圧縮方法は、圧縮データ
の二重圧縮防止手段を含むコンピュータデータのデータ
圧縮方法を提供することを目的とする。Further, claim 2 , claim 3 , and claim 4
The object of the present invention is to provide a computer data compression method including means for preventing double compression of compressed data.

【００１０】[0010]

【００１１】[0011]

【課題を解決するための手段】本願の請求項１に記載の
コンピュータデータの圧縮方法は、コンピュータの外部
記憶媒体に格納するデータを外部指示パラメータにより
前記外部記憶媒体に格納する前に圧縮するデータ圧縮方
法、およびコンピュータ間のデータ転送処理における通
信データを外部指示パラメータによりデータ転送前に圧
縮するデータ圧縮方法であって、相異なるレコード間に
おける同一文字列を圧縮する第１の圧縮処理を行なうス
テップと、同一レコード内における連続文字列を圧縮す
る第２の圧縮処理を行なうステップとを含むことを特徴
とする。According to a first aspect of the present invention, there is provided a computer data compression method for compressing data to be stored in an external storage medium of a computer before storing the data in the external storage medium by an external instruction parameter. A compression method and a data compression method for compressing communication data in a data transfer process between computers before data transfer using an externally designated parameter, wherein a first compression process for compressing the same character string between different records is performed. And performing a second compression process of compressing a continuous character string in the same record.

【００１２】前記第１の圧縮処理を行なうステップは、
第１段階として複数バイト単位でレコード間同一文字列
の判定を行なうステップと、前記判定の結果が不一致の
場合は第２段階として１バイト単位でレコード間同一文
字列の判定を行なうステップと、判定の結果前記同一文
字列が存在した場合には当該同一文字列を圧縮するステ
ップとを含み、前記第２の圧縮処理を行なうステップ
は、第１段階として複数バイト単位で同一レコード内連
続文字列の判定を行なうステップと、前記判定の結果が
不一致の場合は第２段階として１バイト単位で同一レコ
ード内連続文字列の判定を行なうステップと、判定の結
果前記連続文字列が存在した場合には当該連続文字列を
圧縮するステップとをさらに含むことを特徴とする。[0012] The step of performing a pre-Symbol first compression process,
Determining the same character string between records in a plurality of bytes as a first step, and determining the same character string between records in a byte as a second step if the results of the determination do not match; Compressing the same character string when the same character string exists as a result of the processing, the step of performing the second compression processing includes, A step of performing a determination; and a step of determining a continuous character string in the same record in byte units as a second step if the result of the determination does not match. Compressing the continuous character string.

【００１３】請求項２に記載のコンピュータデータの圧
縮方法は請求項１に記載のコンピュータデータの圧縮方
法であって、第１の圧縮処理または第２の圧縮処理を行
なったデータの１件目のレコードの先頭複数バイトに所
定のコードを付加するステップをさらに含むことを特徴
とする。[0013] The method of compressing computer data according to claim 2 is a method of compressing computer data according to claim 1, of 1 second of data was performed first compression processing or the second compression The method further includes a step of adding a predetermined code to a plurality of leading bytes of the record.

【００１４】請求項３に記載のコンピュータデータの圧
縮方法は、請求項２に記載のコンピュータデータの圧縮
方法であって、圧縮処理をすべきデータの１件目の先頭
複数バイトを判定し、それが前記所定のコードでない場
合には圧縮処理を実行し、前記所定のコードである場合
には圧縮処理を中止するステップをさらに含むことを特
徴とする。According to a third aspect of the present invention, there is provided the computer data compression method according to the second aspect , wherein the first plurality of bytes of the first data to be compressed is determined. If the code is not the predetermined code, a compression process is executed, and if the code is the predetermined code, the compression process is stopped.

【００１５】請求項４に記載のコンピュータデータの圧
縮方法は、請求項３に記載のコンピュータデータの圧縮
方法であって、前記複数バイトは４バイトであり、所定
のコードはＡＬＬ“１Ｆ”であることを特徴とする。According to a fourth aspect of the present invention, there is provided the computer data compression method according to the third aspect , wherein the plurality of bytes are four bytes, and the predetermined code is ALL "1F". It is characterized by the following.

【００１６】[0016]

【００１７】[0017]

【００１８】[0018]

【発明の実施の形態】以下本願の実施例を図面を参照し
ながら詳細に説明する。Embodiments of the present invention will be described below in detail with reference to the drawings.

【００１９】まず圧縮方法の実施例について説明する。
図１を参照して、本願の請求項１〜請求項５に記載のコ
ンピュータデータの圧縮方法において、データ圧縮処理
を要求するアプリケーションプログラムから圧縮処理プ
ログラムを読出す際のアーギュメントについて説明す
る。圧縮処理に際しては、データ特性によりレコード間
圧縮処理、連続文字圧縮処理の要否区分（Ｃ１）および
圧縮すべきデータ（Ｃ５）とを指定して圧縮処理が要求
される。First, an embodiment of the compression method will be described.
With reference to FIG. 1, in the method of compressing computer data according to claims 1 to 5 of the present application, arguments when reading a compression processing program from an application program requesting data compression processing will be described. In the compression process, the compression process is requested by designating the inter-record compression process, the necessity category (C1) of the continuous character compression process, and the data to be compressed (C5) according to the data characteristics.

【００２０】図３を参照して、本願の請求項１〜請求項
４に記載のコンピュータデータの圧縮方法における圧縮
処理プログラムのメインルーチンの処理手順を示すブロ
ックフローチャートについて説明する。アプリケーショ
ンプログラムから指定された圧縮処理の要否区分（Ｃ
１）により、レコード間圧縮処理（Ｓ３〜Ｓ５）、連続
文字圧縮処理（Ｓ７〜Ｓ９）、および両圧縮処理（Ｓ１
０〜Ｓ１５）が圧縮すべきデータ（Ｃ５）の最後まで繰
返される。圧縮処理プログラムのメインルーチンでは、
レコード間圧縮の場合３バイトチェック（Ｓ３またはＳ
１０）が、連続文字圧縮の場合は４バイトチェック（Ｓ
７またはＳ１３）が行なわれ、各々圧縮制御符号の格納
桁数（図６・図７で後述）より大きい場合にレコード間
圧縮・連続文字圧縮を行なうサブルーチンが呼出され圧
縮処理が行なわれる。またレコード間圧縮・連続文字圧
縮サブルーチンにおいてはレコード間・連続文字比較に
おいて不一致となった場合はメインルーチンに返り、圧
縮形式で出力バッファ（Ｃ６）への文字移送処理が行な
われる。Referring to FIG. 3, claims 1 to 1 of the present application will be described.
A block flowchart showing a processing procedure of a main routine of a compression processing program in the computer data compression method described in 4 will be described. Necessity of compression processing specified by application program (C
1), inter-record compression processing (S3 to S5), continuous character compression processing (S7 to S9), and both compression processing (S1 to S5)
0 to S15) are repeated until the end of the data (C5) to be compressed. In the main routine of the compression processing program,
3 byte check (S3 or S
If 10) is continuous character compression, a 4-byte check (S
7 or S13), a subroutine for inter-record compression / continuous character compression is called when the number of stored digits of the compression control code (described later in FIGS. 6 and 7) is larger, and compression processing is performed. In the inter-record compression / consecutive character compression subroutine, if there is no match in the inter-record / consecutive character comparison, the process returns to the main routine, and the character is transferred to the output buffer (C6) in a compressed format.

【００２１】また圧縮処理プログラムのメインルーチン
の先頭で１件目の圧縮処理かを判定し（Ｓ３０）、１件
目の処理であるときは圧縮すべきデータ（Ｃ５）の先頭
が４バイトがＡＬＬ“１Ｆ”かどうかを判定する（Ｓ３
１）。ＡＬＬ“１Ｆ”である場合には二重圧縮であるの
でエラー処理（Ｓ３３）を行ない、終了する。ＡＬＬ
“１Ｆ”でない場合は正常であるので、圧縮済データで
あることを示すＡＬＬ“１Ｆ”（４バイト）を出力バッ
ファ（Ｃ６）へ移送出力する。なお“１Ｆ”は通常のデ
ータにあまり存在しないデータであることと、２進数と
して扱った場合でも４バイトもあればデータとしての
“１Ｆ”の連続文字は皆無に近くなることを前提として
使用している。At the beginning of the main routine of the compression processing program, it is determined whether the compression processing is the first compression processing (S30). If the processing is the first processing, the first four bytes of data (C5) to be compressed are ALL. It is determined whether it is “1F” (S3
1). If ALL is "1F", double compression is performed, error processing (S33) is performed, and the processing ends. ALL
If it is not "1F", it is normal, so ALL "1F" (4 bytes) indicating compressed data is transferred to the output buffer (C6). It should be noted that "1F" is data that does not exist much in ordinary data, and that even if it is treated as a binary number, if there are 4 bytes, there will be almost no continuous characters of "1F" as data. ing.

【００２２】図４および図５を参照して、本願の請求項
１〜請求項４に記載のコンピュータデータの圧縮方法に
おけるレコード間圧縮処理サブルーチン、および連続文
字圧縮処理サブルーチンの処理手順を示すブロックフロ
ーチャートについて説明する。圧縮処理での比較でコン
ピュータ中央処理装置の使用率を抑制するためにレコー
ド間圧縮処理では１０バイトチェック（Ｓ１７）、連続
文字圧縮処理では５バイトチェック（Ｓ２４）とまず大
きい桁数で比較し、不一致となった場合に１バイトチェ
ック（Ｓ２０およびＳ２７）を行なう。１バイトチェッ
クで不一致になった時点で圧縮文字・圧縮制御符号・圧
縮文字数の出力バッファ（ＡＲＧ−６）への出力処理
（Ｓ２２およびＳ２９）を行なう。なお最初に同一文字
列、連続文字列の判定を複数バイトで行なうのは同一文
字列、連続文字列の判定回数を減じることが目的である
が５バイト、１０バイトは経験値に基づく値であり、圧
縮処理の性能評価結果においてＣＰＵ性能が上がってい
る。この値をあまり大きくすると同一文字列、連続文字
列の判定で結果が不一致となり、結局は１バイトごとの
判定処理になり判定回数削減効果を得ることができな
い。Referring to FIG. 4 and FIG. 5, a block flowchart showing a processing procedure of an inter-record compression processing subroutine and a continuous character compression processing subroutine in the computer data compression method according to claims 1 to 4 of the present application. Will be described. In order to suppress the usage rate of the computer central processing unit in the comparison in the compression processing, a 10-byte check (S17) in the inter-record compression processing and a 5-byte check (S24) in the continuous character compression processing are first compared with a large number of digits. If they do not match, a one-byte check is performed (S20 and S27). When the one-byte check indicates a mismatch, the output processing (S22 and S29) of the compressed character, the compression control code, and the number of compressed characters to the output buffer (ARG-6) is performed. The purpose of first determining the same character string and the continuous character string in a plurality of bytes is to reduce the number of times the same character string and the continuous character string are determined, but 5 bytes and 10 bytes are values based on empirical values. In the performance evaluation result of the compression processing, the CPU performance is improved. If this value is too large, the result will be inconsistent in the determination of the same character string or continuous character string, and the determination processing will be performed on a byte-by-byte basis, and the effect of reducing the number of determinations cannot be obtained.

【００２３】図６および図７を参照して、本願の請求項
１〜請求項４に記載のコンピュータデータの圧縮方法に
おけるレコード間圧縮処理サブルーチン、および連続文
字圧縮処理サブルーチンの処理概要を説明する。図６お
よび図７は図４および図５のブロックフローチャートの
補足説明を行なっている。Referring to FIG. 6 and FIG. 7, an outline of the inter-record compression subroutine and the continuous character compression subroutine in the computer data compression method according to the first to fourth aspects of the present invention will be described. 6 and 7 provide a supplementary description of the block flowcharts of FIGS. 4 and 5.

【００２４】図６・図７は圧縮制御符号の格納方式の説
明図も兼ねていて、レコード間圧縮の場合は圧縮制御符
号（１桁）、圧縮文字数（１桁）の２桁、連続文字圧縮
の場合は圧縮文字（１桁）、圧縮制御符号（１桁）、圧
縮文字数（１桁）の３桁で構成されている。FIGS. 6 and 7 also serve as explanatory diagrams of a compression control code storage method. In the case of inter-record compression, a compression control code (1 digit), two digits of the number of compressed characters (1 digit), and continuous character compression are used. Is composed of three digits: a compressed character (one digit), a compression control code (one digit), and the number of compressed characters (one digit).

【００２５】図６に示したｎレコード目およびｎ＋１レ
コード目のデータを参照して、レコード間圧縮処理の概
要を説明する。まず（１）で先頭の３バイトを比較し、
（２）で一致したのでレコード間圧縮を行なう。（３）
で先頭の１０バイトを比較し、（４）で一致しなかった
ので１バイトチェックを行なう。（５）で１バイトチェ
ックの２回目で不一致となり、（６）でレコード間圧縮
制御符号、文字数を表わす（“１Ｅ”４）を出力する。
（７）で５バイト目から連続文字圧縮処理を行なう。５
バイト目から４バイトを比較する。（８）で一致しなか
ったので１バイトチェックを行ない、（９）で１バイト
チェックの４回目で不一致となる。（１０）でレコード
間圧縮も連続文字圧縮も行なえないため、ｄｄｄをその
まま出力バッファへ移送する。なお８バイトから１０バ
イトのｈｉｉについても（１）〜（１０）と同様の処理
が行なわれる。（１１）で（１）の処理に返り先頭の３
バイトを比較し、（１２）で一致したのでレコード間圧
縮を行なう。（１３）で１１バイト目から１０バイトを
比較し、（１４）で一致したので２１バイト目から１０
バイトを比較する。（１５）で一致しなかったので２１
バイト目から１バイトチェックを行ない、（１６）で１
バイトチェックの１０回目で不一致となる。（１７）で
レコード間圧縮制御符号、文字数を表わす（“１Ｅ”１
９）を出力する。（１８）で残１バイトを同様な判定を
行ない、結果１バイトを圧縮せずに出力する。なお、最
初に３バイトを比較しているのは、レコード間圧縮制御
符号、桁数の格納方式が２バイトであるため、３バイト
以上にならないと圧縮効果が得られないためである。The outline of the inter-record compression processing will be described with reference to the data of the n-th record and the (n + 1) -th record shown in FIG. First, the first three bytes are compared in (1),
Since they match in (2), compression between records is performed. (3)
The first 10 bytes are compared with each other, and since they do not match in (4), a 1-byte check is performed. In (5), a mismatch occurs in the second one-byte check, and in (6), an inter-record compression control code and ("1E" 4) representing the number of characters are output.
In (7), continuous character compression processing is performed from the fifth byte. 5
Compare 4 bytes from the byte. Since they did not match in (8), a one-byte check is performed, and in (9), no match occurs in the fourth one-byte check. Since neither inter-record compression nor continuous character compression can be performed in (10), ddd is directly transferred to the output buffer. Note that the same processing as (1) to (10) is performed for the hii of 8 to 10 bytes. Return to the process of (1) in (11) and return to the top 3
The bytes are compared, and since they match in (12), compression between records is performed. In (13), the 10th byte from the 11th byte is compared.
Compare bytes. Since they did not match in (15), 21
Check 1 byte from the byte, 1 in (16)
A mismatch occurs at the tenth byte check. (17) represents the inter-record compression control code and the number of characters (“1E” 1
9) is output. At (18), the same determination is made for the remaining one byte, and the result is output without compressing one byte. The reason why the three bytes are compared first is that the compression effect between the records cannot be obtained unless the storage method of the inter-record compression control code and the number of digits is two bytes or more.

【００２６】図８を参照して、圧縮時の特例処理につい
て説明する。圧縮制御符号（“１Ｅ”または“１Ｆ”）
と圧縮データが一致した場合には、伸長処理で制御符号
の判断ができなくなるので補助符号（“０Ｃ”）を挿入
することにより判断可能としている。すなわち、圧縮制
御コード（“１Ｅ”，“１Ｆ”）、補助コード（“０
Ｃ”）と同一のデータを出力する場合には、１バイト前
に補助コード（“０Ｃ”）が付与されて出力される。図
９を参照して、レコード間圧縮処理・連続文字圧縮処理
において、圧縮文字数格納桁数（１桁）の最大２５５文
字を超える圧縮文字数が発生した場合の制御符号格納方
式について説明する。１回の圧縮は２５５バイト単位で
行なわれ、２５５バイトを超える圧縮の場合は２５５バ
イト単位で出力される。With reference to FIG. 8, a special process at the time of compression will be described. Compression control code ("1E" or "1F")
When the compressed data and the compressed data match, it is impossible to determine the control code in the decompression process. Therefore, the determination can be made by inserting an auxiliary code (“0C”). That is, the compression control code (“1E”, “1F”) and the auxiliary code (“0
C "), the data is output with an auxiliary code (" 0C ") added one byte before. Referring to FIG. 9, in the inter-record compression processing and the continuous character compression processing, A description will be given of a control code storage method in the case where the number of compressed characters exceeds the maximum number of stored characters (one digit) of 255. One compression is performed in units of 255 bytes, and a case of compression exceeding 255 bytes. Is output in units of 255 bytes.

【００２７】次に伸長方法の実施例について説明する。
図２を参照して、コンピュータデータの伸長方法におい
て、データ伸長処理を要求するアプリケーションプログ
ラムから伸長処理プログラムを呼出す際のアーギュメン
トについて説明する。伸長条件（Ｅ１）、伸長すべきデ
ータ（Ｅ５）等を指定して伸長処理が要求される。Next, an embodiment of the decompression method will be described.
Referring to FIG. 2, in the process of elongation computer data, it will be described argument when calling the decompression processing program from the application program requesting the data decompression processing. A decompression process is requested by specifying a decompression condition (E1), data to be decompressed (E5), and the like.

【００２８】図１０および図１１を参照して、コンピュ
ータデータの伸長方法における伸長処理メインルーチン
の処理手順について説明する。伸長すべきデータ（Ｅ
５）をサーチし圧縮制御符号（“１Ｅ”または“１
Ｆ”）を判定し、各々レコード間伸長処理、連続文字伸
長処理が行なわれる。[0028] With reference to FIGS. 10 and 11 describes a procedure of the decompression process the main routine in extension method co Npyu <br/> Tadeta. Data to be expanded (E
5) to search for the compression control code (“1E” or “1E”).
F "), and an inter-record decompression process and a continuous character decompression process are performed.

【００２９】図１０を参照して、レコード間伸長処理に
ついて説明する。圧縮制御コードの“１Ｅ”が出現した
ら、以下の方法を用いて伸長処理を行なう。すなわち、
圧縮制御コードを検出したら、次の１バイトをバイナリ
表現の文字数として認識する。次にその文字数分を全レ
コード（ＡＲＧ−４）から出力バッファ（ＡＲＧ−６）
に移送する。Referring to FIG. 10, the inter-record decompression process will be described. When the compression control code "1E" appears, the decompression process is performed using the following method. That is,
When the compression control code is detected, the next one byte is recognized as the number of characters in the binary expression. Next, the number of characters is output from all records (ARG-4) to the output buffer (ARG-6).
Transfer to

【００３０】図１１を参照して、連続文字伸長処理につ
いて説明する。圧縮制御コードの“１Ｆ”が出現した
ら、以下の方法を用いて伸長処理を行なう。すなわち、
圧縮制御コードを検出したら、次の１バイトをバイナリ
表現の文字数として認識する。次に、この時点で最後に
出力バッファ（ＡＲＧ−６）に出力した文字（圧縮文
字）を認識する。次にその文字数分および圧縮文字を出
力バッファ（ＡＲＧ−６）に移送する。Referring to FIG. 11, the continuous character decompression process will be described. When the compression control code "1F" appears, the decompression process is performed using the following method. That is,
When the compression control code is detected, the next one byte is recognized as the number of characters in the binary expression. Next, at this point, the character (compressed character) output to the output buffer (ARG-6) last is recognized. Next, the number of characters and the compressed characters are transferred to the output buffer (ARG-6).

【００３１】図１２を参照して、伸長処理の特例事項に
ついて説明する。連続文字伸長処理、文字移送処理にお
いて、補助コード（“０Ｃ”）を検出した場合、その文
字を読飛ばして次の文字をデータとして扱う。ただし、
補助コード（“０Ｃ”）を読飛ばすのは１回のみとされ
る。With reference to FIG. 12, special items of the decompression process will be described. When the supplementary code (“0C”) is detected in the continuous character decompression process and the character transfer process, the character is skipped and the next character is treated as data. However,
The auxiliary code ("0C") is skipped only once.

【００３２】[0032]

【発明の効果】以上のように本願の請求項１に記載のコ
ンピュータデータの圧縮方法は、従来の同一レコード内
における連続文字圧縮処理に加えて、相異なるレコード
間における同一文字圧縮処理を行ない両者を組合せるこ
とによりコンピュータデータの圧縮率を格段に高めるこ
とができる。したがってコンピュータ資源（中央処理装
置、磁気ディスク、データ通信機器）の有効利用を図る
ことができ、費用の削減が可能となる。As described above, the computer data compression method according to claim 1 of the present application performs the same character compression processing between different records in addition to the conventional continuous character compression processing within the same record. Can significantly increase the compression ratio of computer data. Therefore, effective use of computer resources (central processing unit, magnetic disk, data communication equipment) can be achieved, and cost can be reduced.

【００３３】また請求項１に記載のコンピュータデータ
の圧縮方法によれば、圧縮すべきコンピュータデータを
段階的にサーチする方法により、効率的な圧縮処理を行
なうコンピュータデータの圧縮方法を提供することがで
きる。[0033] According to the method of compressing computer data according to claim 1, the method of searching the computer data to be compressed in stages, to provide a method of compressing computer data to perform efficient compression Can be.

【００３４】また請求項２、請求項３、および請求項４
に記載のコンピュータデータの圧縮方法によれば、請求
項１に記載の発明の効果に加え、二重圧縮を防止するコ
ンピュータデータの圧縮方法を提供することができる。[0034] Further, claim 2 , claim 3 , and claim 4
According to the method for compressing computer data described in ( 1 ), in addition to the effect of the invention described in claim 1 , it is possible to provide a method for compressing computer data that prevents double compression.

【００３５】またコンピュータデータの伸長方法によれ
ば、請求項１〜請求項４に記載のコンピュータデータの
圧縮方法により圧縮されたデータを伸長して、もとのデ
ータに復元することができる。したがって一旦圧縮した
もとのデータを復元して利用することが可能となる。[0035] According to or computer method decompressed data, and decompresses the data compressed by the compression method computer data according to any one of claims 1 to 4, can be restored to the original data . Therefore, it is possible to restore the original compressed data for use.

[Brief description of the drawings]

【図１】実施例において、アプリケーションプログラム
から圧縮処理プログラムを呼出す際のアーギュメントの
説明図である。FIG. 1 is an explanatory diagram of arguments when a compression processing program is called from an application program in an embodiment.

【図２】実施例において、アプリケーションプログラム
から伸長処理プログラムを呼出す際のアーギュメントの
説明図である。FIG. 2 is an explanatory diagram of arguments when a decompression processing program is called from an application program in the embodiment.

【図３】実施例における圧縮処理プログラムのメインル
ーチンの処理手順を示すブロックフローチャートであ
る。FIG. 3 is a block flowchart showing a processing procedure of a main routine of a compression processing program in the embodiment.

【図４】実施例におけるレコード間圧縮処理サブルーチ
ンの処理手順を示すブロックフローチャートである。FIG. 4 is a block flowchart showing a processing procedure of an inter-record compression processing subroutine in the embodiment.

【図５】実施例における連続文字圧縮処理サブルーチン
の処理手順を示すブロックフローチャートである。FIG. 5 is a block flowchart showing a processing procedure of a continuous character compression processing subroutine in the embodiment.

【図６】実施例におけるレコード間圧縮処理サブルーチ
ンの処理内容の概説およびレコード間圧縮制御符号の格
納方式の説明図である。FIG. 6 is an explanatory diagram of an outline of processing contents of an inter-record compression processing subroutine and a storage method of an inter-record compression control code in the embodiment.

【図７】実施例における連続文字圧縮処理サブルーチン
の処理内容の概説および連続文字圧縮制御符号の格納方
式の説明図である。FIG. 7 is an explanatory diagram of an outline of processing contents of a continuous character compression processing subroutine and a storage method of a continuous character compression control code in the embodiment.

【図８】実施例における圧縮処理・文字移送処理におけ
る特例処置の説明図である。FIG. 8 is an explanatory diagram of special treatment in compression processing and character transfer processing in the embodiment.

【図９】実施例における圧縮文字数が２５５文字を超え
る場合の制御符号格納方式の説明図である。FIG. 9 is an explanatory diagram of a control code storage method when the number of compressed characters exceeds 255 in the embodiment.

【図１０】実施例におけるレコード間伸長処理に関する
説明図である。FIG. 10 is an explanatory diagram relating to an inter-record decompression process in the embodiment.

【図１１】実施例における連続文字伸長処理に関する説
明図である。FIG. 11 is a diagram illustrating a continuous character decompression process according to the embodiment.

【図１２】実施例における伸長処理の特例処置の説明図
である。FIG. 12 is an explanatory diagram of a special treatment of a decompression process in the embodiment.

Claims

(57) [Claims]

Claims: 1. An external storage medium for a computer.
Data is transferred to the external storage medium by an external instruction parameter.
A data compression method for compressing data before storing
Externally instruct communication data in data transfer processing between data
Data compression method to compress before data transfer by parameter
Method for compressing the same character string between different records
And a second compression for compressing a continuous character string in the same record.
Performing the first compression processing , wherein the step of performing the first compression processing includes, as a first step, a step of determining the same character string between records in units of a plurality of bytes; and a step of determining the same if the result of the determination does not match. A second step of determining the same character string between records as a two-step unit and compressing the same character string if the same character string exists as a result of the determination; The first step is a step of determining a continuous character string in the same record in a unit of a plurality of bytes as a first step. performing a determination, and step that said continuous string of the determination is to compress the continuous string when the presence of including computer data Shrinkage method.

2. A method according to claim 1, further comprising the step of adding a predetermined code to the beginning multibyte first compression processing or the second 1 of the record of the data subjected to compression processing
3. The method for compressing computer data according to item 1.

3. A first plurality of bytes of data to be subjected to a compression process are determined. If the first plurality of bytes is not the predetermined code, the compression process is executed. If the first code is the predetermined code, the compression process is performed. 3. The method according to claim 2 , further comprising the step of:
3. The method for compressing computer data according to item 1.

4. The computer data compression method according to claim 3 , wherein the plurality of bytes are four bytes, and the predetermined code is ALL “1F”.