JP3132774B2

JP3132774B2 - Data compression / decompression device

Info

Publication number: JP3132774B2
Application number: JP34571191A
Authority: JP
Inventors: 佳之岡田; 茂吉田; 泰彦中野; 広隆千葉
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1991-12-27
Filing date: 1991-12-27
Publication date: 2001-02-05
Anticipated expiration: 2016-02-05
Also published as: JPH05181641A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明はデータ圧縮・復元装置に
関し、特にユニバーサル符号化及び復号化によるデータ
の圧縮及び復元を行うデータ圧縮・復元装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a data compression / decompression device, and more particularly to a data compression / decompression device for compressing and decompressing data by universal encoding and decoding.

【０００２】近年、著しい技術開発によって、コンピュ
ータの処理速度及び記憶容量等は飛躍的な発展を遂げつ
つある。しかし、コンピュータでベクトル情報及び画像
情報等のデータを扱うようになってからは、従来以上に
取り扱うデータ量が増加しつつある。このようなデータ
量の大幅な増加に対処するため、データの内容を損なわ
ずにデータ量を減らす種々のデータ圧縮方式が提案され
ている。In recent years, due to remarkable technological development, the processing speed, storage capacity, and the like of computers have been dramatically improved. However, since the computer handles data such as vector information and image information, the amount of data handled is increasing more than ever. In order to cope with such a large increase in the data amount, various data compression methods have been proposed for reducing the data amount without losing the data contents.

【０００３】これらのデータ圧縮方式は、データに含ま
れる冗長な部分を省いて符号化することによって、デー
タを圧縮する方式である。データ圧縮方式によって、デ
ータ量を減らすことができ、結果的に記憶容量を減らす
ことができる。また、通信では圧縮したデータを伝送す
ることによって、同一内容の情報を速く伝送することが
できる。[0003] These data compression schemes are schemes for compressing data by encoding without redundant portions included in the data. With the data compression method, the data amount can be reduced, and as a result, the storage capacity can be reduced. In communication, by transmitting compressed data, the same information can be transmitted at high speed.

【０００４】なお、「文字（Character ）」及び「文字
列（Character String）」の定義はＪＩＳ−Ｃ６２３０
に従うほか、情報理論で用いられている呼称を踏襲し、
１ワード単位で構成されるデータを「文字」と呼び、任
意のワード単位で構成されるデータを「文字列」と呼ぶ
ことにする。[0004] The definition of "Character" and "Character String" are defined in JIS-C6230.
In addition to following the names used in information theory,
Data composed of one word unit is called a "character", and data composed of an arbitrary word unit is called a "character string".

【０００５】[0005]

【従来の技術】従来、上記のようなデータを圧縮する一
例として、ユニバーサル符号化方式が提案されている。
ユニバーサル符号化方式の代表的な例として、ＬＺ（Le
mpel-Ziv）符号化法と算術符号化法とがある。また、Ｌ
Ｚ符号化法には、ユニバーサル型と増分分解型（Increm
ental persing ）のアルゴリズムが提案されている。さ
らに、これらのアルゴリズムを改良した符号化法とし
て、ユニバーサル型に属するＬＺＳＳ符号化法と、増分
分解型に属するＬＺＷ（Lempel-Ziv-Welch）符号化法と
がある。2. Description of the Related Art Conventionally, a universal coding method has been proposed as an example of compressing the above data.
As a typical example of the universal encoding method, LZ (Le
mpel-Ziv) coding method and arithmetic coding method. Also, L
Z encoding methods include universal type and incremental decomposition type (Increm
ental persing) algorithm has been proposed. Furthermore, there are LZSS encoding methods belonging to the universal type and LZW (Lempel-Ziv-Welch) encoding methods belonging to the incremental decomposition type as encoding methods in which these algorithms are improved.

【０００６】なお、ＬＺ符号化法は、例えば、宗像清治
著「Lempel-Zivデータ圧縮法」、情報処理、pp.2〜6, V
ol.26, No.1, 1985 に詳しく掲載されている。また、Ｌ
ＺＳＳ符号化法は、T.C. Bell, "Better OPM/L Text Co
mpression", IEEE Trans.onCommu., Vol.COM-34, No.1
2, Dec.1986 に詳しく掲載されている。さらに、ＬＺＷ
符号化法は、T.A. Welch, "A Technique for High-Perf
ormance Data Compression", Computer, Jun.1984 に詳
しく掲載されている。そして、増分分解型の符号化法及
びＬＺＷ符号化法は、特開昭59−231683号、米国特許Ｎ
o. 4,558,302号において開示されている。The LZ encoding method is described in, for example, “Lempel-Ziv Data Compression Method” by Seiji Munakata, Information Processing, pp. 2-6, V
ol.26, No.1, 1985. Also, L
ZSS encoding is described in TC Bell, "Better OPM / L Text Co.
mpression ", IEEE Trans.onCommu., Vol.COM-34, No.1
2, Dec.1986. Furthermore, LZW
The encoding method is described in TA Welch, "A Technique for High-Perf
Ormance Data Compression ", Computer, Jun. 1984. An incremental decomposition type encoding method and an LZW encoding method are described in JP-A-59-231683, U.S. Pat.
o. 4,558,302.

【０００７】これらの符号化法のうち、高速処理がで
き、アルゴリズムが簡単であるという利点から、一般的
にＬＺＷ符号化法が使用されてきた。ＬＺＷ符号化法
は、書き換え可能な辞書をもち、以下に示す処理によっ
て符号化を行う方法である。まず、新規の入力文字列を
相異なる部分文字列に分割し、この部分文字列が辞書に
登録されてなければ、出現した順に識別番号を付して全
て辞書に登録する。同時に、現在入力している部分文字
列のうち、最長の部分文字列と一致する部分文字列を辞
書から選択し、選択した部分文字列に付されている識別
番号で符号化する。[0007] Among these coding methods, the LZW coding method has been generally used because of its advantages of high-speed processing and simple algorithm. The LZW encoding method has a rewritable dictionary and performs encoding by the following processing. First, a new input character string is divided into different partial character strings, and if this partial character string is not registered in the dictionary, it is assigned an identification number in the order of appearance and registered in the dictionary. At the same time, of the currently input partial character strings, a partial character string that matches the longest partial character string is selected from the dictionary, and is encoded with the identification number assigned to the selected partial character string.

【０００８】以下、ＬＺＷ符号化法を使用したデータ圧
縮回路及びデータ復元回路の詳細について説明する。図
１５は、従来のデータ圧縮回路を示す図である。図にお
いて、従来のデータ圧縮回路は、辞書検索手段１２１、
辞書登録手段１２２、ユニバーサル符号化手段１２３及
び辞書Ｄ１２から構成される。The details of a data compression circuit and a data decompression circuit using the LZW encoding method will be described below. FIG. 15 is a diagram showing a conventional data compression circuit. In the figure, a conventional data compression circuit includes a dictionary search unit 121,
It comprises a dictionary registration unit 122, a universal encoding unit 123, and a dictionary D12.

【０００９】辞書検索手段１２１は、辞書Ｄ１２に登録
されている登録文字列のうち、入力された入力文字列と
一致する登録文字列を検索する。辞書登録手段１２２
は、辞書検索手段１２１によって検索できなかった入力
文字列に識別番号を付して辞書１２２に登録する。ユニ
バーサル符号化手段１２３は、辞書検索手段１２１によ
って検索された登録文字列に付された識別番号、あるい
は辞書登録手段１２２によって入力文字列に新たに付さ
れた識別番号を出力符号に符号化する。The dictionary search means 121 searches the registered character strings registered in the dictionary D12 for a registered character string that matches the input character string. Dictionary registration means 122
Adds an identification number to an input character string that could not be searched by the dictionary search means 121 and registers it in the dictionary 122. The universal encoding unit 123 encodes an identification number assigned to the registered character string searched by the dictionary search unit 121 or an identification number newly assigned to the input character string by the dictionary registration unit 122 into an output code.

【００１０】辞書検索手段１２１、辞書登録手段１２２
及びユニバーサル符号化手段１２３の符号化処理は、シ
ーケンシャルに行われる。この符号化処理手順の一例を
図１６に示す。[0010] Dictionary search means 121, dictionary registration means 122
The encoding process of the universal encoding unit 123 is performed sequentially. FIG. 16 shows an example of the encoding processing procedure.

【００１１】図１６は、従来のデータ圧縮回路による処
理手順を示す図である。図には、処理文字数欄１２５、
辞書検索欄１２６、辞書登録欄１２７及び符号化欄１２
８からなる表を示す。FIG. 16 is a diagram showing a processing procedure by a conventional data compression circuit. In the figure, the processing character number column 125,
Dictionary search field 126, dictionary registration field 127, and encoding field 12
8 shows a table.

【００１２】処理文字数欄１２５には、入力文字列が符
号化される際に処理される先頭からの文字順位を示す。
辞書検索欄１２６は、図１５の辞書検索手段１２１が処
理する内容を示す。辞書登録欄１２７は、辞書登録手段
１２２が処理する内容を示す。符号化欄１２８は、ユニ
バーサル符号化手段１２３が処理する内容を示す。な
お、円内の数字は処理手順の順番を示す。以下、「円内
の数字Ｘ」を「サイクルＸ」と表記する。The number-of-characters-to-be-processed column 125 indicates the order of characters from the beginning to be processed when the input character string is encoded.
The dictionary search field 126 shows the contents processed by the dictionary search means 121 in FIG. The dictionary registration column 127 indicates the content processed by the dictionary registration unit 122. The encoding column 128 indicates the content processed by the universal encoding means 123. The number in the circle indicates the order of the processing procedure. Hereinafter, “the number X in a circle” is referred to as “cycle X”.

【００１３】まず、サイクル１〜サイクル３では、入力
された入力文字列の最初の１文字について、辞書登録手
段１２２が辞書Ｄ１２を検索する過程を示す。具体的に
は、辞書検索の結果、入力文字列と登録文字列がサイク
ル１及びサイクル２では一致せず、サイクル３で一致し
たことを示す。First, in cycles 1 to 3, a process in which the dictionary registration means 122 searches the dictionary D12 for the first character of the input character string is shown. More specifically, as a result of the dictionary search, the input character string and the registered character string do not match in cycles 1 and 2, but indicate that they match in cycle 3.

【００１４】次に、サイクル４〜サイクル６では、入力
された入力文字列の２文字目について、辞書登録手段１
２２が辞書Ｄ１２を検索する過程を示す。具体的には、
入力文字列と登録文字列がサイクル４及びサイクル５で
は一致せず、サイクル６では辞書に登録されている文字
が無いことを示す。Next, in cycle 4 to cycle 6, the dictionary registration means 1 registers the second character of the input character string.
Reference numeral 22 denotes a process of searching the dictionary D12. In particular,
The input character string and the registered character string do not match in cycles 4 and 5, and cycle 6 indicates that there is no character registered in the dictionary.

【００１５】このとき、辞書登録手段１２２が入力文字
列に識別番号を付して辞書Ｄ１２に登録する。これがサ
イクル７に示す「登録」である。また、符号化するため
に、ユニバーサル符号化手段１２３が辞書登録の際に付
した識別番号を符号化する。これがサイクル８に示す
「符号化」である。以下、同様に符号化処理が行われ
る。At this time, the dictionary registration means 122 adds an identification number to the input character string and registers it in the dictionary D12. This is the “registration” shown in cycle 7. Further, in order to perform encoding, the universal encoding unit 123 encodes an identification number given at the time of dictionary registration. This is the “encoding” shown in cycle 8. Hereinafter, the encoding process is performed similarly.

【００１６】また、図１７は、従来のデータ復元回路を
示す図である。図において、従来のデータ復元回路は、
ユニバーサル復号化手段１３１、辞書検索手段１３２、
スタック蓄積手段１３３、辞書登録手段１３４及び辞書
Ｄ１３から構成される。FIG. 17 is a diagram showing a conventional data restoration circuit. In the figure, the conventional data restoration circuit
Universal decryption means 131, dictionary search means 132,
It comprises a stack storage unit 133, a dictionary registration unit 134, and a dictionary D13.

【００１７】ユニバーサル復号化手段１３１は、入力さ
れた入力符号を復号化する。辞書検索手段１２１は、辞
書Ｄ１３に登録されている登録文字符号のうち、復号化
された文字符号と一致する文字符号を検索する。スタッ
ク蓄積手段１３３は、辞書検索手段１２１によって検索
された文字をスタックに蓄積し、検索が終了した時点で
蓄積した文字を全て出力する。辞書登録手段１３４は、
スタック蓄積手段１３３によって出力された復元文字列
が辞書Ｄ１３に登録されていない場合に、この復元文字
列に新たな符号を付して辞書Ｄ１３に登録する。The universal decoding means 131 decodes the input code. The dictionary search unit 121 searches the registered character codes registered in the dictionary D13 for a character code that matches the decoded character code. The stack storage unit 133 stores the characters searched by the dictionary search unit 121 in a stack, and outputs all the stored characters when the search is completed. Dictionary registration means 134
If the restored character string output by the stack storage unit 133 is not registered in the dictionary D13, the restored character string is assigned a new code and registered in the dictionary D13.

【００１８】これら、ユニバーサル復号化手段１３１、
辞書検索手段１３２、スタック蓄積手段１３３及び辞書
登録手段１３４の復号化処理は、シーケンシャルに行わ
れる。この復号化処理手順の一例を図１８に示す。These universal decoding means 131,
The decryption processing of the dictionary search unit 132, the stack storage unit 133, and the dictionary registration unit 134 is performed sequentially. FIG. 18 shows an example of this decoding processing procedure.

【００１９】図１８は、従来のデータ復元回路による処
理手順を示す図である。図には、処理符号数欄１３６、
復号化欄１３７、検索・蓄積欄１３８、文字列出力欄１
３９及び辞書登録欄１４０からなる表を示す。FIG. 18 is a diagram showing a processing procedure by a conventional data restoration circuit. In the figure, the processing code number column 136,
Decryption field 137, search / accumulation field 138, character string output field 1
39 shows a table composed of 39 and a dictionary registration field 140.

【００２０】処理符号数欄１３６は、入力符号が復号化
される際に処理される符号数を示す。復号化欄１３７
は、図１７のユニバーサル復号化手段１３１が処理する
内容を示す。検索・蓄積欄１３８は、辞書検索手段１３
２が検索し、スタック蓄積手段１３３が検索された文字
を蓄積する処理を示す。文字列出力欄１３９は、スタッ
ク蓄積手段１３３が蓄積した文字を全て出力する処理を
示す。なお、円内の数字は処理手順の順番を示す。以
下、図１６と同様に、「円内の数字Ｘ」を「サイクル
Ｘ」と表記する。The processing code number column 136 indicates the number of codes to be processed when the input code is decoded. Decryption field 137
Indicates the contents to be processed by the universal decoding means 131 in FIG. The search / accumulation column 138 stores the dictionary search means 13
2 shows a process in which the stack storage means 133 stores the searched characters. The character string output column 139 shows a process of outputting all the characters accumulated by the stack accumulation means 133. The number in the circle indicates the order of the processing procedure. Hereinafter, similarly to FIG. 16, “the number X in a circle” is described as “cycle X”.

【００２１】まず、サイクル１〜サイクル８では、入力
された最初入力符号列について、復号化及び出力文字列
として出力するまでの過程を示す。具体的に、各処理手
順は次のようになっている。サイクル１では、ユニバー
サル復号化手段１３１が入力符号を復号化する。サイク
ル２〜サイクル４では、辞書検索手段１３２がサイクル
１で復号化された文字符号を再帰的に検索して、スタッ
ク蓄積手段１３３が検索された文字をスタックに蓄積す
る。サイクル５〜サイクル７では、スタック蓄積手段１
３３が蓄積した文字を全て出力する。サイクル８では、
辞書登録手段１３４がスタック蓄積手段１３３によって
出力された出力文字列に新たな符号を付して辞書Ｄ１３
に登録する。以下、同様に復号化処理が行われる。First, in a cycle 1 to a cycle 8, a process from decoding of an input first input code string to outputting as an output character string is shown. Specifically, each processing procedure is as follows. In cycle 1, the universal decoding means 131 decodes the input code. In cycles 2 to 4, the dictionary search unit 132 recursively searches for the character code decoded in cycle 1, and the stack storage unit 133 stores the searched character in the stack. In cycle 5 to cycle 7, the stack storage unit 1
33 outputs all the stored characters. In cycle 8,
The dictionary registration unit 134 assigns a new code to the output character string output by the stack
Register with. Hereinafter, the decoding process is performed similarly.

【００２２】[0022]

【発明が解決しようとする課題】しかし、従来のデータ
圧縮回路では、ある文字列に対して辞書検索、辞書登録
及び符号化を行なった後に、次の文字列を符号化する、
というようなバッチ・シリアル処理で圧縮処理を行なっ
ていた。また、データ復元回路でも同様に、ある符号に
対して復号化、辞書検索、スタック蓄積、辞書登録を行
なった後に、次の符号を復号化する、というようなバッ
チ・シリアル処理で復元処理を行なっていた。However, in the conventional data compression circuit, after performing a dictionary search, dictionary registration and encoding for a certain character string, the next character string is encoded.
Such compression processing was performed by batch serial processing. Similarly, the data restoration circuit performs restoration processing by batch serial processing, such as decoding a certain code, searching a dictionary, accumulating stacks, and registering a dictionary, and then decoding the next code. I was

【００２３】したがって、このようなバッチ・シリアル
処理では、符号化及び復号化処理の高速化が困難である
という問題点があった。本発明はこのような点に鑑みて
なされたものであり、入力データの圧縮処理を並列処理
により高速化するデータ圧縮回路を提供することを目的
とする。Therefore, in such batch serial processing, there is a problem that it is difficult to speed up the encoding and decoding processes. The present invention has been made in view of such a point, and an object of the present invention is to provide a data compression circuit that speeds up the compression processing of input data by parallel processing.

【００２４】また、本発明の他の目的は、入力符号の復
元処理を並列処理により高速化するデータ復元回路を提
供することである。さらに、本発明の他の目的は、入力
データの圧縮処理及び入力符号の復元処理を、並列処理
により高速化するデータ圧縮・復元装置を提供すること
である。It is another object of the present invention to provide a data restoration circuit which speeds up restoration processing of an input code by parallel processing. Still another object of the present invention is to provide a data compression / decompression device that speeds up the compression processing of input data and the decompression processing of input codes by parallel processing.

【００２５】[0025]

【課題を解決するための手段】本発明では上記目的を達
成するために、図１に示すように、データ圧縮回路は第
１の辞書検索手段１、第１の辞書登録手段２、符号化手
段３、初期辞書アクセス手段４、通常辞書アクセス手段
５、初期辞書Ｄ１及び通常辞書Ｄ２から構成する。第１
の辞書検索手段１は、初期辞書アクセス手段４を介して
初期辞書Ｄ１に登録されている登録文字列のうち、入力
された入力文字列と一致する最長の登録文字列を検索す
る。初期辞書Ｄ１で検索できなかった場合は検索を終了
する。また、検索できた場合には通常辞書アクセス手段
５を介して通常辞書Ｄ２に登録されている登録文字列か
ら検索する。第１の辞書登録手段２は、第１の辞書検索
手段１によって検索された最長の登録文字列に一文字を
加えた文字列に識別番号を付し、通常辞書アクセス手段
５を介して通常辞書Ｄ２に登録する。符号化手段３は、
最長の登録文字列に付された識別番号を、出力符号とし
て符号化する。According to the present invention, in order to achieve the above object, as shown in FIG. 1, a data compression circuit comprises a first dictionary retrieval unit 1, a first dictionary registration unit 2, an encoding unit. 3. Initial dictionary access means 4, normal dictionary access means 5, initial dictionary D1, and normal dictionary D2. First
The dictionary search means 1 searches the registered character strings registered in the initial dictionary D1 via the initial dictionary access means 4 for the longest registered character string that matches the input character string. If the search cannot be performed using the initial dictionary D1, the search is terminated. If the search is successful, the search is performed from the registered character string registered in the normal dictionary D2 via the normal dictionary access means 5. The first dictionary registration unit 2 attaches an identification number to a character string obtained by adding one character to the longest registered character string searched by the first dictionary search unit 1, and sends the normal dictionary D 2 via the normal dictionary access unit 5. Register with. The encoding means 3
The identification number assigned to the longest registered character string is encoded as an output code.

【００２６】また、初期辞書Ｄ１及び通常辞書Ｄ２は、
外部ハッシュによるデータ検索及びデータ登録を行うよ
うに構成する。さらに、初期辞書Ｄ１には第２文字まで
の文字列についてデータ検索及びデータ登録を行い、通
常辞書Ｄ２には第３文字以降の文字列についてデータ検
索及びデータ登録を行うように構成する。The initial dictionary D1 and the normal dictionary D2 are
It is configured to perform data search and data registration using an external hash. Further, the initial dictionary D1 is configured to perform data search and data registration for character strings up to the second character, and the normal dictionary D2 is configured to perform data search and data registration for character strings after the third character.

【００２７】そして、初期辞書Ｄ１及び通常辞書Ｄ２
は、完全ハッシュによるデータ検索及びデータ登録を行
うように構成する。それから、第１の辞書検索手段１、
第１の辞書登録手段２及び符号化手段３は、パイプライ
ンで接続され並列処理される。Then, the initial dictionary D1 and the normal dictionary D2
Is configured to perform data search and data registration using a complete hash. Then, the first dictionary search means 1,
The first dictionary registration unit 2 and the encoding unit 3 are connected by a pipeline and are processed in parallel.

【００２８】そのうえ、図３に示すように、データ復元
回路は、復号化手段１１、第２の辞書検索手段１２、第
１の蓄積手段１３、第２の蓄積手段１４、出力選択手段
１５、第２の辞書登録手段１６、辞書アクセス手段１７
及び辞書Ｄ３から構成する。復号化手段１１は、入力さ
れた入力符号を復号化する。第２の辞書検索手段１２
は、辞書アクセス手段１７を介して辞書Ｄ３に登録され
ている登録文字列のうち、復号化によって復元された文
字列と一致する文字列を検索する。第１の蓄積手段１３
及び第２の蓄積手段１４は、この検索によって復元され
た文字列を蓄積する。出力選択手段１５は、蓄積された
文字列のうち、出力する復元文字列を選択する。第２の
辞書登録手段１６は、選択された復元文字列のうち、辞
書Ｄ３に登録されていない文字列に、新たな符号を付し
て登録する。In addition, as shown in FIG. 3, the data restoration circuit includes a decoding unit 11, a second dictionary search unit 12, a first storage unit 13, a second storage unit 14, an output selection unit 15, 2 dictionary registration means 16 and dictionary access means 17
And the dictionary D3. The decoding unit 11 decodes the input code. Second dictionary search means 12
Searches a registered character string registered in the dictionary D3 via the dictionary access unit 17 for a character string that matches the character string restored by decoding. First storage means 13
The second storage unit 14 stores the character string restored by the search. The output selection unit 15 selects a restored character string to be output from the stored character strings. The second dictionary registration unit 16 registers character strings that are not registered in the dictionary D3 among the selected restored character strings with new codes.

【００２９】また、復号化手段１１、第２の辞書検索手
段１２、出力選択手段１５及び第２の辞書登録手段１６
は、パイプラインで接続され並列処理される。そして、
図５に示すように、データ圧縮・復元装置は、データ圧
縮回路及びデータ復元回路から構成する。データ圧縮回
路は第１の辞書検索手段１、第１の辞書登録手段２、符
号化手段３、初期辞書アクセス手段４、通常辞書アクセ
ス手段５、初期辞書Ｄ１及び通常辞書Ｄ２から構成す
る。また、データ復元回路は、復号化手段１１、第２の
辞書検索手段１２、第１の蓄積手段１３、第２の蓄積手
段１４、出力選択手段１５、第２の辞書登録手段１６、
辞書アクセス手段１７及び辞書Ｄ３から構成する。The decoding means 11, the second dictionary search means 12, the output selection means 15, and the second dictionary registration means 16
Are connected in a pipeline and processed in parallel. And
As shown in FIG. 5, the data compression / decompression device includes a data compression circuit and a data decompression circuit. The data compression circuit includes a first dictionary search unit 1, a first dictionary registration unit 2, an encoding unit 3, an initial dictionary access unit 4, a normal dictionary access unit 5, an initial dictionary D1, and a normal dictionary D2. The data restoration circuit includes a decoding unit 11, a second dictionary search unit 12, a first storage unit 13, a second storage unit 14, an output selection unit 15, a second dictionary registration unit 16,
It comprises a dictionary access means 17 and a dictionary D3.

【００３０】[0030]

【作用】データ圧縮回路において、第１の辞書検索手段
１が初期辞書アクセス手段４を介して、入力された入力
文字列を初期辞書Ｄ１から検索する。初期辞書Ｄ１から
検索されなかった場合は、同様に通常辞書アクセス手段
５を介して通常辞書Ｄ２から検索する。そして、第１の
辞書登録手段２が検索された最長の登録文字列に一文字
を加えた文字列に識別番号を付し、通常辞書アクセス手
段５を介して通常辞書Ｄ２に登録する。符号化手段３は
検索された最長の登録文字列に付された識別番号を出力
符号に符号化する。In the data compression circuit, the first dictionary retrieval means 1 retrieves the input character string from the initial dictionary D1 via the initial dictionary access means 4. If the search is not performed from the initial dictionary D1, the search is similarly performed from the normal dictionary D2 via the normal dictionary access means 5. Then, the first dictionary registration unit 2 attaches an identification number to the character string obtained by adding one character to the longest registered character string searched, and registers the character string in the normal dictionary D2 via the normal dictionary access unit 5. The encoding means 3 encodes the identification number given to the longest registered character string searched into an output code.

【００３１】また、初期辞書Ｄ１及び通常辞書Ｄ２は、
外部ハッシュによるデータ検索及びデータ登録を行う。
さらに、初期辞書Ｄ１には第２文字までの文字列につい
てデータ検索及びデータ登録を行い、通常辞書Ｄ２には
第３文字以降の文字列についてデータ検索及びデータ登
録を行う。The initial dictionary D1 and the normal dictionary D2 are
Perform data search and data registration by external hash.
Further, data search and data registration are performed for the character string up to the second character in the initial dictionary D1, and data search and data registration are performed for the character string after the third character in the normal dictionary D2.

【００３２】そして、初期辞書Ｄ１及び通常辞書Ｄ２
は、完全ハッシュによるデータ検索及びデータ登録を行
う。それから、第１の辞書検索手段１、第１の辞書登録
手段２及び符号化手段３は、パイプラインで接続して並
列処理する。Then, the initial dictionary D1 and the normal dictionary D2
Performs data search and data registration using a complete hash. Then, the first dictionary search means 1, the first dictionary registration means 2, and the encoding means 3 are connected by a pipeline to perform parallel processing.

【００３３】そのうえ、データ復元回路において、復号
化手段１１が入力された入力符号を復号化し、第２の辞
書検索手段１２が辞書アクセス手段１７を介して辞書Ｄ
３のうち、復号化によって復元された文字列と一致する
文字列を検索し、第１の蓄積手段１３及び第２の蓄積手
段１４に復元された文字列を出力する。出力選択手段１
５は出力する復元文字列を選択し、第２の辞書登録手段
１６が未登録の復元文字列に新たな符号を付し、辞書ア
クセス手段１７を介して辞書Ｄ３に登録する。Further, in the data restoration circuit, the decoding means 11 decodes the input code, and the second dictionary search means 12 transmits the dictionary D through the dictionary access means 17.
3, a character string that matches the character string restored by decoding is searched, and the restored character string is output to the first storage unit 13 and the second storage unit 14. Output selection means 1
5 selects a restored character string to be output, the second dictionary registration means 16 assigns a new code to the unregistered restored character string, and registers the new character string in the dictionary D3 via the dictionary access means 17.

【００３４】また、復号化手段１１、第２の辞書検索手
段１２、出力選択手段１５及び第２の辞書登録手段１６
は、パイプラインで接続して並列処理する。そして、デ
ータ圧縮・復元装置において、データ圧縮回路では第１
の辞書検索手段１が初期辞書アクセス手段４を介して、
入力された入力文字列を初期辞書Ｄ１から検索する。初
期辞書Ｄ１から検索されなかった場合は、同様に通常辞
書アクセス手段５を介して通常辞書Ｄ２から検索する。
そして、第１の辞書登録手段２が検索された最長の登録
文字列に一文字を加えた文字列に識別番号を付し、通常
辞書アクセス手段５を介して通常辞書Ｄ２に登録する。
符号化手段３は検索された最長の登録文字列に付された
識別番号を出力符号に符号化する。また、データ復元回
路では復号化手段１１が入力された入力符号を復号化
し、第２の辞書検索手段１２が辞書アクセス手段１７を
介して辞書Ｄ３のうち、復号化によって復元された文字
列と一致する文字列を検索し、第１の蓄積手段１３及び
第２の蓄積手段１４に復元された文字列を出力する。出
力選択手段１５は出力する復元文字列を選択し、第２の
辞書登録手段１６が未登録の復元文字列に新たな符号を
付し、辞書アクセス手段１７を介して辞書Ｄ３に登録す
る。The decoding means 11, the second dictionary search means 12, the output selection means 15, and the second dictionary registration means 16
Are connected in a pipeline to perform parallel processing. In the data compression / decompression device, the data compression circuit
Dictionary search means 1 via initial dictionary access means 4
The input character string is searched from the initial dictionary D1. If the search is not performed from the initial dictionary D1, the search is similarly performed from the normal dictionary D2 via the normal dictionary access means 5.
Then, the first dictionary registration unit 2 attaches an identification number to the character string obtained by adding one character to the longest registered character string searched, and registers the character string in the normal dictionary D2 via the normal dictionary access unit 5.
The encoding means 3 encodes the identification number given to the longest registered character string searched into an output code. In the data restoration circuit, the decoding means 11 decodes the input code, and the second dictionary search means 12 matches the character string restored by decoding in the dictionary D3 via the dictionary access means 17. Then, the restored character string is output to the first storage unit 13 and the second storage unit 14. The output selecting unit 15 selects the restored character string to be output, and the second dictionary registration unit 16 assigns a new code to the unregistered restored character string and registers the new character string in the dictionary D3 via the dictionary access unit 17.

【００３５】[0035]

【実施例】以下、本発明の一実施例を図面に基づいて説
明する。図１は、本発明のデータ圧縮回路の実施例を示
す図である。図において、データ圧縮回路は第１の辞書
検索手段１、第１の辞書登録手段２、符号化手段３、初
期辞書アクセス手段４、通常辞書アクセス手段５、初期
辞書Ｄ１及び通常辞書Ｄ２から構成される。An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a diagram showing an embodiment of the data compression circuit of the present invention. In the figure, the data compression circuit comprises a first dictionary search unit 1, a first dictionary registration unit 2, an encoding unit 3, an initial dictionary access unit 4, a normal dictionary access unit 5, an initial dictionary D1, and a normal dictionary D2. You.

【００３６】第１の辞書検索手段１は、初期辞書アクセ
ス手段４を介して初期辞書Ｄ１に登録されている登録文
字列のうち、あるいは通常辞書アクセス手段５を介して
通常辞書Ｄ２に登録されている登録文字列のうち、情報
源から入力された入力文字列と一致する最長の登録文字
列を検索する。第１の辞書登録手段２は、検索された最
長の登録文字列に一文字を加えた文字列に、識別番号を
付して初期辞書アクセス手段４を介して初期辞書Ｄ１
に、あるいは通常辞書アクセス手段５を介して通常辞書
Ｄ２に登録する。初期辞書アクセス手段４は、初期辞書
Ｄ１とのアクセスを行うための手段であり、例えば第１
の辞書検索手段１及び第１の辞書登録手段２から指令さ
れた論理アドレスを物理アドレスに変換して、初期辞書
Ｄ１とのデータの入出力を行う。通常辞書アクセス手段
５は、通常辞書Ｄ２とのアクセスを行うための手段であ
り、例えば第１の辞書検索手段１及び第１の辞書登録手
段２から指令された論理アドレスを物理アドレスに変換
して、通常辞書Ｄ２とのデータの入出力を行う。符号化
手段３は、例えばＬＺＷ符号化手段が使用され、検索さ
れた最長の登録文字列に付された識別番号を出力符号と
して符号化する。The first dictionary search means 1 is one of the registered character strings registered in the initial dictionary D1 via the initial dictionary access means 4 or registered in the normal dictionary D2 via the normal dictionary access means 5. The longest registered character string that matches the input character string input from the information source is searched for among the registered character strings. The first dictionary registration unit 2 adds an identification number to a character string obtained by adding one character to the searched longest registered character string, and assigns an initial dictionary D1 via the initial dictionary access unit 4.
Or via the normal dictionary access means 5 in the normal dictionary D2. The initial dictionary access unit 4 is a unit for accessing the initial dictionary D1.
The logical address designated by the dictionary search means 1 and the first dictionary registration means 2 is converted into a physical address, and data is input / output to / from the initial dictionary D1. The normal dictionary access unit 5 is a unit for accessing the normal dictionary D2. For example, the normal dictionary access unit 5 converts a logical address specified by the first dictionary search unit 1 and the first dictionary registration unit 2 into a physical address. Performs data input / output with the normal dictionary D2. The encoding unit 3 uses, for example, an LZW encoding unit, and encodes, as an output code, an identification number given to the longest registered character string searched.

【００３７】ここで、通常辞書Ｄ２に登録される「最長
の登録文字列に一文字を加えた文字列」は、次のような
文字列である。例えば、入力文字列が「ａｂｃｄ」であ
り、既に通常辞書Ｄ２に登録されている最長の登録文字
列が「ａｂ」であるならば、入力文字列のうち最長の登
録文字列と一致している「ａｂ」の次の一文字「ｃ」を
加えた文字列「ａｂｃ」である。Here, the "character string obtained by adding one character to the longest registered character string" registered in the normal dictionary D2 is the following character string. For example, if the input character string is "abcd" and the longest registered character string already registered in the normal dictionary D2 is "ab", the input character string matches the longest registered character string. This is a character string “abc” to which the one character “c” next to “ab” is added.

【００３８】なお、初期辞書アクセス手段４から初期辞
書Ｄ１へのアクセス、及び通常辞書アクセス手段５から
通常辞書Ｄ２へのアクセスは、後述する完全ハッシュ法
又は外部ハッシュ法によって行われる。The access from the initial dictionary access means 4 to the initial dictionary D1 and the access from the normal dictionary access means 5 to the normal dictionary D2 are performed by a complete hash method or an external hash method described later.

【００３９】また、第１の辞書検索手段１、第１の辞書
登録手段２及び符号化手段３の各手段は、いずれも図示
されていないパイプラインで接続されており、並列に処
理することができる。このパイプライン並列処理によ
り、複数の手段での符号化処理が可能となり、高速に符
号化処理を行うことができる。Each of the first dictionary search means 1, the first dictionary registration means 2, and the encoding means 3 are connected by a pipeline (not shown), and can be processed in parallel. it can. By this pipeline parallel processing, encoding processing by a plurality of means becomes possible, and encoding processing can be performed at high speed.

【００４０】このような並列処理において、例外となる
のは、第１の辞書検索手段１及び第１の辞書登録手段２
が同時に初期辞書Ｄ１又は通常辞書Ｄ２へのアクセスを
要求する場合である。例えば、第１の辞書検索手段１と
第１の辞書登録手段２とが同時に初期辞書Ｄ１にアクセ
スを要求する場合である。このように、二以上の手段か
ら同時にアクセス要求がある状態を「衝突」という。衝
突が起きた場合には、例えば第１の辞書登録手段２が初
期辞書アクセス手段４を介して初期辞書Ｄ１とアクセス
した後、第１の辞書検索手段１が同様に初期辞書Ｄ１と
アクセスする、というようにシーケンシャルに辞書とア
クセスすることになる。しかしながら、このような衝突
は全体の符号化処理から見れば、極めて稀な現象であ
る。したがって、全体の処理時間から見れば無視できる
程の処理時間に過ぎないので、シーケンシャルに辞書と
アクセス処理を行なっても問題はない。以下、この符号
化処理手順の一例を図２に示す。An exception in such parallel processing is that the first dictionary search means 1 and the first dictionary registration means 2
Is a request for access to the initial dictionary D1 or the normal dictionary D2 at the same time. For example, there is a case where the first dictionary search unit 1 and the first dictionary registration unit 2 request access to the initial dictionary D1 at the same time. A state in which an access request is made from two or more means at the same time is called "collision". When a collision occurs, for example, after the first dictionary registration unit 2 accesses the initial dictionary D1 via the initial dictionary access unit 4, the first dictionary search unit 1 similarly accesses the initial dictionary D1. Thus, the dictionary is accessed sequentially. However, such a collision is an extremely rare phenomenon in view of the entire encoding process. Therefore, since the processing time is negligible in view of the entire processing time, there is no problem even if the dictionary and the access processing are sequentially performed. Hereinafter, an example of this encoding processing procedure is shown in FIG.

【００４１】図２は、本発明のデータ圧縮回路による処
理手順を示す図である。図には、処理文字数欄６、辞書
検索欄７、辞書登録欄８及び符号化欄９からなる表を示
す。処理文字数欄６には、入力文字列が符号化される際
に処理される先頭からの文字順位を示す。辞書検索欄７
は、図１の第１の辞書検索手段１が処理する内容を示
す。辞書登録欄８は、第１の辞書登録手段２が処理する
内容を示す。符号化欄９は、符号化手段３が処理する内
容を示す。FIG. 2 is a diagram showing a processing procedure by the data compression circuit of the present invention. The figure shows a table including a processing character number column 6, a dictionary search column 7, a dictionary registration column 8, and an encoding column 9. The number-of-processed characters column 6 indicates the character order from the head processed when the input character string is encoded. Dictionary search field 7
Shows the contents processed by the first dictionary search means 1 in FIG. The dictionary registration column 8 shows the contents processed by the first dictionary registration unit 2. The encoding column 9 shows the contents to be processed by the encoding means 3.

【００４２】なお、円内の数字は処理手順の順番を示
す。以下、「円内の数字Ｘ」を「サイクルＸ」と表記す
る。また、同じ円内の数字がある場合には、同時に並列
して処理が行われることを示す。そして、初期辞書Ｄ１
とのアクセスは初期辞書アクセス手段４を介して行わ
れ、また、通常辞書Ｄ２とのアクセスは通常辞書アクセ
ス手段５を介して行われるものとして、この過程の表記
を省略する。The numbers in the circle indicate the order of the processing procedure. Hereinafter, “the number X in a circle” is referred to as “cycle X”. If there is a number in the same circle, it indicates that the processing is performed in parallel at the same time. Then, the initial dictionary D1
The access to the normal dictionary D2 is performed through the normal dictionary access means 5, and the description of this process is omitted.

【００４３】まず、サイクル１では、情報源から入力さ
れた入力文字列の最初の第１文字に続く第２文字につい
て、第１の辞書検索手段１が初期辞書Ｄ１を検索する。
サイクル２は、入力文字列の第３文字について、第１の
辞書検索手段１が通常辞書Ｄ２を検索する。サイクル３
は、入力文字列の第４文字について、第１の辞書検索手
段１が通常辞書Ｄ２を検索する。このとき、サイクル３
では、入力文字列の第４文字が通常辞書Ｄ２に登録され
ていない場合を示す。これらのサイクル１〜サイクル３
の過程は、シーケンシャルに各処理が行われる。First, in cycle 1, the first dictionary search means 1 searches the initial dictionary D1 for the second character following the first character of the input character string input from the information source.
In cycle 2, the first dictionary search means 1 searches the normal dictionary D2 for the third character of the input character string. Cycle 3
The first dictionary search means 1 searches the normal dictionary D2 for the fourth character of the input character string. At this time, cycle 3
Shows a case where the fourth character of the input character string is not registered in the normal dictionary D2. These cycles 1 to 3
In this process, each process is performed sequentially.

【００４４】次に、辞書検索欄７のサイクル４では、入
力文字列の第４文字に続く第５文字について、第１の辞
書検索手段１が初期辞書Ｄ１を検索する。同時に、辞書
登録欄８のサイクル４では、辞書検索欄７のサイクル３
で通常辞書Ｄ２に登録されていなかった入力文字列の第
１文字〜第４文字の文字列に識別番号を付して、第１の
辞書登録手段２が通常辞書Ｄ２に登録する。Next, in cycle 4 of the dictionary search field 7, the first dictionary search means 1 searches the initial dictionary D1 for the fifth character following the fourth character of the input character string. At the same time, in cycle 4 of the dictionary registration section 8, cycle 3 of the dictionary search section 7
Then, an identification number is assigned to the character string of the first to fourth characters of the input character string that has not been registered in the normal dictionary D2, and the first dictionary registration unit 2 registers the character string in the normal dictionary D2.

【００４５】そして、辞書検索欄７のサイクル５では、
入力文字列の第５文字に続く第６文字について、第１の
辞書検索手段１が通常辞書Ｄ２を検索する。同時に、符
号化欄９のサイクル５では、辞書登録欄８のサイクル４
で行なった登録文字列に付された識別番号を符号化す
る。以下、同様な処理が続けられる。Then, in cycle 5 of the dictionary search field 7,
For the sixth character following the fifth character of the input character string, the first dictionary search means 1 searches the normal dictionary D2. At the same time, in cycle 5 of the encoding field 9, cycle 4 of the dictionary registration field 8
Encodes the identification number given to the registered character string performed in step. Hereinafter, similar processing is continued.

【００４６】このように、辞書検索欄７において初期辞
書Ｄ１又は通常辞書Ｄ２を検索した結果、入力文字列が
登録されていなかった場合には、辞書登録欄８における
文字列の登録、及び符号化欄９における符号化を、辞書
検索と並列して行う。このため、符号化処理を高速に行
うことができる。As a result of searching the initial dictionary D1 or the normal dictionary D2 in the dictionary search field 7, if the input character string is not registered, the registration of the character string in the dictionary registration field 8 and the encoding The encoding in column 9 is performed in parallel with the dictionary search. Therefore, the encoding process can be performed at high speed.

【００４７】図３は、本発明のデータ復元回路の実施例
を示す図である。図において、データ復元回路は、復号
化手段１１、第２の辞書検索手段１２、第１の蓄積手段
１３、第２の蓄積手段１４、出力選択手段１５、第２の
辞書登録手段１６、辞書アクセス手段１７及び辞書Ｄ３
から構成される。FIG. 3 is a diagram showing an embodiment of the data restoration circuit of the present invention. In the figure, the data restoration circuit includes a decoding unit 11, a second dictionary search unit 12, a first storage unit 13, a second storage unit 14, an output selection unit 15, a second dictionary registration unit 16, and a dictionary access unit. Means 17 and dictionary D3
Consists of

【００４８】復号化手段１１は、例えばＬＺＷ復号化手
段が使用され、情報源から入力された入力符号を復号化
する。第２の辞書検索手段１２は、辞書アクセス手段１
７を介して辞書Ｄ３に登録されている登録文字列のう
ち、復号化によって復元された文字列と一致する文字列
を検索する。第１の蓄積手段１３及び第２の蓄積手段１
４は、検索によって復元された文字列を蓄積する。出力
選択手段１５は、蓄積された文字列のうち、出力する復
元文字列を選択する。第２の辞書登録手段１６は、選択
された復元文字列のうち、辞書Ｄ３に登録されていない
文字列に、新たな符号を付して登録する。辞書アクセス
手段１７は、辞書Ｄ３とのアクセスを行うための手段で
あり、例えば第２の辞書検索手段１２及び第２の辞書登
録手段１６から指令された論理アドレスを物理アドレス
に変換して、辞書Ｄ３とのデータの入出力を行う。As the decoding means 11, for example, LZW decoding means is used, and decodes an input code inputted from an information source. The second dictionary search means 12 is a dictionary access means 1
7, a character string that matches the character string restored by decoding is searched from the registered character strings registered in the dictionary D3. First storage means 13 and second storage means 1
No. 4 stores the character string restored by the search. The output selection unit 15 selects a restored character string to be output from the stored character strings. The second dictionary registration unit 16 registers character strings that are not registered in the dictionary D3 among the selected restored character strings with new codes. The dictionary access unit 17 is a unit for accessing the dictionary D3. For example, the dictionary access unit 17 converts a logical address instructed from the second dictionary search unit 12 and the second dictionary registration unit 16 into a physical address, and converts the logical address into a physical address. Data is input / output to / from D3.

【００４９】なお、辞書アクセス手段１７から辞書Ｄ３
へのアクセスは、後述する完全ハッシュ法又は外部ハッ
シュ法によって行うこともできる。また、図１のデータ
圧縮回路で説明した「衝突」は、このデータ復元回路で
も発生する。例えば、第２の辞書検索手段１２が検索要
求を、第２の辞書登録手段１６が登録要求を同時にする
場合である。この場合、例えば第２の辞書登録手段１６
の登録処理後に、第２の辞書検索手段１２の検索処理を
行うようにシーケンシャルに行われる。データ圧縮回路
の場合と同様に、衝突は全体の復号化処理から見れば極
めて稀な現象であるため、全体の処理時間から見れば無
視できる程の処理時間に過ぎないので、シーケンシャル
に辞書とアクセス処理を行なっても問題はない。The dictionary access means 17 sends the dictionary D3
Can be accessed by the full hash method or the external hash method described later. The "collision" described in the data compression circuit of FIG. 1 also occurs in this data restoration circuit. For example, this is a case where the second dictionary search means 12 makes a search request and the second dictionary registration means 16 makes a registration request at the same time. In this case, for example, the second dictionary registration unit 16
After the registration processing, the search processing of the second dictionary search unit 12 is sequentially performed. As in the case of the data compression circuit, collision is a very rare phenomenon from the viewpoint of the entire decoding process, and is only negligible in the overall processing time. There is no problem in performing the processing.

【００５０】そして、復号化手段１１、第２の辞書検索
手段１２、第１の蓄積手段１３、第２の蓄積手段１４、
出力選択手段１５及び第２の辞書登録手段１６の各手段
は、いずれも図示されていないパイプラインで接続され
ており、並列に処理することができる。このパイプライ
ン並列処理により、複数の手段での符号化処理が可能と
なり、高速に符号化処理を行うことができる。以下、こ
の復号化処理手順の一例を図４に示す。Then, the decoding means 11, the second dictionary search means 12, the first storage means 13, the second storage means 14,
Each of the output selection means 15 and the second dictionary registration means 16 are connected by a pipeline (not shown), and can be processed in parallel. By this pipeline parallel processing, encoding processing by a plurality of means becomes possible, and encoding processing can be performed at high speed. Hereinafter, an example of this decoding processing procedure is shown in FIG.

【００５１】図４は、本発明のデータ復元回路による処
理手順を示す図である。図には、処理符号数欄２１、復
号化欄２２、検索・蓄積欄２３、文字列出力欄２４及び
辞書登録欄２５からなる表を示す。FIG. 4 is a diagram showing a processing procedure by the data restoration circuit of the present invention. The figure shows a table including a processing code number field 21, a decoding field 22, a search / accumulation field 23, a character string output field 24, and a dictionary registration field 25.

【００５２】処理符号数欄２１は、入力符号が復号化さ
れる際に処理される符号数を示す。復号化欄２２は、図
３の復号化手段１１が処理する内容を示す。検索・蓄積
欄２３は、第２の辞書検索手段１２が検索し、スタック
蓄積手段１３，１４が検索された文字を蓄積する処理を
示す。文字列出力欄２４は、出力選択手段１５がスタッ
ク蓄積手段１３，１４に蓄積した文字を全て出力する処
理を示す。辞書登録欄２５は、第２の辞書登録手段１６
が処理する内容を示す。The number of processed codes column 21 indicates the number of codes to be processed when the input code is decoded. The decryption column 22 indicates the content processed by the decryption means 11 of FIG. The search / accumulation column 23 indicates a process in which the second dictionary search means 12 searches and the stack storage means 13 and 14 accumulate the searched characters. The character string output column 24 shows a process in which the output selection unit 15 outputs all the characters stored in the stack storage units 13 and 14. The dictionary registration section 25 stores the second dictionary registration unit 16
Indicates the content to be processed.

【００５３】なお、円内の数字は処理手順の順番を示
す。以下、図２と同様に、「円内の数字Ｘ」を「サイク
ルＸ」と表記する。また、同じ円内の数字がある場合に
は、同時に並列して処理が行われることを示す。そし
て、初期辞書Ｄ３とのアクセスは辞書アクセス手段１７
を介して行われるものとして、この過程の表記を省略す
る。The numbers in the circle indicate the order of the processing procedure. Hereinafter, similarly to FIG. 2, “the number X in a circle” is described as “cycle X”. If there is a number in the same circle, it indicates that the processing is performed in parallel at the same time. The access to the initial dictionary D3 is performed by the dictionary access unit 17.
, The description of this process is omitted.

【００５４】まず、サイクル１では、情報源から入力さ
れた最初の入力符号について、復号化手段１１が復号化
する。復号化欄２２のサイクル２は、２番目の入力符号
について、復号化手段１１が復号化する。同時に、検索
・蓄積欄２３のサイクル２は、第２の辞書検索手段１２
が復号化した符号文字を辞書Ｄ３から検索する。また、
検索された復元文字は、スタック蓄積手段１３によって
スタックに蓄積される。これらのサイクル２は、並列し
て処理が行われる。First, in cycle 1, the decoding means 11 decodes the first input code input from the information source. In cycle 2 of the decoding field 22, the decoding means 11 decodes the second input code. At the same time, cycle 2 of the search / accumulation column 23
Searches the dictionary D3 for code characters decoded by. Also,
The retrieved restored characters are stored in the stack by the stack storage unit 13. These cycles 2 are performed in parallel.

【００５５】次に、サイクル３及びサイクル４では、後
述する再帰的な復号により、第２の辞書検索手段１２が
復号化した符号文字を辞書Ｄ３から検索し、スタック蓄
積手段１３が検索された復元文字をスタックに蓄積す
る。Next, in cycle 3 and cycle 4, the second dictionary search means 12 searches the dictionary D3 for code characters decoded by recursive decoding, which will be described later, and the stack storage means 13 restores the searched character. Accumulate characters on the stack.

【００５６】そして、復号化欄２２のサイクル５では、
３番目の入力符号について、復号化手段１１が復号化す
る。同時に、検索・蓄積欄２３のサイクル５は、第２の
辞書検索手段１２が復号化した符号文字を辞書Ｄ３から
検索し、スタック蓄積手段１４が検索された復元文字を
スタックに蓄積する。さらに同時に、文字列出力欄２４
のサイクル５では、サイクル３〜サイクル４で蓄積した
文字列の１文字目を出力する。これらのサイクル５は、
並列して処理が行われる。Then, in cycle 5 of the decryption column 22,
The decoding means 11 decodes the third input code. At the same time, in cycle 5 of the search / accumulation column 23, the code character decoded by the second dictionary search means 12 is searched from the dictionary D3, and the stack storage means 14 stores the searched restored character in the stack. At the same time, the character string output field 24
In cycle 5, the first character of the character string accumulated in cycle 3 to cycle 4 is output. These cycles 5
Processing is performed in parallel.

【００５７】それから、検索・蓄積欄２３のサイクル６
は、第２の辞書検索手段１２が復号化した符号文字を辞
書Ｄ３から検索し、スタック蓄積手段１４が検索された
復元文字をスタックに蓄積する。同時に、文字列出力欄
２４のサイクル６では、サイクル３〜サイクル４で蓄積
した文字列の２文字目を出力する。これらのサイクル６
も、並列して処理が行われる。以下、同様な処理が続け
られる。Then, cycle 6 of the search / accumulation column 23
Retrieves the code characters decoded by the second dictionary retrieval unit 12 from the dictionary D3, and the stack storage unit 14 stores the retrieved restored characters in the stack. At the same time, in cycle 6 of the character string output column 24, the second character of the character string accumulated in cycle 3 to cycle 4 is output. These cycles 6
Are also performed in parallel. Hereinafter, similar processing is continued.

【００５８】このように、復号化欄２２で行う復号化と
並列して、一方のスタック蓄積手段が検索した文字を蓄
積し、他方のスタック蓄積手段が蓄積した文字列を復元
文字列として出力する処理を行うことができる。このた
め、復号化処理を高速に行うことができる。As described above, in parallel with the decoding performed in the decoding section 22, the character retrieved by one of the stack storage units is stored, and the character string stored by the other stack storage unit is output as a restored character string. Processing can be performed. Therefore, the decoding process can be performed at high speed.

【００５９】図５は、本発明のデータ圧縮・復元装置の
実施例を示す図である。図において、データ圧縮・復元
装置はデータ圧縮回路及びデータ復元回路から構成され
る。データ圧縮回路は第１の辞書検索手段１、第１の辞
書登録手段２、符号化手段３、初期辞書アクセス手段
４、通常辞書アクセス手段５、初期辞書Ｄ１及び通常辞
書Ｄ２から構成される。また、データ復元回路は、デー
タ復元回路は、復号化手段１１、第２の辞書検索手段１
２、第１の蓄積手段１３、第２の蓄積手段１４、出力選
択手段１５、第２の辞書登録手段１６、辞書アクセス手
段１７及び辞書Ｄ３から構成される。FIG. 5 is a diagram showing an embodiment of the data compression / decompression device of the present invention. In the figure, the data compression / decompression device includes a data compression circuit and a data decompression circuit. The data compression circuit includes a first dictionary search unit 1, a first dictionary registration unit 2, an encoding unit 3, an initial dictionary access unit 4, a normal dictionary access unit 5, an initial dictionary D1, and a normal dictionary D2. The data restoration circuit includes a decoding unit 11 and a second dictionary search unit 1.
2, a first storage unit 13, a second storage unit 14, an output selection unit 15, a second dictionary registration unit 16, a dictionary access unit 17, and a dictionary D3.

【００６０】なお、図１及び図３と同一の要素には同一
の番号を付し、説明を省略する。また、データ圧縮回路
及びデータ復元回路の作動は、それぞれ図１及び図３と
同じであるので説明を省略する。The same elements as those in FIGS. 1 and 3 are denoted by the same reference numerals, and description thereof will be omitted. The operations of the data compression circuit and the data decompression circuit are the same as those shown in FIGS.

【００６１】この装置により、データの圧縮処理及び復
元処理の処理速度が向上する。なお、図２及び図４で示
した処理手順のように、複数の手段が並列して処理を行
うことができるので、より装置全体の処理速度を向上さ
せることができる。With this apparatus, the processing speed of the data compression processing and the decompression processing is improved. In addition, as in the processing procedure shown in FIGS. 2 and 4, a plurality of units can perform processing in parallel, so that the processing speed of the entire apparatus can be further improved.

【００６２】次に、上記の各実施例における辞書検索手
順及び辞書登録手順について、まずハッシュ法について
説明する。ハッシュ法は、ハッシュ表（hash table）と
呼ばれる表を用いてデータの格納及びデータの検索を行
う方法の一つであり、データを登録するために検索キー
の内部コードωを用いて格納アドレスを決定する方法で
ある。このために、検索キーの内部コードωからアドレ
スを求める関数が必要となり、この関数は「ハッシュ関
数（hash function ）」と呼ばれている。また、ハッシ
ュ関数Ｈによって得られたアドレスＨ（ω）は「ハッシ
ュアドレス（hash address）」と呼ばれている。なお、
検索も登録の場合と同様に、ハッシュ関数によりハッシ
ュアドレスを求めて目的のアドレスのデータを検索す
る。Next, the dictionary search procedure and the dictionary registration procedure in each of the above embodiments will be described first with regard to the hash method. The hash method is one of the methods of storing data and retrieving data using a table called a hash table. In order to register data, a storage address is stored using an internal code ω of a search key. How to decide. For this reason, a function for obtaining an address from the internal code ω of the search key is required, and this function is called a “hash function”. The address H (ω) obtained by the hash function H is called “hash address”. In addition,
In the search, similarly to the case of registration, a hash address is obtained by a hash function and data of a target address is searched.

【００６３】このようなハッシュ法を使用した辞書検索
及び辞書登録では、ハッシュ関数Ｈをどのように選んで
も、相異なる検索キーの内部コードω₁，ω₂に対し
て、ハッシュアドレスがＨ（ω₁）＝Ｈ（ω₂）となる場合が起こり得る。このような状態を「衝突」と
呼び、この衝突を回避するために外部ハッシュ法（「オ
ープンハッシュ法」又は「連鎖法」とも呼ばれている）
が用いられる。また、衝突が起こらないように、予め表
に検索キーの内部コードωにとり得る全ての値を用意す
る方法が完全ハッシュ法である。以下、外部ハッシュ法
による場合と完全ハッシュ法による場合とに分けて説明
する。In the dictionary search and dictionary registration using the hash method, no matter how the hash function H is selected, the hash address is H (ω) for the internal codes ω ₁ and ω _{2 of} different search keys. ₁ ) = H (ω ₂ ). Such a state is called "collision", and an external hash method (also called "open hash method" or "chain method") is used to avoid the collision.
Is used. In addition, a method of preparing all possible values for the internal code ω of the search key in a table in advance so as not to cause a collision is a complete hash method. Hereinafter, the case using the external hash method and the case using the complete hash method will be described separately.

【００６４】図６は、外部ハッシュ法のデータ構造を示
す図である。外部ハッシュ法では、まずハッシュ関数Ｈ
によって得られるアドレスに対応する表、すなわちバケ
ットヘッダ（Bucket Headder）ＢＨと呼ばれる配列が用
意される。図では、このバケットヘッダＢＨにはハッシ
ュアドレスが０から（ｂ−１）までのｂ個のリストヘッ
ダ（list headder）が用意されている。そして、リスト
ヘッダの一つと、同一のハッシュアドレスを有する一以
上のデータ要素、すなわちリストＬ、との間はリスト構
造によって結合される。ここで、一つのリストは、デー
タを格納するデータ格納域と、次のリストへのポインタ
を格納するポインタ格納域とから構成される。FIG. 6 is a diagram showing a data structure of the external hash method. In the external hash method, first, a hash function H
A table corresponding to the address obtained by the above, that is, an array called a bucket header (Bucket Headder) BH is prepared. In the figure, the bucket header BH is provided with b list headers having hash addresses from 0 to (b-1). Then, one of the list headers and one or more data elements having the same hash address, that is, the list L, are connected by a list structure. Here, one list includes a data storage area for storing data and a pointer storage area for storing a pointer to the next list.

【００６５】例えば、図６に示すように、ハッシュアド
レス「０」に示すリストヘッダにはリストＬ０１へのポ
インタの設定によって、リストヘッダとリストＬ０１と
が結合される。この関係を矢印Ａ１で示す。また、リス
トＬ０１にはリストＬ０２へのポインタの設定によっ
て、リストＬ０１とリストＬ０２とが結合される。この
関係を矢印Ａ２で示す。なお、リストＬ０２以降がポイ
ンタによってリスト間が結合されない場合は、終端記号
「０」が設定される。以下、ハッシュアドレスが１から
（ｂ−１）までのリストヘッダも同様に、リストＬとは
リスト構造によって結合される。For example, as shown in FIG. 6, the list header indicated by the hash address “0” is combined with the list L01 by setting a pointer to the list L01. This relationship is indicated by arrow A1. The list L01 is linked to the list L02 by setting a pointer to the list L02. This relationship is indicated by arrow A2. If the list L02 and subsequent lists are not linked by a pointer, the terminal symbol “0” is set. Hereinafter, the list headers having hash addresses 1 to (b-1) are similarly connected to the list L by the list structure.

【００６６】なお、データの検索は次の手順で処理が行
われる。なお、ここではハッシュ関数Ｈにより求められ
るハッシュアドレスが「３」である場合を例に説明す
る。求められたハッシュアドレス「３」に対応するバケ
ットヘッダＢＨのリストヘッダから、最初のリストＬ３
１へのポインタを取得する。そして、最初のリストＬ３
１内のデータと照合する。もし、データが一致しなけれ
ば、次のリストＬ３２へのポインタを取得し、次のリス
トＬ３２のデータと照合する。もし、データが一致しな
ければ、所望のデータが見つからなかったことを示す。
なお、リストＬ３２に次のリストへのポインタが設定さ
れているならば、同様の検索処理を経て、目的のデータ
を検索する。The data search is performed in the following procedure. Here, the case where the hash address obtained by the hash function H is “3” will be described as an example. From the list header of the bucket header BH corresponding to the obtained hash address “3”, the first list L3
Obtain a pointer to 1. And the first list L3
Check with the data in 1. If the data does not match, a pointer to the next list L32 is obtained and collated with the data of the next list L32. If the data does not match, it indicates that the desired data was not found.
Note that if a pointer to the next list is set in the list L32, the target data is searched through the same search processing.

【００６７】また、データの登録は次の手順で処理が行
われる。データの検索と同様に、ハッシュ関数Ｈにより
求められるハッシュアドレスが「３」である場合を例に
説明する。求められたハッシュアドレス「３」に対応す
るバケットヘッダＢＨのリストヘッダから最初のリスト
Ｌ３１へのポインタを取得する。そして、次のリストへ
のポインタが「０」になるまでリストをたどる。図６の
例ではリストＬ３２が最後のリストであるから、ここで
新たなデータのリストを生成し、生成したリストへのポ
インタをリストＬ３２内のポインタ格納域に設定する。The data registration is performed in the following procedure. Similar to the data search, a case where the hash address obtained by the hash function H is “3” will be described as an example. The pointer to the first list L31 is obtained from the list header of the bucket header BH corresponding to the obtained hash address “3”. The list is followed until the pointer to the next list becomes "0". In the example of FIG. 6, since the list L32 is the last list, a new data list is generated here, and a pointer to the generated list is set in the pointer storage area in the list L32.

【００６８】このようなデータ構造を有する辞書におい
て、データの検索及び登録処理は次のように行われる。
図７は、外部ハッシュ法による辞書検索手順及び辞書登
録手順の一例を示す図である。図において、辞書は検索
及び登録処理を速くするために firstメモリＦＭ、next
メモリＮＭ及び extentionメモリＥＭの３つの物理的な
メモリから構成されている。また、これらの３つのメモ
リは登録するデータの内容に応じて初期辞書と、通常辞
書とに分けられる。In a dictionary having such a data structure, data search and registration processing is performed as follows.
FIG. 7 is a diagram showing an example of a dictionary search procedure and a dictionary registration procedure using the external hash method. In the figure, the dictionary is first memory FM, next to speed up the search and registration process.
It is composed of three physical memories, a memory NM and an extention memory EM. These three memories are divided into an initial dictionary and a normal dictionary according to the contents of data to be registered.

【００６９】まず、この辞書を使用したデータの検索
は、次の手順で処理が行われる。なお、ここでは検索す
る文字列が「Ｋ₂₂Ｋ₃₂Ｋ₄₂」であり、この文字列のハッ
シュ関数Ｈにより求められるハッシュアドレスが
「ω₁」である場合を例に説明する。First, data retrieval using this dictionary is performed in the following procedure. Here, a case where the character string to be searched is “K ₂₂ K ₃₂ K ₄₂ ” and the hash address of the character string obtained by the hash function H is “ω ₁ ” will be described as an example.

【００７０】まず、ハッシュ関数Ｈによって得られるア
ドレスω₁から、初期辞書の firstメモリＦＭのアドレ
スω₁に格納されている次のリストへのポインタω₂₁を
取得する。そして、次のリストへのポインタω₂₁から、
extentionメモリＥＭのアドレスω₂₁に格納されている
データＫ₂₁と照合する。この場合、文字列Ｋ₂₂Ｋ₃₂Ｋ ₄₂
の最初の文字Ｋ₂₂とは一致しないため、nextメモリＮＭ
のアドレスω₂₁から次のリストへのポインタω₂₂を取得
する。First, the algorithm obtained by the hash function H
Dress ω₁From the address of the first memory FM of the initial dictionary
Su ω₁Pointer ω to the next list stored in_{twenty one}To
get. And a pointer ω to the next list_{twenty one}From
address ω of extention memory EM_{twenty one}Stored in
Data K_{twenty one}To match. In this case, the character string K_{twenty two}K₃₂K ₄₂
The first letter K_{twenty two}And the next memory NM
Address ω_{twenty one}Pointer to the next list ω_{twenty two}Get
I do.

【００７１】次に、次のリストへのポインタω₂₂から、
通常辞書の firstメモリＦＭのアドレスω₂₂に格納され
ている次のリストへのポインタω₃₁を取得する。以下、
初期辞書の場合と同様な検索が行われる。そして、最後
は extentionメモリＥＭのアドレスω₄₁に格納されてい
るデータＫ₂₁と照合したが、一致する文字列が検索でき
ずに終了した。この過程を図７では矢印で示す。Next, from the pointer ω ₂₂ to the next list,
To get a pointer ω ₃₁ to the next list that is stored in the first memory FM of the address ω ₂₂ of the normal dictionary. Less than,
A search similar to that of the initial dictionary is performed. Finally has been collated with the data K ₂₁ stored in the address omega ₄₁ of extention memory EM, matching string is terminated can not find. This process is indicated by an arrow in FIG.

【００７２】また、この辞書を使用したデータの登録
は、次の手順で処理が行われる。なお、ここでは上記に
おいて検索できなかった文字列「Ｋ₂₂Ｋ₃₂Ｋ₄₂」を登録
する場合を例に説明する。上記のデータの検索と同様の
処理によって、次のリストへのポインタが「０」になる
までリストをたどる。この場合、nextメモリＮＭのアド
レスがω₄₁になる。ここで、文字Ｋ₄₂を extentionメモ
リＥＭの他のアドレスに生成する。図７では、このアド
レスがω₄₂であり、同じアドレスのnextメモリＮＭには
「０」が設定される。また、リストを連結するため、ne
xtメモリＮＭのアドレスω₄₁には、新たに生成したリス
トへのポインタであるω₄₂を登録する。こうして、新た
な文字列の登録が行われる。The registration of data using this dictionary is performed in the following procedure. Here, a case will be described as an example where a character string “K ₂₂ K ₃₂ K ₄₂ ” that could not be searched above is registered. By the same processing as the above data search, the list is traced until the pointer to the next list becomes "0". In this case, the address of the next memory NM is ω _41. Here, to generate the character K ₄₂ to other addresses extention memory EM. In FIG. 7, this address is ω _42, the next memory NM of the same address is set to "0". Also, to concatenate the lists, ne
The address ω ₄₁ of xt memory NM, to register the ω ₄₂ is a pointer to the list, which was newly created. Thus, a new character string is registered.

【００７３】なお、上記のデータ検索及びデータ登録を
行う初期辞書には第２文字までの文字列について行い、
通常辞書には第３文字以降の文字列について行う。こう
することにより、入力文字列が最適に分散されるため、
データ検索及びデータ登録を行う際の処理時間を抑える
ことができる。In the initial dictionary for performing the data search and data registration, a character string up to the second character is set.
Normally, a dictionary is used for a character string after the third character. This allows the input strings to be optimally distributed,
The processing time when performing data search and data registration can be suppressed.

【００７４】また、初期辞書には第２文字までの文字
列、通常辞書は第３文字以降の文字列に限ることなく、
例えば初期辞書には第３文字までの文字列、通常辞書に
は第４文字以降の文字列というように、入力文字列の性
質に応じて変更してもよい。こうすることにより、様々
な入力文字列の性質に対応して最適な辞書を構成するこ
とができる。The initial dictionary is not limited to the character string up to the second character, and the normal dictionary is not limited to the character string up to the third character.
For example, the character string may be changed according to the properties of the input character string, such as a character string of up to the third character in the initial dictionary and a character string of the fourth and subsequent characters in the normal dictionary. By doing so, it is possible to construct an optimal dictionary corresponding to the properties of various input character strings.

【００７５】図８は、完全ハッシュ法による辞書検索手
順及び辞書登録手順の一例を示す図である。完全ハッシ
ュ法は、全く衝突が起こらないようなハッシュ関数によ
るキー検索法である。この完全ハッシュ法で構築される
辞書は、例えば後述する図１４に示す木構造であって、
各「ノード」ごとに２５６本の「枝」を持つような構造
を有する。FIG. 8 is a diagram showing an example of a dictionary search procedure and a dictionary registration procedure using the complete hash method. The complete hash method is a key search method using a hash function that does not cause any collision. The dictionary constructed by the perfect hash method has, for example, a tree structure shown in FIG.
It has a structure such that each "node" has 256 "branches".

【００７６】図において、辞書は完全ハッシュメモリＰ
Ｍのみの物理的なメモリから構成され、登録するデータ
の内容に応じて初期辞書と、通常辞書とに分けられる。
また、完全ハッシュメモリＰＭのアドレスは、各文字列
のデータと一対一に対応させるため、ハッシュアドレス
ωとデータアドレスＫとから構成される。In the figure, the dictionary is a complete hash memory P
It is composed of only M physical memories, and is divided into an initial dictionary and a normal dictionary according to the contents of data to be registered.
The address of the complete hash memory PM is composed of a hash address ω and a data address K in order to correspond one-to-one with data of each character string.

【００７７】例えば、ハッシュアドレスωが２０ビット
で表現されるアドレスであり、データアドレスＫが８ビ
ットで表現されるアドレスならば、これらを合わせた２
８ビットのアドレスが完全ハッシュメモリＰＭのアドレ
スとして用いられる。すなわち、各ハッシュアドレスω
は、２５６個のデータアドレスＫのブロックごとの先頭
アドレスを示す。なお、図８では完全ハッシュメモリＰ
Ｍのアドレスを「ω_nn・Ｋ_xx」で表す。For example, if the hash address ω is an address represented by 20 bits and the data address K is an address represented by 8 bits, the sum of these two addresses is obtained.
An 8-bit address is used as an address of the complete hash memory PM. That is, each hash address ω
Indicates the head address of each of the 256 data addresses K. In FIG. 8, the complete hash memory P
The address of M is represented by “ω _nn · K _xx ”.

【００７８】そして、この辞書を使用したデータの検索
は、次の手順で処理が行われる。なお、ここでは検索す
る文字列が「Ｋ₂Ｋ₃Ｋ₄Ｋ₅」であり、この文字列の
ハッシュ関数Ｈにより求められるハッシュアドレスが
「ω₁」である場合を例に説明する。The data retrieval using this dictionary is performed in the following procedure. Here, the case where the character string to be searched is “K ₂ K ₃ K ₄ K ₅ ” and the hash address of the character string obtained by the hash function H is “ω ₁ ” will be described as an example.

【００７９】まず、ハッシュ関数Ｈによって得られるア
ドレスω₁と、文字列の第１文字Ｋ ₂のデータから、初
期辞書のアドレスω₁・Ｋ₂に格納されているフラグＦ
Ｌを取得する。フラグＦＬは、このアドレスが有効か無
効かを「１」又は「０」で示す。この場合、フラグＦＬ
には「１」が設定されているので、文字Ｋ₂は有効であ
る。したがって、検索する文字列の第１文字が辞書と一
致したことを示す。First, an algorithm obtained by the hash function H
Dress ω₁And the first character K of the character string _TwoFrom the data of
Period dictionary address ω₁・ K_TwoF stored in the
Get L. The flag FL indicates whether this address is valid or not.
The effectiveness is indicated by “1” or “0”. In this case, the flag FL
Is set to "1", the character K_TwoIs valid
You. Therefore, the first character of the string to be searched matches the dictionary.
Indicates that you have done it.

【００８０】次に、文字列の第２文字Ｋ₃が有効か否か
を検査するため、アドレスω₁・Ｋ ₂に格納されている
次へのポインタω₂₁を取得する。そして、アドレスω₂₁
と、文字列の第２文字Ｋ₃のデータから、初期辞書のア
ドレスω₂₁・Ｋ₃に格納されているフラグＦＬを取得す
る。以下、同様の処理によって検索が行われる。そし
て、最後はアドレスω₄₁・Ｋ₅に格納されているフラグ
ＦＬと照合して、一致する文字列が検索できずに終了し
ている。この過程を図８では矢印で示す。Next, the second character K of the character string_ThreeIs valid or not
To check the address ω₁・ K _TwoStored in
Next pointer ω_{twenty one}To get. And the address ω_{twenty one}
And the second character K of the character string_ThreeFrom the data in
Dress ω_{twenty one}・ K_ThreeThe flag FL stored in the
You. Hereinafter, a search is performed by the same processing. Soshi
And finally the address ω₄₁・ K_FiveFlag stored in
Compared with FL, it ends without finding a matching character string
ing. This process is indicated by an arrow in FIG.

【００８１】また、この辞書を使用したデータの登録
は、次の手順で処理が行われる。なお、ここでは上記に
おいて検索できなかった文字列「Ｋ₂Ｋ₃Ｋ₄Ｋ₅」を
登録する場合を例に説明する。上記のデータの検索と同
様の処理によって、次のリストへのポインタが「０」に
なるまでリストをたどる。この場合、完全ハッシュメモ
リＰＭのアドレスがω₄₁・Ｋ₅になる。ここで、フラグ
ＦＬを「１」に設定する。こうして、新たな文字列の登
録が行われる。The registration of data using this dictionary is performed in the following procedure. Here, an example will be described in which a character string “K ₂ K ₃ K ₄ K ₅ ” that cannot be searched is registered. By the same processing as the above data search, the list is traced until the pointer to the next list becomes "0". In this case, the address of a complete hash memory PM is ω ₄₁ · K _5. Here, the flag FL is set to “1”. Thus, a new character string is registered.

【００８２】次に、上記の実施例で示したＬＺＷ符号化
手段及びＬＺＷ復号化手段について説明する。図９は、
入力文字列をＬＺＷ符号の符号化アルゴリズムによって
符号化する場合の具体例を示す図である。この入力文字
列は、ａ，ｂ，ｃの３文字だけの組み合わせからなる文
字列である。まず、予め辞書に、１文字のａ，ｂ，ｃだ
けをそれぞれ符号１，２，３に対応づけて登録する初期
化を行う。Next, the LZW encoding means and LZW decoding means shown in the above embodiment will be described. FIG.
FIG. 9 is a diagram illustrating a specific example of a case where an input character string is encoded by an LZW encoding algorithm. This input character string is a character string composed of a combination of only three characters a, b, and c. First, initialization is performed in which only a, b, and c of one character are registered in the dictionary in association with codes 1, 2, and 3, respectively.

【００８３】まず、入力文字列７１を左から右へ一字ず
つ読み込む。最初の文字ａを読み込み、このａを語頭文
字（列）(prefix string) とする。次に、２番目の文字
ｂを読み込み、先の語頭文字ａにこのｂを加えたａｂを
辞書の登録文字列と照合する。このとき、ａｂに一致す
る文字列が辞書にないので、先の語頭文字ａの対応符号
１を符号化出力として出力する。この出力される符号
を、出力符号欄７２に示す。同時に、文字列ａｂを符号
４に対応させて辞書に登録する。この辞書に登録される
内容を、登録内容欄７３に示す。ここで、改めて２番目
の入力文字ｂを語頭文字とする。First, the input character string 71 is read one character at a time from left to right. The first character a is read, and this a is set as a prefix string. Next, the second character b is read, and ab obtained by adding the b to the initial letter a is compared with the registered character string in the dictionary. At this time, since there is no character string that matches ab in the dictionary, the corresponding code 1 of the initial character a is output as an encoded output. The output code is shown in the output code column 72. At the same time, the character string ab is registered in the dictionary in correspondence with the code 4. The contents registered in this dictionary are shown in the registered contents column 73. Here, let the second input character b be the initial character again.

【００８４】次に、入力文字列７１の３番目の文字ａを
読み込み、語頭文字ｂにこのａを加えたｂａを辞書の登
録文字列と照合する。このとき、ｂａに一致する文字列
が辞書にはないので、語頭文字ｂの対応符号２を符号化
出力として出力するとともに、文字列ｂａを符号５に対
応させて辞書に登録する。また、改めて３番目の入力文
字ａを語頭文字とする。Next, the third character a of the input character string 71 is read, and ba obtained by adding the letter a to the initial character b is compared with the registered character string in the dictionary. At this time, since there is no character string matching ba, the corresponding code 2 of the initial character b is output as an encoded output, and the character string ba is registered in the dictionary in correspondence with code 5. Further, the third input character a is set as the initial character again.

【００８５】さらに、４番目の文字ｂを読み込み、語頭
文字ａにこのｂを加えたａｂを辞書の登録文字列と照合
する。このとき、辞書にはａｂに一致する文字列が登録
されているので、このときはａｂを語頭文字列とする。
さらに、５番目の入力文字ｃを読み込み、語頭文字列ａ
ｂにこのｃを加えたａｂｃを辞書の登録文字列と照合す
る。このとき、ａｂｃに一致する文字列が辞書にないの
で、語頭文字列ａｂの対応符号４を符号化出力として出
力するとともに、文字列ａｂｃを符号６に対応させて辞
書に登録する。そして、改めて５番目の入力文字ｃを語
頭文字とする。Further, the fourth character b is read, and ab obtained by adding the b to the initial letter a is collated with the registered character string in the dictionary. At this time, since a character string matching ab is registered in the dictionary, ab is set as the initial character string at this time.
Further, the fifth input character c is read and the initial character string a
The abc obtained by adding c to b is compared with the registered character string in the dictionary. At this time, since there is no character string matching abc in the dictionary, the corresponding code 4 of the initial character string ab is output as an encoded output, and the character string abc is registered in the dictionary in association with the code 6. Then, the fifth input character c is set as the initial character again.

【００８６】以下、同様のアルゴリズムにより、符号化
と辞書登録を続ける。このアルゴリズムで入力文字列
ａ，ｂ，ａ，ｂ，ｃ，・・・に対して符号化が行われ、
図９の出力符号欄７２に示すような符号１，２，４，
３，・・・が符号化出力として出力される。そして、図
１１（Ａ）に示すような登録文字列９１と対応符号９２
との対応関係が辞書に登録される。Hereinafter, encoding and dictionary registration are continued by the same algorithm. Encoding is performed on the input character strings a, b, a, b, c,.
Codes 1, 2, 4, and 4 as shown in the output code field 72 in FIG.
Are output as encoded outputs. Then, a registered character string 91 and a corresponding code 92 as shown in FIG.
Is registered in the dictionary.

【００８７】図１０は、以上に例示した符号化の処理手
順を示すフローチャートである。図において、Ｓに続く
数字はステップ番号を示す。〔Ｓ１０１〕予め初期化によって、入力される可能性の
ある全一文字に対しそれぞれ符号を対応させて辞書に登
録する。また、辞書において次に登録すべきアドレスｎ
を、例えば２５６に設定する。ここで、ｎは辞書に登録
される文字列に対応して符号を０，１，２，・・・と付
した場合、登録文字列の総数に相当する。さらに、入力
文字列を読み込み、入力した最初の文字を語頭文字列
（prefix string ）ωとする。FIG. 10 is a flowchart showing the procedure of the above-described encoding process. In the figure, numbers following S indicate step numbers. [S101] By initialization, a code is registered in the dictionary in such a manner that a code is associated with all the characters that may be input. The address n to be registered next in the dictionary
Is set to, for example, 256. Here, n is equivalent to the total number of registered character strings when the codes are assigned 0, 1, 2,... Corresponding to the character strings registered in the dictionary. Further, the input character string is read, and the first character input is set as a prefix string ω.

【００８８】〔Ｓ１０２〕次の入力文字Ｋを読み込む。〔Ｓ１０３〕ステップＳ１０２において、入力文字デー
タが存在したか否かを判別する。もし、入力文字データ
が存在すればステップＳ１０５へ進み、存在しなければ
ステップＳ１０４へ進む。[S102] The next input character K is read. [S103] In step S102, it is determined whether or not input character data exists. If the input character data exists, the process proceeds to step S105, and if not, the process proceeds to step S104.

【００８９】〔Ｓ１０４〕語頭文字列ωを辞書と照合
し、対応する符号code（ω）を読み出し、符号化出力と
して出力する。このとき、符号code（ω）のビット数が
〔log₂ｎ〕の２進数符号に変換して出力する。ここで、
記号〔ｘ〕は、数値ｘ以上の整数のうち、最小の整数を
表す。以下、この意味で記号〔ｘ〕を用いることにす
る。なお、このステップでは処理すべき文字列がないた
め、本ステップを実行後、本処理手順を終了する。[S104] The initial character string ω is collated with the dictionary, the corresponding code code (ω) is read, and output as an encoded output. At this time, the number of bits of the code (ω) is converted to a binary code of [log ₂ n] and output. here,
The symbol [x] represents the minimum integer among integers equal to or larger than x. Hereinafter, the symbol [x] will be used in this sense. In this step, since there is no character string to be processed, the present procedure is terminated after executing this step.

【００９０】〔Ｓ１０５〕語頭文字列ωに、ステップＳ
１０２で読み込んだ文字Ｋを加えた文字列ωＫを辞書と
照合し、文字列ωＫが辞書に登録されているか否かを判
別する。もし、登録されていればステップＳ１０６に進
み、登録されていなければステップＳ１０７に進む。[S105] Step S is added to the initial character string ω.
The character string ωK to which the character K read in 102 is added is collated with the dictionary, and it is determined whether or not the character string ωK is registered in the dictionary. If registered, the process proceeds to step S106, and if not registered, the process proceeds to step S107.

【００９１】〔Ｓ１０６〕文字列ωＫを改めて語頭文字
列ωとする。そして、再びステップＳ１０２に戻る。こ
のように、ステップＳ１０２〜ステップＳ１０６を繰り
返すことにより、入力文字列と一致する文字列として、
辞書に登録された文字列のうちの最大長の文字列が検索
される。[S106] The character string ωK is newly set as the initial character string ω. Then, the process returns to step S102 again. In this way, by repeating steps S102 to S106, a character string that matches the input character string
The longest character string among the character strings registered in the dictionary is searched.

【００９２】〔Ｓ１０７〕語頭文字列ωを辞書と照合
し、対応する符号code（ω）を読み出し、符号化出力と
して出力する。このときの符号code（ω）のビット数
は、〔log₂ｎ〕となる。また、文字列ωＫにｎの値を対
応させて辞書に登録する。すなわち、辞書のアドレスｎ
に文字列ωＫを記憶する。さらに、ステップＳ１０２で
読み込んだ文字Ｋを語頭文字列ωとするとともに、辞書
アドレスｎをインクリメントして、つぎの新たな入力文
字列に対するステップＳ１０２以降の実行に備える。[S107] The initial character string ω is collated with the dictionary, the corresponding code code (ω) is read, and output as an encoded output. The number of bits of the code code (ω) at this time is [log ₂ n]. Further, the value of n is associated with the character string ωK and registered in the dictionary. That is, the dictionary address n
Is stored as a character string ωK. Further, the character K read in step S102 is set as the initial character string ω, and the dictionary address n is incremented to prepare for execution of the next new input character string from step S102.

【００９３】図１２は、図９に示した符号化出力を復元
する場合の具体例を示す図である。予め復元側の辞書
に、初期化によって、符号１，２，３だけがそれぞれ文
字ａ，ｂ，ｃに対応づけられて登録されている。FIG. 12 is a diagram showing a specific example in the case where the encoded output shown in FIG. 9 is restored. In the dictionary on the restoration side, only codes 1, 2, and 3 are registered in association with characters a, b, and c, respectively, by initialization.

【００９４】まず、入力符号８１を左から右へ一つずつ
読み込む。最初の符号１を読み、辞書を参照して文字列
ａを復元する。このとき復元された文字列を、復元文字
列欄８２１に示す。最初の符号は、初期化によって必ず
辞書に登録されている。そして、２番目の符号２を読
み、辞書を参照して文字列ｂを復元する。このとき、前
回の入力符号１と今回復号した文字列の最初の一文字ｂ
とを組み合わせた「１ｂ」に符号４を対応させて辞書に
登録する。このときの登録された内容を、登録内容欄８
３に示す。First, the input code 81 is read one by one from left to right. The first code 1 is read, and the character string a is restored with reference to the dictionary. The character string restored at this time is shown in a restored character string column 821. The first code is always registered in the dictionary by initialization. Then, the second code 2 is read, and the character string b is restored with reference to the dictionary. At this time, the previous input code 1 and the first character b of the currently decoded character string
The code 4 is associated with “1b” obtained by combining the above and registered in the dictionary. The registered content at this time is stored in the registered content column 8
3 is shown.

【００９５】次に、入力文字列８１の３番目の符号４を
読み、辞書を参照して対応する「１ｂ」を読み出す。さ
らに、「１ｂ」の符号１を、辞書を参照して対応する文
字ａを読み出す。このような一連の読み出し繰り返し動
作を「再帰的な復号」と呼ぶ。これを、再帰的復号欄８
２に示す。これによって、文字列ａｂが復元され、復元
文字列として出力する。出力される文字列を、復元文字
列欄８３１に示す。同時に、前回の入力符号２と今回復
元した文字列の最初の一文字ａとを組み合わせた「２
ａ」に、符号５を対応させて辞書に登録する。Next, the third code 4 of the input character string 81 is read, and the corresponding "1b" is read with reference to the dictionary. Further, the code | symbol 1 of "1b" reads the corresponding character a with reference to a dictionary. Such a series of read repetition operations is called “recursive decoding”. This is referred to as a recursive decryption field 8
It is shown in FIG. Thus, the character string ab is restored and output as a restored character string. The output character string is shown in the restored character string column 831. At the same time, “2” is a combination of the previous input code 2 and the first character a of the character string restored this time.
"a" is registered in the dictionary in correspondence with the reference numeral 5.

【００９６】以下、同様のアルゴリズムにより文字列の
復元と辞書登録を続ける。このようにして入力符号１，
２，４，３，５，・・・に対して復元が行われ、図１２
の復元文字列欄８２１に示すような文字列ａ，ｂ，ａ
ｂ，ｃ，ｂａ，・・・が復元文字列として出力される。
そして、図１１（Ｂ）に示すような登録符号９３と対応
文字列９４との対応関係が辞書に登録される。Thereafter, restoration of a character string and registration of a dictionary are continued by the same algorithm. Thus, the input code 1,
Are restored for 2, 4, 3, 5,...
Character strings a, b, a as shown in the restored character string column 821 of FIG.
are output as restored character strings.
Then, the correspondence between the registration code 93 and the corresponding character string 94 as shown in FIG. 11B is registered in the dictionary.

【００９７】図１３は、以上に例示した復号化の処理手
順を示すフローチャートである。図において、Ｓに続く
数字はステップ番号を示す。〔Ｓ１１１〕予め初期化によって、入力される可能性の
ある符号に対しそれぞれ文字を対応させて辞書に登録す
る。また、辞書において次に登録すべきアドレスｎを、
例えば２５６に設定する。ここで、ｎは辞書に登録され
る文字列に対応して符号を０，１，２，・・・と付した
場合、登録文字列の総数に相当する。次に、入力符号を
読み込み、最初の入力符号CODE（バイナリコード）を１
０進数の入力符号ωに変換する。この場合、図１０の符
号化ではωは入力文字列であったが、この復号化ではω
は入力符号である点に注意されたい。そして、このωを
OLD ωとする。同時に、最初に入力する符号は既に辞書
に登録されているため、入力符号ωに対応する文字Ｄ
（ω）を辞書から検索し、復元された文字として出力す
る。なお、出力した文字を後述するステップＳ１１６の
例外処理のためにFINchar にセットしておく。FIG. 13 is a flowchart showing the procedure of the decoding process exemplified above. In the figure, numbers following S indicate step numbers. [S111] Characters are registered in the dictionary in advance by associating characters with codes which may be input by initialization. Also, the address n to be registered next in the dictionary is
For example, it is set to 256. Here, n is equivalent to the total number of registered character strings when the codes are assigned 0, 1, 2,... Corresponding to the character strings registered in the dictionary. Next, the input code is read, and the first input code CODE (binary code) is set to 1
It is converted into the input code ω of the 0-base number. In this case, ω is an input character string in the encoding of FIG.
Is an input code. And this ω
OLD ω. At the same time, since the code to be input first is already registered in the dictionary, the character D corresponding to the input code ω
(Ω) is retrieved from the dictionary and output as restored characters. The output character is set in FINchar for exception processing in step S116 described later.

【００９８】〔Ｓ１１２〕次の入力符号CODEを読み込
む。〔Ｓ１１３〕ステップＳ１１２において入力符号データ
が存在したか否かを判別する。もし、存在すればステッ
プＳ１１５へ進み、存在しなければ本処理手順を終了す
る。[S112] The next input code CODE is read. [S113] In step S112, it is determined whether or not the input code data exists. If it exists, the process proceeds to step S115, and if it does not exist, this processing procedure ends.

【００９９】〔Ｓ１１４〕読み込んだ入力符号CODEから
入力符号ωに変換するとともに、この入力符号ωをINω
にセットする。〔Ｓ１１５〕入力符号ωをｎと比較する。すなわち、入
力符号が辞書に登録されているか否か（ω≧ｎ）を判別
する。もし、ωがｎより小さいときにはステップＳ１１
７へ進み、ωがｎ以上のときにはステップＳ１１６へ進
む。なお、ωがｎ以上になるのは、例えば図１２の入力
符号欄８１が「８」のときである。[S114] The input code CODE is converted into the input code ω, and the input code ω is converted to INω.
Set to. [S115] The input code ω is compared with n. That is, it is determined whether or not the input code is registered in the dictionary (ω ≧ n). If ω is smaller than n, step S11
7 and when ω is equal to or more than n, the process proceeds to step S116. Note that ω is equal to or larger than n when, for example, the input code field 81 in FIG. 12 is “8”.

【０１００】〔Ｓ１１６〕ステップＳ１１１または前回
にステップＳ１１９で設定されたOLD ωおよびFINchar
の組（OLD ω，FINchar ）をωＫと置き換える。すなわ
ち、OLD ωにセットされた値をωに、FINchar にセット
された値をＫにセットする。そして、Ｋをスタックにプ
ッシュ（ＰＵＳＨ）する。なお、ωはステップＳ１１７
で復号化される。[S116] OLD ω and FINchar set in step S111 or last time in step S119
(OLD ω, FINchar) is replaced with ωK. That is, the value set in OLD ω is set to ω, and the value set in FINchar is set to K. Then, K is pushed (PUSH) onto the stack. Note that ω is set in step S117.
Is decrypted.

【０１０１】〔Ｓ１１７〕通常、入力符号ωは前回まで
の処理で辞書に登録されているため、入力符号ωに対応
する文字列Ｄ（ω）を辞書から読み出す。読み出した文
字列Ｄ（ω）をω_iＫに分解する。ω_iは符号、Ｋは復
号化文字である。そして、文字列Ｄ（ω）が、ω_iＫに
分解できない１文字であるか否かを判別する。Ｄ（ω）
がω_iＫに分解できるならばステップＳ１１８に進み、
Ｄ（ω）が１文字であるならばステップＳ１１９へ進
む。[S117] Normally, since the input code ω has been registered in the dictionary in the previous processing, the character string D (ω) corresponding to the input code ω is read from the dictionary. The read character string D (ω) is decomposed into ω _i K. ω _i is a sign and K is a decoded character. Then, it is determined whether or not the character string D (ω) is one character that cannot be decomposed into ω _i K. D (ω)
If can be decomposed into ω _i K, the process proceeds to step S118,
If D (ω) is one character, the process proceeds to step S119.

【０１０２】〔Ｓ１１８〕文字Ｋを一時的にスタックに
プッシュし、また符号ω_iを新たなωとし、再度ステッ
プＳ１１７に戻る。このステップＳ１１７およびステッ
プＳ１１８の実行を、Ｄ（ω）が１文字に至るまで繰り
返す。[0102] [S118] to push the character K to temporarily stack, also the sign ω _i as a new ω, returns to step S117 again. The execution of steps S117 and S118 is repeated until D (ω) reaches one character.

【０１０３】〔Ｓ１１９〕ステップＳ１１８でスタック
にプッシュした各文字をＬＩＦＯ（Last In Fast Out）
形式でポップ（ＰＯＰ）して復元文字列を出力する。例
えば、図１２の入力符号欄８１が「５」の場合ならば、
ａ，ｂの順でスタックにプッシュされ、ｂａという復元
文字列が出力される。同時に、今回復元した文字列の最
初の一文字をFINchar とし、前回セットされた OLDωと
FINchar との組（ OLDω，FINchar ）からなる文字列
を、ｎの値に対応させて辞書に登録する。すなわち、こ
の文字列を辞書のアドレスｎに記憶する。さらに、ｎを
インクリメントし、ステップＳ１１４でセットされたIN
ωをOLD ωにセットして、次のステップＳ１１２以降の
実行に備える。[S119] Each character pushed on the stack in step S118 is transferred to the LIFO (Last In Fast Out).
Pop (POP) in the format and output the restored character string. For example, if the input code field 81 in FIG. 12 is “5”,
The data is pushed onto the stack in the order of a and b, and a restored character string ba is output. At the same time, the first character of the restored string is set to FINchar, and the previously set OLDω and
A character string consisting of a pair with FINchar (OLDω, FINchar) is registered in the dictionary in correspondence with the value of n. That is, this character string is stored at the address n of the dictionary. Further, n is incremented and the IN set in step S114 is incremented.
ω is set to OLD ω to prepare for execution of the next step S112 and subsequent steps.

【０１０４】上述のように復号化処理では、図１３のス
テップＳ１１７〜ステップＳ１１９を繰り返し行うこと
によって符号化前のデータに復元する。すなわち、入力
符号ωは前回までの処理で辞書に登録されているため、
入力符号ωに対応する文字列Ｄ（ω）を辞書から読み出
す。また、読み出した文字列Ｄ（ω）をω_iＫに分解
し、この文字Ｋを一時的にスタックに退避させる。そし
て、符号ω_iを新たな入力符号ωとして、再度入力符号
ωに対応する文字列Ｄ（ω）を辞書から読み出す。これ
らの手順を、新たな入力符号ωが一文字になるまで再帰
的に繰り返す。そして、スタックに退避させた文字をＬ
ＩＦＯ形式でポップして出力するという方式である。As described above, in the decoding process, the data before encoding is restored by repeatedly performing steps S117 to S119 in FIG. That is, since the input code ω has been registered in the dictionary in the previous processing,
The character string D (ω) corresponding to the input code ω is read from the dictionary. Further, the read character string D (ω) is decomposed into ω _i K, and the character K is temporarily saved on the stack. Then, the character string D (ω) corresponding to the input code ω is read from the dictionary again, using the code ω _i as a new input code ω. These procedures are recursively repeated until the new input code ω becomes one character. Then, the character saved on the stack is L
This is a method of popping and outputting in the IFO format.

【０１０５】図１４は、上記処理手順等に使用した辞書
の木構造の一例を示す図である。この辞書の木構造は、
ＬＺＷ符号化手段及びＬＺＷ復号化手段において実現さ
れるアルゴリズムによる符号化及び復号化の際に用いら
れる辞書の内部構造を図示したものである。図１４にお
いて、円内の数字は識別番号を示し、この円内の数字が
付されている箇所を「ノード（node；節）」と呼ぶ。FIG. 14 is a diagram showing an example of the tree structure of the dictionary used in the above processing procedure and the like. The tree structure of this dictionary is
FIG. 3 illustrates an internal structure of a dictionary used for encoding and decoding by an algorithm realized in the LZW encoding unit and the LZW decoding unit. In FIG. 14, the numbers in the circles indicate identification numbers, and the places where the numbers in the circles are attached are called “nodes (nodes)”.

【０１０６】辞書５０は、ルート（root；根）５１を起
点とする。このルート５１には、文字は割り当てられな
い。そして、ルート５１の一階層下、すなわち第１階層
５２には一文字目の文字が登録される。この一文字目の
文字の登録は、相異なる文字が登録され、主に辞書５０
の初期化の時に行われる。図には「ａ」，「ｂ」及び
「ｃ」の３文字が登録されているが、実際には８ビット
のデータで表現可能な２５６文字が登録される。The dictionary 50 has a root (root) 51 as a starting point. No characters are assigned to this route 51. Then, the first character is registered at one level below the root 51, that is, at the first level 52. The registration of the first character is performed by registering different characters.
Is performed at the time of initialization of Although three characters “a”, “b” and “c” are registered in the figure, 256 characters that can be represented by 8-bit data are actually registered.

【０１０７】そして、第２階層５３から下の階層は、情
報源から入力された文字列を学習することによって登録
される文字である。なお、一つ下の階層を有するノード
を「枝（branch）」と呼び、一つ下の階層を有するノー
ドを「葉（leaf）」と呼ぶ。したがって、図では円内の
数字の２５，２６，１３，１４，２７，２８，１６，
６，・・・，２２，２３，２４のノードが「葉」であ
り、その他のノードは「枝」である。[0107] The layers below the second layer 53 are characters registered by learning a character string input from an information source. Note that a node having a lower hierarchy is called a "branch", and a node having a lower hierarchy is called a "leaf". Therefore, in the figure, the numbers 25, 26, 13, 14, 27, 28, 16,
The nodes 6,..., 22, 23, and 24 are “leaves”, and the other nodes are “branches”.

【０１０８】なお、あるノードが現在は「葉」であって
も、学習により「枝」となる可能性がある。例えば、
「ａｃｄ」という文字列を辞書５０に登録する場合、文
字列「ａｃ」は第１階層５２が「ａ」（円内の数字
１）、第２階層５３が「ｃ」（円内の数字６）として登
録されているので、第２階層５３の「ｃ」の下の第３階
層５４に、新たに「ｄ」を登録することになる。このと
き、円内の数字６のノードは「葉」から「枝」に変わ
る。Even if a certain node is currently a “leaf”, it may become a “branch” by learning. For example,
When the character string “acd” is registered in the dictionary 50, the character string “ac” is “a” (the number 1 in the circle) in the first hierarchy 52 and “c” (the number 6 in the circle) in the second hierarchy 53. ), “D” is newly registered in the third hierarchy 54 below “c” of the second hierarchy 53. At this time, the node of number 6 in the circle changes from “leaf” to “branch”.

【０１０９】上記の実施例の説明では、データ圧縮回路
において、符号はlog₂ｎ以上の最小の整数のビット数か
らなる出力符号で出力したが、本出願人が特願平3-1306
23号において開示したように、ビット端数補償、Phasin
g in Binary Codes 、あるいは多値算術符号からなる出
力符号で出力してもよい。In the above description of the embodiment, in the data compression circuit, the code is output as the output code having the minimum integer bit number of log ₂ n or more.
Bit fraction compensation, Phasin, as disclosed in No. 23
g in Binary Codes or an output code composed of multi-level arithmetic codes may be used.

【０１１０】また、初期辞書Ｄ１、通常辞書Ｄ２及び辞
書Ｄ３は外部ハッシュ又は完全ハッシュに基づき構築
し、辞書５０は木構造に基づき構築したが、他の構築法
に基づき辞書を構築してもよい。例えば、二進分木（バ
イナリ・ツリー）法によって辞書を構築し、辞書に登録
された文字列等のデータを二進検索（バイナリ・サー
チ）により検索するようにしてもよい。Although the initial dictionary D1, the normal dictionary D2, and the dictionary D3 are constructed based on an external hash or a complete hash, and the dictionary 50 is constructed based on a tree structure, the dictionary may be constructed based on another construction method. . For example, a dictionary may be constructed by a binary tree method, and data such as a character string registered in the dictionary may be searched by a binary search.

【０１１１】なお、上記の各実施例は、ワークステーシ
ョン等における文字コード、ベクトル情報、画像データ
などの圧縮及び復元に応用され、必要な記憶容量を大幅
に削減することができる。Each of the above embodiments is applied to compression and decompression of character codes, vector information, image data, and the like in a workstation or the like, and the required storage capacity can be greatly reduced.

【０１１２】また、通信回線を利用したデータ送受信に
おいても応用でき、通信時間の短縮を図ることができ
る。例えば、モデム、ファクシミリ等の通信機器に応用
することができる。Further, the present invention can be applied to data transmission / reception using a communication line, and the communication time can be reduced. For example, it can be applied to communication equipment such as a modem and a facsimile.

【０１１３】[0113]

【発明の効果】以上説明したように本発明では、データ
圧縮回路において、第１の辞書検索手段が初期辞書又は
通常辞書から、入力文字列と一致する最長の登録文字列
を検索し、同時に第１の辞書検索手段が最長の登録文字
列に一文字を加えた文字列に識別番号を付して通常辞書
に登録し、符号化手段が最長の登録文字列に付された識
別番号を出力符号に符号化するので、符号化処理を速く
行うことができる。As described above, according to the present invention, in the data compression circuit, the first dictionary search means searches the initial dictionary or the normal dictionary for the longest registered character string that matches the input character string, and The first dictionary search means attaches an identification number to the character string obtained by adding one character to the longest registered character string and registers it in the normal dictionary, and the encoding means uses the identification number assigned to the longest registered character string as an output code. Since the encoding is performed, the encoding process can be performed quickly.

【０１１４】また、データ圧縮回路において、初期辞書
及び通常辞書は外部ハッシュによるデータ検索及びデー
タ登録を行うように構成したので、データ検索及びデー
タ登録が速くなり、符号化処理をより速く行うことがで
きる。In the data compression circuit, the initial dictionary and the normal dictionary are configured to perform data search and data registration using an external hash, so that data search and data registration become faster, and the encoding process can be performed faster. it can.

【０１１５】さらに、データ圧縮回路において、初期辞
書には第２文字までの文字列についてデータ検索及びデ
ータ登録を行い、通常辞書には第３文字以降の文字列に
ついてデータ検索及びデータ登録を行うように構成した
ので、データ検索及びデータ登録に係る処理時間を抑え
ることができる。Further, in the data compression circuit, data search and data registration are performed for the character string up to the second character in the initial dictionary, and data search and data registration are performed for the character string after the third character in the normal dictionary. The processing time required for data search and data registration can be reduced.

【０１１６】そして、データ圧縮回路において、初期辞
書及び通常辞書は完全ハッシュによるデータ検索及びデ
ータ登録を行うように構成したので、データ検索及びデ
ータ登録が速くなり、符号化処理をより速く行うことが
できる。In the data compression circuit, the initial dictionary and the normal dictionary are configured to perform data search and data registration by using a complete hash, so that data search and data registration become faster, and encoding processing can be performed faster. it can.

【０１１７】それから、第１の辞書検索手段、第１の辞
書登録手段及び符号化手段は、パイプラインにより並列
処理するので、高速に符号化処理を行うことができる。
そのうえ、データ復元回路において、復号化手段が入力
符号を復号化し、同時に第２の辞書検索手段が辞書から
復号化によって復元された文字列を検索し、第１の蓄積
手段及び第２の蓄積手段が復元された文字列を蓄積し、
出力選択手段が出力する復元文字列を選択し、第２の辞
書登録手段が未登録の復元文字列に新たな符号を付して
辞書に登録するので、復号化処理を速く行うことができ
る。Since the first dictionary search means, the first dictionary registration means, and the encoding means perform parallel processing by the pipeline, the encoding processing can be performed at high speed.
In addition, in the data restoration circuit, the decoding means decodes the input code, and at the same time, the second dictionary search means searches the dictionary for a character string restored by decoding, and the first storage means and the second storage means Accumulates the restored string,
Since the output character selecting unit selects the restored character string to be output, and the second dictionary registration unit adds a new code to the unregistered restored character string and registers it in the dictionary, the decoding process can be performed quickly.

【０１１８】また、復号化手段、第２の辞書検索手段、
出力選択手段及び第２の辞書登録手段は、パイプライン
により並列処理するので、高速に復号化処理を行うこと
ができる。Also, the decoding means, the second dictionary search means,
Since the output selection means and the second dictionary registration means perform the parallel processing by the pipeline, the decoding processing can be performed at high speed.

【０１１９】そして、データ圧縮・復元装置において、
データ圧縮回路では第１の辞書検索手段が初期辞書又は
通常辞書から、入力文字列と一致する最長の登録文字列
を検索し、同時に第１の辞書検索手段が最長の登録文字
列に一文字を加えた文字列に識別番号を付して通常辞書
に登録し、符号化手段が最長の登録文字列に付された識
別番号を出力符号に符号化する。また、データ復元回路
では復号化手段が入力符号を復号化し、同時に第２の辞
書検索手段が辞書から復号化によって復元された文字列
を検索し、第１の蓄積手段及び第２の蓄積手段が復元さ
れた文字列を蓄積し、出力選択手段が出力する復元文字
列を選択し、第２の辞書登録手段が未登録の復元文字列
に新たな符号を付して辞書に登録するので、符号化処理
及び復号化処理を速く行うことができる。In the data compression / decompression device,
In the data compression circuit, the first dictionary search means searches the initial dictionary or the normal dictionary for the longest registered character string that matches the input character string, and at the same time, the first dictionary search means adds one character to the longest registered character string. The assigned character string is assigned an identification number and registered in a normal dictionary, and the encoding means encodes the identification number assigned to the longest registered character string into an output code. In the data restoration circuit, the decoding means decodes the input code, and at the same time, the second dictionary search means searches the dictionary for a character string restored by decoding, and the first storage means and the second storage means The restored character string is stored, the output selection means selects the restored character string to be output, and the second dictionary registration means assigns a new code to the unregistered restored character string and registers it in the dictionary. The decryption process and the decryption process can be performed quickly.

【０１２０】したがって、各回路及び装置全体の処理速
度を向上させることができる。Therefore, the processing speed of each circuit and the entire device can be improved.

[Brief description of the drawings]

【図１】本発明のデータ圧縮回路の原理説明図である。FIG. 1 is a diagram illustrating the principle of a data compression circuit according to the present invention.

【図２】本発明のデータ圧縮回路による処理手順を示す
図である。FIG. 2 is a diagram showing a processing procedure by a data compression circuit of the present invention.

【図３】本発明のデータ復元回路の原理説明図である。FIG. 3 is a diagram illustrating the principle of a data restoration circuit according to the present invention.

【図４】本発明のデータ復元回路による処理手順を示す
図である。FIG. 4 is a diagram showing a processing procedure by the data restoration circuit of the present invention.

【図５】本発明のデータ圧縮・復元装置の原理説明図で
ある。FIG. 5 is a diagram illustrating the principle of a data compression / decompression device according to the present invention.

【図６】外部ハッシュ法のデータ構造を示す図である。FIG. 6 is a diagram showing a data structure of an external hash method.

【図７】外部ハッシュ法による辞書検索手順及び辞書登
録手順の一例を示す図である。FIG. 7 is a diagram showing an example of a dictionary search procedure and a dictionary registration procedure using the external hash method.

【図８】完全ハッシュ法による辞書検索手順及び辞書登
録手順の一例を示す図である。FIG. 8 is a diagram showing an example of a dictionary search procedure and a dictionary registration procedure using the complete hash method.

【図９】符号化の具体例を示す図である。FIG. 9 is a diagram illustrating a specific example of encoding.

【図１０】符号化の処理手順を示す図である。FIG. 10 is a diagram showing an encoding processing procedure.

【図１１】文字列と符号との対応関係図である。FIG. 11 is a diagram illustrating a correspondence relationship between a character string and a code.

【図１２】復号化の具体例を示す図である。FIG. 12 is a diagram illustrating a specific example of decoding.

【図１３】復号化の処理手順を示す図である。FIG. 13 is a diagram showing a decoding processing procedure.

【図１４】辞書の木構造の一例を示す図である。FIG. 14 is a diagram illustrating an example of a tree structure of a dictionary.

【図１５】従来のデータ圧縮回路を示す図である。FIG. 15 is a diagram showing a conventional data compression circuit.

【図１６】従来のデータ圧縮回路による処理手順を示す
図である。FIG. 16 is a diagram showing a processing procedure by a conventional data compression circuit.

【図１７】従来のデータ復元回路を示す図である。FIG. 17 is a diagram showing a conventional data restoration circuit.

【図１８】従来のデータ復元回路による処理手順を示す
図である。FIG. 18 is a diagram showing a processing procedure by a conventional data restoration circuit.

【符号の説明】１辞書検索手段２辞書登録手段３符号化手段４初期辞書５通常辞書１１復号化手段１２辞書検索手段１３第１の蓄積手段１４第２の蓄積手段１５出力選択手段１６辞書登録手段１７辞書[Description of Signs] 1 Dictionary search means 2 Dictionary registration means 3 Encoding means 4 Initial dictionary 5 Normal dictionary 11 Decoding means 12 Dictionary search means 13 First storage means 14 Second storage means 15 Output selection means 16 Dictionary registration Means 17 Dictionary

───────────────────────────────────────────────────── フロントページの続き (72)発明者千葉広隆神奈川県川崎市中原区上小田中1015番地富士通株式会社内 (56)参考文献特開平３−247167（ＪＰ，Ａ) 特開昭62−263529（ＪＰ，Ａ) 特開平４−267630（ＪＰ，Ａ) 特開平４−129429（ＪＰ，Ａ) 特開平４−123619（ＪＰ，Ａ) 特開平４−76727（ＪＰ，Ａ) 特開平３−262331（ＪＰ，Ａ) 特開昭61−13340（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06F 5/00 H03M 7/30 - 7/46 ──────────────────────────────────────────────────続き Continuation of the front page (72) Inventor Hirotaka Chiba 1015 Kamikodanaka, Nakahara-ku, Kawasaki City, Kanagawa Prefecture Inside Fujitsu Limited (56) References JP-A-3-247167 (JP, A) JP-A-62-263529 (JP, A) JP-A-4-267630 (JP, A) JP-A-4-129429 (JP, A) JP-A-4-123619 (JP, A) JP-A-4-76727 (JP, A) Kaihei 3-262331 (JP, A) JP-A-61-13340 (JP, A) (58) Fields investigated (Int. Cl. ⁷ , DB name) G06F 5/00 H03M ^7/ 30-7/46

Claims

(57) [Claims]

A data compression circuit for compressing an input character string input from an information source by encoding the character string, an initial dictionary (D1) for registering the character string, and a character string registered in the initial dictionary. Included in the initial dictionary
Normal dictionary that registers character strings longer than
And 2), the initial dictionary (D1) or the normal of the registered character strings registered in the dictionary (D2), the first dictionary to find the longest registered character string that matches the inputted input character string search Means (1); initial dictionary access means (4) for accessing the initial dictionary (D1); normal dictionary access means (5) for accessing the normal dictionary (D2); and the longest registration A first dictionary registration unit (2) for registering a character string obtained by adding one character to a character string to the ordinary dictionary (D2) by adding an identification number; and an identification number assigned to the longest registered character string. has a coding means (3) for encoding an output code, the initial dictionary (D1) of the first dictionary search means (1)
Search and the first dictionary registration means (2)
A data compression circuit characterized in that registration in the dictionary (D2) is performed simultaneously .

2. The data compression circuit according to claim 1, wherein the initial dictionary (D1) and the normal dictionary (D2) are configured to perform data search and data registration using an external hash.

3. The data compression circuit according to claim 1, wherein the initial dictionary (D1) and the normal dictionary (D2) are configured to perform data search and data registration using a complete hash.

4. A data search and data registration for a character string up to a second character in the initial dictionary (D1), and a data search and data registration for a character string after a third character in the normal dictionary (D2). 2. The data compression circuit according to claim 1, wherein the data compression circuit is configured to perform the following.

5. The first dictionary search unit (1), the first dictionary registration unit (2), and the encoding unit (3)
5. The data compression circuit according to claim 1, wherein the data compression circuit is processed in parallel by a pipeline.

6. A decoding means (11) for decoding an input code inputted in a data restoration circuit for restoring an input code inputted from an information source by decoding.
A second dictionary search means (12) for searching a registered character string registered in the dictionary (D3) for a registered character string that matches the character string restored by the decoding; To the first storage means (13), to store the registered character string, and to the first storage means (13) or the second storage means (14). Output selecting means (15) for selecting any of the stored registered character strings from the stored registered character strings and outputting the selected registered character string as a restored character string; home,
A second dictionary registration unit (16) for searching for a registered character string that matches the restored character string, assigning a new code to the unregistered restored character string, and registering the restored character string in the dictionary (D3); And dictionary access means (17) for accessing the dictionary (D3).

7. The decoding means (11), the second dictionary search means (12), the output selection means (15) and the second dictionary registration means (16) are processed in parallel by a pipeline. 7. The data restoration circuit according to claim 6, wherein:

8. An initial dictionary (D1) for registering a character string in a data compression / decompression device that compresses and decompresses an input character string or an input code input from an information source by encoding or decoding. , containing the string and the initial dictionary registered in the initial dictionary
Normal dictionary that registers character strings longer than
And 2), the initial dictionary (D1) or the normal of the registered character strings registered in the dictionary (D2), the first dictionary to find the longest registered character string that matches the inputted input character string search Means (1), initial dictionary access means (4) for accessing said initial dictionary (D1), and said normal dictionary (D2)
A normal dictionary access means (5) for accessing the first dictionary, and a first dictionary registration means for registering a character string obtained by adding one character to the longest registered character string to the normal dictionary (D2) with an identification number (2) and encoding means (3) for encoding the identification number given to the longest registered character string into an output code, wherein the initial dictionary of the first dictionary search means (1)
(D1) search and the first dictionary registration means (2)
The registration to the normal dictionary (D2) is performed simultaneously.
Decoding means for decoding and the data compression circuit, the inputted input code (11)
And of the registered character strings registered in the dictionary (D3)
A second dictionary search unit (12) for searching for a registered character string that matches the character string restored by the decoding, a first storage unit (13) for storing the registered character string, Storage means (14) for accumulating the data, and the first storage means (13) or the second storage means (14)
Output selection means (15) for selecting one of the stored registered character strings from the registered character strings stored in the dictionary and outputting as a restored character string; and a registered character string registered in the dictionary (D3). A second dictionary registration unit (16) for searching for a registered character string that matches the restored character string, assigning a new code to the unregistered restored character string, and registering the restored character string in the dictionary (D3). And a data decompression circuit comprising: a dictionary access unit (17) for accessing the dictionary (D3).