JP3117760B2

JP3117760B2 - Data restoration method

Info

Publication number: JP3117760B2
Application number: JP03275292A
Authority: JP
Inventors: 泰彦中野; 茂吉田; 佳之岡田; 広隆千葉
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1991-10-23
Filing date: 1991-10-23
Publication date: 2000-12-18
Anticipated expiration: 2015-12-18
Also published as: JPH05113998A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は圧縮されたデータを復号
化するデータ復元方式に関し、特にユニバーサル符号化
の一種である増分分解型の改良としてのＬＺＷ符号によ
る圧縮されたデータのデータ復元方式に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a data restoring method for decoding compressed data, and more particularly to a data restoring method for LZW-compressed data as an improvement of an incremental decomposition type which is a kind of universal encoding. .

【０００２】近年、文字コード、ベクトル情報、画像デ
ータなど各種のデータがコンピュータで扱われるように
なっており、扱われるデータ量も急速に増加してきてい
る。大量のデータを扱うときは、データの中の冗長な部
分を省いてデータ量を圧縮することで、記憶容量を減ら
したり、速く伝送したりすることが望まれる。In recent years, various types of data such as character codes, vector information, and image data have been handled by computers, and the amount of data handled has rapidly increased. When dealing with a large amount of data, it is desirable to reduce the storage capacity or to transmit data at high speed by omitting redundant portions in the data and compressing the data amount.

【０００３】このような様々なデータを１つの方式でデ
ータ圧縮する方法をしてユニバーサル符号化が提案され
ている。ユニバーサル符号化の代表的な方法として、Zi
v-Lempel符号を用いた方法が知られている（例えば、宗
像清治著、「Ziv-Lempelデータ圧縮法」、情報処理、p
p.2〜6 , Vol.26,No.1,1985を参照のこと）。Ziv-Lempe
l符号では、ユニバーサル型と、増分分解型（Increment
al persing ）の２つのアルゴリズムが提案されてい
る。[0003] Universal encoding has been proposed as a method of compressing such various data by one method. Zi is a typical universal coding method.
Methods using v-Lempel codes are known (for example, Seiji Munakata, "Ziv-Lempel Data Compression Method", Information Processing, p
p.2-6, Vol.26, No.1,1985). Ziv-Lempe
For the l code, the universal type and the incremental decomposition type (Increment
al persing) have been proposed.

【０００４】本発明は増分分解型アルゴリズムを改良し
たＬＺＷ(Lempel-Ziv-Welch)符号によって圧縮されたデ
ータの復号化をさらに改善したものに関わるものであ
る。なお、「文字（Character ）」及び「文字列（Char
acter String）」の定義はＪＩＳ−Ｃ６２３０に従うほ
か、情報理論で用いられている呼称を踏襲し、１ワード
単位で構成されるデータを「文字」と呼び、任意のワー
ド単位で構成されるデータを「文字列」と呼んで、本発
明を圧縮されたデータの復号化に関わるものとして説明
する。The present invention relates to a further improved decoding of data compressed by an LZW (Lempel-Ziv-Welch) code in which an incremental decomposition type algorithm is improved. In addition, "Character (Character)" and "character string (Char
In addition to the definition of "acter String", the data conforms to JIS-C6230 and follows the name used in information theory. Data composed of one word is called "character", and data composed of an arbitrary word is defined. The present invention will be described as a "character string" as it relates to the decoding of compressed data.

【０００５】[0005]

【従来の技術】従来のＬＺＷ符号の符号化アルゴリズム
によれば、入力文字列を互いに異なる部分文字列に分け
て符号化し、これらの部分文字列と符号との対応を辞書
に登録するとともに、入力文字列を、辞書中に登録され
た部分文字列と比較し、入力文字列と一致する部分文字
列の内、最大長の部分文字列を選択し、選択された部分
文字列に対応する符号によって入力文字列を符号化し、
これによってデータを圧縮することが行われている。2. Description of the Related Art According to a conventional LZW code encoding algorithm, an input character string is divided into different partial character strings and encoded, and the correspondence between these partial character strings and codes is registered in a dictionary and the input character string is input. The character string is compared with the partial character string registered in the dictionary, and the partial character string having the maximum length is selected from the partial character strings that match the input character string, and the code corresponding to the selected partial character string is used. Encode the input string,
Thus, data is compressed.

【０００６】図７は、例えばａ，ｂ，ｃの３文字だけの
組み合わせからなる入力文字列７１を、ＬＺＷ符号の符
号化アルゴリズムによって符号化する場合の具体例を示
す図である。予め辞書に、初期化によって、１文字の
ａ，ｂ，ｃだけをそれぞれ符号１，２，３に対応づけて
登録しておく。FIG. 7 is a diagram showing a specific example in which an input character string 71 composed of, for example, a combination of only three characters a, b, and c is encoded by an LZW encoding algorithm. In the dictionary, only one character a, b, and c are registered in association with the codes 1, 2, and 3, respectively, by initialization.

【０００７】まず、入力文字列７１を左から右へ一字ず
つ読み込む。最初の文字ａを読んで、このａを語頭文字
（列）(prefix string) とする。次に、２番目の文字ｂ
を読み、先の語頭文字ａにこのｂを加えたａｂを辞書の
登録文字列と照合する。辞書にはａｂに一致する文字列
がないから、このときは辞書を参照し、先の語頭文字ａ
の対応符号１を符号化出力として出力する（出力符号欄
７２）とともに、文字列ａｂを符号４に対応させて辞書
に登録する（登録内容欄７３）。そして今度は２番目の
入力文字ｂを語頭文字とする。First, the input character string 71 is read one character at a time from left to right. Read the first letter a, and let this a be a prefix string. Next, the second character b
Is read, and ab obtained by adding the b to the initial letter a is compared with the registered character string in the dictionary. Since there is no character string that matches ab in the dictionary, refer to the dictionary at this time and
The corresponding code 1 is output as an encoded output (output code field 72), and the character string ab is registered in the dictionary in association with the code 4 (registration content field 73). Then, the second input character b is set as the initial character.

【０００８】次に、入力文字列７１の３番目の文字ａを
読み、語頭文字ｂにこのａを加えたｂａを辞書の登録文
字列と照合する。辞書にはｂａに一致する文字列がない
から、このときも辞書から語頭文字ｂの対応符号２を符
号化出力として出力するとともに、文字列ｂａを符号５
に対応させて辞書に登録する。そして今度は３番目の入
力文字ａを語頭文字とする。Next, the third character a of the input character string 71 is read, and ba obtained by adding the letter a to the initial character b is compared with the registered character string in the dictionary. Since there is no character string matching ba in the dictionary, the corresponding code 2 of the initial character b is output from the dictionary as an encoded output and the character string ba is
And register it in the dictionary. Then, the third input character a is set as the initial character.

【０００９】さらに、４番目の文字ｂを読み、語頭文字
ａにこのｂを加えたａｂを辞書の登録文字列と照合す
る。辞書にはａｂに一致する文字列があるから、このと
きはａｂを語頭文字列とし、続けて、５番目の入力文字
ｃを読み、語頭文字列ａｂにこのｃを加えたａｂｃを辞
書の登録文字列と照合する。辞書にはａｂｃに一致する
文字列がないから、このときは辞書から語頭文字列ａｂ
の対応符号４を符号化出力として出力するとともに、文
字列ａｂｃを符号６に対応させて辞書に登録する。そし
て、今度はｃを語頭文字とする。Further, the fourth character b is read, and ab obtained by adding the b to the initial letter a is compared with the registered character string in the dictionary. In the dictionary, there is a character string that matches ab. In this case, ab is used as the initial character string, then the fifth input character c is read, and abc obtained by adding this c to the initial character string ab is written in the dictionary. Matches with the registered character string of. Since there is no character string that matches abc in the dictionary, the initial character string ab
Is output as an encoded output, and the character string abc is registered in the dictionary in correspondence with the code 6. Then, let c be the initial letter.

【００１０】以下、同様にこうしたアルゴリズムにより
符号化と辞書登録を続ける。このようにして入力文字列
ａ，ｂ，ａ，ｂ，ｃ・・・に対して符号化が行われ、図
７の出力符号欄７２に示すような符号１，２，４，３・
・・が符号化出力として出力される。そして、図９
（Ａ）に示すような登録文字列９１と対応符号９２との
対応関係が辞書に登録される。[0010] Similarly, encoding and dictionary registration are continued by such an algorithm. In this manner, encoding is performed on the input character strings a, b, a, b, c,..., And codes 1, 2, 4, 3,.
Is output as an encoded output. And FIG.
The correspondence between the registered character string 91 and the corresponding code 92 as shown in (A) is registered in the dictionary.

【００１１】図１０は、以上に例示したＬＺＷ符号の符
号化の処理手順を示すフローチャートである。図中、Ｓ
に続く数字はステップ番号を示す。〔Ｓ１０１〕予め初期化によって、入力される可能性の
ある全一文字に対しそれぞれ符号を対応させて辞書に登
録する。また、辞書において次に登録すべきアドレスｎ
を例えば２５６に設定する（図７の例では４）。ｎは、
辞書に登録される文字列に対応して符号を０，１，２・
・・と付した場合、登録文字列の総数に相当する。さら
に、入力文字列を読み込み、入力した最初の文字を語頭
文字列（prefix string ）ωとする。FIG. 10 is a flowchart showing a processing procedure for encoding the LZW code exemplified above. In the figure, S
The number following "" indicates a step number. [S101] By initialization, a code is registered in the dictionary in such a manner that a code is associated with all the characters that may be input. The address n to be registered next in the dictionary
Is set to, for example, 256 (4 in the example of FIG. 7). n is
Codes 0, 1, 2,... Corresponding to the character strings registered in the dictionary
-When it is added, it corresponds to the total number of registered character strings. Further, the input character string is read, and the first character input is set as a prefix string ω.

【００１２】〔Ｓ１０２〕次の入力文字Ｋを読み込む。〔Ｓ１０３〕ステップＳ１０２において入力文字データ
が存在したか否かを判別する。存在すればステップＳ１
０５へ進み、存在しなければステップＳ１０４へ進む。[S102] The next input character K is read. [S103] In step S102, it is determined whether or not the input character data exists. Step S1 if present
Go to step S05, if not, go to step S104.

【００１３】〔Ｓ１０４〕語頭文字列ωを辞書と照合
し、対応する符号code（ω）を読み出し、符号化出力と
して出力する。このときの符号code（ω）のビット数
は、log₂ｎ以上の最小の整数に相当する。このステップ
では文字列が入力しないから、本ステップを実行後、本
処理手順を終了する。[S104] The initial character string ω is collated with the dictionary, the corresponding code code (ω) is read, and output as an encoded output. The number of bits of the code code (ω) at this time corresponds to the smallest integer equal to or greater than log ₂ n. In this step, since no character string is input, the present processing procedure is terminated after executing this step.

【００１４】〔Ｓ１０５〕語頭文字列ωに、ステップＳ
１０２で読み込んだ文字Ｋを加えた文字列ωＫを辞書と
照合し、文字列ωＫが辞書に登録されているか否かを判
別する。登録されていればステップＳ１０６に進み、登
録されていなければステップＳ１０７に進む。[S105] Step S is added to the initial character string ω.
The character string ωK to which the character K read in 102 is added is collated with the dictionary, and it is determined whether or not the character string ωK is registered in the dictionary. If registered, the process proceeds to step S106. If not registered, the process proceeds to step S107.

【００１５】〔Ｓ１０６〕文字列ωＫを改めて語頭文字
列ωとする。そして、再びステップＳ１０２に戻る。ス
テップＳ１０２乃至ステップＳ１０６を繰り返すことに
より、入力文字列と一致する文字列として、辞書に登録
された文字列のうちの最大長の文字列が探索される。[S106] The character string ωK is newly set as the initial character string ω. Then, the process returns to step S102 again. By repeating steps S102 to S106, a character string having the maximum length among the character strings registered in the dictionary is searched for as a character string that matches the input character string.

【００１６】〔Ｓ１０７〕語頭文字列ωを辞書と照合
し、対応する符号code（ω）を読み出し、符号化出力と
して出力する。このときの符号code（ω）のビット数
は、log₂ｎ以上の最小の整数に相当する。また、文字列
ωＫにｎの値を対応させて辞書に登録する（実際には辞
書のアドレスｎに文字列ωＫを記憶する）。さらに、ス
テップＳ１０２で読み込んだ文字Ｋを語頭文字列ωとす
るとともに、辞書アドレスｎをインクリメントして、つ
ぎの新たな入力文字列に対するステップＳ１０２以降の
実行に備える。[S107] The initial character string ω is collated with the dictionary, the corresponding code code (ω) is read, and output as an encoded output. The number of bits of the code code (ω) at this time corresponds to the smallest integer equal to or greater than log ₂ n. Further, the value of n is associated with the character string ωK and registered in the dictionary (actually, the character string ωK is stored at the address n of the dictionary). Further, the character K read in step S102 is set as the initial character string ω, and the dictionary address n is incremented to prepare for execution of the next new input character string from step S102.

【００１７】図８は、図７に例示された符号化出力を今
度は復号化する場合の具体例を示す図である。予め復号
化側の辞書に、初期化によって、符号１，２，３だけが
それぞれ文字ａ，ｂ，ｃに対応づけられて登録されてい
る。FIG. 8 is a diagram showing a specific example in which the encoded output shown in FIG. 7 is decoded this time. In the dictionary on the decoding side, only codes 1, 2, and 3 are registered in association with characters a, b, and c, respectively, by initialization.

【００１８】まず、入力符号８１を左から右へ一つずつ
読み込む。最初の符号１を読み、辞書を参照して文字列
ａを復元する（復元文字列欄８２１）。最初の符号は、
初期化によって必ず辞書に登録されている。次に、２番
目の符号２を読み、辞書を参照して文字列ｂを復元す
る。このとき、前回の入力符号１と今回復号した文字列
の最初の一文字ｂ（今回の復号文字列は単に文字ｂのみ
であるが）とを組み合わせた（１ｂ）に符号４を対応さ
せて辞書に登録する（登録内容欄８３）。First, input codes 81 are read one by one from left to right. The first code 1 is read, and the character string a is restored with reference to the dictionary (restored character string column 821). The first sign is
It is always registered in the dictionary by initialization. Next, the second code 2 is read, and the character string b is restored with reference to the dictionary. At this time, code 4 is associated with (1b), which is a combination of the previous input code 1 and the first character b of the currently decoded character string (although the current decoded character string is simply character b), and stores it in the dictionary. Register (registered content column 83).

【００１９】次に、入力文字列８１の３番目の符号４を
読み、辞書を参照して対応する１ｂを読み出し、さらに
１ｂの符号１を、辞書を参照して対応する文字ａを読み
出す（この一連の読み出し繰り返し動作を「再帰的な復
号化」と呼ぶ。再帰的復号化欄８２を参照）。これによ
って、文字列ａｂを復号化された文字列として出力する
（復元文字列欄８３１）。同時に、前回の入力符号２と
今回復号した文字列の最初の一文字ａとを組み合わせた
（２ａ）に符号５を対応させて辞書に登録する（登録内
容欄８３）。Next, the third code 4 of the input character string 81 is read, the corresponding 1b is read with reference to the dictionary, and the code 1 of 1b is read with reference to the dictionary, and the corresponding character a is read with reference to the dictionary. A series of read repetition operations is referred to as "recursive decoding" (see recursive decoding column 82). As a result, the character string ab is output as a decrypted character string (restored character string column 831). At the same time, the combination of the previous input code 2 and the first character a of the currently decoded character string (2a) is registered in the dictionary in association with the code 5 (registered content column 83).

【００２０】以下、同様にこうしたアルゴリズムにより
復号化と辞書登録を続ける。このようにして入力符号
１，２，４，３，５・・・に対して復号化が行われ、図
８の復元文字列欄８２１に示すような文字列ａ，ｂ，ａ
ｂ，ｃ，ｂａ・・・が復号化出力として出力される。そ
して、図９（Ｂ）に示すような登録符号９３と対応文字
列９４との対応関係が辞書に登録される。Hereinafter, decoding and dictionary registration are continued by using such an algorithm. In this manner, the input codes 1, 2, 4, 3, 5,... Are decoded, and the character strings a, b, a shown in the restored character string column 821 in FIG.
are output as decoded outputs. Then, the correspondence between the registration code 93 and the corresponding character string 94 as shown in FIG. 9B is registered in the dictionary.

【００２１】図１１は、以上に例示した復号化の処理手
順を示すフローチャートである。図中、Ｓに続く数字は
ステップ番号を示す。〔Ｓ１１１〕予め初期化によって、入力される可能性の
ある符号に対しそれぞれ文字を対応させて辞書に登録す
る。また、辞書において次に登録すべきアドレスｎを例
えば２５６に設定する（図８の例では４）。ｎは、辞書
に登録される文字列に対応して符号を０，１，２・・・
と付した場合、登録文字列の総数に相当する。次に、入
力符号を読み込み、最初の入力符号CODE（バイナリコー
ド）を１０進数の入力符号ωに変換する（図１０の符号
化ではωは入力文字列であったが、復号化ではωは入力
符号である点に注意されたい）。このωをOLDωとする
とともに、最初に入力する符号は既に辞書に登録されて
いるから、入力符号ωに対応する文字Ｄ（ω）を辞書か
ら探し出し、復号化された文字として出力する。なお、
出力した文字を後述のステップＳ１１６の例外処理のた
めにFINchar にセットしておく。FIG. 11 is a flowchart showing the procedure of the decoding process exemplified above. In the figure, numbers following S indicate step numbers. [S111] Characters are registered in the dictionary in advance by associating characters with codes which may be input by initialization. Further, the address n to be registered next in the dictionary is set to, for example, 256 (4 in the example of FIG. 8). n is a code 0, 1, 2,... corresponding to a character string registered in the dictionary.
Is equivalent to the total number of registered character strings. Next, the input code is read, and the first input code CODE (binary code) is converted into a decimal input code ω (ω was an input character string in the encoding of FIG. Note that it is a sign). This ω is set to OLDω, and since the code to be input first is already registered in the dictionary, a character D (ω) corresponding to the input code ω is searched from the dictionary and output as a decoded character. In addition,
The output character is set in FINchar for exception processing in step S116 described later.

【００２２】〔Ｓ１１２〕次の入力符号CODEを読み込
む。〔Ｓ１１３〕ステップＳ１１２において入力符号データ
が存在したか否かを判別する。存在すればステップＳ１
１５へ進み、存在しなければ本処理手順を終了する。[S112] The next input code CODE is read. [S113] In step S112, it is determined whether or not the input code data exists. Step S1 if present
The process proceeds to step 15, and if not present, this processing procedure ends.

【００２３】〔Ｓ１１４〕読み込んだ入力符号CODEから
入力符号ωに変換するとともに、この入力符号ωをINω
にセットする。〔Ｓ１１５〕入力符号ωをｎと比較する。このステップ
は、入力符号が辞書に登録されているか否か（ω≧ｎ）
を判別するものである。ωがｎより小さい場合が通常で
あり、このときにはステップＳ１１７へ進み、ωがｎ以
上のときには（図８の入力符号欄８１が「８」のときが
これに相当する）ステップＳ１１６へ進む。[S114] The input code CODE is converted into the input code ω and the input code ω is converted to INω
Set to. [S115] The input code ω is compared with n. This step determines whether the input code is registered in the dictionary (ω ≧ n)
Is to be determined. It is normal that ω is smaller than n. At this time, the process proceeds to step S117, and when ω is equal to or more than n (the case where the input code column 81 in FIG. 8 is “8” corresponds to this), the process proceeds to step S116.

【００２４】〔Ｓ１１６〕ステップＳ１１１または前回
にステップＳ１１９で設定されたOLD ωおよびFINchar
の組（OLD ω，FINchar ）をωＫと置き換える。すなわ
ち、OLD ωにセットされた値をωに、FINchar にセット
された値をＫにセットする。そして、Ｋをスタックにプ
ッシュ（ＰＵＳＨ）する。ωはステップＳ１１７で復号
化される。（図８の入力符号欄８１が「８」のとき、OL
D ω（ω）は５であり、FINchar （Ｋ）はｂである。）〔Ｓ１１７〕通常、入力符号ωは前回までの処理で辞書
に登録されているため、入力符号ωに対応する文字列Ｄ
（ω）を辞書から読み出す。読み出した文字列Ｄ（ω）
をω_iＫに分解する。ω_iは符号、Ｋは復号化文字であ
る。そして、文字列Ｄ（ω）が、ω_iＫに分解できない
１文字であるか否かを判別する。Ｄ（ω）がω_iＫに分
解できるならばステップＳ１１８に進み、Ｄ（ω）が１
文字であるならばステップＳ１１９へ進む。[S116] OLD ω and FINchar set in step S111 or last time in step S119
(OLD ω, FINchar) is replaced with ωK. That is, the value set to OLD ω is set to ω, and the value set to FINchar is set to K. Then, K is pushed on the stack (PUSH). ω is decoded in step S117. (When the input code field 81 in FIG. 8 is "8", OL
Dω (ω) is 5 and FINchar (K) is b. [S117] Normally, since the input code ω has been registered in the dictionary in the previous processing, the character string D corresponding to the input code ω
(Ω) is read from the dictionary. Character string D (ω) read
Is decomposed into ω _i K. ω _i is a sign and K is a decoded character. Then, it is determined whether or not the character string D (ω) is one character that cannot be decomposed into ω _i K. If D (ω) can be decomposed into ω _i K, the process proceeds to step S118, where D (ω) is 1
If it is a character, the process proceeds to step S119.

【００２５】〔Ｓ１１８〕文字Ｋを一時的にスタックに
プッシュし、また符号ω_iを新たなωとし、再度ステッ
プＳ１１７に戻る。このステップＳ１１７およびステッ
プＳ１１８の実行を、Ｄ（ω）が１文字に至るまで繰り
返す。ステップＳ１１７およびステップＳ１１８の処理
を前述のように「再帰的復号化」と呼ぶ。[0025] [S118] to push the character K to temporarily stack, also the sign ω _i as a new ω, returns to step S117 again. The execution of steps S117 and S118 is repeated until D (ω) reaches one character. The processing in steps S117 and S118 is referred to as “recursive decoding” as described above.

【００２６】〔Ｓ１１９〕ステップＳ１１８でスタック
にプッシュした各文字をＬＩＦＯ（Last In Fast Out）
形式でポップ（ＰＯＰ）して復元文字列を出力する。例
えば、図８の入力符号欄８１が「５」の場合ならば、
ａ，ｂの順でスタックにプッシュされ、ｂａという復元
文字列が出力される。同時に、今回復元した文字列の最
初の一文字をFINchar とし、前回セットされた OLDωと
FINchar との組（ OLDω，FINchar ）からなる文字列
を、ｎの値に対応させて辞書に登録する（実際には、こ
の文字列を辞書のアドレスｎに記憶する）。さらに、ｎ
をインクリメントし、ステップＳ１１４でセットされた
INωをOLD ωにセットして、次のステップＳ１１２以降
の実行に備える。[S119] Each character pushed on the stack in step S118 is transferred to the LIFO (Last In Fast Out).
Pop (POP) in the format and output the restored character string. For example, if the input code field 81 in FIG. 8 is “5”,
The data is pushed onto the stack in the order of a and b, and a restored character string ba is output. At the same time, the first character of the restored string is set to FINchar, and the previously set OLDω and
A character string composed of a pair with FINchar (OLDω, FINchar) is registered in the dictionary in correspondence with the value of n (actually, this character string is stored at the address n of the dictionary). Furthermore, n
Is incremented and set in step S114.
INω is set to OLDω to prepare for the execution of the next step S112 and subsequent steps.

【００２７】上述のように従来の復号化では、図１１の
ステップＳ１１７〜ステップＳ１１９を繰り返し行うこ
とによって符号化前のデータに復元していた。すなわ
ち、入力符号ωは前回までの処理で辞書に登録されてい
るため、入力符号ωに対応する文字列Ｄ（ω）を辞書か
ら読み出す。また、読み出した文字列Ｄ（ω）をω_iＫ
に分解し、この文字Ｋを一時的にスタックに退避させ
る。そして、符号ω_iを新たな入力符号ωとして、再度
入力符号ωに対応する文字列Ｄ（ω）を辞書から読み出
す。これらの手順を、新たな入力符号ωが一文字になる
まで再帰的に繰り返す。そして、スタックに退避させた
文字をＬＩＦＯ形式でポップして出力するという方式で
あった。As described above, in the conventional decoding, the data before encoding is restored by repeatedly performing steps S117 to S119 in FIG. That is, since the input code ω has been registered in the dictionary in the previous processing, the character string D (ω) corresponding to the input code ω is read from the dictionary. Further, the read character string D (ω) is represented by ω _i K
And the character K is temporarily evacuated to the stack. Then, the character string D (ω) corresponding to the input code ω is read from the dictionary again, using the code ω _i as a new input code ω. These procedures are recursively repeated until the new input code ω becomes one character. Then, the character saved in the stack is popped and output in the LIFO format.

【００２８】[0028]

【発明が解決しようとする課題】しかし、このような方
式では、何度も復元される同一の文字列でも、上述のよ
うに再帰的な処理をその都度行わなければならない。し
たがって、辞書からの読み出し、一時的なスタックへの
退避及び退避させた文字のポップといった一連の処理が
必要であり、この処理のために無駄な時間が費やされる
という問題点があった。However, in such a system, as described above, recursive processing must be performed each time even for the same character string restored many times. Therefore, a series of processes such as reading from the dictionary, temporarily saving to the stack, and popping the saved characters are required, and there is a problem that useless time is spent for this process.

【００２９】本発明はこのような点に鑑みてなされたも
のであり、一度再帰的な復号化処理によって復元された
文字列を有効に利用し、復号化処理時間を短縮するデー
タ復元方式を提供することを目的とする。The present invention has been made in view of the above points, and provides a data restoration method which effectively utilizes a character string once restored by recursive decoding processing and shortens decoding processing time. The purpose is to do.

【００３０】[0030]

【課題を解決するための手段】本発明では上記目的を達
成するために、図１に示すように、入力符号を第１の辞
書を用いて再帰的に復号化する第１の復号化手段２と、
前回入力符号と今回復号化された文字列の第１文字とを
組にし、この組に新たな符号を対応させて第１の辞書に
登録する第１の辞書登録手段１とを備え、入力符号に基
づき復号化された文字列を、入力符号とともに第２の辞
書に登録する第２の辞書登録手段４と、第２の復号化手
段３とを備える。According to the present invention, in order to achieve the above object, as shown in FIG. 1, a first decoding means 2 for recursively decoding an input code using a first dictionary. When,
A first dictionary registering means for pairing a previously input code and a first character of a currently decoded character string with a new code corresponding to the set and registering the new code in a first dictionary; A second dictionary registration unit 4 and a second decoding unit 3 for registering a character string decoded on the basis of.

【００３１】第２の復号化手段３は、検索手段３ａと文
字列出力手段３ｂとを有する。すなわち、検索手段３ａ
は第１の復号化手段２に基づき復号化を行う前に、入力
符号を第２の辞書から検索する。また、文字列出力手段
３ｂは検索手段３ａにより入力符号が検出された場合、
第２の辞書から入力符号に対応する復号化された文字列
を出力する。The second decoding means 3 has a search means 3a and a character string output means 3b. That is, the search means 3a
Searches for an input code from the second dictionary before performing decoding based on the first decoding means 2. When the input code is detected by the search means 3a, the character string output means 3b
And outputting a decoded character string corresponding to the input code from the second dictionary.

【００３２】また、第１の辞書に登録されている入力符
号を参照した頻度を表す参照頻度をさらに設け、第２の
辞書登録手段４は新たに新規入力符号が入力されるごと
に、この新規入力符号に対応する参照頻度をカウントす
るカウント手段を有し、文字列出力手段３ｂは、参照頻
度が所定値を超えた場合は、第２の辞書から入力符号に
対応する復号化された文字列を出力する選択手段を有す
る。Further, a reference frequency indicating the frequency of referring to the input code registered in the first dictionary is further provided, and the second dictionary registration means 4 receives the new input code every time a new input code is input. The character string output unit 3b includes a counting unit that counts a reference frequency corresponding to the input code, and the character string output unit 3b, when the reference frequency exceeds a predetermined value, decodes the decoded character string corresponding to the input code from the second dictionary Is provided.

【００３３】さらに、検索手段３ａは入力符号を第２の
辞書から一度だけ検索し、第１の辞書から検索する。そ
して、第２の辞書登録手段４、検索手段３ａ及び文字列
出力手段３ｂは、いずれもハッシュ関数に基づいて、第
２の辞書とアクセスする。Further, the search means 3a searches for the input code only once from the second dictionary and searches for the input code from the first dictionary. Then, the second dictionary registration unit 4, the search unit 3a, and the character string output unit 3b all access the second dictionary based on the hash function.

【００３４】それから、第２の辞書登録手段４は、入力
符号に対応する文字列が第２の辞書に既に存在し、か
つ、第２の辞書に登録されている文字列よりも今回復号
化された文字列のほうが長い文字列である場合、今回復
号化された文字列を第２の辞書に登録する。Then, the second dictionary registering means 4 determines that the character string corresponding to the input code already exists in the second dictionary and that the character string registered in the second dictionary is decoded this time. If the extracted character string is a longer character string, the currently decoded character string is registered in the second dictionary.

【００３５】[0035]

【作用】第１の復号化手段２は、入力符号を第１の辞書
を用いて再帰的に復号化する。第１の辞書登録手段１
は、前回入力符号と今回復号化された文字列の第１文字
とを組にし、この組に新たな符号を対応させて第１の辞
書に登録する。The first decoding means 2 recursively decodes the input code using the first dictionary. First dictionary registration means 1
Sets a previously input code and the first character of the currently decoded character string as a set, and associates the set with a new code and registers it in the first dictionary.

【００３６】一方、第２の辞書登録手段４は、入力符号
に基づき復号化された最終的に文字列を、入力符号とと
もに第２の辞書に登録する。第２の復号化手段３は、検
索手段３ａと文字列出力手段３ｂとを有し、第１の復号
化手段２による復号化前に入力符号に対応する文字列が
第２の辞書に登録されている場合、この文字列を出力す
る。すなわち、検索手段３ａは第１の復号化手段２に基
づき復号化を行う前に、入力符号を第２の辞書から検索
する。そして、文字列出力手段３ｂは検索手段３ａによ
り入力符号が検出された場合、第２の辞書から入力符号
に対応する復号化された文字列を出力する。On the other hand, the second dictionary registration means 4 registers the finally decoded character string based on the input code in the second dictionary together with the input code. The second decoding unit 3 includes a search unit 3a and a character string output unit 3b. Before decoding by the first decoding unit 2, a character string corresponding to an input code is registered in a second dictionary. If this is the case, output this string. That is, the search unit 3a searches the second dictionary for an input code before performing decoding based on the first decoding unit 2. Then, when the input code is detected by the search means 3a, the character string output means 3b outputs a decoded character string corresponding to the input code from the second dictionary.

【００３７】また、第１の辞書に登録されている入力符
号を参照した頻度を表す参照頻度をさらに設け、第２の
辞書登録手段４は新たに新規入力符号が入力されるごと
に、この新規入力符号に対応する参照頻度をカウントす
るカウント手段を設ける。文字列出力手段３ｂは、参照
頻度が所定値を超えた場合は、第２の辞書から入力符号
に対応する復号化された文字列を出力する選択手段を設
けて、第２の辞書の検索を最小限度に抑える。Further, a reference frequency indicating the frequency of referring to the input code registered in the first dictionary is further provided, and the second dictionary registering means 4 receives this new input code every time a new input code is input. Counting means is provided for counting the reference frequency corresponding to the input code. The character string output means 3b is provided with a selection means for outputting a decoded character string corresponding to the input code from the second dictionary when the reference frequency exceeds a predetermined value. Minimize.

【００３８】さらに、検索手段３ａは、入力符号を第２
の辞書から一度だけ検索し、その後に第１の辞書から検
索するようにして、第１の辞書に蓄積された文字列を有
効に活用する。Further, the search means 3a sets the input code to the second
Is searched only once, and then the first dictionary is searched, thereby effectively utilizing the character strings stored in the first dictionary.

【００３９】そして、第２の辞書登録手段４、検索手段
３ａ及び文字列出力手段３ｂに、いずれもハッシュ関数
に基づいて、第２の辞書とアクセスするようにして、第
２の辞書への登録及び検索の処理速度を短縮する。Then, the second dictionary registration unit 4, the retrieval unit 3a, and the character string output unit 3b access the second dictionary based on the hash function, and register the second dictionary. And reduce the processing speed of the search.

【００４０】それから、第２の辞書登録手段４は、入力
符号に対応する文字列が第２の辞書に既に存在し、か
つ、第２の辞書に登録されている文字列よりも今回復号
化された文字列のほうが長い文字列である場合、今回復
号化された文字列を第２の辞書に登録するようにして、
復号化の効率を上げる。Then, the second dictionary registration means 4 determines that the character string corresponding to the input code already exists in the second dictionary, and that the character string registered in the second dictionary is decoded this time. If the extracted character string is a longer character string, the newly decoded character string is registered in the second dictionary,
Increase decoding efficiency.

【００４１】[0041]

【実施例】以下、本発明の一実施例を図面に基づいて説
明する。図２は復号化の第１の実施例の処理手順を示す
フローチャートである。図において、図１１に示したス
テップと同一の内容を持つステップには図１１のステッ
プ番号と同じ番号を付し、その説明も省略する。ただ
し、図１１の説明にある辞書は全て第１の辞書とする。
なお、Ｓに続く数字はステップ番号を示す。An embodiment of the present invention will be described below with reference to the drawings. FIG. 2 is a flowchart showing the processing procedure of the first embodiment of the decoding. In the figure, steps having the same contents as the steps shown in FIG. 11 are assigned the same step numbers as those in FIG. 11, and the description thereof is omitted. However, all the dictionaries described in FIG. 11 are the first dictionaries.
The number following S indicates a step number.

【００４２】〔Ｓ２１〕ハッシュ関数に基づいて入力符
号ωに対応するハッシュアドレスＨ（ω）を求め、第２
の辞書に入力符号ωが登録されているか否かを判別す
る。もし、入力符号ωが登録されている（ＹＥＳ）なら
ばステップＳ２２に進み、入力符号ωが登録されていな
い（ＮＯ）ならばステップＳ１１７に進む。なお、ハッ
シュアドレスＨ（ω）は入力符号ωを第２の辞書の最大
登録可能数で割った余りで演算される。[S21] A hash address H (ω) corresponding to the input code ω is obtained based on the hash function.
Is determined whether or not the input code ω is registered in the dictionary. If the input code ω is registered (YES), the process proceeds to step S22. If the input code ω is not registered (NO), the process proceeds to step S117. Note that the hash address H (ω) is calculated by the remainder obtained by dividing the input code ω by the maximum registrable number of the second dictionary.

【００４３】〔Ｓ２２〕入力符号ωに対応する復元文字
列を第２の辞書から読み出す。すなわち、ステップＳ２
１で求めたハッシュアドレスＨ（ω）に登録されている
登録文字列を読み出す。読み出し後、ステップＳ２４に
進む。[S22] The restored character string corresponding to the input code ω is read from the second dictionary. That is, step S2
The registered character string registered at the hash address H (ω) obtained in step 1 is read. After the reading, the process proceeds to step S24.

【００４４】〔Ｓ２３〕ステップＳ１１８でスタックに
プッシュした各文字をＬＩＦＯ形式でポップして復元文
字列を出力する。また、第２の辞書上のステップＳ２１
で求めたハッシュアドレスＨ（ω）に、この復元文字列
を入力符号ωとともに、第２の辞書に登録する。すなわ
ち、後述する図６において、入力符号ωは符号code（Ｈ
（ω））として、復元文字列は str（Ｈ（ω））として
登録される。[S23] Each character pushed on the stack in step S118 is popped in the LIFO format to output a restored character string. Step S21 on the second dictionary
The restored character string is registered in the second dictionary together with the input code ω in the hash address H (ω) obtained in (1). That is, in FIG. 6 described later, the input code ω is the code code (H
(Ω)), the restored character string is registered as str (H (ω)).

【００４５】〔Ｓ２４〕今回復元した文字列の最初の一
文字をFINcharとし、前回セットされた OLDωとFINchar
との組（ OLDω，FINchar ）からなる文字列を、ｎの
値に対応させて第１の辞書に登録する（実際には、この
文字列を第１の辞書のアドレスｎに登録する）。さら
に、ｎをインクリメントし、ステップＳ１１４でセット
されたINωをOLD ωにセットして、次のステップＳ１１
２以降の実行に備える。[S24] The first character of the character string restored this time is set to FINchar, and OLDω and FINchar set last time are set.
Is registered in the first dictionary corresponding to the value of n (actually, this character string is registered at the address n of the first dictionary). Further, n is incremented, and INω set in step S114 is set to OLDω, and the next step S11
Prepare for execution after 2.

【００４６】ただし、図１１ではステップＳ１１８の次
のステップがステップＳ１１７であるのに対して、本第
１の実施例ではステップＳ１１８の次のステップはステ
ップＳ２１である。これにより、逐次的にハッシュ関数
に基づき第２の辞書にアクセスされ、従来の復号化と比
べて速く得ることができる。However, in FIG. 11, the step following step S118 is step S117, whereas in the first embodiment, the step following step S118 is step S21. Thereby, the second dictionary is sequentially accessed based on the hash function, and can be obtained faster than in the conventional decoding.

【００４７】図３は復号化の第２の実施例の処理手順を
示すフローチャートである。図において、図２及び図１
１に示したステップと同一の内容を持つステップには図
２及び図１１のステップ番号と同じ番号を付し、その説
明も省略する。ただし、図１１の説明にある辞書は全て
第１の辞書とする。なお、Ｓに続く数字はステップ番号
を示す。FIG. 3 is a flowchart showing a processing procedure of the second embodiment of the decoding. In the figures, FIG. 2 and FIG.
Steps having the same contents as the steps shown in FIG. 1 are denoted by the same step numbers as in FIG. 2 and FIG. 11, and the description thereof is omitted. However, all the dictionaries described in FIG. 11 are the first dictionaries. The number following S indicates a step number.

【００４８】図２の第１の実施例と異なるのは、図２で
はステップＳ１１８の次のステップがステップＳ２１で
あるのに対して、本第２の実施例ではステップＳ１１８
の次のステップはステップＳ１１７である。これによ
り、第２の辞書へのアクセスは最初の入力符号ωに対し
てのみ行われる。このステップＳ２１に要する処理時間
は第２の辞書の登録数が増加するごとに長くなる。した
がって、ステップＳ２１を排除することによって復元化
の処理時間が抑えられ、第１の実施例と比べて速く復元
文字列を得ることができる。The difference from the first embodiment shown in FIG. 2 is that the step following step S118 is step S21 in FIG. 2, whereas step S118 is executed in the second embodiment.
Is a step S117. Thus, access to the second dictionary is performed only for the first input code ω. The processing time required for step S21 increases as the number of registered second dictionaries increases. Therefore, by eliminating step S21, the restoration processing time can be reduced, and a restored character string can be obtained faster than in the first embodiment.

【００４９】図４は復号化の第３の実施例の処理手順を
示すフローチャートである。図において、図３及び図１
１に示したステップと同一の内容を持つステップには図
３及び図１１のステップ番号と同じ番号を付し、その説
明も省略する。ただし、図１１の説明にある辞書は全て
第１の辞書とする。なお、Ｓに続く数字はステップ番号
を示す。FIG. 4 is a flowchart showing the processing procedure of the third embodiment of the decoding. In the figures, FIG. 3 and FIG.
Steps having the same contents as the steps shown in FIG. 1 are denoted by the same step numbers as in FIG. 3 and FIG. 11, and the description thereof is omitted. However, all the dictionaries described in FIG. 11 are the first dictionaries. The number following S indicates a step number.

【００５０】〔Ｓ４１〕ステップＳ１１８でスタックに
プッシュした各文字をＬＩＦＯ形式でポップして復元文
字列を出力する。そして、文字出力後はステップＳ４２
に進む。[S41] Each character pushed on the stack in step S118 is popped in the LIFO format to output a restored character string. Then, after outputting the characters, step S42 is executed.
Proceed to.

【００５１】〔Ｓ４２〕２種の判別を行う。すなわち、
一つは入力符号ωと、後述する図６に示す第２の辞書の
内容において、ステップＳ２１で求めたハッシュアドレ
スＨ（ω）に登録されている参照番号の符号code（Ｈ
（ω））とが同じか否かの判別である。もう一つは、今
回復元した文字列の長さが、同じく入力符号ωに対応す
るハッシュアドレスＨ（ω）に登録されている登録文字
列str（Ｈ（ω））の長さよりも長いか否かの判別であ
る。もし、入力符号ωと符号code（Ｈ（ω））が同じで
なく、かつ、今回復元した文字列の長さが第２の辞書に
登録されている登録文字列 str（Ｈ（ω））の長さより
も長い（ＹＥＳ）ならばステップＳ４３に進み、短い
（ＮＯ）ならばステップＳ２４に進む。[S42] Two types of discrimination are performed. That is,
One is the input code ω and the code code (H of the reference number registered in the hash address H (ω) obtained in step S21 in the content of the second dictionary shown in FIG.
(Ω)) is the same or not. The other is whether the length of the character string restored this time is longer than the length of the registered character string str (H (ω)) registered at the hash address H (ω) also corresponding to the input code ω. It is a judgment of whether or not. If the input code ω and the code code (H (ω)) are not the same, and the length of the character string restored this time is the registered character string str (H (ω)) registered in the second dictionary. If it is longer (YES), the process proceeds to step S43, and if shorter (NO), the process proceeds to step S24.

【００５２】〔Ｓ４３〕ステップＳ２１で求めたハッシ
ュアドレスＨ（ω）に、今回復元した文字列を復元文字
列として入力符号ωとともに第２の辞書に登録する。す
なわち、後述する図６において、入力符号ωは符号code
（Ｈ（ω））にとして、復元文字列は str（Ｈ（ω））
として登録される。辞書登録後、ステップＳ２４に進
む。[S43] At the hash address H (ω) obtained in step S21, the character string restored this time is registered in the second dictionary together with the input code ω as a restored character string. That is, in FIG. 6 described later, the input code ω is a code code
(H (ω)), the restored character string is str (H (ω))
Registered as After the dictionary is registered, the process proceeds to step S24.

【００５３】したがって、ハッシュ関数に基づき第２の
辞書への登録する場合、第２の辞書内においては同一の
登録箇所に、より長い文字列を登録することになるた
め、効率よく復元文字列を得ることができる。Therefore, when registering in the second dictionary based on the hash function, a longer character string is registered in the same registration location in the second dictionary. Obtainable.

【００５４】図５は復号化の第４の実施例の処理手順を
示すフローチャートである。図において、図２及び図１
１に示したステップと同一の内容を持つステップには図
２及び図１１のステップ番号と同じ番号を付し、その説
明も省略する。ただし、図１１の説明にある辞書は全て
第１の辞書とする。なお、Ｓに続く数字はステップ番号
を示す。そして、第１の辞書には、既に登録されている
入力符号ωを参照した頻度を表す参照頻度ｃ（ω）をさ
らに設ける。FIG. 5 is a flowchart showing a processing procedure of the fourth embodiment of the decoding. In the figures, FIG. 2 and FIG.
Steps having the same contents as the steps shown in FIG. 1 are denoted by the same step numbers as in FIG. 2 and FIG. 11, and the description thereof is omitted. However, all the dictionaries described in FIG. 11 are the first dictionaries. The number following S indicates a step number. Then, the first dictionary is further provided with a reference frequency c (ω) representing the frequency of referring to the already registered input code ω.

【００５５】〔Ｓ５１〕第１の辞書から入力符号ωに対
応する参照頻度ｃ（ω）を読み出し、所定値Ｔより大き
いか否か判別する。もし、参照頻度ｃ（ω）が所定値Ｔ
より大きい（ＹＥＳ）ならばステップＳ２１に進み、参
照頻度ｃ（ω）が所定値Ｔより小さい（ＮＯ）ならばス
テップＳ１１７に進む。[S51] The reference frequency c (ω) corresponding to the input code ω is read from the first dictionary, and it is determined whether or not it is larger than a predetermined value T. If the reference frequency c (ω) is a predetermined value T
If it is larger (YES), the process proceeds to step S21, and if the reference frequency c (ω) is smaller than the predetermined value T (NO), the process proceeds to step S117.

【００５６】〔Ｓ５２〕入力符号ωに対応する参照頻度
ｃ（ω）をインクリメントする。その後、ステップＳ５
１に戻る。ただし、図１１ではステップＳ１１８の次の
ステップがステップＳ１１７であるのに対して、本第４
の実施例ではステップＳ１１８の次のステップはステッ
プＳ５２である。したがって、参照頻度の高い入力符号
ωのみに対応して第２の辞書にアクセスされるため、効
率よく第２の辞書とアクセスすることができ、復号化処
理も速くなる。[S52] The reference frequency c (ω) corresponding to the input code ω is incremented. Then, step S5
Return to 1. However, in FIG. 11, the step following step S118 is step S117.
In the embodiment, the step following step S118 is step S52. Therefore, since the second dictionary is accessed in correspondence only with the input code ω having a high reference frequency, the second dictionary can be accessed efficiently and the decoding process can be performed quickly.

【００５７】図６は第２の辞書の内容の一例を示す図で
ある。第２の辞書は、ハッシュアドレスＨ（ω）に基づ
いて、参照番号および登録文字列から構成される。参照
番号は入力符号ωに相当し、登録文字列は復号化処理に
よって復号された文字列が登録される。なお、ハッシュ
アドレスＨ（ω）は入力符号ωを辞書最大登録可能数
（図６では１０００）で割った余りで求められる。FIG. 6 is a diagram showing an example of the contents of the second dictionary. The second dictionary includes a reference number and a registered character string based on the hash address H (ω). The reference number corresponds to the input code ω, and the registered character string is a character string decoded by the decoding process. Note that the hash address H (ω) is obtained by the remainder obtained by dividing the input code ω by the maximum number of dictionaries that can be registered (1000 in FIG. 6).

【００５８】上記の説明では、第２の辞書はハッシュ関
数に基づき構築したが、他の構築法に基づき構築しても
よい。例えば、二進分木（バイナリ・ツリー）法によっ
て第２の辞書を構築し、二進探索（バイナリ・サーチ）
により復元文字列の探索を行なってもよい。In the above description, the second dictionary is constructed based on the hash function, but may be constructed based on another construction method. For example, a second dictionary is constructed by a binary tree method, and a binary search is performed.
May be used to search for a restored character string.

【００５９】上述の各実施例は、ワークステーション等
における文字コード、ベクトル情報、画像データなどを
圧縮したデータの復元に応用され、記憶容量を大幅に削
減することができる。また、通信回線を利用したデータ
送受信（例えば、モデム、ファックス等）においても応
用でき、通信時間の短縮を図ることができる。Each of the above embodiments is applied to the restoration of data obtained by compressing character codes, vector information, image data, and the like in a workstation or the like, and the storage capacity can be greatly reduced. Further, the present invention can be applied to data transmission / reception (for example, a modem, a facsimile, etc.) using a communication line, and the communication time can be reduced.

【００６０】[0060]

【発明の効果】以上説明したように本発明では、第２の
復号化手段の検索手段によって入力符号を第２の辞書か
ら検索し、検出されたときは文字列出力手段が第２の辞
書から入力符号に対応する復号化文字列を出力し、検出
されない場合は第１の復号化手段が入力符号を第１の辞
書を用いて再帰的に復号化して出力し、その後第２の辞
書登録手段がこの復号化された文字列に所定の処理を施
して第２の辞書に登録して次の復号化処理に備えるよう
に構成したので、入力符号に対応する復元文字列を大幅
に速く得ることができる。As described above, according to the present invention, the input code is searched from the second dictionary by the search means of the second decoding means, and if it is detected, the character string output means is searched from the second dictionary. A decoded character string corresponding to the input code is output, and if not detected, the first decoding means recursively decodes and outputs the input code using the first dictionary, and then outputs the second dictionary registration means Is configured to apply a predetermined process to this decoded character string, register it in the second dictionary, and prepare for the next decoding process, so that the restored character string corresponding to the input code can be obtained much faster. Can be.

【００６１】また、第２の辞書に最終的な復元文字列を
登録するので、再帰的な復号化処理をその都度行うこと
なく復元文字列を得ることができる。しかも、第２の辞
書の登録をハッシュ関数に基づき行うので、さらに速く
復元文字列を得ることができる。Since the final restored character string is registered in the second dictionary, a restored character string can be obtained without performing recursive decoding processing each time. Moreover, since the registration of the second dictionary is performed based on the hash function, the restored character string can be obtained even faster.

[Brief description of the drawings]

【図１】本発明の原理説明図である。FIG. 1 is a diagram illustrating the principle of the present invention.

【図２】第１の復号化の処理手順を示す図である。FIG. 2 is a diagram showing a procedure of a first decoding process.

【図３】第２の復号化の処理手順を示す図である。FIG. 3 is a diagram showing a procedure of a second decoding process.

【図４】第３の復号化の処理手順を示す図である。FIG. 4 is a diagram showing a procedure of a third decoding process.

【図５】第４の復号化の処理手順を示す図である。FIG. 5 is a diagram showing a fourth decryption processing procedure.

【図６】第２の辞書の内容の一例を示す図である。FIG. 6 is a diagram showing an example of the contents of a second dictionary.

【図７】ＬＺＷ符号化の具体例を示す図である。FIG. 7 is a diagram illustrating a specific example of LZW encoding.

【図８】復号化の具体例を示す図である。FIG. 8 is a diagram illustrating a specific example of decoding.

【図９】文字列と符号との対応関係を示す図である。FIG. 9 is a diagram showing a correspondence relationship between a character string and a code.

【図１０】従来の符号化の処理手順を示す図である。FIG. 10 is a diagram showing a conventional encoding processing procedure.

【図１１】従来の復号化の処理手順を示す図である。FIG. 11 is a diagram showing a conventional decoding processing procedure.

[Explanation of symbols]

１第１の辞書登録手段２第１の復号化手段３第１の復号化手段３ａ検索手段３ｂ文字列出力手段４第２の辞書登録手段 DESCRIPTION OF SYMBOLS 1 1st dictionary registration means 2 1st decoding means 3 1st decoding means 3a search means 3b character string output means 4 2nd dictionary registration means

───────────────────────────────────────────────────── フロントページの続き (72)発明者千葉広隆神奈川県川崎市中原区上小田中1015番地富士通株式会社内 (56)参考文献特開昭60−116228（ＪＰ，Ａ) 特開平３−68219（ＪＰ，Ａ) 特開平３−179520（ＪＰ，Ａ) 特開平３−204233（ＪＰ，Ａ) 特開平３−204235（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) H03M 7/40 ────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Hirotaka Chiba 1015 Kamiodanaka, Nakahara-ku, Kawasaki City, Kanagawa Prefecture Inside Fujitsu Limited (56) References JP-A-60-116228 (JP, A) JP-A-3-68219 (JP, A) JP-A-3-179520 (JP, A) JP-A-3-204233 (JP, A) JP-A-3-204235 (JP, A) (58) Fields investigated (Int. Cl. ⁷⁾ , DB name) H03M 7/40

Claims

(57) [Claims]

A first decoding means (2) for recursively decoding an input code using a first dictionary, and combining a previously input code with a first character of a currently decoded character string. A first dictionary registering means (1) for registering a new code in the first dictionary in association with the set, and in a data restoration method for decoding the input code, A second dictionary registration unit (4) for registering the decoded character string together with the input code in a second dictionary; and before performing decoding based on the first decoding unit (2), Search means (3a) for searching for an input code from the second dictionary; and when the input code is detected by the search means (3a), the decoded code corresponding to the input code is detected from the second dictionary. Character string output means (3b) for outputting a character string Data recovery method, characterized in that it comprises a Goka means (3), the.

2. The method according to claim 1, wherein the first dictionary further includes a reference frequency indicating a frequency of referring to the registered input code, and the second dictionary registration unit (4) determines whether a newly input code is newly input. The character string output means (3b) includes a counting means for counting the reference frequency corresponding to the new input code every time the reference frequency is input. 2. The data restoration method according to claim 1, further comprising selecting means for outputting the decoded character string corresponding to the input code from the dictionary of (1).

3. The data restoration method according to claim 1, wherein said search means (3a) searches said input code only once from said second dictionary and searches said first code from said first dictionary.

4. The second dictionary registration unit (4), the search unit (3a), and the character string output unit (3b) all access the second dictionary based on a hash function. The data restoration method according to claim 1, 2, or 3, wherein

5. The method according to claim 1, wherein the second dictionary registration unit (4) is configured to determine whether the character string corresponding to the input code already exists in the second dictionary and the character string registered in the second dictionary. 5. The data restoration method according to claim 4, wherein when the character string decoded this time is longer than the character string decoded this time, the character string decoded this time is registered in the second dictionary.