JPH0723238A

JPH0723238A - Picture data compression and decoding device

Info

Publication number: JPH0723238A
Application number: JP5153094A
Authority: JP
Inventors: Yoshiyuki Okada; 佳之岡田; Yasuhiko Nakano; 泰彦中野; Shigeru Yoshida; 茂吉田
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1993-06-24
Filing date: 1993-06-24
Publication date: 1995-01-24

Abstract

PURPOSE:To enhance the compression ratio of universal coding by selecting an exclusive dictionary corresponding to the difference from statistic property depending on a line pitch and a dot period of a dot picture with respect to the picture data compression and decoding device applying universal coding to 2-dimension picture data such as a character and dots after they are converted into a pattern run length code. CONSTITUTION:A space dictionary 16-1 and a non-space dictionary 16-2 are provided as dictionaries used for universal coding and whether or not plural lines of conversion objects to intermediate data are space is discriminated by a dictionary selection means 14 by line pitch information of a character picture and one of the dictionaries 16-1, 16-2 is selected and a pattern run length code by an intermediate coding means 10 is coded by a universal coding means 12. A dictionary is selected similarly by a decoder. Moreover, a specific dictionary is selected from the dot picture according to its dot period.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、文字、網点などの１次
元ビットストリームの画像データをユニバーサル符号に
符号化して圧縮すると共に符号データを復号する画像デ
ータ圧縮及び復元装置に関し、特に、１次元ビットスト
リームをパターンとランレングスの中間データに変換し
た後にユニバーサル符号化する画像データ圧縮及び復元
装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image data compression and decompression device for encoding and compressing image data of a one-dimensional bit stream such as characters and halftone dots into a universal code and decoding the code data, and more particularly to 1. BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image data compression / decompression device that universally encodes a three-dimensional bitstream after converting it into intermediate data of a pattern and run length.

【０００２】近年、オフィスオートメーションが発展
し、文書が白黒２値の画像情報としてファクシミリや光
ディスクファイル・システムなどで扱われるようになっ
ている。文書情報をディジタルデータとして利用すると
き、画像情報のデータ量は、文字画像に比べ非常に大き
く１０数〜数１０倍になる。また、最近は、画像の品位
を向上させるため、ファクシミリにおいては、従来のＧ
３機の約２００ｄｐｉから、次のＧ４機では３００ｄｐ
ｉ，４００ｄｐｉへと解像度が上がり、データ量は増加
する方向にある。したがって、蓄積や伝送等で画像情報
を効率良く扱うには、効率的なデータ圧縮を加えること
でデータ量を減らすことが必須となる。In recent years, office automation has been developed, and a document has been handled as a binary image information in a facsimile or an optical disk file system. When the document information is used as digital data, the data amount of the image information is much larger than that of the character image and is ten to several tens of times. Further, recently, in order to improve the image quality, in the facsimile, the conventional G
About 200 dpi for three aircraft, 300 dpi for the next G4 aircraft
The resolution increases to i, 400 dpi, and the amount of data tends to increase. Therefore, in order to handle image information efficiently for storage and transmission, it is essential to reduce the data amount by adding efficient data compression.

【０００３】[0003]

【従来の技術】従来、白黒２値画像の圧縮方式として、
画像の二次元的性質を捉えた中間データへの変換処理を
行った後にユニバーサル符号化する方式が既に提案され
ており、この方法によれば従来の代表的な２つの方式で
あるＭＭＲ方式と予測符号化方式に匹敵する圧縮率が得
られる。2. Description of the Related Art Conventionally, as a compression method for a monochrome binary image,
A universal encoding method has been already proposed after performing conversion processing to intermediate data that captures the two-dimensional characteristics of an image. According to this method, two typical conventional methods, the MMR method and the prediction method, have been proposed. A compression ratio comparable to the encoding scheme is obtained.

【０００４】図２４は従来のデータ圧縮装置を示し、パ
ターン・ランレングス変換部１０と、ＬＺＷ符号化部１
２で構成される。また図２５は従来の画像データ復元装
置を示したもので、ＬＺＷ復号化部１８とパターン・ラ
ンレングス逆変換部２０で構成される。図２６は画像デ
ータとパターン・ランレングス符号の変換を、白黒画像
データを例にとって示す。画像データは通常、１次元の
ビットストリームとして入力される。この画像データは
実際の画像に対応した所定ビット長のラインデータに図
２６（ａ）に示すように分割する。この２次元の原画デ
ータについて、４ラインを対象に、図２６（ｂ）のよう
に白黒パターンの種類と同一パターンが連続する数、即
ちランレングス情報との組で表現される中間データに変
換する。このように変換されたパターン・ランレングス
符号を対象に、例えばＬＺＷ符号に変換するユニバーサ
ル符号化を適用する。FIG. 24 shows a conventional data compression apparatus, which includes a pattern run length conversion unit 10 and an LZW encoding unit 1.
It consists of 2. FIG. 25 shows a conventional image data restoration device, which is composed of an LZW decoding unit 18 and a pattern run length inverse conversion unit 20. FIG. 26 shows conversion of image data and pattern run-length code by taking monochrome image data as an example. Image data is usually input as a one-dimensional bit stream. This image data is divided into line data having a predetermined bit length corresponding to the actual image as shown in FIG. This two-dimensional original image data is converted into intermediate data represented by a combination of the number of consecutive black-and-white patterns and the same pattern, that is, run length information, as shown in FIG. . For the pattern run-length code converted in this way, universal coding for converting to, for example, an LZW code is applied.

【０００５】符号データの復元は、ＬＺＡ符号からパタ
ーン・ランレングス符号を復号した後に、図２６に示す
逆変換を行って元の原画データを復元し、最終的に１次
元のビットストリームとして出力する。この方法は、２
値画像の２次元的情報を前処理によって取り入れ、その
結果をユニバーサル符号化するため、データの種類に依
存せず、かつ簡単な回路で効率よくデータを圧縮するこ
とができる。To restore the code data, after decoding the pattern run length code from the LZA code, the inverse original transform shown in FIG. 26 is performed to restore the original original image data and finally output as a one-dimensional bit stream. . This method is 2
Since the two-dimensional information of the value image is taken in by preprocessing and the result is universally encoded, the data can be efficiently compressed by a simple circuit without depending on the type of the data.

【０００６】即ち、入力した画像を複数ラインのパター
ンとランレングスの中間データに変換し、中間データか
らパターンとランレングスの連続性等の統計的性質をユ
ニバーサル符号化の手法により学習しながら符号の最良
化を図り、種々の性質の画像において効率の良い圧縮を
行うようにするものである。また、元来ユニバーサル符
号は、情報保存型のデータ圧縮方法であり、データ圧縮
時に情報源の統計的な性質を予め仮定しないため、文字
コード、オブジェクトコードなどの種々のデータに適用
することができる。文書画像では、文字の輪郭等や文字
間隔には類似性がある。また、網点画像は網点周期性、
網点形状の同一性等が類似している。この類似性のもつ
冗長性をユニバーサル符号化により削減し、有効な圧縮
を行うことができる。That is, an input image is converted into intermediate data of a pattern and a run length of a plurality of lines, and the statistical properties such as continuity of the pattern and the run length are learned from the intermediate data by a universal coding method. The optimization is performed so that the images having various properties are efficiently compressed. Further, the universal code is originally an information storage type data compression method, and since the statistical property of the information source is not assumed in advance at the time of data compression, it can be applied to various data such as character codes and object codes. . In a document image, there are similarities in character outlines and character spacing. Also, the halftone dot image has halftone dot periodicity,
The dot shapes are similar in identity. The redundancy with this similarity can be reduced by universal coding, and effective compression can be performed.

【０００７】ここでは、ユニバーサル符号化の一つであ
るＬＺＷ符号を取り上げる（T.A.Welch,"A Technique f
or High-Performance Data Compression",Computer,Jun
e1984 参照）。ＬＺＷ符号では、次のシンボルを次の部
分列に組み込むようにしてインデックスのみで符号化で
きるようにしている。図２７にＬＺＷ符号における辞書
の木構成を示し、図２８にＬＺＷ符号における文字列の
符号化を示す。また、ＬＺＷ符号の符号化と復号化の詳
細なアルゴリズムを図２９と図３０に示す。ＬＺＷ符号
化は、書き替え可能な辞書をもち、入力文字コード、デ
ータ中を相異なる文字列に分け、この文字列を出現した
順に番号（インデックスｉ）を付けて辞書に登録すると
ともに、現在入力している文字列を辞書に登録してある
最長一致文字列の番号で表して、符号化するものであ
る。Here, the LZW code which is one of the universal coding is taken up (TAWelch, "A Technique f
or High-Performance Data Compression ", Computer, Jun
e1984). In the LZW code, the next symbol is incorporated in the next subsequence so that the next symbol can be encoded only by the index. FIG. 27 shows the tree structure of the dictionary in the LZW code, and FIG. 28 shows the encoding of the character string in the LZW code. Further, detailed algorithms of encoding and decoding of LZW code are shown in FIGS. 29 and 30. The LZW encoding has a rewritable dictionary, divides the input character code and data into different character strings, adds the number (index i) to the character string in the order of appearance, and registers it in the dictionary. The encoded character string is represented by the number of the longest matching character string registered in the dictionary and is encoded.

【０００８】図２９のＬＺＷ符号化処理では、まずステ
ップＳ１で予め全文字につき一文字からなる文字列を初
期値として登録してから符号化を始める。ステップＳ２
では入力した最初の文字Ｋを辞書検索する参照番号ωと
し、これを語頭文字列（prefix string ）とする。次に
ステップＳ３で入力データの次の文字Ｋを読み込み、ス
テップＳ４ではステップＳ２で求めた語頭文字列ωにス
テップＳ３で読み込んだ文字Ｋを加えた文字列（ωＫ）
が現在の辞書にあるか否か検索する。In the LZW encoding process of FIG. 29, first, in step S1, a character string consisting of one character for all characters is registered in advance as an initial value, and then encoding is started. Step S2
Then, the input first character K is used as a reference number ω for dictionary search, and this is used as a prefix string. Next, in step S3, the next character K of the input data is read, and in step S4, the character string (ωK) obtained by adding the character K read in step S3 to the initial character string ω obtained in step S2.
Is searched for in the current dictionary.

【０００９】ステップＳ４で文字列（ωＫ）が辞書にあ
れば、ステップＳ５で文字列（ωＫ）を参照番号ωに置
き換え、ステップＳ５で入力データが終了かどうかを判
断した後、再びステップＳ３に戻って文字列（ωＫ）が
辞書から探せなくなるまで最大一致長の検索を続ける。
次にステップＳ４で文字列（ωＫ）が辞書になければ、
ステップＳ７に進んでステップＳ２で求めた文字Ｋの参
照番号ωを符号語ｃｏｄｅ（ω）として出力する。If the character string (ωK) is found in the dictionary in step S4, the character string (ωK) is replaced with the reference number ω in step S5, and it is determined in step S5 whether or not the input data has ended. Returning to this, the search for the maximum matching length is continued until the character string (ωK) cannot be searched from the dictionary.
Next, in step S4, if the character string (ωK) is not in the dictionary,
Proceeding to step S7, the reference number ω of the character K obtained in step S2 is output as the code word code (ω).

【００１０】また文字列（ωＫ）に新たな参照番号を付
加して辞書に登録し、さらにステップＳ２の入力文字Ｋ
を参照番号ωに置き換えるとともに、辞書アドレスＮを
インクリメントして、ステップＳ５のチェックを受けた
後、ステップＳ２に戻って次の文字Ｋを読み込む。図３
１、図３２を参照して符号化を具体的に説明すると次の
ようになる。尚、説明を簡単にするためａｂｃの３文字
の組合せからなるデータを圧縮する場合を取上げてい
る。まず図３１の入力データは左から右へ読み込む。最
初の文字ａを入力したとき、辞書にはａの他に一致する
文字列がないので、出力符号（参照番号ω）を符号語と
して出力する。A new reference number is added to the character string (ωK) and registered in the dictionary, and the input character K in step S2 is added.
Is replaced with a reference number ω, the dictionary address N is incremented, and after the check in step S5, the process returns to step S2 to read the next character K. Figure 3
1, the encoding will be described in detail with reference to FIG. Note that, for simplification of description, the case where data consisting of a combination of three letters abc is compressed is taken up. First, the input data of FIG. 31 is read from left to right. When the first character a is input, the output code (reference number ω) is output as a code word because there is no matching character string in the dictionary other than a.

【００１１】そして、拡張した文字列ａｂに参照番号４
を付けて辞書に登録する。実際の登録は文字列（１ｂ）
の形となる。続いて２番目のｂが文字列の先頭になる。
辞書にはｂの他に一致する文字列がないので、参照番号
２を符号語として出力し、拡張した文字列ｂａを実際に
は２ａの形で参照番号５をつけて辞書に登録する。３番
目のａが次の文字列の先頭になる。以下、同様にこの処
理を続ける。The reference character 4 is added to the expanded character string ab.
And add it to the dictionary. The actual registration is a character string (1b)
Will be in the form of. Then, the second b becomes the beginning of the character string.
Since there is no matching character string other than b in the dictionary, reference number 2 is output as a code word, and the expanded character string ba is actually added to reference number 5 in the form of 2a and registered in the dictionary. The third a is the beginning of the next character string. Hereinafter, this processing is similarly continued.

【００１２】図３０の復号化処理は図２９の符号化の逆
の操作を行う。まずステップＳ１において符号化と同様
に予め辞書に全文字につき一文字からなる文字列を初期
値として登録してから復号を始める。まずステップＳ２
で最初の符号（参照番号）を読み込み、現在のＣＯＤＥ
をＯＬＤｃｏｄｅとし、最初の符号は既に辞書に登録さ
れた一文字の参照番号いずれかに該当することから、入
力符号ＣＯＤＥに一致する文字ｃｏｄｅ（Ｋ）を探し出
し、文字Ｋを出力する。なお、出力した文字（Ｋ）は後
の例外処理のためｃｈａｒにセットしておく。The decoding process of FIG. 30 performs the reverse operation of the encoding of FIG. First, in step S1, similarly to the encoding, a character string consisting of one character for every character is registered in the dictionary in advance as an initial value, and then decoding is started. First, step S2
The first code (reference number) is read with and the current CODE
Is the OLD code, and the first code corresponds to any one-character reference number already registered in the dictionary, so the character code (K) that matches the input code CODE is searched for and the character K is output. The output character (K) is set in char for exception processing later.

【００１３】次にステップＳ３に進んで次の符号を読み
込んでＣＯＤＥにＮＥＷｃｏｄｅとしてセットする。次
にステップＳ４に進み、ステップＳ３で入力された符号
ＣＯＤＥが辞書に定義（登録）されているか否かチェッ
クする。通常、入力した符号語は前回までの処理で辞書
に登録されているため、ステップＳ５に進んで符号ＣＯ
ＤＥに対応する文字列ｃｏｄｅ（ωＫ）を辞書から読み
出し、ステップＳ６で文字列Ｋを一時的にスタックし、
参照番号ｃｏｄｅ（ω）を新たなＣＯＤＥとして再度ス
テップＳ５に戻し、このステップＳ５，ステップＳ６の
手順を再帰的に参照番号ωが一文字にいたるまで繰り返
す。Next, in step S3, the next code is read and set in CODE as NEWcode. Next, in step S4, it is checked whether the code CODE input in step S3 is defined (registered) in the dictionary. Normally, since the input codeword is registered in the dictionary by the processing up to the previous time, the process proceeds to step S5 and the code CO
The character string code (ωK) corresponding to DE is read from the dictionary, and the character string K is temporarily stacked in step S6.
The reference number code (ω) is returned to step S5 as a new CODE, and the procedure of steps S5 and S6 is recursively repeated until the reference number ω reaches one character.

【００１４】最後にステップＳ７に進んでステップＳ６
でスタックした文字をＬＩＬＯ（Last In Fast Out）形
式でポップアップして出力する。同時にステップＳ７に
おいて、前回使った符号ωと今回復元した文字列の最初
の一文字Ｋを組（ω，Ｋ）と表した文字列に、新たな参
照番号を付加して辞書に登録する。なお、ステップＳ４
において登録されていない符号（符号化において直前の
参照番号を参照する場合に起きる）場合、ステップＳ９
にて、ＯＬＤｃｏｄｅをＣＯＤＥに、ｃｏｄｅ（ＯＬＤ
ｃｏｄｅ，ｃｈａｒ）をＮＥＷｃｏｄｅに戻した後にス
テップＳ５へ進むようにする。Finally, the process proceeds to step S7 and step S6.
The characters stacked at are popped up and output in the LILO (Last In Fast Out) format. At the same time, in step S7, a new reference number is added to the character string representing the previously used code ω and the first character K of the character string restored this time as a set (ω, K) and registered in the dictionary. Note that step S4
If the code is not registered in (which occurs when the immediately preceding reference number is referred to in encoding), step S9
, OLDcode to CODE, code (OLD
After returning (code, char) to NEWcode, the process proceeds to step S5.

【００１５】図３３を参照して復号化処理を具体的に説
明すると次のようになる。尚、説明を簡単にするためａ
ｂｃの３文字の組合せからなるデータを復元する場合を
取上げている。まず最初の入力文字は１であり、一文字
ａ，ｂ，ｃについては既に参照番号１，２，３として図
３２に示すように辞書に登録されているため、辞書の参
照により符号１に一致する参照番号の文字列ａに置き換
えて出力する。The decoding process will be specifically described with reference to FIG. 33 as follows. In addition, in order to simplify the explanation a
The case of restoring data consisting of a combination of three characters bc is described. First, the first input character is 1, and since the characters a, b, and c are already registered in the dictionary as reference numbers 1, 2, and 3 as shown in FIG. 32, the reference to the dictionary matches the code 1. The character string a of the reference number is replaced and output.

【００１６】次の符号２についても同様にして文字ｂに
置き換えて出力する。このとき前回処理した符号と今回
復号した最初の一文字ｂとを組み合わせた（１ｂ）に新
たな参照番号４を付加した辞書に登録する。３番目の符
号４は辞書の探索により１ｂからａｂと置き換えて文字
列ａｂを出力する。同様に前回処理した符号２と今回復
号した文字列の１番目の文字ａとの組合せた文字列２ａ
（＝ｂａ）を新たな参照番号５を付加して辞書に登録す
る。以下同様に、この処理を繰り返す。Similarly, the following code 2 is replaced with the character b and output. At this time, the code processed previously is combined with the first character b decoded this time (1b) and is added to the dictionary with a new reference number 4. The third code 4 replaces 1b with ab by searching the dictionary and outputs the character string ab. Similarly, a character string 2a obtained by combining the previously processed code 2 and the first character a of the character string decoded this time
A new reference number 5 is added to (= ba) and registered in the dictionary. Similarly, this process is repeated thereafter.

【００１７】ただし、図３３の復号化では次の例外処理
がある。この例外処理は、第６番目の入力符号８の復号
で生ずる。符号８は復号時に辞書に定義されておらず、
復号できない。この場合には、前回処理した符号５に前
回復号した文字列ｂａの最初の一文字ｂを加えた文字列
５ｂを求め、さらに２ａｂ，ｂａｂと置き換えられて出
力される。そして、文字列の出力語に前回の符号語５に
今回復号した文字列の文字ｂを加えた文字列５ｂに参照
番号８を付加して辞書に登録する。However, there is the following exception processing in the decoding of FIG. This exception processing occurs in the decoding of the sixth input code 8. Code 8 is not defined in the dictionary at the time of decoding,
I can't decrypt. In this case, the character string 5b obtained by adding the first character b of the previously decoded character string ba to the code 5 processed last time is obtained and further replaced with 2ab and bab and output. Then, the reference number 8 is added to the character string 5b obtained by adding the character b of the character string decoded this time to the previous code word 5 to the output word of the character string and registered in the dictionary.

【００１８】この例外処理は図３０の復号化処理フロー
のステップＳ４，ステップＳ９の処理を通じて行われ、
最終的にステップＳ７で文字列の出力と新たな文字列に
参照番号を付加した辞書への登録が行われる。尚、図２
９、図３０の符号化処理と復号化処理は、同じ辞書を作
り出しながら行う。This exception processing is performed through the processing of steps S4 and S9 in the decoding processing flow of FIG.
Finally, in step S7, the character string is output and the new character string is registered in the dictionary with the reference number added. Incidentally, FIG.
9. The encoding process and the decoding process in FIG. 30 are performed while creating the same dictionary.

【００１９】[0019]

【発明が解決しようとする課題】しかしながら、画像デ
ータをパターン・ランレングス符号に変換した後にユニ
バーサル符号に変換し、またユニバーサル符号から復号
したパターン・ランレングス符号を元の画像データに逆
変換する従来装置にあっては、次の問題があった。However, the conventional method of converting the image data into the pattern run-length code and then the universal code, and the pattern-run length code decoded from the universal code into the original image data is reversely converted. The device had the following problems.

【００２０】通常、ワープロ等で作成された文字画像の
場合、文字行間の行ピッチが一定であるために周期的に
空白部分が存在する。この場合、文字の部分と空白の部
分は分離可能であるが、画像の統計的性質が異なる。同
様に網点画像の場合も、網点周期毎に類似したパターン
が現れるが、網点周期の位相が異なるライン群では画像
の統計的性質が異なる。Usually, in the case of a character image created by a word processor or the like, blank spaces are periodically present because the line pitch between character lines is constant. In this case, the character portion and the blank portion can be separated, but the statistical properties of the image are different. Similarly, in the case of a halftone dot image, a similar pattern appears for each halftone dot period, but the statistical properties of the image are different in the line groups having different phases of the halftone dot period.

【００２１】しかしながら、従来の方法はＬＺＷ符号に
変換するユニバーサル符号化の部分において単一の辞書
を用いていたため、性質の異なるデータに対して混在さ
せた辞書を作成するか、データの種類毎に辞書をクリア
して再学習をするか２つの方法しかなかったため、圧縮
率を十分に高められない問題があった。本発明は、この
ような従来の問題点に鑑みてなされたもので、文字画像
の行ピッチや網点画像の網点周期に依存した統計的性質
の違いでデータを分類して専用の辞書を対応づけ、ユニ
バーサル符号化の圧縮率を高めるようにした画像データ
圧縮及び復元装置を提供することを目的とする。However, since the conventional method uses a single dictionary in the universal encoding part for converting to the LZW code, it is necessary to prepare a mixed dictionary for data having different properties, or to prepare for each data type. Since there are only two methods of clearing the dictionary and re-learning, there is a problem that the compression rate cannot be increased sufficiently. The present invention has been made in view of such a conventional problem, and classifies the data according to the difference in statistical properties depending on the line pitch of the character image and the halftone dot period of the halftone dot image to create a dedicated dictionary. It is an object of the present invention to provide an image data compression / decompression device that is associated with the compression rate of universal encoding.

【００２２】[0022]

【課題を解決するための手段】図１は本願第１発明の原
理説明図であり、文字画像を対象とする。まず本発明
は、図１（ａ）に示す中間符号化手段（パターン・ラン
レングス符号化手段）１０とユニバーサル符号化手段１
２を備えた画像データ圧縮装置を対象とする。ここで中
間符号化手段１０は、１次元のビットストリームで構成
される画像データを、所定長のラインデータに分割して
並べて２次元化し、この２次元データを所定ライン数毎
に分割方向でのパターンとライン方向での同一パターン
の連続数を示すランレングスの組でなる中間データに変
換する。またユニバーサル符号化手段１２は、中間符号
化手段１０で変換した中間データを辞書を用いてユニバ
ーサル符号に符号化する。FIG. 1 is a diagram for explaining the principle of the first invention of the present application, which targets character images. First, the present invention is based on the intermediate coding means (pattern run length coding means) 10 and the universal coding means 1 shown in FIG.
An image data compression device provided with 2. Here, the intermediate encoding means 10 divides the image data composed of the one-dimensional bit stream into line data of a predetermined length and arranges them into two-dimensional data, and the two-dimensional data is divided by a predetermined number of lines in the division direction. It is converted into intermediate data that is a set of run lengths indicating the number of consecutive patterns and the same pattern in the line direction. The universal encoding means 12 encodes the intermediate data converted by the intermediate encoding means 10 into a universal code using a dictionary.

【００２３】このような画像データ圧縮装置につき第１
発明は、ユニバーサル符号化に用いる辞書として空白用
辞書１６−１と非空白用辞書１６−２を設ける。そし
て、予め与えられた文字画像の行ピッチ情報により辞書
選択手段１４で、中間データへの変換対象となった複数
ラインが空白か否か判断し、２つの辞書１６−１，１６
−２から１つを選択することを特徴とする。First of all for such an image data compression apparatus
The invention provides a blank dictionary 16-1 and a non-blank dictionary 16-2 as dictionaries used for universal encoding. Then, based on the line pitch information of the character image given in advance, the dictionary selecting means 14 judges whether or not the plurality of lines which are the conversion targets to the intermediate data are blank, and the two dictionaries 16-1 and 16 are provided.
It is characterized in that one is selected from -2.

【００２４】また本発明は、図１（ｂ）に示すユニバー
サル復号化手段１８と中間復号化手段（パターン・ラン
レングス逆変換手段）２０とを備えた画像データ復元装
置を対象とする。ここでユニバーサル復号化手段１８
は、辞書を用いて符号データから、所定ライン数毎に分
割方向でのパターンとライン方向での同一パターンの連
続数を示すランレングスとの組でなる２次元の中間デー
タを復号する。また中間復号化手段２０は、ユニバーサ
ル復号化手段１８で復号した２次元の中間データを、１
次元のビットストリームで構成する画像データに逆変換
する。The present invention is also directed to an image data restoration device provided with a universal decoding means 18 and an intermediate decoding means (pattern / run length inverse conversion means) 20 shown in FIG. 1 (b). Here, the universal decoding means 18
Uses a dictionary to decode two-dimensional intermediate data, which is a set of a pattern in the dividing direction and a run length indicating the number of consecutive identical patterns in the line direction, for each predetermined number of lines from the code data. Further, the intermediate decoding means 20 converts the two-dimensional intermediate data decoded by the universal decoding means 18 into 1
Inverse conversion into image data composed of a three-dimensional bit stream.

【００２５】このような画像データ復元装置につき第１
発明にあっては、ユニバーサル復号化に用いる辞書とし
て、空白用辞書１６−１と非空白用辞書１６−２を設け
る。そして、予め与えられた文字画像の行ピッチ情報に
より辞書選択手段１４で、中間データへの変換対象とな
る複数ラインが空白か否か判断し、２つの辞書１６−
１，１６−２から対応する１つを選択する。First of all, regarding such an image data restoration device
In the present invention, the blank dictionary 16-1 and the non-blank dictionary 16-2 are provided as the dictionaries used for universal decoding. Then, based on the line pitch information of the character image given in advance, the dictionary selection means 14 determines whether or not a plurality of lines to be converted into intermediate data are blank, and the two dictionaries 16-
The corresponding one is selected from 1, 16-2.

【００２６】図２は本願の第２発明の原理説明図であ
り、同じく文字画像を対象としている。図２（ａ）の画
像データ圧縮装置では、ユニバーサル符号化に用いる辞
書として空白用辞書１６−１と非空白用辞書１６−２を
設け、辞書選択手段２２が中間データへの変換対象とな
った複数ライン毎にパターンの出現頻度を求めて空白か
否か判断し、２つの辞書の１つを選択とすると共に選択
した辞書番号を符号化させることを特徴とする。FIG. 2 is an explanatory view of the principle of the second invention of the present application, which is also intended for a character image. In the image data compression apparatus of FIG. 2A, a blank dictionary 16-1 and a non-blank dictionary 16-2 are provided as dictionaries used for universal encoding, and the dictionary selecting means 22 is a conversion target to intermediate data. It is characterized in that the frequency of appearance of the pattern is obtained for each of a plurality of lines to determine whether or not there is a blank, one of the two dictionaries is selected, and the selected dictionary number is encoded.

【００２７】また図２（ｂ）の画像データ復元装置で
は、同様に、ユニバーサル復号化に用いる辞書として空
白用辞書１６−１と非空白用辞書１６−２を設け、辞書
選択手段２４が符号データから復元した辞書番号によ
り、２つの辞書の１つを選択することを特徴とする。す
なわち第２発明は、中間データのパターンの出現頻度を
学習して自動的に辞書を選択する。In the image data restoration device of FIG. 2B, similarly, a blank dictionary 16-1 and a non-blank dictionary 16-2 are provided as dictionaries used for universal decoding, and the dictionary selection means 24 uses the code data. It is characterized in that one of the two dictionaries is selected according to the dictionary number restored from. That is, in the second invention, the appearance frequency of the pattern of the intermediate data is learned and the dictionary is automatically selected.

【００２８】図３は本願の第３発明であり、網点画像を
対象とする。図３（ａ）の画像データ圧縮装置では、ユ
ニバーサル符号化に用いる辞書として、網点周期に対応
した複数の辞書３２−１〜３２−ｎを設ける。そして、
辞書選択手段２６が、予め与えられた網点周期情報によ
り中間データへの変換対象となった複数ラインに対応し
た１つの辞書を複数の辞書３２−１〜３２−ｎから選択
することを特徴とする。FIG. 3 shows a third invention of the present application, which is intended for a halftone dot image. In the image data compression apparatus of FIG. 3A, a plurality of dictionaries 32-1 to 32-n corresponding to halftone dot periods are provided as dictionaries used for universal encoding. And
It is characterized in that the dictionary selecting means 26 selects one dictionary corresponding to a plurality of lines to be converted into intermediate data from a plurality of dictionaries 32-1 to 32-n according to halftone dot period information given in advance. To do.

【００２９】図３（ｂ）の画像データ復元装置では、同
様に、ユニバーサル復号化に用いる辞書として網点周期
に対応した複数の辞書３２−１〜３２−ｎを設け、辞書
選択手段２６が、予め与えられた網点周期情報により中
間データへの変換対象となる複数ラインに対応した１つ
の辞書を複数の辞書３２−１〜３２−ｎから選択するこ
とを特徴とする。In the image data restoration device of FIG. 3B, similarly, a plurality of dictionaries 32-1 to 32-n corresponding to halftone dot periods are provided as dictionaries used for universal decoding, and the dictionary selection means 26 It is characterized in that one dictionary corresponding to a plurality of lines to be converted into intermediate data is selected from a plurality of dictionaries 32-1 to 32-n according to halftone dot period information given in advance.

【００３０】図４は本願の第４発明の原理説明図であ
り、同じく網点画像を対象とする。図４（ａ）の画像デ
ータ圧縮装置では、ユニバーサル符号化に用いる辞書と
して、網点周期に対応した複数の辞書３２−１〜３２−
ｎを設け、そして、辞書選択手段３２が、中間データへ
の変換対象となった複数ライン毎にパターンの出現頻度
の分布を求め、過去に求めた出現頻度との差から複数の
辞書３２−１〜３２−ｎの１つを選択するか又は新たな
作成辞書を選択し、選択辞書の辞書番号と共に符号化さ
せることを特徴とする。FIG. 4 is an explanatory view of the principle of the fourth invention of the present application, which similarly targets a halftone dot image. In the image data compression apparatus of FIG. 4A, a plurality of dictionaries 32-1 to 32-corresponding to halftone dot periods are used as dictionaries used for universal encoding.
n is provided, and the dictionary selection means 32 obtains the distribution of the appearance frequency of the pattern for each of the plurality of lines that are the conversion target to the intermediate data, and the plurality of dictionaries 32-1 are obtained from the difference with the appearance frequency obtained in the past. 32 to 32-n or a new created dictionary is selected and encoded together with the dictionary number of the selected dictionary.

【００３１】図４（ｂ）の画像データ復元装置では、ユ
ニバーサル復号化に用いる辞書として、網点周期に対応
した複数の辞書３２−１〜３２−ｎを設け、辞書選択手
段３４が、符号データから復元した辞書番号により複数
の辞書３２−１〜３２−ｎの１つを選択するか、又は新
たな作成辞書を選択して復号化を行わせることを特徴と
する。In the image data restoration device of FIG. 4B, a plurality of dictionaries 32-1 to 32-n corresponding to halftone dot periods are provided as dictionaries used for universal decoding, and the dictionary selection means 34 causes the code data to be encoded. It is characterized in that one of the plurality of dictionaries 32-1 to 32-n is selected according to the dictionary number restored from, or a new created dictionary is selected for decoding.

【００３２】ここで図３及び図４の網点画像を対象とし
た画像データ圧縮又復元装置ついて、辞書３２−１〜３
２−ｎの分割数ｎは、網点周期と変換対象とするライン
数の最小公倍数を求め、最小公倍数を変換対象とする複
数ラインで割った値を辞書の分割数とする。即ち分割数Ｎ＝最小公倍数（網点周期，ライン数）／ライン
数として求める。Here, the image data compression or decompression device for the halftone image of FIGS.
For the division number n of 2-n, the least common multiple of the halftone dot period and the number of lines to be converted is obtained, and the value obtained by dividing the least common multiple by a plurality of lines to be converted is the number of divisions of the dictionary. That is, the number of divisions N = least common multiple (halftone dot period, number of lines) / number of lines.

【００３３】[0033]

【作用】このように本発明は、文字画像の場合は、文字
と空白に対応する２つの辞書２を用意し、前処理として
中間データ、例えばパターン・ランレングス符号に変換
し、続いて与えられた文字の行ピッチ等の情報により辞
書を選択してユニバーサル符号に符号化する。また中間
データとしてのパターン・ランレングス符号に変換する
複数ラインごとの２次元データにつき、パターンの出現
頻度分布を求め、全く空白であれば空白に対応する辞書
を選択し、そうでなければ文字に対応する辞書を選択し
て符号化する。As described above, according to the present invention, in the case of a character image, two dictionaries 2 corresponding to a character and a blank are prepared, intermediate data such as pattern run length code is converted as a pre-process, and is subsequently given. A dictionary is selected according to the information such as the line pitch of the characters and encoded into a universal code. In addition, the pattern appearance frequency distribution is calculated for the two-dimensional data for each line that is converted to the pattern run length code as the intermediate data, and if it is completely blank, the dictionary corresponding to the blank is selected. Select and encode the corresponding dictionary.

【００３４】一方、網点画像の場合は、網点周期の位相
の異なりに対応した辞書を複数用意し、前処理として中
間データ、例えばパターン・ランレングス符号に変換
し、続いて、与えられた網点周期の位相の違いにより辞
書を選択してユニバーサル符号に符号化する。また中間
データとしてのパターン・ランレングス符号に変換する
複数ラインごとの２次元データにつき、出現頻度の分布
を求め、この出現頻度分布を以前の出現頻度分布と比較
して、差異の最も小さい複数ラインに対応する辞書を選
択して符号化する。このとき以前の出現頻度の分布との
差異が所定の閾値以上で対応が得られなければ、新たな
辞書を作って符号化する。On the other hand, in the case of a halftone dot image, a plurality of dictionaries corresponding to different phases of the halftone dot period are prepared, intermediate data such as pattern run length code is converted as preprocessing, and then given. A dictionary is selected according to the phase of the halftone dot period and encoded into a universal code. In addition, the distribution of the appearance frequency is obtained for the two-dimensional data for each of the plurality of lines to be converted into the pattern run length code as the intermediate data, and the appearance frequency distribution is compared with the previous appearance frequency distribution, and the plurality of lines with the smallest difference are obtained. The dictionary corresponding to is selected and encoded. At this time, if the difference from the distribution of the previous appearance frequency is equal to or more than a predetermined threshold value and no correspondence is obtained, a new dictionary is created and encoded.

【００３５】[0035]

【実施例】図５は文書画像を処理する第１発明の画像デ
ータ圧縮装置の実施例を示す。この実施例において、画
像データ圧縮装置は中間符号化手段としてのパターン・
ランレングス変換器１０とＬＺＷ符号化を行うユニバー
サル符号器１２を備える。ユニバーサル符号器１２での
ＬＺＷ符号化に使用する辞書として空白用辞書１６−１
と文字用辞書１６−２を設けている。FIG. 5 shows an embodiment of the image data compression apparatus of the first invention for processing a document image. In this embodiment, the image data compression device uses a pattern
A run length converter 10 and a universal encoder 12 that performs LZW encoding are provided. Blank dictionary 16-1 as a dictionary used for LZW encoding in the universal encoder 12
And a character dictionary 16-2.

【００３６】辞書選択部１４は入力する文字画像につい
て予め定められている文書形式情報から文字の行ピッチ
を入力し、現在、ユニバーサル符号器１２で符号化する
パターン・ランレングス符号が文書画像の空白か文字か
を判断する。空白であれば空白用辞書１６−１を選択
し、文字であれば文字用辞書１６−２を選択する。ここ
で、パターン・ランレングス変換器１０は例えば図２６
に示したように、１次元のビットストリームとして入力
される文字データを所定長のラインデータに分割して２
次元的に配列し、例えば４ラインを対象にパターンの種
類と同一パターンの繰り返しでなるランレングスの組合
せでなるパターン・ランレングス符号に変換する。The dictionary selection unit 14 inputs the line pitch of characters from the document format information predetermined for the input character image, and the pattern run length code currently encoded by the universal encoder 12 is blank in the document image. Or character. If it is blank, the blank dictionary 16-1 is selected, and if it is a character, the character dictionary 16-2 is selected. Here, the pattern / run-length converter 10 is, for example, as shown in FIG.
, The character data input as a one-dimensional bit stream is divided into line data of a predetermined length and 2
For example, four lines are arranged in a dimension and converted into a pattern / run length code which is a combination of run lengths in which the type of pattern and the same pattern are repeated.

【００３７】また、ユニバーサル符号器１２は例えば、
図２９に示したＬＺＷ符号化アルゴリズムに従ってパタ
ーン・ランレングス変換器１０より得られた中間データ
としてのパターン・ランレングス符号を、そのとき選択
されている空白用辞書１６−１または文字用辞書１６−
２を使用してＬＺＷ符号化を行う。図６は図５の符号化
処理を示したフローチャートである。まずステップＳ１
でユニバーサル符号化に使用する辞書を空白用辞書１６
−１と文字用辞書１６−２の２つに分割する。続いてス
テップＳ２でパターン・ランレングス変換器１０に文字
画像の現画像データを入力する。この現画像データは１
次元のビットストリームとして入力される。続いてステ
ップＳ３で、処理対象とする文字画像の文書形式に従っ
て１行分の文字数に対応したビット長毎のラインデータ
に分割し、これを行方向に並べて図２６（ａ）に示した
２次元データに変換する。そして、この２次元データに
ついて例えば４ラインを１ブロックとして、行方向での
パターンとライン方向での同一パターンの繰返しを示す
ランレングスとの組合せでなるパターン・ランレングス
符号への変換を行う。The universal encoder 12 is, for example,
The pattern run length code as intermediate data obtained by the pattern run length converter 10 according to the LZW encoding algorithm shown in FIG. 29 is converted into the blank dictionary 16-1 or the character dictionary 16-
2 is used to perform LZW encoding. FIG. 6 is a flowchart showing the encoding process of FIG. First, step S1
The dictionary used for universal encoding is the blank dictionary 16
-1 and the character dictionary 16-2. Then, in step S2, the current image data of the character image is input to the pattern / run length converter 10. This current image data is 1
Input as a dimensional bitstream. Subsequently, in step S3, the line image is divided into line data for each bit length corresponding to the number of characters of one line according to the document format of the character image to be processed, and the line data is arranged in the line direction to form the two-dimensional image shown in FIG. Convert to data. Then, with respect to the two-dimensional data, for example, four lines are set as one block, and conversion into a pattern run length code is performed which is a combination of a pattern in the row direction and a run length indicating repetition of the same pattern in the line direction.

【００３８】続いてステップＳ４に進み、このときの文
字の行ピッチ情報に基づき、ユニバーサル符号器１２に
よる符号化に使用する辞書を選択する。いま図７に示す
ような２次元データに変換された文字画像から得られた
パターン・ランレングスを対象に、辞書を用いてユニバ
ーサル符号化を行うものとする。２次元データに変換さ
れた文字画像は文字部分及び空白部分が共に４ラインで
あることが文字の行ピッチ情報から認識される。Then, in step S4, a dictionary used for encoding by the universal encoder 12 is selected based on the line pitch information of the character at this time. Now, it is assumed that universal encoding is performed using a dictionary for a pattern run length obtained from a character image converted into two-dimensional data as shown in FIG. It is recognized from the character line pitch information that the character image converted into the two-dimensional data has a character portion and a blank portion each having four lines.

【００３９】即ち、先頭の４ラインで構成された第１ブ
ロックは文字の行ピッチ情報から文字部分と認識され、
従って文字用辞書１６−２が選択される。次の４ライン
の第２ブロックについては、文字の行ピッチ情報から空
白部分であることが認識され、空白用辞書１６−１が選
択される。以下、奇数ブロックについては文字用辞書１
６−２が選択され、偶数ブロックについては空白用辞書
１６−１が選択される。That is, the first block composed of the first four lines is recognized as a character portion from the character line pitch information,
Therefore, the character dictionary 16-2 is selected. Regarding the second block of the next four lines, it is recognized from the character line pitch information that it is a blank portion, and the blank dictionary 16-1 is selected. Below, for odd-numbered blocks, character dictionary 1
6-2 is selected, and the blank dictionary 16-1 is selected for even blocks.

【００４０】再び図６を参照するに、ステップＳ４で文
字の行ピッチ情報から辞書選択を行うと、ステップＳ５
で選択された空白用辞書１６−１または文字用辞書１６
−２を参照しながら符号化及び新たな辞書登録を行う。
この対象辞書を用いた符号化は、例えば図２９に示した
ＡＺＷ符号化アルゴリズムに従った処理となる。ステッ
プＳ５で１ブロック分の符号化が済むと、ステップＳ６
で符号化データを出力し、再びステップＳ４に戻って次
のブロックについて同様な処理を繰り返し、全ブロック
を終了するまで同様な処理を繰り返すことになる。Referring again to FIG. 6, when the dictionary is selected from the character line pitch information in step S4, step S5
The blank dictionary 16-1 or the character dictionary 16 selected in
-2, encoding and new dictionary registration are performed.
The encoding using this target dictionary is, for example, a process according to the AZW encoding algorithm shown in FIG. When the encoding for one block is completed in step S5, step S6
Then, the encoded data is output, the process returns to step S4, the same process is repeated for the next block, and the same process is repeated until all blocks are completed.

【００４１】図８は第１発明の画像データ復元装置の実
施例を示し、図５の符号データ圧縮装置で得られた符号
データを元の文書画像の画像データに戻す。この画像デ
ータ復元装置はユニバーサル復号器１８，中間復号手段
としてのパターン・ランレングス逆変換器２０，空白用
辞書１６−１，文字用辞書１６−２及び辞書選択部１４
で構成される。FIG. 8 shows an embodiment of the image data decompression device of the first invention, in which the code data obtained by the code data compression device of FIG. 5 is restored to the image data of the original document image. This image data restoration device includes a universal decoder 18, a pattern run length inverse converter 20 as an intermediate decoding means, a blank dictionary 16-1, a character dictionary 16-2, and a dictionary selection unit 14.
Composed of.

【００４２】ユニバーサル復号器１８は辞書を用いて、
例えば図３０に示した符号化アルゴリズムに従って符号
データからパターン・ランレングス符号を復元する。こ
のとき辞書選択部１４は予め与えられた復元しようとす
る文書画像の文字の行ピッチ情報に基づき、復元するパ
ターン・ランレングス符号のブロック単位に空白用辞書
１６−１または文字用辞書１６−２を選択する。パター
ン・ランレングス逆変換器２０はユニバーサル復号器１
８で復元されたパターン・ランレングス符号を４ライン
の画素の並びでなる２次元データに逆変換した後、１次
元のビットストリームとして出力する。The universal decoder 18 uses a dictionary to
For example, the pattern run length code is restored from the code data according to the coding algorithm shown in FIG. At this time, the dictionary selection unit 14 uses the blank line dictionary 16-1 or the character dictionary 16-2 for each block of the pattern run length code to be restored based on the line pitch information of the characters of the document image to be restored which is given in advance. Select. The pattern run length inverse converter 20 is a universal decoder 1
The pattern run length code restored in 8 is inversely converted into two-dimensional data consisting of a pixel array of four lines, and then output as a one-dimensional bit stream.

【００４３】図９は図８の復元処理を示したフローチャ
ートである。まずステップＳ１で、ユニバーサル復号化
に使用する辞書を空白用辞書１６−１と文字用辞書１６
−２の２つに分割する。次にステップＳ２で符号データ
を入力する。続いてステップＳ３で１ブロック分の符号
データを対象に予め与えられている文字の行ピッチ情報
に基づき、空白用辞書１６−１または文字用辞書１６−
２のいずれか１つを選択する。FIG. 9 is a flow chart showing the restoration processing of FIG. First, in step S1, the dictionaries used for universal decoding are the blank dictionary 16-1 and the character dictionary 16.
-2. Next, in step S2, code data is input. Subsequently, in step S3, the blank dictionary 16-1 or the character dictionary 16- is created on the basis of the line pitch information of the character given in advance for the code data for one block.
Select one of the two.

【００４４】具体的には、図７に示したように先頭ブロ
ックの符号データについては文字部分となることから文
字用辞書１６−２を選択し、次の第２ブロックについて
は空白部分となることから空白用辞書１６−１を選択
し、以下同様に、奇数ブロックは文字用辞書１６−２、
偶数ブロックは空白用辞書１６−１を選択する。ステッ
プＳ３で辞書選択が済むとステップＳ４に進み、選択し
た対象辞書を参照して符号データからパターン・ランレ
ングス符号に変換する復号化を行い、新たな辞書登録を
繰り返す。１ブロック分のユニバーサル復号化が済むと
ステップＳ５に進み、パターン・ランレングス符号をパ
ターンの種類とランレングスに従って２次元データに逆
変換して、元の文書画像の画像データを復元する。Specifically, as shown in FIG. 7, since the code data of the first block is a character portion, the character dictionary 16-2 is selected, and the next second block is a blank portion. Select the blank dictionary 16-1 from the following, and similarly, for the odd blocks, the character dictionary 16-2,
For even-numbered blocks, the blank dictionary 16-1 is selected. When the dictionary is selected in step S3, the process proceeds to step S4, the selected target dictionary is referred to, decoding is performed to convert the code data into the pattern run length code, and new dictionary registration is repeated. When the universal decoding for one block is completed, the process proceeds to step S5, the pattern run length code is inversely converted into two-dimensional data according to the type of pattern and the run length, and the image data of the original document image is restored.

【００４５】このようなステップＳ３〜Ｓ５の処理を全
ブロックあるいは予め定めた数のブロック単位に繰り返
す。全ブロックあるいは所定のブロック単位の復元処理
が終了するとステップＳ６に進み、２次元画像に復元さ
れた復元データを１次元のビットストリームとして出力
する。図１０は第２発明による画像データ圧縮装置の実
施例を示し、この実施例にあってはパターン・ランレン
グス符号に変換する所定ライン数、例えば４ラインで構
成されたブロック毎にパターンの出現頻度を求めて空白
か否か検出するようにしたことを特徴とする。即ち、パ
ターン・ランレングス変換器１０，ユニバーサル符号器
１２，空白用辞書１６−１及び文字用辞書１６−２は図
５の第１発明の実施例と同じであるが、新たに空白の有
無を検出する辞書選択部２２を設けている。The processes of steps S3 to S5 are repeated for all blocks or a predetermined number of blocks. When the restoration processing for all blocks or a predetermined block unit is completed, the process proceeds to step S6, and the restored data restored to the two-dimensional image is output as a one-dimensional bit stream. FIG. 10 shows an embodiment of the image data compression apparatus according to the second invention. In this embodiment, the frequency of appearance of a pattern for each block composed of a predetermined number of lines to be converted into a pattern run length code, for example, 4 lines. Is characterized by detecting whether or not there is a blank. That is, the pattern / run length converter 10, the universal encoder 12, the blank dictionary 16-1 and the character dictionary 16-2 are the same as those in the embodiment of the first invention shown in FIG. A dictionary selection unit 22 for detecting is provided.

【００４６】辞書選択部はパターン・ランレングス変換
器１０で変換された２次元データの１ブロックについて
空白パターンの出現頻度を検出する。即ち、空白を示す
ビット００００の空白パターンの出現頻度が他のパター
ンに対し極めて高い値として得られている場合には空白
と判断し、空白用辞書１６−１を選択してユニバーサル
符号器１２で符号化させる。これに対し文字部分につい
ては、オールビット０の空白パターン以外に他のビット
パターンの出現頻度もあることから、文字部分と判断し
て文字用辞書１６−２を選択してユニバーサル符号器１
２での符号化に使用させる。The dictionary selection unit detects the appearance frequency of the blank pattern for one block of the two-dimensional data converted by the pattern / run length converter 10. That is, when the appearance frequency of the blank pattern of the bit 0000 indicating a blank is obtained as an extremely high value with respect to other patterns, it is determined to be blank, the blank dictionary 16-1 is selected, and the universal encoder 12 is selected. Encode. On the other hand, with respect to the character portion, since there is the frequency of appearance of other bit patterns in addition to the blank pattern of all bits 0, it is judged that the character portion and the character dictionary 16-2 is selected and the universal encoder 1 is selected.
2 to be used for encoding.

【００４７】図１１は図１０の空白検出を行う辞書選択
部２２を用いた画像データ圧縮装置の処理動作を示した
フローチャートである。この場合にはステップＳ３でブ
ロック単位にパターンの出現頻度を求め、空白のみの場
合は空白用辞書１６−１を、そうでない場合は文字用辞
書１６−２を選択する。更に、復元側での辞書選択を可
能とするため、ユニバーサル符号器１２ではパターン・
ランレングス符号のユニバーサル符号化に先立って最初
に辞書番号を符号化する。それ以外の処理は図６の符号
化と同じになる。FIG. 11 is a flow chart showing the processing operation of the image data compression apparatus using the dictionary selection unit 22 for performing the blank detection of FIG. In this case, the appearance frequency of the pattern is obtained in block units in step S3, and if there is only a blank, the blank dictionary 16-1 is selected, and if not, the character dictionary 16-2 is selected. Furthermore, in order to enable dictionary selection on the restoration side, the universal encoder 12
The dictionary number is first encoded prior to universal encoding of the run length code. The other processing is the same as the encoding in FIG.

【００４８】図１２は図１０の画像データ圧縮装置で得
られた符号データから元の文字画像の画像データを復元
する画像データ復元装置の実施例を示す。この実施例は
ユニバーサル復号器１８に対し空白用辞書１６−１と文
字用辞書１６−２の２つを設けており、辞書選択部２４
でいずれか１つを選択する。辞書選択部２４は符号デー
タのブロック部分の先頭に符号化されている辞書番号を
復号し、復号した辞書番号に基づき空白用辞書１６−１
または文字用辞書１６−２を選択する。ユニバーサル復
号器１８で空白用辞書１６−１または文字用辞書１６−
２を用いて復号されたパターン・ランレングス符号は、
パターン・ランレングス逆変換器２０で２次元の画像デ
ータに復元され、最終的に１次元のビットストリームと
して出力される。FIG. 12 shows an embodiment of an image data decompression device for decompressing the image data of the original character image from the coded data obtained by the image data compression device of FIG. In this embodiment, the universal decoder 18 is provided with two blank dictionaries 16-1 and character dictionaries 16-2.
Select any one. The dictionary selection unit 24 decodes the dictionary number encoded at the beginning of the block portion of the coded data, and based on the decoded dictionary number, the blank dictionary 16-1
Alternatively, the character dictionary 16-2 is selected. In the universal decoder 18, the blank dictionary 16-1 or the character dictionary 16-
The pattern run length code decoded using 2 is
The pattern / run length inverse converter 20 restores the two-dimensional image data and finally outputs it as a one-dimensional bit stream.

【００４９】図１３は図１２の画像データ復元装置の復
元処理を示したフローチャートであり、ステップＳ２で
符号データのブロック部分の先頭の符号化された辞書番
号を復元し、復元した辞書番号に基づいて空白用辞書１
６−１または文字用辞書１６−２を選択する点が特徴と
なる。それ以外は図９の復元処理と同じになる。図１４
は網点画像を対象とした符号化を行う第３発明の画像デ
ータ圧縮装置の実施例を示す。この画像データ圧縮装置
はパターン・ランレングス変換器１０，ユニバーサル符
号器１２，辞書選択部２６，辞書部２８で構成される。
辞書部２８は予め与えられた網点周期と変換対象となっ
ている１ブロックを構成するライン数との関係で辞書を
辞書３０−１〜３０−ｎに分割する。FIG. 13 is a flow chart showing the restoration processing of the image data restoration device of FIG. 12, wherein the head encoded dictionary number of the block portion of the code data is restored in step S2, and based on the restored dictionary number. And blank dictionary 1
A feature is that 6-1 or the character dictionary 16-2 is selected. The rest is the same as the restoration process of FIG. 14
Shows an embodiment of an image data compression apparatus of the third invention for performing encoding on a halftone image. This image data compression apparatus is composed of a pattern / run length converter 10, a universal encoder 12, a dictionary selection unit 26, and a dictionary unit 28.
The dictionary unit 28 divides the dictionary into the dictionaries 30-1 to 30-n based on the relationship between the halftone dot period given in advance and the number of lines forming one block to be converted.

【００５０】図１５は網点画像を対象とした辞書分割を
具体的に示す。この網点画像は網点周期が１２で４ライ
ン毎にパターン・ランレングス符号に変換する場合、網
点周期の位相が異なる３つのブロックが存在する。従っ
て、３つのブロック毎に出易いパターンが異なり、従っ
て辞書を３つの辞書３０−１〜３０−３に分割して、対
応する４ラインのブロック毎にユニバーサル符号化を行
う必要がある。FIG. 15 specifically shows the dictionary division for the halftone image. In this halftone dot image, the halftone dot period is 12, and when converted into a pattern run length code every four lines, there are three blocks having different phases of the halftone dot period. Therefore, the patterns that are likely to appear in each of the three blocks are different, and therefore it is necessary to divide the dictionary into three dictionaries 30-1 to 30-3 and perform universal coding for each corresponding block of 4 lines.

【００５１】即ち、網点周期と変換対象とする１ブロッ
クのライン数の最小公倍数を１ブロックのライン数で割
った値が辞書の分割数ｎとなる。図１５の場合、網点周
期１２と１ブロックのライン数４の最小公倍数は１２で
あり、これを１ブロックのライン数４で割ると分割数ｎ
＝３となる。再び図１４を参照するに、辞書選択部２６
はパターン・ランレングス変換器１０で網点画像データ
をブロック単位にパターン・ランレングス符号に変換し
た後、予め与えられている網点周期と現在変換対象とな
っているブロックの複数ラインとの関係から、辞書部２
８の辞書３０−１〜３０−ｎの中から対象となる辞書を
選択して、ユニバーサル符号器１２による符号化を行わ
せる。That is, a value obtained by dividing the least common multiple of the halftone dot period and the number of lines in one block to be converted by the number of lines in one block is the number of divisions n in the dictionary. In the case of FIG. 15, the least common multiple of the halftone dot period 12 and the number of lines in one block is 12, and when this is divided by the number of lines in one block, the division number n.
= 3. Referring again to FIG. 14, the dictionary selection unit 26
Is a pattern run length converter 10 that converts halftone dot image data into a pattern run length code in block units, and then a relationship between a halftone dot period given in advance and a plurality of lines of the block currently being converted. From the dictionary section 2
The target dictionary is selected from the eight dictionaries 30-1 to 30-n, and the universal encoder 12 performs the encoding.

【００５２】図１６は図１４の網点画像を対象とした画
像データ圧縮装置の処理動作を示したフローチャートで
ある。まずステップＳ１で網点の周期情報と位相の違
い、即ち１ブロックを構成するライン数により決めた分
割数ｎにより１つの辞書を複数の辞書３０−１〜３０−
ｎに分割する。続いて、網点画像の現画像データをステ
ップＳ２で入力し、パターン・ランレングス変換器１０
で２次元のパターン・ランレングス符号に変換する。FIG. 16 is a flow chart showing the processing operation of the image data compression apparatus for the halftone image of FIG. First, in step S1, one dictionary is divided into a plurality of dictionaries 30-1 to 30-in accordance with the difference between the halftone dot period information and the phase, that is, the division number n determined by the number of lines forming one block.
Divide into n. Then, the current image data of the halftone image is input in step S2, and the pattern / run length converter 10 is input.
Is converted into a two-dimensional pattern run length code.

【００５３】続いてステップＳ４に進み、ブロック単位
に与えられた網点周期と現在変換の対象となっているブ
ロックの複数ラインとの関係から、対応する辞書３０−
ｉを選択する。続いてステップＳ５で、選択した辞書を
使用してユニバーサル符号化と辞書登録を行い、１ブロ
ック分の符号化が済むと、ステップＳ６で符号データを
出力する。以上のステップＳ４〜Ｓ６を所定ブロックま
たは全ブロックを終了するまで繰り返す。Then, in step S4, the corresponding dictionary 30-is determined from the relationship between the halftone dot period given in block units and the plurality of lines of the block currently being converted.
Select i. Subsequently, in step S5, universal coding and dictionary registration are performed using the selected dictionary, and when coding of one block is completed, code data is output in step S6. The above steps S4 to S6 are repeated until a predetermined block or all blocks are completed.

【００５４】図１７は図１４の画像データ圧縮装置で得
られた符号データから元の網点画像を復元する画像デー
タ復元装置の実施例を示す。この実施例においても、図
１４の場合と同様、網点周期と変換対象となっている１
ブロックのライン数との関係から定まる分割数ｎで辞書
部２８を辞書３０−１〜３０−ｎに分割する。また、辞
書選択部２６で、現在ユニバーサル復号器１８で復号し
ようとするブロックに対応する複数ラインと予め与えら
れた網点周期の関係から、対象となる辞書を選択する。FIG. 17 shows an embodiment of an image data decompression device for decompressing the original halftone dot image from the coded data obtained by the image data compression device of FIG. Also in this embodiment, as in the case of FIG. 14, the halftone dot period and the conversion target 1
The dictionary unit 28 is divided into the dictionaries 30-1 to 30-n with the division number n determined from the relationship with the number of lines of the block. Further, the dictionary selection unit 26 selects a target dictionary from the relationship between a plurality of lines corresponding to the block currently to be decoded by the universal decoder 18 and a halftone dot period given in advance.

【００５５】図１８は図１７の復元処理を示したフロー
チャートである。この復元処理にあっては、ステップＳ
１で図１６の場合と同様、網点周期と位相の違いにより
決められた１ブロックのライン数で決まる分割数ｎで辞
書を分割する。またステップＳ３において、現在復元対
象となっているブロックに対応する複数ラインと予め与
えられた網点周期の関係から、対応する辞書を選択す
る。FIG. 18 is a flow chart showing the restoration processing of FIG. In this restoration process, step S
16, the dictionary is divided by the number of divisions n determined by the number of lines in one block determined by the difference in halftone dot period and phase, as in the case of FIG. Further, in step S3, the corresponding dictionary is selected from the relationship between the plurality of lines corresponding to the block currently being restored and the halftone dot period given in advance.

【００５６】図１９は網点画像を対象とした第４発明の
画像データ復元装置の実施例を示す。この画像データ復
元装置にあっては、辞書選択部３２で変換対象となるブ
ロックを構成する複数ラインの網点画像データを対象に
パターンの出現頻度の分布を求め、この出現頻度の分布
を以前のブロックの出現頻度の分布と比較し、差異の最
も小さいブロックに対応する辞書を選択して符号化する
ようにしたことを特徴とする。また、以前のブロックの
出現頻度との差異が所定の閾値より大きい場合には、新
たな辞書を作って符号化する。FIG. 19 shows an embodiment of the image data restoration device of the fourth invention for a halftone image. In this image data restoration device, the dictionary selection unit 32 obtains the distribution of the appearance frequency of the pattern for the halftone dot image data of a plurality of lines forming the block to be converted, and the distribution of the appearance frequency is calculated as before. It is characterized in that the dictionary corresponding to the block having the smallest difference is selected and encoded by comparing with the distribution of the appearance frequency of the block. If the difference from the appearance frequency of the previous block is larger than a predetermined threshold value, a new dictionary is created and encoded.

【００５７】図２２は辞書選択部３２における過去のブ
ロックの出現頻度の分布と、現在処理対象としているブ
ロックの出現頻度分布を示す。図２２（ａ）のＡ，Ｂ，
Ｃは以前のブロックの出現頻度の分布である。このよう
な状態で図２２（ｂ）に示す現在処理対象としているブ
ロックの出現頻度の分布Ｄが得られたとする。これを出
現頻度分布Ａ，Ｂ，Ｃに重ねると破線で示すようにな
る。この場合、出現頻度分布Ｂは以前のブロックの出現
頻度分布Ｂとの差異が最も小さいことから、出現頻度分
布Ｂをもつブロックの符号化で使用した辞書を選択す
る。また、辞書選択部３２は辞書部２８の中の辞書３０
−１〜３０−ｎのいずれかを選択したとき、あるいは新
たな辞書３０−（ｎ＋１）を選択した場合には、ユニバ
ーサル符号器１２におけるパターン・ランレングス符号
の符号化に先立ってブロックの先頭に選択した辞書番号
を符号化する。FIG. 22 shows the distribution of the appearance frequencies of past blocks in the dictionary selection unit 32 and the appearance frequency distribution of the blocks currently being processed. 22A, A, B,
C is the distribution of the appearance frequency of the previous block. In such a state, it is assumed that the distribution D of the appearance frequency of the block currently being processed shown in FIG. 22B is obtained. When this is overlaid on the appearance frequency distributions A, B, and C, it becomes as shown by the broken line. In this case, since the appearance frequency distribution B has the smallest difference from the appearance frequency distribution B of the previous block, the dictionary used for encoding the block having the appearance frequency distribution B is selected. In addition, the dictionary selection unit 32 uses the dictionary 30 in the dictionary unit 28.
When any one of -1 to 30-n is selected, or when a new dictionary 30- (n + 1) is selected, the block is added to the head of the block before the universal run encoder 12 encodes the pattern run length code. Encode the selected dictionary number.

【００５８】図２１は図１９のパターンの出現頻度の分
布から辞書を選択する画像選択処理を示したフローチャ
ートであり、ステップＳ３でブロック単位に２次元デー
タとした１ブロックの複数ラインのパターンの出現頻度
を求め、以前の出現頻度の分布との差から辞書を選択す
る。また、以前の出現頻度との差が所定の閾値を越えて
いる場合には新たな辞書を作成する。そして、選択した
辞書番号または新たに作成した辞書番号を最初に符号化
するようになる。これをブロック単位に繰り返す。FIG. 21 is a flow chart showing an image selection process for selecting a dictionary from the distribution of the appearance frequencies of the patterns shown in FIG. The frequency is calculated, and the dictionary is selected based on the difference from the distribution of the previous appearance frequency. If the difference from the previous appearance frequency exceeds a predetermined threshold, a new dictionary is created. Then, the selected dictionary number or the newly created dictionary number is encoded first. This is repeated for each block.

【００５９】図２２は図１９の画像データ圧縮装置で得
られた符号データから元の網点画像を復元する第４発明
の画像データ復元装置の実施例を示す。ユニバーサル復
号器１８，パターン・ランレングス逆変換器２０及び辞
書３０−１〜３０−ｎに分割された辞書部２８は、図１
７の第３発明と同じであるが、新たに辞書選択部３４が
設けられる。FIG. 22 shows an embodiment of the image data decompression device of the fourth invention for decompressing the original halftone dot image from the coded data obtained by the image data compression device of FIG. The universal decoder 18, the pattern run length inverse converter 20, and the dictionary unit 28 divided into the dictionaries 30-1 to 30-n are shown in FIG.
7 is the same as the third invention, but a dictionary selecting unit 34 is newly provided.

【００６０】辞書選択部３４は符号データのブロック先
頭に符号化されている辞書番号を復号し、復号した辞書
番号に基づいて辞書部２８の中の辞書３０−１〜３０−
ｎのいずれかを選択するか新たな辞書を作成する。図２
３は図２２の復元処理を示したフローチャートであり、
ステップＳ２で送られてきた辞書番号を復元して複数の
辞書３０−１〜３０−ｎの中から対応する辞書を選択す
るか、あるいは新たな辞書を作成してユニバーサル復号
器１８による復号化を行う。The dictionary selection unit 34 decodes the dictionary number encoded at the head of the block of the coded data, and the dictionaries 30-1 to 30-in the dictionary unit 28 based on the decoded dictionary number.
Either select n or create a new dictionary. Figure 2
3 is a flowchart showing the restoration processing of FIG.
The dictionary number sent in step S2 is restored and the corresponding dictionary is selected from the plurality of dictionaries 30-1 to 30-n, or a new dictionary is created and decoded by the universal decoder 18. To do.

【００６１】[0061]

【発明の効果】以上説明してきたように本発明によれ
ば、複数の辞書を用いて、文字画像の場合は空白部分と
文字部分に分けてそれぞれ対応する専用の辞書を選択
し、また網点画像の場合は網点周期と処理単位となる複
数ラインから位相の異なった数分の辞書に分けてそれぞ
れ対応する専用の辞書を選択することで、統計的性質の
異なる部分を考慮せずに、混在した単一の辞書を用いて
符号化した場合に比べ、より高い圧縮率を得ることがで
きる。As described above, according to the present invention, a plurality of dictionaries are used, and in the case of a character image, a blank part and a character part are separately selected and corresponding dedicated dictionaries are selected. In the case of an image, by dividing a plurality of dictionaries having different phases from a plurality of lines as a halftone dot period and a processing unit and selecting a dedicated dictionary corresponding to each, without considering a portion having different statistical properties, A higher compression rate can be obtained as compared with the case of encoding using a single mixed dictionary.

[Brief description of drawings]

【図１】文字画像を対象とした第１発明の原理説明図FIG. 1 is an explanatory view of the principle of the first invention for character images.

【図２】文字画像を対象とした第２発明の原理説明図FIG. 2 is an explanatory view of the principle of the second invention for character images.

【図３】網点画像を対象とした第３発明の原理説明図FIG. 3 is an explanatory diagram of the principle of the third invention for a halftone image.

【図４】網点画像を対象とした第４発明の原理説明図FIG. 4 is an explanatory view of the principle of the fourth invention for a halftone image.

【図５】第１発明の画像データ圧縮装置の実施例構成図FIG. 5 is a block diagram of an embodiment of an image data compression apparatus of the first invention.

【図６】図５の実施例の符号化処理を示したフローチャ
ートFIG. 6 is a flowchart showing an encoding process of the embodiment of FIG.

【図７】文字画像と辞書選択の説明図FIG. 7 is an explanatory diagram of character images and dictionary selection.

【図８】第１発明の画像データ復元装置の実施例構成図FIG. 8 is a configuration diagram of an embodiment of an image data restoration device of the first invention.

【図９】図８の実施例の復号化処理を示したフローチャ
ートFIG. 9 is a flowchart showing a decoding process of the embodiment of FIG.

【図１０】第２発明の画像データ圧縮装置の実施例構成
図FIG. 10 is a block diagram of an embodiment of an image data compression apparatus of the second invention.

【図１１】図１０の実施例の符号化処理を示したフロー
チャート11 is a flowchart showing the encoding process of the embodiment of FIG.

【図１２】第２発明の画像データ復元装置の実施例構成
図FIG. 12 is a block diagram of an embodiment of an image data restoration device of the second invention.

【図１３】図１２の実施例の復号化処理を示したフロー
チャートFIG. 13 is a flowchart showing a decoding process of the embodiment of FIG.

【図１４】第３発明の画像データ圧縮装置の実施例構成
図FIG. 14 is a block diagram of an embodiment of an image data compression apparatus of the third invention.

【図１５】網点画像と辞書選択の説明図FIG. 15 is an explanatory diagram of halftone image and dictionary selection.

【図１６】図１４の実施例の符号化処理を示したフロー
チャート16 is a flowchart showing the encoding process of the embodiment of FIG.

【図１７】第３発明の画像データ復元装置の実施例構成
図FIG. 17 is a block diagram of an embodiment of an image data restoration device of the third invention.

【図１８】図１７の実施例の復号化処理を示したフロー
チャートFIG. 18 is a flowchart showing a decoding process of the embodiment of FIG.

【図１９】第４発明の画像データ圧縮装置の実施例構成
図FIG. 19 is a block diagram of an embodiment of an image data compression apparatus of the fourth invention.

【図２０】辞書選択に用いる複数ラインごとのパターン
出現頻度分布の相違を示した説明図FIG. 20 is an explanatory diagram showing a difference in pattern appearance frequency distribution for each of a plurality of lines used for dictionary selection.

【図２１】図１９の実施例の符号化処理を示したフロー
チャートFIG. 21 is a flowchart showing the encoding process of the embodiment of FIG.

【図２２】第４発明の画像データ復元装置の実施例構成
図FIG. 22 is a configuration diagram of an embodiment of an image data restoration device of the fourth invention.

【図２３】図２２の実施例の復号化処理を示したフロー
チャートFIG. 23 is a flowchart showing a decoding process of the embodiment of FIG.

【図２４】従来の画像データ圧縮装置の説明図FIG. 24 is an explanatory diagram of a conventional image data compression device.

【図２５】従来の画像データ復元装置の説明図FIG. 25 is an explanatory diagram of a conventional image data restoration device.

【図２６】２次元画像データとパターン・ランレングス
符号との間の変換および逆変換を示した説明図FIG. 26 is an explanatory diagram showing conversion and inverse conversion between two-dimensional image data and a pattern run length code.

【図２７】ユニバーサル符号化で用いる辞書の木構造を
示した説明図FIG. 27 is an explanatory diagram showing a tree structure of a dictionary used in universal encoding.

【図２８】ユニバーサル符号の説明図FIG. 28 is an explanatory diagram of a universal code.

【図２９】従来のＬＺＡ符号化アルゴリズムを示したフ
ローチャートFIG. 29 is a flowchart showing a conventional LZA encoding algorithm.

【図３０】従来のＬＺＷ復号化アルゴリズムを示したフ
ローチャートFIG. 30 is a flowchart showing a conventional LZW decoding algorithm.

【図３１】ａｂｃ３文字を例にとってＬＺＷ符号化を具
体的に示した説明図FIG. 31 is an explanatory diagram concretely showing LZW encoding by taking abc3 characters as an example.

【図３２】図３１の符号化で使用される辞書の説明図32 is an explanatory diagram of a dictionary used in the encoding of FIG. 31.

【図３３】ａｂｃ３文字を例にとってＬＺＷ復号化を具
体的に示した説明図FIG. 33 is an explanatory diagram concretely showing LZW decoding by taking abc3 characters as an example.

[Explanation of symbols]

１０：中間符号化手段（パターン・ランレングス符号
器）１２：ユニバーサル符号化手段（ユニバーサル符号器）１４，２２，２４，３２，３４：辞書選択手段（辞書選
択部）１６−１：空白用辞書１６−２：文字用辞書１８：ユニバーサル復号化手段（ユニバーサル復号器）２０：中間復号化手段（パターン・ランレングス復号
器）２８：辞書機構３０−１〜３０−２：辞書10: Intermediate Coding Means (Pattern Run Length Encoder) 12: Universal Coding Means (Universal Encoder) 14, 22, 24, 32, 34: Dictionary Selection Means (Dictionary Selection Unit) 16-1: Blank Dictionary 16-2: Character dictionary 18: Universal decoding means (universal decoder) 20: Intermediate decoding means (pattern run length decoder) 28: Dictionary mechanism 30-1 to 30-2: Dictionary

Claims

[Claims]

1. Image data composed of a one-dimensional bit stream is divided into line data of a predetermined length and converted into two-dimensional data, and the two-dimensional data is divided into patterns in a dividing direction for every predetermined number of lines. Intermediate coding means (10) for converting into intermediate data consisting of a set of run lengths indicating the number of consecutive identical patterns in the line direction, and the intermediate coding means (1)
In the image data compression apparatus provided with a universal encoding means (12) for encoding the intermediate data converted in 0) into a universal code using the dictionary, a blank dictionary (16) is used as the dictionary for the universal encoding. -1) and a non-blank dictionary (16-2) are provided, and it is judged from the two dictionaries whether or not a plurality of lines to be converted into the intermediate data are blank based on the pitch information of the character given in advance. An image data compression apparatus, characterized in that a dictionary selection means (14) for selecting one is provided.

2. A two-dimensional intermediate consisting of a set of a pattern in a dividing direction and a run length indicating the number of consecutive identical patterns in a line direction for every predetermined number of lines from code data universally coded using a dictionary. Universal decoding means (18) for decoding data, and intermediate decoding means (20) for inversely converting the two-dimensional intermediate data decoded by the universal decoding means (18) into image data composed of a one-dimensional bit stream. ) And an image data decompression device including a blank dictionary (16-1) and a non-blank dictionary (16-2) as dictionaries to be used for the universal decoding, and a predetermined pitch of characters is provided. A dictionary selecting means (14) for selecting one of the two dictionaries by judging whether or not a plurality of lines to be converted into the intermediate data are blank based on information is provided. An image data compression device characterized by:

3. Image data composed of a one-dimensional bit stream is divided into line data of a predetermined length and converted into two-dimensional data, and the two-dimensional data is divided into patterns in a dividing direction for every predetermined number of lines. Intermediate coding means (10) for converting into intermediate data consisting of a set of run lengths indicating the number of consecutive identical patterns in the line direction, and the intermediate coding means (1)
In the image data compression apparatus provided with the universal encoding means (12) for encoding the intermediate data converted in 0) into a universal code using the dictionary, a blank dictionary ( 16-1) and a non-blank dictionary (16-2) are provided, the appearance frequency of the pattern is determined for each of a plurality of lines that have been converted to the intermediate data, and it is determined whether the pattern is blank or not.
An image data compression apparatus comprising a dictionary selection means (22) for selecting one of the two dictionaries and encoding the selected dictionary number.

4. A two-dimensional intermediate consisting of a set of a pattern in the dividing direction and a run length indicating the number of consecutive identical patterns in the line direction for every predetermined number of lines from the code data universally encoded using a dictionary. Universal decoding means (18) for decoding data, and intermediate decoding means (20) for inversely converting the two-dimensional intermediate data decoded by the universal decoding means (18) into image data composed of a one-dimensional bit stream. ) And a blank dictionary (16-1) and a non-blank dictionary (16-2) are provided as dictionaries used for the universal decoding, and the dictionaries restored from the code data are provided. An image data compression apparatus, characterized by further comprising dictionary selection means (24) for selecting one of the two dictionaries according to a number.

5. Image data composed of a one-dimensional bit stream is divided into line data of a predetermined length and converted into two-dimensional data, and the two-dimensional data is divided into patterns in a dividing direction for every predetermined number of lines. Intermediate coding means (10) for converting into intermediate data consisting of a set of run lengths indicating the number of consecutive identical patterns in the line direction, and the intermediate coding means (1)
In the image data compression apparatus provided with the universal encoding means (12) for encoding the intermediate data converted in 0) into the universal code using the dictionary, the dictionary used in the universal encoding has a halftone dot cycle. A plurality of corresponding dictionaries (32-1 to 32-n) are provided,
Dictionary selecting means (26) for selecting, from the plurality of dictionaries (32-1 to 32-n), one dictionary corresponding to a plurality of lines which have been converted into the intermediate data according to halftone dot period information given in advance. An image data compression device, characterized by being provided with.

6. A two-dimensional intermediate consisting of a set of a pattern in a dividing direction and a run length indicating the number of consecutive identical patterns in a line direction for every predetermined number of lines from code data universally coded using a dictionary. Universal decoding means (18) for decoding data, and intermediate decoding means (20) for inversely converting the two-dimensional intermediate data decoded by the universal decoding means (18) into image data composed of a one-dimensional bit stream. ), And a plurality of dictionaries (32-1 to 32-n) corresponding to halftone dot periods are provided as dictionaries used for the universal decoding.
Halftone dot dictionary selecting means (26) for selecting one dictionary corresponding to a plurality of lines to be converted into the intermediate data from the plurality of dictionaries (32-1 to 32-n) according to halftone dot period information given in advance. ) Is provided, the image data compression device.

7. Image data composed of a one-dimensional bit stream is divided into line data of a predetermined length and converted into two-dimensional data, and the two-dimensional data is divided into patterns in a dividing direction for every predetermined number of lines. Intermediate coding means (10) for converting into intermediate data consisting of a set of run lengths indicating the number of consecutive identical patterns in the line direction, and the intermediate coding means (1)
In an image data compression apparatus equipped with a universal encoding means (12) for encoding the intermediate data converted in 0) into a universal code using a dictionary, a dictionary used for the universal encoding is A plurality of corresponding dictionaries (32-1 to 32-n) are provided,
The distribution of the appearance frequency of the pattern is obtained for each of the plurality of lines that are the conversion targets to the intermediate data, and the plurality of dictionaries (32-1 to 32-n) are obtained from the difference with the appearance frequency obtained in the past.
Dictionary selecting means (3) for selecting one of the selected dictionary or a newly created dictionary and encoding with the dictionary number of the selected dictionary.
2. An image data compression device characterized by being provided with 2).

8. A two-dimensional intermediate consisting of a set of a pattern in the dividing direction and a run length indicating the continuous number of the same pattern in the line direction for every predetermined number of lines from code data universally encoded using a dictionary. Universal decoding means (18) for decoding the data, and intermediate decoding means (20) for inversely converting the two-dimensional intermediate data decoded by the universal decoding means (18) into image data composed of a one-dimensional bit stream. ), And a plurality of dictionaries (32-1 to 32-n) corresponding to halftone dot periods are provided as dictionaries used for the universal decoding.
Dictionary selection means (34) for selecting one of the plurality of dictionaries (32-1 to 32-n) according to the dictionary number restored from the code data or selecting a new created dictionary and performing decoding. An image data compression device characterized by being provided.

9. The image data compression or decompression device according to claim 7 or 8, wherein the least common multiple of the halftone dot period and the number of lines to be converted is obtained, and the least common multiple is converted into a plurality of lines to be converted. An image data compression or decompression device characterized in that a divided value is used as the number of divisions of a dictionary.