JP2010027051A

JP2010027051A - Image compression apparatus and image compression method

Info

Publication number: JP2010027051A
Application number: JP2009162958A
Authority: JP
Inventors: Atsushi Tabata; 淳田畑; Masaaki Yasunaga; 真明安永
Original assignee: Toshiba Corp; Toshiba TEC Corp
Current assignee: Toshiba Corp; Toshiba TEC Corp
Priority date: 2008-07-22
Filing date: 2009-07-09
Publication date: 2010-02-04

Abstract

PROBLEM TO BE SOLVED: To provide an image compression apparatus and an image compression method attaining high-speed image compression processing by a dictionary system. SOLUTION: The image compression apparatus has an analysis section 12 analyzing an input image and outputting object layout information and page attribute information; a segmenting section 13 segmenting the image based on the object layout information and outputting a component image ; a dictionary section for selecting an image dictionary corresponding to the page attribute information out of existing image dictionaries, comparing the selected image dictionary with the component image to determine the corresponding image dictionary, and outputting the determined image dictionary and a dictionary index; and an encoding section 15 encoding the image dictionary and the dictionary index. COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、画像情報からオブジェクト配置情報と頁属性情報を抽出し、これらに基いて画像圧縮する画像圧縮装置および画像圧縮方法に関する。 The present invention relates to an image compression apparatus and an image compression method for extracting object arrangement information and page attribute information from image information and compressing an image based on the extracted information.

従来から、２値画像を効率良く圧縮するために、画像ビットマップを辞書化して、インデックス（該当辞書番号とその配置）と辞書を圧縮データとする方式を標準化した技術としてＪＰＥＧ２が知られている。ＪＰＥＧ２では、圧縮率を上げるため、例えば編み点文字を構成する、形状情報（位置含む）と網点情報に基づいて、それぞれ対となる別の符号化手段で符号化する構成が示されている。 Conventionally, in order to efficiently compress a binary image, JPEG2 is known as a standardized technique for converting an image bitmap into a dictionary and using an index (corresponding dictionary number and its arrangement) and a dictionary as compressed data. . In JPEG2, in order to increase the compression ratio, for example, a configuration is shown in which encoding is performed by another pair of encoding means based on shape information (including position) and halftone dot information, which form a braided dot character, for example. .

画像情報は大容量となるため、一般に圧縮を行い保存・使用している。通常、文字画像などは２値化して可逆圧縮、写真などは多値の非可逆圧縮を行っている。 Since image information has a large capacity, it is generally compressed and stored / used. Usually, character images and the like are binarized to perform lossless compression, and photographs and the like are subjected to multi-value irreversible compression.

しかしながら、２値文字画像に関しても非可逆で圧縮する方式が提案され、ISO/IEC１４４９２（ＪＰＥＧ２）として標準化されている。 However, a method of irreversibly compressing a binary character image has been proposed and standardized as ISO / IEC14492 (JPEG2).

この方式の特徴的な点は、文書画像などは同一文字が多く出現することから、文字のビットマップ画像を辞書化することによって、同じ文字はビットマップでなく、辞書インデックスと画像上の位置情報で表現することで、圧縮対象のビットマップ量を減らして圧縮する点である。このような圧縮技術に関し、例えば以下のような特許文献が知られている。 Characteristic of this method is that many of the same characters appear in the document image etc., so by making a bitmap image of the character into a dictionary, the same character is not a bitmap, but a dictionary index and position information on the image This is to reduce the amount of bitmap to be compressed and compress it. For such compression technology, for example, the following patent documents are known.

特許文献１（特開２００６−２５４３２９）は、単純な線描画文字だけでなく、ハーフトーンなどのパターンで表現された文字も効率良く圧縮するように、文字を構成する形状情報とそのパターン情報に分離して圧縮することで、パターン表現された文字等を効率よく圧縮する方式を開示している。 Patent Document 1 (Japanese Patent Application Laid-Open No. 2006-254329) discloses not only simple line-drawing characters but also shape information and pattern information constituting the characters so as to efficiently compress characters represented by patterns such as halftones. A method of efficiently compressing characters and the like represented by patterns by separating and compressing them is disclosed.

特許文献２（特開２００７−１７４００８）は、画像を文字や写真領域を識別し、それぞれに適した圧縮を適用する方式において、文字領域の圧縮に辞書方式の圧縮を用いる例が開示されており、辞書圧縮前に入力された文字領域のデータから非テキストとみなせる情報を辞書化圧縮対象から除外することで、高速に圧縮する方法を開示している。 Patent Document 2 (Japanese Patent Laid-Open No. 2007-174008) discloses an example in which dictionary-based compression is used for character area compression in a system that identifies characters and photo areas of an image and applies compression suitable for each. A method of high-speed compression is disclosed by excluding information that can be regarded as non-text from character area data input before dictionary compression from a lexicographic compression target.

特許文献３（特開２００５−３０１６６３）は、ＯＣＲ（Optical Character Recognition）で取得した文字単位を元にビットマップ辞書化の単位を設定し、必要以上に画像が部品化されることを防ぐ技術を開示している。 Patent Document 3 (Japanese Patent Laid-Open No. 2005-301663) is a technique for setting a bitmap dictionary unit based on a character unit acquired by OCR (Optical Character Recognition) and preventing an image from being partized more than necessary. Disclosure.

しかしこれら特許文献１乃至３の従来技術においては、部品画像のマッチング処理において、様々な文書フォーマットやフォント種類をもった文書の頁属性情報を考慮しておらず、効率的な画像辞書の探索・登録を行なっていないという問題がある。 However, in the prior arts of these patent documents 1 to 3, the page image attribute information of documents having various document formats and font types is not considered in the matching process of the component images, and an efficient image dictionary search / There is a problem of not registering.

本発明は、辞書化方式による画像圧縮処理を高速化することができる画像圧縮装置および画像圧縮方法を提供することを目的とする。 An object of the present invention is to provide an image compression apparatus and an image compression method capable of speeding up image compression processing using a dictionary method.

課題を解決する一実施形態は、
入力した画像を解析してオブジェクト配置情報と頁属性情報を出力する解析部と、
前記オブジェクト配置情報に基いて、前記画像を部品化して、部品画像を出力する部品化部と、
既にある画像辞書の中から前記解析部からの前記頁属性情報に対応する画像辞書を選び、前記選ばれた画像辞書と前記部品化部からの前記部品画像を比較して対応する画像辞書を決定し、前記決定した画像辞書と前記部品画像の辞書インデックスを出力する辞書化部と、
前記選択された画像辞書と前記辞書インデックスを符号化する符号化部と、
を具備することを特徴とする画像圧縮装置である。 One embodiment to solve the problem is:
An analysis unit that analyzes the input image and outputs object placement information and page attribute information;
Based on the object arrangement information, the image is converted into a component, and a component converting unit that outputs a component image;
An image dictionary corresponding to the page attribute information from the analysis unit is selected from among existing image dictionaries, and the corresponding image dictionary is determined by comparing the selected image dictionary with the component image from the componentizing unit. A dictionary forming unit for outputting the determined image dictionary and a dictionary index of the component image;
An encoding unit for encoding the selected image dictionary and the dictionary index;
An image compression apparatus comprising:

画像圧縮処理を行なう際の部品画像と画像辞書のマッチング処理において、圧縮しようとする画像の頁属性情報（例えば文書、プレゼン、地図等）に対応する画像辞書を優先し対応しない画像辞書は排除するなどして、マッチング処理を効率化する。これにより、画像圧縮処理を高速化することができる。 In the matching process between the component image and the image dictionary when performing the image compression process, the image dictionary corresponding to the page attribute information (for example, document, presentation, map, etc.) of the image to be compressed is given priority and the image dictionary not corresponding is excluded. Etc. to make the matching process more efficient. Thereby, the image compression processing can be speeded up.

本発明の第１実施形態の画像圧縮装置の構成の一例を示すブロック図。1 is a block diagram showing an example of the configuration of an image compression apparatus according to a first embodiment of the present invention. 当該画像圧縮装置のレイアウト解析部の構成の一例を示すブロック図。The block diagram which shows an example of a structure of the layout analysis part of the said image compression apparatus. 当該画像圧縮装置のレイアウト解析部の動作の一例を説明する説明図。Explanatory drawing explaining an example of operation | movement of the layout analysis part of the said image compression apparatus. 当該画像圧縮装置の頁属性判定部の構成の一例を示すブロック図。The block diagram which shows an example of a structure of the page attribute determination part of the said image compression apparatus. 当該画像圧縮装置の頁判定データの一例の説明図。Explanatory drawing of an example of the page determination data of the said image compression apparatus. 当該画像圧縮装置の画像部品化部の構成の一例を示すブロック図。The block diagram which shows an example of a structure of the image componentization part of the said image compression apparatus. 当該画像圧縮装置の画像部品化部の動作の一例を説明する説明図。Explanatory drawing explaining an example of operation | movement of the image componentization part of the said image compression apparatus. 当該画像圧縮装置の画像辞書化部の構成の一例を示すブロック図。The block diagram which shows an example of a structure of the image dictionary part of the said image compression apparatus. 当該画像圧縮装置の画像辞書化部の動作の一例を説明する説明図。Explanatory drawing explaining an example of operation | movement of the image dictionary part of the said image compression apparatus. 第１実施形態の変形例である画像圧縮装置の構成を示すブロック図。The block diagram which shows the structure of the image compression apparatus which is a modification of 1st Embodiment. 当該画像圧縮装置のレイアウト解析部の構成の一例を示すブロック図。The block diagram which shows an example of a structure of the layout analysis part of the said image compression apparatus. 当該画像圧縮装置の頁属性判定部の構成の一例を示すブロック図。The block diagram which shows an example of a structure of the page attribute determination part of the said image compression apparatus. 当該画像圧縮装置のための傾き計算補助図の一例を示す説明図。Explanatory drawing which shows an example of the inclination calculation auxiliary | assistance figure for the said image compression apparatus. 当該画像圧縮装置の頁判定データの一例の説明図。Explanatory drawing of an example of the page determination data of the said image compression apparatus. 当該画像圧縮装置の画像辞書化部の構成の一例を示すブロック図。The block diagram which shows an example of a structure of the image dictionary part of the said image compression apparatus. 本発明の第２実施形態の画像圧縮装置の構成の一例を示すブロック図。The block diagram which shows an example of a structure of the image compression apparatus of 2nd Embodiment of this invention. 当該画像圧縮装置の文字処理部の構成の一例を示すブロック図。The block diagram which shows an example of a structure of the character processing part of the said image compression apparatus. 本発明の第３実施形態の画像圧縮装置の構成の一例を示すブロック図。The block diagram which shows an example of a structure of the image compression apparatus of 3rd Embodiment of this invention. 当該画像圧縮装置の画像辞書化部の構成の一例を示すブロック図。The block diagram which shows an example of a structure of the image dictionary part of the said image compression apparatus. 当該画像圧縮装置の辞書の探索順序入れ替え効果を説明する説明図。Explanatory drawing explaining the search order switching effect of the dictionary of the said image compression apparatus. 当該画像圧縮装置の画像辞書化部の構成の一例を示すブロック図。The block diagram which shows an example of a structure of the image dictionary part of the said image compression apparatus. 当該画像圧縮装置のビットマップカウント部の働きを説明する説明図。Explanatory drawing explaining the function of the bitmap count part of the said image compression apparatus.

以下、この発明の実施の形態について図面を参照して詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

本発明の要旨は、入力された画像から頁属性情報を抽出し、画像を部品化して複数の画像辞書に辞書化すると共に、抽出した頁属性情報に基づいて複数の画像辞書から一つを選択することで、辞書マッチング処理を効率化して迅速な圧縮処理を実現する画像圧縮装置および画像圧縮方法である。 The gist of the present invention is that page attribute information is extracted from an input image, the image is converted into a part into a plurality of image dictionaries, and one is selected from a plurality of image dictionaries based on the extracted page attribute information Thus, the present invention provides an image compression apparatus and an image compression method that realizes quick compression processing by making dictionary matching processing efficient.

このような構成をとることにより、属性に適応した辞書を生成するため、辞書の作成が高速で圧縮率も高い辞書およびインデックス方式の圧縮ファイルを生成することができる。 By adopting such a configuration, a dictionary adapted to attributes is generated, so that a dictionary can be generated at a high speed and a compression ratio can be high, and a compressed file of an index method can be generated.

なお、辞書の切り替え情報はレイアウト解析部以外にも、原稿モード等のユーザー指示手段を用いて辞書を切り替えることも可能である。また切り替え方法も頁単位でなく、頁内のオブジェクト単位で切り替えることも可能である。 In addition to the layout analysis unit, the dictionary switching information can be switched using user instruction means such as a document mode. Also, the switching method can be switched not for each page but for each object in the page.

また、この辞書の切り替え（選択）は、頁単位毎、地図等の原稿種別の頁情報毎、スキュー角度、天地等の頁単位の原稿方向情報毎、文字情報毎に行なうことも可能であり、それぞれ、高速・高圧縮な辞書圧縮ファイルを生成することができる。 Further, this switching (selection) of the dictionary can be performed for each page, for each page information of a document type such as a map, for each document direction information of a page unit such as a skew angle and top and bottom, and for each character information. Each can generate a high-speed and high-compression dictionary compressed file.

（第１実施形態）
図１は、本発明の第１実施形態の画像圧縮装置の構成の一例を示すブロック図である。本発明の一実施形態に係る画像圧縮装置１は、全体の動作を制御する制御部１０と、原稿画像を読み取るスキャナ１１と、スキャナ１１から画像信号ｍ１が供給されるレイアウト解析部１２と、スキャナ１１から画像信号ｍ１が供給されレイアウト解析部１２からオブジェクト配置情報ｍ２が供給される画像部品化部１３を有する。さらに、画像圧縮装置１は、レイアウト解析部１２から頁属性情報ｍ３を受け画像部品化部１３から部品画像ｍ４を受ける画像辞書化部１４と、画像辞書化部１４から画像辞書信号ｍ５と辞書インデックスｍ６が供給される画像符号化部１５と、画像符号化部１５から符号データｍ７が供給される記憶部１６と、記憶部１６の後段に設けられたインターフェース部１７を有している。 (First embodiment)
FIG. 1 is a block diagram showing an example of the configuration of the image compression apparatus according to the first embodiment of the present invention. An image compression apparatus 1 according to an embodiment of the present invention includes a control unit 10 that controls the overall operation, a scanner 11 that reads an original image, a layout analysis unit 12 that is supplied with an image signal m1 from the scanner 11, and a scanner. 11 includes an image component converting unit 13 to which the image signal m1 is supplied from 11 and the object arrangement information m2 is supplied from the layout analyzing unit 12. Further, the image compression apparatus 1 receives the page attribute information m3 from the layout analysis unit 12, the image dictionary unit 14 that receives the component image m4 from the image component unit 13, and the image dictionary signal m5 and the dictionary index from the image dictionary unit 14. An image encoding unit 15 to which m6 is supplied, a storage unit 16 to which code data m7 is supplied from the image encoding unit 15, and an interface unit 17 provided at the subsequent stage of the storage unit 16 are provided.

このような構成をもつ画像圧縮装置１において、以下に動作を説明すると、スキャナ１１から入力された画像信号ｍ１はレイアウト解析部１２に供給され、レイアウト解析部１２では、供給された画像信号ｍ１を既知のレイアウト解析技術により処理することで、オブジェクト配置情報ｍ２と頁属性情報ｍ３を出力する。画像部品化部１３は、供給された画像信号ｍ１とオブジェクト配置情報ｍ２を用いて、部品画像ｍ４を出力する。 In the image compression apparatus 1 having such a configuration, the operation will be described below. The image signal m1 input from the scanner 11 is supplied to the layout analysis unit 12, and the layout analysis unit 12 receives the supplied image signal m1. By processing with a known layout analysis technique, object arrangement information m2 and page attribute information m3 are output. The image component converting unit 13 outputs a component image m4 using the supplied image signal m1 and object arrangement information m2.

画像辞書化部１４は、部品画像ｍ４を既知のビットマップ辞書化技術と頁属性情報ｍ３を用いて、画像辞書ｍ５と辞書インデックスｍ６を出力する。画像符号化部１５は、これらの信号を受けて符号データｍ７を生成し、記憶部１６に格納する。 The image dictionary unit 14 outputs an image dictionary m5 and a dictionary index m6 using the known bitmap dictionary conversion technique and the page attribute information m3 for the component image m4. The image encoding unit 15 receives these signals, generates code data m7, and stores it in the storage unit 16.

図２は、当該画像圧縮装置のレイアウト解析部の構成の一例を示すブロック図である。レイアウト解析部１２は、図２に示すように、画像信号ｍ１を受ける縮小処理部２１と、縮小処理部２１から縮小画像信号ｍ１１が供給される連結画素探索部２２と、連結画素探索部２２から領域の左上座標、左下座標、右上座標、右下座標である領域情報ｍ１２を受ける領域座標変換部２３と、同じく領域情報ｍ１２を受ける頁属性判定部２４と、頁属性判定部２４に接続されるテーブル２５を有している。 FIG. 2 is a block diagram illustrating an example of the configuration of the layout analysis unit of the image compression apparatus. As shown in FIG. 2, the layout analysis unit 12 includes a reduction processing unit 21 that receives the image signal m 1, a connected pixel search unit 22 to which the reduced image signal m 11 is supplied from the reduction processing unit 21, and a connection pixel search unit 22. The region coordinate conversion unit 23 that receives the region information m12 that is the upper left coordinate, the lower left coordinate, the upper right coordinate, and the lower right coordinate of the region, the page attribute determination unit 24 that similarly receives the region information m12, and the page attribute determination unit 24 are connected. A table 25 is provided.

このような構成をもつレイアウト解析部１２において、縮小処理部２１は、画像信号ｍ１を受けると既定の縮小率で縮小し、縮小画像信号ｍ１１を出力する。連結画素探索部２２は、既知のチェインアルゴリズムで８方向探索して、画素の連結した領域の左上座標、左下座標、右上座標、右下座標である領域情報ｍ１２を出力する。ここで、領域情報ｍ１２の座標は、縮小処理部２１で縮小処理した座標系であるので、画像信号ｍ１と同じ座標系に領域座標変換部２３で変換し、オブジェクト配置情報ｍ２として出力する。一方、頁属性判定部２４は、頁内の領域情報ｍ１２を集計し、テーブル２５から読み出した頁判定データｍ１３と比較して、頁属性情報ｍ３を出力する。 In the layout analysis unit 12 having such a configuration, upon receiving the image signal m1, the reduction processing unit 21 reduces the image at a predetermined reduction rate and outputs a reduced image signal m11. The connected pixel search unit 22 searches in eight directions using a known chain algorithm, and outputs area information m12 that is the upper left coordinates, lower left coordinates, upper right coordinates, and lower right coordinates of the areas where the pixels are connected. Here, since the coordinates of the area information m12 are in the coordinate system reduced by the reduction processing unit 21, the area coordinate conversion unit 23 converts the coordinate system to the same coordinate system as the image signal m1 and outputs it as object arrangement information m2. On the other hand, the page attribute determination unit 24 totals the area information m12 in the page, compares it with the page determination data m13 read from the table 25, and outputs page attribute information m3.

ここで、頁属性判定部２４の動作を除いたレイアウト解析部１２の動作例を図３に示す。画像信号ｍ１を縮小処理して得た縮小画像ｍ１１では、文字等は連結した画素になることがわかる。この画像に対して既知のチェインアルゴリズムを用いて、連結した領域を算出すると、領域情報ｍ１２の点線で示したような各固まり毎の複数の領域が算出される。そして、領域座標変換部２３は、領域情報ｍ１２を受けて、座標系を縮小前に戻したオブジェクト配置情報ｍ２を算出する。 Here, an operation example of the layout analysis unit 12 excluding the operation of the page attribute determination unit 24 is shown in FIG. In the reduced image m11 obtained by reducing the image signal m1, it can be seen that characters and the like are connected pixels. When a connected region is calculated for this image using a known chain algorithm, a plurality of regions for each cluster as indicated by the dotted line in the region information m12 are calculated. Then, the area coordinate conversion unit 23 receives the area information m12 and calculates object arrangement information m2 in which the coordinate system is returned to before reduction.

次に、図４は、当該画像圧縮装置の頁属性判定部の構成の一例を示すブロック図である。頁属性判定部２４は、領域情報ｍ１２が供給される領域傾き算出部３１と、同じく領域情報ｍ１２が供給される領域距離算出部３２と、領域傾き算出部３１から傾き情報ｍ２１が供給され領域距離算出部３２から距離情報ｍ２２が供給される判断部３３を有している。 Next, FIG. 4 is a block diagram illustrating an example of the configuration of the page attribute determination unit of the image compression apparatus. The page attribute determination unit 24 includes a region inclination calculation unit 31 to which the region information m12 is supplied, a region distance calculation unit 32 to which the region information m12 is supplied, and a region distance to which the inclination information m21 is supplied from the region inclination calculation unit 31. It has the judgment part 33 to which distance information m22 is supplied from the calculation part 32.

このような構成をもつ頁属性判定部２４において、領域傾き算出部３１は、領域情報ｍ１２を用いて、各領域毎の傾き＝ＭＡＸ（｜左上Ｘ座標−左下Ｘ座標｜、｜左上Ｙ座標−右上Ｙ座標｜）を計算して求め、頁内の全領域の傾きの平均を傾き情報ｍ２１として算出する。 In the page attribute determination unit 24 having such a configuration, the region inclination calculation unit 31 uses the region information m12 to determine the inclination for each region = MAX (| upper left X coordinate−lower left X coordinate |, | upper left Y coordinate− The upper right Y coordinate |) is calculated and obtained, and the average of the inclinations of all the areas in the page is calculated as the inclination information m21.

また、領域距離算出部３２は、各領域のセンター座標を求め、各センター座標毎に最も近いセンター座標を持つ領域ペア（下記でＰ１及びＰ２を求め、
各領域毎の距離＝ＭＩＮ（｜Ｐ１のＸ座標−Ｐ２のＸ座標｜、｜Ｐ１のＹ座標−Ｐ２のＹ座標｜）
を求めることで、頁内のペアの平均を距離情報ｍ２２として算出する。 Further, the area distance calculation unit 32 obtains the center coordinates of each area, obtains an area pair having the nearest center coordinates for each center coordinate (P1 and P2 below are obtained,
Distance for each area = MIN (| X coordinate of P1−X coordinate of P2 |, Y coordinate of | P1−Y coordinate of P2 |)
Is obtained as the distance information m22.

判断部３３は、傾き情報ｍ２１と距離情報ｍ２２を受け取り、テーブル２５から図５に示すような頁判定データｍ１３を読み込む。判断部３３は、傾き情報ｍ２１と距離情報ｍ２２を所定の閾値と比較した結果と頁判定データｍ１３に基づいて原稿タイプを決定し、頁属性情報ｍ３として出力する。 The determination unit 33 receives the inclination information m21 and the distance information m22, and reads page determination data m13 as shown in FIG. The determination unit 33 determines the document type based on the result of comparing the inclination information m21 and the distance information m22 with a predetermined threshold value and the page determination data m13, and outputs it as page attribute information m3.

次に、図６は、当該画像圧縮装置の画像部品化部の構成の一例を示すブロック図である。画像部品化部１３は、図６に示すように、画像信号ｍ１とオブジェクト配置情報ｍ２がそれぞれ供給される縦方向画素カウント部４１と横方向画素カウント部４２と、縦方向画素カウント部４１からの縦方向の射影ｍ３１と横方向画素カウント部４２からの横方向の射影ｍ３２が供給される比較器４３と、同じく縦方向画素カウント部４１からの縦方向の射影ｍ３１と横方向画素カウント部４２からの横方向の射影ｍ３２と比較器４３からの制御信号ｍ３３が供給される選択部４４と、選択部４４からの選択信号ｍ３４と画像信号ｍ１が供給される画素分割部４５を有している。 Next, FIG. 6 is a block diagram illustrating an example of the configuration of the image component converting unit of the image compression apparatus. As shown in FIG. 6, the image componentization unit 13 receives the vertical direction pixel count unit 41, the horizontal direction pixel count unit 42, and the vertical direction pixel count unit 41 to which the image signal m 1 and the object arrangement information m 2 are respectively supplied. From the comparator 43 to which the vertical projection m31 and the horizontal projection m32 from the horizontal pixel count unit 42 are supplied, and from the vertical projection m31 and the horizontal pixel count unit 42 from the vertical pixel count unit 41. A horizontal projection m32 and a selection unit 44 to which the control signal m33 from the comparator 43 is supplied, and a pixel division unit 45 to which the selection signal m34 and the image signal m1 from the selection unit 44 are supplied.

このような構成をもつ画像部品化部１３において、縦方向画素カウント部４１は、オブジェクト配置情報ｍ２の座標単位で画像信号ｍ１を分割して、縦軸への画素カウントの射影を縦方向の射影ｍ３１として出力する。また、横方向画素カウント部４２も、オブジェクト配置情報ｍ２の座標単位で画像信号ｍ１を分割して、横軸への画素カウントの射影を横方向の射影ｍ３２として比較器４３に出力する。比較器４３は、選択部４４が射影値の分散の大きいほうを選択するような制御信号ｍ３３を選択部４４に出力する。選択部４４は、射影値の分散の大きいほうを選択する選択信号ｍ３４を画素分割部４５に供給する。画素分割部４５は、選択された射影値を用いて、画像信号ｍ１を分割して部品画像ｍ４を出力する。 In the image componentizing unit 13 having such a configuration, the vertical pixel counting unit 41 divides the image signal m1 by the coordinate unit of the object arrangement information m2, and the projection of the pixel count on the vertical axis is projected in the vertical direction. Output as m31. The horizontal pixel count unit 42 also divides the image signal m1 by the coordinate unit of the object arrangement information m2, and outputs the projection of the pixel count on the horizontal axis to the comparator 43 as the horizontal projection m32. The comparator 43 outputs to the selection unit 44 a control signal m33 that allows the selection unit 44 to select the one with the larger variance of the projection values. The selection unit 44 supplies the pixel division unit 45 with a selection signal m34 for selecting the one with the larger variance of the projection values. The pixel dividing unit 45 divides the image signal m1 using the selected projection value and outputs a component image m4.

これらの動作を図７を用いて説明する。図７は、当該画像圧縮装置の画像部品化部の動作の一例を説明する説明図である。比較器４３では、縦方向の射影ｍ３１と横方向の射影ｍ３２を比較し、分散の大きい横方向の射影をする。画素分割部４５で、射影に対して閾値処理を行うことで、横方向の分割座標を算出し（点線）、図７で○で示された単位により部品画像として出力する。 These operations will be described with reference to FIG. FIG. 7 is an explanatory diagram illustrating an example of the operation of the image component converting unit of the image compression apparatus. The comparator 43 compares the projection m31 in the vertical direction and the projection m32 in the horizontal direction, and performs a horizontal projection with a large variance. The pixel dividing unit 45 performs threshold processing on the projection to calculate horizontal division coordinates (dotted line), and outputs the result as a component image in units indicated by ◯ in FIG.

つぎに、図８は、当該画像圧縮装置の画像辞書化部の構成の一例を示すブロック図である。画像辞書化部１４は、図５で示した頁属性情報ｍ３に対応する文書辞書５３、プレゼン辞書５２、地図辞書５１、非文書辞書５０を有しており、さらに、頁属性情報ｍ３が供給され各辞書を選択する選択部５５と、選択部５５により選択された辞書が接続され、部品画像ｍ４が供給され画像辞書信号ｍ５と辞書インデックスｍ６を出力するマッチング部５４を有している。 Next, FIG. 8 is a block diagram showing an example of the configuration of the image dictionary unit of the image compression apparatus. The image dictionary unit 14 includes a document dictionary 53, a presentation dictionary 52, a map dictionary 51, and a non-document dictionary 50 corresponding to the page attribute information m3 shown in FIG. 5, and is further supplied with page attribute information m3. A selection unit 55 that selects each dictionary and a dictionary selected by the selection unit 55 are connected, and a matching unit 54 that supplies a component image m4 and outputs an image dictionary signal m5 and a dictionary index m6 is provided.

このような構成をもつ画像辞書化部１４において、選択部５５は、頁属性情報ｍ３に従って文書辞書５３、プレゼン辞書５２、地図辞書５１、非文書辞書５０のうちの適切な辞書を選択出力する。マッチング部５４は、既知の位置シフトとマッチング処理を組み合せにより、部品画像ｍ４が、選択部５５により選択された辞書に存在するかどうかを判断し、存在すれば、その辞書インデックスｍ６を出力する。辞書インデックスｍ６は、対応辞書名と辞書のインデックスと画像上の部品画像の位置情報である。 In the image dictionary unit 14 having such a configuration, the selection unit 55 selects and outputs an appropriate dictionary among the document dictionary 53, the presentation dictionary 52, the map dictionary 51, and the non-document dictionary 50 according to the page attribute information m3. The matching unit 54 determines whether or not the component image m4 exists in the dictionary selected by the selection unit 55 by combining the known position shift and matching processing, and outputs the dictionary index m6 if it exists. The dictionary index m6 is the corresponding dictionary name, dictionary index, and position information of the component image on the image.

マッチング部５４は、対応するビットマップパターンが辞書に無ければ、その部品画像ｍ４を現在選択されている辞書に登録してインデックスを付与する。マッチング部５４は、スキャナ１１から入力された全原稿に関して、上記のマッチング処理が終了すれば、各辞書データを画像辞書ｍ５として出力する。そして、画像符号化部１５は、画像辞書ｍ５を既知の画像圧縮技術（例えばランレングス）で圧縮し、符号データｍ７として、辞書インデックスｍ６と一緒にて出力する。 If there is no corresponding bitmap pattern in the dictionary, the matching unit 54 registers the component image m4 in the currently selected dictionary and assigns an index. The matching unit 54 outputs each dictionary data as the image dictionary m5 when the above matching processing is completed for all the documents input from the scanner 11. Then, the image encoding unit 15 compresses the image dictionary m5 with a known image compression technique (for example, run length), and outputs it as code data m7 together with the dictionary index m6.

次に、本発明による特徴的な辞書の効果を図９を用いて説明する。一般に、複数の枚数の文書をスキャンすると、図９に示すように一般文書やプレゼン用のドキュメント、図３で示した地図など、原稿のタイプが異なる文書が混在することがある。これらの文書では、各文書タイプ毎には同一種のフォントデザインやサイズが使われている可能性が高いが、文書タイプが異なるとデザインやサイズが異なる可能性が高い。 Next, the effect of the characteristic dictionary according to the present invention will be described with reference to FIG. In general, when a plurality of documents are scanned, there may be a mixture of documents of different document types such as a general document, a document for presentation as shown in FIG. 9, and a map shown in FIG. In these documents, there is a high possibility that the same font design and size are used for each document type, but there is a high possibility that the design and size will be different for different document types.

一般的には、これらの辞書データをまとめて一つの辞書に登録し、検索及びマッチングを行なう。しかし、上述した本発明に係る実施形態においては、辞書データを原稿タイプ毎に複数の辞書として辞書化し、辞書を使用する際に、頁属性情報に従って適切な辞書を選択し切り替えて、検索、マッチングを行う。これにより、頁属性情報に対応しない不適切な辞書を検索・マッチングの対象としないため、辞書マッチングを効率的に高速に行なうことができる。すなわち、文書画像をマッチングする場合、文書辞書を選択し、プレゼン辞書、地図辞書、非文書辞書を使用しないことで、効率的で高速な辞書マッチングを行なうことができる。 Generally, these dictionary data are collectively registered in one dictionary, and search and matching are performed. However, in the embodiment according to the present invention described above, dictionary data is dictionaryd as a plurality of dictionaries for each document type, and when using the dictionary, an appropriate dictionary is selected and switched according to the page attribute information, and search and matching are performed. I do. Thereby, since an inappropriate dictionary that does not correspond to the page attribute information is not searched and matched, dictionary matching can be performed efficiently and at high speed. That is, when matching document images, efficient and high-speed dictionary matching can be performed by selecting a document dictionary and not using a presentation dictionary, a map dictionary, or a non-document dictionary.

また、マッチング部５４でマッチング精度を緩めれば、非可逆で辞書ビットマップの圧縮ファイルが作成されるが、様々なフォントデザインを一つの辞書で管理する方式では、画質とマッチング精度の制御が難しい。しかしながら、本発明の方法のように原稿タイプ毎に辞書を切り替えれば、同一フォントデザイン群内でマッチングが行われる確率が高いので、画質低下を抑えつつマッチング精度を緩めることができるので、従来よりも圧縮率の高い辞書ビットマップ圧縮ファイルを提供することができる。 If the matching unit 54 loosens the matching accuracy, a dictionary bitmap compressed file is created irreversibly. However, it is difficult to control image quality and matching accuracy with a system that manages various font designs in a single dictionary. . However, if the dictionary is switched for each document type as in the method of the present invention, the probability of matching within the same font design group is high, so the matching accuracy can be relaxed while suppressing degradation in image quality. A dictionary bitmap compressed file having a high compression rate can be provided.

なおレイアウト解析方法や、そこから算出される頁情報、辞書化や圧縮方法等や、辞書の切り替え方法は、上述した実施形態に限定されるものではなく、本発明の実施可能な範囲は当業者が想到する範囲に及ぶものである。 Note that the layout analysis method, page information calculated therefrom, dictionary formation, compression method, and dictionary switching method are not limited to the above-described embodiments, and the scope of implementation of the present invention is within the scope of those skilled in the art. Is in the range that can be expected.

（第１実施形態の変形例）
次に、本発明の第１実施形態の変形例について以下に図面を用いて詳細に説明する。図１０は、第１実施形態の変形例である画像圧縮装置の構成を示すブロック図である。 (Modification of the first embodiment)
Next, a modification of the first embodiment of the present invention will be described in detail with reference to the drawings. FIG. 10 is a block diagram illustrating a configuration of an image compression apparatus that is a modification of the first embodiment.

第１実施形態の変形例は、（領域距離を考慮せずに）領域傾きのみから頁属性情報ｍ３を決定し、決定した頁属性情報ｍ３により辞書を生成し選択する画像圧縮装置および画像圧縮方法を提供する。第１実施形態の変形例は、基本的に第１実施形態と同様の処理部は同一番号を付与しており、レイアウト解析部１２’、その頁属性情報ｍ３’、それを元に辞書化を行う画像辞書化部１４’が若干異なる以外は同一であるので、変更ブロックのみを説明する。 The modification of the first embodiment is an image compression apparatus and an image compression method for determining page attribute information m3 from only the region inclination (without considering the region distance), and generating and selecting a dictionary based on the determined page attribute information m3. I will provide a. In the modified example of the first embodiment, the same processing unit as that of the first embodiment is basically assigned the same number, and the layout analysis unit 12 ′, its page attribute information m3 ′, and a dictionary based on it. Since the image dictionary conversion unit 14 ′ to be performed is the same except that it is slightly different, only the changed block will be described.

レイアウト解析部１２’の構成は、図１１に示す通りであり、第１実施形態との違いは頁属性判定部２４’のテーブル２５’、頁判定データｍ１３、および頁属性判定部２４’の構成、生成される頁属性情報ｍ３’が異なる。 The configuration of the layout analysis unit 12 ′ is as shown in FIG. 11. The difference from the first embodiment is the configuration of the table 25 ′, page determination data m13, and page attribute determination unit 24 ′ of the page attribute determination unit 24 ′. The generated page attribute information m3 ′ is different.

また、頁属性判定部２４’の構成を図１２に示す。第１実施形態の変形例の頁属性判定部２４’と第１実施形態の頁属性判定部２４との構成上の相違は、第１実施形態の変形例の頁属性判定部２４’が、領域距離算出部３２を用いることなく、領域傾き算出部３１’のみにより傾き角度ｍ２１’と傾き分散ｍ２２’を出力することである。 The configuration of the page attribute determination unit 24 'is shown in FIG. The structural difference between the page attribute determination unit 24 ′ according to the modification of the first embodiment and the page attribute determination unit 24 according to the first embodiment is that the page attribute determination unit 24 ′ according to the modification according to the first embodiment has a region. Without using the distance calculation unit 32, only the region inclination calculation unit 31 ′ outputs the inclination angle m21 ′ and the inclination variance m22 ′.

傾き角度ｍ２１’は、例えば図１３の様に傾きを定義し、簡易的に以下の式で算出する。 The inclination angle m21 'defines an inclination as shown in FIG. 13, for example, and is simply calculated by the following equation.

ｉｆ（（左上Ｙ座標−右上Ｙ座標）＞０）｛
θ＝ａｔａｎ（（右下Ｙ座標−左下Ｙ座標）／（右下Ｘ座標−左下Ｘ座標））
｝
ｅｌｓｅ｛
θ＝ａｔａｎ（（右上Ｙ座標−左上Ｙ座標）／（右上Ｘ座標−左上Ｘ座標））
｝
縮小処理部２１は、傾き角度ｍ２１’としてこれらθの平均値を出力し、傾き分散ｍ２２’として同分散を出力する。 if ((upper left Y coordinate−upper right Y coordinate)> 0) {
θ = atan ((lower right Y coordinate−lower left Y coordinate) / (lower right X coordinate−lower left X coordinate))
}
else {
θ = atan ((upper right Y coordinate−upper left Y coordinate) / (upper right X coordinate−upper left X coordinate))
}
The reduction processing unit 21 outputs the average value of these θ as the inclination angle m21 ′, and outputs the same variance as the inclination variance m22 ′.

図１４は、当該画像圧縮装置の頁判定データの一例の説明図である。判断部３３は、第１実施形態と同様に、図１４に示す頁判定データｍ１３を用いて、傾き角度ｍ２１’の範囲と傾き分散ｍ２２’を閾値と比較した分散の大きさから、角度カテゴリーとして、標準、プラス角度、マイナス角度、特別の４種のカテゴリーに分類して頁属性情報ｍ３’を出力する。ここで特別のカテゴリーは第１実施形態で例示したような地図原稿の場合、領域に角度があるがスキューによる角度ではない可能性が高いカテゴリーも存在するためである。 FIG. 14 is an explanatory diagram of an example of page determination data of the image compression apparatus. As in the first embodiment, the determination unit 33 uses the page determination data m13 illustrated in FIG. 14 as an angle category based on the magnitude of dispersion obtained by comparing the range of the inclination angle m21 ′ and the inclination dispersion m22 ′ with a threshold value. The page attribute information m3 ′ is output after being classified into four categories of standard, plus angle, minus angle, and special. This is because, in the case of a map document as exemplified in the first embodiment, there is a category that has a high possibility that the region has an angle but is not an angle due to skew.

図１５は、画像辞書化部の構成の一例を示すブロック図である。第１実施形態の変形例の画像辞書化部１４’は、図１４の角度カテゴリーに対応した辞書を有していることが特徴であり、標準辞書５６と、プラス角度辞書５７と、マイナス角度辞書５８と、特別辞書５９と、これらの辞書が接続され頁属性情報ｍ３が供給される選択部５５と、選択部５５により選択された辞書が接続されるマッチング部５４を有している。 FIG. 15 is a block diagram illustrating an example of the configuration of the image dictionary unit. The image dictionary conversion unit 14 ′ according to the modification of the first embodiment is characterized by having a dictionary corresponding to the angle category of FIG. 14, and includes a standard dictionary 56, a plus angle dictionary 57, and a minus angle dictionary. 58, a special dictionary 59, a selection unit 55 to which these dictionaries are connected and page attribute information m3 is supplied, and a matching unit 54 to which the dictionary selected by the selection unit 55 is connected.

一般に複数の原稿画像の入力処理を行なうと、入力コンディションにより原稿の方向が若干傾いて入力される可能性があり、この角度をスキューと呼ぶ。またスキューだけでなく原稿の向き（上下逆さま、９０度方向違い）なども混在して入力されることがある。 In general, when a plurality of document images are input, there is a possibility that the direction of the document is slightly inclined depending on the input condition, and this angle is called skew. Further, not only skew but also the orientation of the document (upside down, 90 degrees difference) may be input together.

通常、このような条件は画像の頁全体で共通の性質であるが、単純に辞書化を行うと正しく入力されれば同一の文字であっても、スキューによって別々の文字と判別され辞書化されてしまう。そのため辞書化に当たっては、検索対象の文字ビットマップが増大し検索に時間が掛かってしまう。 Normally, such conditions are common to the entire page of the image, but if lexicalization is simply performed, even if the same character is input correctly, it is discriminated as a separate character due to skew and lexicographically. End up. Therefore, when creating a dictionary, the character bitmap to be searched increases, and the search takes time.

第１実施形態の変形例の構成を取ることにより、原稿全体の方向依存による歪みに対しても、高速に辞書化処理を行うことができる。また、各角度毎にマッチング対象のデータがそろっているので、辞書マッチング精度を緩めても第１実施形態同様、画質低下を抑制した高圧縮な辞書ビットマップ圧縮ファイルを提供することができる。 By adopting the configuration of the modified example of the first embodiment, it is possible to perform dictionary processing at high speed against distortion due to the direction dependence of the entire document. In addition, since matching target data is prepared for each angle, a highly compressed dictionary bitmap compressed file in which image quality deterioration is suppressed can be provided as in the first embodiment even if the dictionary matching accuracy is relaxed.

また、スキューに対しては角度を補正後一律に処理することも考えられるが、補正により角度は直ってもビットマップが微妙に崩れる可能性があり、スキュー補正を行った画像に対しても同様に効果が得られる。 For skew, it is possible to process the angle uniformly after correcting the angle, but even if the angle is corrected by the correction, there is a possibility that the bitmap may be slightly corrupted. The effect is obtained.

また、この実施形態の更なる変形例としては、レイアウト解析部１２’で、頁単位でなく領域毎に種別を特定し、その領域毎の種別ごとに辞書を切り替える構成を取ることで、より細かな辞書化制御が可能となる。例えば、文書領域とグラフや表領域では、使用するフォントデザインが異なる可能性があるので、領域毎の種別毎に辞書を切り替えることでより高速な辞書マッチング処理が可能となる。 Further, as a further modification of this embodiment, the layout analysis unit 12 ′ uses a configuration in which a type is specified for each area, not for each page, and a dictionary is switched for each type for each area. Lexical control is possible. For example, there is a possibility that the font design to be used is different between the document area and the graph or table area. Therefore, the dictionary matching process can be performed at a higher speed by switching the dictionary for each type of area.

（第２実施形態）
第２実施形態は、レイアウト解析部の代わりに文字処理部１８を使用し、オブジェクト配置情報ｍ２’と文字情報ｍ３”を出力し、画像部品化部１３で、より高精度な画像の部品化と画像辞書化部１４において、文字の特徴に応じた辞書を適用して高速に辞書ビットマップ圧縮ファイルを作成することである。 (Second Embodiment)
In the second embodiment, the character processing unit 18 is used in place of the layout analysis unit, and the object arrangement information m2 ′ and the character information m3 ″ are output. In the image dictionary unit 14, a dictionary bitmap compressed file is created at high speed by applying a dictionary according to the character characteristics.

図１６は、本発明の第２実施形態の画像圧縮装置の構成の一例を示すブロック図である。本発明の第２実施形態の画像圧縮装置は、図１６に示すように、基本的に第１実施形態と同様の処理部は同一番号を付与しており、文字処理部１８が異なる以外は同一であるので、変更ブロックのみ説明する。 FIG. 16 is a block diagram showing an example of the configuration of the image compression apparatus according to the second embodiment of the present invention. In the image compression apparatus according to the second embodiment of the present invention, as shown in FIG. 16, the same processing units as those in the first embodiment are basically assigned the same numbers, and are the same except that the character processing unit 18 is different. Therefore, only the changed block will be described.

文字処理部１８は、図１７に示す構成を示している。すなわち、文字処理部１８は、既知のＯＣＲ（Optical Character Recognition）を用いた文字認識部４６と、文字認識部４６から文字のコード情報ｍ５１が供給される言語判定部４７を有している。文字処理部１８は、画像信号ｍ１を受けて特許文献３の様に文字単位の情報としてオブジェクト配置情報ｍ２’を出力する。また、言語判定部４７は、文字処理部１８からコード情報ｍ５１を受けると、処理中の文字が英語か日本語かその他の言語種別情報を判断して文字情報ｍ３”を出力する。 The character processing unit 18 has the configuration shown in FIG. That is, the character processing unit 18 includes a character recognition unit 46 using known OCR (Optical Character Recognition), and a language determination unit 47 to which character code information m51 is supplied from the character recognition unit 46. The character processing unit 18 receives the image signal m1 and outputs object arrangement information m2 'as character unit information as in Patent Document 3. When the language determination unit 47 receives the code information m51 from the character processing unit 18, the language determination unit 47 determines whether the character being processed is English or Japanese and other language type information and outputs the character information m3 ″.

画像部品化部１３は、文字処理部１８から文字単位の情報としてオブジェクト配置情報ｍ２’の供給を受けるため、第１実施形態よりも正確な部品画像ｍ４を出力することができる。また、画像辞書化部１４では、文字情報が英語か日本語かその他の３種の辞書を用意して切り替える。このように、第２実施形態によれば、言語種別に応じたマッチング動作を行なうことにより、高速な辞書ビットマップによる圧縮処理を行なうことができる。また、第２実施形態では、文字情報として言語種別を用いて切り替えたが、文字情報はこの実施形態に限定されるものではない。 Since the image component converting unit 13 receives the object arrangement information m2 'as character unit information from the character processing unit 18, it can output a component image m4 that is more accurate than in the first embodiment. In addition, the image dictionary unit 14 prepares and switches three types of dictionaries for whether the character information is English or Japanese. As described above, according to the second embodiment, it is possible to perform a compression process using a high-speed dictionary bit map by performing a matching operation according to a language type. Moreover, in 2nd Embodiment, although it switched using the language classification as character information, character information is not limited to this embodiment.

（第３実施形態）
第３実施形態は、特にビットマップを辞書化して圧縮する画像圧縮技術を提供する。 (Third embodiment)
The third embodiment particularly provides an image compression technique for compressing a bitmap by converting it into a dictionary.

図１８は、本発明の第３実施形態の画像圧縮装置の構成の一例を示すブロック図である。本発明の第３実施形態の画像圧縮装置は、図１８に示すように、基本的に第１実施形態と同様の処理部は同一番号を付与しており、ビットマップ辞書を用いる画像辞書化部１４”が異なる以外は同一である。これにより、第３実施形態は、圧縮時のインデックス探索順序をビットマップの発生頻度に応じて更新することにより、高速に圧縮することが可能となる。 FIG. 18 is a block diagram showing an example of the configuration of the image compression apparatus according to the third embodiment of the present invention. In the image compression apparatus according to the third embodiment of the present invention, as shown in FIG. 18, basically the same processing units as those in the first embodiment are assigned the same numbers, and the image dictionary conversion unit uses a bitmap dictionary. 14 ″ is the same except that it is different. Accordingly, the third embodiment can perform high-speed compression by updating the index search order at the time of compression according to the occurrence frequency of the bitmap.

画像辞書化部１４”−１は、図１９に示すように、既出のビットマップが複数のインデックス付きで格納されているビットマップ辞書６１と、頁属性情報ｍ３を受けビットマップ辞書６１の探索順序を変更してビットマップ辞書データｍ４１を読み出す探索インデックス並べ替え部６２と、読み出し番号ｍ４２を探索インデックス並べ替え部６２に供給し候補ビットマップｍ４３を探索インデックス並べ替え部６２から供給されるデータ読み出し部６３と、候補ビットマップｍ４３をデータ読み出し部６３から供給され部品画像ｍ４とマッチング処理を行なうマッチング部５４を有している。 As shown in FIG. 19, the image dictionary unit 14 ″ -1 receives the bitmap dictionary 61 in which the existing bitmaps are stored with a plurality of indexes and the page attribute information m3, and searches the bitmap dictionary 61. Index reordering unit 62 that reads out the bitmap dictionary data m41 by changing the above, and a data reading unit that supplies the read number m42 to the search index reordering unit 62 and supplies the candidate bitmap m43 from the search index reordering unit 62 63, and a matching unit 54 that is supplied with the candidate bitmap m43 from the data reading unit 63 and performs a matching process with the component image m4.

このような構成をもつ画像辞書化部１４”−１においては、与えられる頁属性情報ｍ３に応じてビットマップ辞書６１の探索順序を変更し、変更されたビットマップ辞書６１の探索順序により、与えられる読み出し番号ｍ４２に対応するビットマップ辞書データｍ４１を読み出し、候補ビットマップｍ４３としてビットマップそのものと番号インデックスをデータ読み出し部６３に供給する。マッチング部５４は候補ビットマップｍ４３と部品画像ｍ４を既知の位置シフトとマッチング処理を組み合せて、部品画像ｍ４が候補ビットマップｍ４３に一致するかどうかを判断し、一致すれば、その番号インデックスを辞書インデックスｍ６として出力する。マッチング部５４は、対応するビットマップパターンが辞書に無ければ、その部品画像ｍ４に新規番号インデックスと頁属性情報ｍ３を合わせてビットマップ辞書６１に登録する。 In the image dictionary unit 14 ″ -1 having such a configuration, the search order of the bitmap dictionary 61 is changed according to the given page attribute information m3, and the search order of the bitmap dictionary 61 is changed. The bitmap dictionary data m41 corresponding to the read number m42 to be read is read, and the bitmap itself and the number index are supplied as the candidate bitmap m43 to the data readout unit 63. The matching unit 54 knows the candidate bitmap m43 and the component image m4 as known. The position shift and the matching process are combined to determine whether or not the component image m4 matches the candidate bitmap m43, and if it matches, the number index is output as the dictionary index m6. If the pattern is not in the dictionary, the part image It is registered in the bitmap dictionary 61 in accordance with a new number index and page attribute information m3 to 4.

画像辞書化部１４”−１は、スキャナ１１から入力された全原稿のマッチング処理が終了すれば、ビットマップ辞書を番号インデックス順にソートして画像辞書ｍ５として出力する。そして画像符号化部１５で画像辞書ｍ５を既知の画像圧縮技術（例えばランレングス）で圧縮し、辞書インデックスｍ６と合わせて符号データｍ７として記憶部１６に出力する。 When the matching process for all the originals input from the scanner 11 is completed, the image dictionary unit 14 ″ -1 sorts the bitmap dictionary in the order of the number index and outputs it as an image dictionary m5. The image dictionary m5 is compressed by a known image compression technique (for example, run length), and is output to the storage unit 16 as code data m7 together with the dictionary index m6.

次に、第３実施形態において特徴的な辞書の探索順序入れ替え効果を図２０を用いて説明する。例えば、４頁の画像に関する処理で、現在３枚目まで終了しているとすると、新規文字が出現するたびに番号インデックスが付与され、その頁属性情報が付与される。 Next, a characteristic dictionary search order switching effect in the third embodiment will be described with reference to FIG. For example, assuming that the processing for the four-page image has been completed up to the third page, a number index is assigned each time a new character appears, and the page attribute information is assigned.

ここで４頁目は『プレゼン』であるので、探索インデックスを並べ替え、頁属性がプレゼンの辞書ビットマップを探索の先頭に持ってくる。実際にデータの並び替えをする必要は無く、探索インデックス（読み出し番号ｍ４２）に対して読み出すべき番号インデックスの対応表を作るだけである。 Here, since the fourth page is “Presentation”, the search index is rearranged, and a dictionary bitmap whose presentation is the page attribute is brought to the head of the search. It is not necessary to actually rearrange the data, and only a correspondence table of number indexes to be read is created for the search index (read number m42).

複数頁をスキャンし一つのファイルを生成する時は、同一文書タイプでは同一のフォント（サイズ、デザイン）が使われる可能性が高い。そのため、頁属性に応じて辞書マッチングのためのデータの探索順序を、処理中の頁属性を優先的に探索することで高速に辞書マッチングが行われるので、高速に辞書ビットマップ圧縮ファイルを提供することができる。 When scanning multiple pages and creating one file, the same font (size, design) is likely to be used for the same document type. Therefore, dictionary matching is performed at high speed by preferentially searching the page attribute being processed for the search order of data for dictionary matching in accordance with the page attribute, so a dictionary bitmap compressed file is provided at high speed be able to.

また、辞書マッチングが完全一致で無い場合、符号データｍ７は非可逆圧縮されることになるが、マッチングアルゴリズムで候補が複数あった場合、頁属性情報を合わせて用いることで、よりマッチング精度が高まるので高画質・高圧縮な辞書ビットマップ圧縮ファイルを提供することができる。 In addition, when the dictionary matching is not complete matching, the code data m7 is irreversibly compressed. However, when there are a plurality of candidates in the matching algorithm, matching accuracy is further improved by using the page attribute information together. Therefore, it is possible to provide a dictionary bitmap compressed file with high image quality and high compression.

また第３実施形態では、頁番号も管理し頁の探索終了後のビットマップ辞書６１への格納時は、頁番号順に再格納しているが（３頁目）、しかし、頁番号は無視して文書は文書、プレゼンはプレゼンと頁属性単位に一つにまとめる方法も可能である。 In the third embodiment, page numbers are also managed, and when stored in the bitmap dictionary 61 after the page search is completed, the page numbers are re-stored in the order of page numbers (third page), but the page numbers are ignored. It is also possible to combine documents into documents and presentations into presentations and page attributes.

なおレイアウト解析方法や、そこから算出される頁情報、辞書化やインデックスの付け方、探索方法や圧縮方法等、探索順序の切り替え方法は、上述した実施形態に限定されるものではない。 Note that the layout analysis method, page information calculated therefrom, dictionary formation, indexing method, search method switching method, and the like are not limited to the above-described embodiments.

（第４実施形態）
第４実施形態は、第３実施形態のビットマップ辞書の探索インデックス並べ替え部にビットマップカウント部を設けたことを特徴とする。 (Fourth embodiment)
The fourth embodiment is characterized in that a bitmap count unit is provided in the search index rearrangement unit of the bitmap dictionary of the third embodiment.

図２１に第４実施形態の構成の一例である画像辞書化部１４”−２を示す。基本的に３実施形態の画像辞書化部１４”−１と同様の構成であり同一番号を付与しているが、ビットマップカウント部６４が新規に追加されている。 FIG. 21 shows an image dictionary unit 14 ″ -2 which is an example of the configuration of the fourth embodiment. The configuration is basically the same as the image dictionary unit 14 ″ -1 of the third embodiment, and the same numbers are assigned. However, a bitmap count unit 64 is newly added.

このような画像辞書化部１４”−２において、マッチング部５４は、候補ビットマップｍ４３と部品画像ｍ４を既知の位置シフトとマッチング処理を組み合せて、部品画像ｍ４が候補ビットマップｍ４３に一致するかどうかを判断し、一致すれば、その番号インデックスを辞書インデックスｍ６として出力する。対応するビットマップパターンが辞書に無ければ、その部品画像ｍ４に新規番号インデックスと頁属性情報ｍ３を合わせてビットマップ辞書６１に登録する。画像辞書化部１４”−２は、スキャナ１１から入力された全原稿のマッチング処理が終了すれば、ビットマップ辞書を番号インデックス順にソートして画像辞書ｍ５として出力する。 In such an image lexicon 14 ″ -2, the matching unit 54 combines the candidate bitmap m43 and the component image m4 with a known position shift and matching process to determine whether the component image m4 matches the candidate bitmap m43. If there is no corresponding bitmap pattern in the dictionary, the bitmap image is combined with the new number index and page attribute information m3 in the component image m4. The image dictionary converting unit 14 ″ -2 sorts the bitmap dictionary in the order of the number index and outputs it as an image dictionary m5 when the matching process for all the documents input from the scanner 11 is completed.

ここで、第４実施形態に特徴的なビットマップカウント部６４について、図２２を用いて説明する。ビットマップカウント部６４は、図２２の（ａ）に示すような構成を有しており、図１７（ｂ）に示す各辞書ビットマップを生成した時の頁属性情報ｍ３と番号インデックスｍ４６とその出現頻度であるカウント値であるビットマップ統計情報ｍ５３を格納している番号インデックステーブル６６と、ビットマップ統計情報ｍ５３の中の番号インデックスｍ４６の示すカウント値をインクリメントする番号インデックスカウンタ６５と、カウント値が所定の閾値を超えた際に現在処理中の頁属性内で出現頻度に合わせたアクセスの並び替え情報である出現頻度並び替え情報ｍ４５を生成する出現頻度並び替え情報生成部６７を有している。 Here, a bitmap count unit 64 characteristic of the fourth embodiment will be described with reference to FIG. The bitmap count unit 64 has a configuration as shown in FIG. 22 (a). The page attribute information m3, the number index m46 and its index when each dictionary bitmap shown in FIG. 17 (b) is generated. A number index table 66 that stores bitmap statistical information m53 that is a count value that is an appearance frequency, a number index counter 65 that increments a count value indicated by a number index m46 in the bitmap statistical information m53, and a count value An appearance frequency rearrangement information generation unit 67 for generating appearance frequency rearrangement information m45 that is access rearrangement information in accordance with the appearance frequency within the currently processed page attribute when the value exceeds a predetermined threshold. Yes.

次に、カウント値をどのようにインクリメントして出現頻度並び替え情報が更新されるかを、図２２（ｂ）に具体的数値を付与して説明する。並び替えを発生させる閾値が“７９９”である場合、頁属性情報ｍ３として『プレゼン』カテゴリーの番号インデックス“６”が入力されると、番号インデックスカウンタ６５によりインクリメントされたカウンタ値“８００”がビットマップ統計情報ｍ５３として更新され、番号インデックステーブル６６で保持される。ここで、出現頻度並び替え情報生成部６７は、更新されたカウンタ値“８００”に基づき、『プレゼン』カテゴリーの番号インデックス“６”の出現頻度並び替え情報ｍ４５を“２”から“１”に変更し、『プレゼン』カテゴリーの番号インデックス“５”の出現頻度並び替え情報ｍ４５を“１”から“２”に変更することで、各頁属性内で並び替えを行う。 Next, how the count value is incremented to update the appearance frequency rearrangement information will be described with specific numerical values given in FIG. When the threshold value for generating the rearrangement is “799” and the number index “6” of the “presentation” category is input as the page attribute information m3, the counter value “800” incremented by the number index counter 65 is the bit. It is updated as map statistical information m53 and held in the number index table 66. Here, the appearance frequency rearrangement information generation unit 67 changes the appearance frequency rearrangement information m45 of the “presentation” category number index “6” from “2” to “1” based on the updated counter value “800”. By changing and changing the appearance frequency rearrangement information m45 of the “presentation” category number index “5” from “1” to “2”, rearrangement is performed within each page attribute.

なお、画像辞書化部１４”−２全体の動作例で判るように、ここで入力される番号インデックスｍ４６はマッチングした結果として特定される情報であるため、更新された出現頻度並び替え情報ｍ４５は、カウント値ｍ５２が閾値を越えた次の部品画像から適用される。 As can be seen from the operation example of the entire image dictionary unit 14 ″ -2, the number index m46 input here is information specified as a result of matching, and thus the updated appearance frequency rearrangement information m45 is The count value m52 is applied from the next part image that exceeds the threshold value.

このように、第４実施形態では、探索インデックス並び替え部６２は、基本的に第１実施形態同様に頁属性情報ｍ３に応じて並び替えを行うと同時に、頁属性内では出現頻度並び替え情報ｍ４５を用いて頁属性内の辞書の探索順序も変更することにより、いっそう高速にマッチング処理を行なうことができる。 As described above, in the fourth embodiment, the search index rearranging unit 62 basically performs rearrangement according to the page attribute information m3 as in the first embodiment, and at the same time, the appearance frequency rearrangement information in the page attribute. By using m45 to change the dictionary search order in the page attribute, matching processing can be performed at higher speed.

なお、第４実施形態では、並び替えの高速化のために頁属性情報ｍ３も用いて頁属性内での並び替えを実施したが、より処理を簡潔にするために頁属性情報を用いずに個々の辞書ビットマップの頻度情報のみで並び替え情報を生成することも可能である。 In the fourth embodiment, the page attribute information m3 is also used to rearrange the page attributes for speeding up the rearrangement, but the page attribute information is not used in order to simplify the processing. It is also possible to generate rearrangement information only with frequency information of individual dictionary bitmaps.

また、辞書ビットマップの統計処理方法や更新方法等についても、上述した実施形態に限定されるものではない。更に、統計処理は辞書化ビットマップの情報を直接用いたが、特許文献３にあるようにＯＣＲ結果に基づき画像部品化を行うと共に、ＯＣＲの結果で同一の文字コード例えば“Ａ”に判別された辞書ビットマップをグループ化し、そのグループの発生頻度で並び替えを実施することが可能である。これにより、近似したビットマップ群全体で優先的に探索を行なうことができるので、高速に辞書ビットマップ圧縮ファイルを提供することができる。 Also, the dictionary bitmap statistical processing method, update method, and the like are not limited to the above-described embodiment. In addition, the statistical processing directly uses the information of the lexicographic bitmap, but as described in Patent Document 3, image parts are formed based on the OCR result, and the same character code such as “A” is determined based on the OCR result. It is possible to group dictionary bitmaps and rearrange them according to the frequency of occurrence of the groups. As a result, a search can be performed preferentially in the entire approximate bitmap group, and a dictionary bitmap compressed file can be provided at high speed.

（請求案１）
入力した画像を解析してオブジェクト配置情報と頁属性情報を出力する解析部と、
前記オブジェクト配置情報に基いて、前記画像を部品化して、部品画像を出力する部品化部と、
既にある画像辞書の中から前記解析部からの前記頁属性情報に対応する複数の画像辞書を選び、前記選ばれた複数の画像辞書の中から所定順序に従って一つの画像辞書を選び、選ばれた画像辞書と前記部品化部からの前記部品画像を比較して対応する画像辞書を決定し、前記決定した画像辞書と前記部品画像の辞書インデックスを出力する辞書化部と、
前記画像辞書と前記辞書インデックスを符号化する符号化部と、
前記辞書化部の所定順序を前記頁属性情報に基づき並び替えて更新する更新部と、
を具備することを特徴とする画像圧縮装置。 (Draft 1)
An analysis unit that analyzes the input image and outputs object placement information and page attribute information;
Based on the object arrangement information, the image is converted into a component, and a component converting unit that outputs a component image;
A plurality of image dictionaries corresponding to the page attribute information from the analysis unit are selected from the already existing image dictionaries, and one image dictionary is selected from the selected plurality of image dictionaries according to a predetermined order and selected. An image dictionary and the component image from the component conversion unit are compared to determine a corresponding image dictionary, and a dictionary conversion unit that outputs the determined image dictionary and a dictionary index of the component image;
An encoding unit for encoding the image dictionary and the dictionary index;
An update unit that rearranges and updates the predetermined order of the dictionary unit based on the page attribute information;
An image compression apparatus comprising:

（請求案２）
前記更新部が参照する前記頁属性情報は、文書、地図の少なくとも一方を含む原稿種別情報であることを特徴とする請求案１に記載の画像圧縮装置。 (Draft 2)
2. The image compression apparatus according to claim 1, wherein the page attribute information referred to by the updating unit is document type information including at least one of a document and a map.

（請求案３）
前記更新部が参照する前記頁属性情報は、スキュー角度、天地の少なくとも一方を含む原稿方向情報であることを特徴とする請求案１に記載の画像圧縮装置。 (Draft 3)
The image compression apparatus according to claim 1, wherein the page attribute information referred to by the updating unit is document direction information including at least one of a skew angle and a top and bottom.

（請求案４）
入力した画像を解析してオブジェクト配置情報と頁属性情報を出力し、
前記オブジェクト配置情報に基いて、前記画像を部品化して部品画像を出力し、
既にある画像辞書の中から前記頁属性情報に対応する複数の画像辞書を選び、
前記選ばれた複数の画像辞書の中から所定順序に従って一つの画像辞書を選び、
選ばれた画像辞書と前記部品画像を比較して対応する画像辞書を決定し、
前記画像辞書が決定した部品画像の辞書インデックスを決定し、
前記決定した画像辞書と前記部品画像の辞書インデックスを符号化し、
前記所定順序を前記頁属性情報に基づき並び替えて更新することを特徴とする画像圧縮方法。 (Draft bill 4)
Analyzes the input image and outputs object placement information and page attribute information.
Based on the object arrangement information, the image is converted into a component and a component image is output.
Select a plurality of image dictionaries corresponding to the page attribute information from the existing image dictionaries,
Select one image dictionary according to a predetermined order from the selected plurality of image dictionaries,
Compare the selected image dictionary with the component image to determine the corresponding image dictionary,
Determine a dictionary index of the component image determined by the image dictionary;
Encode the determined image dictionary and the dictionary index of the component image,
An image compression method, wherein the predetermined order is rearranged and updated based on the page attribute information.

特開２００６−２５４３２９JP 2006-254329 A 特開２００７−１７４００８JP2007-174008 特開２００５−３０１６６３JP 2005-301663 A

Claims

An analysis unit that analyzes the input image and outputs object placement information and page attribute information;
Based on the object arrangement information, the image is converted into a component, and a component converting unit that outputs a component image;
An image dictionary corresponding to the page attribute information from the analysis unit is selected from among existing image dictionaries, and the corresponding image dictionary is determined by comparing the selected image dictionary with the component image from the componentizing unit. A dictionary forming unit for outputting the determined image dictionary and a dictionary index of the component image;
An encoding unit for encoding the image dictionary and the dictionary index;
An image compression apparatus comprising:

The image compression apparatus according to claim 1, wherein the input image includes a plurality of pages, and the analysis unit outputs page attribute information for each page.

2. The image compression apparatus according to claim 1, wherein the page attribute information is document type information including at least one of a document and a map.

2. The image compression apparatus according to claim 1, wherein the page attribute information is document direction information including at least one of a skew angle and a top and bottom.

The image compression apparatus according to claim 1, wherein the lexicographic unit selects an image dictionary corresponding to the object attribute information from the existing image dictionary.

A character recognition unit that character-recognizes the component image output by the componentization unit and outputs character information;
2. The image compression apparatus according to claim 1, wherein the lexicographic unit selects an image dictionary corresponding to character information of the character recognition unit from existing image dictionaries.

Analyzes the input image and outputs object placement information and page attribute information.
Based on the object arrangement information, the image is converted into a component, and a component image is output.
An image dictionary corresponding to the page attribute information is selected from existing image dictionaries, the selected image dictionary is compared with the component image to determine a corresponding image dictionary, and a dictionary index of the component image is output. ,
An image compression method, wherein the corresponding image dictionary and the dictionary index are encoded.