JP2014072750A

JP2014072750A - Image processor, and computer program

Info

Publication number: JP2014072750A
Application number: JP2012218014A
Authority: JP
Inventors: Ryohei Ozawa; 良平小澤
Original assignee: Brother Industries Ltd
Current assignee: Brother Industries Ltd
Priority date: 2012-09-28
Filing date: 2012-09-28
Publication date: 2014-04-21
Anticipated expiration: 2032-09-28
Also published as: JP6145983B2

Abstract

PROBLEM TO BE SOLVED: To effectively compress image data representing an image including a character area and a non-character area.SOLUTION: An image processor includes: a specification part for specifying a character area, a gray area representing a gray image, and a color area representing a color image in an object image; a first processing part for generating compressed character image data by executing first processing by using a partial image data corresponding to the character area; a second processing part for generating compressed gray image data by performing processing for acquiring gray component data composed of one kind of a component value and processing for compressing the gray component data by using partial image data corresponding to the gray area; and a third processing part for generating compressed color image data by performing processing for acquiring color component data including a plurality of pieces component data and processing for compressing the color component data by using partial image data corresponding to the color area.

Description

本発明は、文字を表す文字領域と、文字領域とは異なる非文字領域と、を含む画像を表す画像データに対する画像処理に関する。 The present invention relates to image processing for image data representing an image including a character region representing a character and a non-character region different from the character region.

文字を含む対象画像を表す対象画像データを高い圧縮率で圧縮する技術が知られている（例えば、特許文献１）。この技術は、カラー文字を表すカラー文字画像データと、モノクロ文字を表すモノクロ文字画像データと、文字を含まない非文字画像データとに分離する。この技術は、分離された各画像を、各画像の圧縮に適した圧縮方式で圧縮することによって、対象画像データ全体の圧縮率を向上させている。 A technique for compressing target image data representing a target image including characters at a high compression rate is known (for example, Patent Document 1). This technique separates color character image data representing color characters, monochrome character image data representing monochrome characters, and non-character image data not containing characters. This technique improves the compression rate of the entire target image data by compressing each separated image by a compression method suitable for compression of each image.

例えば、モノクロ文字画像データは、ＭＭＲ(Modified Modified Read）形式で圧縮され、カラー文字画像データは、ＧＩＦ（Graphic Interchange Format）形式で圧縮され、非文字画像データは、ＪＰＥＧ（Joint Photographic Experts Group）形式で圧縮される。 For example, monochrome character image data is compressed in MMR (Modified Modified Read) format, color character image data is compressed in GIF (Graphic Interchange Format) format, and non-character image data is compressed in JPEG (Joint Photographic Experts Group) format. It is compressed with.

特開２００５−１２４０６６号公報JP 2005-1224066 A

しかしながら、上記技術では、非文字画像データの圧縮について十分に工夫されているとは言えず、対象画像データを効果的に圧縮できない可能性があった。例えば、非文字画像データの割合に対して文字画像データの割合が少ない対象画像データは、十分にデータサイズを縮小できないなど、効果的にデータサイズを縮小できない可能性があった。 However, in the above technique, it cannot be said that the compression of the non-character image data is sufficiently devised, and there is a possibility that the target image data cannot be effectively compressed. For example, there is a possibility that the target image data in which the ratio of the character image data is smaller than the ratio of the non-character image data cannot be reduced effectively because the data size cannot be reduced sufficiently.

本発明の主な利点は、文字を表す文字領域と、文字領域とは異なる非文字領域と、を含む画像を表す画像データを効果的に圧縮する新たな技術を提供することである。 A main advantage of the present invention is to provide a new technique for effectively compressing image data representing an image including a character region representing a character and a non-character region different from the character region.

本発明は、上述の課題の少なくとも一部を解決するためになされたものであり、以下の適用例として実現することが可能である。 SUMMARY An advantage of some aspects of the invention is to solve at least a part of the problems described above, and the invention can be implemented as the following application examples.

［適用例１］対象画像を表す対象画像データであって、複数の色成分に対応する複数の成分データを含む対象画像データを取得する取得部と、
前記対象画像において、文字を表す文字領域と、文字とは異なるグレー画像を表すグレー領域と、文字とは異なるカラー画像を表すカラー領域と、を特定する特定部と、
前記対象画像データのデータサイズを低減するための低減処理部であって、
前記文字領域に対応する部分画像データを用いて、第１の処理を実行することによって、圧縮済みの文字画像データを生成する第１処理部と、
前記グレー領域に対応する部分画像データを用いて、前記第１の処理とは異なる第２の処理を実行することによって、１種類の成分値で構成された圧縮済みのグレー画像データを生成する第２処理部であって、前記第２の処理は、１種類の成分値で構成されたグレー成分データを取得する処理と、前記グレー成分データを圧縮する処理と、を行うことで前記圧縮済みのグレー画像データを生成することを含む、前記第２処理部と、
前記カラー領域に対応する部分画像データを用いて、前記第１の処理および前記第２の処理とは異なる第３の処理を実行することによって、複数種類の成分値で構成された圧縮済みのカラー画像データを生成する第３処理部であって、前記第３の処理は、複数の成分データを含むカラー成分データを取得する処理と、前記カラー成分データを圧縮する処理と、を行うことで前記圧縮済みのカラー画像データを生成することを含む、前記第３処理部と、
を有する、前記低減処理部と、
前記圧縮済みの文字画像データと、前記圧縮済みのグレー画像データと、前記圧縮済みのカラー画像データと、を用いて、前記対象画像を表す圧縮済みの対象画像データを生成する生成部と、
を備える、画像処理装置。 Application Example 1 An acquisition unit that acquires target image data that represents a target image and includes a plurality of component data corresponding to a plurality of color components;
In the target image, a specifying unit that specifies a character region representing a character, a gray region representing a gray image different from the character, and a color region representing a color image different from the character,
A reduction processing unit for reducing the data size of the target image data,
A first processing unit that generates compressed character image data by executing a first process using partial image data corresponding to the character region;
A second process different from the first process is executed using the partial image data corresponding to the gray area, thereby generating compressed gray image data composed of one type of component value. A second processing unit, wherein the second processing is performed by performing processing for obtaining gray component data composed of one kind of component value and processing for compressing the gray component data; Generating the gray image data, the second processing unit;
By executing a third process different from the first process and the second process using partial image data corresponding to the color area, a compressed color composed of a plurality of types of component values A third processing unit for generating image data, wherein the third process performs a process of acquiring color component data including a plurality of component data and a process of compressing the color component data. Generating the compressed color image data, the third processing unit;
The reduction processing unit,
A generating unit that generates compressed target image data representing the target image using the compressed character image data, the compressed gray image data, and the compressed color image data;
An image processing apparatus comprising:

上記構成によれば、圧縮済みの文字画像データと、圧縮済みのグレー画像データと、圧縮済みのカラー画像データと、をそれぞれ異なる処理を用いて生成する。そして、圧縮済みのグレー画像データは、１種類の成分値で構成されたグレー成分画像データを取得する処理と、グレー成分画像データを圧縮する処理と、を含む第２の処理によって生成される。また、圧縮済みのカラー画像データは、複数の成分データを含むカラー画像データを取得する処理と、カラー画像データを圧縮する処理を含む第３の処理によって生成される。この結果、文字、文字とは異なるグレー画像、文字とは異なるカラー画像とにそれぞれ適した処理を実行するので、対象画像データを効果的に圧縮して、圧縮済みの対象画像データを生成することができる。 According to the above configuration, the compressed character image data, the compressed gray image data, and the compressed color image data are generated using different processes. The compressed gray image data is generated by a second process including a process of acquiring gray component image data composed of one type of component value and a process of compressing the gray component image data. The compressed color image data is generated by a third process including a process of acquiring color image data including a plurality of component data and a process of compressing the color image data. As a result, processing suitable for a character, a gray image different from the character, and a color image different from the character are executed, so that the target image data is effectively compressed to generate compressed target image data. Can do.

本発明は、種々の形態で実現することが可能であり、例えば、上記装置の機能を実現する方法、上記装置の機能を実現するコンピュータプログラム、当該コンピュータプログラムを記録した記録媒体、等の形態で実現することができる。 The present invention can be realized in various forms, for example, in the form of a method for realizing the function of the device, a computer program for realizing the function of the device, a recording medium on which the computer program is recorded, and the like. Can be realized.

本発明の一実施例としての計算機２００の構成を示すブロック図である。It is a block diagram which shows the structure of the computer 200 as one Example of this invention. 画像処理のフローチャートである。It is a flowchart of an image process. 画像領域特定処理の全体の流れを示す概略図である。It is the schematic which shows the whole flow of an image area | region identification process. 本実施例におけるエッジ強度の算出式を示している。The calculation formula of the edge intensity | strength in a present Example is shown. 統合処理のフローチャートである。It is a flowchart of an integration process. ２個の領域の統合を示す概略図である。It is the schematic which shows integration of two area | regions. 第３条件Ｆ３の階調差ＴＤの算出式を示している。The calculation formula of the gradation difference TD under the third condition F3 is shown. ４個の領域Ｌ３４〜Ｌ３７の統合を示す概略図である。It is the schematic which shows integration of four area | regions L34-L37. 判断テーブル２９２の一例を示す図である。It is a figure which shows an example of the judgment table 292. 分布幅Ｗと色数Ｃとの説明図である。It is explanatory drawing of the distribution width W and the number C of colors. グレー判断処理のフローチャートである。It is a flowchart of a gray determination process. ヒストグラムの作成のフローチャートである。It is a flowchart of creation of a histogram. 非文字オブジェクト領域の外縁部の一例を示す図である。It is a figure which shows an example of the outer edge part of a non-character object area | region. グレー領域とカラー領域の判断について説明する図である。It is a figure explaining judgment of a gray area and a color area. 第１実施例の低減処理のフローチャートである。It is a flowchart of the reduction process of 1st Example. 第１実施例の低減処理で生成される画像データについて説明する図である。It is a figure explaining the image data produced | generated by the reduction process of 1st Example. 第２実施例の低減処理のフローチャートである。It is a flowchart of the reduction process of 2nd Example. 第２実施例の低減処理で生成される画像データについて説明する図である。It is a figure explaining the image data produced | generated by the reduction process of 2nd Example.

Ａ．第１実施例：
Ａ−１：計算機２００の構成
次に、本発明の実施の形態を実施例に基づき説明する。図１は、本発明の一実施例としての計算機２００の構成を示すブロック図である。計算機２００は、例えば、パーソナルコンピュータであり、ＣＰＵ２１０と、ＤＲＡＭ等を含む揮発性記憶装置２４０と、フラッシュメモリやハードディスクドライブ等を含む不揮発性記憶装置２９０と、タッチパネルやキーボード等の操作部２７０と、外部装置と通信を行うためのインタフェースである通信部２８０と、を備えている。 A. First embodiment:
A-1: Configuration of Computer 200 Next, an embodiment of the present invention will be described based on examples. FIG. 1 is a block diagram showing a configuration of a computer 200 as an embodiment of the present invention. The computer 200 is, for example, a personal computer, a CPU 210, a volatile storage device 240 including a DRAM, a non-volatile storage device 290 including a flash memory and a hard disk drive, an operation unit 270 such as a touch panel and a keyboard, And a communication unit 280 which is an interface for communicating with an external device.

計算機２００は、通信部２８０を介して、外部装置（ここでは、スキャナ３００と複合機４００）に、通信可能に接続されている。スキャナ３００は、光学的に対象物（例えば、紙の文書）を読み取ることによってスキャンデータを取得する画像読取装置である。複合機４００は、光学的に対象物を読み取ることによってスキャンデータを取得する画像読取部（図示省略）を備えている。 The computer 200 is communicably connected to an external device (here, the scanner 300 and the multifunction device 400) via the communication unit 280. The scanner 300 is an image reading device that acquires scan data by optically reading an object (for example, a paper document). The multifunction device 400 includes an image reading unit (not shown) that acquires scan data by optically reading an object.

揮発性記憶装置２４０には、ＣＰＵ２１０が処理を行う際に生成される種々の中間データを一時的に格納するバッファ領域２４１が設けられている。不揮発性記憶装置２９０は、ドライバプログラム２９１と、後述する画像処理にて用いられる判断テーブル２９２と、を格納している。ドライバプログラム２９１は、例えば、ＣＤ−ＲＯＭやＤＶＤ−ＲＯＭなどに格納された形態で提供される。あるいは、ドライバプログラム２９１は、ネットワークを介して計算機２００に接続されたサーバからダウンロードされる形態で提供される。 The volatile storage device 240 is provided with a buffer area 241 for temporarily storing various intermediate data generated when the CPU 210 performs processing. The nonvolatile storage device 290 stores a driver program 291 and a determination table 292 used in image processing to be described later. The driver program 291 is provided in a form stored in, for example, a CD-ROM or DVD-ROM. Alternatively, the driver program 291 is provided in a form downloaded from a server connected to the computer 200 via a network.

ＣＰＵ２１０は、ドライバプログラム２９１を実行することにより、スキャナドライバ１００として機能する。スキャナドライバ１００は、対象画像データ（例えば、スキャンデータ）に対して後述する画像処理を実行して、圧縮ＰＤＦファイルを生成する画像処理機能を備えている。スキャナドライバ１００は、当該画像処理機能を実現するための機能部として、取得部１１０と、特定部１２０と、低減処理部１３０と、生成部１５０と、を備えている。取得部１１０は、対象画像データを取得する。特定部１２０は、対象画像データを解析して領域特定を実行する。特定部１２０は、グレー領域判断処理を行う判断部１２５を備えている。低減処理部１３０は、対象画像データのデータサイズを低減するための低減処理を実行する。低減処理部１３０は、対象画像のうち、文字を表す文字領域に関する処理を行う第１処理部１３１と、対象画像に含まれる文字とは異なるグレー画像を表すグレー領域に関する処理を行う第２処理部１３２と、対象画像に含まれる文字とは異なるカラー画像を表すカラー領域に関する処理を行う第３処理部１３３と、を備えている。生成部１５０は、低減処理部１３０によって処理された画像データを用いて圧縮ＰＤＦファイルを生成する。これらの各機能部１１０〜１５０が実行する処理については後述する。 The CPU 210 functions as the scanner driver 100 by executing the driver program 291. The scanner driver 100 has an image processing function for executing a later-described image process on target image data (for example, scan data) to generate a compressed PDF file. The scanner driver 100 includes an acquisition unit 110, an identification unit 120, a reduction processing unit 130, and a generation unit 150 as functional units for realizing the image processing function. The acquisition unit 110 acquires target image data. The specifying unit 120 analyzes the target image data and executes region specification. The specifying unit 120 includes a determination unit 125 that performs a gray area determination process. The reduction processing unit 130 executes a reduction process for reducing the data size of the target image data. The reduction processing unit 130 includes a first processing unit 131 that performs processing related to a character region representing a character in the target image, and a second processing unit that performs processing related to a gray region representing a gray image different from the character included in the target image. 132, and a third processing unit 133 that performs processing related to a color region representing a color image different from the character included in the target image. The generation unit 150 generates a compressed PDF file using the image data processed by the reduction processing unit 130. Processing executed by each of these functional units 110 to 150 will be described later.

Ａ−２：画像処理
図２は、画像処理のフローチャートである。ステップＳ１００では、取得部１１０（図１）は、通信部２８０を介して、スキャンデータを、対象画像データとして取得し、取得したスキャンデータを、バッファ領域２４１（図１）に格納する。具体的には、取得部１１０は、スキャナ３００または複合機４００の画像読取部を制御して、スキャンデータを取得する。スキャンデータは、例えば、紙の文書（原稿とも呼ぶ）の読み取り結果を表している。また、スキャンデータは、複数の画素毎の色をＲＧＢ値で表すビットマップデータである。 A-2: Image Processing FIG. 2 is a flowchart of image processing. In step S100, the acquisition unit 110 (FIG. 1) acquires scan data as target image data via the communication unit 280, and stores the acquired scan data in the buffer area 241 (FIG. 1). Specifically, the acquisition unit 110 controls the image reading unit of the scanner 300 or the multifunction device 400 to acquire scan data. The scan data represents, for example, a reading result of a paper document (also called a manuscript). The scan data is bitmap data that represents the color of each of the plurality of pixels as an RGB value.

図３は、画像領域特定処理（図２のステップＳ１５０〜Ｓ４５０の処理）の全体の流れを示す概略図である。図３（Ａ）の対象画像ＳＩは、対象画像データ（例えば、スキャンデータ）によって表される画像の一例である。対象画像ＳＩでは、図示しない複数の画素が、第一方向Ｄ１と、第一方向Ｄ１と直交する第二方向Ｄ２と、に沿って、マトリクス状に配置されている。１個の画素の画素データ（すなわち、ＲＧＢ値）は、例えば、赤（Ｒ）と緑（Ｇ）と青（Ｂ）との３個の色成分の階調値（以下、成分値とも呼ぶ）を含んでいる。本実施例では、各成分値の階調数は、２５６階調である。すなわち、対象画像データは、ＲＧＢの３個の成分にそれぞれ対応する３個の成分データを含んでいるということができる。成分データは、３個の成分値のうちの１個の成分値で構成される画像データである。 FIG. 3 is a schematic diagram showing an overall flow of the image region specifying process (the processes in steps S150 to S450 in FIG. 2). The target image SI in FIG. 3A is an example of an image represented by target image data (for example, scan data). In the target image SI, a plurality of pixels (not shown) are arranged in a matrix along a first direction D1 and a second direction D2 orthogonal to the first direction D1. Pixel data of one pixel (that is, RGB value) is, for example, the gradation values (hereinafter also referred to as component values) of three color components of red (R), green (G), and blue (B). Is included. In this embodiment, the number of gradations of each component value is 256 gradations. That is, it can be said that the target image data includes three component data respectively corresponding to the three RGB components. The component data is image data composed of one component value among the three component values.

図３（Ａ）の例では、対象画像ＳＩは、背景Ｂｇと、３個の非文字オブジェクトＯｂ１〜Ｏｂ３と、４個の文字オブジェクトＯｂ４〜Ｏｂ７と、を含んでいる。非文字オブジェクトは、例えば、写真オブジェクトや描画オブジェクトである。写真オブジェクトは、デジタルカメラによる撮影等によって得られる写真を表すオブジェクトである。描画オブジェクトは、イラスト、表、グラフ、線図、ベクトルグラフィックス、模様等の、描画を表すオブジェクトである。本実施例の３個の非文字オブジェクトのうち、２個の非文字オブジェクトＯｂ１、Ｏｂ２は、グレー（無彩色）オブジェクトであり、１個の非文字オブジェクトＯｂ３は、カラー（有彩色）オブジェクトである。また、本実施例の４個の文字オブジェクトＯｂ４〜Ｏｂ７のうち、３個の文字オブジェクトＯｂ４〜Ｏｂ６（Ｅ、Ｆ、Ｇの文字）は、互いに同じ色の文字であり、１個の文字オブジェクトＯｂ７（Ｈの文字）は、他の３個の文字オブジェクトＯｂ４〜Ｏｂ６とは、異なる色の文字であるとする。 In the example of FIG. 3A, the target image SI includes a background Bg, three non-character objects Ob1 to Ob3, and four character objects Ob4 to Ob7. The non-character object is, for example, a photo object or a drawing object. The photographic object is an object representing a photograph obtained by photographing with a digital camera. The drawing object is an object representing drawing, such as an illustration, a table, a graph, a diagram, vector graphics, or a pattern. Of the three non-character objects of the present embodiment, two non-character objects Ob1 and Ob2 are gray (achromatic) objects, and one non-character object Ob3 is a color (chromatic) object. . Of the four character objects Ob4 to Ob7 in this embodiment, three character objects Ob4 to Ob6 (characters E, F, and G) are the same color characters, and one character object Ob7. It is assumed that (letter H) is a character of a different color from the other three character objects Ob4 to Ob6.

ここで、グレーオブジェクトは、観察者が通常の観察距離で観察した場合に、グレーに見えるオブジェクトである。例えば、原稿が、ＣＭＹの３種類の印刷材を用いて印刷された原稿である場合を例に説明する。当該原稿を読み取って得られるスキャン画像において、通常の観察距離で観察した場合に無彩色に見えるオブジェクトを表す個々の画素は、有彩色を有しているが、当該無彩色に見えるオブジェクトは、本実施例におけるグレーオブジェクトに、含まれる。カラーオブジェクトは、観察者が通常の観察距離で観察した場合に、有彩色に見えるオブジェクトである。 Here, the gray object is an object that looks gray when the observer observes at a normal observation distance. For example, a case where the document is a document printed using three types of printing materials of CMY will be described as an example. In the scanned image obtained by scanning the original, each pixel representing an object that appears achromatic when viewed at a normal observation distance has a chromatic color, but the object that appears achromatic is It is included in the gray object in the embodiment. The color object is an object that looks chromatic when the observer observes at a normal observation distance.

ここで、対象画像ＳＩのうち、文字オブジェクトを表す部分画像を文字画像とも呼び、写真オブジェクトを表す部分画像を写真画像とも呼び、描画オブジェクトを表す部分画像を描画画像とも呼ぶ。写真画像と描画画像とを総称して、非文字画像ともよぶ。また、グレーオブジェクトを表す部分画像をグレー画像とも呼び、カラーオブジェクトを表す部分画像をカラー画像とも呼ぶ。 Here, in the target image SI, a partial image representing a character object is also called a character image, a partial image representing a photographic object is also called a photographic image, and a partial image representing a drawing object is also called a drawing image. Photo images and drawn images are collectively referred to as non-character images. A partial image representing a gray object is also called a gray image, and a partial image representing a color object is also called a color image.

図２のステップＳ１５０では、特定部１２０（図１）は、対象画像ＳＩ（スキャンデータ）を用いて、エッジ画像データを生成して、バッファ領域２４１に格納する。図３（Ｂ）は、エッジ画像データによって表されるエッジ画像ＥＩの概略図である。 In step S150 of FIG. 2, the specifying unit 120 (FIG. 1) generates edge image data using the target image SI (scan data) and stores it in the buffer area 241. FIG. 3B is a schematic diagram of the edge image EI represented by the edge image data.

エッジ画像ＥＩは、対象画像ＳＩ内の各画素位置におけるエッジ強度を表している。エッジ強度は、画像内の位置の変化に対する階調値の変化の大きさ（例えば、微分）、すなわち、互いに隣り合う複数個の画素間の階調値の差分の大きさを表している。図４は、本実施例におけるエッジ強度の算出式を示している。本実施例では、特定部１２０は、いわゆるソーベルオペレータ（Sobel operator）を用いて、ＲＧＢの３個の色成分毎に、エッジ強度Ｓｅを算出する。 The edge image EI represents the edge intensity at each pixel position in the target image SI. The edge strength represents the magnitude (for example, differentiation) of the gradation value with respect to the change in position in the image, that is, the magnitude of the difference in gradation value between a plurality of adjacent pixels. FIG. 4 shows a formula for calculating the edge strength in this embodiment. In the present embodiment, the specifying unit 120 calculates the edge strength Se for each of the three color components of RGB using a so-called Sobel operator.

図４の階調値Ｐ（ｘ，ｙ）は、対象画像ＳＩ内の特定の画素位置（ｘ，ｙ）の階調値を表している。位置ｘは、第一方向Ｄ１の画素位置を示し、位置ｙは、第二方向Ｄ２の画素位置を示している。図示するように、対象画像ＳＩ内の画素位置（ｘ，ｙ）におけるエッジ強度Ｓｅ（ｘ，ｙ）は、その画素位置（ｘ，ｙ）を中心とし隣り合う３行３列の９つの画素を用いて算出される。図４の算出式の第１項および第２項は、９つの位置の画素の階調値に、対応する係数をそれぞれ乗じた値の和の絶対値である。第１項は、第一方向Ｄ１の階調値の微分（すなわち、横方向の微分）であり、第２項は、第二方向Ｄ２の階調値の微分（すなわち、縦方向の微分）である。 The gradation value P (x, y) in FIG. 4 represents the gradation value at a specific pixel position (x, y) in the target image SI. The position x indicates the pixel position in the first direction D1, and the position y indicates the pixel position in the second direction D2. As shown in the figure, the edge intensity Se (x, y) at the pixel position (x, y) in the target image SI is obtained by calculating nine pixels in three rows and three columns adjacent to each other at the pixel position (x, y). Is used to calculate. The first and second terms of the calculation formula of FIG. 4 are absolute values of the sum of values obtained by multiplying the gradation values of the pixels at nine positions by the corresponding coefficients, respectively. The first term is the differentiation of the gradation value in the first direction D1 (that is, the differentiation in the horizontal direction), and the second term is the differentiation of the gradation value in the second direction D2 (that is, the differentiation in the vertical direction). is there.

図３（Ｂ）のエッジ画像ＥＩは、各画素位置における、Ｒ成分のエッジ強度とＧ成分のエッジ強度とＢ成分のエッジ強度とを平均して得られるエッジ強度（以下、参考エッジ強度と呼ぶ）を表している。図３（Ｂ）の一点破線Ｅｇ１〜Ｅｇ７は、参考エッジ強度が比較的大きい画素（エッジ画素とも呼ぶ）の位置を表している。図３（Ｂ）のエッジ画像ＥＩは、対象画像ＳＩのオブジェクトＯｂ１〜Ｏｂ７にそれぞれ対応するエッジ画素Ｅｇ１〜Ｅｇ７を含んでいることが解る。 The edge image EI in FIG. 3B is obtained by averaging the edge strength of the R component, the edge strength of the G component, and the edge strength of the B component at each pixel position (hereinafter referred to as reference edge strength). ). Dotted lines Eg1 to Eg7 in FIG. 3B represent the positions of pixels (also referred to as edge pixels) having a relatively high reference edge strength. It can be seen that the edge image EI in FIG. 3B includes edge pixels Eg1 to Eg7 corresponding to the objects Ob1 to Ob7 of the target image SI, respectively.

エッジ画像データを生成した後、続くステップＳ２００では、特定部１２０は、複数個の画素を含むブロックＢＬをエッジ画像ＥＩ上に設定する。図３（Ｂ）の破線は、エッジ画像ＥＩ上にマトリクス状に配置されたブロックＢＬを示している。１個のブロックＢＬは、例えば、ＢＬｎ行×ＢＬｎ列（ＢＬｎは、２以上の整数）の画素ＰＸで構成されたブロックである。ＢＬｎの値には、例えば、１０〜５０の範囲内の値を採用可能である。エッジ画像ＥＩと対象画像ＳＩとは、互いに同じサイズ（縦横の画素数が等しい）であるので、ブロックＢＬは、対象画像ＳＩ上に設定されていると言うこともできる。 After generating the edge image data, in the subsequent step S200, the specifying unit 120 sets a block BL including a plurality of pixels on the edge image EI. The broken lines in FIG. 3B indicate the blocks BL arranged in a matrix on the edge image EI. One block BL is, for example, a block composed of pixels PX of BLn rows × BLn columns (BLn is an integer of 2 or more). As the value of BLn, for example, a value within the range of 10 to 50 can be adopted. Since the edge image EI and the target image SI have the same size (the same number of vertical and horizontal pixels), it can be said that the block BL is set on the target image SI.

ブロックＢＬが設定されると、続くステップＳ２５０では、特定部１２０は、ブロックＢＬ単位で、ベタ領域と非ベタ領域とを特定する。ベタ領域は、領域が有するエッジ強度が所定の基準未満の領域であり、非ベタ領域とは、領域が有するエッジ強度が所定の基準以上の領域である。具体的には、特定部１２０は、ブロックＢＬ毎に、平均エッジ強度（ＥＲａｖｅ、ＥＧａｖｅ、ＥＢａｖｅ）を算出する。平均エッジ強度（ＥＲａｖｅ、ＥＧａｖｅ、ＥＢａｖｅ）は、ＲＧＢの３個の色成分毎に、算出される。特定部１２０は、処理対象のブロックＢＬの平均エッジ強度と所定の基準とを比較して、処理対象のブロックＢＬを、ベタブロックおよび非ベタブロックのいずれかに分類する。ベタブロックは、平均エッジ強度が所定の基準より小さいブロックＢＬである。非ベタブロックは、平均エッジ強度が所定の基準以上であるブロックＢＬである。本実施例では、特定部１２０は、平均エッジ強度（ＥＲａｖｅ、ＥＧａｖｅ、ＥＢａｖｅ）を、色成分ごとに定められた基準値（ＥＴｒ、ＥＴｇ、ＥＴｂ）と比較する。この結果、特定部１２０は、ＥＲａｖｅ＜ＥＴｒ、かつ、ＥＧａｖｅ＜ＥＴｇ、かつ、ＥＢａｖｅ＜ＥＴｂが成立する場合には、処理対象のブロックＢＬをベタブロックに分類する。ＥＲａｖｅ≧ＥＴｒ、および、ＥＧａｖｅ≧ＥＴｇ、および、ＥＢａｖｅ≧ＥＴｂのうちの少なくとも一つが成立する場合には、特定部１２０は、処理対象のブロックＢＬを非ベタブロックに分類する。 When the block BL is set, in the subsequent step S250, the specifying unit 120 specifies a solid area and a non-solid area in units of the block BL. A solid region is a region where the edge strength of the region is less than a predetermined reference, and a non-solid region is a region where the edge strength of the region is greater than or equal to a predetermined reference. Specifically, the specifying unit 120 calculates an average edge strength (ERave, EGave, EBave) for each block BL. The average edge strength (ERave, EGave, EBave) is calculated for each of the three RGB color components. The identifying unit 120 compares the average edge strength of the processing target block BL with a predetermined reference, and classifies the processing target block BL as either a solid block or a non-solid block. A solid block is a block BL whose average edge strength is smaller than a predetermined reference. A non-solid block is a block BL whose average edge strength is greater than or equal to a predetermined reference. In the present embodiment, the specifying unit 120 compares the average edge strength (ERave, EGave, EBave) with reference values (ETr, ETg, ETb) determined for each color component. As a result, when ERave <ETr, EGave <ETg, and EBave <ETb are satisfied, the specifying unit 120 classifies the processing target block BL as a solid block. When at least one of ERave ≧ ETr, EGave ≧ ETg, and EBave ≧ ETb is satisfied, the specifying unit 120 classifies the processing target block BL as a non-solid block.

図３（Ｂ）のエッジ画像ＥＩにおいて、非ベタブロックには、ハッチングが付され、ベタブロックには、ハッチングが付されていない。全てのブロックＢＬを、ベタブロックと非ベタブロックとに分類された後、特定部１２０は、互いに隣り合う（連続する）１個以上の非ベタブロックに対応する領域を、１個の非ベタ領域として特定する。また、特定部１２０は、互いに隣り合う１個以上のベタブロックに対応する領域を、１個のベタ領域として特定する。このように、連続する１個以上の非ベタブロックは、１個の非ベタ領域に組み込まれるので、非ベタ領域は、通常は、ベタ領域に囲まれている。図３（Ｂ）の例では、対象画像ＳＩ（図３（Ａ））の３個の非文字オブジェクトＯｂ１〜Ｏｂ３にそれぞれ対応する３個の非ベタ領域Ｌ１１〜Ｌ１３が特定されている。また、対象画像ＳＩの２個の文字オブジェクトＯｂ４、Ｏｂ５に対応する１個の非ベタ領域Ｌ１４と、２個の文字オブジェクトＯｂ６、Ｏｂ７に対応する１個の非ベタ領域Ｌ１５と、が特定されている。さらに、対象画像ＳＩの背景Ｂｇに対応する１個のベタ領域Ｌ１０が特定されている。エッジ画像ＥＩにおいて、ベタ領域と非ベタ領域が特定されることは、対象画像ＳＩにおいて、同様にベタ領域と非ベタ領域が特定されること、と同義である。 In the edge image EI of FIG. 3B, the non-solid block is hatched and the solid block is not hatched. After all the blocks BL are classified into a solid block and a non-solid block, the specifying unit 120 defines an area corresponding to one or more non-solid blocks adjacent to each other (continuous) as one non-solid area. As specified. Further, the specifying unit 120 specifies a region corresponding to one or more solid blocks adjacent to each other as one solid region. Thus, since one or more continuous non-solid blocks are incorporated into one non-solid area, the non-solid area is usually surrounded by a solid area. In the example of FIG. 3B, three non-solid regions L11 to L13 corresponding to the three non-character objects Ob1 to Ob3 of the target image SI (FIG. 3A) are specified. Further, one non-solid region L14 corresponding to the two character objects Ob4 and Ob5 of the target image SI and one non-solid region L15 corresponding to the two character objects Ob6 and Ob7 are specified. Yes. Furthermore, one solid area L10 corresponding to the background Bg of the target image SI is specified. Specifying a solid region and a non-solid region in the edge image EI has the same meaning as specifying a solid region and a non-solid region in the target image SI.

続く、ステップＳ３００では、特定部１２０は、対象画像ＳＩ内の各非ベタ領域を二値化するための基準値（以下、二値化基準値とも呼ぶ）を、対象画像ＳＩ内の非ベタ領域の周囲を囲むベタ領域内の画素値（言い換えると、色値）を用いて、非ベタ領域Ｌ１１〜Ｌ１５毎に決定する。本実施例では、二値化基準値は、ＲＧＢの成分毎に決定される。具体的には、非ベタ領域の周囲を囲むベタ領域の全ての画素についての、ＲＧＢの各成分値の平均値（Ｒｒ、Ｇｒ、Ｂｒ）が、二値化基準値として採用される。図３（Ｂ）の例では、全ての非ベタ領域Ｌ１１〜Ｌ１５は、背景Ｂｇに対応する１個のベタ領域Ｌ１０に囲まれているので、全ての非ベタ領域Ｌ１１〜Ｌ１５の二値化基準値は、同じ値、すなわち、ベタ領域Ｌ１０内の各成分値の平均値となる。 In step S300, the specifying unit 120 uses a reference value for binarizing each non-solid region in the target image SI (hereinafter also referred to as a binarization reference value) as a non-solid region in the target image SI. Is determined for each of the non-solid regions L11 to L15 using pixel values (in other words, color values) in the solid region surrounding the periphery of the region. In this embodiment, the binarization reference value is determined for each RGB component. Specifically, the average value (Rr, Gr, Br) of the RGB component values for all the pixels in the solid area surrounding the non-solid area is adopted as the binarization reference value. In the example of FIG. 3B, since all the non-solid regions L11 to L15 are surrounded by one solid region L10 corresponding to the background Bg, the binarization criterion for all the non-solid regions L11 to L15. The value is the same value, that is, the average value of the component values in the solid region L10.

二値化基準値（Ｒｒ、Ｇｒ、Ｂｒ）が決定されると、次のステップＳ３５０では、特定部１２０は、非ベタ領域Ｌ１１〜Ｌ１５毎に、二値画像データを生成して、バッファ領域２４１に格納する。本実施例では、特定部１２０は、二値化基準値（Ｒｒ、Ｇｒ、Ｂｒ）を用いて算出される６個の閾値Ｒ１、Ｒ２、Ｇ１、Ｇ２、Ｂ１、Ｂ２を用いて二値化処理を実行する。
Ｒ成分の下限閾値Ｒ１＝Ｒｒ−ｄＶ、Ｒ成分の上限閾値Ｒ２＝Ｒｒ＋ｄＶ
Ｇ成分の下限閾値Ｇ１＝Ｇｒ−ｄＶ、Ｇ成分の上限閾値Ｇ２＝Ｇｒ＋ｄＶ
Ｂ成分の下限閾値Ｂ１＝Ｂｒ−ｄＶ、Ｂ成分の上限閾値Ｂ２＝Ｂｒ＋ｄＶ
ここで、値ｄＶは、予め決められた値である。これらの値Ｒ１、Ｒ２、Ｇ１、Ｇ２、Ｂ１、Ｂ２は、二値化基準値（Ｒｒ、Ｇｒ、Ｂｒ）、すなわち、二値化対象のベタ領域を囲むベタ領域の平均色に比較的近い色の範囲、すなわち、背景の色に比較的近い色の範囲を、定めている。 When the binarization reference values (Rr, Gr, Br) are determined, in the next step S350, the specifying unit 120 generates binary image data for each of the non-solid regions L11 to L15, and the buffer region 241. To store. In the present embodiment, the specifying unit 120 performs binarization processing using six threshold values R1, R2, G1, G2, B1, and B2 calculated using the binarization reference values (Rr, Gr, and Br). Execute.
R component lower limit threshold R1 = Rr−dV, R component upper limit threshold R2 = Rr + dV
G component lower limit threshold G1 = Gr−dV, G component upper limit threshold G2 = Gr + dV
B component lower limit threshold B1 = Br−dV, B component upper limit threshold B2 = Br + dV
Here, the value dV is a predetermined value. These values R1, R2, G1, G2, B1, and B2 are binarization reference values (Rr, Gr, Br), that is, colors that are relatively close to the average color of the solid area surrounding the binarization target solid area. That is, a color range relatively close to the background color.

特定部１２０は、これらの６個の閾値Ｒ１、Ｒ２、Ｇ１、Ｇ２、Ｂ１、Ｂ２を用いて、対象画像ＳＩにおける非ベタ領域内の各画素を、１画素毎に、オブジェクト画素と、非オブジェクト画素とに分類することによって、非ベタ領域の二値画像データを生成する。例えば、生成された二値画像データにおいて、画素値「１」は、オブジェクト画素を示し、画素値「０」は、非オブジェクト画素を示す。 The specifying unit 120 uses these six threshold values R1, R2, G1, G2, B1, and B2 to set each pixel in the non-solid region in the target image SI to an object pixel and a non-object for each pixel. By classifying it into pixels, binary image data of a non-solid region is generated. For example, in the generated binary image data, the pixel value “1” indicates an object pixel, and the pixel value “0” indicates a non-object pixel.

具体的には、非ベタ領域内の画素Ｐｘｉの３個の色成分（ＲＧＢ）の階調値（Ｒｉ、Ｇｉ、Ｂｉ）が、以下の３つの条件を全て満たす場合に、特定部１２０は、画素Ｐｘｉを、非オブジェクト画素に分類し、以下の３つの条件のいずれかを満たさない場合に、画素Ｐｘｉをオブジェクト画素に分類する。
（第１条件）Ｒ１＜Ｒｉ＜Ｒ２
（第２条件）Ｇ１＜Ｇｉ＜Ｇ２
（第３条件）Ｂ１＜Ｂｉ＜Ｂ２ Specifically, when the gradation values (Ri, Gi, Bi) of the three color components (RGB) of the pixel Pxi in the non-solid region satisfy all the following three conditions, the specifying unit 120: The pixel Pxi is classified as a non-object pixel, and when any of the following three conditions is not satisfied, the pixel Pxi is classified as an object pixel.
(First condition) R1 <Ri <R2
(Second condition) G1 <Gi <G2
(Third condition) B1 <Bi <B2

このように、ベタ領域内の画素の色を用いて算出された背景の色に比較的近い画素を、非オブジェクト画素に分類し、その他の画素をオブジェクト画素に分類することによって、オブジェクトを構成するオブジェクト画素を精度良く特定した二値画像データを生成することができる。 In this way, an object is configured by classifying pixels that are relatively close to the background color calculated using the color of the pixels in the solid area as non-object pixels and classifying other pixels as object pixels. It is possible to generate binary image data that accurately identifies object pixels.

図３（Ｃ）には、生成された二値画像データによって表される二値画像ＢＩが示されている。実際には、上述した非ベタ領域Ｌ１１〜Ｌ１５毎に、別々の二値画像データが生成されるが、図３（Ｃ）では、１個の二値画像ＢＩで示している。 FIG. 3C shows a binary image BI represented by the generated binary image data. Actually, separate binary image data is generated for each of the non-solid regions L11 to L15 described above, but in FIG. 3C, it is represented by one binary image BI.

二値画像データを生成された後、続くステップＳ４００では、特定部１２０は、は、二値画像データを利用して、オブジェクト領域と非オブジェクト領域とを特定して、特定された領域に識別子を付すラベリングを実行する。ラベリングの結果、例えば、各領域と、識別子とを、対応付けたラベルデータが生成されて、バッファ領域２４１に格納される。 After the binary image data is generated, in the subsequent step S400, the specifying unit 120 uses the binary image data to specify the object region and the non-object region, and assigns an identifier to the specified region. Execute the attached labeling. As a result of labeling, for example, label data in which each area is associated with an identifier is generated and stored in the buffer area 241.

具体的には、特定部１２０は、連続する１個以上のオブジェクト画素（すなわち、二値化後の階調値が「１」である画素）で構成される１個の領域を、１個のオブジェクト領域として特定する。また、特定部１２０は、連続する１個以上の非オブジェクト画素（すなわち、二値化後の階調値が「ゼロ」である画素）で構成される１個の領域を、１個の非オブジェクト領域として特定する。 Specifically, the specifying unit 120 converts one region including one or more continuous object pixels (that is, a pixel whose gradation value after binarization is “1”) to one Identifies as an object area. In addition, the specifying unit 120 converts one area formed of one or more continuous non-object pixels (that is, a pixel whose gradation value after binarization is “zero”) into one non-object. Specify as an area.

図３（Ｃ）の例では、対象画像ＳＩの６個のオブジェクトＯｂ１〜Ｏｂ７（図３（Ａ））にそれぞれ対応する７つのオブジェクト領域Ｌ２１〜Ｌ２７と、対象画像ＳＩの背景Ｂｇに対応する１個の非オブジェクト領域Ｌ２０と、が特定される。特定部１２０は、特定した領域に、領域を識別する識別子を割り当てる。二値画像ＢＩを構成する各画素は、対象画像ＳＩを構成する各画素と対応しているので、二値画像ＢＩにおいて、領域Ｌ２０〜Ｌ２７が特定されることは、図３（Ｄ）に示すように、対象画像ＳＩにおいて、同様に、領域Ｌ３０〜Ｌ３７が特定されること、と同義である。以下では、基本的に各領域（すなわち、オブジェクト領域および非オブジェクト領域）を表す符号には、図３（Ｄ）に示す符号Ｌ３０〜Ｌ３７を用いる。また、単にオブジェクト領域に対応する画像と呼ぶときには、対象画像ＳＩの対応する部分画像のことを指し、オブジェクト領域内の画素の画素値とは、対象画像ＳＩの対応する画素の画素値、すなわち、対象画像データに対応する画素値（例えば、ＲＧＢ値）のことを指すものとする。 In the example of FIG. 3C, seven object regions L21 to L27 respectively corresponding to the six objects Ob1 to Ob7 (FIG. 3A) of the target image SI and 1 corresponding to the background Bg of the target image SI. Pieces of non-object regions L20 are identified. The identification unit 120 assigns an identifier for identifying the area to the identified area. Since each pixel constituting the binary image BI corresponds to each pixel constituting the target image SI, it is shown in FIG. 3D that the regions L20 to L27 are specified in the binary image BI. As described above, in the target image SI, similarly, the regions L30 to L37 are specified. In the following description, the symbols L30 to L37 shown in FIG. 3D are basically used as symbols representing the respective regions (that is, the object region and the non-object region). Further, when simply calling an image corresponding to an object region, it refers to a corresponding partial image of the target image SI, and a pixel value of a pixel in the object region is a pixel value of a corresponding pixel of the target image SI, that is, The pixel value (for example, RGB value) corresponding to the target image data is indicated.

ラベリングに続いて、ステップＳ４５０では、特定部１２０は、ラベリングされた複数個のオブジェクト領域のうち、統合条件を満たす複数個のオブジェクト領域を統合する統合処理を実行する。この統合処理は、互いに異なるオブジェクト領域として分離されている複数個の文字の領域を１個のオブジェクト領域として特定する統合するための処理である。 Subsequent to the labeling, in step S450, the specifying unit 120 executes an integration process for integrating a plurality of object areas satisfying the integration condition among the plurality of labeled object areas. This integration processing is processing for specifying a plurality of character areas separated as different object areas as one object area.

図５は、統合処理のフローチャートである。ステップＳ４５００では、特定部１２０は、特定部１２０によって特定された複数個の領域（例えば、図３（Ｄ）の領域Ｌ３０〜Ｌ３７）の中から、背景領域を選択する。背景領域は、対象画像ＳＩ（図３（Ａ））の縁部分に対応するベタ領域である。図３（Ｄ）の例では、非オブジェクト領域Ｌ３０が、背景領域として選択される。この背景領域Ｌ３０は、統合の対象から外される。 FIG. 5 is a flowchart of the integration process. In step S4500, the specifying unit 120 selects a background region from a plurality of regions specified by the specifying unit 120 (for example, the regions L30 to L37 in FIG. 3D). The background region is a solid region corresponding to the edge portion of the target image SI (FIG. 3A). In the example of FIG. 3D, the non-object region L30 is selected as the background region. This background region L30 is excluded from integration targets.

背景領域が選択された後、ステップＳ４５０５では、特定部１２０（図１）は、１個の未処理の領域を、処理対象領域Ｎとして選択する。次いで、ステップＳ４５１０では、特定部１２０は、処理対象領域Ｎの画素数が、所定の画素数基準以下であるか否かを判断する。画素数基準は、予め決められている。例えば、画素数基準としては、処理対象領域Ｎが他の領域と統合すべき文字を表す場合に、処理対象領域Ｎの画素数が取り得る最大値を若干上回る値を、採用可能である。画素数基準は、処理対象領域Ｎが１個の文字を表す場合の画素数が取り得る最大値を若干上回る値に、予め設定されている。処理対象領域Ｎの画素数が画素数基準を超える場合には（ステップＳ４５１０：ＮＯ）、特定部１２０は、ステップＳ４５０５に戻る、この結果、選択された処理対象領域Ｎは、統合の対象から外れる。この場合には、現行の処理対象領域Ｎは、典型的な文字よりも大きいので、文字以外のオブジェクトを表している可能性が高い。この画素数基準を適切に設定されることによって、図３（Ｃ）の例では、非文字オブジェクトを表す３個のオブジェクト領域Ｌ３１〜Ｌ３３は、統合の対象から外れ、文字オブジェクトを表す４個のオブジェクト領域Ｌ３４〜Ｌ３７は、統合の対象とされる。なお、本実施例では、処理対象領域Ｎの画素数は、対象画像ＳＩにおける、処理対象領域Ｎに外接する最小矩形に含まれる画素数である。 After the background area is selected, the specifying unit 120 (FIG. 1) selects one unprocessed area as the process target area N in step S <b> 4505. Next, in step S4510, the specifying unit 120 determines whether or not the number of pixels in the processing target area N is equal to or less than a predetermined pixel number reference. The pixel number standard is determined in advance. For example, when the processing target area N represents a character to be integrated with another area, a value slightly larger than the maximum value that the number of pixels of the processing target area N can take can be adopted as the pixel number standard. The pixel number reference is set in advance to a value slightly larger than the maximum value that the number of pixels when the processing target area N represents one character. When the number of pixels in the processing target area N exceeds the pixel number reference (step S4510: NO), the specifying unit 120 returns to step S4505. As a result, the selected processing target area N is excluded from integration targets. . In this case, since the current processing target area N is larger than a typical character, there is a high possibility that it represents an object other than a character. By appropriately setting the pixel number reference, in the example of FIG. 3C, the three object regions L31 to L33 representing the non-character object are excluded from the integration targets, and the four object regions representing the character object are displayed. The object areas L34 to L37 are targeted for integration. In the present embodiment, the number of pixels in the processing target area N is the number of pixels included in the minimum rectangle circumscribing the processing target area N in the target image SI.

図６は、２個の領域の統合を示す概略図である。図中には、文字「Ｅ」を表す処理対象領域Ｌｎが示されている。図中の矩形ＬｎＲは、処理対象領域Ｌｎに外接する最小矩形である。この矩形ＬｎＲ内に含まれる画素数が、処理対象領域Ｌｎの画素数である。ここで、「領域に外接する最小矩形」は、以下のような矩形である。すなわち、矩形は、第一方向Ｄ１と平行な２本の辺と、第二方向Ｄ２と平行な２本の辺と、で構成されている。そして、矩形の上辺が、領域の上端と接し、矩形の下辺が、領域の下端と接し、矩形の左辺が、領域の左端と接し、矩形の右辺が、領域の右端と接している。ここで、上辺および上端は、第二方向Ｄ２の反対方向側の辺および端であり、下辺および下端は、第二方向Ｄ２側の辺および端であり、左辺および左端は、第一方向Ｄ１の反対方向側の辺および端であり、右辺および右端は、第一方向Ｄ１側の辺および端である。なお、特定部１２０は、処理対象領域Ｎの画素のみをカウントすることによって、処理対象領域Ｎの画素数を算出してもよい。すなわち、特定部１２０は、外接矩形内の複数個の画素のうちの処理対象領域Ｎに含まれない画素をカウントせずに、画素数を算出してもよい。 FIG. 6 is a schematic diagram showing the integration of two regions. In the figure, a processing target area Ln representing the character “E” is shown. A rectangle LnR in the drawing is a minimum rectangle circumscribing the processing target region Ln. The number of pixels included in the rectangle LnR is the number of pixels in the processing target area Ln. Here, the “minimum rectangle circumscribing the region” is the following rectangle. That is, the rectangle is composed of two sides parallel to the first direction D1 and two sides parallel to the second direction D2. The upper side of the rectangle is in contact with the upper end of the region, the lower side of the rectangle is in contact with the lower end of the region, the left side of the rectangle is in contact with the left end of the region, and the right side of the rectangle is in contact with the right end of the region. Here, the upper side and the upper end are sides and ends on the opposite side of the second direction D2, the lower side and the lower end are sides and ends on the second direction D2, and the left side and the left end are in the first direction D1. The sides and ends on the opposite direction side, and the right side and the right end are sides and ends on the first direction D1 side. The specifying unit 120 may calculate the number of pixels in the processing target area N by counting only the pixels in the processing target area N. That is, the specifying unit 120 may calculate the number of pixels without counting pixels that are not included in the processing target area N among the plurality of pixels in the circumscribed rectangle.

図５のＳ４５１０で、処理対象領域Ｎの画素数が、所定の画素数基準以下である場合（ステップＳ４５１０：ＹＥＳ）、ステップＳ４５１５では、特定部１２０は、統合の候補領域Ｍのリストを初期化する。特定部１２０は、ステップＳ４５０５で処理対象領域Ｎとして選択されたことがない領域のリストを、生成する。例えば、図３（Ｄ）の例では、初めて実行されるステップＳ４５０５で領域Ｌ３１が処理対象領域Ｎとして選択された場合には、残りの６個の領域Ｌ３２〜Ｌ３７が、リストに挙げられる。次回のステップＳ４５０５で領域Ｌ３２が処理対象領域Ｎとして選択された場合には、残りの５個の領域Ｌ３３〜Ｌ３７が、リストに挙げられる。なお、他の領域に統合済の領域は、リストから除かれる。 In S4510 of FIG. 5, when the number of pixels in the processing target area N is equal to or smaller than a predetermined pixel number reference (Step S4510: YES), in Step S4515, the specifying unit 120 initializes a list of candidate areas M for integration. To do. The specifying unit 120 generates a list of areas that have not been selected as the processing target area N in step S4505. For example, in the example of FIG. 3D, when the region L31 is selected as the processing target region N in step S4505 executed for the first time, the remaining six regions L32 to L37 are listed. When the region L32 is selected as the processing target region N in the next step S4505, the remaining five regions L33 to L37 are listed. Note that areas that have been integrated with other areas are excluded from the list.

次いで、ステップＳ４５２０では、特定部１２０は、生成したリストの中から、１個の未選択の領域を、候補領域Ｍとして選択する。特定部１２０は、続く３つのステップＳ４５２５、Ｓ４５３０、Ｓ４５３５で、候補領域Ｍを処理対象領域Ｎに統合するか否かを判断する。各ステップＳ４５２５、Ｓ４５３０、Ｓ４５３５では、以下の条件が判断される。 Next, in step S4520, the specifying unit 120 selects one unselected area as a candidate area M from the generated list. The specifying unit 120 determines whether or not to integrate the candidate area M into the process target area N in the following three steps S4525, S4530, and S4535. In steps S4525, S4530, and S4535, the following conditions are determined.

（ステップＳ４５２５：第１条件Ｆ１）候補領域Ｍの画素数≦画素数基準
（ステップＳ４５３０：第２条件Ｆ２）第１距離Ｄｉｓ１≦距離基準、かつ、第２距離Ｄｉｓ２≦距離基準
（ステップＳ４５３５：第３条件Ｆ３）階調差ＴＤ≦階調差基準 (Step S4525: First Condition F1) Number of Pixels in Candidate Region M ≦ Pixel Number Reference (Step S4530: Second Condition F2) First Distance Dis1 ≦ Distance Reference and Second Distance Dis2 ≦ Distance Reference (Step S4535: First 3 condition F3) gradation difference TD ≦ gradation difference reference

候補領域Ｍが、これら全ての条件Ｆ１、Ｆ２、Ｆ３を満たす場合（ステップＳ４５２５：ＹＥＳ、かつ、Ｓ４５３０：ＹＥＳ、かつ、Ｓ４５３５：ＹＥＳ）に、図５のステップＳ４５４０で、特定部１２０は、候補領域Ｍを処理対象領域Ｎに統合する。 When the candidate area M satisfies all these conditions F1, F2, and F3 (step S4525: YES, S4530: YES, and S4535: YES), the specifying unit 120 selects candidates in step S4540 of FIG. The region M is integrated into the processing target region N.

ステップＳ４５２５の第１条件Ｆ１は、ステップＳ４５１０の条件と同様の条件である。候補領域Ｍが第１条件Ｆ１を満たさない場合には（ステップＳ４５２５：ＮＯ）、候補領域Ｍは、文字とは異なる種類のオブジェクトを表している可能性が高い。この場合には、特定部１２０は、ステップＳ４５４０をスキップすることによって、候補領域Ｍを処理対象領域Ｎに統合しない。 The first condition F1 in step S4525 is the same as the condition in step S4510. When the candidate area M does not satisfy the first condition F1 (step S4525: NO), it is highly likely that the candidate area M represents an object of a type different from characters. In this case, the specifying unit 120 does not integrate the candidate area M into the process target area N by skipping step S4540.

ステップＳ４５３０の第２条件Ｆ２は、候補領域Ｍが処理対象領域Ｎに比較的近い場合に満たされる条件である。図６は、第２条件Ｆ２の第１距離Ｄｉｓ１と第２距離Ｄｉｓ２との概略を示している。図中には、処理対象領域Ｌｎと候補領域Ｌｍとが示されている。対象矩形ＬｎＲは、処理対象領域Ｌｎに外接する最小矩形であり、候補矩形ＬｍＲは、候補領域Ｌｍに外接する最小矩形である。 The second condition F2 in step S4530 is a condition that is satisfied when the candidate area M is relatively close to the process target area N. FIG. 6 shows an outline of the first distance Dis1 and the second distance Dis2 under the second condition F2. In the figure, a processing target area Ln and a candidate area Lm are shown. The target rectangle LnR is the smallest rectangle circumscribing the processing target region Ln, and the candidate rectangle LmR is the smallest rectangle circumscribing the candidate region Lm.

第１距離Ｄｉｓ１は、図６（Ａ）に示すように、対象矩形ＬｎＲと候補矩形ＬｍＲとの間の、第一方向Ｄ１に沿った最短距離であり、例えば、画素数で表される。図６（Ｂ）に示すように、対象矩形ＬｎＲの第一方向Ｄ１の位置の範囲（左端ＰｎＬ〜右端ＰｎＲ）が、候補矩形ＬｍＲの第一方向Ｄ１の位置の範囲（左端ＰｍＬ〜右端ＰｍＲ）の少なくとも一部と重なる場合には、第１距離Ｄｉｓ１は、ゼロである。 As shown in FIG. 6A, the first distance Dis1 is the shortest distance along the first direction D1 between the target rectangle LnR and the candidate rectangle LmR, and is represented by, for example, the number of pixels. As shown in FIG. 6B, the range of the position of the target rectangle LnR in the first direction D1 (left end PnL to right end PnR) is the range of the position of the candidate rectangle LmR in the first direction D1 (left end PmL to right end PmR). The first distance Dis1 is zero when it overlaps at least a part of.

第２距離Ｄｉｓ２は、図６（Ｂ）に示すように、対象矩形ＬｎＲと候補矩形ＬｍＲとの間の、第二方向Ｄ２に沿った最短距離であり、例えば、画素数で表される。図６（Ａ）に示すように、対象矩形ＬｎＲの第二方向Ｄ２の位置の範囲（すなわち、上端ＰｎＴ〜下端ＰｎＢの範囲）が、候補矩形ＬｍＲの第二方向Ｄ２の位置の範囲（すなわち、上端ＰｍＴ〜下端ＰｍＢの範囲）の少なくとも一部と重なる場合には、第２距離Ｄｉｓ２は、ゼロである。 As shown in FIG. 6B, the second distance Dis2 is the shortest distance along the second direction D2 between the target rectangle LnR and the candidate rectangle LmR, and is represented by, for example, the number of pixels. As shown in FIG. 6A, the range of the position of the target rectangle LnR in the second direction D2 (that is, the range of the upper end PnT to the lower end PnB) is the range of the position of the candidate rectangle LmR in the second direction D2 (that is, The second distance Dis2 is zero when it overlaps at least part of the upper end PmT to the lower end PmB.

第２条件Ｆ２の距離基準は、予め決められている。例えば、距離基準としては、統合すべき２個の文字の間の距離がとり得る最大値を若干上回る値を、採用可能である。候補領域Ｍが第２条件Ｆ２を満たす場合には、候補領域Ｍと処理対象領域Ｎとが、同じ文字列に含まれる文字を表している可能性が高い。候補領域Ｍが第２条件Ｆ２を満たさない場合には（ステップＳ４５３０：ＮＯ）、候補領域Ｍは、処理対象領域Ｎとは関連しないオブジェクトを表している可能性が高い。この場合には、特定部１２０は、ステップＳ４５４０をスキップすることによって、候補領域Ｍを処理対象領域Ｎに統合しない。 The distance reference of the second condition F2 is determined in advance. For example, as the distance reference, a value slightly exceeding the maximum value that the distance between two characters to be integrated can take can be adopted. When the candidate area M satisfies the second condition F2, it is highly likely that the candidate area M and the processing target area N represent characters included in the same character string. When the candidate area M does not satisfy the second condition F2 (step S4530: NO), it is highly likely that the candidate area M represents an object that is not related to the processing target area N. In this case, the specifying unit 120 does not integrate the candidate area M into the process target area N by skipping step S4540.

ステップＳ４５３５の第３条件Ｆ３は、第３条件Ｆ３は、候補領域Ｍの色が処理対象領域Ｎと比較的近い場合に満たされる。図７は、第３条件Ｆ３の階調差ＴＤの算出式を示している。本実施例では、階調差ＴＤは、ＲＧＢ色空間における、処理対象領域Ｎの平均色（すなわち、Rav_n、Gav_n、Bav_n）と、候補領域Ｍの平均色（すなわち、Rav_m、Gav_m、Bav_m）との間のユークリッド距離の二乗である。第３条件Ｆ３の階調差基準は、予め決められている。例えば、階調差基準としては、実質的に同じ色であると、通常の観察者に認識される２つの色の色差の上限値が採用可能である。例えば、２つの色が、２個の文字の色としてそれぞれ用いられた場合に、当該２個の文字を見た通常の観察者が、当該２個の文字の色は互いに同じであると認識する場合に、当該２つの色は、実質的に同じ色であると、判断できる。ここで、第３条件Ｆ３を条件としているのは、後述するように、文字画像を圧縮する際に、実質的に異なる色で表現された複数個の文字は、別々に処理する必要があるからである。候補領域Ｍが第３条件Ｆ３を満たさない場合には（ステップＳ４５３０：ＮＯ）、特定部１２０は、ステップＳ４５４０をスキップすることによって、候補領域Ｍを処理対象領域Ｎに統合しない。 The third condition F3 in step S4535 is satisfied when the color of the candidate area M is relatively close to the process target area N. FIG. 7 shows a formula for calculating the gradation difference TD under the third condition F3. In the present embodiment, the gradation difference TD is calculated by comparing the average color of the processing target area N (ie, Rav_n, Gav_n, Bav_n) and the average color of the candidate area M (ie, Rav_m, Gav_m, Bav_m) in the RGB color space. Is the square of the Euclidean distance between. The gradation difference reference for the third condition F3 is determined in advance. For example, as the gradation difference reference, the upper limit value of the color difference between two colors recognized by a normal observer can be adopted as substantially the same color. For example, when two colors are used as the colors of two characters, a normal observer who sees the two characters recognizes that the colors of the two characters are the same. In this case, it can be determined that the two colors are substantially the same color. Here, the condition of the third condition F3 is that, as will be described later, when a character image is compressed, a plurality of characters expressed in substantially different colors must be processed separately. It is. When the candidate area M does not satisfy the third condition F3 (step S4530: NO), the specifying unit 120 does not integrate the candidate area M into the process target area N by skipping step S4540.

図５のステップＳ４５４０で、候補領域Ｍを処理対象領域Ｎに統合した後、または、ステップＳ４５２５、ステップＳ４５３０、ステップＳ４５３５のいずれかのステップでＮＯと判断した後、ステップＳ４５４５では、特定部１２０は、リストの全ての候補領域Ｍの処理が終了したか否かを判断する。未処理の候補領域Ｍが残っている場合には（ステップＳ４５４５：ＮＯ）、特定部１２０は、ステップＳ４５２０に戻り、未処理の候補領域Ｍに対して、ステップＳ４５２０〜Ｓ４５４０の処理を実行する。リストの全ての候補領域Ｍの処理が終了した場合には（ステップＳ４５４５：ＹＥＳ）、特定部１２０は、ステップＳ４５５０に処理を移行する。 After integrating the candidate area M into the process target area N in step S4540 in FIG. 5 or after determining NO in any one of steps S4525, S4530, and S4535, in step S4545, the specifying unit 120 Then, it is determined whether or not the processing of all candidate areas M in the list has been completed. When the unprocessed candidate area M remains (step S4545: NO), the specifying unit 120 returns to step S4520 and executes the processes of steps S4520 to S4540 on the unprocessed candidate area M. When the process for all candidate areas M in the list is completed (step S4545: YES), the specifying unit 120 shifts the process to step S4550.

ステップＳ４５５０では、特定部１２０は、最後にステップＳ４５１５が実行された後に、処理対象領域Ｎが拡張されたか否か、すなわち、処理対象領域Ｎに統合された候補領域Ｍの総数が１以上であるか否か、を判断する。処理対象領域Ｎが拡張された場合には（ステップＳ４５５０：ＹＥＳ）、特定部１２０は、拡張済の処理対象領域Ｎを利用して、再び、ステップＳ４５１５〜Ｓ４５４５の処理を実行する。従って、特定部１２０は、３個以上の領域を統合し得る。 In step S4550, the specifying unit 120 determines whether or not the processing target area N has been expanded after step S4515 is executed last, that is, the total number of candidate areas M integrated into the processing target area N is one or more. Or not. When the processing target area N is expanded (step S4550: YES), the specifying unit 120 executes the processes of steps S4515 to S4545 again using the expanded processing target area N. Therefore, the specifying unit 120 can integrate three or more regions.

図８は、４個の領域Ｌ３４〜Ｌ３７の統合を示す概略図である。ここでは、統合処理が、図８（Ａ）〜図８（Ｃ）の順に、進行する。 FIG. 8 is a schematic diagram showing the integration of the four regions L34 to L37. Here, the integration process proceeds in the order of FIG. 8 (A) to FIG. 8 (C).

図８（Ａ）では、「Ｅ」の文字を表す領域Ｌ３４が、処理対象領域Ｎである（図５：Ｓ４５０５）。領域Ｌ３４の隣に配置された「Ｆ」の文字を表す領域Ｌ３５は、上記条件Ｆ１〜Ｆ３を満たすので、特定部１２０（図１）は、領域Ｌ３５を領域Ｌ３４に統合する（ステップＳ４５４０）。また、「Ｇ」の文字を表す領域Ｌ３６と「Ｈ」の文字を表す領域Ｌ３７とは、領域Ｌ３４からの距離が遠いので、第２条件Ｆ２を満たさない。したがって、２つの領域Ｌ３６、Ｌ３７は、領域Ｌ３４に統合されない。 In FIG. 8A, the region L34 representing the character “E” is the processing target region N (FIG. 5: S4505). Since the region L35 representing the letter “F” arranged next to the region L34 satisfies the above conditions F1 to F3, the specifying unit 120 (FIG. 1) integrates the region L35 into the region L34 (step S4540). Further, the region L36 representing the character “G” and the region L37 representing the character “H” do not satisfy the second condition F2 because the distance from the region L34 is long. Therefore, the two regions L36 and L37 are not integrated into the region L34.

上述したように、領域Ｌ３５が領域Ｌ３４に統合された場合、図５のステップＳ４５５０では、特定部１２０は、処理対象領域Ｎ（例えば、領域Ｌ３４）が拡張された、と判断する（Ｓ４５５０：ＹＥＳ）。続くステップＳ４５１５では、特定部１２０は、領域Ｌ３５を含む拡張済の領域Ｌ３４Ｂ（図８（Ｂ））のためのリストを生成する。生成されるリストは、領域Ｌ３６と領域Ｌ３７とを含んでいる。 As described above, when the region L35 is integrated into the region L34, in step S4550 of FIG. 5, the specifying unit 120 determines that the processing target region N (for example, the region L34) has been expanded (S4550: YES). ). In subsequent step S4515, the specifying unit 120 generates a list for the expanded region L34B (FIG. 8B) including the region L35. The generated list includes a region L36 and a region L37.

図８（Ｂ）では、拡張済の領域Ｌ３４Ｂが、処理対象領域Ｎである。領域Ｌ３４Ｂの隣に配置された領域Ｌ３６は、上記条件Ｆ１〜Ｆ３を満たすので、特定部１２０は、領域Ｌ３６を、領域Ｌ３４Ｂに統合する（ステップＳ４５４０）。条件Ｆ１〜Ｆ３の判断には、拡張済の領域Ｌ３４Ｂ（すなわち、「Ｅ」の文字と「Ｆ」の文字とを表す領域）に外接する最小矩形が利用される。領域Ｌ３７は、領域Ｌ３４Ｂからの距離が遠いので、領域Ｌ３４Ｂに統合されない。 In FIG. 8B, the expanded region L34B is the processing target region N. Since the region L36 arranged next to the region L34B satisfies the above conditions F1 to F3, the specifying unit 120 integrates the region L36 into the region L34B (step S4540). For the determination of the conditions F1 to F3, a minimum rectangle circumscribing the expanded region L34B (that is, a region representing the characters “E” and “F”) is used. The region L37 is not integrated with the region L34B because the distance from the region L34B is long.

上述したように、領域Ｌ３６が拡張済みの領域Ｌ３４Ｂに統合された場合、図５のステップＳ４５５０では、特定部１２０（図１）は、処理対象領域Ｎ（この場合には、領域Ｌ３４Ｂ）が拡張された、と判断する（Ｓ４５５０：ＹＥＳ）。続くステップＳ４５１５では、特定部１２０は、領域Ｌ３６を含む拡張済の領域Ｌ３４Ｃ（図８（Ｃ））のためのリストを生成する。生成されるリストは、領域Ｌ３７を含んでいる。 As described above, when the region L36 is integrated into the expanded region L34B, the specifying unit 120 (FIG. 1) expands the processing target region N (in this case, the region L34B) in step S4550 of FIG. (S4550: YES). In subsequent step S4515, the specifying unit 120 generates a list for the expanded region L34C (FIG. 8C) including the region L36. The generated list includes an area L37.

図８（Ｃ）では、拡張済の領域Ｌ３４Ｃが、処理対象領域である。領域Ｌ３４Ｃの隣に配置された領域Ｌ３７は、領域Ｌ３４Ｃとの距離が比較的近いので、上記条件Ｆ１を満たすが、領域Ｌ３４Ｃと実質的に色が異なるので、条件Ｆ３を満たさない。したがって、特定部１２０は、領域Ｌ３７を、領域Ｌ３４Ｃには、統合しない。したがって、統合処理後には、図３（Ｄ）および図８（Ａ）の３個の文字オブジェクト領域Ｌ３４〜Ｌ３６が統合された１個の文字オブジェクト領域Ｌ３４Ｃと、文字オブジェクト領域Ｌ３７と、の２個の文字オブジェクト領域が特定されることになる（図３（Ｄ）、図８（Ｃ））。 In FIG. 8C, the expanded area L34C is the processing target area. The region L37 disposed next to the region L34C satisfies the condition F1 because the distance to the region L34C is relatively short, but does not satisfy the condition F3 because the color is substantially different from the region L34C. Therefore, the specifying unit 120 does not integrate the region L37 into the region L34C. Therefore, after the integration process, two character object areas L34C, which are an integration of the three character object areas L34 to L36 shown in FIGS. 3D and 8A, and a character object area L37. Is specified (FIG. 3D, FIG. 8C).

図５のステップＳ４５５０にて、処理対象領域Ｎの拡張がされていない場合には（ステップＳ４５５０：ＮＯ）、ステップＳ４５５５で、特定部１２０は、全ての領域の処理が完了したか否かを判断する。未処理の領域が残っている場合には（ステップＳ４５５５：ＮＯ）、特定部１２０は、ステップＳ４５０５に戻る。全ての領域の処理が完了した場合には（ステップＳ４５５５：ＹＥＳ）、特定部１２０は、ステップＳ４５６０で、領域の識別子を更新する。具体的には、特定部１２０は、図２のＳ４００において生成されて、バッファ領域２４１に格納されたラベルデータを更新する。ラベルデータの更新が終了すると、特定部１２０は、統合処理を終了する。 If the processing target area N has not been expanded in step S4550 in FIG. 5 (step S4550: NO), in step S4555, the identifying unit 120 determines whether or not the processing for all the areas has been completed. To do. If an unprocessed area remains (step S4555: NO), the identifying unit 120 returns to step S4505. When the processing for all the areas is completed (step S4555: YES), the specifying unit 120 updates the area identifier in step S4560. Specifically, the specifying unit 120 updates the label data generated in S400 of FIG. 2 and stored in the buffer area 241. When the update of the label data ends, the specifying unit 120 ends the integration process.

統合処理の後、図２のステップＳ５００では、判断部１２５（図１）は、統合処理後に複数個の領域毎に、画像の種類、すなわち、領域内のオブジェクトの種類が「文字」であるか否かを判断するオブジェクト属性判断処理を実行する。 After the integration process, in step S500 of FIG. 2, the determination unit 125 (FIG. 1) determines whether the type of image, that is, the type of object in the area is “character” for each of the plurality of areas after the integration process. Object attribute determination processing is performed to determine whether or not.

図９は、判断テーブル２９２の一例を示す図である。判断部１２５は、判断テーブル２９２を参照して、オブジェクト属性判断処理を実行する。判断部１２５は、色の分布幅Ｗと色数Ｃと画素密度Ｓとに応じて種類を識別する。判断部１２５は、特定されたオブジェクト領域ごとに、図１０に示すようなヒストグラムを生成して、分布幅Ｗと色数Ｃと画素密度Ｓとを算出する。生成されたヒストグラムおよび分布幅Ｗと色数Ｃと画素密度Ｓは、バッファ領域２４１に格納され、例えば、オブジェクト判断処理後に消去される。 FIG. 9 is a diagram illustrating an example of the determination table 292. The determination unit 125 refers to the determination table 292 and executes object attribute determination processing. The determination unit 125 identifies the type according to the color distribution width W, the number of colors C, and the pixel density S. The determination unit 125 generates a histogram as shown in FIG. 10 for each identified object region, and calculates the distribution width W, the number of colors C, and the pixel density S. The generated histogram, distribution width W, number of colors C, and pixel density S are stored in the buffer area 241 and are deleted after the object determination processing, for example.

図１０は、分布幅Ｗと色数Ｃとの説明図である。図中には、輝度のヒストグラムが示されている。この輝度ヒストグラムは、処理対象のオブジェクト領域（以下、対象領域と呼ぶ）内の画素値から算出される輝度のヒストグラムである。本実施例では、各画素の輝度は、各画素の階調値（赤Ｒと緑Ｇと青Ｂの３個の色成分の階調値）から、算出される。算出式としては、例えば、ＲＧＢの各階調値から、ＹＣｂＣｒ色空間のＹ成分（輝度成分）を算出する算出式が、利用される。 FIG. 10 is an explanatory diagram of the distribution width W and the number C of colors. In the figure, a histogram of luminance is shown. This luminance histogram is a luminance histogram calculated from pixel values in a processing target object region (hereinafter referred to as a target region). In this embodiment, the luminance of each pixel is calculated from the gradation value of each pixel (gradation values of three color components of red R, green G, and blue B). As the calculation formula, for example, a calculation formula for calculating the Y component (luminance component) of the YCbCr color space from the RGB gradation values is used.

色数Ｃは、０〜２５５までの２５６個の輝度値のうち、頻度値（すなわち、画素数）が所定の閾値Ｔｈ以上である輝度値の数である。図１０のヒストグラムは、閾値Ｔｈを越える３個のピークＰ１、Ｐ２、Ｐ３を含んでいる。図１０の例では、色数Ｃは、第１ピークＰ１の閾値Ｔｈを越える部分の幅Ｃ１と、第２ピークＰ２の閾値Ｔｈを越える部分の幅Ｃ２と、第３ピークＰ３の閾値Ｔｈを越える部分の幅Ｃ３と、の和に相当する。一般的に、文字は少ない色で表現されることが多いので、対象領域が文字オブジェクトを表す場合には、色数Ｃは比較的少なくなる。対象領域が非文字オブジェクトを表す場合には、色数Ｃは、比較的多くなる。例えば、写真オブジェクトは、撮影された被写体の種々の色を表すので、対象領域が写真オブジェクトを表す場合には、色数Ｃが比較的多くなる。 The color number C is the number of luminance values whose frequency value (that is, the number of pixels) is equal to or greater than a predetermined threshold Th among 256 luminance values from 0 to 255. The histogram of FIG. 10 includes three peaks P1, P2, and P3 that exceed the threshold Th. In the example of FIG. 10, the number of colors C exceeds the width C1 of the portion exceeding the threshold Th of the first peak P1, the width C2 of the portion exceeding the threshold Th of the second peak P2, and the threshold Th of the third peak P3. This corresponds to the sum of the width C3 of the portion. In general, since characters are often expressed with a small number of colors, the number of colors C is relatively small when the target area represents a character object. When the target area represents a non-character object, the number of colors C is relatively large. For example, since a photographic object represents various colors of a photographed subject, the number of colors C is relatively large when the target region represents a photographic object.

分布幅Ｗは、頻度値（すなわち、画素数）が所定の閾値Ｔｈ以上である輝度値の最低値と最高値との間の差（幅）である。色数Ｃの説明と同じ理由により、対象領域が文字オブジェクトを表す場合には、分布幅Ｗが比較的小さくなり、対象領域が非文字オブジェクトを表す場合には、分布幅Ｗが比較的大きくなる。 The distribution width W is a difference (width) between the lowest value and the highest value of the luminance values whose frequency value (that is, the number of pixels) is equal to or greater than a predetermined threshold Th. For the same reason as the description of the number of colors C, the distribution width W is relatively small when the target area represents a character object, and the distribution width W is relatively large when the target area represents a non-character object. .

画素密度Ｓは、対象領域に外接する最小矩形内の総画素数に対するオブジェクト画素の画素数（単位面積当たりの画素数）である。一般的に、文字は、背景上に、背景とは異なる色の細線で、書かれている。対象領域が文字オブジェクトを表す場合には、画素密度Ｓが比較的小さくなる。対象領域が非文字オブジェクトを表す場合には、画素密度Ｓが比較的大きくなる。例えば、写真オブジェクトは、外接する最小矩形の大部分を占める可能性が高いので、対象領域が写真オブジェクトを表す場合には、画素密度Ｓが比較的大きくなる。 The pixel density S is the number of object pixels (the number of pixels per unit area) with respect to the total number of pixels in the minimum rectangle circumscribing the target region. In general, characters are written on the background with thin lines having a different color from the background. When the target area represents a character object, the pixel density S is relatively small. When the target area represents a non-character object, the pixel density S is relatively large. For example, since the photo object is likely to occupy most of the circumscribed minimum rectangle, the pixel density S is relatively large when the target region represents the photo object.

図９の判断テーブル２９２は、上記説明を考慮して、作成されている。具体的には、判断部１２５は、所定の判断条件が満たされる場合には、対象領域は、文字オブジェクトを表す文字オブジェクト領域であると判断する。判断部１２５は、所定の判断条件が満たされない場合には、対象領域は、非文字オブジェクトを表す非文字オブジェクト領域であると判断する。 The determination table 292 in FIG. 9 is created in consideration of the above description. Specifically, the determination unit 125 determines that the target region is a character object region representing a character object when a predetermined determination condition is satisfied. When the predetermined determination condition is not satisfied, the determination unit 125 determines that the target region is a non-character object region representing a non-character object.

所定の判断条件は、判断テーブル２９２から解るように、以下の２つの条件のいずれかが満たされることである。
１）分布幅Ｗが分布幅閾値Ｗｔｈ以上、かつ、色数Ｃが色数閾値Ｃｔｈ未満、かつ、画素密度Ｓが画素密度閾値Ｓｔｈ未満であること
２）分布幅Ｗが分布幅閾値Ｗｔｈ未満、かつ、画素密度Ｓが画素密度閾値Ｓｔｈ未満であること As is understood from the determination table 292, the predetermined determination condition is that one of the following two conditions is satisfied.
1) The distribution width W is greater than or equal to the distribution width threshold value Wth, the color number C is less than the color number threshold value Cth, and the pixel density S is less than the pixel density threshold value Sth. 2) The distribution width W is less than the distribution width threshold value Wth. In addition, the pixel density S is less than the pixel density threshold Sth.

図３（Ｄ）の例では、オブジェクト領域Ｌ３１、Ｌ３２、Ｌ３３は、非文字オブジェクトを表す領域（以下、非文字オブジェクト領域とも呼ぶ）であると判断され、オブジェクト領域Ｌ３４Ｃ、Ｌ３７は、それぞれ文字オブジェクトを表す領域（以下、文字オブジェクト領域とも呼ぶ）であると判断される。 In the example of FIG. 3D, the object areas L31, L32, and L33 are determined to be areas representing non-character objects (hereinafter also referred to as non-character object areas), and the object areas L34C and L37 are character objects. (Hereinafter also referred to as a character object region).

オブジェクト属性判断処理に続いて、ステップＳ５５０では、判断部１２５は、グレー判断処理を実行する。 Subsequent to the object attribute determination process, in step S550, the determination unit 125 executes a gray determination process.

図１１は、グレー判断処理のフローチャートである。
ステップＳ５５１では、判断部１２５は、ステップＳ５００で非文字オブジェクト領域であると判断された領域を選択する。図３（Ｄ）の例では、３個の非文字オブジェクト領域Ｌ３１〜Ｌ３３が順次に選択される。 FIG. 11 is a flowchart of the gray determination process.
In step S551, the determination unit 125 selects an area determined to be a non-character object area in step S500. In the example of FIG. 3D, three non-character object areas L31 to L33 are sequentially selected.

ステップＳ５５２では、選択された非文字オブジェクト領域内の画像を平滑化する平滑化処理を実行する。具体的には、対象画像データのうち、選択された非文字オブジェクト領域に対応する部分画像データに対して、平滑化フィルタＦＬ（図１１）を適用する。なお、平滑化フィルタＦＬは、処理対象の部分画像データに含まれる３個の成分データ（すなわち、Ｒ成分データ、Ｇ成分データ、Ｂ成分データ）のそれぞれに対して適用される。すなわち、判断部１２５は、処理対象の成分データが表す画像内の注目画素に、平滑化フィルタＦＬの中心位置ＦＣが重なるように平滑化フィルタＦＬを配置する。判断部１２５は、注目画素を中心とした平滑化フィルタＦＬ（例えば、縦５画素×横５画素）内の複数個の成分値の平均値を算出する。判断部１２５は、注目画素の成分値を、算出された平均値に変更する。判断部１２５は、各成分データ内のすべての画素を注目画素に設定して、同様の処理を実行する。平滑化処理後の部分画像データは、バッファ領域２４１に格納される。 In step S552, a smoothing process for smoothing the image in the selected non-character object region is executed. Specifically, the smoothing filter FL (FIG. 11) is applied to the partial image data corresponding to the selected non-character object region in the target image data. The smoothing filter FL is applied to each of the three component data (that is, R component data, G component data, and B component data) included in the partial image data to be processed. That is, the determination unit 125 arranges the smoothing filter FL so that the center position FC of the smoothing filter FL overlaps the target pixel in the image represented by the processing target component data. The determination unit 125 calculates an average value of a plurality of component values in the smoothing filter FL (for example, 5 pixels in the vertical direction × 5 pixels in the horizontal direction) centered on the target pixel. The determination unit 125 changes the component value of the target pixel to the calculated average value. The determination unit 125 sets all the pixels in each component data as the target pixel and executes the same processing. The partial image data after the smoothing process is stored in the buffer area 241.

平滑化処理の意義について説明する。上述したように、原稿を読み取って得られるスキャンデータによって表される画像において、通常の観察距離で観察した場合に無彩色に見える画像の個々の画素が、有彩色を有している場合がある。例えば、原稿が、ＣＭＹの３種類の印刷材を用いて印刷された原稿である場合である。個々の画素が有彩色を有している場合であっても、通常の観察距離で観察した場合にグレーに見える画像（すなわち、領域）は、比較的狭い領域（例えば、通常の観察距離で識別可能な最小の大きさの領域）内の個々の画素の平均色は、無彩色であると考えられる。このために、平滑化処理を行うことによって、無彩色に見える領域内の個々の画素が有彩色を有している場合に、これらの画素の色を無彩色に近づけることができる。したがって、後述するグレー領域であるか否かの判断精度を向上することができる。 The significance of the smoothing process will be described. As described above, in an image represented by scan data obtained by reading a document, each pixel of an image that appears achromatic when observed at a normal observation distance may have a chromatic color. . For example, the document is a document printed using three types of printing materials of CMY. Even when individual pixels have a chromatic color, images that appear gray when viewed at a normal viewing distance (ie, regions) are identified by a relatively narrow region (eg, a normal viewing distance) The average color of the individual pixels within the smallest possible area) is considered to be achromatic. For this reason, by performing the smoothing process, when the individual pixels in the region that appears to be an achromatic color have a chromatic color, the color of these pixels can be brought close to the achromatic color. Therefore, it is possible to improve the accuracy of determining whether or not it is a gray region described later.

平滑化処理に続くステップＳ５５３では、判断部１２５は、機器独立色空間であるＣＩＥＬＡＢ色空間（以下、単にＬａＢ色空間とも呼ぶ）における＊ａ値と、＊ｂ値の分布を表すヒストグラムを作成して、バッファ領域２４１に格納する。このヒストグラムは、例えば、＊ａ値および＊ｂ値毎に、＊ａ値を有する画素数、および、＊ｂ値を有する画素数を、カウントしたヒストグラムである。 In step S553 following the smoothing process, the determination unit 125 creates a histogram representing the distribution of the * a value and the * b value in the CIELAB color space (hereinafter also simply referred to as LaB color space), which is a device-independent color space. And stored in the buffer area 241. This histogram is, for example, a histogram obtained by counting the number of pixels having the * a value and the number of pixels having the * b value for each * a value and * b value.

図１２は、ヒストグラムの作成のフローチャートである。
ステップＳ５５３１では、判断部１２５は、対象画像ＳＩにおいて、処理対象の非文字オブジェクト領域内の画素を処理対象画素として選択する。ステップＳ５５３２では、判断部１２５は、処理対象画素に対応する二値画像ＢＩ（図３（Ｃ））内の画素のエッジ強度ＥＰを算出する。エッジ強度ＥＰは、例えば、図４に示す算出式を用いて算出される。 FIG. 12 is a flowchart for creating a histogram.
In step S5531, the determination unit 125 selects a pixel in the non-character object region to be processed as a processing target pixel in the target image SI. In step S5532, the determination unit 125 calculates the edge intensity EP of the pixel in the binary image BI (FIG. 3C) corresponding to the processing target pixel. The edge strength EP is calculated using, for example, a calculation formula shown in FIG.

エッジ強度ＥＰが算出されると、続くステップＳ５５３３では、判断部１２５は、エッジ強度ＥＰは、エッジ基準値Ｅｔｈ以下であるか否かを判断する。エッジ強度ＥＰがエッジ基準値Ｅｔｈ以下である場合には（ステップＳ５５３３：ＹＥＳ）、判断部１２５は、処理対象画素の画素値を用いて、ヒストグラムを更新する。すなわち、判断部１２５は、処理対象画素の画素値（ＲＧＢ値）をＬａｂ値に変換して、当該Ｌａｂ値に応じて、上述したヒストグラムを更新する。 When the edge strength EP is calculated, in the subsequent step S5533, the determination unit 125 determines whether or not the edge strength EP is equal to or less than the edge reference value Eth. When the edge intensity EP is equal to or less than the edge reference value Eth (step S5533: YES), the determination unit 125 updates the histogram using the pixel value of the processing target pixel. That is, the determination unit 125 converts the pixel value (RGB value) of the processing target pixel into a Lab value, and updates the above-described histogram according to the Lab value.

エッジ強度ＥＰがエッジ基準値Ｅｔｈより大きい場合には（ステップＳ５５３３：ＮＯ）、判断部１２５は、ステップＳ５５３４をスキップする。すなわち、処理対象画素のエッジ強度ＥＰが基準値Ｅｔより大きい場合には、処理対象画素は、ヒストグラムの作成には使用されない。 When the edge strength EP is larger than the edge reference value Eth (step S5533: NO), the determination unit 125 skips step S5534. That is, when the edge intensity EP of the processing target pixel is larger than the reference value Et, the processing target pixel is not used for creating a histogram.

図１３は、非文字オブジェクト領域の外縁部の一例を示す図である。図３（Ｃ）に示すように、二値画像ＢＩでは、オブジェクト領域の内部の画素値は、全て「１」である（具体的には、図３（Ｃ）の黒で表されている）ので、エッジ強度ＥＰが基準値Ｅｔより大きい画素は、オブジェクト領域の外縁部分（非オブジェクト領域と接する部分）に現れ、オブジェクト領域のうち、外縁部分を除いた領域には、現れない。したがって、図１３に示すように、各非文字オブジェクト領域Ｌ３１〜Ｌ３３の各外縁部ＯＥ１〜ＯＥ３（具体的には、図１３においてハッチングされた領域）の画素が、ヒストグラムの作成から除外される。すなわち、非文字オブジェクト領域Ｌ３１〜Ｌ３３の各外縁部ＯＥ１〜ＯＥ３の画素は、後述するステップＳ５５４〜Ｓ５５６の処理にて実行される、グレー領域であるか否かの判断には用いられない。 FIG. 13 is a diagram illustrating an example of the outer edge portion of the non-character object region. As shown in FIG. 3C, in the binary image BI, the pixel values inside the object area are all “1” (specifically, it is represented by black in FIG. 3C). Therefore, pixels whose edge intensity EP is larger than the reference value Et appear in the outer edge portion of the object region (portion in contact with the non-object region), and do not appear in the region other than the outer edge portion of the object region. Therefore, as shown in FIG. 13, the pixels of the outer edge portions OE1 to OE3 (specifically, hatched regions in FIG. 13) of the non-character object regions L31 to L33 are excluded from the creation of the histogram. That is, the pixels of the outer edge portions OE1 to OE3 of the non-character object regions L31 to L33 are not used for determining whether or not they are gray regions, which are executed in the processes of steps S554 to S556 described later.

続くステップＳ５５３５では、判断部１２５は、処理対象の非文字オブジェクト領域内の全ての画素を選択したか否かを判断する。判断部１２５は、全ての画素を選択した場合には（ステップＳ５５３５：ＹＥＳ）、ヒストグラムの作成を終了する。判断部１２５は、未選択の画素がある場合には（ステップＳ５５３５：ＮＯ）、ステップＳ５５３１に戻って、未選択の画素を新たに選択して、上述したステップＳ５５３２〜Ｓ５５３５の処理を繰り返す。 In subsequent step S5535, determination unit 125 determines whether or not all pixels in the non-character object region to be processed have been selected. When all the pixels are selected (step S5535: YES), the determination unit 125 ends the creation of the histogram. If there is an unselected pixel (step S5535: NO), the determination unit 125 returns to step S5531, newly selects an unselected pixel, and repeats the processing of steps S5532 to S5535 described above.

ヒストグラムが作成されると、図１１のステップＳ５５４では、判断部１２５は、処理対象の非文字オブジェクト領域内の全画素に対する有彩色の画素が占める割合ＰＮ（すなわち、有彩色比率）が、グレー判断基準値Ｎｔｈ未満であるか否かを判断する。 When the histogram is created, in step S554 in FIG. 11, the determination unit 125 determines that the ratio PN (that is, the chromatic color ratio) occupied by chromatic pixels to all the pixels in the non-character object region to be processed is gray. It is determined whether it is less than the reference value Nth.

図１４は、グレー領域とカラー領域の判断について説明する図である。
本実施例では、判断部１２５は、Ｌａｂ色空間における無彩色軸（すなわち、＊ａ＝＊ｂ＝０の軸）との距離Ｒが、所定の基準距離Ｒｔｈ（図１４）未満である色は、無彩色であると判断し、Ｌａｂ色空間における無彩色軸（すなわち、＊ａ＝＊ｂ＝０の軸）との距離Ｒが、所定の基準距離Ｒｔｈ以上である色は、有彩色であると判断する。ここで、距離Ｒは、Ｌａｂ色空間におけるユークリッド距離（具体的には、Ｒ^２＝（＊ａ）^２＋（＊ｂ）^２の式で表される。）である。基準距離Ｒｔｈは、観察者が無彩色であると認識する色と、有彩色であると認識する色と、を分離できる値であって、経験的に定められる。判断部１２５は、ヒストグラムを参照して、有彩色比率ＰＮを算出して、グレー判断基準値Ｎｔｈ未満であるか否かを判断する。 FIG. 14 is a diagram for explaining determination of a gray area and a color area.
In the present embodiment, the determination unit 125 determines colors whose distance R to the achromatic color axis (that is, the axis of * a = * b = 0) in the Lab color space is less than a predetermined reference distance Rth (FIG. 14). A color whose distance R from the achromatic color axis (that is, the axis of * a = * b = 0) in the Lab color space is equal to or greater than a predetermined reference distance Rth is a chromatic color. Judge. Here, the distance R is the Euclidean distance in the Lab color space (specifically, it is represented by the formula R ² = (* a) ² + (* b) ² ). The reference distance Rth is a value that can separate the color recognized by the observer as an achromatic color and the color recognized as a chromatic color, and is determined empirically. The determination unit 125 refers to the histogram, calculates the chromatic color ratio PN, and determines whether it is less than the gray determination reference value Nth.

有彩色比率ＰＮがグレー判断基準値Ｎｔｈ未満である場合には（ステップＳ５５４：ＹＥＳ）、判断部１２５は、処理対象の非文字オブジェクト領域は、グレー領域であると判断する（ステップＳ５５５）。有彩色比率ＰＮがグレー判断基準値Ｎｔｈ以上である場合には（ステップＳ５５４：ＮＯ）、判断部１２５は、処理対象の非文字オブジェクト領域は、カラー領域であると判断する（ステップＳ５５６）。図１４（Ａ）は、グレー領域であると判断される非文字オブジェクト領域内の画像の色分布の例を示し、図１４（Ｂ）は、カラー領域であると判断される非文字オブジェクト領域内の画像の色分布の例を示している。図１４（Ａ）の例では、ほとんどの画素の色が、無彩色軸から基準距離Ｒｔｈ内の範囲（無彩色範囲とも呼ぶ）に分布していることが解る。一方、図１４（Ｂ）の例では、比較的高い割合の画素の色が、無彩色軸から基準距離Ｒｔｈ以上離れた範囲（有彩色範囲とも呼ぶ）に分布していることが解る。なお、グレー判断基準値Ｎｔｈは、一例として、０．３以下であることが好ましく、０．１５以下であることがさらに好ましい。判断部１２５は、処理対象の非文字オブジェクト領域が、グレー領域であるか、カラー領域であるかを判断すると、平滑化処理後の部分画像データ、および、ヒストグラムデータを、バッファ領域２４１から消去する。 When the chromatic color ratio PN is less than the gray determination reference value Nth (step S554: YES), the determination unit 125 determines that the non-character object region to be processed is a gray region (step S555). When the chromatic color ratio PN is equal to or greater than the gray determination reference value Nth (step S554: NO), the determination unit 125 determines that the non-character object region to be processed is a color region (step S556). FIG. 14A shows an example of the color distribution of the image in the non-character object area determined to be a gray area, and FIG. 14B shows the inside of the non-character object area determined to be a color area. The example of the color distribution of the image of is shown. In the example of FIG. 14A, it can be seen that the color of most pixels is distributed in a range (also referred to as an achromatic color range) within the reference distance Rth from the achromatic color axis. On the other hand, in the example of FIG. 14B, it can be seen that a relatively high proportion of pixel colors are distributed in a range (also referred to as a chromatic color range) separated from the achromatic color axis by a reference distance Rth or more. As an example, the gray determination reference value Nth is preferably 0.3 or less, and more preferably 0.15 or less. When determining whether the non-character object region to be processed is a gray region or a color region, the determination unit 125 deletes the partial image data and the histogram data after the smoothing process from the buffer region 241. .

続くステップＳ５５７では、判断部１２５は、全ての非文字オブジェクト領域を選択したか否かを判断する。判断部１２５は、全ての非文字オブジェクト領域を選択した場合には（ステップＳ５５７：ＹＥＳ）、グレー判定処理を終了する。判断部１２５は、未選択の非文字オブジェクト領域がある場合には（ステップＳ５５７：ＮＯ）、ステップＳ５５１に戻って、未選択の非文字オブジェクト領域を新たに選択して、上述したステップＳ５５２〜Ｓ５５７の処理を繰り返す。 In subsequent step S557, determination unit 125 determines whether or not all non-character object regions have been selected. If all the non-character object regions are selected (step S557: YES), the determination unit 125 ends the gray determination process. If there is an unselected non-character object area (step S557: NO), the determination unit 125 returns to step S551, newly selects an unselected non-character object area, and the above-described steps S552 to S557. Repeat the process.

ここで、非文字オブジェクト領域Ｌ３１〜Ｌ３３の各外縁部ＯＥ１〜ＯＥ３の画素を、グレー画像か否かの判断に用いない理由を説明する。これらの外縁部ＯＥ１〜ＯＥ３は、オブジェクト領域と、非オブジェクト領域（背景領域）との境界に位置する。これらの境界部分は、オブジェクト領域の本来の色とは異なる色を有する場合がある。一般的なスキャナは、イメージセンサの原稿に対する位置を副走査方向に移動させながら、イメージセンサによって原稿からの光を受光して、スキャンデータを生成する。このとき、境界部分の画素データは、原稿のオブジェクト領域からの光と、背景領域からの光との両方に基づいて生成され得る。したがって、本実施例のように、対象画像データがスキャンデータである場合には、境界部分は、オブジェクト領域の本来の色と、背景領域の色との中間色になる場合がある。本実施例では、各外縁部ＯＥ１〜ＯＥ３の画素を、グレー領域か否かの判断に用いないことによって、オブジェクト領域の本来の色が、無彩色であるか有彩色であるかを精度良く判断することができる。 Here, the reason why the pixels of the outer edge portions OE1 to OE3 of the non-character object regions L31 to L33 are not used for determining whether or not the image is a gray image will be described. These outer edge portions OE1 to OE3 are located at the boundary between the object region and the non-object region (background region). These boundary portions may have a color different from the original color of the object area. A general scanner receives light from an original by the image sensor while moving the position of the image sensor relative to the original in the sub-scanning direction, and generates scan data. At this time, the pixel data of the boundary portion can be generated based on both the light from the object area of the document and the light from the background area. Accordingly, when the target image data is scan data as in the present embodiment, the boundary portion may be an intermediate color between the original color of the object area and the color of the background area. In this embodiment, the pixels of the outer edge portions OE1 to OE3 are not used for determining whether or not they are gray regions, thereby accurately determining whether the original color of the object region is an achromatic color or a chromatic color. can do.

図１１のグレー判断処理によって、図３（Ｄ）の例では、２個の非文字オブジェクト領域Ｌ３１、Ｌ３２は、グレー領域であると判断され、１個の非文字オブジェクト領域Ｌ３３は、カラー領域であると判断される。 In the example of FIG. 3D, the two non-character object regions L31 and L32 are determined to be gray regions by the gray determination process of FIG. 11, and one non-character object region L33 is a color region. It is judged that there is.

グレー判断処理が終了すると、図２のステップＳ６００では、低減処理部１３０は、対象画像データのデータサイズを低減するための低減処理を実行する。 When the gray determination process is completed, in step S600 of FIG. 2, the reduction processing unit 130 executes a reduction process for reducing the data size of the target image data.

図１５は、第１実施例の低減処理のフローチャートである。
低減処理が開始されると、先ず、ステップＳ６０５では、処理対象の１個のオブジェクト領域（文字オブジェクト領域または非文字オブジェクト領域）が選択される。図３（Ｄ）の例では、５個のオブジェクト領域Ｌ３１〜Ｌ３３、Ｌ３４Ｃ、Ｌ３７が、１個ずつ順次に選択される。 FIG. 15 is a flowchart of the reduction process of the first embodiment.
When the reduction process is started, first, in step S605, one object area (character object area or non-character object area) to be processed is selected. In the example of FIG. 3D, five object areas L31 to L33, L34C, and L37 are sequentially selected one by one.

処理対象のオブジェクト領域が選択されると、次のステップＳ６１０では、低減処理部１３０は、処理対象のオブジェクト領域が、文字オブジェクト領域であるか否かを判断する。低減処理部１３０は、処理対象のオブジェクト領域が、文字オブジェクト領域である場合には（ステップＳ６１０：ＹＥＳ）には、低減処理部１３０の第１処理部１３１は、処理対象の文字オブジェクト領域内の文字画像の色を表す文字色値ＴＣを決定する（ステップＳ６２０）。第１処理部１３１は、例えば、処理対象の文字オブジェクト領域内の全ての画素についての、ＲＧＢの各成分値の平均値（Ｒａｖｅ１、Ｇａｖｅ１、Ｂａｖｅ１）を、文字色値ＴＣとして算出する。第１処理部１３１は、文字色値ＴＣとして他の値を算出しても良く、例えば、文字オブジェクト領域内の画素のうち、文字オブジェクト領域の外縁部（背景との境界部）を除く画素のみを用いて、文字色値ＴＣを算出しても良い。 When the processing target object area is selected, in the next step S610, the reduction processing unit 130 determines whether or not the processing target object area is a character object area. In the case where the object area to be processed is a character object area (step S610: YES), the reduction processing section 130 causes the first processing section 131 of the reduction processing section 130 to include a character object area within the processing target character object area. A character color value TC representing the color of the character image is determined (step S620). For example, the first processing unit 131 calculates an average value (Rave1, Gave1, Bave1) of RGB component values for all pixels in the character object area to be processed as the character color value TC. The first processing unit 131 may calculate other values as the character color value TC. For example, only the pixels in the character object region excluding the outer edge portion (boundary portion with the background) of the character object region. May be used to calculate the character color value TC.

文字色値ＴＣが決定されると、続くステップＳ６２５では、低減処理部１３０の第３処理部１３３は、バッファ領域２４１に格納された対象画像データにおいて、処理対象の文字オブジェクト領域内の画素の画素値を背景色値ＢＣに変更する。すなわち、第３処理部１３３は、対象画像ＳＩから処理対象の文字オブジェクト領域の文字画像を消去する。背景色値ＢＣは、例えば、処理対象の文字オブジェクト領域を囲む非オブジェクト領域（背景領域）内の全ての画素についての、ＲＧＢの各成分値の平均値（Ｒａｖｅ２、Ｇａｖｅ２、Ｂａｖｅ２）が採用される。図３（Ｄ）の例では、文字オブジェクト領域Ｌ３４Ｃ、Ｌ３７の背景色値ＢＣは、非オブジェクト領域Ｌ３０内の画素の画素値を用いて算出される。背景色値ＢＣは、他の値が採用されても良く、例えば、第３処理部１３３は、処理対象の文字オブジェクト領域を囲む非オブジェクト領域内の複数個の画素のうち、文字オブジェクト領域に比較的近い複数個の画素のみを用いて、背景色値ＢＣを算出しても良い。 When the character color value TC is determined, in the subsequent step S625, the third processing unit 133 of the reduction processing unit 130 uses the pixel of the pixel in the processing target character object region in the target image data stored in the buffer region 241. The value is changed to the background color value BC. That is, the third processing unit 133 deletes the character image in the character object area to be processed from the target image SI. As the background color value BC, for example, an average value (Rave2, Gave2, Bave2) of RGB component values for all pixels in the non-object area (background area) surrounding the character object area to be processed is adopted. . In the example of FIG. 3D, the background color value BC of the character object areas L34C and L37 is calculated using the pixel values of the pixels in the non-object area L30. Other values may be adopted as the background color value BC. For example, the third processing unit 133 compares the background color value BC with the character object area among a plurality of pixels in the non-object area surrounding the character object area to be processed. The background color value BC may be calculated using only a plurality of close pixels.

続くステップＳ６３０では、第１処理部１３１は、処理対象の文字オブジェクト領域に対応する二値画像データを取得して、当該二値画像データを、ＦＬＡＴＥ圧縮方式により圧縮する。本ステップにおいて、処理対象の文字オブジェクト領域内の文字を表す圧縮済みの文字画像データが生成される。生成された圧縮済みの文字画像データは、後述する圧縮ＰＤＦファイルの生成まで（図２：Ｓ７００）まで、バッファ領域２４１に格納される。文字オブジェクト領域に対応する二値画像データは、文字オブジェクトに外接する最小矩形に対応する二値画像を表す。例えば、図３（Ｄ）の文字オブジェクト領域Ｌ３４Ｃに対応する二値画像データによって表される二値画像は、図３（Ｃ）における二値画像ＢＩ４である。また、図３（Ｄ）の文字オブジェクト領域Ｌ３７に対応する二値画像データによって表される二値画像は、図３（Ｃ）における二値画像ＢＩ７である。 In subsequent step S630, the first processing unit 131 obtains binary image data corresponding to the character object region to be processed, and compresses the binary image data by the FLATE compression method. In this step, compressed character image data representing characters in the character object area to be processed is generated. The generated compressed character image data is stored in the buffer area 241 until a compressed PDF file to be described later is generated (FIG. 2: S700). The binary image data corresponding to the character object area represents a binary image corresponding to the smallest rectangle circumscribing the character object. For example, the binary image represented by the binary image data corresponding to the character object region L34C in FIG. 3D is the binary image BI4 in FIG. Further, the binary image represented by the binary image data corresponding to the character object region L37 in FIG. 3D is the binary image BI7 in FIG.

ＦＬＡＴＥ圧縮方式は、ＺＩＰファイルの作成などに使用されている可逆圧縮方式であり、比較的階調数の少ない画像の圧縮に適している。ＦＬＡＴＥ圧縮方式を用いれば、二値画像データを、高い圧縮率で、かつ、解像度を落とすことなく圧縮することができる。文字は、色の再現性よりも読みやすさが優先されると考えられるので、階調性の維持よりも解像度の維持が優先される。このために、本実施例では、圧縮済みの文字画像データを、二値画像データをＦＬＡＴＥ圧縮方式で圧縮することによって生成している。ＦＬＡＴＥ圧縮方式により圧縮済みの文字画像データが生成されると、低減処理部１３０は、ステップＳ６５０に処理を移行する。 The FLATE compression method is a reversible compression method used for creating a ZIP file and is suitable for compression of an image having a relatively small number of gradations. If the FLATE compression method is used, binary image data can be compressed at a high compression rate and without reducing the resolution. Since it is considered that the readability is given priority over the color reproducibility, maintaining the resolution has priority over maintaining the gradation. For this reason, in this embodiment, the compressed character image data is generated by compressing the binary image data by the FLATE compression method. When the compressed character image data is generated by the FLATE compression method, the reduction processing unit 130 shifts the process to step S650.

上述した図６のステップＳ６１０にて、処理対象のオブジェクト領域が、文字オブジェクト領域でない場合には（ステップＳ６１０：ＮＯ）には、低減処理部１３０は、処理対象のオブジェクト領域が、グレー領域であるか否かを判断する（ステップＳ６１５）。グレー領域は、上述したとおり、非文字オブジェクト領域のうち、グレー判断処理（図２：ステップＳ５５０）にて、グレー領域であると判断された領域である。処理対象のオブジェクト領域が、グレー領域でない場合には（ステップＳ６１５：ＮＯ）、すなわち、処理対象のオブジェクト領域がカラー領域である場合には、低減処理部１３０は、ステップＳ６５０に処理を移行する。 In step S610 of FIG. 6 described above, if the object area to be processed is not a character object area (step S610: NO), the reduction processing unit 130 indicates that the object area to be processed is a gray area. Whether or not (step S615). As described above, the gray area is an area determined as a gray area in the non-character object area by the gray determination process (FIG. 2: step S550). If the object area to be processed is not a gray area (step S615: NO), that is, if the object area to be processed is a color area, the reduction processing unit 130 shifts the process to step S650.

処理対象のオブジェクト領域が、グレー領域である場合には（ステップＳ６１５：ＹＥＳ）、低減処理部１３０の第２処理部１３２は、処理対象のグレー領域内の画像を表す１個の成分データを取得する（ステップＳ６３５）。本実施例では、第２処理部１３２は、対象画像データのうち、処理対象のグレー領域に対応する部分画像データに含まれる３個の成分データ（すなわち、Ｒ成分データ、Ｇ成分データ、Ｂ成分データ）のうちの１個の成分データを選択する。無彩色（すなわち、グレー）は、３個の成分値が互いに等しいＲＧＢ値で表されるから、グレー領域に対応する部分画像データに含まれる３個の成分データは、互いにほぼ等しいと考えられる。したがって、本ステップでは、３個の成分データのうちいずれの成分データが選択されても良い。 When the object region to be processed is a gray region (step S615: YES), the second processing unit 132 of the reduction processing unit 130 acquires one component data representing an image in the gray region to be processed. (Step S635). In the present embodiment, the second processing unit 132 includes three component data (that is, R component data, G component data, and B component) included in the partial image data corresponding to the gray region to be processed among the target image data. Data) is selected. Since an achromatic color (that is, gray) is represented by RGB values having the same three component values, the three component data included in the partial image data corresponding to the gray region are considered to be substantially equal to each other. Therefore, in this step, any component data among the three component data may be selected.

続くステップＳ６４０では、第３処理部１３３は、バッファ領域２４１に格納された対象画像データにおいて、処理対象のグレー領域内の画素の画素値を背景色値ＢＣに変更する。すなわち、第３処理部１３３は、対象画像ＳＩから処理対象のグレー領域内の画像を消去する。背景色値ＢＣは、上述したステップＳ６２５において、文字オブジェクト領域内の文字を消去する際に使用した背景色値ＢＣと同様に、処理対象のグレー領域を囲む非オブジェクト領域（すなわち、背景領域）内の画素の画素値を用いて算出される。 In subsequent step S640, the third processing unit 133 changes the pixel value of the pixel in the gray area to be processed to the background color value BC in the target image data stored in the buffer area 241. That is, the third processing unit 133 deletes the image in the gray area to be processed from the target image SI. The background color value BC is stored in the non-object region (that is, the background region) surrounding the gray region to be processed in the same manner as the background color value BC used when erasing characters in the character object region in step S625 described above. It is calculated using the pixel value of each pixel.

次のステップＳ６４５では、第２処理部１３２は、ステップＳ６３５で取得された１個の成分データを、グレー成分データ上に配置する。具体的には、グレー成分データは、対象画像ＳＩと同じ大きさの領域を有する１個の成分データであり、初期状態では、全ての画素値が、白（すなわち、最大輝度値）を表す値、例えば、「２５５」にされている。先ず、初期状態のグレー成分データがバッファ領域２４１に準備され、本ステップでは、第２処理部１３２は、このグレー成分データのうち、処理対象のグレー領域に対応する部分データを、ステップＳ６３５で取得された１個の成分データに置換する。 In the next step S645, the second processing unit 132 arranges one component data acquired in step S635 on the gray component data. Specifically, the gray component data is one component data having a region having the same size as the target image SI, and in the initial state, all pixel values are values representing white (that is, the maximum luminance value). For example, “255” is set. First, the gray component data in the initial state is prepared in the buffer area 241. In this step, the second processing unit 132 acquires partial data corresponding to the gray area to be processed from the gray component data in step S635. It is replaced with one component data.

次のステップＳ６４７では、第２処理部１３２は、処理対象のグレー領域に対応する二値画像データを取得して、グレーマスクデータ上に配置する。具体的には、グレーマスクデータは、対象画像ＳＩと同じ大きさの領域、すなわち、グレー成分データと同じ大きさの領域を有する二値画像データであり、初期状態では、全ての画素値が「０」にされている。先ず、初期状態のグレー成分データがバッファ領域２４１に準備され、本ステップでは、第２処理部１３２は、このグレーマスクデータのうち、処理対象のグレー領域に対応する部分データを、グレー領域に対応する二値画像データに置換する。グレー領域に対応する二値画像データは、グレー領域（すなわち、非文字オブジェクト）に外接する最小矩形に対応する二値画像を表す。例えば、図３（Ｄ）のグレー領域（すなわち、非文字オブジェクト領域）Ｌ３１に対応する二値画像データによって表される二値画像は、図３（Ｃ）における二値画像ＢＩ１である。また、図３（Ｄ）のグレー領域（すなわち、非文字オブジェクト領域）Ｌ３２に対応する二値画像データによって表される二値画像は、図３（Ｃ）における二値画像ＢＩ２である。処理対象のグレー領域に対応する二値画像データを取得して、グレーマスクデータ上に配置すると、低減処理部１３０は、ステップＳ６５０に処理を移行する。 In the next step S647, the second processing unit 132 acquires binary image data corresponding to the gray region to be processed, and arranges it on the gray mask data. Specifically, the gray mask data is binary image data having an area having the same size as that of the target image SI, that is, an area having the same size as that of the gray component data. 0 ”. First, the gray component data in the initial state is prepared in the buffer area 241. In this step, the second processing unit 132 corresponds to the gray area of the partial data corresponding to the gray area to be processed among the gray mask data. Replace with binary image data. The binary image data corresponding to the gray area represents a binary image corresponding to the smallest rectangle circumscribing the gray area (that is, the non-character object). For example, the binary image represented by the binary image data corresponding to the gray region (that is, the non-character object region) L31 in FIG. 3D is the binary image BI1 in FIG. Also, the binary image represented by the binary image data corresponding to the gray region (that is, the non-character object region) L32 in FIG. 3D is the binary image BI2 in FIG. When the binary image data corresponding to the gray area to be processed is acquired and placed on the gray mask data, the reduction processing unit 130 shifts the process to step S650.

ステップＳ６５０では、低減処理部１３０は、全てのオブジェクト領域を選択したか否かを判断する。低減処理部１３０は、全てのオブジェクト領域を選択した場合には（ステップＳ６５０：ＹＥＳ）、ステップＳ６５５に処理を進める。低減処理部１３０は、未選択のオブジェクト領域がある場合には（ステップＳ６５０：ＮＯ）、ステップＳ６０５に戻って、未選択のオブジェクト領域を新たに選択して、上述したステップＳ６１０〜Ｓ６５０の処理を繰り返す。 In step S650, the reduction processing unit 130 determines whether all object areas have been selected. If all the object areas have been selected (step S650: YES), the reduction processing unit 130 proceeds with the process to step S655. If there is an unselected object area (step S650: NO), the reduction processing unit 130 returns to step S605, newly selects an unselected object area, and performs the above-described processing of steps S610 to S650. repeat.

図１６は、第１実施例の低減処理で生成される画像データについて説明する図である。
対象画像ＳＩに対して上記の各処理を終え、ステップＳ６５５に移行した時点で、１個のカラー成分データ（図１６（Ａ））と、２個の圧縮済みの文字画像データ（図１６（Ｂ））と、１個のグレー成分データ（図１６（Ｃ））と、１個のグレーマスクデータ（図１６（Ｄ））と、が生成され、バッファ領域２４１に、それぞれ格納されている。 FIG. 16 is a diagram for explaining image data generated by the reduction process of the first embodiment.
When each of the above-described processes is completed for the target image SI and the process proceeds to step S655, one color component data (FIG. 16A) and two compressed character image data (FIG. 16B) )), One gray component data (FIG. 16C), and one gray mask data (FIG. 16D) are generated and stored in the buffer area 241 respectively.

カラー成分データは、低減処理部１３０の第３処理部１３３による処理（図１５：ステップＳ６２５、Ｓ６４０）によって、生成される。ステップＳ６２５、Ｓ６４０の処理から解るように、カラー成分データは、対象画像データ内の複数個の画素値のうち、文字オブジェクト領域内の各画素の画素値と、グレー領域内の各画素の画素値とを、背景色値ＢＣに置換して得られる画像データである。カラー成分データは、対象画像データのうち、カラー領域に対応する部分画像データを、含んでいる。したがって、図１６（Ａ）に示すように、カラー成分データによって表されるカラー成分画像ＣＩは、対象画像ＳＩの非オブジェクト領域Ｌ３０（図３（Ｄ））に対応する背景Ｂｇと、カラー領域Ｌ３３（図３（Ｄ））に対応するカラーオブジェクトＯｂ３と、を表す部分画像を含んでいる。また、カラー成分データは、対象画像データと同じように、３個の色成分に対応する３個の成分データ（すなわち、Ｒ成分データ、Ｇ成分データ、Ｂ成分データ）を含んでいる。 The color component data is generated by the processing by the third processing unit 133 of the reduction processing unit 130 (FIG. 15: steps S625 and S640). As understood from the processing of steps S625 and S640, the color component data includes the pixel value of each pixel in the character object area and the pixel value of each pixel in the gray area among the plurality of pixel values in the target image data. Is the image data obtained by substituting the background color value BC. The color component data includes partial image data corresponding to the color area in the target image data. Accordingly, as shown in FIG. 16A, the color component image CI represented by the color component data includes the background Bg corresponding to the non-object region L30 (FIG. 3D) of the target image SI and the color region L33. A partial image representing the color object Ob3 corresponding to (FIG. 3D) is included. In addition, the color component data includes three component data (that is, R component data, G component data, and B component data) corresponding to the three color components, like the target image data.

圧縮済みの文字画像データは、低減処理部１３０の第１処理部１３１による処理（すなわち、図１５：ステップＳ６３０）によって、生成される。ステップＳ６３０の処理から解るように、圧縮済みの文字画像データは、対象画像データのうち、文字オブジェクト領域に対応する部分画像データを二値化して得られる二値画像データをＦＬＡＴ圧縮方式で圧縮したデータである。図１６（Ｂ）の例では、対象画像ＳＩの文字オブジェクト領域Ｌ３４Ｃ（図３（Ｄ））に対応する３個の文字オブジェクトＯｂ４〜Ｏｂ６を表す二値画像ＴＩ１と、領域Ｌ３７（図３（Ｄ））に対応する１個の文字オブジェクトＯｂ７を表す二値画像ＴＩ２と、をそれぞれ表す２個の圧縮済みの文字画像データが生成されている。 The compressed character image data is generated by the processing by the first processing unit 131 of the reduction processing unit 130 (ie, step S630 in FIG. 15). As understood from the processing of step S630, the compressed character image data is binary image data obtained by binarizing the partial image data corresponding to the character object area in the target image data, and compressed by the FLAT compression method. It is data. In the example of FIG. 16B, a binary image TI1 representing three character objects Ob4 to Ob6 corresponding to the character object region L34C (FIG. 3D) of the target image SI, and a region L37 (FIG. 3D )), Two compressed character image data respectively representing a binary image TI2 representing one character object Ob7 are generated.

グレー成分データは、低減処理部１３０の第２処理部１３２による処理（図１５：ステップＳ６３５、Ｓ６４５）によって、生成される。ステップＳ６３５、Ｓ６４５の処理から解るように、グレー成分データは、対象画像ＳＩに含まれる１個以上のグレー領域をそれぞれ表す部分成分データを含んでいる。したがって、図１６（Ｃ）に示すように、グレー成分データによって表されるグレー成分画像ＧＩは、対象画像ＳＩ内の２個のグレー領域Ｌ３１、Ｌ３２にそれぞれ対応する２個のグレーオブジェクトＯｂ１、Ｏｂ２、を表す２個のグレー部分画像ＰＧ１、ＰＧ２を含んでいる。また、グレー成分データは、対象画像データとは異なり、１個の成分値から構成された１個の成分データである。 The gray component data is generated by the processing (FIG. 15: steps S635 and S645) by the second processing unit 132 of the reduction processing unit 130. As understood from the processing in steps S635 and S645, the gray component data includes partial component data each representing one or more gray regions included in the target image SI. Therefore, as shown in FIG. 16C, the gray component image GI represented by the gray component data includes two gray objects Ob1 and Ob2 corresponding to the two gray regions L31 and L32 in the target image SI, respectively. , Two gray partial images PG1 and PG2 are included. Further, unlike the target image data, the gray component data is one component data composed of one component value.

グレーマスクデータは、低減処理部１３０の第２処理部１３２による処理（図１５：ステップＳ６４７）によって、生成される。ステップＳ６４７の処理から解るように、グレーマスクデータによって表されるグレーマスク画像ＭＩは、対応するグレー成分画像ＧＩ（図１６（Ｃ））内の２個のグレー部分画像ＰＧ１、ＰＧ２にそれぞれ対応する２個のマスク部分画像ＰＭ１、ＰＭ２と、を含んでいる。２個のマスク部分画像ＰＭ１、ＰＭ２は、対応する２個のグレー部分画像ＰＧ１、ＰＧ２内のオブジェクト画素の位置を示している。すなわち、グレーマスクデータは、グレー成分画像ＧＩ内のオブジェクト画素の位置を表す値「１」と、非オブジェクト画素の位置を表す値「０」と、から構成される二値データである。言い換えれば、グレーマスク画像ＭＩは、グレー成分画像ＧＩが、カラー成分画像ＣＩ上に重ねて表示される場合に、表示すべき画素（すなわち、画素値「１」の画素）と、表示しない画素（すなわち、画素値「０」の画素）と、を規定した二値画像である。 The gray mask data is generated by the processing (FIG. 15: step S647) by the second processing unit 132 of the reduction processing unit 130. As understood from the processing in step S647, the gray mask image MI represented by the gray mask data corresponds to the two gray partial images PG1 and PG2 in the corresponding gray component image GI (FIG. 16C), respectively. Two mask partial images PM1 and PM2 are included. The two mask partial images PM1 and PM2 indicate the positions of the object pixels in the corresponding two gray partial images PG1 and PG2. That is, the gray mask data is binary data composed of a value “1” representing the position of the object pixel in the gray component image GI and a value “0” representing the position of the non-object pixel. In other words, the gray mask image MI includes pixels to be displayed (that is, pixels having a pixel value “1”) and pixels not to be displayed (when the gray component image GI is displayed over the color component image CI). That is, a binary image defining a pixel having a pixel value “0”).

ステップＳ６５５では、先ず、低減処理部１３０の第２処理部１３２は、グレー成分データをＪＰＥＧ圧縮方式で圧縮する。ＪＰＥＧ圧縮方式は、デジタルカメラで撮影することによって生成された画像データの圧縮などに使用されている不可逆圧縮方式であり、写真のように、比較的階調性が高い画像、すなわち、色数Ｃが多く、階調の変化が緩やかな画像の圧縮に適している。一方、ＪＰＥＧ圧縮方式は、急激に階調が変化するエッジを大きく劣化させるので、文字のように、読みやすさや見栄えの観点からエッジの再現性が重要な画像の圧縮には不向きである。本ステップにて圧縮されたグレー成分データを、圧縮済みのグレー画像データとも呼ぶ。生成された圧縮済みのグレー画像データは、バッファ領域２４１に格納され、圧縮前のグレー成分データは消去される。 In step S655, first, the second processing unit 132 of the reduction processing unit 130 compresses the gray component data using the JPEG compression method. The JPEG compression method is an irreversible compression method used for compression of image data generated by taking a picture with a digital camera, and is an image having a relatively high gradation, such as a photograph, that is, the number of colors C Therefore, it is suitable for compression of images with a gradual change in gradation. On the other hand, the JPEG compression method greatly deteriorates an edge whose gradation changes rapidly, and thus is not suitable for compression of an image in which reproducibility of an edge is important from the viewpoint of readability and appearance like characters. The gray component data compressed in this step is also called compressed gray image data. The generated compressed gray image data is stored in the buffer area 241 and the gray component data before compression is deleted.

圧縮済みのグレー画像データが生成されると、次のステップＳ６６０では、第２処理部１３２は、グレーマスクデータをＦＬＡＴＥ圧縮方式で圧縮して、圧縮済みのグレーマスクデータを生成する。ＦＬＡＴＥ圧縮方式は、上述したように、二値データのような階調数が比較的少ない画像の圧縮に適している。生成された圧縮済みのグレーマスクデータは、バッファ領域２４１に格納され、圧縮前のグレーマスクデータは消去される。 When the compressed gray image data is generated, in the next step S660, the second processing unit 132 generates the compressed gray mask data by compressing the gray mask data by the FLATE compression method. As described above, the FLATE compression method is suitable for compression of an image having a relatively small number of gradations such as binary data. The generated compressed gray mask data is stored in the buffer area 241, and the gray mask data before compression is deleted.

圧縮済みのグレーマスクデータが生成されると、次のステップＳ６６５では、低減処理部１３０の第３処理部１３３は、カラー成分データをＪＰＥＧ圧縮方式で圧縮する。カラー成分データは、上述したように、３個の成分データを含んでいるため、３個の成分データがそれぞれ圧縮されることになる。本ステップにて圧縮されたカラー成分データを、圧縮済みのカラー画像データとも呼ぶ。生成された圧縮済みのカラー画像データは、バッファ領域２４１に格納され、圧縮前のカラー成分データは消去される。圧縮済みのカラー画像データが生成されると、低減処理部１３０は、低減処理を終了する。 When the compressed gray mask data is generated, in the next step S665, the third processing unit 133 of the reduction processing unit 130 compresses the color component data by the JPEG compression method. As described above, since the color component data includes three component data, the three component data are respectively compressed. The color component data compressed in this step is also called compressed color image data. The generated compressed color image data is stored in the buffer area 241, and the color component data before compression is deleted. When the compressed color image data is generated, the reduction processing unit 130 ends the reduction processing.

低減処理が終了されると、図２のステップＳ７００では、生成部１５０は、圧縮済みの文字画像データと、圧縮済みのグレー画像データと、圧縮済みのカラー画像データと、圧縮済みのグレーマスクデータと、を用いて圧縮ＰＤＦファイルを生成する。 When the reduction process is completed, in step S700 of FIG. 2, the generation unit 150 causes the compressed character image data, the compressed gray image data, the compressed color image data, and the compressed gray mask data. And generate a compressed PDF file.

具体的には、生成部１５０は、圧縮済みのカラー画像データを、最下層のレイヤーとして表示させる画像データとして、ＰＤＦファイルに格納する。 Specifically, the generation unit 150 stores the compressed color image data in the PDF file as image data to be displayed as the lowest layer.

また、生成部１５０は、圧縮済みの文字画像データを、圧縮済みのカラー画像データより上位層のレイヤーとして表示する画像データとして、ＰＤＦファイルに格納する。圧縮済みの文字画像データは、文字色値ＴＣおよび座標値ＣＤと関連付けて、ＰＤＦファイルに格納される。文字色値ＴＣは、文字の色を表すＲＧＢ値であり、図１５のステップＳ６２０で算出された値である。座標値ＣＤは、圧縮済みのカラー画像データによって表されるカラー成分画像ＣＩに対して、圧縮済みの文字画像データによって表される二値画像ＴＩが配置されるべき位置を表す情報である。座標値ＣＤは、例えば、二値画像ＴＩに外接する最小矩形の左上の角の画素の座標値（Ｘ、Ｙ）で表される。図１６（Ｂ）の例では、３個の文字オブジェクトＯｂ４〜Ｏｂ６を表す二値画像ＴＩ１を表す文字画像データには、文字色値ＴＣ１（Ｒ１、Ｇ１、Ｂ１）と、座標値ＣＤ１（Ｘ１、Ｙ１）と、が関連付けられている。また、１個の文字オブジェクトＯｂ７を表す二値画像ＴＩ２には、文字色値ＴＣ２（Ｒ２、Ｇ２、Ｂ２）と、座標値ＣＤ２（Ｘ２、Ｙ２）と、が関連付けられている。 Further, the generation unit 150 stores the compressed character image data in the PDF file as image data to be displayed as a layer higher than the compressed color image data. The compressed character image data is stored in the PDF file in association with the character color value TC and the coordinate value CD. The character color value TC is an RGB value representing the color of the character, and is the value calculated in step S620 in FIG. The coordinate value CD is information representing a position where the binary image TI represented by the compressed character image data should be arranged with respect to the color component image CI represented by the compressed color image data. The coordinate value CD is represented by, for example, the coordinate value (X, Y) of the pixel at the upper left corner of the smallest rectangle that circumscribes the binary image TI. In the example of FIG. 16B, character image data representing a binary image TI1 representing three character objects Ob4 to Ob6 includes a character color value TC1 (R1, G1, B1) and a coordinate value CD1 (X1, Y1) are associated with each other. In addition, a character color value TC2 (R2, G2, B2) and a coordinate value CD2 (X2, Y2) are associated with the binary image TI2 representing one character object Ob7.

また、生成部１５０は、圧縮済みのグレー画像データを、圧縮済みのカラー画像データより上位層のレイヤーとして表示する画像データとして、ＰＤＦファイルに格納する。圧縮済みのグレー画像データは、圧縮済みのグレーマスクデータと関連付けて、ＰＤＦファイルに格納される。 Further, the generation unit 150 stores the compressed gray image data in the PDF file as image data to be displayed as a layer higher than the compressed color image data. The compressed gray image data is stored in the PDF file in association with the compressed gray mask data.

圧縮ＰＤＦファイルが生成されると、例えば、スキャナドライバ１００は、生成されたＰＤＦファイルを、例えば、不揮発性記憶装置２９０に格納し、バッファ領域２４１に格納された圧縮済みの文字画像データと、圧縮済みのグレー画像データと、圧縮済みのカラー画像データと、圧縮済みのグレーマスクデータとをそれぞれ消去した後、画像処理を終了する。
ＰＤＦファイルは、複数個の異なる形式の画像データを１個のファイルに格納可能であり、当該ファイルを用いて画像を表示する際には、格納された複数個の画像データを重畳して１個の画像として再現可能なように規格が定められている。ステップＳ７００において、生成部１５０は、ＰＤＦ規格に従って、各圧縮済みの画像データ（図１６）をＰＤＦファイルに格納するので、本実施例にて作成された圧縮ＰＤＦファイルは、ＰＤＦファイルの閲覧ソフトを用いて表示すると、対象画像ＳＩ（図３（Ａ））を、再現することができる。 When the compressed PDF file is generated, for example, the scanner driver 100 stores the generated PDF file in, for example, the nonvolatile storage device 290 and the compressed character image data stored in the buffer area 241 and the compressed PDF file. After erasing the already-processed gray image data, the compressed color image data, and the compressed gray mask data, the image processing is terminated.
A PDF file can store a plurality of different types of image data in a single file. When an image is displayed using the file, a plurality of stored image data is superimposed on a single file. Standards have been established so that images can be reproduced. In step S700, the generation unit 150 stores each compressed image data (FIG. 16) in a PDF file in accordance with the PDF standard. Therefore, the compressed PDF file created in this embodiment is a PDF file viewing software. When used and displayed, the target image SI (FIG. 3A) can be reproduced.

以上説明した第１実施例によれば、第１処理部１３１は、対象画像データのうち、文字領域に対応する部分画像データを用いて、圧縮済みの文字画像データを生成する。また、第２処理部１３２は、対象画像データのうち、グレー領域に対応する部分画像データを用いて、１種類の成分値で構成された圧縮済みのグレー画像データを生成する。第２処理部１３２は、１種類の成分値で構成されたグレー成分データを取得する処理（図１５：ステップＳ６３５、Ｓ６４５）と、グレー成分データを圧縮する処理（図１５：ステップＳ６５５）と、を行うことで圧縮済みのグレー画像データを生成する。第３処理部１３３は、対象画像データのうち、カラー領域に対応する部分画像データを用いて、複数種類の成分値で構成された圧縮済みのカラー画像データを生成する。第３処理部１３３は、複数個の成分データを含むカラー成分データを取得する処理（図１５：ステップＳ６２５、Ｓ６４０）と、カラー成分データを圧縮する処理（図１５：ステップＳ６６５）と、を行うことで圧縮済みのカラー画像データを生成する。第１処理部１３１と、第２処理部１３２と、第３処理部１３３と、が行う処理は、互いに異なる。この結果、文字、文字とは異なるグレー画像、文字とは異なるカラー画像とにそれぞれ適した処理を実行するので、対象画像データを効果的に圧縮して、圧縮済みの対象画像データを生成することができる。 According to the first embodiment described above, the first processing unit 131 generates compressed character image data using the partial image data corresponding to the character region in the target image data. In addition, the second processing unit 132 generates compressed gray image data composed of one type of component value using partial image data corresponding to the gray region in the target image data. The second processing unit 132 acquires gray component data composed of one type of component value (FIG. 15: steps S635 and S645), processing for compressing gray component data (FIG. 15: step S655), To generate compressed gray image data. The third processing unit 133 generates compressed color image data composed of a plurality of types of component values using partial image data corresponding to the color area in the target image data. The third processing unit 133 performs a process of acquiring color component data including a plurality of component data (FIG. 15: steps S625 and S640) and a process of compressing the color component data (FIG. 15: step S665). As a result, compressed color image data is generated. The processes performed by the first processing unit 131, the second processing unit 132, and the third processing unit 133 are different from each other. As a result, processing suitable for a character, a gray image different from the character, and a color image different from the character are executed, so that the target image data is effectively compressed to generate compressed target image data. Can do.

具体的には、階調性よりエッジの再現性が重要であると考えられる文字領域については、二値画像データをＦＬＡＴＥ圧縮方式で圧縮して、圧縮済みの文字画像データを生成する。この結果、圧縮率を高く維持しつつ、かつ、文字の読みやすさを損なわない態様で、文字領域を表す画像を保存することができる。 Specifically, for a character region in which edge reproducibility is considered to be more important than gradation, binary image data is compressed by the FLATE compression method to generate compressed character image data. As a result, an image representing a character region can be stored in a manner that maintains a high compression ratio and does not impair the readability of characters.

さらに、エッジの再現性より階調性が重要であると考えられる写真領域を含み得る非文字領域のうち、グレー領域については、１個の成分で構成されたグレー成分データを、ＪＰＥＧ圧縮方式で圧縮して、圧縮済みのグレー画像データを生成する。この結果、１個の成分で構成されたグレー成分データを用いることで、圧縮率を高めることができるとともに、多階調の圧縮に適したＪＰＥＧ圧縮方式を用いることで、グレーの写真などを含み得るグレー領域の画質（階調性など）を損なわない態様で、グレー領域を表す画像を保存することができる。 Further, among non-character areas that can include photographic areas where gradation is more important than edge reproducibility, for gray areas, gray component data composed of one component is converted into JPEG compression. Compressed gray image data is generated. As a result, it is possible to increase the compression rate by using gray component data composed of one component, and to include gray photographs by using the JPEG compression method suitable for multi-tone compression. An image representing the gray area can be stored in a manner that does not impair the image quality (gradation, etc.) of the obtained gray area.

さらに、エッジの再現性より階調性が重要であると考えられる写真領域を含み得る非文字領域のうち、カラー領域については、複数種類の色成分（例えば、ＲＧＢの３種類）で構成されたカラー成分データを、ＪＰＥＧ圧縮方式で圧縮して、圧縮済みのカラー画像データを生成する。この結果、カラー写真などを含み得るカラー領域の画質（色相や階調性）を損なわない態様で、カラー領域を表す画像を保存することができる。 Further, among the non-character areas that can include a photographic area where gradation is considered to be more important than edge reproducibility, the color area is composed of a plurality of types of color components (for example, three types of RGB). The color component data is compressed by the JPEG compression method to generate compressed color image data. As a result, an image representing the color area can be stored in a manner that does not impair the image quality (hue and gradation) of the color area that may include a color photograph.

以上の結果、文字領域と、グレー領域と、カラー領域と、を含む対象画像ＳＩを表す対象画像データを、全体として高い画質を維持しつつ、圧縮率を向上できる形式に変換して保存することができる。 As a result, the target image data representing the target image SI including the character area, the gray area, and the color area is converted and saved in a format that can improve the compression rate while maintaining high image quality as a whole. Can do.

さらに、第２処理部１３２は、グレー領域に対応する部分画像データに含まれる３個の成分データ（すなわち、Ｒ成分データ、Ｇ成分データ、Ｂ成分データ）のうちの１個の成分データを選択することによって、グレー成分データを生成する（図１５：ステップＳ６３５）。この結果、グレー成分データを簡単に取得することができる。 Further, the second processing unit 132 selects one component data out of three component data (that is, R component data, G component data, and B component data) included in the partial image data corresponding to the gray area. Thus, gray component data is generated (FIG. 15: step S635). As a result, gray component data can be easily obtained.

さらに、圧縮済みのグレー画像データは、図１６（Ｃ）に示すグレー成分画像ＧＩを表す画像データであり、図１６（Ｃ）から解るように対象画像ＳＩの全体に対応する領域を表す１個の画像データである。また、圧縮済みのカラー画像データは、図１６（Ａ）に示すカラー成分画像ＣＩを表す画像データであり、図１６（Ａ）から解るように、対象画像ＳＩの全体に対応する領域を表す１個の画像データである。この結果、例えば、比較的多数のグレー領域が存在する場合であっても、圧縮済みの画像データの数が過度に増加することを抑制できる。 Further, the compressed gray image data is image data representing the gray component image GI shown in FIG. 16C, and one piece representing an area corresponding to the entire target image SI as can be seen from FIG. Image data. Further, the compressed color image data is image data representing the color component image CI shown in FIG. 16A, and as shown in FIG. 16A, 1 represents a region corresponding to the entire target image SI. Pieces of image data. As a result, for example, even when a relatively large number of gray regions exist, it is possible to suppress an excessive increase in the number of compressed image data.

さらに、判断部１２５は、判断対象のオブジェクト領域に含まれる複数個の画素のうちの有彩色を表す画素の割合ＰＮがグレー判断基準値Ｎｔｈ以上である場合には、判断対象のオブジェクト領域を、カラー領域であると判断し、有彩色を表す画素の割合ＰＮがグレー判断基準値Ｎｔｈ未満である場合には、判断対象の領域を、グレー領域であると判断する（図１１）。したがって、判断部１２５は、グレー領域であるか、カラー領域であるか、を適切に判断することができる。 Furthermore, when the ratio PN of pixels representing a chromatic color among the plurality of pixels included in the object region to be determined is equal to or greater than the gray determination reference value Nth, the determination unit 125 determines the object region to be determined as If it is determined that the area is a color area and the ratio PN of pixels representing a chromatic color is less than the gray determination reference value Nth, the determination target area is determined to be a gray area (FIG. 11). Therefore, the determination unit 125 can appropriately determine whether the region is a gray region or a color region.

さらに、判断部１２５は、処理対象のオブジェクト領域に含まれる色数Ｃ（言い換えれば、色の種類の数）に応じて、文字領域と、グレー領域およびカラー領域を含む非文字領域と、を特定する（図９、図１０）。この結果、特定部１２０は、色数Ｃで表される領域の階調性の違いを利用して、文字領域と、非文字領域とを、精度良く特定することができる。 Further, the determination unit 125 identifies a character region and a non-character region including a gray region and a color region according to the number of colors C (in other words, the number of color types) included in the object region to be processed. (FIGS. 9 and 10). As a result, the specification unit 120 can specify the character region and the non-character region with high accuracy by using the difference in gradation of the region represented by the number of colors C.

さらに、上記実施例では、特定部１２０は、ラベリングや統合処理（図５）を行うことにより、文字領域を文字の色毎に分離して特定する。そして、文字領域ごとに圧縮された二値データを作成して、各二値データに文字色値ＴＣを関連付けている（図１６）。したがって、文字色を再現しつつ、高い圧縮率で圧縮された圧縮ＰＤＦファイルを作成することができる。 Further, in the above-described embodiment, the specifying unit 120 performs the labeling and the integration process (FIG. 5) to separate and specify the character area for each character color. Then, binary data compressed for each character area is created, and a character color value TC is associated with each binary data (FIG. 16). Therefore, it is possible to create a compressed PDF file that is compressed at a high compression rate while reproducing the character color.

Ｂ．第２実施例
図１７は、第２実施例の低減処理のフローチャートである。図１７において、第１実施例の低減処理（図１５）と同一のステップには、図１５と同一の符号を付し、第１実施例の低減処理と異なるステップには、符号の末尾に「Ａ」を付した。 B. Second Embodiment FIG. 17 is a flowchart of a reduction process according to the second embodiment. In FIG. 17, the same steps as those of the reduction process (FIG. 15) of the first embodiment are denoted by the same reference numerals as those of FIG. 15, and steps different from those of the reduction process of the first embodiment are denoted by “ A ”is attached.

第２実施例の低減処理では、第１実施例におけるステップＳ６４５、Ｓ６４７（図１５）に代えて、図１７に示すステップＳ６４５Ａ、Ｓ６４７Ａを実行し、第１実施例におけるステップＳ６５５、Ｓ６６０を実行しない。 In the reduction process of the second embodiment, steps S645A and S647A shown in FIG. 17 are executed instead of steps S645 and S647 (FIG. 15) in the first embodiment, and steps S655 and S660 in the first embodiment are not executed. .

図１７のステップＳ６４５Ａでは、第２処理部１３２は、ステップＳ６３５にて取得されて、バッファ領域２４１に格納されている処理対象のグレー領域の画像を表す１個の成分データの階調数（例えば、２５５階調）を低減して、グレー領域の画像を表すグレー成分データを生成する。本実施例では、第２処理部１３２は、グレー領域の画像を表す１個の成分データを二値データに変換する（階調数を２階調に低減する）。 In step S645A of FIG. 17, the second processing unit 132 acquires the number of gradations of one component data (for example, representing the image of the gray area to be processed that is acquired in step S635 and stored in the buffer area 241). Gray component data representing a gray region image is generated. In the present embodiment, the second processing unit 132 converts one component data representing a gray area image into binary data (reducing the number of gradations to two gradations).

続くステップＳ６４７Ａでは、第２処理部１３２は、ステップＳ６４５Ａで生成された２階調のグレー成分データを、比較的階調数の少ない画像の圧縮に適したＦＬＡＴＥ圧縮方式により圧縮して、圧縮済みのグレー成分データを生成する。本ステップにおいて、処理対象のグレー領域の画像を表す圧縮済みのグレー画像データが生成される。生成された圧縮済みのグレー画像データは、バッファ領域２４１に格納され、圧縮前の二値化された成分データは、消去される。 In the subsequent step S647A, the second processing unit 132 compresses the two-gradation gray component data generated in step S645A by using the FLATE compression method suitable for compressing an image having a relatively small number of gradations. Of gray component data is generated. In this step, compressed gray image data representing the gray region image to be processed is generated. The generated compressed gray image data is stored in the buffer area 241 and the binarized component data before compression is deleted.

以上の説明から解るように、本実施例では、グレー領域毎に、圧縮済みのグレー画像データが生成される。従って、複数個のグレー領域が存在する場合には、複数個の圧縮済みのグレー画像データが生成されて、バッファ領域２４１に格納される。 As can be understood from the above description, in this embodiment, compressed gray image data is generated for each gray region. Accordingly, when there are a plurality of gray areas, a plurality of compressed gray image data are generated and stored in the buffer area 241.

第２実施例の低減処理では、グレー領域毎に圧縮済みのグレー画像データが生成されるので、第１実施例のステップＳ６５５の処理（１個のグレー成分データを圧縮する処理）は、存在しない。また、第２実施例の低減処理では、圧縮済みのグレー画像データは二値データであるので、カラー成分画像ＣＩと重畳された場合に、表示すべき画素（すなわち、画素値「１」の画素）と、表示しない画素（すなわち、画素値「０」の画素）と、の区別は明らかである。このために、第２実施例の低減処理では、グレーマスクデータ（図１６（Ｄ））が生成されない。したがって、第２実施例の低減処理では、第１実施例のステップＳ６６０の処理は、存在しない。 In the reduction process of the second embodiment, compressed gray image data is generated for each gray area, and therefore the process of step S655 of the first embodiment (a process of compressing one gray component data) does not exist. . In the reduction process of the second embodiment, the compressed gray image data is binary data. Therefore, when superimposed on the color component image CI, the pixel to be displayed (that is, the pixel having the pixel value “1”). ) And non-displayed pixels (that is, pixels having a pixel value “0”) are clear. For this reason, gray mask data (FIG. 16D) is not generated in the reduction process of the second embodiment. Therefore, in the reduction process of the second embodiment, the process of step S660 of the first embodiment does not exist.

図１８は、第２実施例の低減処理で生成される画像データについて説明する図である。
第２実施例の低減処理では、図３（Ａ）の対象画像ＳＩを表す対象画像データを用いる場合には、図１８に示すように、１個の圧縮済みのカラー成分データ（図１８（Ａ））と、２個の圧縮済みの文字画像データ（図１８（Ｂ））と、２個の圧縮済みのグレー成分データ（図１８（Ｃ））と、が生成されて、バッファ領域２４１に、それぞれ格納される。 FIG. 18 is a diagram for explaining image data generated by the reduction process of the second embodiment.
In the reduction process of the second embodiment, when the target image data representing the target image SI of FIG. 3A is used, as shown in FIG. 18, one compressed color component data (FIG. 18A )), Two compressed character image data (FIG. 18B), and two compressed gray component data (FIG. 18C) are generated in the buffer area 241. Each is stored.

１個の圧縮済みのカラー成分データ（図１８（Ａ））と、２個の圧縮済みの文字画像データ（図１８（Ｂ））とは、第１実施例の同名のデータ（図１６（Ａ）、図１６（Ｂ））と同じである。 One compressed color component data (FIG. 18A) and two compressed character image data (FIG. 18B) are data having the same name in the first embodiment (FIG. 16A). ) And FIG. 16 (B)).

圧縮済みのグレー成分データは、グレー領域、すなわち、グレーと判断された非文字オブジェクト領域毎に生成され、グレー領域に外接する最小矩形に対応するサイズを有する二値画像を表すデータである。したがって、図１８（Ｃ）の例では、２個のグレーオブジェクトＯｂ１、Ｏｂ２をそれぞれ表す２個の二値画像ＧＩ１、ＧＩ２をそれぞれ表す２個の圧縮済みのグレー画像データが生成される。図２のステップＳ７００にて、二値画像ＧＩ１を表す圧縮済みのグレー画像データは、座標値ＣＤ３（Ｘ３、Ｙ３）と関連付けて、ＰＤＦファイルに格納され、二値画像ＧＩ２を表す圧縮済みのグレー画像データは、座標値ＣＤ４（Ｘ４、Ｙ４）と関連付けて、ＰＤＦファイルに格納される。 The compressed gray component data is data representing a binary image having a size corresponding to a minimum rectangle circumscribing the gray region, which is generated for each gray region, that is, a non-character object region determined to be gray. Accordingly, in the example of FIG. 18C, two compressed gray image data respectively representing two binary images GI1 and GI2 representing two gray objects Ob1 and Ob2 are generated. In step S700 of FIG. 2, the compressed gray image data representing the binary image GI1 is stored in the PDF file in association with the coordinate value CD3 (X3, Y3), and the compressed gray image representing the binary image GI2 is stored. The image data is stored in the PDF file in association with the coordinate value CD4 (X4, Y4).

以上説明した第２実施例によれば、第１実施例と同様に、文字領域と、グレー領域と、カラー領域と、を含む対象画像ＳＩを表す対象画像データを、全体として高い画質を維持しつつ、圧縮率を向上できる形式に変換して保存することができる。 According to the second embodiment described above, as in the first embodiment, the target image data representing the target image SI including the character area, the gray area, and the color area is maintained with high image quality as a whole. However, it can be converted and saved in a format that can improve the compression rate.

さらに、第２実施例によれば、第２処理部１３２は、グレー成分データに含まれる成分値の諧調数を減らす処理を実行する（図１７：ステップＳ６４５Ａ）。この結果、さらに、グレー成分データのデータ量を低減することができる。 Furthermore, according to the second embodiment, the second processing unit 132 executes a process for reducing the number of gradations of the component values included in the gray component data (FIG. 17: step S645A). As a result, the amount of gray component data can be further reduced.

さらに、グレー成分データの階調数を２階調に減らしたことに応じて、グレー成分データを圧縮する圧縮方式に、比較的少ない階調に適したＦＬＡＴＥ圧縮方式が採用されている（図１７：ステップＳ６４７Ａ）。この結果、グレー成分データの階調数に応じて、より圧縮率の向上を図ることができる。 Further, the FLATE compression method suitable for relatively few gradations is adopted as the compression method for compressing the gray component data in accordance with the reduction of the number of gradations of the gray component data to 2 gradations (FIG. 17). : Step S647A). As a result, the compression rate can be further improved according to the number of gray levels of the gray component data.

また、圧縮済みのカラー画像データは、対象画像ＳＩの全体に対応する領域（すなわち、カラー成分画像ＣＩ（図１８（Ａ）））を表す１個の画像データである。そして、圧縮済みのグレー画像データは、対象画像ＳＩの一部分に対応する領域を表す１個以上のグレー成分画像（図１８（Ｃ）の例では、２個のグレー成分画像ＧＩ１、ＧＩ２）を表す１個以上の部分画像データである。１個以上の圧縮済みのグレー画像データは、対象画像ＳＩ内の位置を表す位置情報（図１８の例では、座標値ＣＤ１、ＣＤ２）と対応付けられている。この結果、例えば、グレー領域が比較的小さい場合に、一部分の領域のグレー画像データを保持すれば良いので、効率的である。 The compressed color image data is one piece of image data representing an area (that is, the color component image CI (FIG. 18A)) corresponding to the entire target image SI. The compressed gray image data represents one or more gray component images (two gray component images GI1 and GI2 in the example of FIG. 18C) that represent an area corresponding to a part of the target image SI. One or more partial image data. One or more compressed gray image data is associated with position information (coordinate values CD1, CD2 in the example of FIG. 18) indicating the position in the target image SI. As a result, for example, when the gray area is relatively small, it is only necessary to hold gray image data of a partial area, which is efficient.

Ｃ．変形例：
（１）上記第１および第２実施例では、圧縮済みの文字画像データの生成には、ＦＬＡＴＥ圧縮方式が採用されているが、これに限られない。例えば、二値画像データの圧縮に適した他の圧縮方式が採用されても良く、例えば、可逆圧縮方式であるＭＭＲ（Modified Modified Read）方式（CCITT-G4方式とも呼ばれる。）が採用されても良い。 C. Variations:
(1) In the first and second embodiments, the FLATE compression method is used to generate compressed character image data. However, the present invention is not limited to this. For example, another compression method suitable for compression of binary image data may be employed. For example, an MMR (Modified Modified Read) method (also referred to as CCITT-G4 method) that is a lossless compression method may be employed. good.

（２）上記第１実施例では、圧縮済みのグレー画像データの生成および圧縮済みのカラー画像データの生成には、共にＪＰＥＧ圧縮方式が採用されているが、これに限られない。例えば、ＧＩＦ（Graphic Interchange Format）形式や、ＴＩＦＦ(Tagged Image File Format)形式の画像ファイルの圧縮に用いられるＬＺＷ圧縮などが採用されても良い。 (2) In the first embodiment, the JPEG compression method is employed for both the generation of compressed gray image data and the generation of compressed color image data. However, the present invention is not limited to this. For example, the GIF (Graphic Interchange Format) format or the LZW compression used for compressing the TIFF (Tagged Image File Format) format image file may be employed.

（３）上記第２実施例のステップＳ６４５Ａ（図１７）では、第２処理部１３２は、処理対象のグレー領域の画像を表す１個の成分データの階調数（例えば、２５５階調（８ビット））を２階調（１ビット）に低減しているが、これに限られない。これに代えて、第２処理部１３２は、グレー領域の画像を表す１個の成分データの階調数を、４階調（２ビット）、８階調（３ビット）、１６階調（４ビット）、３２階調（５ビット）、６４階調（６ビット）、１２８階調（７ビット）のいずれかに低減しても良い。 (3) In step S645A (FIG. 17) of the second embodiment, the second processing unit 132 determines the number of gradations of one component data (eg, 255 gradations (8 Bit)) is reduced to 2 gradations (1 bit), but is not limited thereto. Instead, the second processing unit 132 sets the number of gradations of one component data representing the gray area image to 4 gradations (2 bits), 8 gradations (3 bits), and 16 gradations (4 Bit), 32 gradations (5 bits), 64 gradations (6 bits), or 128 gradations (7 bits).

（４）スキャナドライバ１００は、第１実施例における画像処理の一部の処理と、第２実施例における画像処理の一部の処理と、を組合わせた画像処理を実行しても良い。具体的には、スキャナドライバ１００は、第１実施例の低減処理（図１５）のステップＳ６３５にて取得されたグレー領域内の画像を表す１個の成分データに対して、第２実施例の低減処理（図１７）のステップＳ６４５Ａの階調数を低減する処理を実行しても良い。この場合には、階調数が低減された１個のグレー成分データであって、対象画像ＳＩの全体に対応する領域を表す１個のグレー成分データが生成される。逆に、スキャナドライバ１００は、第２実施例の低減処理（図７）において、ステップＳ６４５Ａの階調数を低減する処理は、省略されても良い。この場合には、階調数が低減されていない、例えば、２５６階調の成分値で構成された１個以上のグレー成分データであって、対象画像ＳＩの一部分に対応する領域を表す１個以上のグレー成分データが生成される。 (4) The scanner driver 100 may execute image processing in which part of the image processing in the first embodiment and part of the image processing in the second embodiment are combined. Specifically, the scanner driver 100 applies the one component data representing the image in the gray area acquired in step S635 of the reduction process (FIG. 15) of the first embodiment to the second embodiment. You may perform the process which reduces the number of gradations of step S645A of a reduction process (FIG. 17). In this case, one piece of gray component data with a reduced number of gradations and one piece of gray component data representing an area corresponding to the entire target image SI is generated. Conversely, the scanner driver 100 may omit the process of reducing the number of gradations in step S645A in the reduction process (FIG. 7) of the second embodiment. In this case, the number of gradations is not reduced, for example, one or more pieces of gray component data composed of component values of 256 gradations, each representing an area corresponding to a part of the target image SI The above gray component data is generated.

（５）上記第１実施例では、ステップＳ６３５（図１５）において、第２処理部１３２は、対象画像データのうち、処理対象のグレー領域に対応する部分画像データに含まれる３個の成分データ（すなわち、Ｒ成分データ、Ｇ成分データ、Ｂ成分データ）のうちの１個の成分データを選択することによって、グレー領域を表す１個の成分データを取得している。これに代えて、第２処理部１３２は、処理対象のグレー領域内の画像を表す１個の成分データを、当該グレー領域に対応する部分画像データに含まれる３個の成分データ（すなわち、Ｒ成分データ、Ｇ成分データ、Ｂ成分データ）を用いて生成しても良い。具体的には、第２処理部１３２は、当該グレー領域内の複数個の画素のそれぞれの輝度値Ｙを、以下の式によって算出しても良い。
Ｙ＝Ｒ×０．２９８９１＋Ｇ×０．５８６６１＋Ｂ×０．１１４４８
上記の式のＲ、Ｇ、Ｂは、３個の成分データの対応する画素の画素値、すなわち、Ｒ成分値、Ｇ成分値、Ｂ成分値である。第２処理部１３２は、算出された輝度値を各画素の１個の成分値とする１個の成分データを、当該グレー領域内の画像を表す１個の成分データとしても良い。 (5) In the first embodiment, in step S635 (FIG. 15), the second processing unit 132 includes, among the target image data, three component data included in the partial image data corresponding to the gray area to be processed. One component data representing a gray region is acquired by selecting one component data from among (ie, R component data, G component data, B component data). Instead, the second processing unit 132 converts one component data representing an image in the gray area to be processed into three component data (that is, R data) included in the partial image data corresponding to the gray area. (Component data, G component data, B component data) may be used. Specifically, the second processing unit 132 may calculate the luminance value Y of each of the plurality of pixels in the gray area using the following formula.
Y = R × 0.29871 + G × 0.58661 + B × 0.11448
R, G, and B in the above formula are the pixel values of the corresponding pixels of the three component data, that is, the R component value, the G component value, and the B component value. The second processing unit 132 may use one component data having the calculated luminance value as one component value of each pixel as one component data representing an image in the gray area.

（６）図２：ステップＳ１５０におけるエッジ強度Ｓｅの算出や、図１２のステップＳ５５３２におけるエッジ強度ＥＰの算出の算出式としては、図５のソーベルオペレータを用いた算出法に限らず、他の任意の方法を採用可能である。例えば、プレウィットオペレータ（Prewitt operator）、または、ロバーツクロスオペレータ（Roberts Cross operator）など種々のエッジ検出用オペレータを利用可能である。また、エッジ強度は、ＲＧＢの各色成分に限らず、他の色成分（例えば、輝度）の階調値を用いて算出されてもよい。 (6) FIG. 2: The calculation formula of the edge strength Se in step S150 and the calculation formula of the edge strength EP in step S5532 in FIG. 12 are not limited to the calculation method using the Sobel operator in FIG. Any method can be adopted. For example, various edge detection operators such as a Prewitt operator or a Roberts Cross operator can be used. The edge strength is not limited to the RGB color components, and may be calculated using gradation values of other color components (for example, luminance).

（７）上記実施例では、判断部１２５は、各画素の色が有彩色であるか否かの判断を、Ｌａｂ色空間を用いて行っている。すなわち、判断部１２５は、当該画素の画素値（言い換えれば、色値）をＬａｂ色空間の表色値に変換して、当該表色値とＬａｂ色空間の無彩色軸との距離Ｒが、基準距離Ｒｔｈ未満か否かによって、当該画素の色が有彩色であるか否か判断している。これに限らず、一般的には、判断部１２５は、無彩色軸を有する任意の色空間、例えば、ＲＧＢ色空間、Ｌａｂ色空間、ＨＳＶ色空間、ＹＣｒＣｂ色空間において、無彩色軸に比較的近い色を表す画素を、無彩色を有する画素であると判断し、無彩色軸から比較的遠い色を表す画素を、有彩色を有する画素であると判断すれば良い。 (7) In the above-described embodiment, the determination unit 125 determines whether the color of each pixel is a chromatic color using the Lab color space. That is, the determination unit 125 converts the pixel value of the pixel (in other words, the color value) into a color value of the Lab color space, and the distance R between the color value and the achromatic color axis of the Lab color space is Whether or not the color of the pixel is a chromatic color is determined based on whether or not the distance is less than the reference distance Rth. However, the determination unit 125 is not limited to this, and in general, the determination unit 125 is relatively in the achromatic color axis in an arbitrary color space having an achromatic color axis, for example, an RGB color space, a Lab color space, an HSV color space, and a YCrCb color space. A pixel representing a close color may be determined to be a pixel having an achromatic color, and a pixel representing a color relatively far from the achromatic color axis may be determined to be a pixel having a chromatic color.

（８）計算機２００のスキャナドライバ１００による画像処理機能は、光学的に対象物を読み取ることによって対象物を表す画像データを生成する画像読取部を含む画像処理装置によって実現されてもよい（例えば、複合機４００やスキャナ３００や図示しないデジタルカメラ）。この場合には、画像処理装置は、自身の画像読取部によって生成された画像データを用いて、画像処理（例えば、図２の処理）を行えばよい。 (8) The image processing function by the scanner driver 100 of the computer 200 may be realized by an image processing apparatus including an image reading unit that generates image data representing an object by optically reading the object (for example, MFP 400, scanner 300, or digital camera (not shown). In this case, the image processing apparatus may perform image processing (for example, the processing in FIG. 2) using the image data generated by its own image reading unit.

一般的には、画像処理（例えば、図２の処理）を実現する画像処理装置は、計算機２００に限らず、種々の装置であってよい。例えば、プリンタ、デジタルカメラ、スキャナなどの画像関連機器の内部のコンピュータ、汎用のパーソナルコンピュータ、ネットワークに接続されたサーバ等を採用可能である。また、ネットワークを介して互いに通信可能な複数個のコンピュータが、画像処理に要する機能を一部ずつ分担して、全体として、画像処理の機能を提供してもよい。この場合、複数個のコンピュータの全体が、特許請求の範囲における画像処理装置に対応する。 In general, an image processing apparatus that realizes image processing (for example, the process of FIG. 2) is not limited to the computer 200, and may be various apparatuses. For example, a computer inside an image-related device such as a printer, a digital camera, or a scanner, a general-purpose personal computer, a server connected to a network, or the like can be employed. A plurality of computers that can communicate with each other via a network may share a part of the functions required for image processing and provide the image processing functions as a whole. In this case, the entirety of the plurality of computers corresponds to the image processing apparatus in the claims.

（９）上記各実施例において、ハードウェアによって実現されていた構成の一部をソフトウェアに置き換えるようにしてもよく、逆に、ソフトウェアによって実現されていた構成の一部あるいは全部をハードウェアに置き換えるようにしてもよい。 (9) In each of the above embodiments, a part of the configuration realized by hardware may be replaced with software, and conversely, part or all of the configuration realized by software is replaced with hardware. You may do it.

以上、実施例、変形例に基づき本発明について説明してきたが、上記した発明の実施の形態は、本発明の理解を容易にするためのものであり、本発明を限定するものではない。本発明は、その趣旨並びに特許請求の範囲を逸脱することなく、変更、改良され得ると共に、本発明にはその等価物が含まれる。 As mentioned above, although this invention was demonstrated based on the Example and the modification, Embodiment mentioned above is for making an understanding of this invention easy, and does not limit this invention. The present invention can be changed and improved without departing from the spirit and scope of the claims, and equivalents thereof are included in the present invention.

１００...スキャナドライバ、１１０...取得部、１２０...特定部、１２５...判断部、１３０...低減処理部、１３１...第１処理部、１３２...第２処理部、１３３...第３処理部、１５０...生成部、２００...計算機、２１０...ＣＰＵ、２４０...揮発性記憶装置、２４１...バッファ領域、２７０...操作部、２８０...通信部、２９０...不揮発性記憶装置、２９１...ドライバプログラム、２９２...判断テーブル、３００...スキャナ、４００...複合機 DESCRIPTION OF SYMBOLS 100 ... Scanner driver, 110 ... Acquisition part, 120 ... Identification part, 125 ... Judgment part, 130 ... Reduction process part, 131 ... 1st process part, 132 ... 1st 2 processing units, 133 ... third processing unit, 150 ... generation unit, 200 ... computer, 210 ... CPU, 240 ... volatile storage device, 241 ... buffer area, 270. ..Operation unit, 280 ... communication unit, 290 ... nonvolatile storage device, 291 ... driver program, 292 ... judgment table, 300 ... scanner, 400 ... multifunction device

Claims

An acquisition unit that acquires target image data that represents a target image and includes a plurality of component data corresponding to a plurality of color components;
In the target image, a specifying unit that specifies a character region representing a character, a gray region representing a gray image different from the character, and a color region representing a color image different from the character,
A reduction processing unit for reducing the data size of the target image data,
A first processing unit that generates compressed character image data by executing a first process using partial image data corresponding to the character region;
A second process different from the first process is executed using the partial image data corresponding to the gray area, thereby generating compressed gray image data composed of one type of component value. A second processing unit, wherein the second processing is performed by performing processing for obtaining gray component data composed of one kind of component value and processing for compressing the gray component data; Generating the gray image data, the second processing unit;
By executing a third process different from the first process and the second process using partial image data corresponding to the color area, a compressed color composed of a plurality of types of component values A third processing unit for generating image data, wherein the third process performs a process of acquiring color component data including a plurality of component data and a process of compressing the color component data. Generating the compressed color image data, the third processing unit;
The reduction processing unit,
A generating unit that generates compressed target image data representing the target image using the compressed character image data, the compressed gray image data, and the compressed color image data;
An image processing apparatus comprising:

The image processing apparatus according to claim 1,
The image processing apparatus, wherein the gray component data includes one component data selected from the plurality of component data included in the partial image data corresponding to the gray region.

The image processing apparatus according to claim 1 or 2,
The compression method used for compression of the gray component data by the second processing unit and the compression method used for compression of each of the plurality of component data included in the color component data by the third processing unit are the same An image processing apparatus.

An image processing apparatus according to any one of claims 1 to 3,
The image processing apparatus according to claim 2, wherein the second process includes a process of reducing the number of gradations of component values included in the gray component data.

An image processing apparatus according to any one of claims 1 to 4,
The compressed gray image data is a piece of image data representing an area corresponding to the entire target image,
The image processing apparatus, wherein the compressed color image data is one piece of image data representing an area corresponding to the entire target image.

An image processing apparatus according to any one of claims 1 to 4,
The compressed color image data is one piece of image data representing an area corresponding to the entire target image,
The compressed gray image data is one or more pieces of image data representing a region corresponding to a part of the target image, and is associated with position information representing a position in the target image.

An image processing apparatus according to any one of claims 1 to 6,
The specifying unit includes a determination unit that determines whether an area in the target image is the gray area or the color area,
The determination unit determines that the determination target area is the color area when the ratio of pixels representing a chromatic color included in the determination target area is equal to or greater than a reference value.
The image processing apparatus, wherein the determination unit determines that the determination target area is the gray area when a ratio of pixels representing a chromatic color included in the determination target area is less than a reference value.

The image processing apparatus according to claim 7,
The determination unit is the gray region by using a plurality of pixels excluding a plurality of outer edge pixels located at an outer edge portion of the determination target region from a plurality of pixels included in the determination target region. Or an image processing apparatus for determining whether the color area is present.

The image processing apparatus according to claim 1, wherein:
The specifying unit specifies the character region and the region including the gray region and the color region according to the number of types of colors included in the processing target region.

An acquisition function for acquiring target image data representing a target image, the target image data including a plurality of component data corresponding to a plurality of color components;
In the target image, a specific function for specifying a character region representing a character, a gray region representing a gray image different from the character, and a color region representing a color image different from the character;
A reduction processing function for reducing the data size of the target image data,
A first processing function for generating compressed character image data by executing a first process using partial image data corresponding to the character region;
A second process different from the first process is executed using the partial image data corresponding to the gray area, thereby generating compressed gray image data composed of one type of component value. A second processing function, wherein the second processing is performed by performing processing for obtaining gray component data composed of one type of component value and processing for compressing the gray component data. Generating said second image function, comprising generating gray image data;
By executing a third process different from the first process and the second process using partial image data corresponding to the color area, a compressed color composed of a plurality of types of component values A third processing function for generating image data, wherein the third processing performs processing for acquiring color component data including a plurality of component data and processing for compressing the color component data; Generating said compressed color image data, said third processing function;
The reduction processing function,
A generation function for generating compressed target image data representing the target image using the compressed character image data, the compressed gray image data, and the compressed color image data;
A computer program that causes a computer to realize