JP2008099149A

JP2008099149A - Image processor, image processing method and image processing program

Info

Publication number: JP2008099149A
Application number: JP2006280790A
Authority: JP
Inventors: Kazuya Yago; 和也矢後
Original assignee: Konica Minolta Business Technologies Inc
Current assignee: Konica Minolta Business Technologies Inc
Priority date: 2006-10-13
Filing date: 2006-10-13
Publication date: 2008-04-24

Abstract

PROBLEM TO BE SOLVED: To easily and highly accurately detect the colors of characters. SOLUTION: An MFP includes: a character region extraction part 310 for extracting the character region of a character attribute where the character is indicated from color images; a thinning part 331 for thinning the character included in the character region; a character color determination part 330 for determining a character color on the basis of the color of the pixels of the color image corresponding to a plurality of character center pixels constituting the thinned character; a noise elimination part 320 for converting the color image to a binary image by binarizing it; a character region encoding part 340 for encoding the character region of the binary image and generating code data; and an integration part 370 for relating the generated code data, the determined character color and the position in the color image of the character region. COPYRIGHT: (C)2008,JPO&INPIT

Description

この発明は、画像処理装置、画像処理方法および画像処理プログラムに関し、特にカラー画像を効率的に圧縮する画像処理装置、画像処理方法および画像処理プログラムに関する。 The present invention relates to an image processing apparatus, an image processing method, and an image processing program, and more particularly, to an image processing apparatus, an image processing method, and an image processing program for efficiently compressing a color image.

近年、情報の電子化が進み、文書を紙でなく電子データとして保存あるいは送信する需要が高まっている。さらに、文書がカラーで表されることが多く、電子データもカラー画像であることが望まれる。一方で、カラー画像は、モノクロ画像に比較してデータ量が多くなるため、カラー画像の電子データを圧縮し、圧縮後のデータを保存あるいは送信することがなされている。 In recent years, with the progress of computerization of information, there is an increasing demand for storing or transmitting documents as electronic data instead of paper. Further, the document is often expressed in color, and it is desired that the electronic data is also a color image. On the other hand, since a color image has a larger data amount than a monochrome image, the electronic data of the color image is compressed and the compressed data is stored or transmitted.

しかしながら、カラー画像に文字が表されている場合、写真等の圧縮に適した非可逆方式で圧縮すると、伸張後のカラー画像中で文字が読めなくなってしまうといった問題がある。この問題に対応するために、カラー画像を、文字が表された文字領域と、文字領域以外の領域とに分離し、カラー画像の文字領域を可逆圧縮方式で圧縮し、文字領域以外の領域を非可逆圧縮方式で圧縮する技術が、特開２０００−３５００４０（特許文献１）または特開平５−１３０４２６号公報（特許文献２）に記載されている。さらに、カラー画像に複数の色で文字が表されている場合、複数の色別に文字領域を２値化した画像を可逆圧縮すると、カラー画像を効率的に圧縮することができる。 However, when characters are represented in a color image, there is a problem that if the image is compressed by an irreversible method suitable for compression of a photograph or the like, the characters cannot be read in the expanded color image. To cope with this problem, a color image is separated into a character area where characters are represented and an area other than the character area, the character area of the color image is compressed by a lossless compression method, and an area other than the character area is compressed. A technique for compressing with a lossy compression method is described in Japanese Patent Laid-Open No. 2000-350040 (Patent Document 1) or Japanese Patent Laid-Open No. 5-130426 (Patent Document 2). Furthermore, when characters are represented in a plurality of colors in a color image, the color image can be efficiently compressed by reversibly compressing an image in which character regions are binarized for a plurality of colors.

しかしながら、原稿を読み取るスキャナの特性上、スキャナが出力するカラー画像データで、文字の中心部分の画素と文字の輪郭部分の画素とで色が異なるため、文字を構成する画素の値から文字の色を高精度に検出するためには、複雑な処理を行わなければならず、処理に時間がかかるといった問題があった。
特開２００１−３１２７２５号公報特開２００１−３１２７２６号公報 However, due to the characteristics of the scanner that reads the document, the color image data output by the scanner has a different color between the pixel at the center of the character and the pixel at the outline of the character. In order to detect this with high accuracy, complicated processing must be performed, and there is a problem that processing takes time.
JP 2001-31725 A JP 2001-31726 A

この発明は上述した問題点を解決するためになされたもので、この発明の目的の一つは、文字の色を容易かつ高精度に検出することが可能な画像処理装置、画像圧縮方法および画像圧縮プログラムを提供することである。 The present invention has been made to solve the above-described problems, and one of the objects of the present invention is to provide an image processing apparatus, an image compression method, and an image capable of easily and accurately detecting the color of a character. It is to provide a compression program.

この発明の他の目的は、画像から文字が表された領域を抽出する精度を高めた画像処理装置を提供することである。 Another object of the present invention is to provide an image processing apparatus with improved accuracy for extracting a region where characters are represented from an image.

上述した目的を達成するためにこの発明のある局面によれば、画像処理装置は、カラー画像から文字が表された文字属性の文字領域を抽出する文字領域抽出手段と、文字領域に含まれる文字を細線化する細線化手段と、細線化された文字を構成する複数の文字中心画素に対応するカラー画像の画素の色に基づいて、文字色を決定する文字色決定手段と、カラー画像を２値化することによって、２値画像に変換する２値化手段と、２値画像の文字領域を符号化し、符号データを生成する文字領域符号化手段と、生成された符号データと、決定された文字色と、文字領域のカラー画像中の位置とを関連付ける関連付手段と、を備える。 In order to achieve the above-described object, according to one aspect of the present invention, an image processing device includes: a character area extracting unit that extracts a character area having a character attribute representing a character from a color image; and a character included in the character area. Thinning means for thinning the character, character color determining means for determining the character color based on the color of the pixel of the color image corresponding to the plurality of character center pixels constituting the thinned character, and two color images. By binarization, binarization means for converting to a binary image, character area encoding means for encoding a character area of the binary image and generating code data, and generated code data are determined Association means for associating the character color with the position of the character region in the color image.

この局面に従えば、カラー画像から抽出された文字領域に含まれる文字が細線化され、細線化された文字を構成する複数の文字中心画素に対応するカラー画像の画素の色に基づいて、文字色が決定される。そして、カラー画像を２値化した２値画像の文字領域が符号化され、符号データと、決定された文字色と、文字領域のカラー画像中の位置とが関連付けられる。カラー画像を２値化した２値画像の文字領域が符号化されるので、文字全体を同じ色の文字として２値化することができる。また、細線化後の文字を構成する複数の文字中心画素に対応する画像データの画素の色に基づいて文字色が決定されるので、文字の色を正確に検出することができる。その結果、画像中の文字の色を容易かつ高精度に検出することが可能な画像処理装置を提供することができる。 According to this aspect, the character included in the character region extracted from the color image is thinned, and the character is based on the color of the pixel of the color image corresponding to the plurality of character center pixels constituting the thinned character. The color is determined. Then, the character area of the binary image obtained by binarizing the color image is encoded, and the code data, the determined character color, and the position of the character area in the color image are associated with each other. Since the character area of the binary image obtained by binarizing the color image is encoded, the entire character can be binarized as a character of the same color. Further, since the character color is determined based on the color of the pixel of the image data corresponding to the plurality of character center pixels constituting the thinned character, the character color can be accurately detected. As a result, it is possible to provide an image processing apparatus that can easily and accurately detect the color of characters in an image.

好ましくは、細線化手段は、２値画像において主走査方向に連続して配置された黒画素のうちから中心の画素を抽出する主走査方向検出手段と、主走査方向と交わる副走査方向に連続して配置された黒画素のうちから中心の画素を抽出する副走査方向検出手段と、主走査方向検出手段で抽出された中心画素と、副走査方向検出手段で抽出された中心画素とを文字中心画素に決定する決定手段と、を含む。 Preferably, the thinning means is continuous in the sub-scanning direction intersecting with the main scanning direction and main scanning direction detecting means for extracting a central pixel from black pixels continuously arranged in the main scanning direction in the binary image. Sub-scanning direction detecting means for extracting the central pixel from the black pixels arranged in the same manner, the central pixel extracted by the main scanning direction detecting means, and the central pixel extracted by the sub-scanning direction detecting means Determining means for determining the center pixel.

この局面に従えば、主走査方向と副走査方向それぞれで、連続して配置された黒画素のうちから中心の画素を文字中心画素として抽出するので、文字中心画素を容易に抽出することができ、処理速度を速くすることができる。 According to this aspect, since the center pixel is extracted as the character center pixel from the continuously arranged black pixels in the main scanning direction and the sub-scanning direction, the character center pixel can be easily extracted. , Processing speed can be increased.

好ましくは、細線化手段は、主走査方向に連続して配置された複数の黒画素の両端から所定数の画素を除いた少なくとも１つの画素を第１候補画素として抽出する主走査方向検出手段と、主走査方向と交わる副走査方向に連続して配置された複数の黒画素の両端から所定数の画素を除いた少なくとも１つの画素を第２候補画素として抽出する副走査方向検出手段と、第１候補画素および第２候補画素のいずれでもある画素を文字中心画素に決定する決定手段と、を含む。 Preferably, the thinning unit includes a main scanning direction detecting unit that extracts at least one pixel excluding a predetermined number of pixels from both ends of a plurality of black pixels continuously arranged in the main scanning direction as a first candidate pixel. Sub-scanning direction detecting means for extracting at least one pixel excluding a predetermined number of pixels from both ends of a plurality of black pixels continuously arranged in the sub-scanning direction intersecting with the main scanning direction; Determining means for determining a pixel that is both a first candidate pixel and a second candidate pixel as a character center pixel.

この局面に従えば、主走査方向に連続して配置された複数の黒画素の両端から所定数の画素を除いた少なくとも１つの画素であって、副走査方向に連続して配置された複数の黒画素の両端から所定数の画素を除いた少なくとも１つの画素を文字中心画素とするので、文字中心画素を容易に抽出することができ、処理速度を早くすることができる。 According to this aspect, at least one pixel obtained by removing a predetermined number of pixels from both ends of a plurality of black pixels continuously arranged in the main scanning direction, and a plurality of pixels arranged continuously in the sub-scanning direction Since at least one pixel excluding a predetermined number of pixels from both ends of the black pixel is used as the character center pixel, the character center pixel can be easily extracted, and the processing speed can be increased.

好ましくは、文字色決定手段は、複数の文字中心画素それぞれの細線化する前の色に基づき、複数の文字中心画素それぞれを複数の色相階級のいずれかに分類する分類手段と、度数が第１のしきい値以下の色相階級に分類された画素をノイズとして除去するノイズ除去手段を含む。 Preferably, the character color determining means includes a classification means for classifying each of the plurality of character center pixels into one of a plurality of hue classes based on the color before thinning of each of the plurality of character center pixels, and the frequency is first. Noise removal means for removing pixels classified into a hue class that is equal to or less than the threshold value as noise.

好ましくは、文字色決定手段は、複数の文字中心画素それぞれの細線化する前の色に基づき、複数の文字中心画素それぞれを複数の色相階級のいずれかに分類する分類手段と、複数の色相階級のうちから度数が第１のしきい値を超える色相階級を代表色相として選択する選択手段と、を含み、複数の文字中心画素のうち選択された代表色相の色相階級に分類された文字中心画素の細線化する前の色に基づき文字色を決定する。 Preferably, the character color determination means includes a classification means for classifying each of the plurality of character center pixels into one of a plurality of hue classes based on a color before thinning of each of the plurality of character center pixels, and a plurality of hue classes. Selecting a hue class whose frequency exceeds a first threshold value as a representative hue, and the character center pixel classified into the hue class of the selected representative hue among a plurality of character center pixels The character color is determined based on the color before thinning.

好ましくは、選択手段は、度数が第１しきい値を超える色相階級が２以上連続する場合、連続する色相階級のうち度数が最大の色相階級を代表色相として選択する色相統合手段を含む。 Preferably, the selection unit includes a hue integration unit that selects, as a representative hue, a hue class having the highest frequency among consecutive hue classes when two or more hue classes having a frequency exceeding the first threshold value are consecutive.

好ましくは、文字色決定手段は、選択手段により複数の代表色相が選択された場合、複数の代表色相にそれぞれ対応する複数の文字色を決定し、文字領域符号化手段は、複数の代表色相それぞれにおいて、２値画像の文字領域に含まれる黒画素のうちから該黒画素に対応するカラー画像の画素の色が該代表色相に対応する文字色と近似する画素を抽出することにより、該代表色相に対応する文字色別２値画像を生成する文字色別２値化データ生成手段と、決定された複数の文字色にそれぞれ対応する複数の文字色別２値化データを符号化し、文字色別符号データを生成する文字色別符号化手段と、を含み、関連付手段は、決定された複数の文字色ごとに、該文字色と、該文字色に対応する文字色別符号データと、文字領域の画像データ中の位置とを関連付ける文字色別関連付手段を含む。 Preferably, the character color determining means determines a plurality of character colors respectively corresponding to the plurality of representative hues when the selection means selects a plurality of representative hues, and the character area encoding means In this case, by extracting a pixel in which the color of the pixel of the color image corresponding to the black pixel approximates the character color corresponding to the representative hue from among the black pixels included in the character region of the binary image, the representative hue Character color binarized data generating means for generating a character color binarized image corresponding to the character color, and a plurality of character color binarized data respectively corresponding to the determined character colors A character color encoding unit that generates code data, and the association unit includes, for each of the plurality of determined character colors, the character color, the character color code data corresponding to the character color, and a character Position in the image data of the area Including the character color by association means for associating the door.

好ましくは、文字色決定手段は、複数の文字中心画素それぞれの細線化する前の色に基づき、複数の文字中心画素それぞれを複数の色相階級のいずれかに分類する分類手段と、複数の色相階級の度数分布に基づいて、文字領域の属性を文字属性以外の属性に変更する属性変更手段と、を含む。 Preferably, the character color determination means includes a classification means for classifying each of the plurality of character center pixels into one of a plurality of hue classes based on a color before thinning of each of the plurality of character center pixels, and a plurality of hue classes. And an attribute changing means for changing the attribute of the character area to an attribute other than the character attribute based on the frequency distribution.

この発明の他の局面によれば、画像処理装置は、カラー画像から文字が表された文字属性の文字領域を抽出する文字領域抽出手段と、カラー画像から写真または図形が表された写真／図属性の写真／図領域を抽出する写真／図領域抽出手段と、文字領域に含まれる文字を細線化する細線化手段と、細線化された文字を構成する複数の文字中心画素それぞれの細線化する前の色に基づき、複数の文字中心画素それぞれを複数の色相階級のいずれかに分類する分類手段と、複数の色相階級の度数分布に基づいて、文字領域の属性を文字属性以外の属性に変更する属性変更手段と、を備える。 According to another aspect of the present invention, an image processing apparatus includes a character area extracting unit that extracts a character area having a character attribute representing a character from a color image, and a photograph / figure representing a photograph or a figure from the color image. A photo / figure area extracting means for extracting a photograph / figure area of an attribute, a thinning means for thinning a character included in the character area, and thinning each of a plurality of character center pixels constituting the thinned character. Based on the previous color, the character area attribute is changed to an attribute other than the character attribute based on the classification means for classifying each of the plurality of character center pixels into one of a plurality of hue classes and the frequency distribution of the plurality of hue classes Attribute changing means.

この局面に従えば、文字中心画素それぞれの細線化する前の色の複数の色相階級における度数分布に基づいて、文字領域の属性が文字属性以外の属性に変更される。このため、写真または図が表されている領域が誤って文字領域に判別された場合であっても、適切に領域の属性を判別することができる。その結果、画像から文字が表された領域を抽出する精度を高めた画像処理装置を提供することができる。 According to this aspect, the attribute of the character area is changed to an attribute other than the character attribute based on the frequency distribution in a plurality of hue classes of the color before thinning of each character center pixel. For this reason, even when the area where the photograph or the figure is represented is erroneously determined as the character area, the attribute of the area can be appropriately determined. As a result, it is possible to provide an image processing apparatus with improved accuracy for extracting a region where characters are represented from an image.

この発明のさらに他の局面によれば、画像処理方法は、カラー画像から文字が表された文字属性の文字領域を抽出するステップと、文字領域に含まれる文字を細線化するステップと、細線化された文字を構成する複数の文字中心画素に対応するカラー画像の画素の色に基づいて、文字色を決定するステップと、カラー画像を２値化することによって、カラー画像を２値画像に変換するステップと、２値画像の文字領域を符号化し、符号データを生成するステップと、生成された符号データと、決定された文字色と、文字領域のカラー画像中の位置とを関連付けるステップと、を含む。 According to still another aspect of the present invention, an image processing method includes a step of extracting a character region having a character attribute representing a character from a color image, a step of thinning a character included in the character region, and a thinning step Determining a character color based on the color of a pixel of a color image corresponding to a plurality of character center pixels constituting the character and converting the color image into a binary image by binarizing the color image Encoding the character region of the binary image and generating code data, associating the generated code data, the determined character color, and the position of the character region in the color image; including.

この局面に従えば、画像中の文字の色を容易かつ高精度に検出することが可能な画像処理方法を提供することができる。 According to this aspect, it is possible to provide an image processing method capable of easily and accurately detecting the color of characters in an image.

この発明のさらに他の局面によれば、画像処理方法は、カラー画像から文字が表された文字属性の文字領域を抽出するステップと、カラー画像から写真または図形が表された写真／図属性の写真／図領域を抽出するステップと、文字領域に含まれる文字を細線化するステップと、細線化された文字を構成する複数の文字中心画素それぞれの細線化する前の色に基づき、複数の文字中心画素それぞれを複数の色相階級のいずれかに分類するステップと、
複数の色相階級の度数分布に基づいて、文字領域の属性を文字属性以外の属性に変更するステップと、を含む。 According to still another aspect of the present invention, an image processing method includes a step of extracting a character area having a character attribute representing a character from a color image, and a photo / graphic attribute representing a photograph or figure from the color image. A step of extracting a photo / figure region, a step of thinning characters included in the character region, and a plurality of characters based on the color before thinning of each of a plurality of character center pixels constituting the thinned character Classifying each central pixel into one of a plurality of hue classes;
Changing the attribute of the character region to an attribute other than the character attribute based on the frequency distribution of a plurality of hue classes.

この局面に従えば、画像から文字が表された領域を抽出する精度を高めた画像処理方法を提供することができる。 If this aspect is followed, the image processing method which improved the precision which extracts the area | region where the character was represented from the image can be provided.

この発明のさらに他の局面によれば、画像処理プログラムは、カラー画像から文字が表された文字属性の文字領域を抽出するステップと、文字領域に含まれる文字を細線化するステップと、細線化された文字を構成する複数の文字中心画素に対応するカラー画像の画素の色に基づいて、文字色を決定するステップと、カラー画像を２値化することによって、カラー画像を２値画像に変換するステップと、２値画像の文字領域を符号化し、符号データを生成するステップと、生成された符号データと、決定された文字色と、文字領域のカラー画像中の位置とを関連付けるステップと、をコンピュータに実行させる。 According to still another aspect of the present invention, an image processing program extracts a character region having a character attribute representing a character from a color image, thinning characters included in the character region, and thinning Determining a character color based on the color of a pixel of a color image corresponding to a plurality of character center pixels constituting the character and converting the color image into a binary image by binarizing the color image Encoding the character region of the binary image and generating code data, associating the generated code data, the determined character color, and the position of the character region in the color image; Is executed on the computer.

この局面に従えば、画像中の文字の色を容易かつ高精度に検出することが可能な画像処理プログラムを提供することができる。 If this aspect is followed, the image processing program which can detect the color of the character in an image easily and with high precision can be provided.

この発明のさらに他の局面によれば、画像処理プログラムは、カラー画像から文字が表された文字属性の文字領域を抽出するステップと、カラー画像から写真または図形が表された写真／図属性の写真／図領域を抽出するステップと、文字領域に含まれる文字を細線化するステップと、細線化された文字を構成する複数の文字中心画素それぞれの細線化する前の色に基づき、複数の文字中心画素それぞれを複数の色相階級のいずれかに分類するステップと、
複数の色相階級の度数分布に基づいて、文字領域の属性を文字属性以外の属性に変更するステップと、をコンピュータに実行させる。 According to still another aspect of the present invention, an image processing program extracts a character region having a character attribute representing characters from a color image, and a photo / graphic attribute representing a photograph or figure from a color image. A step of extracting a photo / figure region, a step of thinning characters included in the character region, and a plurality of characters based on the color before thinning of each of a plurality of character center pixels constituting the thinned character Classifying each central pixel into one of a plurality of hue classes;
And causing the computer to execute a step of changing the attribute of the character region to an attribute other than the character attribute based on the frequency distribution of the plurality of hue classes.

この局面に従えば、画像から文字が表された領域を抽出する精度を高めた画像処理プログラムを提供することができる。 If this aspect is followed, the image processing program which improved the precision which extracts the area | region where the character was represented from the image can be provided.

以下、本発明の実施の形態について図面を参照して説明する。以下の説明では同一の部品には同一の符号を付してある。それらの名称および機能も同じである。したがってそれらについての詳細な説明は繰返さない。 Embodiments of the present invention will be described below with reference to the drawings. In the following description, the same parts are denoted by the same reference numerals. Their names and functions are also the same. Therefore, detailed description thereof will not be repeated.

図１は、本発明の実施の形態の一つにおける画像形成システムの全体概要を示す図である。図１を参照して、画像形成システム１は、ネットワーク２にそれぞれ接続された画像処理装置としてのＭＦＰ（ＭｕｌｔｉＦｕｎｃｔｉｏｎＰｅｒｉｐｈｅｒａｌ）１００と、コンピュータ３，３Ａ，３Ｂ，３Ｃと、それぞれがＭＦＰ１００に接続されるデジタルカメラ４と、デジタルビデオカメラ５と、携帯情報端末６とを含む。 FIG. 1 is a diagram showing an overall outline of an image forming system according to one embodiment of the present invention. Referring to FIG. 1, an image forming system 1 includes an MFP (Multi Function Peripheral) 100 as an image processing apparatus connected to a network 2, and computers 3, 3 A, 3 B, and 3 C, each connected to the MFP 100. Digital camera 4, digital video camera 5, and portable information terminal 6.

ＭＦＰ１００は、原稿を読取るためのスキャナ、画像データに基づいて紙などの記録媒体に画像を形成するための画像形成部、ファクシミリを含み、画像読取機能、複写機能、ファクシミリ送受信機能を備えている。 MFP 100 includes a scanner for reading a document, an image forming unit for forming an image on a recording medium such as paper based on image data, and a facsimile, and includes an image reading function, a copying function, and a facsimile transmission / reception function.

コンピュータ３，３Ａ，３Ｂ，３Ｃは、一般的なパーソナルコンピュータであり、ＭＦＰ１００を制御するためのプリンタドライバプログラムがインストールされている。コンピュータ３，３Ａ，３Ｂ，３Ｃは、それらがアプリケーションプログラムを実行することにより、生成されるプリントデータまたは画像データを、ＭＦＰ１００に出力する。ＭＦＰ１００は、携帯情報端末６から入力されるプリントデータまたは画像データを画像処理する、または、そのプリントデータまたは画像データに基づいて、記録媒体に画像を形成する。さらに、コンピュータ３，３Ａ，３Ｂ，３Ｃは、画像処理装置としても機能する。 Computers 3, 3 A, 3 B, and 3 C are general personal computers, and a printer driver program for controlling MFP 100 is installed. Computers 3, 3 A, 3 B, and 3 C output print data or image data generated when they execute an application program to MFP 100. MFP 100 performs image processing on print data or image data input from portable information terminal 6, or forms an image on a recording medium based on the print data or image data. Furthermore, the computers 3, 3A, 3B, 3C also function as image processing apparatuses.

デジタルカメラ４およびデジタルビデオカメラ５は、撮像した静止画像または動画像を記憶し、それらが接続されたＭＦＰ１００に静止画像または動画像の画像データを出力する。ＭＦＰ１００は、入力される静止画像の画像データを画像処理する、また、静止画像の画像データに基づき用紙等の記録媒体に静止画像を形成する。また、動画像の画像データが入力される場合、１フレームの画像の画像データを画像処理する、また、１フレームの画像の画像データに基づき記録媒体に画像を形成する。携帯情報端末６は、コンピュータ３，３Ａ，３Ｂ，３Ｃと同様に、それがアプリケーションプログラムを実行することにより生成されるデータまたは画像データを、ＭＦＰ１００に出力する。ＭＦＰ１００は、携帯情報端末６から入力されるデータまたは画像データを画像処理する、また、その画像データに基づいて、記録媒体に画像を形成する。 The digital camera 4 and the digital video camera 5 store captured still images or moving images, and output image data of the still images or moving images to the MFP 100 to which they are connected. The MFP 100 performs image processing on the input still image data, and forms a still image on a recording medium such as paper based on the still image data. When image data of a moving image is input, image data of one frame image is processed, and an image is formed on a recording medium based on the image data of one frame image. As with the computers 3, 3 A, 3 B, 3 C, the portable information terminal 6 outputs data or image data generated by executing an application program to the MFP 100. MFP 100 performs image processing on data or image data input from portable information terminal 6 and forms an image on a recording medium based on the image data.

ネットワーク２は、ローカルエリアネットワーク（ＬＡＮ）であり、接続形態は有線または無線を問わない。またネットワーク２は、ＬＡＮに限らず、ワイドエリアネットワーク（ＷＡＮ）、インターネット等であってもよい。 The network 2 is a local area network (LAN), and the connection form may be wired or wireless. The network 2 is not limited to a LAN, and may be a wide area network (WAN), the Internet, or the like.

図２は、ＭＦＰの外観を示す斜視図である。図２を参照して、ＭＦＰ１００は、自動原稿搬送装置（ＡＤＦ）１０と、画像読取部２０と、画像形成部４０と、給紙部５０と、を含む。ＡＤＦ１０は、原稿台に搭載された複数枚の原稿をさばいて１枚ずつ順に、画像読取部２０に搬送する。画像読取部２０は、写真、文字、絵等の画像情報を原稿から光学的に読み取って画像データを取得する。画像形成部４０は、画像データが入力されると、画像データに基づいて用紙上に画像を形成する。画像形成部４０は、シアン、マゼンタ、イエロー、ブラックの４色のトナーを用いて画像を形成する。給紙部５０は、用紙を格納しており、格納した用紙を１枚ずつ画像形成部４０に供給する。ＭＦＰ１００は、その上面に操作パネル９を備える。 FIG. 2 is a perspective view showing the appearance of the MFP. Referring to FIG. 2, MFP 100 includes an automatic document feeder (ADF) 10, an image reading unit 20, an image forming unit 40, and a paper feeding unit 50. The ADF 10 handles a plurality of documents mounted on the document table, and sequentially conveys them to the image reading unit 20 one by one. The image reading unit 20 optically reads image information such as photographs, characters, pictures, and the like from a document and acquires image data. When the image data is input, the image forming unit 40 forms an image on a sheet based on the image data. The image forming unit 40 forms an image using toners of four colors, cyan, magenta, yellow, and black. The paper feed unit 50 stores paper and supplies the stored paper to the image forming unit 40 one by one. MFP 100 includes an operation panel 9 on the upper surface thereof.

図３は、ＭＦＰのハード構成の一例を示すブロック図である。図３を参照して、ＭＦＰ１００は、メイン回路１０１と、ファクシミリ部６０と、通信制御部６１と、ＡＤＦ１０と、画像読取部２０と、画像処理部３０と、画像形成部４０と、給紙部５０とを含む。メイン回路１０１は、中央演算装置（ＣＰＵ）１１１と、ＣＰＵ１１１の作業領域として使用されるＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）１１２と、ＣＰＵ１１１が実行するプログラム等を記憶するためのＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）１１３と、表示部１１４と、操作部１１５と、大容量記憶装置としてのハードディスクドライブ（ＨＤＤ）１１６と、データ通信制御部１１７と、を含む。ＣＰＵ１１１は、表示部１１４、操作部１１５、ＨＤＤ１１６およびデータ通信制御部１１７とそれぞれ接続され、メイン回路１０１の全体を制御する。また、ＣＰＵ１１１は、ファクシミリ部６０、通信制御部６１、ＡＤＦ１０、画像読取部２０、画像処理部３０、画像形成部４０、給紙部５０、と接続され、ＭＦＰ１００の全体を制御する。 FIG. 3 is a block diagram illustrating an example of a hardware configuration of the MFP. Referring to FIG. 3, MFP 100 includes main circuit 101, facsimile unit 60, communication control unit 61, ADF 10, image reading unit 20, image processing unit 30, image forming unit 40, and paper feeding unit. 50. The main circuit 101 includes a central processing unit (CPU) 111, a RAM (Random Access Memory) 112 used as a work area of the CPU 111, a ROM (Read Only Memory) 113 for storing programs executed by the CPU 111, and the like. A display unit 114, an operation unit 115, a hard disk drive (HDD) 116 as a mass storage device, and a data communication control unit 117. The CPU 111 is connected to the display unit 114, the operation unit 115, the HDD 116, and the data communication control unit 117, and controls the entire main circuit 101. The CPU 111 is connected to the facsimile unit 60, the communication control unit 61, the ADF 10, the image reading unit 20, the image processing unit 30, the image forming unit 40, and the paper feeding unit 50, and controls the entire MFP 100.

表示部１１４は、液晶表示装置（ＬＣＤ）、有機ＥＬＤ（ＥｌｅｃｔｒｏＬｕｍｉｎｅｓｃｅｎｃｅＤｉｓｐｌａｙ）等の表示装置であり、ユーザに対する指示メニューや取得した画像データに関する情報等を表示する。操作部１１５は、複数のキーを備え、キーに対応するユーザの操作による各種の指示、文字、数字などのデータの入力を受付ける。操作部１１５は、表示部１１４上に設けられたタッチパネルを含む。表示部１１４と操作部１１５とで、操作パネル９が構成される。 The display unit 114 is a display device such as a liquid crystal display (LCD) or an organic ELD (Electro Luminescence Display), and displays an instruction menu for the user, information about acquired image data, and the like. The operation unit 115 includes a plurality of keys, and accepts input of various instructions, data such as characters and numbers by user operations corresponding to the keys. The operation unit 115 includes a touch panel provided on the display unit 114. The display unit 114 and the operation unit 115 constitute the operation panel 9.

データ通信制御部１１７は、ＴＣＰ（ＴｒａｎｓｍｉｓｓｉｏｎＣｏｎｔｒｏｌＰｒｏｔｏｃｏｌ）またはＦＴＰ（ＦｉｌｅＴｒａｎｓｆｅｒＰｒｏｔｏｃｏｌ）等の通信プロトコルで通信するためのインターフェースであるＬＡＮ端子１１８と、シリアル通信インターフェース端子１１９とを有する。データ通信制御部１１７は、ＣＰＵ１１１からの指示に従って、ＬＡＮ端子１１８またはシリアル通信インターフェース端子１１９に接続された外部の機器との間でデータを送受信する。 The data communication control unit 117 includes a LAN terminal 118 that is an interface for communicating with a communication protocol such as TCP (Transmission Control Protocol) or FTP (File Transfer Protocol), and a serial communication interface terminal 119. The data communication control unit 117 transmits / receives data to / from an external device connected to the LAN terminal 118 or the serial communication interface terminal 119 in accordance with an instruction from the CPU 111.

ＬＡＮ端子１１８に、ネットワーク２に接続するためのＬＡＮケーブルが接続される場合、データ通信制御部１１７は、ＬＡＮ端子１１８を介して接続された他のＭＦＰ、スキャナ、プリンタ、またはコンピュータ３，３Ａ，３Ｂ，３Ｃと通信する。シリアル通信インターフェース端子１１９に機器が接続された場合、データ通信制御部１１７は、シリアル通信インターフェース端子１１９に接続された機器、例えば、デジタルカメラ４、デジタルビデオカメラ５または携帯情報端末６との間で通信して画像データを入出力する。また、シリアル通信インターフェース端子１１９には、フラッシュメモリを内蔵したメモリカード１１９Ａが接続可能である。ＣＰＵ１１１は、データ通信制御部１１７を制御して、メモリカード１１９ＡからＣＰＵ１１１または画像処理部３０が実行するためのプログラムを読出し、読み出したプログラムをＲＡＭ１１２に記憶し、実行する。 When a LAN cable for connecting to the network 2 is connected to the LAN terminal 118, the data communication control unit 117 is connected to another MFP, scanner, printer, or computer 3, 3A, which is connected via the LAN terminal 118. Communicate with 3B, 3C. When a device is connected to the serial communication interface terminal 119, the data communication control unit 117 communicates with a device connected to the serial communication interface terminal 119, for example, the digital camera 4, the digital video camera 5, or the portable information terminal 6. Communicate and input / output image data. The serial communication interface terminal 119 can be connected to a memory card 119A incorporating a flash memory. The CPU 111 controls the data communication control unit 117 to read a program to be executed by the CPU 111 or the image processing unit 30 from the memory card 119A, and stores the read program in the RAM 112 and executes it.

なお、ＣＰＵ１１１または画像処理部３０が実行するためのプログラムを記憶する記録媒体としては、メモリカード１１９Ａに限られず、フレキシブルディスク、カセットテープ、光ディスク（ＣＤ−ＲＯＭ（ＣｏｍｐａｃｔＤｉｓｃ−ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）／ＭＯ（ＭａｇｎｅｔｉｃＯｐｔｉｃａｌＤｉｓｃ／ＭＤ（ＭｉｎｉＤｉｓｃ）／ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ））、ＩＣカード（メモリカードを含む）、光カード、マスクＲＯＭ、ＥＰＲＯＭ（ＥｒａｓａｂｌｅＰｒｏｇｒａｍｍａｂｌｅＲＯＭ）、ＥＥＰＲＯＭ（ＥｌｅｃｔｒｏｎｉｃａｌｌｙＥＰＲＯＭ）などの半導体メモリ等の媒体でもよい。さらに、ＣＰＵ１１１がインターネットに接続されたコンピュータからプログラムをダウンロードしてＨＤＤ１１６に記憶する、または、インターネットに接続されたコンピュータがプログラムをＨＤＤ１１６に書込みするようにして、ＨＤＤ１１６に記憶されたプログラムをＲＡＭ１１２にロードしてＣＰＵ１１１または画像処理部３０で実行するようにしてもよい。ここでいうプログラムは、ＣＰＵ１１１または画像処理部３０により直接実行可能なプログラムだけでなく、ソースプログラム、圧縮処理されたプログラム、暗号化されたプログラム等を含む。 A recording medium for storing a program to be executed by the CPU 111 or the image processing unit 30 is not limited to the memory card 119A, but a flexible disk, a cassette tape, an optical disk (CD-ROM (Compact Disc-Read Only Memory) / MO). (Magnetic Optical Disc / MD (Mini Disc) / DVD (Digital Versatile Disc)), IC card (including memory card), optical card, mask ROM, EPROM (Erasable Programmable ROM), EEPROM (Electrical ROM, etc.) Further, the CPU 111 may be programmed from a computer connected to the Internet. The program is downloaded and stored in the HDD 116, or the computer connected to the Internet writes the program in the HDD 116, and the program stored in the HDD 116 is loaded into the RAM 112 and executed by the CPU 111 or the image processing unit 30. The program here includes not only a program that can be directly executed by the CPU 111 or the image processing unit 30 but also a source program, a compressed program, an encrypted program, and the like.

通信制御部６１は、ＣＰＵ１１１をＰＳＴＮ（ＰｕｂｌｉｃＳｗｉｔｃｈｅｄＴｅｌｅｐｈｏｎｅＮｅｔｗｏｒｋｓ）７に接続するためのモデムである。ＭＦＰ１００には、ＰＳＴＮ７における電話番号が予め割り当てられており、ＰＳＴＮ７に接続されたファクシミリ装置からＭＦＰ１００に割り当てられた電話番号に発呼があると、通信制御部６１がその発呼を検出する。通信制御部６１は、発呼を検出すると通話を確立し、ファクシミリ部６０に通信させる。 The communication control unit 61 is a modem for connecting the CPU 111 to a PSTN (Public Switched Telephony Networks) 7. The MFP 100 is assigned a telephone number in the PSTN 7 in advance. When a call is made from the facsimile apparatus connected to the PSTN 7 to the telephone number assigned to the MFP 100, the communication control unit 61 detects the call. When the communication control unit 61 detects a call, the communication control unit 61 establishes a call and causes the facsimile unit 60 to communicate.

ファクシミリ部６０は、ＰＳＴＮ７に接続され、ＰＳＴＮ７にファクシミリデータを送信する、またはＰＳＴＮ７からファクシミリデータを受信する。ファクシミリ部６０は、受信したファクシミリデータを、ＣＰＵ１１１に出力する。また、ファクシミリ部６０は、受信したファクシミリデータを画像形成部４０でプリント可能なプリントデータに変換して、画像形成部４０に出力する。これにより、画像形成部４０は、ファクシミリ部６０により受信されたファクシミリデータを記録シートにプリントする。また、ファクシミリ部６０は、ＨＤＤ１１６に記憶された画像データをファクシミリデータに変換して、ＰＳＴＮ７に接続されたファクシミリ装置または他のＭＦＰに出力する。これにより、ＨＤＤ１１６に記憶されたデータをファクシミリ装置または他のＭＦＰに出力することができる。なお、ファクシミリデータは画像データに含まれる。このように、ＭＦＰ１００は、ファクシミリ送受信機能を有する。 The facsimile unit 60 is connected to the PSTN 7 and transmits facsimile data to the PSTN 7 or receives facsimile data from the PSTN 7. The facsimile unit 60 outputs the received facsimile data to the CPU 111. The facsimile unit 60 converts the received facsimile data into print data that can be printed by the image forming unit 40 and outputs the print data to the image forming unit 40. As a result, the image forming unit 40 prints the facsimile data received by the facsimile unit 60 on a recording sheet. The facsimile unit 60 converts the image data stored in the HDD 116 into facsimile data, and outputs the facsimile data to a facsimile machine connected to the PSTN 7 or another MFP. As a result, the data stored in HDD 116 can be output to a facsimile machine or another MFP. The facsimile data is included in the image data. As described above, the MFP 100 has a facsimile transmission / reception function.

ＭＦＰ１００は、スキャナ機能を有し、画像読取部２０が原稿を読み取って出力する画像データが、ＲＡＭ１１２に一時的に記憶される。また、画像形成部４０は、画像読取部２０が原稿を読み取って出力する画像データに基づいて、用紙等の記録シート上に画像を形成する。このため、ＭＦＰ１００は、コピー機能を有する。さらに、データ通信制御部１１７は、ネットワーク２に接続された他のＰＣ３、３Ａ，３Ｂ，３Ｃのいずれかから受信した画像データ、または、シリアル通信インターフェース端子１１９に接続されたデジタルカメラ４、デジタルビデオカメラ５および携帯情報端末６のいずれかから受信された画像データをＲＡＭ１１２に一時的に記憶する。そして、ＲＡＭ１１２に一時的に記憶された画像データに基づいて、用紙等の記録シート上に画像を形成する。このため、ＭＦＰ１００は、プリント機能を有する。 The MFP 100 has a scanner function, and image data that the image reading unit 20 reads and outputs a document is temporarily stored in the RAM 112. The image forming unit 40 forms an image on a recording sheet such as paper based on image data output by the image reading unit 20 reading a document. Therefore, the MFP 100 has a copy function. Further, the data communication control unit 117 receives image data received from any of the other PCs 3, 3A, 3B, 3C connected to the network 2, or the digital camera 4 connected to the serial communication interface terminal 119, the digital video. Image data received from either the camera 5 or the portable information terminal 6 is temporarily stored in the RAM 112. Then, an image is formed on a recording sheet such as paper based on the image data temporarily stored in the RAM 112. Therefore, MFP 100 has a print function.

画像処理部３０は、ＲＡＭ１１２に記憶された画像データを画像処理する。画像処理部３０は、ＭＦＰ１００がスキャナ機能またはコピー機能を能動化する場合には、画像読取部２０が原稿を読み取って出力する画像データを画像処理する。また、画像処理部３０は、ＭＦＰ１００がファクシミリ機能を能動化する場合には、ファクシミリ部６０が受信するファクシミリデータ（画像データ）を画像処理する。さらに、画像処理部３０は、データ通信制御部１１７が、ネットワーク２に接続された他のＰＣ３、３Ａ，３Ｂ，３Ｃ、スキャナ、ＭＦＰのいずれかから受信した画像データ、またはシリアル通信インターフェース端子１１９に接続されたデジタルカメラ４、デジタルビデオカメラ５および携帯情報端末６のいずれかから受信された画像データを画像処理する。 The image processing unit 30 performs image processing on the image data stored in the RAM 112. When the MFP 100 activates the scanner function or the copy function, the image processing unit 30 performs image processing on image data output by the image reading unit 20 reading a document. Further, when MFP 100 activates the facsimile function, image processing unit 30 performs image processing on facsimile data (image data) received by facsimile unit 60. Further, the image processing unit 30 is connected to the image data received by the data communication control unit 117 from any of the other PCs 3, 3 A, 3 B, 3 C, scanner, and MFP connected to the network 2, or to the serial communication interface terminal 119. Image data received from any of the connected digital camera 4, digital video camera 5, and portable information terminal 6 is subjected to image processing.

図４は、画像処理部の機能の一例を示す機能ブロック図である。図４を参照して、画像処理部３０は、画像データに前処理を実行し明度画像を出力する前処理部３０１と、明度画像から文字の表された文字領域を抽出する文字領域抽出部３１０と、文字領域の画像からノイズを除去するノイズ除去部３２０と、文字領域に含まれる文字の色を決定する文字色決定部３３０と、文字領域の画像を符号化する文字領域符号化部３４０と、明度画像から写真属性および図属性の領域をそれぞれ抽出する写真／図領域抽出部３５０と、写真属性および図属性の領域の画像を符号化する写真／図領域符号化部３６０と、符号データを統合する統合部３７０と、を含む。 FIG. 4 is a functional block diagram illustrating an example of functions of the image processing unit. Referring to FIG. 4, an image processing unit 30 performs preprocessing on image data and outputs a brightness image, and a character region extraction unit 310 that extracts a character region in which characters are represented from the brightness image. A noise removing unit 320 that removes noise from the character region image, a character color determining unit 330 that determines the color of the character included in the character region, and a character region encoding unit 340 that encodes the character region image. A photo / figure region extraction unit 350 that extracts a photo attribute and a graphic attribute region from the brightness image, a photo / figure region encoding unit 360 that encodes a photo attribute and a graphic attribute region, and encoded data And an integration unit 370 for integration.

本実施の形態においては、画像処理部３０に入力される画像データは、１画素が赤（Ｒ）、緑（Ｇ）および青（Ｂ）の３つの値（８ビット）を含む２４ビットの値を有するカラー画像である。 In the present embodiment, the image data input to the image processing unit 30 is a 24-bit value in which one pixel includes three values (8 bits) of red (R), green (G), and blue (B). Is a color image.

前処理部３０１は、カラー画像の下地部分を除去する下地除去処理と、下地の除去されたカラー画像を明度画像に変換する明度変換処理と、明度画像をスムージングするスムージング処理とを実行し、スムージング処理した明度画像を文字領域抽出部３１０および写真／図領域抽出部３５０に出力する。下地除去処理は、カラー画像の濃度の薄い下地色を除去し、コントラストを伸張する。これにより、カラー画像の傾向を整えることができる。明度変換処理は、カラー画像の各画素値（２４ビット）に基づいて明度（８ビット）を算出し、明度画像を生成する。明度画像は、１画素が８ビットの値を有するグレースケールの画像である。明度画像を生成するのは、写真／図領域抽出部３５０および文字領域抽出部３１０で、明度画像を処理の対象としているからである。スムージング処理は、明度画像をスムージングする。スムージングすることにより、ノイズが除去されるので、写真／図領域抽出部３５０および文字領域抽出部３１０において領域を抽出する精度を向上させることができる。 The pre-processing unit 301 performs background removal processing for removing the background portion of the color image, lightness conversion processing for converting the color image from which the background is removed into a brightness image, and smoothing processing for smoothing the brightness image. The processed brightness image is output to the character region extraction unit 310 and the photograph / figure region extraction unit 350. The background removal process removes a background color having a low density in a color image and extends the contrast. Thereby, the tendency of a color image can be adjusted. In the brightness conversion process, brightness (8 bits) is calculated based on each pixel value (24 bits) of the color image, and a brightness image is generated. The brightness image is a grayscale image in which one pixel has an 8-bit value. The brightness image is generated because the brightness / image area extraction unit 350 and the character area extraction unit 310 set the brightness image as a processing target. In the smoothing process, the brightness image is smoothed. Since the noise is removed by the smoothing, it is possible to improve the accuracy of extracting a region in the photo / figure region extracting unit 350 and the character region extracting unit 310.

写真／図領域抽出部３５０の主目的は、文字、図、写真および表の表された領域を含んだカラー画像から図が表された領域および写真が表された領域を抽出することである。図が表された領域および写真が表された領域を抽出するのは、それらの領域は文字が含まれることが少ないからである。文字が含まれることが少ない図および写真の表された領域に対して文字抽出処理を行うと、図および写真の表された領域の一部を誤って文字と判定してしまうため、そのような誤判定を回避するためである。このため、本実施の形態においては、写真／図領域抽出部３５０において、図属性または写真属性と判定された領域の明度データを、文字領域抽出部３１０に入力しないようにしている。表の罫線については文字領域抽出部３１０において罫線除去を行い、罫線の位置情報を写真／図領域符号化部３６０に伝え、罫線を写真／図領域符号化部３６０で符号化する。 The main purpose of the photograph / drawing area extraction unit 350 is to extract the area where the figure is represented and the area where the photograph is represented from the color image including the area where the character, the figure, the photograph and the table are represented. The reason why the region where the figure is represented and the region where the photograph is represented is extracted because these regions rarely include characters. If character extraction processing is performed on an area where figures and photographs that contain few characters are included, a part of the area where figures and photographs are displayed will be erroneously determined as characters. This is to avoid erroneous determination. For this reason, in the present embodiment, the photo / figure region extraction unit 350 does not input the brightness data of the region determined to be a figure attribute or a photo attribute to the character region extraction unit 310. For the ruled lines in the table, the ruled line is removed by the character area extracting unit 310, the ruled line position information is transmitted to the photo / figure region encoding unit 360, and the ruled lines are encoded by the photo / figure region encoding unit 360.

写真／図領域抽出部３５０は、明度画像を２値化し、２値画像を生成する。そして、生成した２値画像をラベリングする。ラベリングは、２値画像で値が「１」の黒画素のうちで隣接する複数の画素の集合に外接する矩形をブロックとして抽出する処理である。ブロックを矩形とするのは、一般的に、写真、図形などは矩形の画像として原稿に表されることが多いからである。ラベリングにより抽出されたブロックは、それの画像データ中における位置を示す位置情報と関連付けられる。 The photograph / drawing region extraction unit 350 binarizes the brightness image and generates a binary image. Then, the generated binary image is labeled. Labeling is a process of extracting a rectangle circumscribing a set of a plurality of adjacent pixels among black pixels having a value of “1” in a binary image as a block. The reason why the block is rectangular is that, in general, a photograph, a figure, and the like are often displayed on a document as a rectangular image. The block extracted by labeling is associated with position information indicating its position in the image data.

さらに、写真／図領域抽出部３５０は、抽出されたブロックの大きさやそれぞれの明度画像の特徴量を算出し、ブロックの属性を判別する。複数のブロックが抽出された場合は、複数のブロックそれぞれの属性を判別する。属性は、明度画像のブロックに表されている対象を特定し、文字の表された文字属性と、図形の表された図属性と、写真の表された写真属性と、表属性とを含む。そして、写真／図領域抽出部３５０は、写真属性または図属性のブロックの位置情報を写真画像符号化部３６０と文字領域抽出部３１０に出力する。位置情報は、例えば、画像データにおけるブロックの対角の画素の座標である。 Further, the photograph / figure region extraction unit 350 calculates the size of the extracted block and the feature amount of each brightness image, and determines the attribute of the block. When a plurality of blocks are extracted, the attribute of each of the plurality of blocks is determined. The attribute specifies the object represented in the block of the brightness image, and includes a character attribute represented by a character, a figure attribute represented by a graphic, a photo attribute represented by a photograph, and a table attribute. Then, the photograph / figure region extraction unit 350 outputs the position information of the block of the photograph attribute or the diagram attribute to the photograph image encoding unit 360 and the character region extraction unit 310. The position information is, for example, the coordinates of the diagonal pixels of the block in the image data.

図５は、文字領域抽出部の詳細な機能を示す機能ブロック図である。図５を参照して、文字領域抽出部３１０は、明度画像からエッジを抽出し、エッジ画像を生成するエッジ画像生成部３１１と、抽出されたエッジで構成される線からエッジ画像の傾きを補正する傾き補正部３１２と、傾きが補正されたエッジ画像から罫線を除去する罫線除去部３１３と、エッジ画像から文字領域の候補となるブロックを抽出する候補ブロック抽出部３１４と、ブロックのエッジ画像が文字属性か否かを判定する判定部３１５と、文字属性とされたブロックを文字領域に決定する文字領域決定部３１６と、文字属性でないと判定されたブロックを写真属性の領域に変更する第１属性変更部３１７と、を含む。 FIG. 5 is a functional block diagram showing detailed functions of the character area extraction unit. Referring to FIG. 5, the character region extraction unit 310 extracts an edge from the brightness image, corrects the inclination of the edge image from the edge image generation unit 311 that generates the edge image, and a line composed of the extracted edges. An inclination correction unit 312, a ruled line removal unit 313 that removes a ruled line from the edge image whose inclination is corrected, a candidate block extraction unit 314 that extracts a block that is a candidate for a character region from the edge image, and an edge image of the block A determination unit 315 that determines whether or not a character attribute is present, a character region determination unit 316 that determines a block that is a character attribute as a character region, and a first that changes a block that is determined not to be a character attribute to a region of a photo attribute Attribute change unit 317.

エッジ画像生成部３１１は、前処理部３０１が出力する明度画像が入力され、明度画像を、例えば微分フィルタを用いてフィルタ処理することにより、エッジ画像を生成する。エッジ画像は、明度画像において、画素の値が、周辺の画素の値と所定の差のある画素をエッジ画素（値が「１」）として含む。傾き補正部３１２は、抽出されたエッジ画素で構成される線が、水平または垂直となるように、エッジ画像を回転させる。これにより、次の罫線除去部３１３による罫線の検出精度が向上する。罫線除去部３１３は、エッジ画像から罫線を検出し、その罫線をエッジ画像から除去する。判定部３１５における文字判定の精度を向上させるためである。また、エッジ画像の罫線に対応する部分の周辺の明度データを調べ、罫線領域を決定し、罫線の位置情報を写真／図領域符号化部３６０に出力する。 The edge image generation unit 311 receives the brightness image output from the preprocessing unit 301 and generates an edge image by filtering the brightness image using, for example, a differential filter. The edge image includes, as edge pixels (value is “1”), in the brightness image, the pixel value has a predetermined difference from the values of surrounding pixels. The inclination correction unit 312 rotates the edge image so that the line formed by the extracted edge pixels is horizontal or vertical. Thereby, the detection accuracy of the ruled line by the next ruled line removing unit 313 is improved. The ruled line removal unit 313 detects a ruled line from the edge image and removes the ruled line from the edge image. This is to improve the accuracy of character determination in the determination unit 315. Further, the brightness data around the part corresponding to the ruled line of the edge image is examined, the ruled line area is determined, and the position information of the ruled line is output to the photo / figure area encoding unit 360.

候補ブロック抽出部３１４は、エッジ画像中から文字領域の候補となる候補ブロックを抽出する。具体的には、まず行間を処理対象から除外する。行間は、エッジ画像の主走査方向（Ｘ軸方向）の画素値がすべて「０」のラインである。エッジ画像の行間以外の画素で、近傍のエッジ画素（値が「１」の画素）を連結することにより、連結されたエッジ画素の集合に外接する矩形を候補ブロックとして抽出する。候補ブロックは、画像中で、１行の文字列の外接矩形に相当する。 The candidate block extraction unit 314 extracts candidate blocks that are candidates for character regions from the edge image. Specifically, the line spacing is first excluded from the processing target. Between the lines, the pixel values in the main scanning direction (X-axis direction) of the edge image are all “0” lines. By connecting neighboring edge pixels (pixels having a value of “1”) with pixels other than between the rows of the edge image, a rectangle circumscribing the set of connected edge pixels is extracted as a candidate block. The candidate block corresponds to a circumscribed rectangle of a character string in one line in the image.

判定部３１５は、抽出された候補ブロックが、文字属性か否かを判定する。判定方法は、例えば、候補ブロックの縦横比、横方向の長さ、縦方向の長さ、候補ブロックのエッジ画像におけるエッジ画素の占める割合等を基に判定するようにすればよい。判定部３１５は、候補ブロック抽出部３１４により複数の候補ブロックが抽出された場合には、複数の候補ブロックそれぞれを判定する。 The determination unit 315 determines whether or not the extracted candidate block has a character attribute. The determination method may be determined based on, for example, the aspect ratio of the candidate block, the length in the horizontal direction, the length in the vertical direction, the ratio of edge pixels in the edge image of the candidate block, and the like. When a plurality of candidate blocks are extracted by the candidate block extraction unit 314, the determination unit 315 determines each of the plurality of candidate blocks.

文字領域決定部３１６は、判定部３１５により文字属性と判定された候補ブロックを文字領域に決定し、文字領域の画像データ中の位置を示す位置情報をノイズ除去部３２０に出力する。第１属性変更部３１７は、判定部３１５により文字属性と判定されなかった候補ブロックを写真領域に変更し、候補ブロックの画像データ中の位置を示す位置情報を写真／図領域符号化部３６０に出力する。 The character region determination unit 316 determines the candidate block determined as the character attribute by the determination unit 315 as the character region, and outputs position information indicating the position of the character region in the image data to the noise removal unit 320. The first attribute changing unit 317 changes a candidate block that has not been determined to be a character attribute by the determining unit 315 to a photo area, and sends position information indicating the position of the candidate block in the image data to the photo / figure area encoding unit 360. Output.

図６は、ノイズ除去部３２０の詳細な機能を示す機能ブロック図である。図６を参照して、ノイズ除去部３２０は、文字領域抽出部３１０から文字領域の位置情報が入力され、前処理部３０１から明度画像が入力される。ノイズ除去部３２０は、明度画像の文字領域に含まれる複数の画素の画素値に基づいて明度ヒストグラムを生成するための明度ヒストグラム生成部３２１と、ノイズおよび背景を除去するための網点・背景除去部３２３と、文字領域の属性を写真属性に変更する第２属性変更部３２５とを含む。 FIG. 6 is a functional block diagram showing detailed functions of the noise removing unit 320. Referring to FIG. 6, noise removal unit 320 receives character region position information from character region extraction unit 310 and brightness image from preprocessing unit 301. The noise removing unit 320 includes a lightness histogram generating unit 321 for generating a lightness histogram based on the pixel values of a plurality of pixels included in the character area of the lightness image, and a halftone dot / background removal for removing noise and background. Part 323 and a second attribute changing part 325 for changing the attribute of the character area to a photo attribute.

網点のノイズは、明度が低い。このため、網点・背景除去部３２３は、明度画像の文字領域に含まれる複数の画素のうちで所定のしきい値より明度の低い画素を、網点のノイズとして除去する。具体的には、しきい値より明度の低い画素の画素値を「０」にする。さらに、明度画像の文字領域に文字が表されている場合、文字領域に含まれる画素は、文字を構成する画素または背景を構成する画素のいずれかである。文字と背景とでは明度が異なる。このため、明度ヒストグラムは、文字の明度で度数が高くなり、背景の明度で度数が高くなる。複数色の文字を含む場合、複数色それぞれの文字を構成する画素の明度で度数が高くなる。また、背景は、明度が高く、文字は明度が低い。したがって、ノイズ除去部３２０は、ヒストグラムの度数から文字の明度と背景の明度との間に背景除去のためのしきい値を設定し、明度画像を２値化し、２値画像を生成する。具体的には、明度画像の文字領域に含まれる複数の画素のうちで背景除去のためのしきい値よりも明度の高い画素の画素値を「０」にし、他の画像の値を「１」にする。網点・背景除去部３２３は、文字領域の位置情報と２値画像とを、文字色決定部３３０および文字領域符号化部３４０に出力する。 Halftone noise has low brightness. For this reason, the halftone dot / background removal unit 323 removes pixels having lightness lower than a predetermined threshold among a plurality of pixels included in the character area of the lightness image as halftone noise. Specifically, the pixel value of a pixel whose brightness is lower than the threshold value is set to “0”. Furthermore, when a character is represented in the character area of the brightness image, the pixel included in the character area is either a pixel constituting the character or a pixel constituting the background. The brightness differs between the character and the background. For this reason, in the brightness histogram, the frequency increases with the brightness of the character, and the frequency increases with the brightness of the background. In the case of including characters of a plurality of colors, the frequency increases with the brightness of the pixels constituting each character of the plurality of colors. In addition, the background has high lightness and the characters have low lightness. Therefore, the noise removal unit 320 sets a threshold for background removal between the lightness of the character and the lightness of the background from the frequency of the histogram, and binarizes the lightness image to generate a binary image. Specifically, among the plurality of pixels included in the character area of the brightness image, the pixel value of the pixel having a brightness higher than the threshold for background removal is set to “0”, and the values of the other images are set to “1”. " The halftone dot / background removal unit 323 outputs the character region position information and the binary image to the character color determination unit 330 and the character region encoding unit 340.

網点・背景除去部３２３は、背景除去のためのしきい値を設定することができない場合がある。明度画像の文字領域に写真または図が表されている場合である。したがって、網点・背景除去部３２３は、背景除去のためのしきい値を設定することができない場合、第２属性変更部にエラー信号を出力する。第２属性変更部３２５は、網点・背景除去部３２３からエラー信号が入力されると、文字領域の属性を写真領域に変更し、文字領域の画像データ中の位置を示す位置情報を写真／図領域符号化部３６０に出力する。 The halftone dot / background removal unit 323 may not be able to set a threshold for background removal. This is a case where a photograph or a figure is represented in the character area of the brightness image. Therefore, the halftone / background removal unit 323 outputs an error signal to the second attribute change unit when the threshold for background removal cannot be set. When an error signal is input from the halftone dot / background removal unit 323, the second attribute change unit 325 changes the attribute of the character area to a photo area, and displays position information indicating the position of the character area in the image data. It outputs to the figure area | region encoding part 360. FIG.

図７は、文字色決定部の詳細な機能を示す機能ブロック図である。図７を参照して、文字色決定部３３０は、ノイズ除去部３２０から２値画像と位置情報とが入力され、カラー画像が入力される。文字色決定部３３０は、２値画像に含まれる文字の線幅を細くする細線化部３３１と、カラー画像中における細線の画素値から色相ヒストグラムを生成する色相ヒストグラム生成部３３２と、ノイズ色相除去部３３３と、隣接色相統合部３３４と、第３属性変更部３３５と、代表色相選択部３３６と、第４属性変更部３３７と、文字色算出部３３８とを含む。 FIG. 7 is a functional block diagram showing detailed functions of the character color determining unit. Referring to FIG. 7, character color determination unit 330 receives a binary image and position information from noise removal unit 320, and receives a color image. The character color determination unit 330 includes a thinning unit 331 that narrows the line width of characters included in the binary image, a hue histogram generation unit 332 that generates a hue histogram from pixel values of the thin lines in the color image, and noise hue removal. A section 333, an adjacent hue integration section 334, a third attribute change section 335, a representative hue selection section 336, a fourth attribute change section 337, and a character color calculation section 338.

細線化部３３１は、２値画像において主走査方向に連続して配置された黒画素のうちから中心の画素を主走査方向中心画素として抽出し、主走査方向と交わる副走査方向に連続して配置された黒画素のうちから中心の画素を副走査方向中心画素として抽出する。そして、抽出した主走査方向中心画素と、副走査方向中心画素とを文字を構成する文字中心画素に決定する。図８は、文字「Ａ」を含む２値画像の一例を示す図である。文字を構成する画素をハッチングで示している。図９（Ａ）は、主走査方向中心画素の一例を示す図である。図８に示した２値画像から抽出された主走査方向中心画素を、ハッチングで示している。図９（Ｂ）は、副主走査方向中心画素の一例を示す図である。図８に示した２値画像から抽出された副走査方向中心画素をハッチングで示している。図９（Ｃ）は、文字中心画素の一例を示す図である。文字中心画素は、図９（Ａ）に示した主走査方向中心画素と、図９（Ｂ）に示した副走査方向中心画素とを含み、それをハッチングして示している。 The thinning unit 331 extracts the central pixel from the black pixels continuously arranged in the main scanning direction in the binary image as the central pixel in the main scanning direction, and continuously in the sub-scanning direction intersecting with the main scanning direction. A central pixel is extracted from the arranged black pixels as a central pixel in the sub-scanning direction. Then, the extracted central pixel in the main scanning direction and the central pixel in the sub-scanning direction are determined as character central pixels constituting the character. FIG. 8 is a diagram illustrating an example of a binary image including the character “A”. Pixels constituting the character are indicated by hatching. FIG. 9A is a diagram illustrating an example of the central pixel in the main scanning direction. The central pixel in the main scanning direction extracted from the binary image shown in FIG. 8 is indicated by hatching. FIG. 9B is a diagram illustrating an example of the central pixel in the sub main scanning direction. The center pixel in the sub-scanning direction extracted from the binary image shown in FIG. 8 is indicated by hatching. FIG. 9C is a diagram illustrating an example of the character center pixel. The character center pixel includes the main scanning direction center pixel shown in FIG. 9A and the sub-scanning direction center pixel shown in FIG. 9B, and is hatched.

図７に戻って、細線化部３３１は、文字中心画素の値を「１」とし、他の画素の値を「０」にした２値画像を色相ヒストグラム生成部３３２に出力する。上述したように、スキャナの特性からそれが出力する画像中の黒以外の色の文字の輪郭の色が、文字の中心の色と異なるため、文字の色を文字の中心を構成する色から判定するためである。文字の主走査方向の中心の画素と、副走査方法の中心の画素とを抽出する簡単な処理で文字中心画素を抽出することができるので、処理速度が向上する。さらに、処理対象とする画素の数が少なくなるので、後の処理の速度を早くすることができる。 Returning to FIG. 7, the thinning unit 331 outputs a binary image in which the value of the character center pixel is “1” and the values of the other pixels are “0” to the hue histogram generation unit 332. As described above, because the color of the outline of the non-black character in the image that it outputs is different from the color of the center of the character, the character color is determined from the color that forms the center of the character. It is to do. Since the character center pixel can be extracted by a simple process of extracting the center pixel in the main scanning direction of the character and the center pixel of the sub-scanning method, the processing speed is improved. Furthermore, since the number of pixels to be processed is reduced, the speed of subsequent processing can be increased.

また、中心画素を文字の輪郭部分を除いた画素とするようにしてもよい。この場合、細線化部３３１は、２値画像において主走査方向に連続して配置された複数の黒画素の両端から所定数の画素を除いた少なくとも１つの画素を第１候補画素として抽出し、副走査方向に連続して配置された複数の黒画素の両端から所定数の画素を除いた少なくとも１つの画素を第２候補画素として抽出する。そして、第１候補画素および第２候補画素のいずれでもある画素を文字中心画素に決定する。所定数は、文字の太さ、スキャナの読取解像度により定めるようにすればよい。この場合においても、文字の輪郭部分の画素を除去する簡単な処理で文字中心画素を抽出することができるので、処理速度が向上する。さらに、処理対象とする画素の数が少なくなるので、後の処理の速度を早くすることができる。 The central pixel may be a pixel excluding the outline portion of the character. In this case, the thinning unit 331 extracts at least one pixel obtained by removing a predetermined number of pixels from both ends of a plurality of black pixels continuously arranged in the main scanning direction in the binary image as a first candidate pixel, At least one pixel obtained by removing a predetermined number of pixels from both ends of a plurality of black pixels continuously arranged in the sub-scanning direction is extracted as a second candidate pixel. Then, the pixel that is both the first candidate pixel and the second candidate pixel is determined as the character center pixel. The predetermined number may be determined by the thickness of the character and the reading resolution of the scanner. Even in this case, since the character center pixel can be extracted by a simple process of removing the pixels in the outline portion of the character, the processing speed is improved. Furthermore, since the number of pixels to be processed is reduced, the speed of subsequent processing can be increased.

色相ヒストグラム生成部３３２は、細線化部３３１から入力される２値画像で値が「１」の画素のカラー画像の画素値に基づいて、色相ヒストグラムを生成する。換言すれば、細線化された文字（文字中心画素）の細線化する前の色に基づき、細線化された文字（文字中心画素）を複数の色相階級のいずれかに分類する。具体的には、２値画像中の処理対象とする画素の位置と、２値化画像の位置情報とから処理対象とする画素に対応するカラー画像中の画素の画素値を取得する。カラー画像中の画素の画素値は、ＲＧＢ値を有するため、このＲＧＢ値をＨＳＬ表色系に変換する。そして、色相（Ｈ）に基づいて、色相ヒストグラムを生成する。ただし、彩度（Ｓ）が所定の値よりも低い場合、明度（Ｌ）をしきい値と比較し、白または黒のいずれかの階級に分類する。ここでは、色相ヒストグラムは、色相を３２等分した３２区間の階級と、白の階級と、黒の階級とを含む。図１０に、色相ヒストグラムの階級の一例を示す。 The hue histogram generation unit 332 generates a hue histogram based on the pixel value of the color image of the pixel having the value “1” in the binary image input from the thinning unit 331. In other words, the thinned character (character center pixel) is classified into one of a plurality of hue classes based on the color before thinning of the thinned character (character center pixel). Specifically, the pixel value of the pixel in the color image corresponding to the pixel to be processed is acquired from the position of the pixel to be processed in the binary image and the position information of the binarized image. Since the pixel value of the pixel in the color image has an RGB value, the RGB value is converted into the HSL color system. Then, a hue histogram is generated based on the hue (H). However, when the saturation (S) is lower than a predetermined value, the lightness (L) is compared with a threshold value and classified into either a white or black class. In this case, the hue histogram includes a class of 32 sections obtained by dividing the hue into 32 equal parts, a white class, and a black class. FIG. 10 shows an example of the class of the hue histogram.

ノイズ色相除去部３３３は、色相ヒストグラム生成部３３２により生成された色相ヒストグラムにおいて、最大度数を基準に第１しきい値を設定し、第１しきい値以下の度数の階級に振り分けられた画素をノイズとみなし、その画素を処理対象から除外する。ノイズを除去することにより、文字の色の検出精度を向上させることができる。図１１は、色相ヒストグラムの一例を示す第１の図である。図１１を参照して、度数が１以上の１２の階級が存在する。色相の階級「１０」で度数が最大の「２５００」である。図１２は、色相ヒストグラムの一例を示す第２の図である。第１しきい値を、最大度数の３０％の「７５０」としている。図１３は、色相ヒストグラムの一例を示す第３の図である。図１３は、ノイズ色相を除去した後の色相ヒストグラムを示す。ノイズ色相を除去することにより、４階級の色相に限定される。 The noise hue removing unit 333 sets the first threshold value based on the maximum frequency in the hue histogram generated by the hue histogram generating unit 332, and the pixels distributed to the frequency class equal to or lower than the first threshold value. It is regarded as noise and the pixel is excluded from the processing target. By removing noise, it is possible to improve the accuracy of character color detection. FIG. 11 is a first diagram illustrating an example of a hue histogram. Referring to FIG. 11, there are 12 classes whose frequency is 1 or more. The hue class is “10” and the frequency is “2500” which is the maximum. FIG. 12 is a second diagram illustrating an example of a hue histogram. The first threshold value is “750” which is 30% of the maximum frequency. FIG. 13 is a third diagram illustrating an example of a hue histogram. FIG. 13 shows a hue histogram after removing the noise hue. By removing the noise hue, the hue is limited to the 4th class hue.

隣接色相統合部３３４は、第１のしきい値を超える度数の階級を、代表色相選択部３３６に出力する。また、第１しきい値を超える度数の階級が連続する場合、連続する階級を統合した１つの階級を、代表色相選択部３３６に出力する。具体的には、連続する階級のうち度数が最大の階級に統合する。図１４は、色相ヒストグラムの一例を示す第４の図である。図１４は、階級「２９」と階級「３０」とを階級「３０」に統合した後の色相ヒストグラムを示す。これにより、色相の統合により３階級の色相に限定される。 The adjacent hue integration unit 334 outputs a class having a frequency exceeding the first threshold value to the representative hue selection unit 336. In addition, when classes having a frequency exceeding the first threshold value continue, one class obtained by integrating the consecutive classes is output to the representative hue selection unit 336. Specifically, it integrates into the class with the highest frequency among consecutive classes. FIG. 14 is a fourth diagram illustrating an example of a hue histogram. FIG. 14 shows a hue histogram after the class “29” and the class “30” are integrated into the class “30”. Thereby, it is limited to the hue of the 3rd class by the integration of the hue.

ただし、隣接色相統合部３３４は、第１しきい値を超える度数の階級が第２のしきい値以上連続する場合、位置情報で特定される文字領域が、文字以外のもの（写真、図形、罫線など）が表わされた領域と判断し、第３属性変更部３３５にエラー信号を出力する。第２のしきい値は、色相ヒストグラムの階級数により定まり、色相を３２等分して３２階級とする場合には、４階級とすれば、良好な結果が得られる。 However, when the frequency class exceeding the first threshold value continues for the second threshold value or more, the adjacent hue integration unit 334 determines that the character region specified by the position information is other than characters (photograph, figure, A ruled line or the like) is output, and an error signal is output to the third attribute changing unit 335. The second threshold value is determined by the number of classes in the hue histogram, and when the hue is divided into 32 equal parts to obtain the 32nd class, good results can be obtained if the fourth class is used.

第３属性変更部３３５は、隣接色相統合部３３４からエラー信号が入力されると、文字領域の属性を写真領域に変更し、文字領域の画像データ中の位置を示す位置情報を写真／図領域符号化部３６０に出力する。 When the error signal is input from the adjacent hue integration unit 334, the third attribute change unit 335 changes the attribute of the character region to a photo region, and displays position information indicating the position of the character region in the image data as a photo / figure region. The data is output to the encoding unit 360.

代表色相選択部３３６は、隣接色相統合部３３４から入力される階級の色相を、文字領域に含まれる文字の色相に決定し、入力された階級を文字色算出部３３８に出力する。隣接色相統合部３３４から複数の階級が入力される場合、複数の階級それぞれの色相を代表色相として決定する。ただし、代表色相選択部３３６は、隣接色相統合部３３４から第３のしきい値以上の階級が入力される場合、位置情報で特定される文字領域が、文字でなく写真または図形が表された領域と判断し、第４属性変更部３３７にエラー信号を出力する。第３のしきい値は、文字領域に含まれる文字の色数の上限であり、第３のしきい値を４とすれば良好な結果が得られる。 The representative hue selection unit 336 determines the hue of the class input from the adjacent hue integration unit 334 as the hue of the character included in the character area, and outputs the input class to the character color calculation unit 338. When a plurality of classes are input from the adjacent hue integration unit 334, the hues of the plurality of classes are determined as representative hues. However, the representative hue selection unit 336 displays a photograph or a figure instead of a character in the character area specified by the position information when a class higher than the third threshold value is input from the adjacent hue integration unit 334. An area is determined, and an error signal is output to the fourth attribute changing unit 337. The third threshold value is the upper limit of the number of colors of characters included in the character area. If the third threshold value is 4, good results can be obtained.

第４属性変更部３３７は、代表色相選択部３３６からエラー信号が入力されると、文字領域の属性を写真領域に変更し、文字領域の画像データ中の位置を示す位置情報を写真／図領域符号化部３６０に出力する。 When an error signal is input from the representative hue selection unit 336, the fourth attribute change unit 337 changes the attribute of the character region to a photo region, and displays position information indicating the position of the character region in the image data as a photo / figure region. The data is output to the encoding unit 360.

文字色算出部３３８は、代表色相選択部３３６から入力された階級に振り分けられた複数の画素のカラー画像の画素値から文字色を算出し、算出した文字色を文字領域符号化部３４０に出力する。具体的には、複数の画素のＲＧＢ値の平均を算出し、文字色とする。代表色相選択部３３６から複数の階級が入力される場合、複数の階級毎に文字色を算出する。 The character color calculation unit 338 calculates the character color from the pixel values of the color image of the plurality of pixels distributed to the class input from the representative hue selection unit 336, and outputs the calculated character color to the character region encoding unit 340. To do. Specifically, an average of RGB values of a plurality of pixels is calculated and set as a character color. When a plurality of classes are input from the representative hue selection unit 336, the character color is calculated for each of the plurality of classes.

図１５は、文字領域符号化部の詳細な機能を示す機能ブロック図である。図１５を参照して、文字領域符号化部３４０は、ノイズ除去部３２０から２値画像と位置情報とが入力され、文字色決定部３３０から文字色が入力され、カラー画像が入力される。文字領域符号化部３４０は、文字色別の２値画像を生成する文字色別２値画像生成部３４１と、文字色別の２値画像を圧縮符号化する文字色別符号化部３４３とを含む。文字色別２値画像生成部３４１は、２値画像のうち画素値が「１」の画素のうちからカラー画像の画素値が文字色に近い画素を抽出することにより、その文字色に対応する文字色別２値画像を生成する。カラー画像の画素値が文字色と近いか否かは、カラー画像の画素値と文字色との間のＲＧＢ色空間における距離が所定の範囲にあるか否かにより判断すればよい。 FIG. 15 is a functional block diagram showing detailed functions of the character area encoding unit. Referring to FIG. 15, character region encoding unit 340 receives a binary image and position information from noise removal unit 320, receives a character color from character color determination unit 330, and receives a color image. The character region encoding unit 340 includes: a character color-specific binary image generation unit 341 that generates a binary image for each character color; and a character color-specific encoding unit 343 that compresses and encodes a binary image for each character color. Including. The binary image generating unit 341 for each character color corresponds to the character color by extracting a pixel having a pixel value of the color image close to the character color from the pixels having the pixel value “1” in the binary image. A binary image for each character color is generated. Whether or not the pixel value of the color image is close to the character color may be determined by whether or not the distance in the RGB color space between the pixel value of the color image and the character color is within a predetermined range.

例えば、文字領域に「文字色別に圧縮」の７文字が含まれており、カラー画像において７文字のうち「文字色別」の４文字が赤色、「に」の１文字が黒色、および「圧縮」の２文字が青色の場合、文字領域符号化部３４０には、赤、黒および青の３つの文字色が入力される。文字色別２値画像生成部３４１は、赤色の文字色に対応する２値画像として、「文字色別」の４文字を含む文字色別２値化画像を生成し、黒色の文字色に対応する２値画像として、「に」の１文字を含む文字色別２値化画像を生成し、青色の文字色に対応する２値画像として、「圧縮」の２文字を含む文字色別２値化画像を生成する。文字色別２値画像生成部３４１は、文字色と、その文字色に対応する文字色別２値画像とを文字色別符号化部３４３に出力する。 For example, the character area includes seven characters “compressed by character color”, and among the seven characters in the color image, four characters “by character color” are red, one character “ni” is black, and “compressed” ”Is blue, three character colors of red, black, and blue are input to the character region encoding unit 340. The binary image generation unit 341 for each character color generates a binary image for each character color including four characters “for each character color” as a binary image corresponding to the red character color, and corresponds to the black character color. As a binary image to be generated, a binary image classified by character color including one character “ni” is generated, and as a binary image corresponding to a blue character color, binary by character color including two characters “compressed” Generate a digitized image. The character color-specific binary image generation unit 341 outputs the character color and the character color-specific binary image corresponding to the character color to the character color-specific encoding unit 343.

文字色別符号化部３４３は、文字色別２値画像を可逆圧縮方式で符号化し、符号データを生成する。そして、符号データと、文字色と、文字領域の位置情報とを統合部３７０に出力する。可逆圧縮方式は、いずれの方式を用いてもよいが、ここでは、ＭＭＲ（ＭｏｄｉｆｉｅｄＭｏｄｉｆｉｅｄＲｅａｄ）圧縮方式を用いる。 The character color encoding unit 343 encodes the character color binary image using a lossless compression method to generate code data. Then, the code data, the character color, and the position information of the character region are output to the integration unit 370. Any method may be used as the lossless compression method, but here, an MMR (Modified Modified Read) compression method is used.

図４に戻って、写真／図領域符号化部３６０は、文字以外（写真、図、罫線）のカラー画像を、非可逆圧縮方式で符号化し、符号データを生成する。圧縮方式は、非可逆圧縮方式であればよく、ここではＪＰＥＧ（ＪｏｉｎｔＰｈｏｔｏｇｒａｐｈｉｃＥｘｐｅｒｔｓＧｒｏｕｐ）方式を用いている。そして、符号データを統合部３７０に出力する。統合部３７０は、文字領域符号化部３４０から入力される符号データ、文字色および文字領域の位置情報と、写真／図領域符号化部３６０から入力される符号データとを併せて１つのファイルとして出力する。符号データは、それぞれ別々の符号データとして出力する。 Returning to FIG. 4, the photo / figure region encoding unit 360 encodes a color image other than characters (photos, diagrams, ruled lines) by the lossy compression method to generate code data. The compression method may be an irreversible compression method, and here, the JPEG (Joint Photographic Experts Group) method is used. Then, the code data is output to the integration unit 370. The integration unit 370 combines the code data input from the character region encoding unit 340, the character color and the position information of the character region, and the code data input from the photo / figure region encoding unit 360 as one file. Output. The code data is output as separate code data.

図１６は、圧縮処理の流れの一例を示すフローチャートである。圧縮処理は、圧縮プログラムを実行する画像処理部３０により実行される処理である。図１６を参照して、画像処理部３０は、画像データを受け付けたか否かを判断する（ステップＳ０１）。画像データを受け付けるまで待機状態となり（ステップＳ０１でＮＯ）、画像データを受け付けたならば処理をステップＳ０２に進める。すなわち、画像処理は、画像データを受け付けることにより実行される処理である。画像データは、ＣＰＵ１１１からの指示により、画像処理部３０が、ＲＡＭ１１２から画像データを読み出すことにより受け付ける。画像データは、１画素がＲＧＢの３つの値（８ビット）を含むカラー画像である。 FIG. 16 is a flowchart illustrating an example of the flow of compression processing. The compression process is a process executed by the image processing unit 30 that executes a compression program. Referring to FIG. 16, image processing unit 30 determines whether image data has been received (step S01). The process waits until image data is received (NO in step S01). If image data is received, the process proceeds to step S02. That is, the image processing is processing that is executed by receiving image data. The image data is received by the image processing unit 30 reading the image data from the RAM 112 in accordance with an instruction from the CPU 111. The image data is a color image in which one pixel includes three values of RGB (8 bits).

次に、カラー画像に前処理を実行することにより明度画像を生成し（ステップＳ０２）、写真領域および図領域を抽出する写真／図領域抽出処理を実行し（ステップＳ０３）、文字領域を抽出する文字領域抽出処理を実行する（ステップＳ０４）。文字領域抽出処理については後述するが、ステップＳ０２において抽出された明度画像から文字領域を抽出する。文字領域は、カラー画像中の位置を示す位置情報で特定される。文字領域抽出処理は、複数の文字領域が抽出される場合がある。 Next, a brightness image is generated by performing pre-processing on the color image (step S02), and a photo / figure region extraction process for extracting a photograph region and a diagram region is performed (step S03), thereby extracting a character region. Character area extraction processing is executed (step S04). The character region extraction process will be described later, but a character region is extracted from the brightness image extracted in step S02. The character area is specified by position information indicating a position in the color image. In the character area extraction process, a plurality of character areas may be extracted.

ステップＳ０５〜ステップＳ０８の処理は、１つの文字領域に対して実行される処理である。このため、複数の文字領域が抽出された場合は、それらのうち１つを処理対象に設定し、ステップＳ０５以降の処理が実行される。ステップＳ０５においては、ステップＳ０２で生成された明度画像の文字領域からノイズを除去することにより明度画像の文字領域を２値画像に変換する。ノイズを除去できたか否かにより明度画像の文字領域が文字属性か否かを判断する（ステップＳ０６）。文字属性ならば処理をステップＳ０７に進め、文字属性でなければ処理をステップＳ１０に進める。 The processes in steps S05 to S08 are executed for one character area. For this reason, when a plurality of character regions are extracted, one of them is set as a processing target, and the processing after step S05 is executed. In step S05, the character area of the brightness image is converted into a binary image by removing noise from the character area of the brightness image generated in step S02. It is determined whether or not the character area of the brightness image has a character attribute based on whether or not noise has been removed (step S06). If it is a character attribute, the process proceeds to step S07, and if it is not a character attribute, the process proceeds to step S10.

ステップＳ０７においては、カラー画像の文字領域に含まれる文字の色を決定するための文字色決定処理を実行する。そして、カラー画像の文字領域に含まれる文字の色相の状態により、文字領域が文字属性か否かを判断する（ステップＳ０８）。文字属性ならば処理をステップＳ０９に進め、文字属性でなければ処理をステップＳ１０に進める。 In step S07, a character color determination process for determining the color of the character included in the character region of the color image is executed. Then, it is determined whether or not the character area has a character attribute according to the state of the hue of the character included in the character area of the color image (step S08). If it is a character attribute, the process proceeds to step S09, and if it is not a character attribute, the process proceeds to step S10.

ステップＳ０９においては、２値画像の文字領域を符号化するための文字領域符号化処理を実行し、処理をステップＳ１０に進める。この段階で、２値画像の文字領域が圧縮符号化される。ステップＳ１０においては、次に処理対象とするべき未処理の文字領域が存在するか否かを判断する。そのような文字領域が存在すればその文字領域を処理対象に設定し、処理をステップＳ０５に戻すが、存在しなければ処理をステップＳ１１に進める。 In step S09, a character area encoding process for encoding the character area of the binary image is executed, and the process proceeds to step S10. At this stage, the character area of the binary image is compression encoded. In step S10, it is determined whether there is an unprocessed character area to be processed next. If such a character area exists, the character area is set as a processing target, and the process returns to step S05. If the character area does not exist, the process proceeds to step S11.

ステップＳ１１においては、ステップＳ０３で抽出された写真／図領域と、ステップＳ０４、ステップＳ０５およびステップＳ０７において文字ではないと判定された領域を圧縮符号化するための写真／図領域符号化処理を実行する。複数の写真／図領域が抽出された場合には、複数の写真／図領域毎に圧縮符号化する。このため、複数の写真／図領域のうち１つを処理対処に設定し、ステップＳ１１を実行する。そして、次に処理対象とするべき未処理の写真／図領域が存在するか否かを判断する（ステップＳ１２）。そのような写真／文字領域が存在すればその写真／文字領域を処理対象に設定し、処理をステップＳ１１に戻すが、存在しなければ処理をステップＳ１３に進める。 In step S11, the photo / figure region encoding process is performed to compress and encode the photo / figure region extracted in step S03 and the region determined not to be a character in step S04, step S05, and step S07. To do. When a plurality of photograph / figure areas are extracted, compression coding is performed for each of the plurality of photograph / figure areas. For this reason, one of the plurality of photograph / drawing areas is set to be processed, and step S11 is executed. Then, it is determined whether or not there is an unprocessed photo / drawing area to be processed next (step S12). If such a photograph / character area exists, the photograph / character area is set as a processing target, and the process returns to step S11. If not, the process proceeds to step S13.

ステップＳ１３においては、ステップＳ０９において２値画像の文字領域を符号化した符号データと、その文字領域の文字色と、その文字領域の位置情報と、ステップＳ１１においてカラー画像の写真／文字領域を符号化した符号データとを統合し、１つの圧縮データを生成する。 In step S13, code data obtained by encoding the character area of the binary image in step S09, the character color of the character area, the position information of the character area, and the photograph / character area of the color image are encoded in step S11. The compressed code data is integrated to generate one compressed data.

図１７は、前処理の流れの一例を示すフローチャートである。前処理は、図１６のステップＳ０２で実行される処理である。図１７を参照して、カラー画像の下地部分を除去する（ステップＳ２１）。カラー画像の濃度の薄い下地色を除去し、コントラストを伸張する。これにより、カラー画像の傾向を整えることができる。そして、下地の除去されたカラー画像を明度画像に変換する（ステップＳ２２）。カラー画像の各画素値（２４ビット）の値を明度（８ビット）に変換することにより、カラー画像を明度画像に変換する。次に、明度画像をスムージングする（ステップＳ２３）。明度画像をスムージングすることにより、ノイズが除去されるので、後の処理の精度を向上させることができる。 FIG. 17 is a flowchart illustrating an example of the flow of preprocessing. The preprocessing is processing executed in step S02 in FIG. Referring to FIG. 17, the background portion of the color image is removed (step S21). Removes the background color with low density in the color image and extends the contrast. Thereby, the tendency of a color image can be adjusted. Then, the color image from which the background is removed is converted into a brightness image (step S22). By converting the value of each pixel value (24 bits) of the color image into lightness (8 bits), the color image is converted into a lightness image. Next, the brightness image is smoothed (step S23). Since the noise is removed by smoothing the brightness image, the accuracy of the subsequent processing can be improved.

図１８は、写真／図領域抽出処理の流れの一例を示すフローチャートである。図１８を参照して、図１６のステップＳ０２において生成した明度画像を２値化し、２値画像を生成する（ステップＳ３１）。そして、生成した２値画像をラベリングする（ステップＳ３２）。ラベリングにより、２値画像で値が「１」の複数の画素の集合に外接する矩形がブロックとして抽出される。２値画像から複数のブロックが抽出される場合がある。 FIG. 18 is a flowchart showing an example of the flow of the photograph / drawing area extraction process. Referring to FIG. 18, the brightness image generated in step S02 of FIG. 16 is binarized to generate a binary image (step S31). Then, the generated binary image is labeled (step S32). By labeling, a rectangle circumscribing a set of a plurality of pixels having a value of “1” in the binary image is extracted as a block. A plurality of blocks may be extracted from the binary image.

ステップＳ３３〜ステップＳ３５の処理は、１つのブロックに対して実行される処理である。このため、複数のブロックが抽出された場合は、それらのうち１つを処理対象に設定し、ステップＳ３３以降の処理を実行する。ステップＳ３３においては、ステップＳ３１で生成された２値画像の処理対象ブロックの特徴量を算出する。そして、算出した特徴量から処理対象ブロックの属性を判別する。属性は、文字属性と、図属性と、写真属性と、下地属性とを含む。そして、ステップＳ３４においては、処理対象ブロックの属性が、写属性または図属性であれば処理をステップＳ３５に進め、そうでなければステップＳ３５をスキップして処理をステップＳ３６に進める。ステップＳ３５においては、処理対象ブロックを、写真／文字領域と判断し、その処理対象ブロックの２値画像中の位置を示す位置情報を写真／文字領域の位置情報として記憶する。例えば、処理対象ブロックの対抗する２つの頂点の２値画像中の座標をＲＡＭ１１２の写真／文字領域の位置情報を格納する領域に記憶する。 The processes in steps S33 to S35 are processes executed for one block. For this reason, when a plurality of blocks are extracted, one of them is set as a processing target, and the processing after step S33 is executed. In step S33, the feature amount of the processing target block of the binary image generated in step S31 is calculated. Then, the attribute of the processing target block is determined from the calculated feature amount. The attributes include a character attribute, a figure attribute, a photograph attribute, and a background attribute. In step S34, if the attribute of the processing target block is a copy attribute or a graphic attribute, the process proceeds to step S35, and if not, step S35 is skipped and the process proceeds to step S36. In step S35, the processing target block is determined to be a photo / character area, and position information indicating the position of the processing target block in the binary image is stored as position information of the photo / character area. For example, the coordinates in the binary image of the two vertices opposed to the processing target block are stored in the area for storing the position information of the photo / character area in the RAM 112.

ステップＳ３６においては、次に処理対象とするべき未処理のブロックが存在するか否かを判断する。そのようなブロックが存在すればそのブロックを処理対象に設定し、処理をステップＳ３３に戻すが、存在しなければ処理を圧縮処理に戻す。 In step S36, it is determined whether there is an unprocessed block to be processed next. If such a block exists, the block is set as a processing target, and the process returns to step S33. If not, the process returns to the compression process.

図１９は、文字領域抽出処理の流れの一例を示すフローチャートである。文字領域抽出処理は、図１６のステップＳ０４において実行される処理である。画像処理部３０は、図１６のステップＳ０２において生成された明度画像を、例えば微分フィルタを用いてフィルタ処理することにより、エッジ画像を生成する（ステップＳ４１）。そして、エッジ画像の傾きを補正する（ステップＳ４２）。抽出されたエッジ画素で構成される線が、水平または垂直となるように、エッジ画像を回転させる。画像データが、画像読取部２０から入力される場合があり、原稿が傾いて読み取られる場合があるからである。そして、エッジ画像から罫線を検出し、その罫線をエッジ画像から除去する（ステップＳ４３）。次のステップＳ４５における文字判定の精度を向上させるためである。 FIG. 19 is a flowchart illustrating an example of the flow of character area extraction processing. The character area extraction process is a process executed in step S04 in FIG. The image processing unit 30 generates an edge image by filtering the lightness image generated in step S02 of FIG. 16 using, for example, a differential filter (step S41). Then, the inclination of the edge image is corrected (step S42). The edge image is rotated so that the line composed of the extracted edge pixels is horizontal or vertical. This is because image data may be input from the image reading unit 20 and the document may be read while being tilted. Then, a ruled line is detected from the edge image, and the ruled line is removed from the edge image (step S43). This is to improve the accuracy of character determination in the next step S45.

次のステップＳ４４においては、エッジ画像中から文字領域の候補となる候補ブロックを抽出する。具体的には、まず行間を処理対象から除外し、エッジ画像のエッジ画素を近傍のエッジ画素と連結することにより、連結された複数のエッジ画素の集合に外接する矩形を候補ブロックとして抽出する。候補ブロックは、画像中で、１行の文字列の外接矩形に相当する。複数の候補ブロックが抽出される場合がある。 In the next step S44, candidate blocks as character area candidates are extracted from the edge image. Specifically, first, a line space is excluded from the processing target, and edge pixels of the edge image are connected to neighboring edge pixels to extract a rectangle circumscribing a set of a plurality of connected edge pixels as a candidate block. The candidate block corresponds to a circumscribed rectangle of a character string in one line in the image. A plurality of candidate blocks may be extracted.

ステップＳ４５〜ステップＳ４８の処理は、１つの候補ブロックに対して実行される処理である。このため、複数の候補ブロックが抽出された場合は、それらのうち１つを処理対象に設定し、ステップＳ４５以降の処理が実行される。ステップＳ４５においては、処理対象の候補ブロックが、文字属性か否かを判定する。判定方法は、例えば、候補ブロックの縦横比、横方方向の長さ、縦方向の長さ、候補ブロックのエッジ画像におけるエッジ画素の占める割合等を基に判定する。処理対象の候補ブロックが文字属性ならば処理をステップＳ４６に進めるが、そうでなければ処理をステップＳ４７に進める。 The processes in steps S45 to S48 are executed for one candidate block. For this reason, when a plurality of candidate blocks are extracted, one of them is set as a processing target, and the processing after step S45 is executed. In step S45, it is determined whether the candidate block to be processed has a character attribute. The determination method is determined based on, for example, the aspect ratio of the candidate block, the length in the horizontal direction, the length in the vertical direction, the ratio of edge pixels in the edge image of the candidate block, and the like. If the candidate block to be processed is a character attribute, the process proceeds to step S46; otherwise, the process proceeds to step S47.

ステップＳ４６においては、文字属性と判定された候補ブロックを、文字領域と判断し、その候補ブロックのエッジ画像中の位置を示す位置情報を文字領域の位置情報として記憶する。例えば、候補ブロックの対抗する２つの頂点のエッジ画像中の座標をＲＡＭ１１２の文字領域の位置情報を格納する領域に記憶する。 In step S46, the candidate block determined as the character attribute is determined as the character area, and the position information indicating the position of the candidate block in the edge image is stored as the position information of the character area. For example, the coordinates in the edge image of the two vertices opposed by the candidate block are stored in the area for storing the position information of the character area in the RAM 112.

一方、ステップＳ４７においては、文字属性と判定されなかった候補ブロックの属性を写真属性に変更し、処理をステップＳ４８に進める。具体的には、その候補ブロックのエッジ画像中の位置を示す位置情報を写真領域の位置情報として記憶するため、候補ブロックの位置情報をＲＡＭ１１２の写真領域の位置情報を格納する領域に記憶する。 On the other hand, in step S47, the attribute of the candidate block that has not been determined to be a character attribute is changed to a photo attribute, and the process proceeds to step S48. Specifically, in order to store the position information indicating the position of the candidate block in the edge image as the position information of the photograph area, the position information of the candidate block is stored in the area for storing the position information of the photograph area in the RAM 112.

ステップＳ４８においては、次に処理対象とするべき未処理の候補ブロックが存在するか否かを判断する。そのような候補ブロックが存在すればその候補ブロックを処理対象に設定し、処理をステップＳ４５に戻すが、存在しなければ処理を圧縮処理に戻す。 In step S48, it is determined whether there is an unprocessed candidate block to be processed next. If such a candidate block exists, the candidate block is set as a processing target, and the process returns to step S45. If not, the process returns to the compression process.

図２０は、ノイズ除去処理の流れの一例を示すフローチャートである。ノイズ除去処理は、図１６のステップＳ０５において実行される処理である。図２０を参照して、画像処理部３０は、文字領域の位置情報をＲＡＭ１１２から読み出す（ステップＳ５１）。そして、図１６のステップＳ０２で生成された明度画像の文字領域に含まれる複数の画素の画素値に基づいて明度ヒストグラムを生成する（ステップＳ５２）。明度画像の文字領域に含まれる複数の画素のうちで所定のしきい値より明度の低い画素を、網点のノイズとして除去する（ステップＳ５３）。具体的には、しきい値より明度の低い画素の画素値を「０」にする。さらに、ヒストグラムの度数から文字の明度と背景の明度との間に背景除去のためのしきい値を設定する（ステップＳ５４）。しきい値を設定できたならば処理をステップＳ５５に進めるが、しきい値を設定できない場合は処理をステップＳ５７に進める。背景除去のためのしきい値を設定できない場合には、文字領域に文字でなく写真または図が表されている可能性が高いからである。 FIG. 20 is a flowchart illustrating an example of the flow of noise removal processing. The noise removal process is a process executed in step S05 in FIG. Referring to FIG. 20, the image processing unit 30 reads the position information of the character area from the RAM 112 (step S51). Then, a brightness histogram is generated based on the pixel values of a plurality of pixels included in the character area of the brightness image generated in step S02 of FIG. 16 (step S52). Of the plurality of pixels included in the character area of the brightness image, pixels having a brightness lower than a predetermined threshold are removed as halftone noise (step S53). Specifically, the pixel value of a pixel whose brightness is lower than the threshold value is set to “0”. Further, a threshold for background removal is set between the brightness of the character and the brightness of the background from the frequency of the histogram (step S54). If the threshold value can be set, the process proceeds to step S55. If the threshold value cannot be set, the process proceeds to step S57. This is because when the threshold value for background removal cannot be set, there is a high possibility that a photo or a figure is displayed in the character area instead of characters.

ステップＳ５５においては、背景除去のためのしきい値を用いて、明度画像の文字領域を２値化し、２値画像を生成する。次のステップＳ５６においては、生成した２値画像と位置情報とを関連付け、処理を圧縮処理に戻す。具体的には、その２値画像と位置情報との組をＲＡＭ１１２の所定の領域に記憶する。 In step S55, the character area of the brightness image is binarized using a threshold value for background removal, and a binary image is generated. In the next step S56, the generated binary image is associated with the position information, and the process returns to the compression process. Specifically, the set of the binary image and position information is stored in a predetermined area of the RAM 112.

一方、ステップＳ５７においては、しきい値を設定できなかった文字領域の属性を写真属性に変更し、処理を圧縮処理に戻す。具体的には、その文字領域の位置情報をＲＡＭ１１２の写真領域の位置情報を格納する領域に記憶する。 On the other hand, in step S57, the attribute of the character area for which the threshold value could not be set is changed to a photo attribute, and the process returns to the compression process. Specifically, the position information of the character area is stored in an area for storing the position information of the photo area in the RAM 112.

図２１は、文字色決定処理の流れの一例を示すフローチャートである。文字色決定処理は、図１６のステップＳ０７において実行される処理である。図２１を参照して、画像処理部３０は、２値画像と位置情報との組をＲＡＭ１１２から読み出す（ステップＳ６１）。読み出した２値画像中の文字の線幅を細くするために細線化処理を実行する（ステップＳ６２）。細線化処理が実行されることにより文字中心画素の値を「１」とし、他の画素の値を「０」にした２値画像が生成される。 FIG. 21 is a flowchart illustrating an example of the flow of character color determination processing. The character color determination process is a process executed in step S07 in FIG. Referring to FIG. 21, the image processing unit 30 reads a pair of a binary image and position information from the RAM 112 (step S61). Thinning processing is executed to narrow the line width of the characters in the read binary image (step S62). By executing the thinning process, a binary image in which the value of the character center pixel is “1” and the values of the other pixels are “0” is generated.

ステップＳ６３においては、文字領域の色相ヒストグラムを生成する。ステップＳ６１で読み出した位置情報は、カラー画像における文字領域の位置を示す。この位置情報からステップＳ６２で生成された２値画像が含む各画素に対応するカラー画像中の画素を特定することができる。このため、ステップＳ６２で生成され２値画像で値が「１」の画素に対応するカラー画像の画素の画素値に基づいて、色相ヒストグラムを生成する。ここでは、色相ヒストグラムは、色相を３２等分した３２区間の階級と、白の階級と、黒の階級とを含む。 In step S63, a hue histogram of the character area is generated. The position information read in step S61 indicates the position of the character area in the color image. From this position information, the pixel in the color image corresponding to each pixel included in the binary image generated in step S62 can be specified. Therefore, a hue histogram is generated based on the pixel value of the pixel of the color image corresponding to the pixel having the value “1” in the binary image generated in step S62. In this case, the hue histogram includes a class of 32 sections obtained by dividing the hue into 32 equal parts, a white class, and a black class.

そして、ノイズとなる色相の画素を処理対象から除去する（ステップＳ６４）。具体的には、色相ヒストグラムにおいて、最大度数を基準に第１しきい値を設定し、第１しきい値以下の度数の階級に振り分けられた画素をノイズとみなし、その画素を処理対象から除外する。ノイズを除去することにより、文字の色の検出精度を向上させることができる。第１しきい値は、最大度数の３０％とするのが好ましい。 Then, the pixel of the hue that causes noise is removed from the processing target (step S64). Specifically, in the hue histogram, a first threshold value is set based on the maximum frequency, pixels assigned to a frequency class lower than the first threshold value are regarded as noise, and the pixel is excluded from the processing target. To do. By removing noise, it is possible to improve the accuracy of character color detection. The first threshold value is preferably 30% of the maximum frequency.

次に、色相ヒストグラムにおいて第１のしきい値を超える度数の階級が３つ以上連続するか否かを判断する（ステップＳ６５）。第１しきい値を超える度数の階級が３つ以上連続する場合、処理をステップＳ７１に進めるが、そうでなければ処理をステップＳ６６に進める。第１しきい値を超える度数の階級が３つ以上連続する場合、位置情報で特定される文字領域が、文字でなく写真または図形が表わされている可能性が高いからである。なお、第１しきい値を超える度数の階級が連続する最大のしきい値を２としたが、このしきい値は、これに限定されることなく、色相ヒストグラムの階級に応じて定めることができる。 Next, it is determined whether or not three or more frequency classes exceeding the first threshold value continue in the hue histogram (step S65). If three or more classes having a frequency exceeding the first threshold value continue, the process proceeds to step S71; otherwise, the process proceeds to step S66. This is because, when three or more classes having a frequency exceeding the first threshold value continue, there is a high possibility that the character area specified by the position information represents a photograph or a figure instead of characters. In addition, although the maximum threshold value with which the frequency class exceeding the first threshold value continues is 2, this threshold value is not limited to this and may be determined according to the class of the hue histogram. it can.

ステップＳ６６においては、第１しきい値を超える度数の階級が連続する場合、連続する階級を度数が最大の代表色相の階級に統合する。これにより、文字色の近い画素を１つの代表色相の階級にすることで、文字色の数を少なくし、圧縮率を向上することができる。また、文字の輪郭部分と中央部分との色を同じ色とすることができ、文字の見栄えをよくすることができる。 In step S66, when the frequency class exceeding the first threshold value continues, the continuous class is integrated into the class of the representative hue having the maximum frequency. Thereby, the number of character colors can be reduced and the compression rate can be improved by making pixels close to the character color into one representative hue class. Moreover, the color of the outline part and center part of a character can be made into the same color, and the appearance of a character can be improved.

次のステップＳ６７においては、代表色相の階級数が４以上か否かを判断する。代表色相の階級数が４以上であれば処理をステップＳ７１に進め、そうでなければ処理をステップＳ６８に進める。代表色相の階級数が４以上の場合、カラー画像が文字領域において４色以上を含むことになり、文字でなく写真または図形が表わされている可能性が高いからである。なお、代表色相の階級数が４以上の場合に文字領域を写真属性に変更するようにしたが、このしきい値「４」は、これに限定されることなく、色相ヒストグラムの階級に応じて定めることができる。 In the next step S67, it is determined whether the number of representative hues is 4 or more. If the number of representative hues is 4 or more, the process proceeds to step S71; otherwise, the process proceeds to step S68. This is because when the number of representative hues is 4 or more, the color image includes four or more colors in the character area, and there is a high possibility that a photograph or a figure is represented instead of characters. The character area is changed to the photo attribute when the number of classes of the representative hue is 4 or more. However, the threshold value “4” is not limited to this, and the character area is changed according to the class of the hue histogram. Can be determined.

ステップＳ６８においては、文字色を算出する。文字色は、代表色相の階級に基づいて算出する。代表色相の階級が複数の場合、複数の代表色相の階級毎に文字色を算出するため、複数の代表色相の階級の１つを処理対象に設定する。そして、色相ヒストグラムにおいて、処理対象の代表色相の階級に振り分けられた複数の画素のカラー画像における画素値から文字色を算出する。文字色は、複数の画素のＲＧＢ値それぞれを平均したＲＧＢ値である。 In step S68, the character color is calculated. The character color is calculated based on the representative hue class. If there are a plurality of representative hue classes, one of the plurality of representative hue classes is set as a processing target in order to calculate a character color for each of the plurality of representative hue classes. Then, in the hue histogram, the character color is calculated from the pixel values in the color image of a plurality of pixels assigned to the representative hue class to be processed. The character color is an RGB value obtained by averaging the RGB values of a plurality of pixels.

次のステップＳ６９においては、文字色と２値画像と位置情報とを関連付け、処理をステップＳ７０に進める。具体的には、ステップＳ６８で算出された文字色と、ステップＳ６４でノイズが除去された２値画像と、位置情報との組をＲＡＭ１１２の所定の領域に記憶する。 In the next step S69, the character color, the binary image, and the position information are associated, and the process proceeds to step S70. Specifically, a set of the character color calculated in step S68, the binary image from which noise is removed in step S64, and the position information is stored in a predetermined area of the RAM 112.

次のステップＳ７０においては、次に処理対象とするべき未処理の代表色相の階級が存在するか否かを判断する。未処理の代表色相の階級が存在すれば処理をステップＳ０８に戻し、存在しなければ処理を圧縮処理に戻す。 In the next step S70, it is determined whether or not there is an unprocessed representative hue class to be processed next. If there is an unprocessed representative hue class, the process returns to step S08; otherwise, the process returns to the compression process.

一方、処理をステップＳ７１に進める場合は、ステップＳ６５において第１しきい値を超える度数の階級が３つ以上連続すると判断された場合、または、ステップＳ６７において代表色相の階級数が４以上存在すると判断された場合である。この場合、文字領域には文字でなく写真または図形が表されている可能性が高いので、ステップＳ７１においては、文字領域の属性を写真属性に変更し、処理を圧縮処理に戻す。具体的には、文字領域の位置情報をＲＡＭ１１２の写真領域の位置情報を格納する領域に記憶する。 On the other hand, when the process proceeds to step S71, when it is determined in step S65 that three or more classes having a frequency exceeding the first threshold value are consecutive, or in step S67, there are four or more representative hue classes. This is the case. In this case, since there is a high possibility that a photo or a figure is represented in the character area instead of a character, in step S71, the attribute of the character area is changed to a photo attribute, and the process returns to the compression process. Specifically, the position information of the character area is stored in the area for storing the position information of the photo area in the RAM 112.

図２２は、文字領域符号化処理の流れの一例を示すフローチャートである。文字領域符号化処理は、図１６のステップＳ０９において実行される処理である。図２２を参照して、画像処理部３０は、文字色と、２値画像と、位置情報との組をＲＡＭ１１２から読み出す（ステップＳ８１）。文字色と２値画像と位置情報との組で同じ位置情報の組が複数記憶されている場合には、それらの組のうちから１つを読み出す。そして、文字色別２値画像を生成する（ステップＳ８２）。文字色別２値画像は、カラー画像で文字色に近い色の画素の画素値を「１」とし、他の画素の画素値を「０」とした画像である。具体的には、画像処理部３０は、２値画像のうち画素値が「１」の画素のうちからカラー画像の画素値が文字色に近い画素を抽出することにより、その文字色に対応する文字色別２値画像を生成する。カラー画像の画素値が文字色と近いか否かは、カラー画像の画素値と文字色との間のＲＧＢ色空間における距離が所定の範囲にあることを条件とする。 FIG. 22 is a flowchart showing an example of the flow of character area encoding processing. The character area encoding process is a process executed in step S09 in FIG. Referring to FIG. 22, the image processing unit 30 reads a set of a character color, a binary image, and position information from the RAM 112 (step S81). When a plurality of sets of the same position information are stored as a set of the character color, the binary image, and the position information, one of the sets is read out. Then, a binary image for each character color is generated (step S82). The binary image for each character color is an image in which the pixel value of a pixel of a color close to the character color is “1” and the pixel values of other pixels are “0”. Specifically, the image processing unit 30 extracts a pixel having a pixel value of a color image close to a character color from pixels having a pixel value “1” in the binary image, thereby corresponding to the character color. A binary image for each character color is generated. Whether the pixel value of the color image is close to the character color is based on the condition that the distance in the RGB color space between the pixel value of the color image and the character color is within a predetermined range.

次のステップＳ８３においては、文字色別２値画像を圧縮符号化し、符号データを生成する。ここでの圧縮符号化は、可逆圧縮方式であり、例えばＭＭＲ圧縮方式である。次のステップＳ８４においては、文字色と符号データと位置情報とを関連付け、処理をステップＳ８５に進める。具体的には、ステップＳ８１で読み出した文字色および位置情報と、ステップＳ８４で生成した符号データの組をＲＡＭ１１２の所定の領域に記憶する。 In the next step S83, the binary image for each character color is compression-encoded to generate code data. The compression coding here is a lossless compression method, for example, an MMR compression method. In the next step S84, the character color, the code data, and the position information are associated with each other, and the process proceeds to step S85. Specifically, the character color and position information read in step S81 and the set of code data generated in step S84 are stored in a predetermined area of the RAM 112.

次のステップＳ８５においては、次に処理対象とするべき文字色と２値画像と位置情報との組が存在するか否かを判断する。そのような組が存在するならば、処理をステップＳ８１に戻し、存在しなければ処理を圧縮処理に戻す。 In the next step S85, it is determined whether or not there is a set of a character color, a binary image, and position information to be processed next. If such a group exists, the process returns to step S81, and if not, the process returns to the compression process.

以上説明したように、本実施の形態におけるＭＦＰ１００は、カラー画像から抽出された文字領域に含まれる文字を細線化し、細線化された文字を構成する複数の文字中心画素に対応するカラー画像の画素の色に基づいて、文字色を決定する。そして、カラー画像を２値化した２値画像の文字領域を符号化し、符号データと、決定された文字色と、文字領域のカラー画像中の位置とを関連付ける。カラー画像を２値化した２値画像の文字領域を符号化するので、文字全体を同じ色の文字として２値化することができる。また、細線化後の文字を構成する複数の文字中心画素に対応する画像データの画素の色に基づいて文字色を決定するので、文字の色を正確に検出することができる。その結果、画像中の文字の色を容易かつ高精度に検出することができる。 As described above, MFP 100 according to the present embodiment thins a character included in a character region extracted from a color image, and color image pixels corresponding to a plurality of character center pixels constituting the thinned character. The character color is determined based on the color. Then, the character area of the binary image obtained by binarizing the color image is encoded, and the code data, the determined character color, and the position of the character area in the color image are associated with each other. Since the character area of the binary image obtained by binarizing the color image is encoded, the entire character can be binarized as a character of the same color. In addition, since the character color is determined based on the color of the pixel of the image data corresponding to the plurality of character center pixels constituting the thinned character, the character color can be accurately detected. As a result, the color of characters in the image can be detected easily and with high accuracy.

また、主走査方向と副走査方向それぞれで、連続して配置された黒画素のうちから中心の画素を文字中心画素として抽出するので、文字中心画素を容易に抽出することができ、処理速度を速くすることができる。 In addition, since the center pixel is extracted as the character center pixel from the continuously arranged black pixels in each of the main scanning direction and the sub-scanning direction, the character center pixel can be easily extracted, and the processing speed can be reduced. Can be fast.

さらに、主走査方向に連続して配置された複数の黒画素の両端から所定数の画素を除いた少なくとも１つの画素であって、副走査方向に連続して配置された複数の黒画素の両端から所定数の画素を除いた少なくとも１つの画素を文字中心画素とするので、文字中心画素を容易に抽出することができ、処理速度を早くすることができる。 Furthermore, at least one pixel obtained by removing a predetermined number of pixels from both ends of a plurality of black pixels arranged continuously in the main scanning direction, and both ends of the plurality of black pixels arranged continuously in the sub-scanning direction Since at least one pixel excluding a predetermined number of pixels is used as the character center pixel, the character center pixel can be easily extracted, and the processing speed can be increased.

なお、上述した実施の形態においては画像処理装置の一例としてＭＦＰ１００について説明したが、図１６〜図２１に示した圧縮処理をＭＦＰ１００に実行させるための画像圧縮方法、または図１６〜図２１に示した圧縮処理をＭＦＰ１００またはＰＣ３，３Ａ，３Ｂ、３Ｃに実行させるための画像圧縮プログラムとして発明を捉えることができるのは言うまでもない。 In the above-described embodiment, the MFP 100 has been described as an example of the image processing apparatus. However, an image compression method for causing the MFP 100 to execute the compression processing illustrated in FIGS. 16 to 21 or illustrated in FIGS. Needless to say, the invention can be understood as an image compression program for causing the MFP 100 or the PCs 3, 3A, 3B, and 3C to execute the compression process.

今回開示された実施の形態はすべての点で例示であって制限的なものではないと考えられるべきである。本発明の範囲は上記した説明ではなくて特許請求の範囲によって示され、特許請求の範囲と均等の意味および範囲内でのすべての変更が含まれることが意図される。 The embodiment disclosed this time should be considered as illustrative in all points and not restrictive. The scope of the present invention is defined by the terms of the claims, rather than the description above, and is intended to include any modifications within the scope and meaning equivalent to the terms of the claims.

＜付記＞
（１）前記第１のしきい値は、度数の最大値に基づき定められる、請求項４に記載の画像処理装置。
（２）前記属性変更手段は、度数が第１しきい値を超える色相階級が第２のしきい値を超えて連続する場合、前記文字領域の属性を文字属性以外の属性に変更する、請求項８または９に記載の画像処理装置。
（３）前記文字色決定手段は、前記複数の色相階級のうちから度数が第１のしきい値を超える色相階級を代表色相として選択する選択手段をさらに含み、
前記属性変更手段は、度数が第１しきい値を超える代表色相が第３のしきい値を越えて存在する場合、前記文字領域の属性を文字属性以外の属性に変更する、請求項８または９に記載の画像処理装置。
（４）前記文字領域符号化手段は、前記２値化データを可逆圧縮する可逆圧縮手段を含む、請求項１に記載の画像処理装置。
（５）カラー画像から写真または図形が表された写真／図属性の写真／図領域を抽出する写真／図領域抽出手段と、
前記写真／図領域のカラー画像を符号化する写真／図形領域符号化手段と、をさらに備えた請求項１に記載の画像処理装置。
（６）前記文字領域符号化手段は、前記２値画像の前記文字領域を可逆圧縮する可逆圧縮手段を含み、
前記写真／図形領域符号化手段は、前記カラー画像の前記写真／図領域を非可逆圧縮する非可逆圧縮手段を含む、（５）に記載の画像処理装置。 <Appendix>
(1) The image processing apparatus according to claim 4, wherein the first threshold value is determined based on a maximum frequency value.
(2) The attribute changing unit changes the attribute of the character region to an attribute other than the character attribute when the hue class whose frequency exceeds the first threshold value exceeds the second threshold value. Item 10. The image processing device according to Item 8 or 9.
(3) The character color determination means further includes a selection means for selecting, as a representative hue, a hue class whose frequency exceeds a first threshold value from the plurality of hue classes.
The attribute changing unit changes the attribute of the character region to an attribute other than a character attribute when a representative hue whose frequency exceeds a first threshold exceeds a third threshold. The image processing apparatus according to 9.
(4) The image processing apparatus according to claim 1, wherein the character area encoding unit includes a reversible compression unit that reversibly compresses the binarized data.
(5) a photograph / figure area extracting means for extracting a photograph / figure attributed photograph / figure area representing a photograph or a figure from a color image;
The image processing apparatus according to claim 1, further comprising a photograph / graphic region encoding unit that encodes a color image of the photograph / graphic region.
(6) The character area encoding means includes lossless compression means for reversibly compressing the character area of the binary image,
The image processing apparatus according to (5), wherein the photograph / graphic region encoding unit includes an irreversible compression unit that performs irreversible compression on the photograph / figure region of the color image.

本発明の実施の形態の一つにおける画像形成システムの全体概要を示す図である。1 is a diagram showing an overall outline of an image forming system in one embodiment of the present invention. ＭＦＰの外観を示す斜視図である。1 is a perspective view showing an appearance of an MFP. ＭＦＰのハード構成の一例を示すブロック図である。2 is a block diagram illustrating an example of a hardware configuration of an MFP. FIG. 画像処理部の機能の一例を示す機能ブロック図である。It is a functional block diagram which shows an example of the function of an image process part. 文字領域抽出部の詳細な機能を示す機能ブロック図である。It is a functional block diagram which shows the detailed function of a character area extraction part. ノイズ除去部３２０の詳細な機能を示す機能ブロック図である。3 is a functional block diagram showing detailed functions of a noise removing unit 320. FIG. 文字色決定部の詳細な機能を示す機能ブロック図である。It is a functional block diagram which shows the detailed function of a character color determination part. 文字「Ａ」を含む２値画像の一例を示す図である。It is a figure which shows an example of the binary image containing the character "A". （Ａ）は主走査方向中心画素の一例を示す図であり、（Ｂ）は副走査方向中心画素の一例を示す図であり、（Ｃ）は文字中心画素の一例を示す図である。(A) is a figure which shows an example of a main scanning direction center pixel, (B) is a figure which shows an example of a subscanning direction center pixel, (C) is a figure which shows an example of a character center pixel. 色相ヒストグラムの階級の一例を示す図である。It is a figure which shows an example of the class of a hue histogram. 色相ヒストグラムの一例を示す第１の図である。It is a 1st figure which shows an example of a hue histogram. 色相ヒストグラムの一例を示す第２の図である。It is a 2nd figure which shows an example of a hue histogram. 色相ヒストグラムの一例を示す第３の図である。It is a 3rd figure which shows an example of a hue histogram. 色相ヒストグラムの一例を示す第４の図である。It is a 4th figure which shows an example of a hue histogram. 文字領域符号化部の詳細な機能を示す機能ブロック図である。It is a functional block diagram which shows the detailed function of a character area encoding part. 圧縮処理の流れの一例を示すフローチャートである。It is a flowchart which shows an example of the flow of a compression process. 前処理の流れの一例を示すフローチャートである。It is a flowchart which shows an example of the flow of pre-processing. 写真／図領域抽出処理の流れの一例を示すフローチャートである。It is a flowchart which shows an example of the flow of a photograph / figure area | region extraction process. 文字領域抽出処理の流れの一例を示すフローチャートである。It is a flowchart which shows an example of the flow of a character area extraction process. ノイズ除去処理の流れの一例を示すフローチャートである。It is a flowchart which shows an example of the flow of a noise removal process. 文字色決定処理の流れの一例を示すフローチャートである。It is a flowchart which shows an example of the flow of a character color determination process. 文字領域符号化処理の流れの一例を示すフローチャートである。It is a flowchart which shows an example of the flow of a character area encoding process.

Explanation of symbols

１画像形成システム、２ネットワーク、３，３Ａ，３Ｂ，３Ｃコンピュータ、４デジタルカメラ、５デジタルビデオカメラ、６携帯情報端末、９操作パネル、２０画像読取部、２０画像読取部、３０画像処理部、４０画像形成部、５０給紙部、６０ファクシミリ部、６１通信制御部、１００ＭＦＰ、１０１メイン回路、１１１ＣＰＵ、１１２ＲＡＭ、１１３ＲＯＭ、１１４表示部、１１５操作部、１１６ＨＤＤ、１１７データ通信制御部、１１８ＬＡＮ端子、１１９シリアル通信インターフェース端子、１１９Ａメモリカード、３０１前処理部、３１０文字領域抽出部、３１１エッジ画像生成部、３１２傾き補正部、３１３罫線除去部、３１４候補ブロック抽出部、３１５判定部、３１６文字領域決定部、３１７第１属性変更部、３２０ノイズ除去部、３２１明度ヒストグラム生成部、３２３網点・背景除去部、３２５第２属性変更部、３３０文字色決定部、３３１細線化部、３３２色相ヒストグラム生成部、３３３ノイズ色相除去部、３３４隣接色相統合部、３３５第３属性変更部、３３６代表色相選択部、３３７第４属性変更部、３３８文字色算出部、３４０文字領域符号化部、３４１値画像生成部、３４３文字色別符号化部、３５０写真／図領域抽出部、３６０写真画像符号化部、３７０統合部。 DESCRIPTION OF SYMBOLS 1 Image forming system 2 Network 3 3, 3A, 3B, 3C Computer 4 Digital camera 5 Digital video camera 6 Mobile information terminal 9 Operation panel 20 Image reading part 20 Image reading part 30 Image processing part 40 image forming unit, 50 paper feeding unit, 60 facsimile unit, 61 communication control unit, 100 MFP, 101 main circuit, 111 CPU, 112 RAM, 113 ROM, 114 display unit, 115 operation unit, 116 HDD, 117 data communication control , 118 LAN terminal, 119 serial communication interface terminal, 119A memory card, 301 preprocessing unit, 310 character area extraction unit, 311 edge image generation unit, 312 tilt correction unit, 313 ruled line removal unit, 314 candidate block extraction unit, 315 Judgment part, 316 sentences Area determination unit, 317 first attribute change unit, 320 noise removal unit, 321 brightness histogram generation unit, 323 halftone dot / background removal unit, 325 second attribute change unit, 330 character color determination unit, 331 thinning unit, 332 hue Histogram generation section, 333 noise hue removal section, 334 adjacent hue integration section, 335 third attribute change section, 336 representative hue selection section, 337 fourth attribute change section, 338 character color calculation section, 340 character area encoding section, 341 Value image generation unit, 343 Character color coding unit, 350 photo / figure region extraction unit, 360 photo image coding unit, 370 integration unit.

Claims

A character area extracting means for extracting a character area having a character attribute representing a character from a color image;
Thinning means for thinning characters included in the character region;
A character color determining means for determining a character color based on a color of a pixel of the color image corresponding to a plurality of character center pixels constituting the thinned character;
Binarization means for converting the color image into a binary image by binarizing;
A character area encoding means for encoding the character area of the binary image and generating code data;
An image processing apparatus comprising: association means for associating the generated code data, the determined character color, and a position of the character region in the color image.

The thinning unit is a main scanning direction detecting unit that extracts a central pixel from black pixels continuously arranged in the main scanning direction in the binary image;
Sub-scanning direction detecting means for extracting a central pixel from black pixels continuously arranged in the sub-scanning direction intersecting with the main scanning direction;
The image processing apparatus according to claim 1, further comprising: a determining unit that determines the center pixel extracted by the main scanning direction detection unit and the center pixel extracted by the sub-scanning direction detection unit as the character center pixel. .

The thinning means extracts main scanning direction detection means for extracting at least one pixel excluding a predetermined number of pixels from both ends of a plurality of black pixels arranged continuously in the main scanning direction as a first candidate pixel;
Sub-scanning direction detecting means for extracting at least one pixel excluding a predetermined number of pixels from both ends of a plurality of black pixels continuously arranged in the sub-scanning direction intersecting with the main scanning direction;
The image processing apparatus according to claim 1, further comprising: a determination unit that determines a pixel that is both the first candidate pixel and the second candidate pixel as the character center pixel.

The character color determining means is a classification means for classifying each of the plurality of character center pixels into one of a plurality of hue classes based on a color before thinning of each of the plurality of character center pixels.
The image processing apparatus according to claim 1, further comprising a noise removing unit that removes, as noise, pixels classified into a hue class whose frequency is equal to or less than a first threshold value.

The character color determining means includes a classification means for classifying each of the plurality of character center pixels into one of a plurality of hue classes based on a color before thinning of each of the plurality of character center pixels; Selection means for selecting, as a representative hue, a hue class whose frequency exceeds a first threshold value from among the classes,
The image processing device according to claim 1, wherein the character color is determined based on a color before thinning of the character center pixel classified into a hue class of the selected representative hue among the plurality of character center pixels. .

The selection means includes hue integration means for selecting, as a representative hue, a hue class having the highest frequency among consecutive hue classes when two or more hue classes having a frequency exceeding the first threshold value are consecutive. 5. The image processing apparatus according to 5.

The character color determining means determines a plurality of the character colors respectively corresponding to the plurality of representative hues when a plurality of the representative hues are selected by the selecting means;
The character region encoding means includes, in each of the plurality of representative hues, a color of a pixel of the color image corresponding to the black pixel among the black pixels included in the character region of the binary image. Binarized data generation unit for each character color that generates a binary image for each character color corresponding to the representative hue by extracting pixels that approximate the corresponding character color;
A character color coding unit that codes a plurality of character color binarized data respectively corresponding to the determined plurality of character colors, and generates character color code data.
The association means associates, for each of the determined plurality of character colors, the character color, the character color code data corresponding to the character color, and the position of the character region in the image data. The image processing apparatus according to claim 5, comprising color-specific association means.

The character color determining means includes a classification means for classifying each of the plurality of character center pixels into one of a plurality of hue classes based on a color before thinning of each of the plurality of character center pixels; The image processing apparatus according to claim 1, further comprising: an attribute changing unit that changes an attribute of the character region to an attribute other than the character attribute based on a class frequency distribution.

A character area extracting means for extracting a character area having a character attribute representing a character from a color image;
A photograph / figure region extracting means for extracting a photograph / figure attribute / photo / figure region representing a photograph or a figure from a color image;
Thinning means for thinning characters included in the character region;
Classifying means for classifying each of the plurality of character center pixels into one of a plurality of hue classes based on the color before thinning of each of the plurality of character center pixels constituting the thinned character;
An image processing apparatus comprising: an attribute changing unit that changes an attribute of the character region to an attribute other than the character attribute based on the frequency distribution of the plurality of hue classes.

Extracting a character region having a character attribute representing a character from a color image;
Thinning characters included in the character region;
Determining a character color based on a color of a pixel of the color image corresponding to a plurality of character center pixels constituting the thinned character;
Converting the color image into a binary image by binarizing the color image;
Encoding the character region of the binary image to generate code data;
An image processing method comprising: associating the generated code data, the determined character color, and a position of the character region in the color image.

Extracting a character region having a character attribute representing a character from a color image;
Extracting a photo / picture attribute of a picture / picture attribute representing a picture or figure from a color image;
Thinning characters included in the character region;
Classifying each of the plurality of character center pixels into one of a plurality of hue classes based on the color before thinning of each of the plurality of character center pixels constituting the thinned character;
Changing the attribute of the character region to an attribute other than the character attribute based on the frequency distribution of the plurality of hue classes.

Extracting a character region having a character attribute representing a character from a color image;
Thinning characters included in the character region;
Determining a character color based on a color of a pixel of the color image corresponding to a plurality of character center pixels constituting the thinned character;
Converting the color image into a binary image by binarizing the color image;
Encoding the character region of the binary image to generate code data;
An image processing program that causes a computer to execute the step of associating the generated code data, the determined character color, and the position of the character region in the color image.

Extracting a character region having a character attribute representing a character from a color image;
Extracting a photo / picture attribute of a picture / picture attribute representing a picture or figure from a color image;
Thinning characters included in the character region;
Classifying each of the plurality of character center pixels into one of a plurality of hue classes based on the color before thinning of each of the plurality of character center pixels constituting the thinned character;
An image processing program for causing a computer to execute the step of changing the attribute of the character region to an attribute other than the character attribute based on the frequency distribution of the plurality of hue classes.