JP4442651B2

JP4442651B2 - Image processing apparatus and program

Info

Publication number: JP4442651B2
Application number: JP2007176900A
Authority: JP
Inventors: 俊浩森
Original assignee: Konica Minolta Business Technologies Inc
Current assignee: Konica Minolta Business Technologies Inc
Priority date: 2007-07-05
Filing date: 2007-07-05
Publication date: 2010-03-31
Anticipated expiration: 2027-07-05
Also published as: JP2009017247A

Description

本発明は、画像処理技術に関し、特にスキャン画像に対する画像処理技術に関する。 The present invention relates to an image processing technique, and more particularly to an image processing technique for a scanned image.

スキャナ機器で文書（紙原稿等）を読み取ったスキャン画像を高い圧縮率で圧縮し、比較的小さな画像サイズのファイルを生成して保存する技術が存在する。このようなファイルとしては、例えば、高圧縮ＰＤＦなどのＰＤＦ（Portable Document Format）ファイルが存在する。 There is a technique for compressing a scanned image obtained by scanning a document (paper document or the like) with a scanner device at a high compression rate, and generating and storing a file having a relatively small image size. Examples of such a file include a PDF (Portable Document Format) file such as a highly compressed PDF.

このようなファイルにおいては、文字領域とその他の領域とが区別され、各領域の特質に応じた処理が施されることによって、文字の判読性とファイル容量の低減とが実現される。具体的には、文字領域に関しては、文字の判読性を保つため、高解像度のまま二値化処理が施されて生成されたデータが保存される。一方、文字領域以外の領域に関しては、階調性を保ちつつファイル容量を低減するため、二値化処理が施されることなく低解像度化された後に高い圧縮率で圧縮されたデータが保存される。また、文字領域に関しては、所定の範囲（単語あるいは行等）を１つの単位として、当該単位ごとに一つの代表色が文字色として決定される。これによれば、ファイルサイズの更なる低減を図ることができる。 In such a file, a character area is distinguished from other areas, and processing according to the characteristics of each area is performed, so that character readability and file capacity reduction are realized. Specifically, with respect to the character area, in order to maintain the legibility of the character, data generated by performing binarization processing with high resolution is stored. On the other hand, for areas other than the character area, data that has been compressed at a high compression rate after being reduced in resolution without being binarized is stored in order to reduce the file capacity while maintaining gradation. The As for the character area, a predetermined range (word or line) is set as one unit, and one representative color is determined as the character color for each unit. According to this, the file size can be further reduced.

このようなファイルにおいては、文字が小さくなると様々な問題が生じる。例えば、文字が小さくなると、文字のエッジの抽出が困難になる。このような問題を解決する技術としては、例えば特許文献１に記載の技術が存在する。 In such a file, various problems occur when the characters become smaller. For example, when the character becomes small, it becomes difficult to extract the edge of the character. As a technique for solving such a problem, there is a technique described in Patent Document 1, for example.

特開２００２−１００７５号公報JP 2002-10075 A

ところで、上記のようなファイルに関しては、別の問題、具体的には文字の色に関する問題も存在する。 By the way, regarding the above file, there is another problem, specifically, a problem regarding the color of characters.

詳細には、文字が細い場合には、スキャン時の条件等によっては、文字のエッジ付近の画素の色が下地の色と混ざりやすくなる。例えば、文字が赤色且つ下地が緑色の場合には、スキャン後のファイル（スキャンファイルとも称する）における文字の赤色は、本来の「赤」ではなく、緑色（ないし黄色）側に寄った色になる。すなわち、スキャンファイルの文字色が、背景色の影響を受けて、原稿（紙文書等）上での本来の色とは異なる色になってしまう。そして、この結果、文字の判読性が低下するという問題が存在する。 Specifically, when the character is thin, the color of the pixel near the edge of the character is likely to be mixed with the background color depending on scanning conditions and the like. For example, when the character is red and the background is green, the red color of the character in the scanned file (also referred to as a scan file) is not the original “red” but a color closer to the green (or yellow) side. . In other words, the character color of the scan file is influenced by the background color and becomes a color different from the original color on the original (paper document or the like). As a result, there is a problem that the legibility of the character is lowered.

そこで、この発明の課題は、スキャン画像における背景色の影響を排除して文字の判読性を向上することが可能な技術を提供することにある。 SUMMARY OF THE INVENTION An object of the present invention is to provide a technique capable of improving the legibility of a character by eliminating the influence of a background color in a scanned image.

上記課題を解決すべく、請求項１の発明は、画像処理装置であって、スキャン画像に対して二値化処理を施し、文字に対応する部分である文字部を背景から分離して抽出する文字部抽出手段と、前記文字部の代表色である文字色を決定する決定手段であって、前記文字部の色成分値に基づいて前記文字部の色の候補である第１の色を求め、前記第１の色と前記スキャン画像における前記背景の色である背景色との差を拡大するように前記第１の色を補正して得られる第２の色を前記文字色として決定する決定手段と、前記文字の太さを検出する検出手段と、を備え、前記決定手段は、前記検出手段によって検出された前記文字の太さに応じて、補正時における前記差の拡大の程度を変更するものであって、前記文字が細くなるにつれて、前記差の拡大の程度を大きくするものであることを特徴とする In order to solve the above problems, the invention of claim 1 is an image processing apparatus, which performs binarization processing on a scanned image and extracts a character portion corresponding to a character from a background. A character part extraction unit and a determination unit that determines a character color that is a representative color of the character part, and obtains a first color that is a candidate color of the character part based on a color component value of the character part; Determination to determine, as the character color, a second color obtained by correcting the first color so as to enlarge a difference between the first color and a background color that is the background color in the scanned image And a detecting means for detecting the thickness of the character, wherein the determining means changes the degree of enlargement of the difference at the time of correction according to the thickness of the character detected by the detecting means. as be one which in the character becomes narrower, the Characterized in that it is intended to increase the degree of expansion

請求項２の発明は、請求項１の発明に係る画像処理装置において、前記検出手段は、前記文字部の画素総数に対する非境界画素の割合を前記文字の太さとして算出することを特徴とする。 According to a second aspect of the present invention, in the image processing apparatus according to the first aspect of the invention, the detecting means calculates a ratio of non-boundary pixels to the total number of pixels of the character portion as the thickness of the character. .

請求項３の発明は、請求項１または請求項２の発明に係る画像処理装置において、前記決定手段は、前記文字部に含まれる少なくとも一部の画素の平均色を前記第１の色として決定することを特徴とする。 According to a third aspect of the invention, in the image processing apparatus according to the first or second aspect of the invention, the determining means determines an average color of at least some of the pixels included in the character portion as the first color. It is characterized by doing.

請求項４の発明は、請求項１または請求項２の発明に係る画像処理装置において、前記決定手段は、前記文字部に含まれる代表画素の色を前記第１の色として決定することを特徴とする。 According to a fourth aspect of the invention, in the image processing apparatus according to the first or second aspect of the invention, the determining means determines the color of a representative pixel included in the character portion as the first color. And

請求項５の発明は、コンピュータに、ａ）スキャン画像に対して二値化処理を施し、文字に対応する部分である文字部を背景から分離して抽出する手順と、ｂ）前記文字の太さを検出する手順と、ｃ）前記文字部の色成分値に基づいて前記文字部の色の候補である第１の色を求め、前記第１の色と前記スキャン画像における前記背景の色である背景色との差を拡大するように前記第１の色を補正して得られる第２の色を、前記文字部の代表色である文字色として決定する手順と、を実行させるためのプログラムであって、前記手順ｃ）は、ｃ−１）前記手順ｂ）で検出された前記文字の太さに応じて、補正時における前記差の拡大の程度を変更する手順であって、前記文字が細くなるにつれて前記差の拡大の程度が大きくなるように、前記差の拡大の程度を変更する手順を含むことを特徴とする。
請求項６の発明は、請求項５の発明に係るプログラムにおいて、前記手順ｂ）は、前記文字部の画素総数に対する非境界画素の割合を前記文字の太さとして算出する手順を含むことを特徴とする。
請求項７の発明は、請求項５または請求項６の発明に係るプログラムにおいて、前記手順ｃ）は、前記文字部に含まれる少なくとも一部の画素の平均色を前記第１の色として決定する手順を含むことを特徴とする。
請求項８の発明は、請求項５または請求項６の発明に係るプログラムにおいて、前記手順ｃ）は、前記文字部に含まれる代表画素の色を前記第１の色として決定する手順を含むことを特徴とする。 The invention according to claim 5 is: a) a procedure for performing binarization processing on a scanned image and extracting a character part, which is a part corresponding to a character, from a background; b) a thickness of the character And c) obtaining a first color that is a candidate for the color of the character portion based on the color component value of the character portion, and using the first color and the background color in the scanned image A program for executing a procedure for determining a second color obtained by correcting the first color so as to enlarge a difference from a certain background color as a character color that is a representative color of the character portion. a is one of the steps c) is, c-1) the procedure b) in accordance with the thickness of said detected characters, a procedure for changing the degree of expansion of the difference in the correction time, the character The difference is increased so that the degree of enlargement of the difference increases as the thickness becomes thinner. The method includes a procedure for changing the degree of enlargement .
According to a sixth aspect of the invention, in the program according to the fifth aspect of the invention, the step b) includes a step of calculating a ratio of non-boundary pixels with respect to the total number of pixels of the character portion as the thickness of the character. And
According to a seventh aspect of the invention, in the program according to the fifth or sixth aspect of the invention, the procedure c) determines an average color of at least some of the pixels included in the character portion as the first color. It includes a procedure.
According to an eighth aspect of the present invention, in the program according to the fifth or sixth aspect , the step c) includes a step of determining a color of a representative pixel included in the character portion as the first color. It is characterized by.

請求項１ないし請求項８に記載の発明によれば、文字部の色成分値に基づいて文字部の色の候補である第１の色が求められ、当該第１の色とスキャン画像における背景の色である背景色との差を拡大するように第１の色を補正して得られる第２の色が文字色として決定されるので、スキャン画像における背景色の影響を排除して文字の判読性を向上することができる。また特に、検出された文字の太さに応じて第１の色と背景色との差の拡大の程度が変更され、より詳細には、当該文字が細くなるにつれて当該差の拡大の程度が大きくされるので、文字の太さに応じて補正の程度の適正化を図ることが可能である。 According to the first to eighth aspects of the present invention, the first color that is a candidate for the color of the character portion is obtained based on the color component value of the character portion, and the background in the first color and the scanned image is obtained. Since the second color obtained by correcting the first color so as to enlarge the difference from the background color which is the color of the character is determined as the character color, the influence of the background color in the scanned image is eliminated and the character color The legibility can be improved. In particular, the degree of enlargement of the difference between the first color and the background color is changed according to the detected thickness of the character , and more specifically, the degree of enlargement of the difference increases as the character becomes thinner. since the, it is possible to achieve an appropriate degree of correction in accordance with the thickness of a character.

＜１．第１実施形態＞
＜１−１．構成＞
図１は、第１実施形態に係る原稿電子化システム１００（１００Ａ）を示す概略図である。ここでは、原稿電子化システム１００は、マルチ・ファンクション・ペリフェラル（ＭＦＰとも略称する）１（１Ａ）を備えて構成される。ＭＦＰ１は、その画像処理部３を用いて所定の画像処理を実行することによって、画像処理装置としても機能する。 <1. First Embodiment>
<1-1. Configuration>
FIG. 1 is a schematic view showing an original digitizing system 100 (100A) according to the first embodiment. Here, the electronic document system 100 includes a multi-function peripheral (abbreviated as MFP) 1 (1A). The MFP 1 also functions as an image processing apparatus by executing predetermined image processing using the image processing unit 3.

この原稿電子化システム１００は、原稿（紙文書等）を光学的に読み取って得られる画像（原稿画像とも称する）を電子データ化するシステムである。詳細には、ＭＦＰ１が原稿画像を生成するとともに当該原稿画像に対する画像処理等を実行することによって、原稿が電子データ化される。 The document digitizing system 100 is a system that converts an image (also referred to as a document image) obtained by optically reading a document (such as a paper document) into electronic data. Specifically, when the MFP 1 generates a document image and executes image processing on the document image, the document is converted into electronic data.

ＭＦＰ１は、スキャナ機能、プリンタ機能、コピー機能およびファクシミリ機能などを備える装置（複合機とも称する）である。具体的には、ＭＦＰ１は、画像読取部２と、画像処理部３と、印刷出力部４と、通信部５と、入出力部６と、格納部８と、コントローラ９とを備えており、これらの各部を複合的に動作させることによって、上記の各機能を実現する。 The MFP 1 is a device (also referred to as a multi function device) having a scanner function, a printer function, a copy function, a facsimile function, and the like. Specifically, the MFP 1 includes an image reading unit 2, an image processing unit 3, a print output unit 4, a communication unit 5, an input / output unit 6, a storage unit 8, and a controller 9. The above functions are realized by operating these units in a complex manner.

画像読取部２は、ＭＦＰ１の所定の位置に載置された原稿を光学的に読み取って、当該原稿の画像（原稿画像とも称する）を生成する処理部である。画像読取部２は、スキャナ部とも称される。 The image reading unit 2 is a processing unit that optically reads a document placed at a predetermined position of the MFP 1 and generates an image (also referred to as a document image) of the document. The image reading unit 2 is also referred to as a scanner unit.

画像処理部３は、画像読取部２によって生成されたスキャン画像に対して各種の画像処理を施す処理部である。画像処理部３は、文字部抽出部３１および文字色決定部３２を有している。文字部抽出部３１は、スキャン画像に含まれる文字を背景から分離して抽出する処理等を実行する。また、文字色決定部３２は、文字部抽出部３１によって抽出された文字部の色（代表色）を決定する処理等を実行する。これらの処理については後に詳述する。 The image processing unit 3 is a processing unit that performs various types of image processing on the scanned image generated by the image reading unit 2. The image processing unit 3 includes a character part extraction unit 31 and a character color determination unit 32. The character part extraction unit 31 performs a process of separating and extracting characters included in the scanned image from the background. Further, the character color determination unit 32 executes a process for determining the color (representative color) of the character part extracted by the character part extraction unit 31. These processes will be described in detail later.

印刷出力部４は、対象画像に関する画像データに基づいて紙などの各種の媒体に画像を印刷出力する出力部である。 The print output unit 4 is an output unit that prints an image on various media such as paper based on image data related to the target image.

通信部５は、公衆回線等を介したファクシミリ通信を行うことが可能な処理部である。また、通信部５は、通信ネットワークＮＷを介したネットワーク通信が可能である。このネットワーク通信を利用することによって、ＭＦＰ１は、所望の相手先との間で各種のデータを授受することが可能である。また、ＭＦＰ１は、このネットワーク通信を利用することによって、電子メールの送受信を行うことも可能である。 The communication unit 5 is a processing unit capable of performing facsimile communication via a public line or the like. The communication unit 5 can perform network communication via the communication network NW. By using this network communication, the MFP 1 can exchange various data with a desired destination. The MFP 1 can also send and receive e-mails using this network communication.

入出力部６は、ＭＦＰ１に対する入力を受け付ける操作入力部６１と、各種情報の表示出力を行う表示部６２とを備えている。 The input / output unit 6 includes an operation input unit 61 that receives input to the MFP 1 and a display unit 62 that displays and outputs various types of information.

格納部８は、ハードディスクドライブ（ＨＤＤ）等の格納装置で構成される。この格納部８には、画像読取部２等で生成された原稿画像等が格納される。 The storage unit 8 includes a storage device such as a hard disk drive (HDD). The storage unit 8 stores a document image or the like generated by the image reading unit 2 or the like.

コントローラ９は、ＭＦＰ１を統括的に制御する制御装置であり、ＣＰＵと、各種の半導体メモリ（ＲＡＭおよびＲＯＭ等）とを備えて構成される。コントローラ９の制御下において各種の処理部が動作することによって、ＭＦＰ１の各種の機能が実現される。例えば、コントローラ９の制御下において、画像読取部２を用いて所望の画像を光学的に読み取ることによって、原稿をスキャニングした画像（原稿画像ないしスキャン画像とも称する）が取得され、スキャナ機能が実現される。 The controller 9 is a control device that controls the MFP 1 in an integrated manner, and includes a CPU and various semiconductor memories (such as a RAM and a ROM). Various functions of the MFP 1 are realized by operating various processing units under the control of the controller 9. For example, by scanning a desired image optically using the image reading unit 2 under the control of the controller 9, an image obtained by scanning a document (also referred to as a document image or a scanned image) is acquired, and a scanner function is realized. The

＜１−２．画像処理の概要＞
ＭＦＰ１は、スキャン画像に対して所定の画像処理を施して、所定の形式のファイル（例えば、ＰＤＦ（Portable Document Format）ファイル）を生成する機能を有している。以下では、このようなファイル生成処理について説明する。なお、このファイル生成機能は、画像処理部３などによって実現される。 <1-2. Overview of image processing>
The MFP 1 has a function of performing predetermined image processing on a scanned image to generate a file in a predetermined format (for example, a PDF (Portable Document Format) file). Hereinafter, such a file generation process will be described. This file generation function is realized by the image processing unit 3 or the like.

図２に示されるように、まず、画像読取部２によって取得されたスキャン画像に対して、所定の前処理が施される（ステップＳ１１）。この前処理には、例えば画像形式の変換処理、および解像度の調整処理等が含まれ得る。 As shown in FIG. 2, first, a predetermined pre-process is performed on the scanned image acquired by the image reading unit 2 (step S11). This preprocessing can include, for example, image format conversion processing, resolution adjustment processing, and the like.

次に、文字に関する領域である文字領域（第１の領域とも称する）とその他の領域（第２の領域とも称する）とを区別する領域判別処理が実行される（ステップＳ１３）。その後、各領域の特質に応じた処理が施されることによって、文字の判読性を維持しつつファイル容量を低減することが可能である。なお、第２の領域としては、写真あるいは図表等を含む領域が例示される。 Next, an area determination process for distinguishing between a character area (also referred to as a first area) that is an area related to characters and another area (also referred to as a second area) is executed (step S13). Thereafter, by performing processing according to the characteristics of each area, it is possible to reduce the file capacity while maintaining the legibility of the characters. In addition, as a 2nd area | region, the area | region containing a photograph or a chart is illustrated.

文字領域（第１の領域）の画像に関しては、文字領域に対する処理（ステップＳ１５）および可逆圧縮処理（ステップＳ１６）が施される。文字領域に関しては、文字の判読性を保つため、高解像度のまま二値化処理等が施されてデータＤＴが生成される。また、このデータＤＴは、所定形式（ＭＭＲ（Modified Modified READ）等）で可逆圧縮されて保存される。可逆圧縮することによれば、文字の判読性が低下することを回避できる。 With respect to the image of the character area (first area), the process for the character area (step S15) and the lossless compression process (step S16) are performed. With respect to the character region, in order to maintain the legibility of the character, binarization processing or the like is performed with high resolution, and data DT is generated. The data DT is stored in a reversible compression format in a predetermined format (MMR (Modified Modified READ) or the like). According to the reversible compression, it is possible to avoid deterioration of the legibility of characters.

一方、文字領域以外の領域（第２の領域）の画像に関しては、階調性を保ちつつファイル容量を低減するため、二値化処理が施されることなく低解像度化処理が施されてデータＤＧが生成される（ステップＳ１７，Ｓ１８）。また、このデータＤＧは、所定形式（ＪＰＥＧ等）で非可逆圧縮されて保存される。非可逆圧縮することによれば、生成されるファイルのファイルサイズを非常に小さくすることが可能である。 On the other hand, regarding the image of the area other than the character area (second area), the data is subjected to the resolution reduction process without being subjected to the binarization process in order to reduce the file capacity while maintaining the gradation. A DG is generated (steps S17 and S18). The data DG is stored in a predetermined format (such as JPEG) after being irreversibly compressed. By irreversible compression, the file size of the generated file can be made very small.

そして、両データＤＴ，ＤＧが合成されることなどによって、所定の形式のファイルが生成される（ステップＳ１９）。 Then, a file in a predetermined format is generated by combining both data DT and DG (step S19).

＜１−３．文字データ生成処理＞
次に、文字領域に対する画像処理（ステップＳ１５）についてさらに詳細に説明する。 <1-3. Character data generation processing>
Next, the image processing (step S15) for the character area will be described in more detail.

図３は、ステップＳ１５における処理の概要を示す図である。また、図４〜図１１は、白色背景の赤色文字に関する処理結果の変遷を示す図である。詳細には、図４〜図７は、６ポイントの大きさを有する文字「Ｉ」に関する処理結果の変遷を示す図であり、図８〜図１１は、１０．５ポイントの大きさを有する文字「Ｉ」に関する処理結果の変遷を示す図である。図４および図８は、原稿上の文字「Ｉ」を示す図であり、図５および図９は、スキャン画像における文字「Ｉ」を示す図である。例えば、上述のようにＭＦＰ１が原稿をスキャンしてスキャン画像を生成すると、原稿上の６ポイント文字「Ｉ」（図４参照）は、スキャン画像において図５に示すような状態で得られる。同様に、原稿上の１０．５ポイント文字「Ｉ」（図８参照）は、スキャン画像において図９に示すような状態で得られる。図５および図９の状態においては、各画素が多階調値を有している。また、図６および図１０は、スキャン画像を二値化して得られる画像、詳細には、後述する補正処理（ステップＳ２４）前の画像における文字「Ｉ」を示す図であり、図７および図１１は、当該補正処理（ステップＳ２４）後の画像における文字「Ｉ」を示す図である。 FIG. 3 is a diagram showing an overview of the processing in step S15. Moreover, FIGS. 4-11 is a figure which shows the transition of the processing result regarding the red character of a white background. Specifically, FIGS. 4 to 7 are diagrams showing the transition of the processing result related to the character “I” having a size of 6 points, and FIGS. 8 to 11 are characters having a size of 10.5 points. It is a figure which shows the transition of the processing result regarding "I". 4 and 8 are diagrams showing the letter “I” on the document, and FIGS. 5 and 9 are diagrams showing the letter “I” in the scanned image. For example, when the MFP 1 scans an original to generate a scanned image as described above, the 6-point character “I” (see FIG. 4) on the original is obtained in the state shown in FIG. 5 in the scanned image. Similarly, the 10.5 point character “I” (see FIG. 8) on the document is obtained in the scan image as shown in FIG. 5 and 9, each pixel has a multi-tone value. 6 and 10 are diagrams showing a character “I” in an image obtained by binarizing the scanned image, specifically, an image before a correction process (step S24) described later, and FIG. 7 and FIG. 11 is a diagram showing the character “I” in the image after the correction process (step S24).

ここにおいて、ＭＦＰ１は、カラー画像をスキャン画像として取得しているものとする。具体的には、Ｒ（レッド）成分、Ｇ（グリーン）成分、Ｂ（ブルー）成分の３つの三原色成分で構成されるカラー画像(ＲＧＢ画像とも称する)が、スキャン画像として取得されているものとする。各色成分画像（すなわち、Ｒ成分画像、Ｇ成分画像、Ｂ成分画像）は、それぞれ、所定ビット（例えば８ビット）に対応する多段階（例えば２５６段階）の階調値を有している。したがって、ＲＧＢ画像は、各色成分の所定ビット数の３倍のビット数（例えば２４ビット）のデータとして構成される。 Here, it is assumed that the MFP 1 acquires a color image as a scan image. Specifically, a color image (also referred to as an RGB image) composed of three primary color components of an R (red) component, a G (green) component, and a B (blue) component is acquired as a scan image. To do. Each color component image (that is, an R component image, a G component image, and a B component image) has gradation values of multiple levels (for example, 256 levels) corresponding to predetermined bits (for example, 8 bits). Therefore, the RGB image is configured as data having a bit number (for example, 24 bits) that is three times the predetermined bit number of each color component.

この文字データの生成処理においては、ＲＧＢ画像のデータ（単にＲＧＢデータとも称する）の他に、ＲＧＢデータに基づいて「明度データ」をも生成する。この明度データは、その各画素の画素値を、ＲＧＢデータの各色成分値にそれぞれ対応係数を乗じた値を加算した値として算出することによって、取得される。また、明度データは、所定ビット数（例えば８ビット）に応じた階調値を有する値として取得される。明度データは、グレースケール画像データとも表現される。以下の文字抽出処理等においては基本的に明度データを用いることによって、演算負荷を低減することが可能である。 In this character data generation process, in addition to RGB image data (also simply referred to as RGB data), “brightness data” is also generated based on RGB data. The brightness data is obtained by calculating the pixel value of each pixel as a value obtained by adding a value obtained by multiplying each color component value of RGB data by a corresponding coefficient. Further, the brightness data is acquired as a value having a gradation value corresponding to a predetermined number of bits (for example, 8 bits). The brightness data is also expressed as grayscale image data. In the following character extraction processing and the like, the calculation load can be reduced by basically using the brightness data.

ステップＳ２１（図３）においては、文字領域の明度データに対して二値化処理が施される。この二値化処理によって、スキャン画像の文字領域内の「文字」に対応する部分（「文字部」とも称する）が、背景から分離して抽出される。 In step S21 (FIG. 3), binarization processing is performed on the brightness data of the character area. By this binarization processing, a portion (also referred to as “character portion”) corresponding to “character” in the character region of the scanned image is extracted separately from the background.

例えば、図５のスキャン画像上の６ポイント文字「Ｉ」は、ステップＳ２１の二値化処理によって、図６に示すように、文字「Ｉ」を示す画素集合として得られることになる。また、図１２は、この状態を模式的に表現した図である。図１２においては、小さな矩形が１つの画素を表現しており、文字「Ｉ」が６２画素の集合として構成される状態が示されている。換言すれば、全６２画素の集合体である文字部ＣＰによって、文字「Ｉ」が構成されている。 For example, the 6-point character “I” on the scanned image in FIG. 5 is obtained as a pixel set indicating the character “I” as shown in FIG. 6 by the binarization processing in step S21. FIG. 12 is a diagram schematically representing this state. In FIG. 12, a small rectangle represents one pixel, and the state where the character “I” is configured as a set of 62 pixels is shown. In other words, the character “I” is constituted by the character portion CP which is an aggregate of all 62 pixels.

同様に、図９のスキャン画像上の１０．５ポイント文字「Ｉ」は、ステップＳ２１の二値化処理によって、図１０に示すように、文字「Ｉ」を示す画素集合として得られることになる。また、図１３は、この状態を模式的に表現した図である。図１３においては、小さな矩形が１つの画素を表現しており、文字「Ｉ」が１９６画素の集合として構成される状態が示されている。換言すれば、全１９６画素の集合体である文字部ＣＰによって、文字「Ｉ」が構成されている。 Similarly, the 10.5 point character “I” on the scanned image of FIG. 9 is obtained as a pixel set indicating the character “I” as shown in FIG. 10 by the binarization processing in step S21. . FIG. 13 is a diagram schematically representing this state. In FIG. 13, a small rectangle represents one pixel, and the state where the character “I” is configured as a set of 196 pixels is shown. In other words, the character “I” is constituted by the character portion CP which is an aggregate of all 196 pixels.

ここにおいて、この文字部ＣＰの色は、１文字以上を含む所定の範囲（１文字、単語、行、段落、あるいはブロック等）を１つの単位として、当該単位ごとに一つの代表色が、「文字色」として決定される。これによれば、当該文字部ＣＰを構成する画素ごとに異なる色が決定される場合に比べて、ファイルサイズの更なる低減を図ることができる。なお、以下では説明の簡易化のため、１文字単位で文字色を決定する場合について例示する。 Here, the color of the character part CP is a predetermined range (one character, word, line, paragraph, block, etc.) including one or more characters as one unit, and one representative color for each unit is “ It is determined as “character color”. According to this, compared with the case where a different color is determined for each pixel constituting the character part CP, the file size can be further reduced. In the following, for simplification of description, the case where the character color is determined in units of one character will be exemplified.

そのため、第１段階の処理として、ステップＳ２２（図３）の処理が実行される。具体的には、文字部ＣＰに含まれる全画素の平均色を第１の色Ｖ１として求める。換言すれば、文字部ＣＰの代表色（「文字色」）の候補として、第１の色Ｖ１を求める。例えば、６ポイント文字「Ｉ」（図６ないし図１２参照）の全６２画素について、各色成分ごとに平均値を求め、各色成分の平均値で構成される色を「第１の色」として求めることができる。 Therefore, the process of step S22 (FIG. 3) is executed as the first stage process. Specifically, the average color of all the pixels included in the character part CP is obtained as the first color V1. In other words, the first color V1 is obtained as a candidate for the representative color (“character color”) of the character part CP. For example, for all 62 pixels of the 6-point character “I” (see FIGS. 6 to 12), an average value is obtained for each color component, and a color constituted by the average value of each color component is obtained as the “first color”. be able to.

ただし、この第１の色Ｖ１は、原稿における文字の色とは異なっていることがある。特に、細い文字においては、背景色の影響を受けて本来の色よりも背景色に近い色になっていることが多い。例えば、図１４に示すように、本来赤色（Ｒ＝２５５，Ｇ＝０，Ｂ＝０）の６ポイント文字に関する「第１の色」は、背景（下地）の白色（Ｒ＝２５５，Ｇ＝２５５，Ｂ＝２５５）の影響を受けて、少しくすんだ赤色（Ｒ＝２４６，Ｇ＝７９，Ｂ＝８１）になる。なお、図１４は、補正前後の色の変化等を示す図である。 However, the first color V1 may be different from the character color in the document. In particular, thin characters are often closer to the background color than the original color due to the influence of the background color. For example, as shown in FIG. 14, the “first color” related to a six-point character that is originally red (R = 255, G = 0, B = 0) is white (R = 255, G = 255, B = 255), it becomes a slightly dull red color (R = 246, G = 79, B = 81). FIG. 14 is a diagram showing a change in color before and after correction.

そこで、この実施形態においては、ステップＳ２４（図３）において、第１の色Ｖ１と背景の色（背景色）ＶＢとの差を拡大するように第１の色Ｖ１を補正して得られる第２の色Ｖ２を、「文字色」として決定する。 Therefore, in this embodiment, in step S24 (FIG. 3), the first color V1 obtained by correcting the first color V1 so as to increase the difference between the first color V1 and the background color (background color) VB. The second color V2 is determined as the “character color”.

また、ここでは、文字の太さに応じて、補正時における、第１の色Ｖ１と背景色ＶＢとの差の拡大の程度を変更する。これは、文字の混色の程度が文字の太さに応じて異なる、という知見に基づくものである。そのため、ステップＳ２３で文字の太さを求めた後に、ステップＳ２４の文字色の決定動作を実行する。 Here, the degree of enlargement of the difference between the first color V1 and the background color VB at the time of correction is changed according to the thickness of the character. This is based on the knowledge that the degree of color mixing of characters varies depending on the thickness of the characters. Therefore, after obtaining the thickness of the character in step S23, the character color determining operation in step S24 is executed.

まず、ステップＳ２３の処理について説明する。 First, the process of step S23 will be described.

ステップＳ２３では、文字の太さを検出する。 In step S23, the thickness of the character is detected.

具体的には、まず、文字部ＣＰを構成する全画素の数を求めることに加えて、全画素のうち、背景（下地）との境界上の画素（境界画素）を除いた内部画素（非境界画素とも称する）の数を求める。図１２の場合には、全画素数が６２画素、内部画素数が１６画素である、として算出される。 Specifically, first, in addition to obtaining the number of all pixels constituting the character part CP, the internal pixels (non-pixels) excluding the pixels (boundary pixels) on the boundary with the background (background) among all the pixels. The number of boundary pixels). In the case of FIG. 12, it is calculated that the total number of pixels is 62 pixels and the number of internal pixels is 16 pixels.

そして、全画素に対する内部画素（非境界画素）の比率（内部比率）αを算出する。図１２の場合には、内部比率αは、２６％（＝（１６／６２）×１００）、として算出される。 Then, the ratio (internal ratio) α of internal pixels (non-boundary pixels) to all pixels is calculated. In the case of FIG. 12, the internal ratio α is calculated as 26% (= (16/62) × 100).

また、図１３の場合には、全画素数が１９６画素、内部画素数が１０２画素であり、内部比率αが５２％（＝（１０２／１９６）×１００）、として算出される。 In the case of FIG. 13, the total number of pixels is 196 pixels, the number of internal pixels is 102 pixels, and the internal ratio α is calculated as 52% (= (102/196) × 100).

内部比率α、換言すれば文字部ＣＰの画素総数に対する内部画素（非境界画素）の割合は、文字が太くなるにつれて大きな値となる。このような特質を利用して、この実施形態においては、この内部比率αを「文字の太さ」を表す指標値として用いる。 The ratio of the internal pixel (non-boundary pixel) to the internal ratio α, in other words, the total number of pixels of the character part CP increases as the character becomes thicker. Using this characteristic, in this embodiment, the internal ratio α is used as an index value representing “character thickness”.

つぎに、ステップＳ２４の処理について説明する。 Next, the process of step S24 will be described.

ステップＳ２４においては、次の式（１）にしたがって、背景色ＶＢの各成分値（ＶＢｒ，ＶＢｇ，ＶＢｂ）等を用いて上記の第１の色Ｖ１の各成分値（Ｖ１ｒ，Ｖ１ｇ，Ｖ１ｂ）を補正し、第２の色Ｖ２の各成分値（Ｖ２ｒ，Ｖ２ｇ，Ｖ２ｂ）を算出する。なお、式（１）における値βは、適宜の定数であり、例えば０．７５（＝７５％）として定められる。また、背景色ＶＢは、上記の二値化処理（ステップＳ２１）によって文字部ＣＰ以外の部分として判定された背景部分（下地部分）の平均色を求めることによって、得ることができる。 In step S24, the component values (V1r, V1g, V1b) of the first color V1 using the component values (VBr, VBg, VBb) of the background color VB and the like according to the following equation (1). Are corrected, and the component values (V2r, V2g, V2b) of the second color V2 are calculated. Note that the value β in the equation (1) is an appropriate constant, and is determined as 0.75 (= 75%), for example. The background color VB can be obtained by obtaining the average color of the background portion (background portion) determined as a portion other than the character portion CP by the binarization process (step S21).

ただし、式（１）による各算出値（Ｖ２ｒ，Ｖ２ｇ，Ｖ２ｂ）は、それぞれ、最小値Ｖmin（例えば、０（ゼロ））よりも小さい場合には当該最小値Ｖminに修正され、最大値Ｖmax（例えば、２５５）よりも大きい場合には当該最大値Ｖmaxに修正される。 However, each calculated value (V2r, V2g, V2b) according to the equation (1) is corrected to the minimum value Vmin when it is smaller than the minimum value Vmin (for example, 0 (zero)), and the maximum value Vmax ( For example, when it is larger than 255), it is corrected to the maximum value Vmax.

式（１）に基づく補正原理について、図１５等を参照しながら説明する。なお、記号Ｖ１は、ＲＧＢの各色成分値を総称する場合にも用いるものとする。記号Ｖ２，ＶＢについても同様である。 The correction principle based on Expression (1) will be described with reference to FIG. The symbol V1 is also used when collectively referring to the RGB color component values. The same applies to the symbols V2 and VB.

例えば、内部比率αがβ（例えば７５％）に等しい場合には、ｋ＝１となる。このとき、式（１）によって、第２の色の成分値Ｖ２は、第１の色の成分値Ｖ１に等しくなる。特に図１５を参照すると、背景色の成分値ＶＢから、背景色の成分値ＶＢと第１の色の成分値Ｖ１との差（ＶＢ−Ｖ１）を引くと、式（１）と同様に、値Ｖ１になることが判る。 For example, when the internal ratio α is equal to β (for example, 75%), k = 1. At this time, the component value V2 of the second color is equal to the component value V1 of the first color according to the expression (1). Referring to FIG. 15 in particular, when the difference (VB−V1) between the background color component value VB and the first color component value V1 is subtracted from the background color component value VB, as in Equation (1), It can be seen that the value is V1.

一方、内部比率αがβ（例えば７５％）より小さい場合を想定すると、ｋ＞１となる。例えば、α＝２６％、β＝７５％のときには、ｋ＝１．４９（１４９％）となる。この場合、図１５に示すように、背景色の成分値ＶＢから、背景色の成分値ＶＢと第１の色の成分値Ｖ１との差をｋ倍（１．４９倍）に拡大した値（（ＶＢ−Ｖ１）×ｋ）を差し引いた値が値Ｖ２として算出される。換言すれば、第２の色の各成分値（Ｖ２ｒ，Ｖ２ｇ，Ｖ２ｂ）が、第１の色の各対応成分値（Ｖ１ｒ，Ｖ１ｇ，Ｖ１ｂ）と背景色の各対応成分値（ＶＢｒ，ＶＢｇ，ＶＢｂ）との差（詳細には差分値の絶対値）を拡大するように決定される。 On the other hand, assuming that the internal ratio α is smaller than β (for example, 75%), k> 1. For example, when α = 26% and β = 75%, k = 1.49 (149%). In this case, as shown in FIG. 15, a value obtained by expanding the difference between the background color component value VB and the first color component value V1 from the background color component value VB to k times (1.49 times) ( A value obtained by subtracting (VB−V1) × k) is calculated as a value V2. In other words, the component values (V2r, V2g, V2b) of the second color are the corresponding component values (V1r, V1g, V1b) of the first color and the corresponding component values (VBr, VBg, V1b) of the background color. VBb) is determined so as to enlarge the difference (specifically, the absolute value of the difference value).

例えば、文字が細い場合（６ポイント文字の場合）においては、図１４に示すように、色Ｖ１（Ｒ＝２４６，Ｇ＝７９，Ｂ＝８１）は式（１）にしたがって色Ｖ２（Ｒ＝２４２，Ｇ＝０，Ｂ＝０）へと変更される。 For example, when the character is thin (in the case of a 6-point character), as shown in FIG. 14, the color V1 (R = 246, G = 79, B = 81) is the color V2 (R = 242, G = 0, B = 0).

詳細には、Ｒ成分Ｖ２ｒは、Ｖ２ｒ＝２５５−（２５５−２４６）×１．４９＝２４１．６、として算出される。また、Ｇ成分Ｖ２ｇは、Ｖ２ｇ＝２５５−（２５５−７９）×１．４９＝−７．２、として算出され、Ｂ成分Ｖ２ｂは、Ｖ２ｇ＝２５５−（２５５−８１）×１．４９＝−４．３、として算出される。その後、小数点以下を四捨五入して整数化するとともに、負の数をゼロに修正して上述のような値（Ｖ２ｒ＝２４２，Ｖ２ｇ＝０，Ｖ２ｂ＝０）が算出される。 Specifically, the R component V2r is calculated as V2r = 255− (255−246) × 1.49 = 241.6. The G component V2g is calculated as V2g = 255− (255−79) × 1.49 = −7.2, and the B component V2b is calculated as V2g = 255− (255−81) × 1.49 = −. Calculated as 4.3. After that, the numbers after the decimal point are rounded to an integer, and the negative numbers are corrected to zero to calculate the above values (V2r = 242, V2g = 0, V2b = 0).

これによれば、ステップＳ２２で本来の色Ｖ０が背景色の影響により背景色に近づいた色Ｖ１として算出されている場合においても、その色Ｖ１を背景色から引き離した色Ｖ２に補正することができる。この場合には、特に補正後の色Ｖ２のＧ成分およびＢ成分（値０）が、それぞれ、補正前の色Ｖ１（値７９あるいは値８１）に比べて背景色のＧ成分およびＢ成分（値２５５）から大きく離れることになる。したがって、補正後の色Ｖ２は、全体として背景色から離れた色になる。このような色（すなわち背景色との差を拡げた色）で文字を表現することによれば、文字の色がくっきりとし、文字が見やすくなる。すなわち、文字の判読性を向上することができる。 According to this, even when the original color V0 is calculated as the color V1 approaching the background color due to the influence of the background color in step S22, the color V1 can be corrected to the color V2 separated from the background color. it can. In this case, in particular, the G component and B component (value 0) of the color V2 after correction are compared with the G component and B component (value) of the background color compared to the color V1 (value 79 or value 81) before correction, respectively. 255). Therefore, the corrected color V2 is a color that is separated from the background color as a whole. By expressing a character with such a color (that is, a color in which the difference from the background color is widened), the color of the character is clear and the character is easy to see. That is, the legibility of characters can be improved.

また、値ｋは、文字の太さを表す指標値αに応じて変更される。具体的には、文字が太くなる（指標値αが大きくなる）につれて、値ｋが小さくなる。逆に、文字が細くなる（指標値αが小さくなる）につれて、値ｋが大きくなる。 The value k is changed according to the index value α representing the thickness of the character. Specifically, the value k decreases as the character becomes thicker (the index value α increases). Conversely, the value k increases as the character becomes thinner (the index value α decreases).

ここにおいて、図１４に示すように、文字が太い場合（１０．５ポイント文字の場合）における補正前の色Ｖ１（Ｒ＝２３３，Ｇ＝２０，Ｂ＝１６）は、文字が細い場合（６ポイント文字の場合）における補正前の色Ｖ１（Ｒ＝２４６，Ｇ＝７９，Ｂ＝８１）に比べて、本来の色（Ｒ＝２５５，Ｇ＝０，Ｂ＝０）に比較的近い。 Here, as shown in FIG. 14, the color V1 (R = 233, G = 20, B = 16) before correction when the character is thick (in the case of a 10.5 point character) is when the character is thin (6 Compared to the original color V1 (R = 246, G = 79, B = 81) before correction in the case of a point character, it is relatively close to the original color (R = 255, G = 0, B = 0).

そこで、この実施形態では、文字が太い場合は、値kが比較的小さくなるような式（１）を用いることによって補正の程度を比較的小さくしている。これによれば過剰な補正を防止できる。逆に、文字が細い場合には、式（１）により値kを比較的大きな値とし、補正の程度を比較的大きくしている。 Therefore, in this embodiment, when the character is thick, the degree of correction is made relatively small by using the equation (1) in which the value k becomes relatively small. According to this, excessive correction can be prevented. On the contrary, when the character is thin, the value k is set to a relatively large value by the equation (1), and the degree of correction is made relatively large.

このように、文字の太さに応じて、補正時における、第１の色と背景色との差の拡大の程度を変更することによれば、補正の程度の適正化を図ることができる。 As described above, the degree of correction can be optimized by changing the degree of enlargement of the difference between the first color and the background color at the time of correction according to the thickness of the character.

＜１−４．背景色が緑色の場合＞
また、図１６〜図２３は、（文字色は上記と同様に赤色であるが）背景色が白色ではなく緑色の場合における処理結果の変遷を示す図であり、それぞれ、図４〜図１１に対応している。図１６〜図１９は、６ポイントの大きさを有する文字「Ｉ」に関する処理結果の変遷を示す図であり、図２０〜図２３は、１０．５ポイントの大きさを有する文字「Ｉ」に関する処理結果の変遷を示す図である。図１６および図２０は、原稿上の文字「Ｉ」を示す図であり、図１７および図２１は、スキャン画像における文字「Ｉ」を示す図である。また、図１８および図２２は、スキャン画像を二値化して得られる画像、詳細には、補正処理（ステップＳ２４）前の画像における文字「Ｉ」を示す図であり、図１９および図２３は、当該補正処理（ステップＳ２４）後の画像における文字「Ｉ」を示す図である。 <1-4. When the background color is green>
FIGS. 16 to 23 are diagrams showing changes in processing results when the background color is not white but green (although the character color is red as described above), and FIGS. It corresponds. 16 to 19 are diagrams showing the transition of the processing result related to the character “I” having a size of 6 points, and FIGS. 20 to 23 are related to the character “I” having a size of 10.5 points. It is a figure which shows the transition of a process result. 16 and 20 are diagrams showing the letter “I” on the document, and FIGS. 17 and 21 are diagrams showing the letter “I” in the scanned image. 18 and 22 are diagrams showing the character “I” in the image obtained by binarizing the scan image, specifically, the image before the correction process (step S24). FIG. 19 and FIG. FIG. 10 is a diagram showing the letter “I” in the image after the correction process (step S24).

また、図２４は、背景色が白色ではなく緑色の場合の補正前後の色等を示す図である。 FIG. 24 is a diagram showing the color before and after correction when the background color is not white but green.

図２４に示すように、例えば６ポイント文字に関しては補正前の文字色Ｖ１（Ｒ＝１９５，Ｇ＝２３，Ｂ＝１）は、補正後の文字色Ｖ２（Ｒ＝２２０，Ｇ＝０，Ｂ＝０）へと変更される。すなわち、背景の緑色（Ｒ＝１４５，Ｇ＝２０１，Ｂ＝５２）の影響によって補正前の文字色Ｖ１が本来の赤色（Ｒ＝２５５，Ｇ＝０，Ｂ＝０）よりも「くすんだ」色（Ｒ＝１９５，Ｇ＝２３，Ｂ＝１）になっている場合であっても、補正後の文字の色Ｖ２は、背景との差を大きくした色（Ｒ＝２２０，Ｇ＝０，Ｂ＝０）に補正される。そのため、観察者は比較的くっきりとした色の文字を視認することができる。したがって、文字の判読性を向上させることができる。 As shown in FIG. 24, for example, for a 6-point character, the character color V1 before correction (R = 195, G = 23, B = 1) is the character color V2 after correction (R = 220, G = 0, B). = 0). That is, the character color V1 before correction is more dull than the original red color (R = 255, G = 0, B = 0) due to the influence of the background green color (R = 145, G = 201, B = 52). Even when the color (R = 195, G = 23, B = 1), the corrected character color V2 is a color (R = 220, G = 0, B = 0). Therefore, the observer can visually recognize characters with relatively clear colors. Therefore, the legibility of characters can be improved.

また、１０．５ポイント文字に関しては、図１６に示すように補正前の色Ｖ１（Ｒ＝２１１，Ｇ＝１６，Ｂ＝１）は、補正後の色Ｖ２（Ｒ＝２２６，Ｇ＝０，Ｂ＝０）へと変更される。このような補正によれば、背景の緑色（Ｒ＝１４５，Ｇ＝２０１，Ｂ＝５２）の影響によって補正前の文字の色Ｖ１が本来の赤色（Ｒ＝２５５，Ｇ＝０，Ｂ＝０）よりも「くすんだ」色になっている場合であっても、補正後の文字の色Ｖ２は、背景との差を大きくした色に補正される。 For the 10.5 point character, as shown in FIG. 16, the uncorrected color V1 (R = 211, G = 16, B = 1) is the corrected color V2 (R = 226, G = 0, B = 0). According to such correction, the color V1 of the character before correction is the original red color (R = 255, G = 0, B = 0) due to the influence of the green background (R = 145, G = 201, B = 52). ), The corrected character color V2 is corrected to a color with a larger difference from the background.

なお、図１７〜図１９、図２１〜図２３においては、文字部ＣＰに隣接する背景部分（ないし下地部分）が、写真あるいは図表と同様に上述の第２の領域として扱われ、文字部ＣＰと合成された状態で示されている。そのため、背景部分は、文字部ＣＰのように二値化されておらず、多段階の階調値を有するものとして示されている。また、上述の計算における背景色の緑色（Ｒ＝１４５，Ｇ＝２０１，Ｂ＝５２）は、このような背景部分の平均色として取得されたものである。 In FIG. 17 to FIG. 19 and FIG. 21 to FIG. 23, the background portion (or the ground portion) adjacent to the character portion CP is treated as the above-described second region in the same manner as the photograph or chart, and the character portion CP. And is shown in a synthesized state. Therefore, the background portion is not binarized like the character portion CP, and is shown as having multi-level gradation values. Further, the green background color (R = 145, G = 201, B = 52) in the above calculation is acquired as the average color of such a background portion.

＜１−５．背景色が青色の場合＞
また、図２５は、（文字色は上記と同様に赤色であるが）背景色が緑色ではなく青色の場合における補正前後の色等を示す図である。さらに、図２６〜図３３は、背景色が白色ではなく青色の場合における処理結果の変遷を示す図であり、それぞれ、図４〜図１１に対応している。図２６〜図２９は、６ポイントの大きさを有する文字「Ｉ」に関する処理結果の変遷を示す図であり、図３０〜図３３は、１０．５ポイントの大きさを有する文字「Ｉ」に関する処理結果の変遷を示す図である。図２６および図３０は、原稿上の文字「Ｉ」を示す図であり、図２７および図３１は、スキャン画像における文字「Ｉ」を示す図である。また、図２８および図３２は、スキャン画像を二値化して得られる画像、詳細には、補正処理（ステップＳ２４）前の画像における文字「Ｉ」を示す図であり、図２９および図３３は、当該補正処理（ステップＳ２４）後の画像における文字「Ｉ」を示す図である。 <1-5. When the background color is blue>
FIG. 25 is a diagram illustrating the color before and after correction when the background color is blue instead of green (although the character color is red as described above). Furthermore, FIGS. 26 to 33 are diagrams showing the transition of the processing results when the background color is blue instead of white, and correspond to FIGS. 4 to 11 respectively. 26 to 29 are diagrams showing the transition of the processing result related to the letter “I” having a size of 6 points, and FIGS. 30 to 33 are related to the letter “I” having a size of 10.5 points. It is a figure which shows the transition of a process result. 26 and 30 are diagrams showing the letter “I” on the document, and FIGS. 27 and 31 are diagrams showing the letter “I” in the scanned image. FIGS. 28 and 32 are diagrams showing the character “I” in an image obtained by binarizing the scanned image, specifically, the image before the correction process (step S24). FIGS. FIG. 10 is a diagram showing the letter “I” in the image after the correction process (step S24).

図２５に示すように、例えば６ポイント文字に関しては補正前の色Ｖ１（Ｒ＝１９２，Ｇ＝８０，Ｂ＝７４）は、補正後の色Ｖ２（Ｒ＝２５５，Ｇ＝７７，Ｂ＝３１）へと変更される。このような補正によれば、背景の青色（Ｒ＝２６，Ｇ＝８６，Ｂ＝１６１）の影響によって補正前の文字の色Ｖ１が本来の赤色（Ｒ＝２５５，Ｇ＝０，Ｂ＝０）よりも「くすんだ」色（Ｒ＝１９２，Ｇ＝８０，Ｂ＝７４）になっている場合であっても、補正後の文字の色Ｖ２は、背景との差を大きくした色（Ｒ＝２５５，Ｇ＝７７，Ｂ＝３１）に補正される。そのため、観察者は比較的くっきりとした色の文字を視認することができる。したがって、文字の判読性を向上させることができる。 As shown in FIG. 25, for a six-point character, for example, the color V1 before correction (R = 192, G = 80, B = 74) is the color V2 after correction (R = 255, G = 77, B = 31). ). According to such correction, the character color V1 before correction is changed to the original red color (R = 255, G = 0, B = 0) due to the influence of the blue color of the background (R = 26, G = 86, B = 161). ), The corrected character color V2 is a color (R) that has a larger difference from the background (R = 192, G = 80, B = 74). = 255, G = 77, B = 31). Therefore, the observer can visually recognize characters with relatively clear colors. Therefore, the legibility of characters can be improved.

また、１０．５ポイント文字に関しては、図２５に示すように補正前の色Ｖ１（Ｒ＝２２５，Ｇ＝６３，Ｂ＝５９）は、補正後の色Ｖ２（Ｒ＝２５５，Ｇ＝５８，Ｂ＝３６）へと変更される。このような補正によれば、背景の青色（Ｒ＝２６，Ｇ＝８６，Ｂ＝１６１）の影響によって補正前の文字の色Ｖ１が本来の赤色（Ｒ＝２５５，Ｇ＝０，Ｂ＝０）よりも「くすんだ」色（Ｒ＝２２５，Ｇ＝６３，Ｂ＝５９）になっている場合であっても、補正後の文字の色Ｖ２は、背景との差を大きくした色（Ｒ＝２５５，Ｇ＝５８，Ｂ＝３６）に補正される。 For the 10.5 point character, as shown in FIG. 25, the uncorrected color V1 (R = 225, G = 63, B = 59) is the corrected color V2 (R = 255, G = 58, B = 36). According to such correction, the character color V1 before correction is changed to the original red color (R = 255, G = 0, B = 0) due to the influence of the blue color of the background (R = 26, G = 86, B = 161). ), The corrected character color V2 is a color (R) having a larger difference from the background (R = 225, G = 63, B = 59). = 255, G = 58, B = 36).

なお、図２７〜図２９、図３１〜図３３においても、図１７〜図１９、図２１〜図２３と同様に、背景部分は、文字部ＣＰのように二値化されておらず、多段階の階調値を有するものとして示されている。また、上述の計算における背景色の青色（Ｒ＝２６，Ｇ＝８６，Ｂ＝１６１）は、このような背景部分の平均色として取得されたものである。 27 to 29 and FIGS. 31 to 33, as in FIGS. 17 to 19 and FIGS. 21 to 23, the background portion is not binarized like the character portion CP. It is shown as having graded gradation values. Further, the background color blue (R = 26, G = 86, B = 161) in the above calculation is obtained as the average color of such a background portion.

＜２．第２実施形態＞
第２実施形態は、第１実施形態の変形例である。以下では、第１実施形態との相違点を中心に説明する。 <2. Second Embodiment>
The second embodiment is a modification of the first embodiment. Below, it demonstrates centering on difference with 1st Embodiment.

上記第１実施形態においては、ＭＦＰ１が画像処理装置として機能する場合を例示したが、この第２実施形態においては、コンピュータ（パーソナルコンピュータ等）が画像処理装置として機能する場合を例示する。 In the first embodiment, the case where the MFP 1 functions as an image processing apparatus is illustrated, but in the second embodiment, the case where a computer (such as a personal computer) functions as an image processing apparatus is illustrated.

図３４は、第２実施形態に係る原稿電子化システム１００（１００Ｂ）を示す概略図である。ここでは、原稿電子化システム１００は、複数のスキャナ装置８０と、パーソナルコンピュータなどのコンピュータ９０とを備えている。なお、スキャナ装置８０は、スキャナ専用機器であってもよく、あるいは、上述のＭＦＰ１のような複合機等であってもよい。 FIG. 34 is a schematic diagram showing an original digitizing system 100 (100B) according to the second embodiment. Here, the document digitizing system 100 includes a plurality of scanner devices 80 and a computer 90 such as a personal computer. The scanner device 80 may be a scanner-dedicated device, or may be a multifunction device such as the MFP 1 described above.

また、コンピュータ９０は、複数のスキャナ装置８０に対してネットワークＮＷを介して通信可能に接続されている。コンピュータ９０は、各スキャナ装置８０によって生成されたスキャン画像をネットワークＮＷを介して受信し、当該スキャン画像に対して上述のような画像処理を施す。 The computer 90 is connected to a plurality of scanner devices 80 via a network NW so as to be communicable. The computer 90 receives the scan image generated by each scanner device 80 via the network NW, and performs the above-described image processing on the scan image.

詳細には、コンピュータ９０は、所定のプログラムＰＧが記録された記録媒体９１（例えば、フレキシブルディスク、ＣＤ−ＲＯＭ、ＤＶＤ−ＲＯＭ等）から当該プログラムＰＧを読み出し、当該プログラムＰＧをそのＣＰＵ等を用いて実行することによって、上記の画像処理部３と同様の機能を実現する。なお、当該プログラムＰＧは、記録媒体によって供給されてもよいが、インターネットを介してダウンロードすることなどによって供給されてもよい。 Specifically, the computer 90 reads the program PG from a recording medium 91 (for example, a flexible disk, a CD-ROM, a DVD-ROM, etc.) on which a predetermined program PG is recorded, and uses the program PG using the CPU or the like. By executing these functions, the same function as that of the image processing unit 3 is realized. The program PG may be supplied by a recording medium, or may be supplied by downloading via the Internet.

これによって、コンピュータ９０は、第１実施形態と同様の画像処理を実現し、文字の色をくっきりとさせた状態のファイルを作成することができる。 As a result, the computer 90 can realize image processing similar to that of the first embodiment, and can create a file in which the character color is clear.

また、このコンピュータ９０は、複数のスキャナ装置８０から送信されてきた画像に基づいて、上述のようなファイル（例えばＰＤＦファイル）を作成することができる。特に、コンピュータ９０は、上式（１）におけるパラメータβを、各スキャナ装置に固有の値に設定して、その記憶部に格納している。例えば、或るスキャナ装置向けにはβ＝０．７５を設定し、別のスキャナ装置向けにはβ＝０．８５を設定する。これによれば、各スキャナ装置８０の特性が互いに異なる場合であっても、それぞれの特性に応じた処理を行うことが可能である。 Further, the computer 90 can create the above-described file (for example, PDF file) based on the images transmitted from the plurality of scanner devices 80. In particular, the computer 90 sets the parameter β in the above equation (1) to a value unique to each scanner device and stores it in the storage unit. For example, β = 0.75 is set for a certain scanner device, and β = 0.85 is set for another scanner device. According to this, even if the characteristics of the scanner devices 80 are different from each other, it is possible to perform processing according to the characteristics.

＜３．変形例＞
以上、この発明の実施の形態について説明したが、この発明は上記説明した内容のものに限定されるものではない。 <3. Modification>
Although the embodiments of the present invention have been described above, the present invention is not limited to the contents described above.

たとえば、上記各実施形態においては、値ｋを文字の太さ（α）に応じて変更する場合を例示したが、これに限定されず、値ｋを固定値にしてもよい。 For example, in each of the above-described embodiments, the case where the value k is changed according to the thickness (α) of the character is exemplified, but the present invention is not limited to this, and the value k may be a fixed value.

また、上記実施形態においては、文字部ＣＰ（図１２等参照）に含まれる全画素の平均色を第１の色Ｖ１として求める場合を例示しているが、これに限定されない。 Moreover, although the case where the average color of all the pixels contained in the character part CP (refer FIG. 12 etc.) is calculated | required as 1st color V1 is illustrated in the said embodiment, it is not limited to this.

例えば、文字部ＣＰに含まれる全画素のうちの一部の画素（例えば、非境界画素の全て又は一部）の平均色を第１の色Ｖ１として求めるようにしてもよい。 For example, an average color of a part of all the pixels included in the character part CP (for example, all or part of the non-boundary pixels) may be obtained as the first color V1.

また、文字部ＣＰに含まれる代表画素の色を第１の色Ｖ１として決定するようにしてもよい。より詳細には、図１２における文字部ＣＰの中央の代表画素Ｒ１の色を第１の色Ｖ１として決定するようにしてもよい。 Further, the color of the representative pixel included in the character part CP may be determined as the first color V1. More specifically, the color of the representative pixel R1 at the center of the character portion CP in FIG. 12 may be determined as the first color V1.

また、上記各実施形態においては、１文字単位で文字色を決定する場合を中心に例示したが、上述したように、或る範囲（単語、行、段落、あるいはブロック等）を１つの単位として、当該単位ごとに「文字色」を決定してもよい。 Further, in each of the above embodiments, the case where the character color is determined in units of one character has been exemplified. However, as described above, a certain range (word, line, paragraph, block, or the like) is set as one unit. The “character color” may be determined for each unit.

例えば、単語ごとに１つの文字色を決定してもよい。具体的には、単語に含まれる複数の文字の「文字色」をそれぞれ求め、その平均値を当該単語の「文字色」として決定すればよい。 For example, one character color may be determined for each word. Specifically, the “character color” of a plurality of characters included in a word may be obtained, and the average value may be determined as the “character color” of the word.

あるいは、「行」ごとに１つの文字色を決定してもよい。具体的には、「行」に含まれる複数の文字の「文字色」をそれぞれ求め、その平均値を当該「行」の「文字色」として決定すればよい。 Alternatively, one character color may be determined for each “line”. Specifically, the “character color” of a plurality of characters included in the “line” is obtained, and the average value is determined as the “character color” of the “line”.

あるいは、複数の行（あるいは複数の段落）で構成される文字ブロックごとに１つの文字色を決定するようにしてもよい。この場合においては、複数の行ごとに文字色を決定しておき、近傍の行のうち文字色が互いに近い行をまとめて文字のブロックを構成し、当該ブロックに対する文字色を再度決定することが好ましい。これによれば、互いにその色が近似する複数の行を１つのブロックにまとめて「色」を管理することができるので、効率的にファイルサイズを低減することが可能である。 Alternatively, one character color may be determined for each character block composed of a plurality of lines (or a plurality of paragraphs). In this case, the character color is determined for each of a plurality of lines, and among the neighboring lines, the character colors that are close to each other are combined to form a character block, and the character color for the block is determined again. preferable. According to this, since a plurality of rows whose colors are similar to each other can be combined into one block to manage “color”, it is possible to efficiently reduce the file size.

第１実施形態に係る原稿電子化システムを示す概略図である。1 is a schematic diagram illustrating a document digitization system according to a first embodiment. ＭＦＰ内での画像処理の概要を示す図である。FIG. 2 is a diagram illustrating an overview of image processing in an MFP. 文字に関する処理の概要を示す図である。It is a figure which shows the outline | summary of the process regarding a character. 原稿上の文字「Ｉ」（６ポイント文字）を示す図である。It is a figure which shows the character "I" (6 point character) on a manuscript. スキャン画像における文字「Ｉ」（６ポイント文字）を示す図である。It is a figure which shows the character "I" (6 point character) in a scan image. 補正処理前の文字「Ｉ」（６ポイント文字）を示す図である。It is a figure which shows the character "I" (6-point character) before a correction process. 補正処理後の文字「Ｉ」（６ポイント文字）を示す図である。It is a figure which shows the character "I" (6 point character) after a correction process. 原稿上の文字「Ｉ」（１０．５ポイント文字）を示す図である。It is a figure which shows the character "I" (10.5 point character) on a manuscript. スキャン画像における文字「Ｉ」（１０．５ポイント文字）を示す図である。It is a figure which shows the character "I" (10.5 point character) in a scan image. 補正処理前の文字「Ｉ」（１０．５ポイント文字）を示す図である。It is a figure which shows the character "I" (10.5 point character) before a correction process. 補正処理後の文字「Ｉ」（１０．５ポイント文字）を示す図である。It is a figure which shows the character "I" (10.5 point character) after a correction process. 補正処理前の文字「Ｉ」（６ポイント文字）を模式的に示す図である。It is a figure which shows typically the character "I" (6-point character) before a correction process. 補正処理前の文字「Ｉ」（１０．５ポイント文字）を模式的に示す図である。It is a figure which shows typically character "I" (10.5 point character) before a correction process. 補正前後の色の変化を示す図である（背景が白色の場合）。It is a figure which shows the change of the color before and behind correction | amendment (when a background is white). 補正原理を説明するための図である。It is a figure for demonstrating the correction principle. 原稿上の６ポイント文字（緑色背景）を示す図である。It is a figure which shows the 6 point character (green background) on a manuscript. スキャン画像における６ポイント文字（緑色背景）を示す図である。It is a figure which shows the 6 point character (green background) in a scan image. 補正処理前の６ポイント文字（緑色背景）を示す図である。It is a figure which shows the 6 point character (green background) before a correction process. 補正処理後の６ポイント文字（緑色背景）を示す図である。It is a figure which shows 6 point characters (green background) after a correction process. 原稿上の文字１０．５ポイント文字（緑色背景）を示す図である。It is a figure which shows the character 10.5 point character (green background) on a manuscript. スキャン画像における１０．５ポイント文字（緑色背景）を示す図である。It is a figure which shows the 10.5 point character (green background) in a scanning image. 補正処理前の１０．５ポイント文字（緑色背景）を示す図である。It is a figure which shows the 10.5 point character (green background) before a correction process. 補正処理後の１０．５ポイント文字（緑色背景）を示す図である。It is a figure which shows the 10.5 point character (green background) after a correction process. 補正前後の色の変化を示す図である（背景が緑色の場合）。It is a figure which shows the change of the color before and behind correction | amendment (when a background is green). 補正前後の色の変化を示す図である（背景が青色の場合）。It is a figure which shows the change of the color before and behind correction | amendment (when a background is blue). 原稿上の６ポイント文字（青色背景）を示す図である。It is a figure which shows the 6 point character (blue background) on a manuscript. スキャン画像における６ポイント文字（青色背景）を示す図である。It is a figure which shows the 6 point character (blue background) in a scan image. 補正処理前の６ポイント文字（青色背景）を示す図である。It is a figure which shows 6 point characters (blue background) before a correction process. 補正処理後の６ポイント文字（青色背景）を示す図である。It is a figure which shows 6 point characters (blue background) after a correction process. 原稿上の文字１０．５ポイント文字（青色背景）を示す図である。It is a figure which shows the character 10.5 point character (blue background) on a manuscript. スキャン画像における１０．５ポイント文字（青色背景）を示す図である。It is a figure which shows the 10.5 point character (blue background) in a scanning image. 補正処理前の１０．５ポイント文字（青色背景）を示す図である。It is a figure which shows the 10.5 point character (blue background) before a correction process. 補正処理後の１０．５ポイント文字（青色背景）を示す図である。It is a figure which shows the 10.5 point character (blue background) after a correction process. 第２実施形態に係る原稿電子化システムを示す概略図である。It is the schematic which shows the original digitization system which concerns on 2nd Embodiment.

Explanation of symbols

１ＭＦＰ（画像処理装置）
９０コンピュータ（画像処理装置）
９１記録媒体
１００，１００Ａ，１００Ｂ原稿電子化システム
ＰＧプログラム
Ｒ１代表画素
Ｖ１第１の色
Ｖ２第２の色
ＶＢ背景色
α 内部比率 1 MFP (image processing device)
90 Computer (image processing device)
91 Recording Medium 100, 100A, 100B Electronic Document System PG Program R1 Representative Pixel V1 First Color V2 Second Color VB Background Color α Internal Ratio

Claims

An image processing apparatus,
Character part extraction means for performing binarization processing on the scanned image and separating and extracting a character part that is a part corresponding to the character from the background;
Determining means for determining a character color that is a representative color of the character portion, wherein a first color that is a candidate color of the character portion is obtained based on a color component value of the character portion; Determining means for determining, as the character color, a second color obtained by correcting the first color so as to enlarge a difference between a background color that is a background color of the scanned image and the background color;
Detecting means for detecting the thickness of the character;
With
Said determining means, depending on the thickness of the character detected by the detecting means, be one that changes the degree of expansion of the difference in the correction time, as the character becomes narrower, the expansion of the difference An image processing apparatus characterized by increasing the degree .

The image processing apparatus according to claim 1.
The image processing apparatus according to claim 1, wherein the detection unit calculates a ratio of a non-boundary pixel to a total number of pixels of the character portion as a thickness of the character.

The image processing apparatus according to claim 1 or 2,
The image processing apparatus according to claim 1, wherein the determining unit determines an average color of at least some of the pixels included in the character portion as the first color.

The image processing apparatus according to claim 1 or 2,
The image processing apparatus according to claim 1, wherein the determining unit determines a color of a representative pixel included in the character portion as the first color.

  On the computer,
  a) A procedure for performing binarization processing on the scanned image and separating and extracting a character portion that is a portion corresponding to the character from the background;
  b) a procedure for detecting the thickness of the character;
  c) obtaining a first color that is a candidate for the color of the character part based on the color component value of the character part, and calculating a difference between the first color and a background color that is the background color of the scanned image. Determining a second color obtained by correcting the first color so as to enlarge as a character color that is a representative color of the character portion;
A program for executing
  The procedure c)
    c-1) A procedure for changing the degree of enlargement of the difference at the time of correction according to the thickness of the character detected in step b), and the degree of enlargement of the difference as the character becomes thinner A program comprising a procedure for changing the degree of enlargement of the difference so as to increase.

The program according to claim 5, wherein
The procedure b) includes a procedure for calculating a ratio of non-boundary pixels to the total number of pixels in the character portion as the thickness of the character.

In the program according to claim 5 or 6,
The procedure c) includes a procedure for determining an average color of at least some pixels included in the character portion as the first color.

In the program according to claim 5 or 6,
The procedure c) includes a procedure for determining a color of a representative pixel included in the character portion as the first color.