JP7357853B2

JP7357853B2 - Image processing device, image processing method, and image processing program

Info

Publication number: JP7357853B2
Application number: JP2019196511A
Authority: JP
Inventors: 亮祐加藤; 敏明六尾
Original assignee: Kyocera Document Solutions Inc
Current assignee: Kyocera Document Solutions Inc
Priority date: 2019-10-29
Filing date: 2019-10-29
Publication date: 2023-10-10
Anticipated expiration: 2039-10-29
Also published as: JP2021072484A

Description

本発明は、画像処理装置、画像処理方法及び画像処理プログラムに関し、特に、二値化処理技術に関する。 The present invention relates to an image processing device, an image processing method, and an image processing program, and particularly relates to a binarization processing technique.

地肌ノイズが含まれている原稿を読取り、その読取画像を二値化処理すると不要な地肌ノイズが残存する問題がある。このような問題に対し、たとえば特許文献１は、入力画像の濃度ヒストグラムにおいて、各濃度に対する度数の移動平均を求め、その結果の最大度数に対する濃度値に基づいて二値化スライスレベルを求め、二値化を行う技術を提案している。しかしながら、閾値は、移動平均を取った後のヒストグラムが明確な双峰性を持つことを前提として簡易な方法で設定されているので、明確な双峰性が無い場合には、閾値を設定することができないという問題がある。一方、特許文献２は、ｎ×ｎ画素の「セル」を取得し、そのセルを中心にｍ×ｍ画素（ｎ＜ｍ）の測定領域を設定し、測定領域内の輝度ヒストグラムからセルに対する局所的しきい値を求め、予め設定された制限値の範囲内で局所的しきい値を用いてセルを２値化する技術を提案している。予め設定された制限値の範囲内としているのは、ノイズ成分の取り出しを抑制するためである。 When a document containing background noise is read and the read image is subjected to binarization processing, there is a problem that unnecessary background noise remains. To solve this problem, for example, Patent Document 1 calculates a moving average of the frequency for each density in the density histogram of an input image, calculates a binarized slice level based on the resulting density value for the maximum frequency, and We are proposing a technology for converting values into values. However, the threshold is set using a simple method on the assumption that the histogram after taking the moving average has clear bimodality, so if there is no clear bimodality, the threshold cannot be set. The problem is that I can't. On the other hand, in Patent Document 2, a "cell" of n×n pixels is acquired, a measurement area of m×m pixels (n<m) is set around the cell, and a local area for the cell is determined from a brightness histogram in the measurement area. proposed a technique to obtain a local threshold value and binarize cells using a local threshold value within a preset limit value. The reason for setting the value within the preset limit value is to suppress extraction of noise components.

特開平７－２６２３４８号公報Japanese Patent Application Publication No. 7-262348 特開２００１－２４５１４８号公報Japanese Patent Application Publication No. 2001-245148

しかしながら、本願発明者の知見によれば、地肌の濃度分布が広い場合には、明確な双峰性が認められない場合も多く、特許文献１は、このような画像の二値化に対応することができないという問題がある。一方、本願発明者の解析によれば、ノイズ成分の発生は、地肌の濃度に応じて変動するので、予め設定されている固定の制限値では、ノイズ成分の取り出しと薄い図形や文字の消滅のトレードオフの問題から逃れられないという問題がある。 However, according to the knowledge of the inventor of the present application, when the density distribution of the background is wide, clear bimodality is often not recognized, and Patent Document 1 deals with the binarization of such an image. The problem is that I can't. On the other hand, according to the inventor's analysis, since the generation of noise components varies depending on the density of the background, it is difficult to extract noise components and eliminate thin figures and characters using fixed limit values set in advance. The problem is that we cannot escape the problem of trade-offs.

本発明は、このような状況に鑑みてなされたものであり、低濃度画像の判別と画像ノイズの抑制の両立を改善する二値化処理技術を提供することを目的とする。 The present invention has been made in view of this situation, and an object of the present invention is to provide a binarization processing technique that improves both the discrimination of low-density images and the suppression of image noise.

本発明の画像処理装置は、入力画像データに対して判別分析法で第１のクラスの平均階調値である第１の平均階調値と、背景画像を比較的に多く含む側である第２のクラスの平均階調値である第２の平均階調値とを算出するヒストグラム解析部と、前記第１の平均階調値と前記第２の平均階調値の間において、閾値を調整するためのユーザー入力を受け付ける操作表示部と、前記操作表示部に入力されたユーザー入力に応じて前記閾値を調整し、前記調整された閾値を使用して二値化処理を実行する二値化処理部とを備え、前記操作表示部は、前記閾値を操作に応じて実時間で前記二値化処理後の画像を表示する。 The image processing device of the present invention uses a discriminant analysis method for input image data to determine a first average gradation value, which is an average gradation value of a first class, and a first average gradation value, which is an average gradation value of a first class, and a first average gradation value, which is an average gradation value of a first class, and a first average gradation value, which is an average gradation value of a first class, and a first average gradation value, which is an average gradation value of a first class, for input image data. a histogram analysis unit that calculates a second average gradation value that is an average gradation value of the second class; and a threshold value is adjusted between the first average gradation value and the second average gradation value. an operation display unit that accepts user input to perform the operation; and binarization that adjusts the threshold according to the user input input to the operation display unit and executes binarization processing using the adjusted threshold. and a processing section, and the operation display section displays the image after the binarization process in real time according to the operation of the threshold value.

本発明の画像形成方法は、入力画像データに対して判別分析法で第１のクラスの平均階調値である第１の平均階調値と、背景画像を比較的に多く含む側である第２のクラスの平均階調値である第２の平均階調値とを算出するヒストグラム解析工程と、前記第１の平均階調値と前記第２の平均階調値の間において、閾値を調整するためのユーザー入力を受け付ける操作表示工程と、前記操作表示工程で入力されたユーザー入力に応じて前記閾値を調整し、前記調整された閾値を使用して二値化処理を実行する二値化処理工程とを備え、前記操作表示工程は、前記閾値を操作に応じて実時間で前記二値化処理後の画像を表示する。 The image forming method of the present invention uses a discriminant analysis method for input image data to determine a first average gradation value, which is an average gradation value of a first class, and a first average gradation value, which is the average gradation value of the first class, and a first average gradation value, which is the average gradation value of the first class, and a first average gradation value, which is the average gradation value of the first class, and a first average gradation value, which is the average gradation value of the first class, and a histogram analysis step of calculating a second average gradation value that is an average gradation value of the second class; and adjusting a threshold between the first average gradation value and the second average gradation value. an operation/display step for accepting user input for the operation and display; and a binarization step for adjusting the threshold according to the user input input in the operation/display step, and performing a binarization process using the adjusted threshold. and a processing step, and the operation display step displays the binarized image in real time according to the operation of the threshold value.

本発明の画像処理プログラムは、入力画像データに対して判別分析法で第１のクラスの平均階調値である第１の平均階調値と、背景画像を比較的に多く含む側である第２のクラスの平均階調値である第２の平均階調値とを算出するヒストグラム解析部、前記第１の平均階調値と前記第２の平均階調値の間において、閾値を調整するためのユーザー入力を受け付ける操作表示部、及び前記操作表示部に入力されたユーザー入力に応じて前記閾値を調整し、前記調整された閾値を使用して二値化処理を実行する二値化処理部として画像処理装置を機能させ、前記操作表示部は、前記閾値を操作に応じて実時間で前記二値化処理後の画像を表示する。 The image processing program of the present invention uses a discriminant analysis method for input image data to determine a first average gradation value, which is an average gradation value of a first class, and a first average gradation value, which is an average gradation value of a first class, and a first average gradation value, which is an average gradation value of a first class. a histogram analysis unit that calculates a second average gradation value that is an average gradation value of the second class, and adjusts a threshold between the first average gradation value and the second average gradation value; an operation display unit that accepts user input for the operation, and a binarization process that adjusts the threshold according to the user input input to the operation display unit and executes the binarization process using the adjusted threshold. The operation display section displays the binarized image in real time according to the operation of the threshold value.

本発明によれば、低濃度画像の判別と画像ノイズの抑制の両立を改善する二値化処理技術を提供することができる。 According to the present invention, it is possible to provide a binarization processing technique that improves both the discrimination of low-density images and the suppression of image noise.

本発明の第１実施形態に係る画像形成装置１０の全体構成を示す概略構成図である。1 is a schematic configuration diagram showing the overall configuration of an image forming apparatus 10 according to a first embodiment of the present invention. 第１実施形態に係る画像形成装置１０の機能的構成を示すブロックダイアグラムである。1 is a block diagram showing a functional configuration of an image forming apparatus 10 according to a first embodiment. 読取画像及び読取画像の輝度ヒストグラムを示す説明図である。FIG. 3 is an explanatory diagram showing a read image and a brightness histogram of the read image. 比較例に係る二値化処理及び比較例の二値化処理用閾値ＴＨｃを示す説明図である。FIG. 7 is an explanatory diagram showing a binarization process according to a comparative example and a threshold value THc for the binarization process of the comparative example. 第１実施形態に係る二値化処理手順の内容を示すフローチャートである。It is a flowchart which shows the content of the binarization processing procedure based on 1st Embodiment. 第１実施形態に係るローパスフィルタ処理の内容を示す説明図である。FIG. 3 is an explanatory diagram showing the contents of low-pass filter processing according to the first embodiment. 第１実施形態に係るページ閾値設定処理の内容を示すフローチャートである。It is a flowchart which shows the content of page threshold value setting processing concerning a 1st embodiment. 第１実施形態に係るページ閾値設定処理の内容を示す説明図である。It is an explanatory diagram showing the contents of page threshold value setting processing concerning a 1st embodiment. 第１実施形態及び比較例に係る地肌ノイズ判定用閾値設定処理の内容を示す説明図である。FIG. 7 is an explanatory diagram illustrating the content of background noise determination threshold setting processing according to the first embodiment and the comparative example. 第１実施形態及び比較例に係る地肌ノイズ除去処理の内容を示す説明図である。FIG. 3 is an explanatory diagram showing the content of background noise removal processing according to the first embodiment and the comparative example. 第１実施形態に係る地肌ノイズ除去処理及び二値化処理の内容を示す説明図である。FIG. 3 is an explanatory diagram showing the contents of background noise removal processing and binarization processing according to the first embodiment. 第１実施形態に係る二値化処理の内容を示すフローチャートである。It is a flowchart which shows the content of the binarization process based on 1st Embodiment. 第１実施形態に係る着目局所領域選択の様子を示す説明図である。FIG. 3 is an explanatory diagram showing how to select a local region of interest according to the first embodiment. 第１実施形態に係る切替用閾値ＴＨＳを示すグラフである。It is a graph which shows the threshold value THS for switching based on 1st Embodiment. 第１実施形態に係る二値化処理の内容を示す説明図である。It is an explanatory diagram showing the contents of binarization processing concerning a 1st embodiment. 第２実施形態に係る二値化用閾値調整処理の内容を示すフローチャートである。It is a flowchart which shows the content of the threshold value adjustment process for binarization based on 2nd Embodiment. 第２実施形態に係る操作表示画面を示す説明図である。FIG. 7 is an explanatory diagram showing an operation display screen according to the second embodiment. 変形例に係るフィルタ処理の内容を示す説明図である。It is an explanatory view showing the contents of filter processing concerning a modification.

以下、本発明を実施するための形態（以下、「実施形態」という）を、図面を参照して以下の順序で説明する。
Ａ．第１実施形態：
Ｂ．第２実施形態：
Ｃ．変形例： DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, modes for carrying out the present invention (hereinafter referred to as "embodiments") will be described in the following order with reference to the drawings.
A. First embodiment:
B. Second embodiment:
C. Variant:

Ａ．第１実施形態：
図１は、本発明の第１実施形態に係る画像形成装置１０の全体構成を示す概略構成図である。図２は、第１実施形態に係る画像形成装置１０の機能的構成を示すブロックダイアグラムである。画像形成装置１０は、画像読取部１００と、制御部２１０と、画像形成部２２０と、操作表示部２３０と、記憶部２４０と、ＦＡＸ処理部２５０とを備えている。画像読取部１００は、自動原稿送り装置（ＡＤＦ）１６０と原稿台（コンタクトガラス）１５０とを有し、原稿から画像（原画像）を読み取ってデジタルデータである画像データＩＤを生成する。 A. First embodiment:
FIG. 1 is a schematic configuration diagram showing the overall configuration of an image forming apparatus 10 according to a first embodiment of the present invention. FIG. 2 is a block diagram showing the functional configuration of the image forming apparatus 10 according to the first embodiment. The image forming apparatus 10 includes an image reading section 100, a control section 210, an image forming section 220, an operation display section 230, a storage section 240, and a FAX processing section 250. The image reading unit 100 includes an automatic document feeder (ADF) 160 and a document table (contact glass) 150, reads an image (original image) from a document, and generates an image data ID that is digital data.

画像形成部２２０は、画像データＩＤに基づいて印刷媒体（図示せず）に画像を形成して排出する。画像形成部２２０は、色変換処理部２２１と、ハーフトーン処理部２２２と、画像出力部２２３とを備える。色変換処理部２２１は、ＲＧＢデータである画像データＩＤをＣＭＹＫ画像データに色変換する。ハーフトーン処理部２２２は、ハーフトーン処理を実行してＣＭＹＫ画像データのハーフトーンデータを生成する。画像出力部２２３は、ハーフトーンデータに基づいて画像を形成する。操作表示部２３０は、タッチパネルとして機能するディスプレイ（図示せず）や各種ボタンやスイッチ（図示せず）からユーザーの操作入力を受け付ける。 The image forming unit 220 forms an image on a print medium (not shown) based on the image data ID and discharges the print medium. The image forming section 220 includes a color conversion processing section 221, a halftone processing section 222, and an image output section 223. The color conversion processing unit 221 converts the image data ID, which is RGB data, into CMYK image data. The halftone processing unit 222 performs halftone processing to generate halftone data of CMYK image data. The image output unit 223 forms an image based on the halftone data. The operation display unit 230 receives user operation inputs from a display (not shown) functioning as a touch panel and various buttons and switches (not shown).

制御部２１０は、ＲＡＭやＲＯＭ等の主記憶手段、及びＭＰＵ（ＭｉｃｒｏＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）やＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）等の制御手段を備えている。また、制御部２１０は、各種Ｉ／Ｏ、ＵＳＢ（ユニバーサル・シリアル・バス）、バス、その他ハードウェア等のインターフェイスに関連するコントローラ機能を備え、画像形成装置１０全体を制御する。 The control unit 210 includes main storage means such as RAM and ROM, and control means such as MPU (Micro Processing Unit) and CPU (Central Processing Unit). Further, the control unit 210 has controller functions related to interfaces such as various I/Os, a USB (universal serial bus), a bus, and other hardware, and controls the entire image forming apparatus 10 .

記憶部２４０は、非一時的な記録媒体であるハードディスクドライブやフラッシュメモリー等からなる記憶装置で、制御部２１０や二値化処理部１４０が実行する処理の制御プログラム（画像処理プログラムを含む。）やデータを記憶する。 The storage unit 240 is a storage device that is a non-temporary recording medium such as a hard disk drive or flash memory, and stores control programs (including image processing programs) for processing executed by the control unit 210 and the binarization processing unit 140. and memorize data.

画像読取部１００は、図２に示されるように、光源ドライバ１１１と、光源１１２とを備えている。光源１１２は、原稿Ｄに光を照射する複数のＬＥＤ（図示せず）を有する。光源ドライバ１１１は、主走査方向に配列されている複数のＬＥＤを駆動するＬＥＤドライバであり、光源１１２のオンオフ駆動制御を行う。これにより、光源１１２は、可変の駆動デューティのパルス幅変調（ＰＷＭ）で原稿Ｄの原稿面を照射光Ｌ１で照射することができる。 The image reading unit 100 includes a light source driver 111 and a light source 112, as shown in FIG. The light source 112 includes a plurality of LEDs (not shown) that irradiate the document D with light. The light source driver 111 is an LED driver that drives a plurality of LEDs arranged in the main scanning direction, and performs on/off drive control of the light source 112. Thereby, the light source 112 can irradiate the document surface of the document D with the irradiation light L1 using pulse width modulation (PWM) with variable drive duty.

照射光Ｌ１は、原稿Ｄの面に垂直な方向に対して４５度（傾斜した方向）の角度で照射される。原稿Ｄは、拡散反射光Ｌ２と、正反射光とを含む反射光を反射する。受光素子１２２は、拡散反射光Ｌ２を受光する。 The irradiation light L1 is irradiated at an angle of 45 degrees (in a tilted direction) with respect to a direction perpendicular to the surface of the document D. The document D reflects reflected light including diffusely reflected light L2 and specularly reflected light. The light receiving element 122 receives the diffusely reflected light L2.

画像読取部１００は、さらに、図１に示されるように、原稿Ｄとイメージセンサ１２１との間に、第１反射鏡１１３と、第１キャリッジ１１４と、第２反射鏡１１５と、第３反射鏡１１６と、第２キャリッジ１１７と、集光レンズ１１８とを備えている。第１反射鏡１１３は、原稿Ｄからの拡散反射光Ｌ２を第２反射鏡１１５の方向に反射する。第２反射鏡１１５は、拡散反射光Ｌ２を第３反射鏡１１６の方向に反射する。第３反射鏡１１６は、拡散反射光Ｌ２を集光レンズ１１８の方向に反射する。集光レンズ１１８は、拡散反射光Ｌ２をイメージセンサ１２１が有する複数の受光素子１２２の各受光面（図示せず）に結像する。 As shown in FIG. 1, the image reading unit 100 further includes a first reflecting mirror 113, a first carriage 114, a second reflecting mirror 115, and a third reflecting mirror between the document D and the image sensor 121. It includes a mirror 116, a second carriage 117, and a condenser lens 118. The first reflecting mirror 113 reflects the diffusely reflected light L2 from the document D toward the second reflecting mirror 115. The second reflecting mirror 115 reflects the diffusely reflected light L2 in the direction of the third reflecting mirror 116. The third reflecting mirror 116 reflects the diffusely reflected light L2 in the direction of the condenser lens 118. The condensing lens 118 forms an image of the diffusely reflected light L2 on each light receiving surface (not shown) of a plurality of light receiving elements 122 included in the image sensor 121.

イメージセンサ１２１は、Ｒ，Ｇ，Ｂの各色成分のカラーフィルタ（図示略）を使用してＲ，Ｇ，Ｂの３つの色をそれぞれ検知する３本のＣＣＤラインセンサ（図示せず）である。イメージセンサ１２１は、主走査方向に延びている３本のＣＣＤラインセンサで原稿を走査（副走査）して原稿上の画像をＲ，Ｇ，Ｂに対応する電圧値の組合せとして取得する。このように、イメージセンサ１２１は、光電変換処理を行って、主走査方向の画素毎のＲ，Ｇ，Ｂのアナログ電気信号を出力することができる。 The image sensor 121 is three CCD line sensors (not shown) that detect each of the three colors R, G, and B using color filters (not shown) for each color component of R, G, and B. . The image sensor 121 scans (sub-scans) a document with three CCD line sensors extending in the main scanning direction and acquires an image on the document as a combination of voltage values corresponding to R, G, and B. In this way, the image sensor 121 can perform photoelectric conversion processing and output R, G, and B analog electrical signals for each pixel in the main scanning direction.

第１キャリッジ１１４は、光源１１２と第１反射鏡１１３とを搭載し、副走査方向に往復動する。第２キャリッジ１１７は、第２反射鏡１１５と第３反射鏡１１６とを搭載し、副走査方向に往復動する。第１キャリッジ１１４及び第２キャリッジ１１７は、走査制御部として機能する制御部２１０によって制御される。これにより、光源１１２は原稿を副走査方向に走査することができるので、イメージセンサ１２１は、原稿上の２次元画像に応じてアナログ電気信号を出力することができる。 The first carriage 114 carries a light source 112 and a first reflecting mirror 113, and reciprocates in the sub-scanning direction. The second carriage 117 carries a second reflecting mirror 115 and a third reflecting mirror 116, and reciprocates in the sub-scanning direction. The first carriage 114 and the second carriage 117 are controlled by a control section 210 that functions as a scanning control section. Thereby, the light source 112 can scan the original in the sub-scanning direction, so the image sensor 121 can output an analog electrical signal in accordance with the two-dimensional image on the original.

なお、自動原稿送り装置（ＡＤＦ）１６０が使用される場合には、第１キャリッジ１１４及び第２キャリッジ１１７は、予め設定されている副走査位置に固定され、原稿Ｄの自動送りによって副走査方向の走査が行われる。なお、ＡＤＦ１６０には、片面だけでなく両面を同時にあるいは逐次に読み取るものもある。 Note that when the automatic document feeder (ADF) 160 is used, the first carriage 114 and the second carriage 117 are fixed at a preset sub-scanning position, and the document D is automatically fed in the sub-scanning direction. is scanned. Note that some ADFs 160 read not only one side but also both sides simultaneously or sequentially.

ＡＤＦ１６０は、紙送りローラー１６１と、原稿読取スリット１６２とを備えている。紙送りローラー１６１は、原稿の自動送りを行い、原稿の読み取りが原稿読取スリット１６２を介して行われる。この場合には、第１キャリッジ１１４が予め設定されている副走査位置に固定されるので、第１キャリッジ１１４に搭載されている光源１１２も所定位置に固定されることになる。 The ADF 160 includes a paper feed roller 161 and a document reading slit 162. The paper feed roller 161 automatically feeds the original, and the original is read through the original reading slit 162. In this case, since the first carriage 114 is fixed at a preset sub-scanning position, the light source 112 mounted on the first carriage 114 is also fixed at a predetermined position.

画像読取部１００は、図２に示されるように、さらに、信号処理部１２３と、シェーディング補正部１２４と、シェーディング補正テーブル１２４ａと、ガンマ変換部１２５と、二値化処理部１４０と、ＡＧＣ処理部１３０と、白基準板１３２（図１参照）とを備えている。二値化処理部１４０は、フィルタ処理部１４１と、閾値設定部１４２とを有している。 As shown in FIG. 2, the image reading section 100 further includes a signal processing section 123, a shading correction section 124, a shading correction table 124a, a gamma conversion section 125, a binarization processing section 140, and an AGC processing section. 130, and a white reference plate 132 (see FIG. 1). The binarization processing section 140 includes a filter processing section 141 and a threshold value setting section 142.

信号処理部１２３は、Ａ／Ｄ変換機能を有する可変利得増幅器である。信号処理部１２３は、ＡＧＣ処理部１３０で設定され、記憶部２４０に格納されている利得でアナログ電気信号を増幅し、増幅されたアナログ電気信号をＡ／Ｄ変換してデジタルデータとする。 The signal processing section 123 is a variable gain amplifier having an A/D conversion function. The signal processing section 123 amplifies the analog electrical signal with the gain set by the AGC processing section 130 and stored in the storage section 240, and A/D converts the amplified analog electrical signal into digital data.

ＡＧＣ処理部１３０は、本実施形態では、黒基準信号と白基準信号とを使用して複数の受光素子１２２のそれぞれに対して最適な利得とオフセット値とを設定する利得調整部である。黒基準信号は、光源１１２がオフの状態における受光素子１２２のアナログ電気信号である。白基準信号は、原稿Ｄの代わりに白基準板１３２を照射したときの受光素子１２２のアナログ電気信号である。 In this embodiment, the AGC processing unit 130 is a gain adjustment unit that sets optimal gains and offset values for each of the plurality of light receiving elements 122 using a black reference signal and a white reference signal. The black reference signal is an analog electrical signal of the light receiving element 122 when the light source 112 is off. The white reference signal is an analog electrical signal of the light receiving element 122 when the white reference plate 132 is irradiated instead of the original D.

ＡＧＣ処理部１３０は、黒基準信号が信号処理部１２３によってＡ／Ｄ変換されたときの画像データＩＤのＲＧＢの各階調値が最小値「０」となるようにオフセット値を設定する。ＡＧＣ処理部１３０は、このオフセット値を使用して白基準信号が信号処理部１２３によってＡ／Ｄ変換されたときの画像データＩＤのＲＧＢの各階調値が最大値「２５５」となるように利得を設定する。これにより、最小値「０」から最大値「２５５」までのダイナミックレンジを有効に利用することができる。 The AGC processing unit 130 sets the offset value so that each RGB gradation value of the image data ID becomes the minimum value “0” when the black reference signal is A/D converted by the signal processing unit 123. The AGC processing unit 130 uses this offset value to set the gain so that each RGB gradation value of the image data ID becomes the maximum value “255” when the white reference signal is A/D converted by the signal processing unit 123. Set. Thereby, the dynamic range from the minimum value "0" to the maximum value "255" can be effectively utilized.

シェーディング補正部１２４は、デジタルデータに対してシェーディング補正を実行して画像データＩＤを生成する。シェーディング補正は、光源１１２の長さ方向の光量不均一性やレンズのコサイン４乗則による周辺減光、主走査方向に配列されている複数の受光素子１２２の感度ムラに起因して発生するシェーディングを抑制するための補正である。ガンマ変換部１２５は、画像読取部１００の特性に基づいてガンマ変換を行う。これにより、画像読取部１００は、ＲＧＢの各階調値を有する画像データＩＤを生成することができる。 The shading correction unit 124 performs shading correction on the digital data to generate image data ID. Shading correction is performed to correct shading that occurs due to non-uniformity of light amount in the length direction of the light source 112, marginal light reduction due to the cosine fourth power law of the lens, and uneven sensitivity of the plurality of light receiving elements 122 arranged in the main scanning direction. This is a correction to suppress the The gamma conversion unit 125 performs gamma conversion based on the characteristics of the image reading unit 100. Thereby, the image reading unit 100 can generate image data ID having each RGB tone value.

二値化処理部１４０は、輝度値Ｌの計算式（輝度値Ｌ≒０．３Ｒ＋０．６Ｇ＋０．１Ｂ）を使用し、画像データＩＤのＲＧＢの各階調値を使用して輝度値Ｌを算出してモノクロ画像データＭＤ０を生成する。二値化処理部１４０は、さらに、モノクロ画像データＭＤ０に対して二値化処理を実行して各画素が１ビットで表される二値データＢＤとする。ＦＡＸ処理部２５０は、データサイズが小さな二値データＢＤを使用してファクシミリの送信処理を実行することができる。 The binarization processing unit 140 uses the calculation formula for the brightness value L (brightness value L≈0.3R+0.6G+0.1B) and calculates the brightness value L using each RGB gradation value of the image data ID. Then, monochrome image data MD0 is generated. The binarization processing unit 140 further performs binarization processing on the monochrome image data MD0 to generate binary data BD in which each pixel is represented by 1 bit. The FAX processing unit 250 can perform facsimile transmission processing using binary data BD having a small data size.

このように、画像読取部１００は、原稿Ｄ上の画像を読み取って画像データＩＤを生成し、必要に応じて二値データＢＤを生成する。画像データＩＤは、原稿Ｄ上の画像をＲＧＢの各階調値（０～２５５）で表すＲＧＢ画像データである。 In this way, the image reading unit 100 reads the image on the document D, generates the image data ID, and generates the binary data BD as necessary. The image data ID is RGB image data that represents the image on the document D using RGB gradation values (0 to 255).

図３は、読取画像及び読取画像の輝度ヒストグラムを示す説明図である。図３（ａ）は、原稿画像Ｄ０を示している。原稿画像Ｄ０は、地肌の輝度が比較的に低い視認性の悪い画像である。原稿画像Ｄ０は、画像読取部１００の読取りによって生成された画像データＩＤによって表される画像である。 FIG. 3 is an explanatory diagram showing a read image and a brightness histogram of the read image. FIG. 3(a) shows the original image D0. The original image D0 is an image with relatively low background brightness and poor visibility. The original image D0 is an image represented by image data ID generated by reading by the image reading section 100.

図３（ｂ）は、モノクロ画像データＭＤ０の輝度ヒストグラムＨ０を示している。モノクロ画像データＭＤ０は、原稿画像Ｄ０を表す画像データＩＤの輝度値Ｌのヒストグラムである。輝度ヒストグラムの横軸は、輝度階調値（０（黒）乃至２５５（白））であり、縦軸は、画素数（頻度）である。輝度ヒストグラムＨ０は、黒文字のテキストや罫線といった線画の画像を構成する画素群である線画画素群と、地肌画像を構成する画素群である地肌画素群とを含んでいる。 FIG. 3(b) shows a brightness histogram H0 of the monochrome image data MD0. Monochrome image data MD0 is a histogram of brightness values L of image data ID representing original image D0. The horizontal axis of the brightness histogram is the brightness gradation value (0 (black) to 255 (white)), and the vertical axis is the number of pixels (frequency). The brightness histogram H0 includes a line pixel group that is a pixel group that makes up a line drawing image such as black text or a ruled line, and a background pixel group that is a pixel group that makes up a background image.

図４は、比較例に係る二値化処理及び比較例の二値化処理用閾値ＴＨｃを示す説明図である。二値化処理用閾値ＴＨｃは、比較例としての判別分析法（大津の方法）を用いて設定されている。判別分析法は、輝度ヒストグラムを暫定閾値によって２つのクラスに分け、各クラス内の分散値とクラス間の分散値からヒストグラムの形状を調べ、もっとも谷部に位置すると思われる閾値を採用する手法である。なお、判別分析法の処理内容については後述する。 FIG. 4 is an explanatory diagram showing the binarization process according to the comparative example and the binarization process threshold THc of the comparative example. The binarization processing threshold THc is set using a discriminant analysis method (Otsu's method) as a comparative example. Discriminant analysis is a method that divides a brightness histogram into two classes using a provisional threshold value, examines the shape of the histogram from the variance values within each class and the variance values between classes, and adopts the threshold value that is thought to be located in the lowest valley. be. Note that the processing details of the discriminant analysis method will be described later.

図４（ａ）は、比較例に係る二値化処理の処理結果としての二値化画像Ｄｃを示している。二値化画像Ｄｃは、二値化処理後の二値データＢＤによって表される画像である。二値データＢＤは、二値化処理部１４０によって、入力画像データとしてのモノクロ画像データＭＤ０の線画画素群の輝度階調値を「０（黒）」とし、地肌画素群のうち二値化処理用閾値ＴＨｃよりも輝度階調値が低い画素の輝度階調値を「０」とする一方、他の画素の輝度階調値を「１（白）」とすることによって生成される画像データである。 FIG. 4A shows a binarized image Dc as a result of the binarization process according to the comparative example. The binarized image Dc is an image represented by the binary data BD after the binarization process. The binary data BD is processed by the binarization processing unit 140 to set the brightness gradation value of the line pixel group of the monochrome image data MD0 as input image data to "0 (black)", and to perform the binarization processing on the background pixel group. Image data that is generated by setting the luminance gradation value of a pixel whose luminance gradation value is lower than the threshold value THc to "0", while setting the luminance gradation value of other pixels to "1 (white)". be.

図４（ｂ）は、比較例に係る二値化処理用閾値ＴＨｃを示している。二値化処理用閾値ＴＨｃは、黒文字のテキスト画像をパターン領域と判別するだけでなく、地肌画像のうち輝度の低い画素から構成される画像領域もパターン領域と判別している。パターン領域と判別された地肌画像の輝度は、二値化処理部１４０によって「０（黒）」に輝度が変更されるので、二値化画像Ｄｃは、二値化処理によって顕著に粒状性が悪化した（荒い）画像となっている（図４（ａ）参照）。 FIG. 4(b) shows the binarization processing threshold THc according to the comparative example. The binarization processing threshold THc not only discriminates a black text image as a pattern area, but also discriminates an image area composed of pixels with low brightness in a background image as a pattern area. The brightness of the background image determined to be a pattern area is changed to "0 (black)" by the binarization processing unit 140, so that the binarized image Dc has noticeable graininess due to the binarization process. The image is deteriorated (rough) (see FIG. 4(a)).

図５は、第１実施形態に係る二値化処理手順の内容を示すフローチャートである。第１実施形態に係る二値化処理は、上述のような地肌画像に起因する粒状性の悪化を顕著に抑制することができる。 FIG. 5 is a flowchart showing the contents of the binarization processing procedure according to the first embodiment. The binarization process according to the first embodiment can significantly suppress deterioration of graininess caused by the background image as described above.

ステップＳ１００では、二値化処理部１４０のフィルタ処理部１４１は、ローパスフィルタ処理を実行する。ローパスフィルタ処理では、二値化処理部１４０は、たとえば３００ｄｐｉのモノクロ画像データＭＤ０（入力画像データとも呼ばれる。）に対して、５×５画素のガウシアンフィルタを使用して平滑化処理を実行する。これにより、ローパスフィルタ処理は、地肌画像のノイズを平滑化してモノクロ画像データＭＤ１を生成することができる。地肌画像のノイズは、背景ノイズの一例である。 In step S100, the filter processing unit 141 of the binarization processing unit 140 performs low-pass filter processing. In the low-pass filter processing, the binarization processing unit 140 performs smoothing processing on, for example, 300 dpi monochrome image data MD0 (also referred to as input image data) using a 5×5 pixel Gaussian filter. Thereby, the low-pass filter processing can smooth noise in the background image and generate monochrome image data MD1. Noise in the background image is an example of background noise.

図６は、第１実施形態に係るローパスフィルタ処理の内容を示す説明図である。図６（ａ）は、ローパスフィルタ処理前のモノクロ画像データＭＤ０の輝度ヒストグラムＨ０を示している。図６（ｂ）は、ローパスフィルタ処理後のモノクロ画像データＭＤ１の輝度ヒストグラムＨ１を示している。輝度ヒストグラムＨ１から分かるように、離散的となっている地肌画素群の階調値分布が単一の山にまとまっている。 FIG. 6 is an explanatory diagram showing the contents of the low-pass filter processing according to the first embodiment. FIG. 6A shows a brightness histogram H0 of monochrome image data MD0 before low-pass filter processing. FIG. 6(b) shows a brightness histogram H1 of the monochrome image data MD1 after low-pass filter processing. As can be seen from the brightness histogram H1, the gradation value distribution of the discrete background pixel group is gathered into a single peak.

ローパスフィルタ処理は、モノクロ画像データＭＤ０の高周波成分（画素値の変動が激しい地肌の階調領域（ノイジーでギザギザな輝度成分））を平滑化し、これにより、背景ノイズ判定用閾値設定処理（ステップＳ３００）で適切な閾値を設定可能とするために実行される。 The low-pass filter process smoothes the high frequency component (the gradation area of the background where the pixel value fluctuates significantly (the noisy and jagged brightness component)) of the monochrome image data MD0, and thereby performs the background noise determination threshold setting process (step S300). ) to be able to set appropriate thresholds.

ステップＳ２００では、二値化処理部１４０の閾値設定部１４２は、ローパスフィルタ処理後の輝度ヒストグラムＨ１に基づいてページ閾値ＴＨＰを設定する。ページ閾値設定処理では、二値化処理部１４０は、判別分析法（大津の方法）を使用してページ閾値を設定する。ページ閾値ＴＨＰは、画像の全面に適用可能な二値化用閾値である。 In step S200, the threshold value setting unit 142 of the binarization processing unit 140 sets the page threshold value THP based on the brightness histogram H1 after the low-pass filter processing. In the page threshold setting process, the binarization processing unit 140 sets a page threshold using a discriminant analysis method (Otsu's method). The page threshold THP is a binarization threshold that can be applied to the entire image.

図７は、第１実施形態に係るページ閾値設定処理（ステップＳ２００）の内容を示すフローチャートである。図８は、第１実施形態に係るページ閾値設定処理の内容を示す説明図である。ステップＳ２１０では、二値化処理部１４０は、ヒストグラム解析部として機能し、暫定閾値を使用して輝度ヒストグラムＨ１を暫定的にクラス１（第１のクラスとも呼ばれる。）とクラス２（第２のクラスとも呼ばれる。）の２つのクラスにクラス分けする（図８（ａ）参照）。第２のクラスは、背景画像を比較的に多く含む側のクラスである。 FIG. 7 is a flowchart showing the contents of the page threshold setting process (step S200) according to the first embodiment. FIG. 8 is an explanatory diagram showing the contents of the page threshold setting process according to the first embodiment. In step S210, the binarization processing unit 140 functions as a histogram analysis unit, and uses the provisional threshold to temporarily divide the brightness histogram H1 into class 1 (also called the first class) and class 2 (second class). (also called class) (see FIG. 8(a)). The second class is a class that includes a relatively large number of background images.

ステップＳ２２０では、二値化処理部１４０は、クラス内分散算出処理を実行する。クラス内分散算出処理では、二値化処理部１４０は、計算式Ｆ１を使用してクラス内分散を算出する。ステップＳ２３０では、二値化処理部１４０は、クラス間分散算出処理を実行する。クラス間分散算出処理では、二値化処理部１４０は、計算式Ｆ２を使用してクラス間分散を算出する。ステップＳ２４０では、二値化処理部１４０は、全分散算出処理を実行する。全分散算出処理では、二値化処理部１４０は、計算式Ｆ３を使用して全分散を算出する。全分散は、暫定閾値の評価値として使用することができる。 In step S220, the binarization processing unit 140 executes intra-class variance calculation processing. In the intra-class variance calculation process, the binarization processing unit 140 calculates the intra-class variance using calculation formula F1. In step S230, the binarization processing unit 140 executes an inter-class variance calculation process. In the inter-class variance calculation process, the binarization processing unit 140 calculates the inter-class variance using calculation formula F2. In step S240, the binarization processing unit 140 executes total variance calculation processing. In the total variance calculation process, the binarization processing unit 140 calculates the total variance using calculation formula F3. The total variance can be used as an evaluation value for the provisional threshold.

二値化処理部１４０は、輝度階調値の全階調値でステップＳ２１０乃至ステップＳ２４０の処理を繰り返して実行する（ステップＳ２５０）。ステップＳ２６０では、二値化処理部１４０は、全分散が最大の階調値にページ閾値ＴＨＰを設定する。 The binarization processing unit 140 repeatedly executes the processing of steps S210 to S240 for all the luminance gradation values (step S250). In step S260, the binarization processing unit 140 sets the page threshold THP to the gradation value with the maximum total variance.

ステップＳ３００では、二値化処理部１４０は、背景ノイズ判定用閾値設定部として機能し、地肌ノイズ判定用の閾値である地肌ノイズ判定用閾値ＴＨＢを設定する。なお、本処理（ステップＳ３００）は、地肌が黒の場合には、スキップするようにしてもよい。地肌ノイズ判定用閾値ＴＨＢは、背景ノイズ判定用閾値とも呼ばれる。 In step S300, the binarization processing unit 140 functions as a background noise determination threshold setting unit and sets a background noise determination threshold THB, which is a threshold for background noise determination. Note that this process (step S300) may be skipped when the background is black. The background noise determination threshold THB is also called a background noise determination threshold.

図９は、第１実施形態及び比較例に係る地肌ノイズ判定用閾値設定処理の内容を示す説明図である。図１０は、第１実施形態及び比較例に係る地肌ノイズ除去処理の内容を示す説明図である。図９（ａ）は、第１実施形態に係る地肌ノイズ判定用閾値ＴＨＢを設定する方法を示している。図９（ｂ）は、比較例に係る地肌ノイズ判定用閾値設定処理の内容を示す説明図である。図１０（ａ）は、第１実施形態に係る地肌ノイズ除去処理の内容を示す説明図である。図１０（ｂ）は、比較例に係る地肌ノイズ除去処理の内容を示す説明図である。 FIG. 9 is an explanatory diagram showing the contents of the background noise determination threshold setting process according to the first embodiment and the comparative example. FIG. 10 is an explanatory diagram showing the details of the background noise removal processing according to the first embodiment and the comparative example. FIG. 9A shows a method for setting the background noise determination threshold THB according to the first embodiment. FIG. 9B is an explanatory diagram showing the contents of the background noise determination threshold setting process according to the comparative example. FIG. 10A is an explanatory diagram showing the details of the background noise removal process according to the first embodiment. FIG. 10(b) is an explanatory diagram showing the details of the background noise removal process according to the comparative example.

この例では、二値化処理部１４０は、地肌画像の分布のピーク値ＰＢの画素数であるピーク画素数の所定範囲の割合（たとえば１０％）の画素数（頻度）となる輝度階調値を、ピーク値ＰＢの輝度階調値である背景ピーク階調値の側から輝度階調値の低い側（濃度が濃くなる側）に検索する。 In this example, the binarization processing unit 140 generates a brightness gradation value that is the number of pixels (frequency) of a predetermined range (for example, 10%) of the peak pixel number, which is the number of pixels of the peak value PB of the distribution of the background image. is searched from the side of the background peak gradation value, which is the luminance gradation value of the peak value PB, to the side of the lower luminance gradation value (the side where the density becomes higher).

比較例に係る地肌ノイズ判定用閾値ＴＨＢ（図９（ｂ））では、地肌画素群の階調値分布が離散的（ギザギザ）となっているので、二値化処理部１４０は、地肌ノイズを効果的に排除することができる地肌ノイズ判定用閾値を設定してことができないことが分かる（図１０（ｂ）参照）。これに対して、第１実施形態に係る地肌ノイズ判定用閾値ＴＨＢ（図９（ａ））では、地肌画素群の階調値分布が単一の山にまとまっているので、二値化処理部１４０は、円滑に適切な地肌ノイズ判定用閾値を設定して地肌ノイズを効果的に排除することができる（図１０（ａ））。 In the background noise determination threshold THB (FIG. 9(b)) according to the comparative example, the gradation value distribution of the background pixel group is discrete (jagged), so the binarization processing unit 140 eliminates the background noise. It can be seen that it is not possible to set a threshold for background noise determination that can effectively eliminate background noise (see FIG. 10(b)). On the other hand, in the background noise determination threshold THB (FIG. 9(a)) according to the first embodiment, the gradation value distribution of the background pixel group is gathered into a single peak, so the binarization processing unit 140 can effectively eliminate background noise by smoothly setting an appropriate background noise determination threshold (FIG. 10(a)).

これにより、二値化処理部１４０は、検索された閾値を地肌ノイズ判定用閾値ＴＨＢとして設定することができる。輝度ヒストグラムＨ１では、上述のように高周波成分（画素値の変動が激しい地肌の階調領域（ノイジーでギザギザな輝度成分））が平滑化されているので、二値化処理部１４０は、地肌の残存が過度とならないような地肌ノイズ判定用閾値ＴＨＢを円滑に設定することができる。所定範囲の割合は、５％乃至２０％が好ましく、５％乃至１５％がより好まく、１０％の近傍が最も好ましい。 Thereby, the binarization processing unit 140 can set the searched threshold value as the background noise determination threshold value THB. In the brightness histogram H1, as described above, the high frequency component (the gradation region of the background where pixel values fluctuate significantly (the noisy and jagged brightness component)) is smoothed, so the binarization processing unit 140 It is possible to smoothly set the background noise determination threshold THB so that the residual amount does not become excessive. The ratio in the predetermined range is preferably 5% to 20%, more preferably 5% to 15%, and most preferably around 10%.

ステップＳ４００では、二値化処理部１４０は、ページ閾値ＴＨＰと地肌ノイズ判定用閾値ＴＨＢとを比較し、ページ閾値ＴＨＰが地肌ノイズ判定用閾値ＴＨＢよりも大きな場合には、処理をステップＳ５００に進め、ページ閾値ＴＨＰが地肌ノイズ判定用閾値ＴＨＢ以下の場合には、処理をステップＳ６００に進める。 In step S400, the binarization processing unit 140 compares the page threshold THP and the background noise determination threshold THB, and if the page threshold THP is larger than the background noise determination threshold THB, the process proceeds to step S500. , if the page threshold THP is less than or equal to the background noise determination threshold THB, the process advances to step S600.

ステップＳ５００では、二値化処理部１４０は、地肌ノイズ判定用閾値ＴＨＢの値をページ閾値ＴＨＰの値で置き換えて再設定する。これにより、二値化処理部１４０は、地肌ノイズ判定用閾値ＴＨＢが過度に小さく設定された場合に、地肌ノイズ除去処理で過度に地肌ノイズが消去され、残すべきテキスト画像等が削除されるといたことが発生しないようにすることができる。 In step S500, the binarization processing unit 140 replaces and resets the value of the background noise determination threshold THB with the value of the page threshold THP. As a result, the binarization processing unit 140 can detect that when the background noise determination threshold THB is set too small, the background noise is excessively erased in the background noise removal process, and text images etc. that should be left are deleted. You can prevent what happened.

ステップＳ６００では、二値化処理部１４０は、背景ノイズ低減処理部として機能し、地肌ノイズ除去処理（背景ノイズ低減処理とも呼ばれる。）を実行する。地肌ノイズ除去処理では、二値化処理部１４０は、地肌ノイズ判定用閾値ＴＨＢよりも輝度階調値が高い（濃度が薄い）画素の画素値を地肌ノイズ判定用閾値ＴＨＢの輝度階調値に設定する。 In step S600, the binarization processing unit 140 functions as a background noise reduction processing unit and executes background noise removal processing (also referred to as background noise reduction processing). In the background noise removal process, the binarization processing unit 140 converts the pixel value of a pixel whose luminance gradation value is higher (lower density) than the background noise determination threshold THB into the luminance gradation value of the background noise determination threshold THB. Set.

これにより、二値化処理部１４０は、輝度ヒストグラムＨ２を有するモノクロ画像データＭＤ２を生成することができる。輝度ヒストグラムＨ２では、地肌画像の輝度階調値分布が地肌ノイズ判定用閾値ＴＨＢに集中して平滑化され、コントラストが低くなるため後述の二値化処理（ステップＳ７００）においてノイズ（特にエッジ）のパターン画像との誤判別を抑制することができる。 Thereby, the binarization processing unit 140 can generate monochrome image data MD2 having the brightness histogram H2. In the brightness histogram H2, the brightness gradation value distribution of the background image is smoothed to concentrate on the background noise determination threshold THB, and the contrast becomes low. Misidentification with pattern images can be suppressed.

図１１は、第１実施形態に係る地肌ノイズ除去処理及び二値化処理の内容を示す説明図である。この例では、二値化処理部１４０は、後述の二値化処理（ステップＳ７００）とは異なり、二値化処理の閾値をページ閾値ＴＨＰに固定して処理した場合を想定している。図１１（ａ）は、地肌ノイズ除去処理後のモノクロ画像Ｄ１を示している。モノクロ画像Ｄ１は、地肌画像において、顕著に輝度の低い画素の輝度が高くなる一方、顕著に輝度の高い画素の輝度が低くなるように処理されているので、モノトーンの粒状性が向上した（細かい）背景画像を有している。 FIG. 11 is an explanatory diagram showing the contents of the background noise removal process and the binarization process according to the first embodiment. In this example, unlike the binarization process (step S700) described later, it is assumed that the binarization processing unit 140 performs processing by fixing the threshold value of the binarization process to the page threshold value THP. FIG. 11A shows a monochrome image D1 after background noise removal processing. The monochrome image D1 is processed so that the luminance of pixels with significantly low luminance increases while the luminance of pixels with significantly high luminance decreases in the background image, so the graininess of the monochrome image has improved (fine ) has a background image.

図１１（ｂ）は、第１実施形態に係る二値化処理後の二値画像Ｄ２を示している。二値画像Ｄ２は、第１実施形態に係るページ閾値ＴＨＰの設定によって、パターン領域の画素と誤判別される背景領域の画素が顕著に減少して、粒状性を悪化させることなく適切な画像となっている。このような誤判別の低減は、地肌ノイズ除去処理によって粒状性が低下し、局所コントラストが低下したことによって実現されている。 FIG. 11(b) shows the binary image D2 after the binarization process according to the first embodiment. By setting the page threshold value THP according to the first embodiment, the binary image D2 has a significantly reduced number of pixels in the background area that are misidentified as pixels in the pattern area, so that the binary image D2 becomes an appropriate image without deteriorating graininess. It has become. Such a reduction in misjudgment is achieved by reducing graininess and reducing local contrast through background noise removal processing.

このように、第１実施形態に係る画像形成装置１０は、二値化処理用の閾値を設定する前に前処理としてのローパスフィルタ処理を実行する。これにより、二値化処理部１４０は、地肌画像のノイズを低減させるとともに輝度階調値の分散を小さくすることができる。地肌画像のノイズ除去は、二値化処理に伴う粒状性の悪化を低減させることができる。輝度階調値の分散の低減は、地肌画像の輝度階調値の平均値を中心として地肌画像の分布の幅を狭くするとともに、分布の階調値変化を滑らかにして地肌ノイズ判定用閾値ＴＨＢの設定を円滑化することができる。 In this way, the image forming apparatus 10 according to the first embodiment executes low-pass filter processing as pre-processing before setting the threshold value for binarization processing. Thereby, the binarization processing unit 140 can reduce the noise in the background image and reduce the variance of the brightness gradation values. Removal of noise from the background image can reduce deterioration in graininess caused by binarization processing. Reducing the variance of the brightness gradation values is achieved by narrowing the width of the distribution of the background image around the average value of the brightness gradation values of the background image, and by smoothing the changes in the gradation values of the distribution to set the background noise determination threshold THB. settings can be made more smoothly.

図１２は、第１実施形態に係る二値化処理（ステップＳ７００）の内容を示すフローチャートである。ステップＳ７１０では、二値化処理部１４０は、着目局所領域選択部として機能し、モノクロ画像データＭＤ２（入力画像データとも呼ばれる。）を構成する複数の画素を（Ｍ×Ｎ）画素（Ｍ及びＮは、たとえば３以上の整数）から構成される複数の局所領域に分割し、前記分割された複数の局所領域を着目局所領域として順に選択する。着目局所領域は、たとえば参照サイズが５×５画素の領域である。参照サイズは、たとえば処理対象となる画像の解像度等に応じて設定される。 FIG. 12 is a flowchart showing the contents of the binarization process (step S700) according to the first embodiment. In step S710, the binarization processing unit 140 functions as a local region of interest selection unit and converts a plurality of pixels constituting the monochrome image data MD2 (also referred to as input image data) into (M×N) pixels (M and N (for example, an integer of 3 or more), and the divided local regions are sequentially selected as local regions of interest. The local area of interest is, for example, an area with a reference size of 5×5 pixels. The reference size is set depending on, for example, the resolution of the image to be processed.

図１３は、第１実施形態に係る着目局所領域選択の様子を示す説明図である。図１４は、第１実施形態に係る切替用閾値ＴＨＳを示すグラフである。この例では、２つの局所領域ＬＲ１，ＬＲ２が示されている。着目局所領域として、局所領域ＬＲ１が選択されたときには、黒文字のテキスト画像が参照範囲に入っているので、最小輝度値は０に近い値となり、最大輝度値は地肌の輝度値、すなわち地肌ノイズ判定用閾値ＴＨＢの近傍の輝度値となる。一方、局所領域ＬＲ２が選択されたときには、薄いテキスト画像が参照範囲に入っているので、最小輝度値は薄いテキスト画像の輝度値となって、地肌ノイズ判定用閾値ＴＨＢの近傍の輝度値となり、最大輝度値は地肌の輝度値となる。 FIG. 13 is an explanatory diagram showing how to select a local region of interest according to the first embodiment. FIG. 14 is a graph showing the switching threshold THS according to the first embodiment. In this example, two local regions LR1 and LR2 are shown. When the local region LR1 is selected as the local region of interest, the black text image is within the reference range, so the minimum brightness value is close to 0, and the maximum brightness value is the brightness value of the background, that is, the background noise determination. The brightness value is near the threshold value THB. On the other hand, when the local region LR2 is selected, since the thin text image is within the reference range, the minimum brightness value becomes the brightness value of the thin text image, and becomes the brightness value near the background noise determination threshold THB, The maximum brightness value is the brightness value of the background.

ステップＳ７２０では、二値化処理部１４０は、最大輝度値取得処理を実行する。最大輝度値取得処理では、二値化処理部１４０は、着目局所領域内の最大輝度値を取得する。具体的には、二値化処理部１４０は、着目局所領域が局所領域ＬＲ１及び局所領域ＬＲ２のいずれの場合においても、各局所領域内の地肌の最大輝度値を取得することになる。この例では、二値化処理部１４０は、局所領域ＬＲ２において、局所最大輝度値Ｌ＿Ｍａｘ（輝度値１７０）を取得したものとする（図１４（ａ）参照）。最大輝度値は、輝度の最大階調値であり、最も濃度が低い側の濃度階調値である最大階調値でもよい。 In step S720, the binarization processing unit 140 executes maximum brightness value acquisition processing. In the maximum brightness value acquisition process, the binarization processing unit 140 acquires the maximum brightness value within the local area of interest. Specifically, the binarization processing unit 140 obtains the maximum brightness value of the background in each local area, regardless of whether the local area of interest is the local area LR1 or the local area LR2. In this example, it is assumed that the binarization processing unit 140 has acquired the local maximum brightness value L_Max (brightness value 170) in the local region LR2 (see FIG. 14(a)). The maximum luminance value is the maximum gradation value of luminance, and may be the maximum gradation value that is the density gradation value on the lowest density side.

ステップＳ７３０では、二値化処理部１４０は、切替用閾値設定部として機能し、切替用閾値設定処理を実行する。切替用閾値設定処理では、二値化処理部１４０は、記憶部２４０から予め設定されている二値化閾値切替用閾値テーブル（閾値切替テーブルとも呼ばれる。）Ｔを読出して切替用閾値ＴＨＳを設定可能な状態とする。この例では、二値化処理部１４０は、局所最大輝度値Ｌ＿Ｍａｘ（輝度値１７０）に基づき、切替用閾値ＴＨＳとして、「３０」を取得したものとする。 In step S730, the binarization processing unit 140 functions as a switching threshold setting unit and executes switching threshold setting processing. In the switching threshold setting process, the binarization processing unit 140 reads out a preset binarization threshold switching threshold table (also referred to as a threshold switching table) T from the storage unit 240 and sets the switching threshold THS. Make it possible. In this example, it is assumed that the binarization processing unit 140 has acquired "30" as the switching threshold THS based on the local maximum luminance value L_Max (luminance value 170).

閾値切替テーブルＴは、計算式Ｆ４で定義される切替用閾値ＴＨＳを離散化してテーブルとしたものである。切替用閾値ＴＨＳは、背景領域（この例では地肌画像）の輝度が低いほど、局所領域内で発生しているコントラストがパターン領域に起因するのではなく、背景領域の画像のノイズに起因する可能性が高いという発明者の知見に基づいて設定されている。本願発明者は、最大階調値に応じて切替用閾値ＴＨＳを変化させることが好ましいことを実験やシミュレーションで見出した。具体的には、切替用閾値ＴＨＳは、最大階調値が小さくなるほど（濃度が濃くなるほど）大きくなるように設定し、特に線形に変化するように設定することが好ましいことが見いだされた。 The threshold switching table T is a table obtained by discretizing the switching threshold THS defined by the calculation formula F4. The switching threshold THS indicates that the lower the brightness of the background area (the background image in this example), the more likely that the contrast occurring in the local area is not due to the pattern area but to noise in the image of the background area. This setting is based on the inventor's knowledge that the The inventor of the present application found through experiments and simulations that it is preferable to change the switching threshold THS according to the maximum gradation value. Specifically, it has been found that it is preferable to set the switching threshold THS to become larger as the maximum gradation value becomes smaller (as the density becomes darker), and particularly to set it so that it changes linearly.

ステップＳ７４０では、二値化処理部１４０は、局所コントラスト算出処理を実行する。局所コントラスト算出処理では、二値化処理部１４０は、局所領域内の５×５の２５画素のうちの最大輝度値（この例では１７０）と最小輝度値（この例では１２０）の間の差を局所コントラストＬＣとして算出する（計算式Ｆ５参照）。この例では、二値化処理部１４０は、局所コントラストＬＣとして「５０」（＝１７０－１２０）を取得したものとする。最小輝度値は、輝度の最小階調値であり、最も濃度が濃い側の階調値である最小階調値でもよい。 In step S740, the binarization processing unit 140 executes local contrast calculation processing. In the local contrast calculation process, the binarization processing unit 140 calculates the difference between the maximum brightness value (170 in this example) and the minimum brightness value (120 in this example) of 25 pixels of 5×5 in the local area. is calculated as the local contrast LC (see calculation formula F5). In this example, it is assumed that the binarization processing unit 140 obtains "50" (=170-120) as the local contrast LC. The minimum luminance value is the minimum gradation value of luminance, and may be the minimum gradation value that is the gradation value on the darkest side.

ステップＳ７５０では、二値化処理部１４０は、局所コントラストＬＣが切替用閾値ＴＨＳよりも大きい場合には、処理をステップＳ７６０に進め、局所コントラストＬＣが切替用閾値ＴＨＳ以下の場合には、処理をステップＳ７７０に進める。これにより、二値化処理部１４０は、背景領域のノイズがパターン領域のコントラストであるとの誤判別や文字の中抜けを抑制することができる。 In step S750, the binarization processing unit 140 advances the process to step S760 if the local contrast LC is greater than the switching threshold THS, and if the local contrast LC is less than or equal to the switching threshold THS, the binarization processing unit 140 continues the process. The process advances to step S770. Thereby, the binarization processing unit 140 can suppress erroneous determination that noise in the background area is the contrast of the pattern area and suppress hollow characters.

ステップＳ７６０では、二値化処理部１４０は、局所領域ＬＲ２の二値化処理に使用される二値化処理用閾値として局所閾値ＴＨＬ（計算式Ｆ６参照）を設定する。この例では、計算式Ｆ６において、係数αは、たとえば０．１以上の数値である。ステップＳ７７０では、二値化処理部１４０は、局所領域ＬＲ２の二値化処理に使用される二値化処理用閾値としてページ閾値ＴＨＰを設定する。ステップＳ７１０乃至ステップＳ７７０の処理は、全ての局所領域に対して実行される（ステップＳ７８０）。なお、局所閾値は、最大階調値と最小階調値の間の範囲で設定されるものであればよい。 In step S760, the binarization processing unit 140 sets the local threshold THL (see calculation formula F6) as the binarization processing threshold used for the binarization processing of the local region LR2. In this example, in calculation formula F6, the coefficient α is, for example, a numerical value of 0.1 or more. In step S770, the binarization processing unit 140 sets the page threshold THP as the binarization processing threshold used for the binarization processing of the local region LR2. The processes from step S710 to step S770 are executed for all local regions (step S780). Note that the local threshold value may be set within a range between the maximum gradation value and the minimum gradation value.

図１５は、第１実施形態に係る二値化処理の内容を示す説明図である。図１５（ａ）は、薄いテキスト画像を示す原稿画像Ｄ０ａを示している。原稿画像Ｄ０ａは、原稿画像Ｄ０（図３（ａ）参照）と同様の背景画像を有しているが、説明を分かりやすくするために、地肌を除去して示している。図１５（ｂ）は、比較例に係る二値化処理後の画像Ｄｃａを示している。この例において、画像Ｄｃａでは、薄いテキスト画像が背景画像と誤判別されて二値化処理で消滅している。図１５（ｂ）は、第１実施形態に係る二値化処理後の画像Ｄ２を示している。この例において、画像Ｄ２は、薄いテキスト画像がパターン領域の画像と判別されて二値化処理で明確に再現されている。 FIG. 15 is an explanatory diagram showing the contents of the binarization process according to the first embodiment. FIG. 15(a) shows a document image D0a showing a thin text image. The original image D0a has the same background image as the original image D0 (see FIG. 3(a)), but the background image is shown with the background image removed to make the explanation easier to understand. FIG. 15(b) shows the image Dca after the binarization process according to the comparative example. In this example, in image Dca, a thin text image is erroneously determined to be a background image and disappears in the binarization process. FIG. 15(b) shows the image D2 after the binarization process according to the first embodiment. In this example, in image D2, a thin text image is determined to be an image of a pattern area and is clearly reproduced by binarization processing.

このように、第１実施形態に係る画像形成装置１０は、局所領域内の最大輝度値に応じて輝度階調値に応じて切替用閾値ＴＨＳを調整しつつ、切替用閾値ＴＨＳに基づいて局所閾値ＴＨＬとページ閾値ＴＨＰとを適切に切り替えて二値化処理を実行する。これにより、低濃度画像（薄いテキスト等）の判別と画像ノイズの抑制の両立を改善することができる。一方、本画像形成装置１０は、地肌ノイズ除去処理によって地肌ノイズのコントラストを低減させているので、背景領域のノイズが低濃度画像のコントラストであるとの誤判別を抑制することもできる。 In this way, the image forming apparatus 10 according to the first embodiment adjusts the switching threshold THS according to the luminance gradation value according to the maximum luminance value in the local area, and locally adjusts the switching threshold THS based on the switching threshold THS. The binarization process is executed by appropriately switching between the threshold THL and the page threshold THP. Thereby, it is possible to improve both the discrimination of low-density images (such as thin text) and the suppression of image noise. On the other hand, since the present image forming apparatus 10 reduces the contrast of the background noise through the background noise removal process, it is also possible to suppress erroneous determination that noise in the background area is the contrast of a low-density image.

Ｂ．第２実施形態：
図１６は、第２実施形態に係る二値化用閾値調整処理（ステップＳ８００）の内容を示すフローチャートである。第２実施形態に係る二値化処理は、第１実施形態による二値化処理の後で、ユーザーの希望に応じて二値化用閾値を調整し、改めて二値化処理を実行可能となるように構成されている。第２実施形態に係る二値化処理は、第１実施形態の二値化処理に対して、二値化用閾値調整処理（ステップＳ８００）が処理の最後に追加されている点で第１実施形態と相違する。 B. Second embodiment:
FIG. 16 is a flowchart showing the contents of the binarization threshold adjustment process (step S800) according to the second embodiment. In the binarization process according to the second embodiment, after the binarization process according to the first embodiment, the binarization threshold can be adjusted according to the user's wishes, and the binarization process can be executed again. It is configured as follows. The binarization process according to the second embodiment is different from the binarization process of the first embodiment in that a binarization threshold adjustment process (step S800) is added at the end of the process. It differs from the form.

図１７は、第２実施形態に係る操作表示画面を示す説明図である。二値化用閾値調整処理（ステップＳ８００）は、二値化処理後において、操作表示部２３０への所定のユーザー入力に応じて対話型微調整処理モードとして起動される。操作表示部２３０は、ユーザーインターフェース画面２３１とスタートボタン２３２とを有している。ユーザーインターフェース画面２３１は、対話型微調整処理モードを有し、選択アイコン２３３と、解除アイコン２３４と、ＯＫアイコン２３５と、濃度減少アイコン２３７ｄと、濃度増加アイコン２３７ｕとを有している。 FIG. 17 is an explanatory diagram showing an operation display screen according to the second embodiment. The binarization threshold adjustment process (step S800) is activated as an interactive fine adjustment process mode in response to a predetermined user input to the operation display unit 230 after the binarization process. The operation display section 230 has a user interface screen 231 and a start button 232. The user interface screen 231 has an interactive fine adjustment processing mode, and includes a selection icon 233, a cancel icon 234, an OK icon 235, a density decrease icon 237d, and a density increase icon 237u.

ステップＳ８１０では、操作表示部２３０は、領域選択のためのユーザー入力を受け付ける。ユーザーは、選択アイコン２３３をタッチして反転させた状態において、ユーザーインターフェース画面２３１上において指をスライドさせて選択領域ＳＲを指定することができる。選択領域を改めて選択するときには、解除アイコン２３４をタッチすることによって選択領域ＳＲの指定を解除することができる。この状態において、ＯＫアイコン２３５をタッチすると、操作表示部２３０は、文字入力用ポップアップ画面（図示略）を表示する。 In step S810, the operation display unit 230 receives user input for region selection. The user can designate the selection region SR by touching and inverting the selection icon 233 and sliding his or her finger on the user interface screen 231. When reselecting the selection area, the designation of the selection area SR can be canceled by touching the cancellation icon 234. In this state, when the OK icon 235 is touched, the operation display section 230 displays a pop-up screen for character input (not shown).

ステップＳ８２０では、操作表示部２３０は、文字入力のためのポップアップ画面（図示略）においてユーザー入力を受け付ける。ユーザーが文字を入力し、あるいは入力することなく、ポップアップ画面上のＯＫアイコン（図示略）をタッチすると、処理がステップＳ８３０に進められる。 In step S820, the operation display unit 230 accepts user input on a pop-up screen (not shown) for inputting characters. When the user inputs characters or touches an OK icon (not shown) on the pop-up screen without inputting characters, the process proceeds to step S830.

ステップＳ８３０では、制御部２１０のＯＣＲ処理部２１１は、ＯＣＲ確度算出処理を実行する。ＯＣＲ確度算出処理では、ＯＣＲ処理部２１１は、入力された文字の認識を実行し、ＯＣＲ確度（認識の確からしさ）を数値化し、各文字の認識確度の積算値を算出する。ＯＣＲ処理部２１１は、第１実施形態に係る閾値ＴＨ１やページ閾値の際に設定されたクラス１の平均輝度値Ｍ_１（図８参照）とクラス２の平均輝度値Ｍ_２との間において閾値を変更しつつＯＣＲ確度を閾値毎に算出する（ステップＳ８４０）。ＯＣＲ処理部２１１は、文字認識処理部とも呼ばれる。平均輝度値Ｍ_１は、第１の平均階調値とも呼ばれる。平均輝度値Ｍ_２は、第２の平均階調値とも呼ばれる。 In step S830, the OCR processing unit 211 of the control unit 210 executes OCR accuracy calculation processing. In the OCR accuracy calculation process, the OCR processing unit 211 recognizes input characters, digitizes the OCR accuracy (recognition certainty), and calculates an integrated value of the recognition accuracy of each character. The OCR processing unit 211 sets a threshold value between the average brightness value M ₁ of class 1 (see FIG. 8) and the average brightness value M ₂ of class 2, which are set at the time of threshold TH1 and page threshold according to the first embodiment. The OCR accuracy is calculated for each threshold while changing the OCR accuracy (step S840). The OCR processing section 211 is also called a character recognition processing section. The average brightness value _M1 is also called the first average gradation value. The average brightness value _M2 is also called the second average gradation value.

ステップＳ８５０では、制御部２１０のＯＣＲ処理部２１１は、最高確度閾値を設定する。最高確度閾値は、入力された各文字の認識確度の積算値が最も高い閾値である。二値化処理部１４０は、最高確度閾値を使用して選択領域ＳＲの二値化処理を実行し、二値化処理後の二値化画像をユーザーインターフェース画面２３１に実時間で表示する。なお、たとえばユーザーの選択によって、最高確度閾値を使用して二値化処理が行われる対象をページ全体とするようにしてもよい。 In step S850, the OCR processing unit 211 of the control unit 210 sets the highest accuracy threshold. The highest accuracy threshold is a threshold with the highest cumulative value of recognition accuracy for each input character. The binarization processing unit 140 executes the binarization process on the selected region SR using the highest accuracy threshold value, and displays the binarized image after the binarization process on the user interface screen 231 in real time. Note that, for example, depending on the user's selection, the entire page may be subjected to the binarization process using the highest accuracy threshold.

ステップＳ８６０では、操作表示部２３０は、微調整処理のためのユーザー入力を受け付ける。微調整処理では、濃度減少アイコン２３７ｄをタッチすることによって閾値を大きくする微調整を可能とし、濃度増加アイコン２３７ｕをタッチすることによって閾値を小さくする微調整を可能とする。 In step S860, the operation display unit 230 receives user input for fine adjustment processing. In the fine adjustment process, touching the density decrease icon 237d allows fine adjustment to increase the threshold value, and touching the density increase icon 237u allows fine adjustment to decrease the threshold value.

このように、第２実施形態に係る画像形成装置１０は、第１実施形態における自動処理の結果に満足できない場合には、特定の領域又はページ全体において、クラス１の平均輝度値Ｍ_１とクラス２の平均輝度値Ｍ_２との間においてＯＣＲ確度が最も高い閾値を自動的に変更して、たとえば自動処理の二値化処理で消滅した文字を対話型微調整処理モードで再現することができる。 In this manner, when the image forming apparatus 10 according to the second embodiment is not satisfied with the result of the automatic processing in the first embodiment, the average brightness value _M1 of class 1 and the class By automatically changing the threshold value with the highest OCR accuracy between the average brightness value M2 of 2 and the average brightness value _M2 of 2, it is possible to reproduce, for example, characters that disappeared during automatic binarization processing in the interactive fine-tuning processing mode. .

これにより、画像形成装置１０は、たとえば異常画像（全白や全黒）に簡易に対処できるとともに、ユーザー好みの二値化処理を実現する事ができる。なお、濃度減少アイコン２３７ｄ及び濃度増加アイコン２３７ｕは、ステップＳ８１０乃至ステップＳ８５０の処理を実行しなかった場合にも全階調値の範囲又は平均輝度値Ｍ_１と平均輝度値Ｍ_２との間で利用可能とすることが好ましい。 Thereby, the image forming apparatus 10 can easily deal with, for example, an abnormal image (all white or all black), and can also realize binarization processing according to the user's preference. Note that the density decrease icon 237d and the density increase icon 237u are displayed within the range of all gradation values or between the average brightness value _M1 and the average brightness value _M2 even if the processes of steps S810 to S850 are not executed. It is preferable to make it available.

Ｃ．変形例：
本発明は、上記実施形態だけでなく、以下のような変形例でも実施することができる。 C. Variant:
The present invention can be implemented not only in the above embodiment but also in the following modifications.

変形例１：上記実施形態では、ローパスフィルタ処理で使用されるフィルタは、ガウシアンフィルタであるが、それに限定されず、たとえば移動平均フィルタを使用してもよい。さらに、背景画像（たとえば地肌画像）のノイズを除去できる特性を有していれば、ＤＯＧ（ＤｉｆｆｅｒｅｎｃｅｏｆＧａｕｓｓｉａｎ）フィルタ（ガウシアン差分フィルタとも呼ばれる。）のようなノイズの除去とともに線画の強調が可能なバンドパスフィルタも利用可能である。 Modification 1: In the above embodiment, the filter used in the low-pass filter processing is a Gaussian filter, but is not limited thereto, and for example, a moving average filter may be used. Furthermore, if it has the characteristic of removing noise from a background image (for example, a background image), it is possible to remove noise and emphasize line drawings, such as with a DOG (Difference of Gaussian) filter (also called a Gaussian difference filter). Bandpass filters are also available.

図１８は、変形例に係るフィルタ処理の内容を示す説明図である。図１８（ａ）は、図９（ａ）と同一であり、ガウシアンフィルタ処理後のモノクロ画像データＭＤ１のヒストグラムＨ１を示している。図１８（ｂ）は、ＤＯＧフィルタ処理後のモノクロ画像データＭＤ１ａのヒストグラムＨ１ａを示している。ヒストグラムＨ１ａでは、薄文字のテキストが強調されている。これにより、二値化処理の閾値設定において、薄文字のテキストの残存性を高めるような適切な閾値を設定可能とすることができる。 FIG. 18 is an explanatory diagram showing the contents of filter processing according to a modified example. FIG. 18(a) is the same as FIG. 9(a), and shows the histogram H1 of the monochrome image data MD1 after Gaussian filter processing. FIG. 18(b) shows a histogram H1a of the monochrome image data MD1a after the DOG filter processing. In the histogram H1a, light text is emphasized. Thereby, in threshold setting for binarization processing, it is possible to set an appropriate threshold that increases the survivability of light text.

変形例２：上記実施形態では、輝度階調値と輝度ヒストグラムが使用されているが、たとえば濃度階調値や濃度ヒストグラムを使用してもよい。濃度は、たとえばＫの色材を使用して印刷したと仮定した場合のＫの色材の濃度階調値である。 Modification 2: In the above embodiment, brightness gradation values and brightness histograms are used, but for example, density gradation values and density histograms may be used. The density is, for example, the density gradation value of the K color material when printing is performed using the K color material.

変形例３：上記実施形態では、本発明は、画像形成装置に適用されているが、必ずしもい画像形成装置に限定されず、画像読み取り装置や携帯端末用のような画像処理装置として機能する電子機器に適用可能である。 Modified Example 3: In the above embodiment, the present invention is applied to an image forming apparatus, but is not necessarily limited to an image forming apparatus, and is applicable to an electronic device that functions as an image processing apparatus such as an image reading apparatus or a mobile terminal. Applicable to equipment.

１０画像形成装置
２１０制御部
２２０画像形成部
２３０操作表示部
２４０記憶部
１００画像読取部
１１１光源ドライバ
１１２光源
１２１イメージセンサ
１２２受光素子
１２３信号処理部
１２４シェーディング補正部
１３０ＡＧＣ処理部
１４０二値化処理部
１４１フィルタ処理部
１４２閾値設定部
１５０原稿台
１６０自動原稿送り装置
１６２原稿読取スリット

10 Image forming apparatus 210 Control section 220 Image forming section 230 Operation display section 240 Storage section 100 Image reading section 111 Light source driver 112 Light source 121 Image sensor 122 Light receiving element 123 Signal processing section 124 Shading correction section 130 AGC processing section 140 Binarization processing Section 141 Filter processing section 142 Threshold setting section 150 Document table 160 Automatic document feeder 162 Document reading slit

Claims

a filter processing unit that performs low-pass filter processing as preprocessing on input image data before setting a threshold for binarization processing;
a threshold setting unit that sets a page threshold and a local threshold, which are set using a discriminant analysis method, as thresholds for binarization processing, based on the input image data that has been subjected to the low-pass filter processing;
A plurality of pixels included in the input image data subjected to the low-pass filter processing are divided into local regions composed of (M×N) pixels (M and N are integers of 3 or more), and the divided plurality of pixels are a local region of interest selection unit that sequentially selects local regions of interest as local regions of interest;
Obtain the maximum gradation value, which is the gradation value on the lowest density side among the gradation values of the plurality of pixels constituting the local area of interest, and switch so that the smaller the maximum gradation value is, the larger the gradation value becomes. a switching threshold setting unit that sets a threshold for
The local contrast, which is the difference between the maximum gradation value and the minimum gradation value among the gradation values of a plurality of pixels included in the local area of interest, is compared with the switching threshold, and the local contrast is determined by comparing the switching threshold with the local contrast. is larger than the switching threshold, a value obtained by multiplying the local contrast by a predetermined coefficient in the range between the maximum gradation value and the minimum gradation value is added to the minimum gradation value. Binarization processing that executes binarization processing using the set local threshold, and executes binarization processing using the page threshold when the local contrast is less than or equal to the switching threshold. Department and
A first average gradation value, which is the average gradation value of the first class, and a first average gradation value, which is the average gradation value of the first class, and a first average gradation value, which is the average gradation value of the first class, are determined by the discriminant analysis method on the input image data that has been subjected to the low-pass filter processing . a histogram analysis unit that calculates a second average gradation value that is an average gradation value of the second class;
an operation display section that accepts user input for specifying a selection area ;
a character recognition processing unit that recognizes characters included in the input image data in the selection area, calculates recognition accuracy that is the certainty of the recognition, and calculates an integrated value of the recognition accuracy of each character;
Equipped with
The binarization processing unit executes binarization processing using the changed threshold while changing the threshold between the first average gradation value and the second average gradation value . ,
The character recognition processing unit calculates the recognition accuracy for each of the changed thresholds, and sets a highest accuracy threshold that is a threshold with the highest cumulative value of recognition accuracy for each input character,
The binarization processing unit executes binarization processing of the input image data within the selected area using the highest accuracy threshold,
The operation display unit is an image processing device that displays the image after the binarization process.

The image processing device according to claim 1, further comprising:
comprising a character recognition processing unit that recognizes characters included in the input image data, calculates recognition accuracy that is the certainty of the recognition, and calculates an integrated value of the recognition accuracy of each character,
The binarization processing unit changes the threshold between the first average gradation value and the second average gradation value,
The character recognition processing unit calculates the integrated value for each of the selected threshold values,
The binarization processing unit is an image processing device that executes binarization processing using a threshold value having the largest integrated value.

The image processing device according to claim 1 or 2, further comprising :
A brightness histogram of the image data subjected to the low-pass filter processing, whose horizontal axis is the brightness gradation value and the vertical axis is the number of pixels, is generated, and the brightness histogram is calculated using the brightness gradation value of the peak value of the distribution of the background image of the brightness histogram. Background noise is determined by searching for a gradation value that has the number of pixels in a predetermined range of the number of pixels of the peak value, which is the number of pixels of the peak value, from the side of a certain background peak gradation value to the side of the gradation value where the density becomes darker. a background noise determination threshold setting unit that sets a threshold for
a background noise reduction processing unit that reduces background noise by setting all the gradation values of pixels having gradation values on the side where the density is lighter than the background noise determination threshold as the background noise determination threshold;
Equipped with
In the image processing apparatus , the low-pass filter processing is smoothing processing .

The image processing device according to claim 3,
The filter processing unit is an image processing device that performs the low-pass filter processing using a Gaussian filter.

The image processing device according to claim 3,
The filter processing unit is an image processing device that performs the low-pass filter processing using a Gaussian difference filter.

a filter processing step of performing low-pass filter processing as preprocessing on input image data before setting a threshold for binarization processing;
a threshold setting step of setting a page threshold and a local threshold, which are set using a discriminant analysis method, as thresholds for binarization processing, based on the input image data on which the low-pass filter processing has been performed;
A plurality of pixels included in the input image data subjected to the low-pass filter processing are divided into local regions composed of (M×N) pixels (M and N are integers of 3 or more), and the divided plurality of pixels are a step of selecting a local region of interest as a local region of interest;
Obtain the maximum gradation value, which is the gradation value on the lowest density side among the gradation values of the plurality of pixels constituting the local area of interest, and switch so that the smaller the maximum gradation value is, the larger the gradation value becomes. a switching threshold setting step of setting a threshold for
The local contrast, which is the difference between the maximum gradation value and the minimum gradation value among the gradation values of a plurality of pixels included in the local area of interest, is compared with the switching threshold, and the local contrast is determined by comparing the switching threshold with the local contrast. is larger than the switching threshold, a value obtained by multiplying the local contrast by a predetermined coefficient in the range between the maximum gradation value and the minimum gradation value is added to the minimum gradation value. Binarization processing that executes binarization processing using the set local threshold, and executes binarization processing using the page threshold when the local contrast is less than or equal to the switching threshold. process and
A first average gradation value, which is the average gradation value of the first class, and a first average gradation value, which is the average gradation value of the first class, and a first average gradation value, which is the average gradation value of the first class, are determined by the discriminant analysis method on the input image data that has been subjected to the low-pass filter processing . a histogram analysis step of calculating a second average gradation value that is an average gradation value of the second class;
an operation display step of accepting user input for specifying a selection area ;
a character recognition processing step of recognizing characters included in the input image data in the selected area, calculating a recognition accuracy that is the certainty of the recognition, and calculating an integrated value of the recognition accuracy of each character;
Equipped with
In the binarization processing step, a threshold is changed between the first average gradation value and the second average gradation value, and the binarization processing is performed using the changed threshold . ,
The character recognition processing step calculates the recognition accuracy for each of the changed thresholds, and sets a highest accuracy threshold that is the threshold with the highest cumulative value of recognition accuracy for each input character,
The binarization processing step executes binarization processing of the input image data within the selected area using the highest accuracy threshold,
The operation display step is an image processing method of displaying the image after the binarization processing.

a filter processing unit that performs low-pass filter processing as preprocessing on input image data before setting a threshold for binarization processing;
A threshold setting unit that sets a page threshold and a local threshold, which are set using a discriminant analysis method, as thresholds for binarization processing, based on the input image data on which the low-pass filter processing has been performed;
A plurality of pixels included in the input image data subjected to the low-pass filter processing are divided into local regions composed of (M×N) pixels (M and N are integers of 3 or more), and the divided plurality of pixels are a local region of interest selection unit that sequentially selects the local regions of as the local region of interest;
Obtain the maximum gradation value, which is the gradation value on the lowest density side among the gradation values of the plurality of pixels constituting the local area of interest, and switch so that the smaller the maximum gradation value is, the larger the gradation value becomes. a switching threshold setting unit that sets a threshold for
The local contrast, which is the difference between the maximum gradation value and the minimum gradation value among the gradation values of a plurality of pixels included in the local area of interest, is compared with the switching threshold, and the local contrast is determined by comparing the switching threshold with the local contrast. is larger than the switching threshold, a value obtained by multiplying the local contrast by a predetermined coefficient in the range between the maximum gradation value and the minimum gradation value is added to the minimum gradation value. Binarization processing that executes binarization processing using the set local threshold, and executes binarization processing using the page threshold when the local contrast is less than or equal to the switching threshold. Department,
A first average gradation value, which is the average gradation value of the first class, and a first average gradation value, which is the average gradation value of the first class, and a first average gradation value, which is the average gradation value of the first class, are determined by the discriminant analysis method on the input image data that has been subjected to the low-pass filter processing . a histogram analysis unit that calculates a second average gradation value that is an average gradation value of the second class;
an operation display unit that accepts user input for specifying a selection area ; and an operation display unit that recognizes characters included in the input image data within the selection area, calculates recognition accuracy that is the certainty of the recognition, and calculates the recognition accuracy of each character. causing the image processing device to function as a character recognition processing unit that calculates the cumulative value of the recognition accuracy;
The binarization processing unit executes binarization processing using the changed threshold while changing the threshold between the first average gradation value and the second average gradation value . ,
The character recognition processing unit calculates the recognition accuracy for each of the changed thresholds, and sets a highest accuracy threshold that is a threshold with the highest cumulative value of recognition accuracy for each input character,
The binarization processing unit executes binarization processing of the input image data within the selected area using the highest accuracy threshold,
The operation display section is an image processing program that displays the image after the binarization process.