JP2001291056A

JP2001291056A - Document picture recognizing device and recording medium

Info

Publication number: JP2001291056A
Application number: JP2000102968A
Authority: JP
Inventors: Masaki Hamaguchi; 昌己濱口; Katsuto Fujimoto; 克仁藤本
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2000-04-05
Filing date: 2000-04-05
Publication date: 2001-10-19
Anticipated expiration: 2020-04-05
Also published as: JP4409713B2

Abstract

PROBLEM TO BE SOLVED: To properly and quickly binarize a character picture and the other picture area whose characteristics are different from an inputted multi-level picture. SOLUTION: In this document picture recognizing device equipped with a picture binarizing means 10a for preparing a binary picture from an input multi- level picture 1 and a picture recognizing means 14a for recognizing the prepared binary picture, the picture binarizing means 10a separates an input multi-level picture 1 into a character picture area and a background picture area, and binarizes each separated character picture area, and decides the binarization threshold of the background picture area from the binarization threshold at the time of the binarization processing, and binarizes the background picture area.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、スキャナやデジタ
ルカメラ等で取り込んだ多値画像から、文字を認識する
文書画像認識装置及び記録媒体に関する。[0001] 1. Field of the Invention [0002] The present invention relates to a document image recognition apparatus and a recording medium for recognizing characters from a multi-valued image captured by a scanner, a digital camera, or the like.

【０００２】[0002]

【従来の技術】画像の特徴を解析するとき、多値画像か
ら対象図形と背景を分離した２値画像を作成して取り扱
うことが多い。画像の２値化は、多値画像の階調値がし
きい値以下のときは黒とし、しきい値を超えるときは白
とするしきい値処理によって行われる。通常、２値画像
の黒の部分は対象図形を、白の部分は背景を表してい
る。2. Description of the Related Art When analyzing features of an image, a binary image in which a target graphic and a background are separated from a multi-valued image is often created and handled. The binarization of the image is performed by a threshold value process in which when the gradation value of the multi-valued image is equal to or smaller than the threshold value, the image is set to black, and when it exceeds the threshold value, it is set to white. Usually, the black part of the binary image represents the target figure, and the white part represents the background.

【０００３】しきい値を決める手法には、与えられた多
値画像の階調ヒストグラムを求めた結果、２つのピーク
をもつ分布になる場合、この２つの山の間の谷をしきい
値とするモード法や、階調ヒストグラムにおいて、階調
値の集合をしきい値ｔで２つのクラス（ｔ以上とｔ未
満）に分割したと仮定したとき、２つのクラス間の分離
（分散値）が最も良くなるようにパラメータ（しきい値
ｔ）を決める（文献：電子情報通信学会論文誌８０／４
Ｖｏｌ．Ｊ６３−ＤＮｏ．４，ｐ．３４９−３５６参
照）、判別分析における判別基準を用いた２値化処理な
どがある。[0003] As a method of determining a threshold value, when a distribution having two peaks is obtained as a result of obtaining a gradation histogram of a given multi-valued image, a valley between these two peaks is defined as a threshold value. Assuming that a set of tone values is divided into two classes (more than t and less than t) by a threshold value t in a mode method or a tone histogram, the separation (variance value) between the two classes is Determine the parameter (threshold value t) to be the best (literature: IEICE Transactions on Electronics 80/4
Vol. J63-D No. 4, p. 349-356), and binarization processing using a discriminant criterion in discriminant analysis.

【０００４】文書画像認識装置では、このような２値化
処理によって作成された２値画像を認識用画像として扱
い、文字の抽出（コード化）が行われていた。In a document image recognition apparatus, a binary image created by such a binarization process is treated as a recognition image, and characters are extracted (coded).

【０００５】[0005]

【発明が解決しようとする課題】前記従来のものには、
次のような課題があった。SUMMARY OF THE INVENTION
There were the following issues.

【０００６】従来の２値化処理では、抽出対象である文
字画像の背景に色が付いている場合、読み取られた文字
画像と背景色の濃度が近くなるため、適切なしきい値を
求めることが困難となり、作成された２値画像において
背景領域の一部が文字画像と共に黒画素として表される
ことがあった。このような２値画像を用いて文字認識を
行うと認識精度が低くなるといった問題が発生してい
た。また、このような２値画像は、見やすさといった点
から、表示用の画像としては適さないといった問題があ
った。In the conventional binarization process, if the background of a character image to be extracted has a color, the density of the read character image and the background color are close to each other. It became difficult, and a part of the background area was sometimes represented as a black pixel together with the character image in the created binary image. When character recognition is performed using such a binary image, there has been a problem that recognition accuracy is reduced. In addition, such a binary image has a problem that it is not suitable as an image for display from the point of viewability.

【０００７】本発明は、このような従来の課題を解決
し、入力された多値画像から文字画像とそれ以外の性質
の異なる画像領域を適切に、かつ高速に２値化すること
を目的とする。An object of the present invention is to solve such a conventional problem and appropriately and rapidly binarize a character image and other image regions having different properties from an input multi-valued image. I do.

【０００８】[0008]

【課題を解決するための手段】図１は本発明の原理説明
図である。図１中、１は入力多値画像、１０ａは画像２
値化手段、１４ａは画像認識手段である。FIG. 1 is a diagram illustrating the principle of the present invention. In FIG. 1, 1 is an input multi-value image, 10a is an image 2
The value conversion means 14a is an image recognition means.

【０００９】本発明は前記従来の課題を解決するため次
のように構成した。The present invention is configured as follows in order to solve the above-mentioned conventional problems.

【００１０】（１）：入力多値画像１から２値画像を作
成する画像２値化手段１０ａと、該作成した２値画像の
認識を行う画像認識手段１４ａとを備えた文書画像認識
装置において、前記画像２値化手段１０ａは、前記入力
多値画像１を文字画像領域と背景画像領域とに分離し、
該分離した各文字画像領域に対して２値化処理を行い、
該２値化処理時の２値化しきい値から前記背景画像領域
の２値化しきい値を決定して前記背景画像領域の２値化
処理を行う。(1): A document image recognition apparatus provided with an image binarizing means 10a for creating a binary image from an input multi-valued image 1 and an image recognizing means 14a for recognizing the created binary image. The image binarizing means 10a separates the input multi-valued image 1 into a character image area and a background image area,
Performing a binarization process on each of the separated character image regions;
The binarization threshold of the background image area is determined from the binarization threshold at the time of the binarization processing, and the binarization processing of the background image area is performed.

【００１１】（２）：前記（１）の文書画像認識装置に
おいて、前記画像２値化手段１０ａは、前記入力多値画
像１に含まれる低階調の代表値を選んで２値化処理を行
い、該作成された２値画像から文字画像を多く含む文字
画像領域とそれ以外の背景画像領域とに分離する。(2): In the document image recognition device of (1), the image binarizing means 10a selects a low gradation representative value included in the input multi-valued image 1 and performs a binarizing process. Then, the generated binary image is separated into a character image region including many character images and a background image region other than the character image region.

【００１２】（３）：前記（１）の文書画像認識装置に
おいて、前記画像２値化手段１０ａは、前記分離した文
字画像領域内が全て文字画像の単一階調である場合、該
単一階調の文字画像領域に背景画像が含まれるように該
文字画像領域を変更して、２値化処理を行う。(3) In the document image recognition apparatus of (1), the image binarizing means 10a is configured to determine whether the separated character image area has a single gradation when the entire character image area has a single gradation. The character image area is changed so that the background image is included in the gradation character image area, and the binarization process is performed.

【００１３】（４）：前記（１）の文書画像認識装置に
おいて、前記画像２値化手段１０ａは、前記分離した各
文字画像領域に対して行った２値化処理時の各２値化し
きい値を求め、前記２値化処理された領域を含むように
拡大した矩形内の領域を前記求めた２値化しきい値で２
値化処理する。(4) In the document image recognition apparatus of (1), the image binarizing means 10a performs each binarization threshold in the binarization processing performed on each of the separated character image areas. The area within the rectangle enlarged to include the binarized area is calculated by the obtained binarization threshold value.
Perform value processing.

【００１４】（５）：入力多値画像を文字画像領域と背
景画像領域とに分離し、該分離した各文字画像領域に対
して２値化処理を行い、該２値化処理時の２値化しきい
値から前記背景画像領域の２値化しきい値を決定して前
記背景画像領域の２値化処理を行う画像２値化手段１０
ａと、該２値化処理で作成した２値画像の認識を行う画
像認識手段１４ａと、してコンピュータを機能させるた
めのプログラムを記録したコンピュータ読み取り可能な
記録媒体とする。(5): The input multi-valued image is separated into a character image region and a background image region, and a binarization process is performed on each of the separated character image regions. Image binarizing means 10 for determining a binarization threshold value of the background image region from a binarization threshold value and performing binarization processing of the background image region
a, and an image recognizing unit 14a for recognizing the binary image created by the binarization process, and a computer-readable recording medium storing a program for causing a computer to function.

【００１５】（作用）前記構成に基づく作用を説明す
る。(Operation) The operation based on the above configuration will be described.

【００１６】画像２値化手段１０ａで入力多値画像１か
ら２値画像を作成し、画像認識手段１４ａで作成した２
値画像の認識を行う文書画像認識装置において、前記画
像２値化手段１０ａで、前記入力多値画像１を文字画像
領域と背景画像領域とに分離し、該分離した各文字画像
領域に対して２値化処理を行い、該２値化処理時の２値
化しきい値から前記背景画像領域の２値化しきい値を決
定して前記背景画像領域の２値化処理を行う。このた
め、文字画像の背景に色が付いている画像であっても高
精度に２値化でき、かつ一定しきい値で２値化処理を行
うため高速に２値化することができる。A binary image is created from the input multivalued image 1 by the image binarizing means 10a, and the binary image created by the image recognizing means 14a.
In a document image recognition apparatus for recognizing a value image, the input multi-valued image 1 is separated into a character image region and a background image region by the image binarizing means 10a. A binarization process is performed, and a binarization threshold of the background image region is determined from the binarization threshold value at the time of the binarization process, and the binarization process of the background image region is performed. For this reason, even if the image of the character image has a color on the background, the image can be binarized with high accuracy, and the binarization can be performed at a high speed because the binarization process is performed with a fixed threshold value.

【００１７】また、前記画像２値化手段１０ａで、前記
入力多値画像１に含まれる低階調の代表値を選んで２値
化処理を行い、該作成された２値画像から文字画像を多
く含む文字画像領域とそれ以外の背景画像領域とに分離
する。このため、低階調である黒い部分のみ２値化し
て、容易に文字画像領域を分離することができる。Further, the image binarizing means 10a selects a low gradation representative value included in the input multi-valued image 1 and performs a binarizing process, and converts a character image from the created binary image. It is separated into a character image area including many and a background image area other than that. For this reason, it is possible to binarize only a black portion having a low gradation and easily separate a character image region.

【００１８】さらに、前記画像２値化手段１０ａで、前
記分離した文字画像領域内が全て文字画像の単一階調で
ある場合、該単一階調の文字画像領域に背景画像が含ま
れるように該文字画像領域を変更して、２値化処理を行
う。このため、文字画像領域内が全て文字画像の場合で
も、文字画像が細く出力されることを防止し、適切な２
値化を行うことができる。Further, in the image binarizing means 10a, when the separated character image area has a single gradation of the character image, the background image is included in the single gradation character image area. Then, the character image area is changed and the binarization process is performed. For this reason, even when the entire character image area is a character image, it is possible to prevent the character image from being output thinly,
Value conversion can be performed.

【００１９】また、前記画像２値化手段１０ａで、前記
分離した各文字画像領域に対して行った２値化処理時の
各２値化しきい値を求め、前記２値化処理された領域を
含むように拡大した矩形内の領域を前記求めた２値化し
きい値で２値化処理する。このため、文字画像領域とし
て、抽出されなかった文字画像部分を適切に２値化処理
することができる。The image binarizing means 10a obtains each binarization threshold value at the time of the binarization process performed on each of the separated character image regions, and determines the binarized region. The area within the rectangle enlarged so as to be included is subjected to binarization processing using the obtained binarization threshold. For this reason, a character image portion that has not been extracted can be appropriately binarized as a character image region.

【００２０】さらに、入力多値画像を文字画像領域と背
景画像領域とに分離し、該分離した各文字画像領域に対
して２値化処理を行い、該２値化処理時の２値化しきい
値から前記背景画像領域の２値化しきい値を決定して前
記背景画像領域の２値化処理を行う画像２値化手段１０
ａと、該２値化処理で作成した２値画像の認識を行う画
像認識手段１４ａと、してコンピュータを機能させるた
めのプログラムを記録したコンピュータ読み取り可能な
記録媒体とする。このため、この記録媒体のプログラム
をコンピュータにインストールすることで、文字画像の
背景に色が付いている画像であっても高精度に２値化で
き、かつ高速に２値化することができる文書画像認識装
置を容易に提供することができる。Further, the input multi-valued image is separated into a character image region and a background image region, and each of the separated character image regions is subjected to a binarization process, and a binarization threshold in the binarization process. Image binarization means 10 for determining a binarization threshold value of the background image area from the value and performing a binarization process of the background image area
a, and an image recognizing unit 14a for recognizing the binary image created by the binarization process, and a computer-readable recording medium storing a program for causing a computer to function. Therefore, by installing the program on the recording medium into a computer, a document that can be binarized with high precision and at high speed even if the image has a character image with a colored background. An image recognition device can be easily provided.

【００２１】[0021]

【発明の実施の形態】本発明の文書画像認識装置は、入
力多値画像から文字画像を多く含む画像領域とそれ以外
の画像領域に分離し、該分離された２種類の画像領域に
対して異なる画像処理を行い、高速に２値画像を作成す
る２値化処理手段を備えるものである。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A document image recognition apparatus according to the present invention separates an input multi-valued image into an image area containing many character images and an image area other than the image area. It is provided with a binarization processing means for performing different image processing and creating a binary image at high speed.

【００２２】また、入力多値画像の文字画像に含まれる
低階調（黒）の代表値を選んで２値化処理を行い、該作
成された２値画像から文字画像に含まれる画素を抽出
し、文字画像を多く含む画像領域とそれ以外の背景画像
領域を分離する２値画像領域分離手段を備えるものであ
る。Also, a low gradation (black) representative value included in the character image of the input multi-valued image is selected and binarized, and pixels included in the character image are extracted from the generated binary image. And a binary image area separating means for separating an image area containing many character images from other background image areas.

【００２３】さらに、前記２値画像領域分離手段により
求められた文字画像を多く含む画像領域に対して個々に
判別分析における判別基準を用いた２値化処理を行い、
２値画像を作成する文字画像領域２値化手段と、該文字
画像領域２値化手段で求められた文字画像領域の分散値
から、その文字画像領域内が単一階調であることを判断
する単一階調画像領域判別手段と、該単一階調画像領域
判別手段により単一階調と判断された文字画像領域に、
背景画像が含まれるように文字画像領域を膨張し、再度
判別分析における判別基準を用いた２値化処理を行う単
一階調画像領域２値化手段とを備えるものである。Further, a binarization process using a discrimination criterion in discriminant analysis is individually performed on an image region containing a large number of character images obtained by the binary image region separating means,
From the character image area binarizing means for creating a binary image and the variance of the character image area obtained by the character image area binarizing means, it is determined that the inside of the character image area has a single gradation. A single tone image area determining means, and a character image area determined to be a single tone by the single tone image area determining means,
A single-tone image area binarizing means for expanding a character image area so as to include a background image and performing binarization processing again using a criterion in the discriminant analysis.

【００２４】また、前記文字画像領域２値化手段により
求められた個々の文字画像領域のしきい値から最適なし
きい値を求め、前記文字画像領域２値化手段により２値
化処理された領域を含むｎ×ｍ矩形内の背景画像領域を
２値化することで、文字画像領域に含まれなかった文字
画像部分を２値化処理する背景画像領域２値化手段を備
えるものである。Further, an optimum threshold value is obtained from the threshold values of the individual character image areas obtained by the character image area binarizing means, and the binarized area is obtained by the character image area binarizing means. The background image area is binarized by binarizing the background image area in the n × m rectangle including the character image area, thereby performing a binarization process on a character image portion not included in the character image area.

【００２５】このような手段を備えることにより、入力
された多値画像から文字画像とそれ以外の性質の異なる
画像領域とに分離された２値画像が作成されるため、従
来技術で問題になっていた背景色のある文字画像の適切
な２値化を行うことができる。By providing such means, a binary image separated from a character image and an image region having other properties from an input multi-valued image is created, which is a problem in the prior art. Suitable binarization of a character image having a background color can be performed.

【００２６】以下、図面を参照して本発明の実施の形態
を説明する。Hereinafter, embodiments of the present invention will be described with reference to the drawings.

【００２７】（１）：文書画像認識装置の構成の説明図２は本発明の実施の形態における文書画像認識装置の
構成の説明図である。図２において、文書画像認識装置
には、多値画像２値化部１０と２値画像認識部１４が設
けてある。また、多値画像２値化部１０には、文字画像
領域検出部１１、文字画像領域２値化部１２、背景画像
領域２値化部１３が設けてある。(1) Description of the Configuration of the Document Image Recognition Apparatus FIG. 2 is an explanatory diagram of the configuration of the document image recognition apparatus according to the embodiment of the present invention. In FIG. 2, the document image recognition apparatus includes a multi-value image binarization unit 10 and a binary image recognition unit 14. The multi-value image binarizing unit 10 includes a character image region detecting unit 11, a character image region binarizing unit 12, and a background image region binarizing unit 13.

【００２８】この文書画像認識装置は、多値画像（グレ
ースケールイメージ）を入力とし、多値画像２値化部１
０で各画素を文字画像領域と背景画像領域のいずれかを
意味する値を持つ２値画像に変換するものである。This document image recognition device receives a multi-valued image (gray scale image) as an input,
At 0, each pixel is converted into a binary image having a value meaning either the character image area or the background image area.

【００２９】文字画像領域検出部１１は、グレースケー
ルイメージ全体に代表的なしきい値で２値化処理を実行
し、文字画像を多く含む領域を決定するものである。文
字画像領域２値化部１２は、文字画像を多く含む領域に
ついて個々に２値化処理を実行するものである。背景画
像領域２値化部１３は、文字画像領域のしきい値を基に
背景画像領域の２値化処理を実行し、最終的な２値画像
を出力するものである。２値画像認識部１４は、多値画
像２値化部１０で作成した２値画像から文字を認識し、
認識結果を出力するものである。The character image area detecting section 11 executes a binarization process on the entire gray scale image with a typical threshold value to determine an area containing a large number of character images. The character image area binarizing unit 12 individually performs a binarization process on an area including many character images. The background image area binarization unit 13 performs a binarization process on the background image area based on the threshold value of the character image area, and outputs a final binary image. The binary image recognizing unit 14 recognizes a character from the binary image created by the multi-value image binarizing unit 10,
It outputs the recognition result.

【００３０】（２）：多値画像２値化部の文字画像領域
検出部の説明図３は文字画像領域検出部の処理の説明図である。図３
において、文字画像領域検出部１１の処理は、先ず一定
しきい値２値化処理Ｓ１を行い、次にラベリング処理Ｓ
２を行い、最後に有効セグメント領域抽出処理（文字画
像領域検出）Ｓ３を行うものである（２値画像領域分離
手段）。(2) Description of Character Image Area Detecting Section of Multivalued Image Binarizing Section FIG. 3 is an explanatory diagram of processing of the character image area detecting section. FIG.
In the processing of the character image area detecting section 11, first, a constant threshold value binarization processing S1 is performed, and then a labeling processing S1 is performed.
2, and finally, an effective segment area extraction process (character image area detection) S3 is performed (binary image area separation means).

【００３１】一定しきい値２値化処理Ｓ１では、入力グ
レースケールイメージに対して、一定しきい値で２値化
を行う。このときのしきい値は、文字画像の中でもより
黒い部分のみが２値化される階調を選ぶ。In the constant threshold binarization process S1, binarization is performed on the input gray scale image with a constant threshold. As the threshold value at this time, a gradation is selected at which only the darker part in the character image is binarized.

【００３２】ラベリング処理Ｓ２では、前記一定しきい
値２値化処理Ｓ１で得られた２値画像の連結成分の集合
をラベリング処理により抽出する。In the labeling process S2, a set of connected components of the binary image obtained in the constant threshold binarization process S1 is extracted by a labeling process.

【００３３】有効セグメント領域抽出処理（文字画像領
域検出）Ｓ３では、文字画像に適さないサイズの連結部
分を排除する（サイズにより罫線や１ドットのゴミ等を
除く）ことで、有効なセグメントを選び、その選んだセ
グメントの外接矩形をとり、文字画像領域とする。In the effective segment area extraction processing (character image area detection) S3, an effective segment is selected by eliminating connected portions of a size unsuitable for a character image (excluding ruled lines and 1-dot dust depending on the size). Then, a circumscribed rectangle of the selected segment is taken as a character image area.

【００３４】（具体的イメージによる説明）図４は２値
画像領域分離手段のイメージによる説明図であり、図４
（ａ）は入力グレースケールイメージである。図４
（ａ）において、文字画像は、薄い黒い部分（灰色部
分）ａ１、ｂ１とより黒い部分ａ２、ｂ２とから成り、
背景は白い部分である。FIG. 4 is an explanatory diagram based on an image of the binary image area separating means.
(A) is an input grayscale image. FIG.
In (a), the character image is composed of light black parts (gray parts) a1, b1 and darker parts a2, b2,
The background is the white part.

【００３５】図４（ｂ）は一定しきい値２値化処理Ｓ１
の処理結果である。図４（ｂ）において、一定しきい値
２値化処理Ｓ１の処理で、一定しきい値で２値化処理し
て文字画像中のより黒い部分ａ２、ｂ２のみを２値化
し、ラベリング処理Ｓ２、文字画像領域検出Ｓ３を行っ
て、より黒い部分ａ２、ｂ２の外接矩形（点線で示して
ある）を抽出する。FIG. 4B shows a constant threshold value binarization process S1.
Is the processing result. In FIG. 4B, in the process of the constant threshold value binarization process S1, only the darker portions a2 and b2 in the character image are binarized by the constant threshold value, and the labeling process S2 is performed. Then, the character image area detection S3 is performed to extract a circumscribed rectangle (shown by a dotted line) of the darker portions a2 and b2.

【００３６】図４（ｃ）は背景領域と文字画像領域に分
離したグレースケールイメージの説明である。図４
（ｃ）において、図４（ｂ）の外接矩形座標をグレース
ケールイメージ（図４（ａ）参照））に適応して、背景
領域を含んだ文字画像領域（点線内）と、他の背景領域
（点線外）とに分離する。FIG. 4C illustrates a gray scale image separated into a background area and a character image area. FIG.
In (c), the circumscribed rectangular coordinates of FIG. 4 (b) are adapted to a grayscale image (see FIG. 4 (a)), and a character image area including a background area (within a dotted line) and another background area (Outside the dotted line).

【００３７】このように外接矩形を取るのは、文字画像
中のより黒い部分以外の文字画像領域と背景画像領域を
部分２値化領域に含ませるためである。本実施の形態の
２値化処理で使用している、判別分析における判別基準
を用いた２値化処理では、抽出対象階調画像（文字画
像）と非抽出対象画像（背景画像）を含む領域にするこ
とで、最適な２値化が行われるからである。The reason why the circumscribed rectangle is obtained in this way is to include the character image area and the background image area other than the blacker part in the character image in the partial binarized area. In the binarization processing using the discriminant criterion in the discriminant analysis used in the binarization processing of the present embodiment, an area including an extraction target gradation image (character image) and a non-extraction target image (background image) By doing so, the optimal binarization is performed.

【００３８】また、文字画像領域を決める従来の方法と
して、文字画像全体のエッジを抽出する（画素間の階調
の変化量を計算する）ようなソーベルフィルタ処理が用
いられることがある。これに対し、本発明では、一定し
きい値２値化処理を用いることで計算量を減らし、より
高速な文字画像領域の検出を行っている。As a conventional method for determining a character image area, a Sobel filter process for extracting edges of the entire character image (calculating a change in gradation between pixels) may be used. On the other hand, in the present invention, the amount of calculation is reduced by using the constant threshold value binarization processing, and the character image area is detected at higher speed.

【００３９】（３）：多値画像２値化部の文字画像領域
２値化部の説明文字画像領域２値化部１２では、文字画像領域検出部１
１で決定した文字画像を多く含む領域（外接矩形の文字
画像領域）について、領域毎に判別分析における判別基
準を用いた２値化処理を行うものである。(3): Explanation of the character image area binarization section of the multi-value image binarization section In the character image area binarization section 12, the character image area detection section 1
The binarization process is performed on a region including a large number of character images determined in step 1 (character image region of a circumscribed rectangle) using a discrimination criterion in discriminant analysis for each region.

【００４０】図５は文字画像領域２値化部の説明図であ
り、図５（ａ）は文字画像領域２値化部の処理の説明で
ある。図５（ａ）において、文字画像領域２値化部１２
の処理は、先ず文字画像領域の２値化しきい値・分散値
算出処理Ｓ１１を行い、次に文字画像領域に背景が含ま
れているかを領域内分散値により判定Ｓ１２を行う。こ
の判定で、文字画像領域に背景が含まれている場合は文
字画像領域内の２値化処理Ｓ１３を行い、もし背景が含
まれていない場合は文字画像領域膨張処理Ｓ１４を行い
再度文字画像領域の２値化しきい値・分散値算出処理Ｓ
１１に戻る。FIG. 5 is an explanatory diagram of the character image area binarizing section, and FIG. 5A is an explanatory diagram of the processing of the character image area binarizing section. In FIG. 5A, the character image area binarization unit 12
First, the binarization threshold value / variance value calculation processing S11 of the character image area is performed, and then the determination S12 is performed based on the variance value in the area to determine whether the background is included in the character image area. In this determination, if the background is included in the character image area, the binarization processing S13 in the character image area is performed. If the background is not included, the character image area expansion processing S14 is performed and the character image area is re-executed. Thresholding / variance calculating process S
Return to 11.

【００４１】・この文字画像領域の２値化しきい値・分
散値算出処理Ｓ１１では、先ず各文字画像領域につい
て、判別分析における判別基準を用いた２値化しきい値
の算出とクラス内分散値の算出処理が行われる。In the character image area binarization threshold value / variance calculation processing S11, first, for each character image area, the binarization threshold value is calculated using the criterion in the discriminant analysis, and the in-class variance value is calculated. Calculation processing is performed.

【００４２】（２値化しきい値とクラス内分散値の算出
方法の説明）以下、２値化しきい値とクラス内分散値の
算出方法を説明する。図５（ｂ）は階調ヒストグラムの
説明である。図５（ｂ）において、横軸は階調、縦軸は
画素数を示している。なお、階調は左から右方向に黒か
ら白に変化するものである。(Description of Calculation Method of Binarization Threshold and Intra-Class Variance Value) Hereinafter, a method of calculating the binarization threshold value and the in-class variance value will be described. FIG. 5B is an explanation of the gradation histogram. In FIG. 5B, the horizontal axis represents the gradation, and the vertical axis represents the number of pixels. Note that the gradation changes from black to white from left to right.

【００４３】与えられた領域のしきい値をｔとして、ｔ
以上の階調を持つ画素と、それより小さな値を持つ画素
の２つのグループに分ける。この２つのグループをクラ
ス１、クラス２とする。クラス１の画素数をω₁(t)、ク
ラス１の平均階調をＭ₁(t)、クラス２の画素数をω
₂(t)、クラス２の平均階調をＭ₂(t)とおき、全画素の平
均階調をＭ_Tとおくと、クラス間分散σＢ²は次の式で
与えられる。Assuming that the threshold value of a given region is t, t
Pixels having the above-mentioned gradation and pixels having smaller values are divided into two groups. These two groups are referred to as class 1 and class 2. The number of pixels of class 1 is ω ₁ (t), the average gradation of class 1 is M ₁ (t), and the number of pixels of class 2 is ω
₂ (t), the average gradation of class 2 is M ₂ (t), and the average gradation of all pixels is M _T , the inter-class variance σB ² is given by the following equation.

【００４４】σＢ²＝ω₁(Ｍ₁ −Ｍ_T)²＋ω₂(Ｍ₂ −Ｍ
_T)²＝ω₁ ω₂(Ｍ₁−Ｍ₂)² ここで、ｔを変化させてクラス間分散σＢ²を最大にす
るｔの値を求め、その領域内の２値化しきい値とする。ΣB ² = ω ₁ (M ₁ −M _T ) ² + ω ₂ (M ₂ −M
_T ) ² = ω ₁ ω ₂ (M ₁ −M ₂ ) ² Here, the value of t that maximizes the inter-class variance σB ² by changing t is determined, and is set as a binarization threshold value in the region. .

【００４５】（文字画像領域に背景が含まれているかの
判定の説明）・次に文字画像領域に背景が含まれているかを領域内分
散値による判定Ｓ１２を行う（単一階調画像領域判別手
段）。これは、文字画像領域内がすべて文字画像で背景
画像を含まない場合、上記の判別分析における判別基準
を用いた２値化しきい値の算出方法では、微小な階調差
を感知してしきい値を求めてしまうため、そのしきい値
で正しく２値化できずに文字画像が細く出力されるとい
った現象が起きる。(Explanation of Judgment of Whether Background is Included in Character Image Area) Next, judgment S12 is performed based on the variance value within the area to determine whether the background is included in the character image area (single gradation image area determination). means). This is because when the entire character image area does not include a background image as a character image, the above-described method of calculating the binarization threshold using the criterion in the discriminant analysis senses a small gradation difference. Since the value is obtained, a phenomenon occurs in which the character image cannot be correctly binarized at the threshold value and the character image is output thinly.

【００４６】図６は判別分析における判別基準を用いた
２値化処理例の説明図であり、図６（ａ）は領域内がす
べて文字画像で背景画像を含まない例の説明である。図
６（ａ）において、文字画像領域内がすべて文字画像
で、ａ１の階調の画素とａ２の階調の画素よりなり、文
字画像領域内の画素がすべて２値化対象の階調となって
いる。この場合、その領域内で２値化しきい値を求めて
しまうため、図６（ａ）の右図のように文字画像が細く
出力される（ａ２の階調の画素のみ出力される）ことに
なる。このため、判別分析における判別基準を用いた２
値化処理では正しく２値化できないことになる。FIG. 6 is a diagram illustrating an example of a binarization process using a discriminant criterion in discriminant analysis. FIG. 6A is a diagram illustrating an example in which the entire area is a character image and does not include a background image. In FIG. 6A, the entire character image area is a character image, and is composed of pixels of a1 gradation and pixels of a2 gradation, and all the pixels in the character image area are binarization target gradations. ing. In this case, since the binarization threshold value is obtained in the area, the character image is output thinly (only the pixels of the gradation of a2 are output) as shown in the right diagram of FIG. Become. For this reason, 2 using the discriminant criterion in discriminant analysis
In the binarization processing, binarization cannot be correctly performed.

【００４７】・文字画像領域内がすべて文字画像で背景
画像を含まない場合、このような領域のクラス間分散値
は、背景を含む領域に比べ小さい値を取ることから判断
できる。このため、クラス間分散しきい値を定め、その
しきい値以下の領域に対しては、周囲の背景を取り込む
ように領域膨張を行い（文字画像領域膨張処理Ｓ１
４）、該膨張した新たな領域について２値化しきい値と
クラス間分散値の算出を再度行う。When the entire character image area is a character image and does not include a background image, it can be determined from the fact that the inter-class variance of such an area takes a smaller value than that of the area including the background. For this reason, the inter-class variance threshold value is determined, and for an area equal to or less than the threshold value, area expansion is performed to capture the surrounding background (character image area expansion processing S1).
4) The binarization threshold and the inter-class variance are calculated again for the expanded new area.

【００４８】図６（ｂ）は単一階調画像領域２値化手段
の説明である。図６（ｂ）において、図６（ｂ）の左の
ように、文字画像領域内のすべての画素（ａ１の階調の
画素とａ２の階調の画素）が２値化対象の階調となって
いる場合、図６（ｂ）の中央のように、領域を広げて
（膨張して）背景画像（白い部分）を取り込む。次に、
広げた領域について、再度、判別分析における判別基準
を用いた２値化しきい値とクラス間分散値の算出を行
う。FIG. 6B is an illustration of the single-tone image area binarizing means. In FIG. 6B, as shown on the left side of FIG. 6B, all the pixels in the character image area (the pixel of the gray scale of a1 and the pixel of the gray scale of a2) correspond to the gray scale to be binarized. If so, the background image (white portion) is captured by expanding (expanding) the area as shown in the center of FIG. 6B. next,
For the expanded area, the binarization threshold and the inter-class variance are calculated again using the criterion in the discriminant analysis.

【００４９】・クラス間分散がクラス間分散しきい値よ
り大きく、文字画像領域内に背景画像が含まれると判断
した領域については、求めた２値化しきい値により、２
値画像を作成する（文字画像領域２値化手段）。For an area where the inter-class variance is larger than the inter-class variance threshold and the background image is determined to be included in the character image area, the obtained binarization threshold
A value image is created (character image area binarization means).

【００５０】図６（ｃ）は領域内に文字画像と背景画像
が含まれると判断した領域の例の説明である。図６
（ｃ）において、左図のように文字画像領域内に背景
（白い部分）が含まれている場合は、判別分析における
判別基準を用いた２値化処理で適切なしきい値が求ま
り、右図のように適切な２値化処理が行われる。FIG. 6C is an explanation of an example of an area determined to include a character image and a background image in the area. FIG.
In (c), if the background (white portion) is included in the character image area as shown in the left diagram, an appropriate threshold value is obtained by binarization processing using the discrimination standard in discriminant analysis. An appropriate binarization process is performed as shown in FIG.

【００５１】以上の処理を文字画像領域検出部１１で決
定した全ての文字画像領域に対して行うことで、文字画
像の２値化が行われる。By performing the above processing on all the character image areas determined by the character image area detection unit 11, the character image is binarized.

【００５２】（４）：背景画像領域２値化部の説明前記（３）では文字画像領域検出部１１で検出した領域
について２値化処理を行った。しかし、この領域は、文
字画像の中でもより黒い部分の外接矩形であるため、こ
の領域外にも文字画像が含まれている可能性がある。し
たがって、背景画像領域２値化部１３では、背景画像領
域について２値化を行い、この領域に含まれる文字画像
の抽出を行う。(4) Description of Background Image Area Binarization Unit In (3), the area detected by the character image area detection unit 11 was subjected to binarization processing. However, since this area is a circumscribed rectangle of a darker portion in the character image, the character image may be included outside this area. Therefore, the background image area binarization unit 13 binarizes the background image area and extracts a character image included in this area.

【００５３】図７は背景画像領域２値化部の処理の説明
図である。図７において、先ず、文字画像領域２値化し
きい値を用いて背景画像領域の２値化しきい値を算出す
る（Ｓ２１）。次に、該算出した２値化しきい値を用い
て背景画像領域の２値化処理を行う（Ｓ２２）。FIG. 7 is an explanatory diagram of the processing of the background image area binarization unit. In FIG. 7, first, the binarization threshold value of the background image area is calculated using the binarization threshold value of the character image area (S21). Next, a binarization process is performed on the background image area using the calculated binarization threshold (S22).

【００５４】（背景画像領域の２値化例の説明）ａ：文字画像領域２値化しきい値から代表例を選ぶ方法
（第１の方法）第１の方法として、文字画像領域検出部１１で求めた各
領域のしきい値（記憶装置等に記憶しておく）から代表
値を選び、背景画像領域全面に適応する方法である。例
えば、各領域のしきい値から最も低い（黒い）階調を背
景画像領域のしきい値とする。このしきい値が文字画像
領域検出部１１で用いた一定しきい値の階調より高い
（白い）場合、本来掠れるはずであった背景画像領域中
の文字画像が２値化されるため、より文字らしくなる。
なお、しきい値の選び方として、各領域のしきい値の平
均、又は、最も高い（白い）階調を用いることもでき
る。(Description of Binarization Example of Background Image Area) a: Method of Selecting a Representative Example from Character Image Area Binarization Threshold (First Method) As a first method, the character image area detection unit 11 In this method, a representative value is selected from the obtained threshold values (stored in a storage device or the like) of each area, and is applied to the entire background image area. For example, the lowest (black) gradation from the threshold value of each area is set as the threshold value of the background image area. If this threshold value is higher (white) than the fixed threshold gradation used in the character image area detection unit 11, the character image in the background image area that should have been blurred is binarized. It becomes more character-like.
As a method of selecting the threshold value, an average of the threshold values of the respective regions or the highest (white) gradation can be used.

【００５５】図８は背景画像領域２値化の例１の説明図
である。図８において、先ず、上図ののように、２値
画像領域分離手段を用いて、文字画像領域（点線内の領
域１、領域２）を求める。次に、中図ののように、各
文字画像領域内を文字画像領域２値化手段、及び単一階
調画像領域２値化手段を用いて２値化する。この時、領
域１内の２値化しきい値をｔ₁、領域２内の２値化しき
い値をｔ₂とする（ｔ ₁＞ｔ₂）。最後に、下図のの
ように、文字画像領域内の２値化しきい値のうち最も階
調が低いｔ₂で背景画像領域の２値化を行い、文字画像
部分の抽出を行う。FIG. 8 is an explanatory view of Example 1 of the binarization of the background image area.
It is. In FIG. 8, first, as shown in the above figure,
The character image area (the area within the dotted line) is
The area 1 and the area 2) are obtained. Next, as shown in the middle figure,
A character image area binarizing means in the character image area, and a single floor
Binarization is performed using a tonal image area binarization unit. At this time,
Let the binarization threshold in region 1 be t₁, Binarize in area 2
T_Two(T ₁> T_Two). Finally, in the figure below
As described above, of the binarization thresholds in the character image area,
Low tone_TwoPerforms binarization of the background image area with
Extract the part.

【００５６】ｂ：文字画像領域２値化部で求めた各領域
のしきい値をその領域の周囲の背景画像領域に適応させ
る方法（第２の方法）第２の方法として、文字画像領域２値化部で求めた各領
域のしきい値をその領域の周囲の背景画像領域に適応さ
せる方法である。この方法では、狭い領域で求めた２値
化しきい値がその周囲にのみ用いられるため、１文字毎
の２値化がより適切に行えるようになる。B: Method of adapting the threshold value of each area obtained by the character image area binarization unit to the background image area surrounding the area (second method) As a second method, the character image area 2 This is a method in which the threshold value of each area obtained by the value conversion unit is adapted to a background image area around the area. In this method, since the binarization threshold value obtained in a narrow region is used only around the narrow region, binarization for each character can be performed more appropriately.

【００５７】図９は背景画像領域２値化の例２の説明図
である。図９において、先ず、上図ののように、２値
画像領域分離手段を用いて、文字画像領域（点線内の領
域１、領域２）を求める。次に、中図ののように、各
文字画像領域内を文字画像領域２値化手段、及び単一階
調画像領域２値化手段を用いて２値化する。この時、領
域１内の２値化しきい値をｔ₁、領域２内の２値化しき
い値をｔ₂とする（ｔ ₁＞ｔ₂）。最後に、下図のの
ように、各文字画像領域を含むｎ×ｍ領域（一点鎖線
内）を各文字画像領域内の２値化しきい値で２値化す
る。即ち、領域１を含むｎ×ｍ領域は２値化しきい値ｔ
₁で２値化し、領域２を含むｎ×ｍ領域は２値化しきい
値ｔ₂で２値化する。これにより、背景画像領域内に残
った文字画像部分を２値化する。なお、領域ｎ×ｍの範
囲は、文字となると予測できる範囲まで拡大するもので
ある。また、２値化の結果は黒で示してある。FIG. 9 is an explanatory diagram of Example 2 of the binarization of the background image area.
It is. In FIG. 9, first, as shown in the above figure,
The character image area (the area within the dotted line) is
The area 1 and the area 2) are obtained. Next, as shown in the middle figure,
A character image area binarizing means in the character image area, and a single floor
Binarization is performed using a tonal image area binarization unit. At this time,
Let the binarization threshold in region 1 be t₁, Binarize in area 2
T_Two(T ₁> T_Two). Finally, in the figure below
As described above, an nxm area including each character image area (dashed line
Is binarized by the binarization threshold in each character image area.
You. That is, the n × m region including the region 1 is a binarization threshold value t.
₁And the n × m region including the region 2 is binarized by
Value t_TwoTo binarize. As a result, the image remains in the background image area.
The digitized character image portion is binarized. Note that the range of the area n × m
The box expands to a range that can be predicted to be characters.
is there. The result of binarization is shown in black.

【００５８】以上、実施の形態で説明したように、入力
された多値画像から文字画像とそれ以外の性質の異なる
画像領域（背景画像領域）とに分離し、先ず、分離した
各文字画像領域に対して２値化処理を行い、該２値化処
理時の２値化しきい値から背景画像領域の２値化しきい
値を決定して背景画像領域の２値化処理を行い、２値画
像を作成する。そのため、従来できなかった文字画像の
背景に色が付いているような画像であっても、高精度か
つ高速に２値画像が作成できる。また、この２値画像を
用いることで認識精度を向上することができる。As described in the above embodiment, the input multi-valued image is separated into a character image and an image region having other properties (background image region). , A binary threshold value of the background image area is determined from the binary threshold value at the time of the binary processing, and the binary image processing is performed on the background image area. Create For this reason, a binary image can be created with high accuracy and high speed even for an image in which the background of a character image, which has not been conventionally possible, has a colored background. The recognition accuracy can be improved by using the binary image.

【００５９】（５）：プログラムのインストールの説明画像２値化手段１０ａ、画像認識手段１４ａ、多値画像
２値化部１０、文字画像領域検出部１１、文字画像領域
２値化部１２、背景画像領域２値化部１３、２値画像認
識部１４等は、プログラムで構成でき、主制御部（ＣＰ
Ｕ）が実行するものであり、主記憶に格納されているも
のである。このプログラムは、一般的な、コンピュータ
で処理されるものである。このコンピュータは、主制御
部、主記憶、ファイル装置、表示装置、キーボード等の
入力手段である入力装置などのハードウェアで構成され
ている。(5): Description of installation of program Image binarizing means 10a, image recognizing means 14a, multi-value image binarizing section 10, character image area detecting section 11, character image area binarizing section 12, background The image area binarizing unit 13, the binary image recognizing unit 14, and the like can be configured by a program, and can be configured by a main control unit (CP
U) is executed and stored in the main memory. This program is generally processed by a computer. This computer is configured by hardware such as a main control unit, a main memory, a file device, a display device, and an input device such as a keyboard.

【００６０】このコンピュータに、本発明のプログラム
をインストールする。このインストールは、フロッピ
ィ、光磁気ディスク等の可搬型の記録（記憶）媒体に、
これらのプログラムを記憶させておき、コンピュータが
備えている記録媒体に対して、アクセスするためのドラ
イブ装置を介して、或いは、ＬＡＮ等のネットワークを
介して、コンピュータに設けられたファイル装置にイン
ストールされる。そして、このファイル装置から処理に
必要なプログラムステップを主記憶に読み出し、主制御
部が実行するものである。The program of the present invention is installed on this computer. This installation is performed on portable recording (storage) media such as floppy disks, magneto-optical disks, etc.
These programs are stored and installed in a file device provided in the computer via a drive device for accessing a recording medium provided in the computer or via a network such as a LAN. You. Then, the program steps necessary for the processing are read out from the file device to the main memory, and are executed by the main control unit.

【００６１】[0061]

【発明の効果】以上説明したように、本発明によれば次
のような効果がある。As described above, the present invention has the following effects.

【００６２】（１）：画像２値化手段で、入力多値画像
を文字画像領域と背景画像領域とに分離し、該分離した
各文字画像領域に対して２値化処理を行い、該２値化処
理時の２値化しきい値から前記背景画像領域の２値化し
きい値を決定して前記背景画像領域の２値化処理を行う
ため、文字画像の背景に色が付いている画像であっても
高精度に２値化でき、かつ一定しきい値で２値化処理を
行うため高速に２値化することができる。(1): The input multi-valued image is separated into a character image region and a background image region by an image binarizing means, and each of the separated character image regions is subjected to a binarization process. The binarization threshold value of the background image area is determined from the binarization threshold value at the time of the binarization processing, and the binarization processing of the background image area is performed. Even if there is, binarization can be performed with high accuracy, and binarization can be performed at high speed because the binarization process is performed with a fixed threshold value.

【００６３】（２）：画像２値化手段で、入力多値画像
に含まれる低階調の代表値を選んで２値化処理を行い、
該作成された２値画像から文字画像を多く含む文字画像
領域とそれ以外の背景画像領域とに分離するため、低階
調である黒い部分のみ２値化して、容易に文字画像領域
を分離することができる。(2): The image binarization means selects a low gradation representative value included in the input multi-valued image and performs binarization processing.
In order to separate the created binary image into a character image region containing a large number of character images and a background image region other than the character image region, only the black portion having a low gradation is binarized to easily separate the character image region. be able to.

【００６４】（３）：画像２値化手段で、分離した文字
画像領域内が全て文字画像の単一階調である場合、該単
一階調の文字画像領域に背景画像が含まれるように該文
字画像領域を変更して、２値化処理を行うため、文字画
像領域内が全て文字画像の場合でも、文字画像が細く出
力されることを防止し、適切な２値化を行うことができ
る。(3): When the image data is binarized by the image binarizing means, when the character image area of the separated character image has a single gradation, the background image is included in the character image area of the single gradation. Since the character image area is changed and the binarization process is performed, even if the entire character image area is a character image, it is possible to prevent the character image from being output thinly and perform appropriate binarization. it can.

【００６５】（４）：画像２値化手段で、分離した各文
字画像領域に対して行った２値化処理時の各２値化しき
い値を求め、前記２値化処理された領域を含むように拡
大した矩形内の領域を前記求めた２値化しきい値で２値
化処理するため、文字画像領域として、抽出されなかっ
た文字画像部分を適切に２値化処理することができる。(4): The image binarization means finds each binarization threshold value in the binarization processing performed on each of the separated character image areas, and includes the binarized area. Since the area within the rectangle thus enlarged is subjected to the binarization processing using the obtained binarization threshold value, the character image area that has not been extracted can be appropriately binarized as the character image area.

【００６６】（５）：入力多値画像を文字画像領域と背
景画像領域とに分離し、該分離した各文字画像領域に対
して２値化処理を行い、該２値化処理時の２値化しきい
値から前記背景画像領域の２値化しきい値を決定して前
記背景画像領域の２値化処理を行う画像２値化手段と、
該２値化処理で作成した２値画像の認識を行う画像認識
手段と、してコンピュータを機能させるためのプログラ
ムを記録したコンピュータ読み取り可能な記録媒体とす
るため、この記録媒体のプログラムをコンピュータにイ
ンストールすることで、文字画像の背景に色が付いてい
る画像であっても高精度に２値化でき、かつ高速に２値
化することができる文書画像認識装置を容易に提供する
ことができる。(5): The input multi-valued image is separated into a character image area and a background image area, and each of the separated character image areas is subjected to binarization processing. Image binarization means for determining a binarization threshold value of the background image region from a binarization threshold value and performing binarization processing of the background image region;
An image recognizing means for recognizing the binary image created by the binarization process and a computer-readable recording medium on which a program for causing a computer to function are recorded. By installing the document image recognition device, it is possible to easily provide a document image recognition device that can binarize a character image with high accuracy even if the image has a colored background, and that can binarize it at high speed. .

[Brief description of the drawings]

【図１】本発明の原理説明図である。FIG. 1 is a diagram illustrating the principle of the present invention.

【図２】実施の形態における文書画像認識装置の構成の
説明図である。FIG. 2 is an explanatory diagram of a configuration of a document image recognition device according to an embodiment.

【図３】実施の形態における文字画像領域検出部の処理
の説明図である。FIG. 3 is an explanatory diagram of processing of a character image area detection unit according to the embodiment.

【図４】実施の形態における２値画像領域分離手段のイ
メージによる説明図である。FIG. 4 is an explanatory diagram based on an image of a binary image area separating unit in the embodiment.

【図５】実施の形態における文字画像領域２値化部の説
明図である。FIG. 5 is an explanatory diagram of a character image area binarizing unit according to the embodiment.

【図６】実施の形態における判別分析における判別基準
を用いた２値化処理例の説明図である。FIG. 6 is an explanatory diagram of an example of a binarization process using a discriminant criterion in discriminant analysis according to the embodiment;

【図７】実施の形態における背景画像領域２値化部の処
理の説明図である。FIG. 7 is an explanatory diagram of processing of a background image area binarizing unit according to the embodiment;

【図８】実施の形態における背景画像領域２値化の例１
の説明図である。FIG. 8 is an example 1 of binarization of a background image area in the embodiment.
FIG.

【図９】実施の形態における背景画像領域２値化の例２
の説明図である。FIG. 9 illustrates an example 2 of binarization of a background image area according to the embodiment.
FIG.

[Explanation of symbols]

１入力多値画像１０ａ画像２値化手段１４ａ画像認識手段 DESCRIPTION OF SYMBOLS 1 Input multivalued image 10a Image binarization means 14a Image recognition means

───────────────────────────────────────────────────── フロントページの続きＦターム(参考） 5B029 AA01 CC29 DD07 EE17 5C077 LL18 MP05 PP27 PP28 PQ08 RR02 RR16 ──────────────────────────────────────────────────続き Continued on the front page F term (reference) 5B029 AA01 CC29 DD07 EE17 5C077 LL18 MP05 PP27 PP28 PQ08 RR02 RR16

Claims

[Claims]

1. A document image recognizing apparatus comprising: an image binarizing unit for generating a binary image from an input multi-valued image; and an image recognizing unit for recognizing the generated binary image. The binarizing means separates the input multi-valued image into a character image region and a background image region, performs a binarization process on each of the separated character image regions, and performs a binarization threshold in the binarization process. A document image recognition apparatus, wherein a threshold value for binarizing the background image area is determined from a value, and a binarization process is performed on the background image area.

2. The image binarizing means selects a low gradation representative value included in the input multi-valued image and performs a binarizing process.
2. The document image recognition apparatus according to claim 1, wherein the generated binary image is separated into a character image region including many character images and a background image region other than the character image region.

3. The image binarizing means according to claim 1, wherein when the separated character image area is entirely a single tone of the character image, the character image area of the single tone includes a background image. 2. The document image recognition apparatus according to claim 1, wherein the character image area is changed to perform a binarization process.

4. The image binarizing means obtains each binarization threshold value at the time of the binarization process performed on each of the separated character image regions, and includes the binarized region. 2. The document image recognition apparatus according to claim 1, wherein the area within the rectangle thus enlarged is binarized by the obtained binarization threshold value.

5. An input multi-valued image is separated into a character image region and a background image region, a binarization process is performed on each of the separated character image regions, and a binarization threshold in the binarization process. Image binarization means for determining a binarization threshold value of the background image area from the values and performing binarization processing of the background image area; and an image for recognizing the binary image created by the binarization processing. A computer-readable recording medium that stores a program for causing a computer to function as a recognition unit.