JP2002032713A

JP2002032713A - Character recognition processing method

Info

Publication number: JP2002032713A
Application number: JP2000215562A
Authority: JP
Inventors: Yoshiko Hozumi; 芳子穂積
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 2000-07-17
Filing date: 2000-07-17
Publication date: 2002-01-31

Abstract

PROBLEM TO BE SOLVED: To provide a character recognizing method which can recognize characters of a character image with high precision even if the image has luminance spots or is distorted. SOLUTION: In this character recognizing method which recognizes characters of character image data, the character image data are divided into blocks, a binarization threshold for separating a character part and a background part is found by the blocks, and then the luminance is distributed over the whole character image data from the binarized threshold to perform the character recognition.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、デジタルカメラや
ビデオカメラ、スキャナ等で入力した画像に含まれる文
字を認識する文字認識方法に関するものである。[0001] 1. Field of the Invention [0002] The present invention relates to a character recognition method for recognizing characters contained in an image input by a digital camera, a video camera, a scanner or the like.

【０００２】[0002]

【従来の技術】従来より、文字認識方法については、多
くの方法が提案されている。例えば、入力データと辞書
のイメージデータとのパターンマッチングを行って、入
力データの文字を認識したり、或いは文字の背景特徴や
ストローク特徴を用いて、入力データの文字認識を行う
方法がある。しかし、いずれの文字認識方法も文書をス
キャナー等で入力した比較的高品質、高解像度の画像を
文字認識の対象としている。これまでは、文書をスキャ
ナーで入力する方法が主流であったが、デジタルスチル
カメラやビデオカメラの普及に伴い、これらのカメラを
画像入力装置として使用する場合が急速に増え、そのア
プリケーションとして、これらの画像入力装置で入力し
た画像に対して文字認識を行うことが生じてきた。2. Description of the Related Art Conventionally, many character recognition methods have been proposed. For example, there is a method of performing pattern matching between input data and image data of a dictionary to recognize characters of the input data, or performing character recognition of the input data using background characteristics and stroke characteristics of the characters. However, in any of the character recognition methods, a relatively high-quality and high-resolution image obtained by inputting a document with a scanner or the like is targeted for character recognition. Until now, the main method was to input documents with a scanner, but with the spread of digital still cameras and video cameras, the use of these cameras as image input devices has increased rapidly, Character recognition is performed on an image input by an image input device.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、従来よ
り行われている文字認識方法は、高品質、高解像度の文
字画像を対象としているため、この方法をそのままデジ
タルカメラやビデオカメラ等で入力した低解像度で輝度
斑や歪のある文字画像に適用すると、文字認識率が著し
く低下していた。その原因としては、低解像度のために
認識に必要な文字サイズが得られないことによる、また
カメラ固有の画像の輝度斑やノイズ、撮影位置による歪
等が挙げられる。また、従来の文字認識方法では、文書
をスキャナーで入力する場合にもサイズの小さい文字は
認識できないことが多かった。However, since the conventional character recognition method is intended for high-quality, high-resolution character images, this method is directly used by a digital camera, a video camera, or the like. When applied to a character image having luminance unevenness or distortion at a resolution, the character recognition rate has been significantly reduced. The causes include the inability to obtain a character size required for recognition due to low resolution, luminance unevenness and noise of an image unique to a camera, distortion due to a shooting position, and the like. Further, in the conventional character recognition method, even when a document is input by a scanner, small characters cannot be recognized in many cases.

【０００４】そこで、本発明は、上記のような問題点を
解消するためになされたもので、輝度斑や歪のある文字
画像でも高い精度で文字認識できる文字認識方法を提供
することを目的とする。The present invention has been made in order to solve the above-mentioned problems, and an object of the present invention is to provide a character recognition method capable of character recognition with high accuracy even for a character image having luminance unevenness or distortion. I do.

【０００５】[0005]

【課題を解決するための手段】本発明の文字認識方法の
第１の発明は、文字画像データを文字認識する文字認識
方法において、前記文字画像データを複数の分割ブロッ
クに分割した後、この分割ブロック毎に文字部分と背景
部分とを分ける２値化しきい値を求めた後、この２値化
しきい値から前記文字画像データ全体の輝度分布が均一
になるように輝度補正を行って文字認識を行うことを特
徴とする。第２の発明は、請求項１記載の文字認識方法
において、前記２値化しきい値を求めた後、平滑化処理
を行うことを特徴とする。第３の発明は、文字画像デー
タを文字認識する文字認識方法において、前記文字画像
データを複数の分割ブロックに分割した後、この分割ブ
ロック毎に文字部分と背景部分とを分ける２値化しきい
値を求め、この２値化しきい値の平滑化処理を行って輝
度補正マップを作成し、次に、この輝度補正マップの輝
度補正処理を行って前記文字画像データ全体の輝度分布
を均一にし、更に、各文字の輪郭線の抽出を行ない、こ
の輪郭線が互いにオーバーラップするように拡大した
後、一行毎に文字を塗りつぶして塗りつぶし領域を作成
し、次に、この塗りつぶし領域から１行ずつ取り出し
て、これを元画像に重ねる行抽出を行って文字認識前処
理を終了した後、この行抽出された文字と予め登録され
た辞書データとを照合して文字認識を行うことを特徴と
する。According to a first aspect of the present invention, there is provided a character recognition method for character recognition of character image data, comprising the step of dividing the character image data into a plurality of divided blocks, After obtaining a binarization threshold value for separating the character portion and the background portion for each block, character recognition is performed by performing luminance correction based on the binarization threshold value so that the luminance distribution of the entire character image data becomes uniform. It is characterized by performing. According to a second aspect, in the character recognition method according to the first aspect, a smoothing process is performed after obtaining the binarization threshold. According to a third aspect of the present invention, in the character recognition method for character recognizing character image data, after dividing the character image data into a plurality of divided blocks, a binary threshold value for dividing a character portion and a background portion for each of the divided blocks Is obtained, a brightness correction map is created by performing a smoothing process of the binarization threshold value, and then a brightness correction process of the brightness correction map is performed to uniform the brightness distribution of the entire character image data. After extracting the outline of each character and enlarging the outline so that they overlap each other, the character is filled line by line to create a filled area, and then the line is taken out from the filled area one line at a time. After performing the character extraction pre-processing by performing line extraction on which this is superimposed on the original image, it is assumed that character recognition is performed by comparing the extracted characters with dictionary data registered in advance. And butterflies.

【０００６】[0006]

【発明の実施の形態】以下、本発明の実施形態の文字認
識方法について、図１乃至図１２を用いて詳細に説明す
る。図１は、本発明の文字認識方法に用いられる文字認
識装置を示すブロック図である。図２は、図１に示した
認識処理装置による文字認識処理手順を示すフローチャ
ートである。図３は、ブロック毎に分割された文字画像
を示す図である。図４は、ブロック毎に分割された文字
画像の２値化しきい値を示す図である。図５は、文字画
像にノイズがある場合に、周辺の２値化しきい値との平
均をとって求めた文字画像の２値化しきい値を示す図で
ある。図６は、２値化しきい値分布を元の画像の大きさ
に拡大して得られた輝度補正マップを示す図である。図
７は、輝度補正した文字画像データを示す図である。図
８は、図７に示す文字画像データを２値化した図であ
る。図９は、抽出した２値化した文字領域を示す図であ
る。図１０は、抽出した２値化文字領域の外形矩形座標
を横方向に拡大した図である。図１１は、行抽出を行っ
た文字領域を示し、（ａ）は１行目、（ｂ）は１７行目
を示す図である。図１２は、２倍に拡大した後に２値化
した画像を示す図である。DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, a character recognition method according to an embodiment of the present invention will be described in detail with reference to FIGS. FIG. 1 is a block diagram showing a character recognition device used in the character recognition method of the present invention. FIG. 2 is a flowchart illustrating a character recognition processing procedure performed by the recognition processing device illustrated in FIG. FIG. 3 is a diagram showing a character image divided for each block. FIG. 4 is a diagram illustrating a binarization threshold value of a character image divided for each block. FIG. 5 is a diagram illustrating a binarization threshold value of a character image obtained by averaging the binarization threshold value of a character image when the noise is present in the character image. FIG. 6 is a diagram illustrating a luminance correction map obtained by enlarging the binarized threshold distribution to the size of the original image. FIG. 7 is a diagram illustrating character image data subjected to luminance correction. FIG. 8 is a diagram in which the character image data shown in FIG. 7 is binarized. FIG. 9 is a diagram showing the extracted binarized character area. FIG. 10 is a diagram in which the outer shape rectangular coordinates of the extracted binarized character area are enlarged in the horizontal direction. FIGS. 11A and 11B show a character area from which a line has been extracted, wherein FIG. 11A shows the first line and FIG. 11B shows the seventeenth line. FIG. 12 is a diagram illustrating an image binarized after being enlarged twice.

【０００７】図１に示すように、文字認識装置は、スキ
ャナー、デジタルスチルカメラ、ビデオカメラにキャプ
チャーを組み合わせた装置等よりなる画像入力装置１
と、この画像入力装置１に接続され、画像認識を行う認
識処理装置２と、認識処理装置２に接続され、ユーザか
らの指示などを入力するマウス、キーボード等の入力装
置３と、認識処理装置２に接続され、辞書データおよび
プログラムを記憶する記憶装置４と、認識処理装置２で
処理された画像の表示を行う画像表示装置５と、認識処
理装置２から画像等を出力する出力装置６とからなる。
なお、上記した文字認識装置は、例えば、パーソナルコ
ンピュータとその周辺機器により構成することができ
る。As shown in FIG. 1, the character recognition device is an image input device 1 including a scanner, a digital still camera, a video camera combined with a capture device, and the like.
A recognition processing device 2 connected to the image input device 1 for performing image recognition; an input device 3 such as a mouse and a keyboard connected to the recognition processing device 2 for inputting an instruction from a user; 2, a storage device 4 for storing dictionary data and programs, an image display device 5 for displaying images processed by the recognition processing device 2, and an output device 6 for outputting images and the like from the recognition processing device 2. Consists of
The above-described character recognition device can be composed of, for example, a personal computer and its peripheral devices.

【０００８】次に、本発明の実施形態の文字認識方法に
ついて、図２乃至図１２を用いて説明する。本発明の実
施形態の文字認識方法は、ステップ３０１〜ステップ３
０５における文字認識前処理を行った後、文字認識を行
うことを特徴とする。まず始めに、図２に示すように、
ステップ２０１にて、標準文字の画像データから輝度等
高線を抽出して輪郭線データを作成し、この輪郭線デー
タをステップ２０２にて記憶装置４に登録して辞書デー
タとする。なお、文字フォントデータとして既に作成さ
れているアウトラインフォントデータを辞書データとし
て使用しても良い。Next, a character recognition method according to an embodiment of the present invention will be described with reference to FIGS. The character recognition method according to the embodiment of the present invention includes steps 301 to 3
The character recognition is performed after performing the character recognition preprocessing in step 05. First, as shown in FIG.
In step 201, contour lines are created by extracting luminance contour lines from the image data of the standard characters, and the contour data is registered in the storage device 4 in step 202 to become dictionary data. Note that outline font data already created as character font data may be used as dictionary data.

【０００９】上記した辞書データを用いて、図１に示し
た認識処理装置２が文字認識する方法について、図２に
示すステップ３０１〜ステップ３０７のフローチャート
に従って説明する。ステップ３０１にて、デジタルスチ
ルカメラ、スキャナー等の画像入力装置１によって入力
された文字画像データを記憶装置４に保存する。次に、
ステップ３０２にて、文字画像データの輝度補正マップ
作成を以下のようにして行う。まず、図３に示すよう
に、入力した文字画像データを適当なブロックに分割す
る。この際、ブロックは、文字部分と背景部分とが含ま
れるようにして、例えば、４０ピクセル×４０ピクセル
にする。次に、図４に示すように、文字画像データは、
文字部分と背景部分にピークがあるという特徴を生かし
て、文字部分と背景部分とを分ける２値化しきい値を各
ブロック毎に求めて、文字画像の並び順に並べた２値化
しきい値分布を作成する。A method of recognizing characters by the recognition processing device 2 shown in FIG. 1 using the above-described dictionary data will be described with reference to a flowchart of steps 301 to 307 shown in FIG. In step 301, character image data input by the image input device 1 such as a digital still camera or a scanner is stored in the storage device 4. next,
In step 302, a brightness correction map for character image data is created as follows. First, as shown in FIG. 3, the input character image data is divided into appropriate blocks. At this time, the block is set to, for example, 40 pixels × 40 pixels so as to include a character portion and a background portion. Next, as shown in FIG.
Taking advantage of the feature that the character portion and the background portion have peaks, a binarization threshold value for separating the character portion and the background portion is obtained for each block, and a binarization threshold distribution arranged in the order of the character images is obtained. create.

【００１０】この２値化しきい値は、一般的な判別分析
法によって求められ、輝度値ヒストグラムから分散が最
大になるように文字部分と背景部分とを分割する値から
求められる。こうして得られる２値化しきい値は、文字
画像の周辺部では、ブロック内の輝度が低いので、低く
なり、この中心部では、ブロック内の輝度が高いので、
高くなる。The binarization threshold is obtained by a general discriminant analysis method, and is obtained from a value obtained by dividing a character portion and a background portion from a luminance value histogram so that the variance is maximized. The binarization threshold obtained in this manner is low in the peripheral portion of the character image because the luminance in the block is low, and is low in the central portion because the luminance in the block is high.
Get higher.

【００１１】文字画像による局所的なノイズが現れる場
合には、図５に示すように、周辺の２値化しきい値との
平均をとって平滑化処理するようにしても良い。図６に
示すように、２値化しきい値分布を元の画像の大きさに
戻して、輝度補正マップを作成する。When local noise due to a character image appears, as shown in FIG. 5, a smoothing process may be performed by averaging the binarized threshold value in the vicinity. As shown in FIG. 6, a luminance correction map is created by returning the binarized threshold distribution to the original image size.

【００１２】次に、ステップ３０３にて、上記の輝度補
正マップから文字画像データを以下のようにして得る。
輝度補正マップの輝度補正処理は、以下の（１）式を用
いて行うことができる。元画像の輝度値をＹ、輝度の最
大値をＹ_max、輝度補正マップにおける２値化しきい値
をＴとすると、補正後の輝度値Ｙ₁は、Ｙ₁＝（Ｙ＋（Ｙ_max−Ｔ））／２…（１）の関係式によって求めることができる。Next, in step 303, character image data is obtained from the brightness correction map as follows.
The brightness correction processing of the brightness correction map can be performed using the following equation (1). Assuming that the luminance value of the original image is Y, the maximum value of the luminance is Y _max , and the binarization threshold in the luminance correction map is T, the corrected luminance value Y ₁ is Y ₁ = (Y + (Y _max −T) ) / 2 (1).

【００１３】（１）式から、補正後の輝度値は、輝度補
正マップにおける２値化しきい値Ｔが大きいところで
は、小さくなり、２値化しきい値Ｔが小さいところで
は、大きくなるので、文字画像データ全体としての輝度
分布は、均一になる。このようにして、（１）式の演算
を行って、輝度補正マップによる輝度補正処理が可能と
なる。この結果、図７に示すように、輝度補正された文
字画像データを得ることができる。従来の方法を用い
て、これを２値化すると図８に示すようになる。From equation (1), the corrected luminance value becomes smaller when the binarization threshold T is large in the luminance correction map, and becomes large when the binarization threshold T is small. The luminance distribution of the entire image data becomes uniform. In this way, the calculation of the expression (1) is performed, and the luminance correction processing using the luminance correction map can be performed. As a result, as shown in FIG. 7, character image data whose luminance has been corrected can be obtained. When this is binarized using a conventional method, the result is as shown in FIG.

【００１４】次に、ステップ３０４にて、図９に示すよ
うに、図８に示す２値化した文字画像データから各文字
の輪郭線を抽出して文字領域抽出を行う。図９は、図８
に示す２値化した文字画像データの一部を示している。
次に、図１０に示すように、抽出した文字領域の外形矩
形の座標を左右にオフセットして横方向に拡大する。こ
の際、拡大幅は、隣り合った文字がオーバーラップする
ように決める。Next, in step 304, as shown in FIG. 9, the outline of each character is extracted from the binarized character image data shown in FIG. 8 to extract a character region. FIG. 9 shows FIG.
2 shows a part of the binarized character image data.
Next, as shown in FIG. 10, the coordinates of the extracted outer shape rectangle of the character area are offset to the left and right and enlarged in the horizontal direction. At this time, the enlargement width is determined so that adjacent characters overlap.

【００１５】次に、ステップ３０５にて、図１１に示す
ように、この抽出した文字領域から一行毎に文字を塗り
つぶして各行が塗りつぶされた塗りつぶし領域７を作成
し、この塗りつぶし領域７から１行ずつ取り出してマス
クを作成し、これを元画像に重ねて行抽出を行う。こう
して、文字認識前処理を完了する。Next, in step 305, as shown in FIG. 11, characters are painted line by line from the extracted character region to create a painted region 7 in which each line is painted, and one line is drawn from the painted region 7. Then, a mask is created by taking out each of the masks, and the mask is superimposed on the original image to perform row extraction. Thus, the character recognition pre-processing is completed.

【００１６】以上のように、文字認識前処理は、元画像
の大きさのままで行っても良いが、元画像を縮小して行
っても良い。縮小した元画像を使用した場合には、使用
メモリ量や処理量を減らすことができるので、携帯型端
末に搭載することが可能となる。この後、ステップ３０
６にて、前記した文字認識前処理により行抽出された文
字領域から文字抽出を行う。As described above, the character recognition pre-processing may be performed with the size of the original image, or may be performed with the original image reduced. When the reduced original image is used, the amount of memory used and the amount of processing can be reduced, so that the image can be mounted on a portable terminal. After this, step 30
At 6, the character is extracted from the character area where the line is extracted by the above-described character recognition preprocessing.

【００１７】次に、ステップ３０７にて、認識処理装置
２は、ステップ３０６で抽出された文字を前記したステ
ップ２０２にて記憶装置４に登録された辞書データと順
次照合して文字認識を行った後、その結果を画像表示装
置５又は、出力装置６に出力する。前記したように、ス
テップ３０３にて得られた文字画像データをそのまま２
値化して文字認識を行っても良いが、デジタルカメラ等
で撮影した画像の場合には解像度が不足することが多い
ので、この画像を何倍かに拡大した後に２値化すれば高
品質な２値化画像が得られる。この場合、画像の解像
度、文字の大きさなどの状況により必要に応じて画像の
拡大を行うと良い。こうして、２倍に拡大された後に２
値化された画像を図１２に示す。Next, in step 307, the recognition processing device 2 performs character recognition by sequentially collating the characters extracted in step 306 with the dictionary data registered in the storage device 4 in step 202 described above. Thereafter, the result is output to the image display device 5 or the output device 6. As described above, the character image data obtained in step 303 is
Character recognition may be performed by converting the image into a value. However, in the case of an image photographed by a digital camera or the like, the resolution is often insufficient. A binary image is obtained. In this case, it is preferable to enlarge the image as necessary depending on the situation such as the resolution of the image and the size of characters. Thus, after being doubled, 2
FIG. 12 shows the binarized image.

【００１８】この文字認識前処理を通常行われている一
行単位の拡大処理を行った場合には、拡大幅だけの反復
処理が必要であるが、これに対して、輪郭線により座標
値に変換して、拡大処理を行った場合には、座標値のオ
フセットだけでよく、拡大幅に関係なく処理時間を一定
にすることができる。When the character recognition pre-processing is performed in a line-by-line enlargement process which is usually performed, it is necessary to repeat the enlargement width only. When the enlargement processing is performed, only the offset of the coordinate value is required, and the processing time can be constant regardless of the enlargement width.

【００１９】また、単独文字での文字認識照合の結果に
加えて、単語や熟語の情報を用いると、文字認識照合結
果を補正して認識率を向上させることができる。そのた
めに文字認識照合結果を１つだけ出力するのではなく、
複数の文字認識照合結果を出力し、これを辞書データと
照合させて正解を得る等の文字認識方法をとることがで
きる。本発明によって文字認識前処理を行った文字画像
について文字認識方式の１つである拡張セル方式で文字
認識を行った結果を図１３に示す。図１３は、拡張セル
方式で文字認識を行った結果を示し、（ａ）は入力画像
を示す図、（ｂ）は文字認識結果を示す図である。When information on words and idioms is used in addition to the result of character recognition and collation with a single character, the result of character recognition and collation can be corrected to improve the recognition rate. Therefore, instead of outputting only one character recognition collation result,
A character recognition method such as outputting a plurality of character recognition collation results and collating the result with dictionary data to obtain a correct answer can be adopted. FIG. 13 shows the result of character recognition performed on the character image subjected to the character recognition preprocessing according to the present invention by the extended cell method, which is one of the character recognition methods. FIGS. 13A and 13B show the result of character recognition performed by the extended cell method. FIG. 13A is a diagram showing an input image, and FIG. 13B is a diagram showing a character recognition result.

【００２０】本発明の実施形態によれば、デジタルスチ
ルカメラ等から入力された画像データ等の文字画像デー
タを文字部分と背景部分とを含む複数のブロックに分割
した後、文字部分と背景部分とを分ける２値化しきい値
を各ブロック毎に求め、この２値化しきい値の平滑化処
理を行って、輝度補正マップを作成し、次に、この輝度
補正マップの輝度補正処理を行なって、文字画像データ
全体としての輝度分布を均一にし、引き続いて、各文字
の輪郭線の抽出を行ない、更に、この輪郭線が互いにオ
ーバーラップするように拡大した後、一行後に文字を塗
りつぶして、各行が塗りつぶされた塗りつぶし領域７を
作成し、次に、この塗りつぶし領域７から１行ずつ取り
出して、これを元画像に重ねる行抽出を行って文字認識
前処理を終了した後、この抽出された行から文字抽出を
行い、この抽出された文字と予め登録された辞書データ
とを照合して文字認識を行うので、輝度斑や歪のある文
字画像でも高い精度で文字認識を行うことができる。According to the embodiment of the present invention, character image data such as image data input from a digital still camera or the like is divided into a plurality of blocks including a character portion and a background portion, and then divided into a plurality of blocks including a character portion and a background portion. Is obtained for each block, a smoothing process of the binary threshold value is performed, a brightness correction map is created, and then a brightness correction process of the brightness correction map is performed. The luminance distribution of the entire character image data is made uniform, the outline of each character is subsequently extracted, and the outline is enlarged so that these outlines overlap with each other. A filled area 7 is created, and then one line is taken out from the filled area 7 and a line is extracted by superimposing the line on the original image, and the character recognition preprocessing is completed. Characters are extracted from the extracted lines, and character recognition is performed by comparing the extracted characters with pre-registered dictionary data, so that character recognition can be performed with high accuracy even for character images having luminance unevenness or distortion. It can be carried out.

【００２１】また、傾きや歪みのある入力画像について
も高精度に行を抽出することが可能であるので、文字認
識の精度を上げることができる。更に、輪郭線を用いる
ことによって抽出にかかる処理量を減らすことが可能で
あるので、携帯型端末など計算能力の少ない機器への搭
載が可能である。Further, since it is possible to extract a line with high accuracy even from an input image having an inclination or distortion, the accuracy of character recognition can be improved. Furthermore, since the amount of processing required for extraction can be reduced by using a contour line, it can be mounted on a device having a low calculation capability such as a portable terminal.

【００２２】[0022]

【発明の効果】本発明によれば、文字画像データを文字
認識する文字認識方法において、前記文字画像データを
複数の分割ブロックに分割した後、この分割ブロック毎
に文字部分と背景部分とを分ける２値化しきい値を求め
た後、この２値化しきい値から前記文字画像データ全体
の輝度分布が均一になるように輝度補正を行って文字認
識を行うので、輝度斑や歪のある文字画像でも高い精度
で文字認識を行うことができる。また、文字画像データ
を文字認識する文字認識方法において、前記文字画像デ
ータを複数の分割ブロックに分割した後、この分割ブロ
ック毎に文字部分と背景部分とを分ける２値化しきい値
を求め、この２値化しきい値の平滑化処理を行って輝度
補正マップを作成し、次に、この輝度補正マップの輝度
補正処理を行って前記文字画像データ全体の輝度分布を
均一にし、更に、各文字の輪郭線の抽出を行ない、この
輪郭線が互いにオーバーラップするように拡大した後、
一行毎に文字を塗りつぶして塗りつぶし領域を作成し、
次に、この塗りつぶし領域から１行ずつ取り出して、こ
れを元画像に重ねる行抽出を行って文字認識前処理を終
了した後、この行抽出された文字と予め登録された辞書
データとを照合して文字認識を行うので、傾きや歪みの
ある入力画像についても高精度に行を抽出することが可
能となり、文字認識の精度を上げることができる。更
に、輪郭線を用いることによって抽出にかかる処理量を
減らすことが可能であるので、携帯型端末など計算能力
の少ない機器への搭載が可能である。According to the present invention, in the character recognition method for character recognition of character image data, after dividing the character image data into a plurality of divided blocks, a character portion and a background portion are separated for each of the divided blocks. After obtaining the binarization threshold, character recognition is performed by performing luminance correction based on the binarization threshold so that the luminance distribution of the entire character image data becomes uniform. However, character recognition can be performed with high accuracy. In the character recognition method for character recognition of character image data, after dividing the character image data into a plurality of divided blocks, a binarization threshold for dividing a character portion and a background portion for each of the divided blocks is obtained. A brightness correction map is created by performing a smoothing process of the binarization threshold, and then a brightness correction process of the brightness correction map is performed to make the brightness distribution of the entire character image data uniform. After extracting the outlines and expanding them so that they overlap each other,
Fill the text line by line to create a filled area,
Next, one line is taken out from the filled area one line at a time, and a line is extracted by superimposing the line on the original image, and the character recognition preprocessing is completed. Then, the extracted character is collated with dictionary data registered in advance. Therefore, it is possible to extract a line with high accuracy even for an input image having an inclination or distortion, thereby improving the accuracy of character recognition. Furthermore, since the amount of processing required for extraction can be reduced by using a contour line, it can be mounted on a device having a low calculation capability such as a portable terminal.

[Brief description of the drawings]

【図１】本発明の文字認識方法に用いられる文字認識装
置を示すブロック図である。FIG. 1 is a block diagram showing a character recognition device used in the character recognition method of the present invention.

【図２】図１に示した認識処理装置による文字認識処理
手順を示すフローチャートである。FIG. 2 is a flowchart illustrating a character recognition processing procedure performed by the recognition processing device illustrated in FIG. 1;

【図３】ブロック毎に分割された文字画像を示す図であ
る。FIG. 3 is a diagram showing a character image divided for each block.

【図４】ブロック毎に分割された文字画像の２値化しき
い値を示す図である。FIG. 4 is a diagram showing a binarization threshold value of a character image divided for each block.

【図５】文字画像にノイズがある場合に、周辺の２値化
しきい値との平均をとって求めた文字画像の２値化しき
い値を示す図である。FIG. 5 is a diagram illustrating a binarization threshold value of a character image obtained by averaging the binarization threshold value in the vicinity when a noise is present in the character image;

【図６】２値化しきい値分布を元の画像の大きさに拡大
して得られた輝度補正マップを示す図である。FIG. 6 is a diagram showing a luminance correction map obtained by enlarging a binarized threshold distribution to the size of an original image.

【図７】輝度補正した文字画像データを示す図である。FIG. 7 is a diagram showing character image data subjected to luminance correction.

【図８】図７に示す文字画像データを２値化した図であ
る。8 is a diagram in which the character image data shown in FIG. 7 is binarized.

【図９】２値化した文字画像データの文字領域抽出を行
った図である。FIG. 9 is a diagram in which a character area is extracted from binarized character image data.

【図１０】文字領域抽出した外形矩形の座標を横方向に
拡大した図である。FIG. 10 is a diagram in which the coordinates of the external rectangle extracted from the character area are enlarged in the horizontal direction.

【図１１】行抽出を行った様子を示し、（ａ）は１行
目、（ｂ）は１７行目を示す図である。11A and 11B show a state in which row extraction is performed, wherein FIG. 11A is a view showing a first row, and FIG. 11B is a view showing a 17th row.

【図１２】２倍に拡大した後に２値化した様子を示す図
である。FIG. 12 is a diagram illustrating a state where the image is binarized after being enlarged twice.

【図１３】拡張セル方式で文字認識を行った結果を示
し、（ａ）は入力画像を示す図、（ｂ）は認識結果を示
す図である。13A and 13B show the results of character recognition performed by the extended cell method, FIG. 13A shows an input image, and FIG. 13B shows recognition results.

[Explanation of symbols]

１…画像入力装置、２…認識処理装置、３…入力装置、
４…記憶装置、５…画像表示装置、６…出力装置、７…
塗りつぶし領域DESCRIPTION OF SYMBOLS 1 ... Image input device, 2 ... Recognition processing device, 3 ... Input device,
4 storage device 5 image display device 6 output device 7
Fill area

Claims

[Claims]

1. A character recognition method for character recognition of character image data, comprising: dividing the character image data into a plurality of divided blocks; and setting a binary threshold for dividing a character portion and a background portion for each of the divided blocks. A character recognition method comprising: performing a luminance correction based on the binarization threshold so that the luminance distribution of the entire character image data becomes uniform;

2. The character recognition method according to claim 1, wherein a smoothing process is performed after obtaining the binarization threshold.

3. A character recognition method for character recognizing character image data, comprising: dividing the character image data into a plurality of divided blocks; and setting a binarization threshold for dividing a character portion and a background portion for each of the divided blocks. Then, a brightness correction map is created by performing a smoothing process of the binarization threshold value, and then a brightness correction process of the brightness correction map is performed to make the brightness distribution of the entire character image data uniform. After extracting the outline of each character and enlarging the outline so that they overlap each other, the character is filled line by line to create a filled area, and then the lines are taken out of the filled area one line at a time. After performing line extraction to overlap this with the original image and ending the character recognition preprocessing, character recognition is performed by collating the extracted character with dictionary data registered in advance. Character recognition method to be used.