JPH05282493A

JPH05282493A - Roman letter recognizing device

Info

Publication number: JPH05282493A
Application number: JP4076685A
Authority: JP
Inventors: Michiaki Nobuoka; 道明信岡
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1992-03-31
Filing date: 1992-03-31
Publication date: 1993-10-29

Abstract

PURPOSE:To provide the Roman letter recognizing device which improves the operability of letter input by recognizing a document of low quality which contains contacting letters and blurring letters at a high speed. CONSTITUTION:This Roman letter recognizing device consists of a means which segments a row area after segmenting a document area, and then estimates the letter width of the letter with the largest letter width according to the height of the row, a means which segments a letter area almost to the estimated letter width and recognizes the area, a means 11 which stores the height of the letter area based upon a reference letter, the position (letter position feature) based upon a reference line, and image information when the recognition result is acceptable, and a means 10 which limits an object letters to be segment again in the letter area according to a letter information dictionary 9 containing letter position features of all letters, superposes the stored letter image and segments the letter again, and performs recognition when the recognition result is rejected.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は英文一般文書の文字認識
を行う英文字認識装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an English character recognizing device for recognizing characters in English general documents.

【０００２】[0002]

【従来の技術】近年、文字認識装置を電子計算機等の入
力装置として利用する要求が高まっており、安定な認識
結果を効率的に得ることができる文字入力装置が電子計
算機等のシステムの性能向上に不可欠となっている。2. Description of the Related Art In recent years, there has been an increasing demand for using a character recognition device as an input device for a computer or the like, and a character input device capable of efficiently obtaining a stable recognition result improves the performance of a system such as a computer. Has become indispensable to.

【０００３】従来の英文字認識装置は、文書領域から、
文字領域を切り出す際、英文字は左右に分離していない
ことに着目し、前後を空白に囲まれた領域を文字領域と
して切り出し、認識処理を行っていた。A conventional English character recognizing device reads from a document area
When cutting out a character area, paying attention to the fact that English characters are not separated into left and right, the area surrounded by white spaces is cut out as a character area and recognition processing is performed.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら上記従来
の構成では、複写等により印字品質の低下した文書では
左右の２文字が接触したり（以下、これらの文字を接触
文字と呼ぶ）、あるいは、一つの文字が切れて２つに分
割される（以下、これらの文字を切れ文字と呼ぶ）こと
があり、この場合、従来の方法では正しく文字領域を切
り出すことができないという問題点を有していた。However, in the above-mentioned conventional configuration, two characters on the left and right of the document are in contact with each other (hereinafter, these characters are referred to as contact characters) in a document whose print quality is deteriorated due to copying or the like. One character may be cut and divided into two (hereinafter, these characters are referred to as cut characters). In this case, the conventional method cannot correctly cut out the character area. ..

【０００５】また、認識精度が低いので、高速に認識を
行うことができず、文字入力の作業性が低いという問題
点があった。Further, since the recognition accuracy is low, the recognition cannot be performed at high speed, and the workability of character input is low.

【０００６】本発明は上記従来の問題点を解決するもの
で、英文一般文書の文字認識において、接触文字、切れ
文字が存在する品質の低い文書に対して高速に認識を行
うことができる文字入力の作業性を著しく向上させた英
文字認識装置を提供することを目的とする。The present invention solves the above-mentioned conventional problems, and in character recognition of an English general document, character input capable of high-speed recognition of a low-quality document including contact characters and broken characters. It is an object of the present invention to provide an English character recognizing device whose workability is remarkably improved.

【０００７】[0007]

【課題を解決するための手段】この目的を達成するため
に本発明の英文字認識装置は、文書領域を切り出した
後、行領域を切り出し、行の高さをもとに最も文字幅の
広い文字の文字幅を推測する。そして推測された文字幅
の前後で文字領域を切り出し、認識を行う。認識を行っ
た結果、認識結果に値する場合、この文字領域の基準文
字に対する高さ、基準線に対する位置（以下これらの高
さ、位置情報を文字位置特徴と呼ぶ）及び画像情報を記
憶しておく。棄却（リジェクト）された場合、この文字
領域に対して、予め全ての文字の文字位置特徴を記憶し
ている文字情報辞書をもとに、再切り出し対象文字を限
定し、記憶している文字画像を重ね合わせ再度切り出
し、認識処理を行うことにより、接触文字、切れ文字を
含む文書の文字領域切り出し精度及び処理速度の向上を
図った構成からなっている。具体的には、認識対象文書
を入力する画像入力部と、入力された文書画像から文章
領域を出力する文章領域切り出し部と、文章領域から行
領域を出力する行領域切り出し部と、行領域から文字領
域を出力する文字領域切り出し部と、全ての認識対象文
字の図形特徴を予め記憶している認識辞書と、切り出さ
れた文字領域の画像から前記認識辞書をもとに認識を行
う文字認識部と、前記文字認識部における認識の結果再
度文字領域の切り出し及び文字認識を行う必要がある領
域の位置を記憶する再切り出し領域格納部と、前記文字
認識部において、認識結果に値すると決定された文字の
認識文字コード、基準文字に対する文字の高さ・基準線
に対する文字の位置（以下、文字位置特徴と呼ぶ）及び
文字画像を記憶する文字情報格納部と、全ての認識対象
文字の文字位置特徴を予め記憶している文字情報辞書
と、前記再切り出し領域格納部に格納されている領域に
対して前記文字情報辞書を用いて再切り出し対象文字を
限定し、前記文字情報格納部に記憶されている画像を重
ね合わせて切り出し・認識を行う重ね合わせ処理部と、
文字認識の結果及び位置を記憶している認識結果格納部
と、前記認識結果格納部に記憶している認識結果を出力
する認識結果出力部と、からなる構成を有している。In order to achieve this object, the English character recognizing device of the present invention cuts out a document area, then cuts out a line area, and has the widest character width based on the height of the line. Guess the character width. Then, the character area is cut out before and after the estimated character width, and recognition is performed. If the recognition result is worthy of the recognition result, the height of this character area with respect to the reference character, the position with respect to the reference line (hereinafter, these height and position information are referred to as character position features), and the image information are stored. .. If rejected (rejected), the characters to be recut are limited based on the character information dictionary that stores the character position characteristics of all characters in advance for this character area, and the stored character image By overlapping and re-extracting and performing recognition processing, the accuracy of extracting a character area of a document including contact characters and cut characters and the processing speed are improved. Specifically, an image input unit for inputting a document to be recognized, a text region cutout unit for outputting a text region from the input document image, a line region cutout unit for outputting a line region from the text region, and a line region A character area cutout unit that outputs a character area, a recognition dictionary that stores graphic features of all recognition target characters in advance, and a character recognition unit that performs recognition from an image of the cut out character area based on the recognition dictionary. And a re-cutout area storage section that stores the position of the area where the character area needs to be cut out and the character recognition is performed again as a result of recognition in the character recognition section, and the character recognition section determines that the recognition result is worth it. A character information storage unit that stores the character recognition character code, the height of the character with respect to the reference character, the position of the character with respect to the reference line (hereinafter referred to as the character position feature), and the character image. A character information dictionary in which the character position characteristics of the recognition target character are stored in advance, and the recut target character is limited to the area stored in the recut area storage unit by using the character information dictionary, and the character An overlay processing unit that overlays the images stored in the information storage unit to perform clipping and recognition,
The recognition result storage unit stores the result and position of character recognition, and the recognition result output unit outputs the recognition result stored in the recognition result storage unit.

【０００８】[0008]

【作用】この構成によって、文字領域の切り出し処理を
行う際、最大文字幅を推定し、文字領域の切り出しを行
うことにより、切れ文字を分割することなく切り出すこ
とができ、また、認識の結果棄却された場合、この文字
領域に対して、文字位置特徴を用いて、再切り出し対象
文字を限定し、同一文書内の認識確度の高い文字の画像
を重ね合わせ、切れ文字、接触文字の高速な切り出し、
認識を図ることができる。With this configuration, when the character region is cut out, the maximum character width is estimated and the character region is cut out, so that the cut character can be cut out without being divided, and the recognition result is rejected. In this case, the character position feature is used for this character area to limit the characters to be re-cut out, and the images of highly recognizable characters in the same document are overlaid, and the cut-out character and contact character are cut out at high speed. ,
Can be recognized.

【０００９】[0009]

【実施例】以下本発明の一実施例について、図面を参照
しながら説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings.

【００１０】図１は本実施例の英文字認識装置の全体構
成図である。１は認識対象文書を文書画像として入力す
る画像入力部、２は入力された文書画像から文字列の集
まりを見つけ、文章領域を出力する文章領域切り出し
部、３は文章領域から行単位の区切りを見つけ１つの行
の範囲を行領域として出力する行領域切り出し部、４は
行領域から文字単位の区切りを見つけ１つの文字の範囲
を文字領域として出力する文字領域切り出し部、５は全
ての認識対象文字の図形特徴を予め記憶している認識辞
書、６は前記文字領域切り出し部４で切り出された文字
領域内の図形特徴と前記認識辞書５の図形特徴を比較し
それらの間との類似性を求め、認識を行う文字認識部、
７は前記文字認識部６においての認識の結果、再度文字
領域を切り出す必要がある領域の位置を記憶しておく再
切り出し領域格納部、８は文字認識の結果認識確度が高
い文字の認識文字コード、基準文字に対する文字の高さ
・基準線に対する文字の位置（以下、これらの文字の高
さ、文字の位置を文字位置特徴と呼ぶ）及び画像を記憶
しておく文字情報格納部、９は全ての認識対象文字の文
字位置特徴を予め記憶している文字情報辞書、１０は再
切り出し領域格納部７に記憶されている領域に対して、
前記文字情報辞書９をもとに再切り出し対象文字を限定
し、文字情報格納部８に記憶されている画像情報を重ね
合わせ、文字領域の切り出し、認識を行う重ね合わせ処
理部、１１は認識結果及び位置を記憶している認識結果
格納部、１２は前記認識結果格納部をもとに認識結果を
記憶する認識結果出力部、１３は１から５、７から９及
び１１の各部をつなぐ内部バス、１４，１５，１６は５
と６、９と１０、１１と１２をつなぐ内部バスである。FIG. 1 is an overall configuration diagram of an English character recognition apparatus of this embodiment. 1 is an image input unit for inputting a document to be recognized as a document image, 2 is a sentence region cutout unit for finding a set of character strings from the input document image, and outputting a sentence region, 3 is a line unit break from the sentence region A line area cutout unit that finds and outputs the range of one line as a line area, 4 is a character area cutout unit that finds a character-based delimiter from the line area, and outputs a range of one character as a character area, and 5 is all recognition targets A recognition dictionary 6 in which graphic features of characters are stored in advance, 6 is used to compare the graphic features in the character region cut out by the character region cutting unit 4 with the graphic features in the recognition dictionary 5 to determine the similarity between them. A character recognition unit that seeks and recognizes
Reference numeral 7 is a re-cutout area storage unit that stores the position of the area where the character area needs to be cut out again as a result of recognition by the character recognition unit 6, and 8 is a recognized character code of a character with high recognition accuracy as a result of character recognition. , A character height with respect to a reference character, a character position with respect to a reference line (hereinafter, these character heights and character positions are referred to as character position features), and a character information storage unit for storing images, 9 are all The character information dictionary 10 in which the character position features of the recognition target characters are stored in advance for the area stored in the re-cut area storage unit 7,
Based on the character information dictionary 9, the characters to be re-cut out are limited, the image information stored in the character information storage unit 8 is overlapped, a character region is cut out, and recognition is performed. And a recognition result storage unit that stores the position, a recognition result output unit 12 that stores the recognition result based on the recognition result storage unit, and an internal bus 13 that connects the units 1 to 5, 7 to 9 and 11 , 14, 15, 16 is 5
It is an internal bus that connects 6 and 9 and 10 and 11 and 12.

【００１１】以上のように構成された英文字認識装置に
ついて、以下図面を参照しながらその動作を説明する。The operation of the English character recognizing device configured as described above will be described below with reference to the drawings.

【００１２】図２は全体のフローチャートであり、図３
は文字領域切り出し・認識処理時のフローチャートであ
り、図４は文字領域再切り出し・認識処理時のフローチ
ャートであり、図５は重ね合わせ処理時のフローチャー
トであり、図６は文字位置特徴の説明図であり、図７は
文字情報辞書の内容を示す説明図である。FIG. 2 is an overall flow chart, and FIG.
FIG. 4 is a flowchart for character area cutout / recognition processing, FIG. 4 is a flowchart for character area recutout / recognition processing, FIG. 5 is a flowchart for overlay processing, and FIG. 6 is an explanatory diagram of character position features. FIG. 7 is an explanatory diagram showing the contents of the character information dictionary.

【００１３】まず、図２において、認識したい文書を画
像入力部１に文書画像として入力する（Ｓ１）。First, in FIG. 2, a document to be recognized is input as a document image to the image input section 1 (S1).

【００１４】入力された文書画像を文章領域切り出し部
２に送り、文章領域切り出し部２にて文書画像の縦方向
及び横方向の黒画素のヒストグラムを求め、これをもと
に文章領域を抽出する。そして文章領域の位置情報を内
部データとして蓄える（Ｓ２）。The input document image is sent to the text area cutout unit 2, and the text area cutout unit 2 obtains a histogram of black pixels in the vertical and horizontal directions of the document image, and the text area is extracted based on this histogram. .. Then, the position information of the text area is stored as internal data (S2).

【００１５】行領域切り出し部３に文章領域の位置情報
を送り、文章領域内に対する行領域の切り出し処理を行
う。行領域切り出し部３では、文章領域内の横方向のヒ
ストグラムを求め、０に近い極小値を行の区切りとして
行領域を切り出す。文章領域切り出し部２にて見つけら
れた文章領域内の全ての行領域の位置情報を求め、内部
データとして蓄える（Ｓ３）。The position information of the text area is sent to the line area cutout unit 3, and the line area is cut out from the text area. The line area cutout unit 3 obtains a horizontal histogram in the text area, and cuts out the line area using a minimum value close to 0 as a line break. The position information of all line areas in the text area found by the text area cutout unit 2 is obtained and stored as internal data (S3).

【００１６】次に、文字領域切り出し処理・文字認識処
理（Ｓ４）について、図３の文字領域切り出し、認識処
理時のフローチャートに沿って説明する。Next, the character area cutout processing / character recognition processing (S4) will be described with reference to the flowchart of the character area cutout and recognition processing of FIG.

【００１７】文字領域切り出し部４に行領域の位置情報
を送り、文字領域を切り出す。文字領域切り出し部４で
は、送られてきた行領域の位置情報をもとに最大文字値
を以下のように推測する（Ｓ９）。Position information of the line area is sent to the character area cutout unit 4 to cut out the character area. The character area slicing unit 4 estimates the maximum character value as follows based on the position information of the sent line area (S9).

【００１８】最大文字幅＝行の高さ×１．２行領域内の画像データの黒画素連結部分の外接矩形を抽
出する（Ｓ１０）。行の上１／３または下１／３に微小
矩形が存在し、かつ上下方向に、矩形が存在する場合、
ｉ，ｊのような分離文字とみなし前記外接矩形を統合す
る（Ｓ１１）。以下この統合した矩形のことを基本矩形
と呼ぶ。処理を行うとする基本矩形が行頭の場合、最も
行領域内で最も左に位置する矩形の左側の辺のＸ座標を
切り出し位置とし、行頭でない場合は、次に処理を行う
外接矩形の左側の辺のＸ座標を切り出し位置とする。こ
の切り出し位置より右側に存在する基本矩形を最大文字
幅の越えない範囲で左に位置する矩形から順に統合す
る。この統合された領域を文字領域として切り出し、位
置情報を内部データとして蓄える。もし、一つの外接矩
形のみで最大文字幅より大きい場合は、その一つの外接
矩形を文字領域として切り出す（Ｓ１２）。Maximum character width = line height × 1.2 The circumscribed rectangle of the black pixel connected portion of the image data in the line area is extracted (S10). If there is a small rectangle in the upper 1/3 or lower 1/3 of the line and there is a rectangle in the vertical direction,
The circumscribed rectangles are integrated by regarding them as separated characters such as i and j (S11). Hereinafter, this integrated rectangle is referred to as a basic rectangle. If the basic rectangle to be processed is at the beginning of a line, the X coordinate of the left side of the leftmost rectangle in the line region is the cutout position. If it is not at the beginning of the line, the left side of the circumscribed rectangle to be processed next is The X coordinate of the side is the cutout position. The basic rectangles on the right side of this cutout position are integrated in order from the rectangle on the left within a range that does not exceed the maximum character width. The integrated area is cut out as a character area and the position information is stored as internal data. If only one circumscribed rectangle is larger than the maximum character width, the one circumscribed rectangle is cut out as a character area (S12).

【００１９】文字認識部５に文字領域の位置情報を送
り、認識処理を行う。文字認識部６では、送られてきた
文字領域の位置情報をもとに、文字領域内の画像の図形
特徴を抽出し、抽出した図形特徴と認識辞書５内の全て
の文字の図形特徴との間の類似性を求め（Ｓ１３）、認
識結果に値するかどうかを判定する（Ｓ１４）。認識結
果に値する場合、認識の対象としている文字の文字位置
特徴、画像が、文字情報格納部８に、既に記憶されてい
るか調べる（Ｓ１５）。文字情報格納部８にこの対象と
する文字領域の情報が記憶されていない場合、文字情報
格納部８に認識文字コード、文字位置特徴及び画像を蓄
える（Ｓ１６）。Ｓ１５において、対象とする文字の情
報が文字情報格納部８に記憶されているか否かにかかわ
らず認識結果格納部１１に認識文字コード及び文字位置
情報を送り蓄える（Ｓ１７）。Ｓ１４において、認識結
果に値しなかった場合、すぐ直前の文字領域が再切り出
し領域であるか調べる（Ｓ１８）。直前の領域が再切り
出し領域の場合、この領域と現在対象としている領域を
結合し１つの領域とする（Ｓ１９）。再切り出し領域格
納部７に対象としている文字領域（統合した場合はその
領域）の位置情報を格納する（Ｓ２０）。The position information of the character area is sent to the character recognizing unit 5 to perform the recognition process. The character recognition unit 6 extracts the graphic feature of the image in the character region based on the sent position information of the character region, and extracts the extracted graphic feature and the graphic feature of all the characters in the recognition dictionary 5. The similarity between the two is obtained (S13), and it is determined whether or not the recognition result is worth (S14). If the recognition result is worthy, it is checked whether or not the character position feature and the image of the character to be recognized are already stored in the character information storage unit 8 (S15). If the information of the target character area is not stored in the character information storage unit 8, the recognized character code, the character position feature and the image are stored in the character information storage unit 8 (S16). In S15, the recognized character code and the character position information are sent to the recognition result storage unit 11 regardless of whether or not the information of the target character is stored in the character information storage unit 8 (S17). In S14, if the recognition result is not merited, it is checked whether the character area immediately before is a re-cutout area (S18). If the immediately preceding area is the re-cutout area, this area and the area currently targeted are combined into one area (S19). The position information of the target character area (the area when integrated) is stored in the recut area storage unit 7 (S20).

【００２０】以上Ｓ４の処理を文章領域内の全ての文字
に対して行う（Ｓ５）。次に、文字領域再切り出し・認
識処理（Ｓ６）について、図４の文字領域再切り出し・
認識処理時のフローチャート、及び図５の重ね合わせ処
理時のフローチャートに沿って説明する。The process of S4 is performed for all the characters in the text area (S5). Next, regarding the character area recutting / recognition processing (S6), the character area recutting / recognition processing of FIG.
A description will be given according to the flowchart of the recognition process and the flowchart of the superposition process of FIG.

【００２１】再切り出し領域に格納されている再切り出
し文字領域の文字位置情報を文字領域切り出し部４に送
る（Ｓ２１）。文字領域切り出し部４では送られてきた
矩形の左側のＸ座標を切り出し位置として内部データに
蓄える（Ｓ２２）。重ね合わせ処理部１０に再切り出し
領域の位置情報、切り出し位置を送る。The character position information of the recut character area stored in the recut area is sent to the character area slicing section 4 (S21). The character area cutout unit 4 stores the X coordinate on the left side of the received rectangle as the cutout position in the internal data (S22). The position information of the re-cutout area and the cutout position are sent to the superposition processing unit 10.

【００２２】重ね合わせ処理において用いる文字位置特
徴及び文字情報辞書９について、図６の文字位置特徴の
説明図及び図７の文字情報辞書の内容を示す説明図を用
いて説明する。The character position feature and character information dictionary 9 used in the superposition processing will be described with reference to the character position feature explanatory diagram of FIG. 6 and the character information dictionary content diagram of FIG.

【００２３】ベースラインは基準となる仮想線で、
“ａ”，“ｂ”，“Ａ”などほとんどの文字の下端が位
置するラインのことを指す。ｘハイトは小文字“ｘ”の
高さ（図６中のｈ２）のことを指す。ディセンダはこの
ベースラインの下側の部分、アセンダはｘハイトの上側
の部分のことを指す。また、高さ情報の基準文字として
大文字の高さ（図６中のｈ１）を合わせて用いる。文字
情報辞書９には、認識対象となる各文字種毎に、ベース
ラインを基準としたときの高さ及びディセンダの有無を
記憶している。高さ情報はベースラインからの高さを大
文字の高さ、小文字の高さ、どちらとも決定できないも
のの３つに区分しそれぞれ１、２、３としており、ディ
センダ情報はディセンダが無い場合を１、ある場合を
２、書体による場合を３としている。The baseline is a virtual line that serves as a reference,
"A", "b", "A", etc. indicate the line where the lower ends of most characters are located. The x-height indicates the height of the lowercase letter "x" (h2 in FIG. 6). The descender refers to the lower part of this baseline, and the ascender refers to the upper part of the x-height. The height of the capital letter (h1 in FIG. 6) is also used as the reference character of the height information. The character information dictionary 9 stores the height with respect to the baseline and the presence or absence of a descender for each character type to be recognized. The height information is divided into three, the height from the baseline from the height of uppercase letters and the height of lowercase letters, which cannot be determined, and is set to 1, 2, and 3, respectively. The descender information is 1, when there is no descender, Some cases are 2, and some are in typeface.

【００２４】重ね合わせ処理部１０では、送られてきた
再切り出し領域の高さがｘハイトに等しいか調べる（Ｓ
３８）。もし再切り出し領域の高さがｘハイトに等しけ
れば、この再切り出し領域内に含まれる文字はｘハイト
文字のみとみなし、文字情報格納部８に記憶されている
画像の内、文字情報辞書の高さが１または３かつディセ
ンダが１または３の字種の画像を文字幅の広い順に取り
出し、切り出し位置より順に重ね合わせを行う（Ｓ３
９）。もし再切り出し領域の高さがｘハイトに等しくな
ければ、切り出し位置から水平方向にはじめてディセン
ダまたはアセンダが現れる位置をみつける（Ｓ４０）。
もし、ディセンダ、アセンダ共に存在しなければ（Ｓ４
１）、この領域内に存在する文字はｘハイト文字とみな
し、Ｓ３９同様の重ね合わせを行う。切り出し位置から
はじめて現れたのがディセンダの場合（Ｓ４２）、切り
出し位置からはじめてディセンダが現れた位置までの水
平方向の距離が小文字の“ｙ”の幅以下かどうか調べる
（Ｓ４３）。もし、切り出し位置からディセンダが存在
する位置までの水平方向の距離が大文字の“ｙ”の幅以
下の場合、ディセンダを含む文字が存在する可能性があ
るとみなし、まず、文字情報格納部８に記憶されている
画像の内、文字情報辞書９のディセンダが２または３の
字種の画像を文字幅の広い順に取り出し、切り出し位置
より順に重ね合わせを行う（Ｓ４４）。一致する画像が
あるかどうか調べる（Ｓ４５）。もし一致する画像がな
れけば、切り出し位置よりすぐ後ろにはｘハイト文字が
存在するとみなし、Ｓ３９と同様の重ね合わせを行う
（Ｓ４９）。切り出し位置から始めて現れたのがアセン
ダの場合、切り出し位置から始めてアセンダが現れた位
置までの水平方向の距離が大文字の“Ｗ”の幅以下であ
るか調べる（Ｓ４６）。もし、切り出し位置からアセン
ダが存在する位置までの水平方向の距離が大文字の
“Ｗ”の幅以下の場合、アセンダを含む文字が存在する
可能性があるとみなし、まず、文字情報格納部８に記憶
されている画像の内、文字情報辞書９の高さが１または
３の字種の画像を文字幅の広い順に取り出し、切り出し
位置より順に重ね合わせを行う（Ｓ４７）。一致するが
画像があるかどうか調べ（Ｓ４８）、もし一致する画像
がなければ、切り出し位置からすぐ後ろにはｘハイト文
字が存在するとみなし、Ｓ３９と同様の重ね合わせを行
う（Ｓ４９）。The superposition processing unit 10 checks whether the height of the re-cutout region that has been sent is equal to x height.
38). If the height of the re-cutout area is equal to x height, the characters included in this re-cutout area are regarded as x-height characters only, and the height of the character information dictionary of the images stored in the character information storage unit 8 is considered. Images of character types with a size of 1 or 3 and a descender of 1 or 3 are taken out in the order of increasing character width, and are superimposed in order from the cutout position (S3).
9). If the height of the re-cutout area is not equal to x height, the position where the descender or the ascender appears for the first time in the horizontal direction from the cutout position is found (S40).
If both the descender and the ascender do not exist (S4
1) The characters existing in this area are regarded as x-height characters, and superposition similar to S39 is performed. When the descender first appears from the cut-out position (S42), it is checked whether the horizontal distance from the cut-out position to the position where the descender first appears is less than or equal to the width of the lower case "y" (S43). If the horizontal distance from the cutout position to the position where the descender exists is less than or equal to the width of the uppercase “y”, it is considered that a character including the descender may exist, and the character information storage unit 8 first stores the character. Among the stored images, the descender of the character information dictionary 9 takes out images of character types 2 or 3 in the order of increasing character width, and superimposes them sequentially from the cutout position (S44). It is checked whether there is a matching image (S45). If there is no matching image, it is considered that there is an x-height character immediately behind the cutout position, and the same superimposition as in S39 is performed (S49). When it is the ascender that first appears from the cutout position, it is checked whether the horizontal distance from the cutout position to the position where the ascender appears is less than or equal to the width of the capital letter "W" (S46). If the horizontal distance from the cutout position to the position where the ascender exists is less than or equal to the width of the capital letter "W", it is considered that a character including the ascender may exist, and the character information storage unit 8 first stores the character. Among the stored images, images of character types having a height of 1 or 3 in the character information dictionary 9 are taken out in the order of increasing character width, and are sequentially superposed from the cutout position (S47). It is checked whether or not there is a matching image, but if there is no matching image, it is considered that there is an x-height character immediately behind the cut-out position, and superimposing similar to S39 is performed (S49).

【００２５】このように重ね合わせ処理を行い、一致す
る画像が存在するかどうか調べる（Ｓ２４）。一致する
画像が存在した場合、認識結果格納部１１に認識結果及
び位置情報を送り蓄える（Ｓ３６）。一致する画像が存
在しなかった場合、この再切り出し領域内に存在する基
本矩形の数を数える（Ｓ２５）。基本矩形の数が２以上
の場合、この領域内で最も右に位置する基本矩形を取り
除き、残りの基本矩形を統合する。この統合した領域を
新しく文字領域とし、位置情報を内部データに蓄える
（Ｓ３１）。再度文字認識部６へ新しい文字領域の位置
情報を送り、Ｓ３１と同様の認識処理を行い（Ｓ３
２）、認識結果に値するかどうか判定する。認識結果に
値しない場合、Ｓ２５以下の処理を繰り返す（Ｓ３
３）。認識結果に値する場合、認識の対象としている文
字の文字位置特徴、画像が、既に記憶されているかどう
か調べる（Ｓ３４）。文字情報格納部８にこの対象とす
る文字の文字位置特徴、文字画像が記憶されていない場
合、文字情報格納部８に認識文字コード、文字位置特徴
及び画像を蓄える（Ｓ３５）。Ｓ３０において、対象文
字の文字位置特徴、画像が文字情報格納部８に記憶され
ているか否かにかかわらず認識結果格納部１１に認識文
字コード及び文字位置情報を送り蓄える（Ｓ３６）。基
本矩形の数が１の場合、この再切り出し領域に対して、
再切り出しを少なくとも１度以上行ったことがあるかど
うか調べる（Ｓ２６）。もし、再切り出しを少なくとも
一度以上行ったことがある場合、Ｓ１３と同様の認識処
理を行い（Ｓ２７）、認識結果に値するかどうか判定す
る（Ｓ２８）。もし、認識結果に値する場合、前述と同
様認識の対象としている文字の文字位置特徴、画像が、
既に記憶されているかどうか調べ（Ｓ３４）、文字情報
格納部８にこの対象とする文字の文字位置特徴、画像が
記憶されていない場合、文字情報格納部８に認識文字コ
ード、文字位置特徴及び画像を蓄える（Ｓ３５）。Ｓ３
４において、対象文字の文字位置特徴、画像が文字情報
格納部８に記憶されているか否かにかかわらず認識結果
格納部１１に認識文字コード及び文字位置情報を送り蓄
える（Ｓ３６）。Ｓ２６において、この再切り出し領域
内で１度も再切り出しを行っていない場合、または、Ｓ
２８において認識結果に値しなかった場合、この領域内
で接触文字が存在するとみなし、領域内の黒画素の輪郭
の凹部分で切断し、文字領域とし、位置情報を内部デー
タに蓄える（Ｓ２９）。文字認識部６に位置情報を送
り、Ｓ１３と同様の認識処理を行う（Ｓ３０）。認識結
果に値する場合は認識文字コード及び文字位置情報を、
認識結果に値しない場合は棄却（リジェクト）コード及
び位置情報を認識結果格納部１１へ送り蓄える（Ｓ３
６）。The superimposing process is performed in this way, and it is checked whether or not there is a matching image (S24). If a matching image exists, the recognition result and the position information are sent to the recognition result storage unit 11 (S36). If there is no matching image, the number of basic rectangles existing in this re-cutout area is counted (S25). When the number of basic rectangles is 2 or more, the rightmost basic rectangle in this area is removed and the remaining basic rectangles are integrated. This integrated area is newly set as a character area, and position information is stored in internal data (S31). The position information of the new character area is sent to the character recognition unit 6 again, and the recognition processing similar to S31 is performed (S3
2) Determine whether the recognition result is worth it. When the recognition result is not merited, the processing from S25 is repeated (S3).
3). If the recognition result is worthy, it is checked whether or not the character position feature and the image of the character to be recognized are already stored (S34). When the character position feature and the character image of the target character are not stored in the character information storage unit 8, the recognized character code, the character position feature, and the image are stored in the character information storage unit 8 (S35). In S30, the recognition character code and the character position information are sent to the recognition result storage unit 11 regardless of whether the character position characteristics of the target character and the image are stored in the character information storage unit 8 (S36). When the number of basic rectangles is 1, for this re-cut area,
It is checked whether the re-cutout has been performed at least once (S26). If the re-cutout has been performed at least once, the recognition process similar to S13 is performed (S27), and it is determined whether or not the recognition result is worth (S28). If the recognition result is worth, the character position feature of the character to be recognized and the image are the same as above.
It is checked whether or not it is already stored (S34). If the character position feature and image of the target character are not stored in the character information storage unit 8, the recognized character code, character position feature and image are stored in the character information storage unit 8. Is stored (S35). S3
In 4, the recognized character code and the character position information are sent to the recognition result storage unit 11 regardless of whether or not the character position characteristic of the target character and the image are stored in the character information storage unit 8 (S36). In S26, if re-cutting has not been performed within this re-cutting area, or S
If the recognition result is not worth in 28, it is considered that a contact character exists in this area, and it is cut at the concave portion of the contour of the black pixel in the area to make a character area, and the position information is stored in the internal data (S29). .. The position information is sent to the character recognition unit 6 and the same recognition processing as S13 is performed (S30). If the recognition result is worth, the recognition character code and character position information,
When the recognition result is not worth, the rejection code and the position information are sent to the recognition result storage unit 11 and stored (S3).
6).

【００２６】処理を行った基本矩形が再切り出し領域に
含まれる基本矩形の中で、最も右に位置する矩形でなけ
れば、切り出し位置を切り出された文字領域の右に位置
する基本矩形の左側の辺のＸ座標にし（Ｓ２２）、この
位置情報を文字領域切り出し部４へ送り、以下Ｓ２３以
降の処理を繰り返す。処理を行った再切り出し領域に含
まれる基本矩形の内、最も右に位置する基本矩形であれ
ば、この再切り出し領域内の再切り出し、認識処理を終
了する（Ｓ３４）。If the processed basic rectangle is not the rightmost rectangle among the basic rectangles included in the re-cutout area, the cutout position is on the left side of the cutout character area. The X coordinate of the side is set (S22), this position information is sent to the character area cutout unit 4, and the processing from S23 onward is repeated. Of the basic rectangles included in the processed re-cutout area, if the basic rectangle is located at the rightmost position, the re-cutout and recognition processing in this recutout area is terminated (S34).

【００２７】全ての再切り出し領域に対して、文字領域
再切り出し・認識処理（Ｓ６）を行う（Ｓ７）。The character region recutting / recognizing process (S6) is performed on all the recutting regions (S7).

【００２８】認識結果格納部１１より認識文字コード及
び位置情報を認識結果出力部１２に送る。認識結果出力
部１２では、送られてきた位置情報をもとに認識文字コ
ードを出力する（Ｓ８）。The recognition result storage unit 11 sends the recognition character code and position information to the recognition result output unit 12. The recognition result output unit 12 outputs the recognized character code based on the sent position information (S8).

【００２９】以上Ｓ１〜Ｓ８の処理を行うことにより、
与えられた文書画像の文字認識処理を行う。By performing the processing of S1 to S8,
Performs character recognition processing of the given document image.

【００３０】[0030]

【発明の効果】以上のように本発明は、英文一般文書の
文字領域の切り出し処理を行う際、最大文字幅を推定
し、文字領域の切り出しを行うことにより、切れ文字を
分割することなく切り出すことを図ることができる。ま
た、認識の結果棄却された場合、この文字領域に対し
て、文字の位置特徴を用いて再切り出し対象文字を限定
し、同一文書内の認識文字確度の高い文字の画像と重ね
合わせを行うことにより、切れ文字、接触文字の高速な
切り出し、認識を図ることができる、文字入力の作業性
に優れた英文字認識装置を実現できるものである。As described above, according to the present invention, when the character area of an English general document is cut out, the maximum character width is estimated and the character area is cut out so that the cut character is cut out without being divided. You can plan that. In addition, when the result of recognition is rejected, the character to be re-cut out is limited to this character area by using the character position feature, and it is superimposed with the image of the character with high recognition character accuracy in the same document. Thus, it is possible to realize an English character recognizing device which is capable of cutting out a broken character and a contact character at high speed and recognizing the character and excellent in workability of character input.

[Brief description of drawings]

【図１】本発明の一実施例である英文字認識装置の全体
構成図FIG. 1 is an overall configuration diagram of an English character recognition device that is an embodiment of the present invention.

【図２】本実施例の英文字認識装置の全体のフローチャ
ートFIG. 2 is an overall flowchart of the English character recognition device according to the present embodiment.

【図３】本実施例の英文字認識装置の文字領域切り出し
・認識処理行程のフローチャートFIG. 3 is a flowchart of a character region cutout / recognition process step of the English character recognition device according to the present embodiment.

【図４】本実施例の英文字認識装置の文字領域再切り出
し・認識処理行程のフローチャートFIG. 4 is a flowchart of a character area re-cutting / recognition process step of the English character recognition apparatus according to the present embodiment.

【図５】本実施例の英文字認識装置の重ね合わせ処理行
程のフローチャートFIG. 5 is a flowchart of a superposition processing process of the English character recognition device according to the present embodiment.

【図６】本実施例の英文字認識装置の文字位置特徴の説
明図FIG. 6 is an explanatory diagram of character position features of the English character recognition device according to the present embodiment.

【図７】本実施例の英文字認識装置の文字情報辞書の内
容の説明図FIG. 7 is an explanatory diagram of contents of a character information dictionary of the English character recognition device according to the present embodiment.

[Explanation of symbols]

１画像入力部２文章領域切り出し部３行領域切り出し部４文字領域切り出し部５認識辞書６文字認識部７再切り出し領域格納部８文字情報格納部９文字情報辞書１０重ね合わせ処理部１１認識結果格納部１２認識結果出力部 1 image input section 2 text area cutout section 3 line area cutout section 4 character area cutout section 5 recognition dictionary 6 character recognition section 7 recutout area storage section 8 character information storage section 9 character information dictionary 10 overlay processing section 11 recognition result storage Part 12 Recognition result output part

Claims

[Claims]

1. An image input unit for inputting a document to be recognized, a text region cutout unit for outputting a text region from the input document image, a line region cutout unit for outputting a line region from the text region, and a line region A character area cutout unit that outputs a character area, a recognition dictionary that stores graphic features of all recognition target characters in advance, and a character recognition unit that performs recognition from an image of the cut out character area based on the recognition dictionary. And a re-cutout area storage section that stores the position of the area where the character area needs to be cut out and the character recognition is performed again as a result of recognition in the character recognition section, and the character recognition section determines that the recognition result is worth it. Character recognition character code, height of character with respect to reference character, position of character with respect to reference line (hereinafter,
(Character position feature) and a character information storage unit that stores a character image, a character information dictionary that stores in advance character position features of all recognition target characters, and an area stored in the re-cutout area storage unit. For the re-cutting target character using the character information dictionary, a superposition processing unit for superposing and cutting out and recognizing images stored in the character information storage unit, and a result and position of character recognition. And a recognition result output unit for outputting the recognition result stored in the recognition result storage unit.