JP2000181989A

JP2000181989A - Character recognition device, method therefor and computer readable recording medium recording the method as program

Info

Publication number: JP2000181989A
Application number: JP10360407A
Authority: JP
Inventors: Minoru Takakura; 穂高倉; Mariko Takenouchi; 磨理子竹之内; Ichiro Nakao; 一郎中尾; Yoshikatsu Ito; 好克井藤
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1998-12-18
Filing date: 1998-12-18
Publication date: 2000-06-30

Abstract

PROBLEM TO BE SOLVED: To provide a character recognition device for finding a horizontally written character string included in vertically written character rows in character images and correctly recognizing characters. SOLUTION: A binary picture input part 101 inputs the document images of a recognition object as binary pictures and a character constituting element detection part 102 detects character constituting elements from the binary pictures. A character row extraction part 103 connects the character constituting elements and extracts a character row. A character row direction judgement part 104 judges the direction, an extraction part 105 extracts a character area from the character row and a character recognition part 107 collates it with a recognition dictionary 106 and recognizes the character. A horizontally written character string judgement part 108 judges the presence/absence of the possibility of the horizontally written character string based on the recognized character. At the time of judging that it is present, a horizontally written character string division recognition part 110 divides the character images in a horizontal direction and recognizes new character images by collating them with the recognition dictionary 106. A recognized result output part 111 selects the one of high similarities of recognized results.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、郵便物や帳票等の
文書画像中の文字を認識し、文字コードに変換する文字
認識装置及びその方法に関する。[0001] 1. Field of the Invention [0002] The present invention relates to a character recognition apparatus and method for recognizing characters in a document image such as a mail or a form and converting the characters into a character code.

【０００２】[0002]

【従来の技術】光学式文字認識装置（ＯＣＲ）は、郵便
物のあて先住所の認識に威力を発揮している。一般にＯ
ＣＲでは、光学的に読み取った文字画像から、先ず、文
字行を抽出し、抽出された各文字行を行方向に分離する
ことで、文字抽出を行い、抽出した各文字毎に文字認識
をする。そのため、縦書き文字行中に横書きの数字列、
例えば「４０」が記載されている場合には、これを一の
文字として抽出してしまうため、正しく文字を認識する
ことができない。2. Description of the Related Art An optical character recognition device (OCR) is effective in recognizing a destination address of a mail. Generally O
In CR, a character line is first extracted from an optically read character image, and each extracted character line is separated in the line direction to perform character extraction, and character recognition is performed for each extracted character. . Therefore, a horizontal character string in a vertical character line,
For example, when "40" is described, the character is extracted as one character, so that the character cannot be correctly recognized.

【０００３】このような問題を解決するため、第１の装
置として、特開平５−８１４６３号公報記載の文字行認
識装置では、縦書き文字行が住所である場合に、縦書き
文字行中の横書きの数字列を漢字一文字として認識した
ときの認識結果を予めテーブルに用意しておき、縦書き
文字行中の文字の認識結果に、テーブル記載の文字があ
ったとき、その認識結果を相当する数字列に変換するよ
うにして、縦書き文字行中の横書きの数字列を認識して
いる。In order to solve such a problem, as a first device, a character line recognizing device described in Japanese Patent Application Laid-Open No. Hei 5-81463 is used. A recognition result when a horizontal character string is recognized as one kanji character is prepared in a table in advance, and when a recognition result of a character in a vertical character line includes a character described in the table, the recognition result is equivalent to the character. By converting to a numeric string, a horizontal numeric string in a vertical character line is recognized.

【０００４】また、第２の装置として、特開平８−１６
７１９号公報記載の文字認識装置がある。この装置で
は、縦書き文字行中の幅広の文字や認識結果の評価値の
良くない文字を横書き文字列の可能性がある文字として
射影に基づいた横方向の文字切り出しを行い、再度文字
認識をして、縦書き文字行中の横書きの数字列を認識し
ている。As a second device, Japanese Patent Application Laid-Open No.
There is a character recognition device described in Japanese Patent No. 719. With this device, wide characters in a vertical writing character line and characters with poor recognition result evaluation values are extracted as horizontal characters, and horizontal character extraction based on projection is performed, and character recognition is performed again. Then, the horizontal character string in the vertical character line is recognized.

【０００５】[0005]

【発明が解決しようとする課題】ところで、上記第１の
装置では、郵便物のあて先住所の認識にのみ利用されて
いるものであり、予めテーブル記載の文字が数字列に変
換されるべき文字であることを住所辞書を併用して判断
しなければならない。また、上記第２の装置では、縦書
き文字行中の横書き文字列の文字同士が接触している場
合には、射影に基づいて文字切り出しをするために、正
しく文字切り出しをすることができず、正しい文字認識
をすることができない。By the way, the first device is used only for recognizing a destination address of a postal matter. Characters described in a table in advance are characters to be converted into a numeric string. It must be judged by using an address dictionary together. Further, in the second device, when characters in a horizontal writing character string in a vertical writing character line are in contact with each other, character extraction cannot be performed correctly because character extraction is performed based on projection. Cannot recognize characters correctly.

【０００６】そこで、本発明は、縦書き文字行中に横書
き文字列が含まれており、かつ横書きの文字同士が接触
しているときであっても、横書き文字列を精度よく認識
することのできる文字認識装置及びその方法を提供する
ことを目的とする。Therefore, the present invention provides a method for accurately recognizing a horizontally written character string even when a horizontally written character string is included in a vertically written character line and the horizontally written characters are in contact with each other. It is an object of the present invention to provide a character recognition device and a method thereof.

【０００７】[0007]

【課題を解決するための手段】上記課題を解決するた
め、本発明は、入力された２値文字画像から文字構成要
素を検出する文字構成要素検出手段と、検出された文字
構成要素を連結し、文字行を抽出する文字行抽出手段
と、抽出された文字行が縦書きであるか横書きであるか
を文字行の長手方向に従い判定する縦横判定手段と、前
記文字行内で一文字としての文字画像を構成する第１文
字画像領域を検出する文字画像領域検出手段と、検出さ
れた第１文字画像領域から一の文字として文字認識する
第１文字認識手段と、前記縦横判定手段で文字行が縦書
きと判定された場合、第１文字認識手段で文字認識され
た認識結果に基づいて、第１文字画像領域の文字画像を
横書き文字列と判定する横書き文字列判定手段と、前記
横書き文字列判定手段で横書き文字列と判定されたと
き、第１文字画像領域を横方向の複数の第２文字画像領
域に分割し、各第２文字画像領域の文字画像から文字認
識する第２文字認識手段と、前記第１文字認識手段で認
識された文字と第２文字認識手段で認識された文字列の
いずれかを認識結果として選択する認識結果選択手段と
を備えることとしている。In order to solve the above-mentioned problems, the present invention relates to a character component detecting means for detecting a character component from an input binary character image, and concatenating the detected character components. A character line extracting unit for extracting a character line, a vertical / horizontal determining unit for determining whether the extracted character line is vertical or horizontal according to the longitudinal direction of the character line, and a character image as one character in the character line. Character image area detecting means for detecting a first character image area constituting the first character image area; first character recognizing means for recognizing a character as one character from the detected first character image area; If it is determined that the character is written, the horizontal character string determination unit determines that the character image in the first character image area is a horizontal character string based on the recognition result of the character recognition by the first character recognition unit; means A second character recognition unit that divides the first character image region into a plurality of second character image regions in a horizontal direction when the character image is determined to be a horizontal writing character string, and recognizes a character from a character image in each second character image region; There is provided a recognition result selecting means for selecting any of the character recognized by the first character recognizing means and the character string recognized by the second character recognizing means as a recognition result.

【０００８】[0008]

【発明の実施の形態】以下、本発明に係る文字認識装置
の実施の形態について、図面を参照しながら説明する。（実施の形態）図１は、本発明に係る文字認識装置の一
実施の形態の構成図である。この文字認識装置は、２値
画像入力部１０１と、文字構成要素検出部１０２と、文
字行抽出部１０３と、文字行方向判定部１０４と、文字
抽出部１０５と、認識辞書１０６と、文字認識部１０７
と、横書き文字列判定部１０８と、遊離文字構成要素統
合部１０９と、横書き文字列分割認識部１１０と認識結
果出力部１１１とを備えている。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, embodiments of the character recognition device according to the present invention will be described with reference to the drawings. (Embodiment) FIG. 1 is a configuration diagram of an embodiment of a character recognition device according to the present invention. This character recognition device includes a binary image input unit 101, a character component detection unit 102, a character line extraction unit 103, a character line direction determination unit 104, a character extraction unit 105, a recognition dictionary 106, a character recognition Part 107
, A horizontal character string determination unit 108, a loose character component integration unit 109, a horizontal character string division recognition unit 110, and a recognition result output unit 111.

【０００９】２値画像入力部１０１は、スキャナ等を有
し、認識対象の文書画像を２値画像として入力する。図
２は、入力された２値画像を示す図である。２値画像２
０１は、文字構成要素（文字及び文字要素を含む）を黒
画素で背景部分を白画素で示される。なお、２値画像２
０１は、図２を正面から見たときに、文字が通常読むこ
とができる向きに入力されているものとする。A binary image input unit 101 has a scanner or the like and inputs a document image to be recognized as a binary image. FIG. 2 is a diagram illustrating an input binary image. Binary image 2
01 indicates a character component (including a character and a character element) as a black pixel and a background portion as a white pixel. The binary image 2
01 is input in a direction in which characters can be normally read when FIG. 2 is viewed from the front.

【００１０】文字構成要素検出部１０２は、記憶領域を
有し、２値画像入力部１０１で入力された２値画像２０
１から連続した黒画素の塊である文字構成要素を検出
し、記憶しておく。図３は、２値画像２０１から検出さ
れた文字構成要素３０１、３０２、３０３、・・・を示
す。この際、文字構成要素検出部１０２は、文字構成要
素の外接矩形の重複の大きい文字構成要素同士を統合し
ておく。「府」３０５は、統合されている。文字構成要
素検出部１０２は、文字構成要素の外接矩形の座標値を
文字行抽出部１０３及び文字抽出部１０５に通知する。The character component detection unit 102 has a storage area and has a binary image 20 input by the binary image input unit 101.
A character component that is a block of black pixels continuing from 1 is detected and stored. FIG. 3 shows character components 301, 302, 303,... Detected from the binary image 201. At this time, the character component detection unit 102 integrates character components having large overlapping circumscribed rectangles of the character components. The “fu” 305 is integrated. The character component detecting unit 102 notifies the character line extracting unit 103 and the character extracting unit 105 of the coordinate values of the circumscribed rectangle of the character component.

【００１１】なお、図３において、文字構成要素の外接
矩形は、文字構成要素と接していないけれども、これは
作図上の制限により生ずるものであり、実際には、文字
構成要素と接している。以下の図面においても同様であ
る。文字行抽出部１０３は、文字構成要素検索部１０２
から文字構成要素の外接矩形の座標値の通知を受ける
と、文字構成要素の外接矩形間の距離が所定値以下とな
る近接した個々の文字構成要素３０１、３０２、・・・
を段階的に統合して文字行を抽出する。２値画像２０１
から図４に示すように、文字行４０１、４０２が抽出さ
れる。抽出した文字行４０１、４０２の外接矩形の座標
値を文字行方向判定部１０４及び文字抽出部１０５に通
知する。In FIG. 3, the circumscribed rectangle of the character component is not in contact with the character component, but this is caused by a limitation in drawing, and in fact, it is in contact with the character component. The same applies to the following drawings. The character line extraction unit 103 includes a character component search unit 102
When the notification of the coordinate values of the circumscribed rectangles of the character components is received from, the individual character components 301, 302,.
Is integrated step by step to extract character lines. Binary image 201
As shown in FIG. 4, character lines 401 and 402 are extracted. The coordinate values of the circumscribed rectangles of the extracted character lines 401 and 402 are notified to the character line direction determination unit 104 and the character extraction unit 105.

【００１２】文字行方向判定部１０４は、通知された座
標値から文字行４０１、４０２の長手方向を文字行の方
向として判定する。図４に示す文字行４０１、４０２
は、縦方向であり、縦書きと判定する。文字行方向判定
部１０４は、文字抽出部１０５に文字行の方向を通知す
る。文字抽出部１０５は、文字行方向判定部１０４から
文字行の方向（縦書き）の通知を受けると、文字構成要
素検出部１０２から通知された文字構成要素の外接矩形
の座標値と、文字行抽出部１０３から通知された文字行
の外接矩形の座標値とから各文字行４０１、４０２内で
一文字を構成する文字領域の座標値を順次、文字認識部
１０７に通知する。The character line direction determining unit 104 determines the longitudinal direction of the character lines 401 and 402 as the direction of the character line based on the notified coordinate values. Character lines 401 and 402 shown in FIG.
Indicates the vertical direction, and is determined to be vertical writing. The character line direction determining unit 104 notifies the character extracting unit 105 of the direction of the character line. When the character extracting unit 105 receives the notification of the direction (vertical writing) of the character line from the character line direction determining unit 104, the character extracting unit 105 determines the coordinate value of the circumscribed rectangle of the character component notified from the character component detecting unit 102, From the coordinate values of the circumscribed rectangle of the character line notified from the extraction unit 103, the coordinate values of the character area forming one character in each of the character lines 401 and 402 are sequentially notified to the character recognition unit 107.

【００１３】一文字を構成する文字領域の決定は、以下
のルールに基づいて行なう。（１）文字行内の文字構成要素の配置に基づき、図５
（ａ）に示すように基準となる文字の高さ及び文字間隔
を決定する。（ａ）基準文字高さＨは、文字行幅Ｗを超えない文字構
成要素の高さの最大値とする。The determination of the character area constituting one character is performed based on the following rules. (1) Based on the arrangement of the character components in the character line, FIG.
The reference character height and character spacing are determined as shown in FIG. (A) The reference character height H is a maximum value of the height of a character component that does not exceed the character line width W.

【００１４】（ｂ）基準文字間隔Ｄは、上下に隣接する
文字構成要素間の間隔が基準文字高さＨを超える文字構
成要素間隔の内、最小値とする。（２）基準文字高さＨ及び基準文字間隔Ｄに基づき、近
接する文字構成要素を統合し、個々の文字の領域を決定
する。（ａ）左右に隣接する文字構成要素は統合する。(B) The reference character spacing D is the minimum value of the character component spacing in which the spacing between vertically adjacent character components exceeds the reference character height H. (2) Based on the reference character height H and the reference character spacing D, adjacent character components are integrated to determine the area of each character. (A) The character components adjacent to the left and right are integrated.

【００１５】（ｂ）文字構成要素が上下に隣接する場合
には、基準文字間隔Ｄを越えない範囲で統合する。文字抽出部１０５で上述したルールに基づき縦書きの文
字行４０１から抽出された文字画像を図５（ｂ）に示
す。郵便番号マーク「〒」５０１、数字列「５８８」５
０２、縦書きハイフン「｜」５０３、数字列「５１１
１」５０４のうちの数字列「５１１」５０５、・・・、
数字列「４０」５０６、・・・、数字列「１４」５０７
が１文字画像として抽出される。(B) When character components are vertically adjacent to each other, they are integrated within a range not exceeding the reference character interval D. FIG. 5B shows a character image extracted from the vertically written character line 401 by the character extracting unit 105 based on the above-described rule. Postal code mark "@" 501, digit string "588" 5
02, vertical hyphen "|" 503, numeric string "511"
Numeral string “511” 505 out of “1” 504,.
Numeric string “40” 506,..., Numeric string “14” 507
Is extracted as a one-character image.

【００１６】なお、横書き文字行の場合にも、縦横を入
れ替えて、同様の手法で文字領域を決定する。認識辞書
１０６は、文字の標準文字画像の特徴である標準特徴と
その文字コードとの組を登録している通常の認識辞書部
分と、２連接数字画像の標準特徴と２連接数字の文字コ
ードとの組を登録している２連接数字クラスタ辞書部分
とからなる。２連接数字画像は、図６に示すような複数
種類（ここでは４種類）の書体で印字された２連接数字
のものであり、その標準特徴は、２桁数字の全組み合わ
せの画像から抽出した特徴をクラスタ分析し、各クラス
タの平均特徴を求めたものである。In the case of a horizontally written character line, the character area is determined in the same manner by changing the length and width. The recognition dictionary 106 includes a normal recognition dictionary portion in which a set of a standard feature that is a feature of a standard character image of a character and its character code is registered, a standard feature of a two-digit numeric image, and a character code of a two-digit numeric. And a two-part number cluster dictionary part in which a set of numbers is registered. The two-digit number image is a two-digit number image printed in a plurality of types (four types in this case) as shown in FIG. 6, and its standard feature is extracted from an image of all combinations of two-digit numbers. The feature is analyzed in a cluster, and the average feature of each cluster is obtained.

【００１７】文字認識部１０７は、文字抽出部１０５か
ら認識対象の文字領域の座標値の通知を受けると、文字
構成要素検出部１０２に記憶されているその文字領域の
文字構成要素から特徴を抽出し、認識辞書１０６の標準
特徴と抽出した特徴との類似度を計算し、その類似度の
最も高い標準特徴に組にして記憶されている文字コード
を認識結果とする。Upon receiving a notification of the coordinate value of the character area to be recognized from the character extracting section 105, the character recognizing section 107 extracts a feature from the character components of the character area stored in the character component detecting section 102. Then, the similarity between the standard feature of the recognition dictionary 106 and the extracted feature is calculated, and the character code stored as a pair with the standard feature having the highest similarity is used as the recognition result.

【００１８】縦書き文字行４０１中の文字画像の認識結
果を図７に示す。文字認識部１０７は、この認識結果を
横書き文字列判定部１０８及び認識結果出力部１１１に
通知する。なお、認識結果出力部１１１には、文字コー
ドの他にその類似度も併せて通知する。横書き文字列判
定部１０８は、文字認識部１０７から通知された文字コ
ードに基づいて、縦書き文字行４０１中の横書き文字列
の可能性を判定する。FIG. 7 shows the recognition result of the character image in the vertically written character line 401. The character recognition unit 107 notifies the recognition result to the horizontally written character string determination unit 108 and the recognition result output unit 111. Note that the recognition result output unit 111 is notified of the similarity in addition to the character code. The horizontal writing character string determination unit 108 determines the possibility of a horizontal writing character string in the vertical writing character line 401 based on the character code notified from the character recognition unit 107.

【００１９】横書き文字列の可能性有の判定基準は、以
下のルールを適用する。ルール（ａ）２連接数字クラスタ辞書部分で認識された
２連接数字ルール（ｂ）各種郵便番号マーク又は縦書き・横書きの
ハイフンに認識された文字に後続する数字以外に認識さ
れた文字ルール（ｃ）アルファベットに認識された文字以上のルールの他に、横書き文字列判定部１０８は、例
えば、「００」と「∞」とのように誤認識されやすい候
補文字テーブル（図示せず）を記憶しておき、テーブル
記載の文字に認識された文字を横書き文字列と判定して
もよい。The following rules are applied as criteria for determining the possibility of a horizontally written character string. Rule (a) Two-consecutive digits recognized in the two-consecutive-number cluster dictionary rule (b) Characters recognized other than digits following various postal code marks or characters recognized by vertical or horizontal hyphens Rule (c ) Characters Recognized in Alphabet In addition to the above rules, the horizontal writing character string determination unit 108 stores a candidate character table (not shown) that is easily misrecognized, for example, “00” and “∞”. In addition, a character recognized as a character described in the table may be determined as a horizontally written character string.

【００２０】横書き文字列判定部１０８の可能性有りの
判定結果を図７に示した認識結果の隣に「○」印を付し
て示す。数字列「５８８」５０２の文字画像は、郵便番
号マークに認識された文字に後続する数字以外のアルフ
ァベット「ｍ」と認識されているので、ルール（ｂ）が
適用される。数字列「５１１」５０５の文字画像は、縦
書きハイフンに後続の文字が漢字「馴」と認識されてい
るので、ルール（ｂ）が適用される。数字列「４０」５
０６の文字画像は数字「４０」と２連接数字と認識され
ているので、ルール（ａ）が適用される。また数字列
「１４」５０７の文字画像はアルファベット「Ｍ」と認
識されているので、ルール（ｃ）が適用される。The judgment result of the possibility of the horizontal writing character string judging section 108 is shown with a circle next to the recognition result shown in FIG. Since the character image of the numeral string “588” 502 is recognized as the alphabet “m” other than the number following the character recognized as the postal code mark, the rule (b) is applied. In the character image of the numeral string “511” 505, the rule (b) is applied because the character following the vertical hyphen is recognized as the kanji “familiar”. Numeric string "40" 5
Since the character image 06 is recognized as a two-digit number with the numeral “40”, the rule (a) is applied. Since the character image of the numeral string “14” 507 is recognized as the alphabet “M”, the rule (c) is applied.

【００２１】横書き文字列判定部１０８は、横書き文字
列の可能性があると判定したとき、その文字画像の領域
の座標値と、判定ルールとを遊離文字構成要素統合部１
０９及び横書き文字列分割認識部１１０に通知する。な
お、２値画像入力部１０１から入力された文書画像が図
８に示すような横書き文字列であるときには、文字列８
０１は、文字行方向判定部１０４で横書き文字列と判定
され、文字認識部１０７で、図９に示すように文字認識
される。この場合、横書き文字列判定部１０８は、ルー
ル（ａ）、（ｂ）を適用して文字画像の再分割可能性を
判定する。When it is determined that there is a possibility of a horizontally written character string, the horizontally written character string determination unit 108 compares the coordinate value of the area of the character image and the determination rule with the free character component integration unit 1.
09 and the horizontal writing character string division recognition unit 110 is notified. When the document image input from the binary image input unit 101 is a horizontally written character string as shown in FIG.
01 is determined as a horizontally written character string by the character line direction determination unit 104, and the character is recognized by the character recognition unit 107 as shown in FIG. In this case, the horizontal writing character string determination unit 108 determines the possibility of re-division of the character image by applying the rules (a) and (b).

【００２２】図９の認識結果に組して再分割可能性があ
ると判定された文字画像に「○」印を付している。文字
列８０１中の文字画像８０２、８０３、８０４がルール
（ｂ）、（ａ）、（ｂ）にそれぞれ該当し、再分割可能
性があると判定され、その文字画像の領域の座標値が横
書き文字列分割認識部１１０に通知される。遊離文字構
成要素統合部１０９は、横書き文字列判定部１０８から
横書き文字列の可能性がある旨の通知を受けると、文字
画像の領域から所定値以下の距離に文字行抽出部１０３
で文字行４０１に含まれずに残っている遊離した文字構
成要素がないか文字構成要素検出部１０２の記憶領域を
検索する。遊離した文字構成要素があれば、文字画像の
領域に含めて、横書き文字列分割認識部１１０にその領
域の座標値を通知する。A character image determined to have a possibility of re-division in combination with the recognition result shown in FIG. 9 is marked with a circle. The character images 802, 803, and 804 in the character string 801 correspond to the rules (b), (a), and (b), respectively, and it is determined that there is a possibility of re-division, and the coordinate values of the area of the character image are written horizontally. The character string division recognition unit 110 is notified. Upon receiving notification from the horizontal character string determination unit 108 that there is a possibility of a horizontal character string, the loose character component integrating unit 109 moves the character line extraction unit 103 to a distance less than or equal to a predetermined value from the character image area.
Searches the storage area of the character component detection unit 102 for any loose character components that are not included in the character line 401. If there is a loose character component, it is included in the area of the character image, and the horizontal character string division recognition unit 110 is notified of the coordinate value of the area.

【００２３】図１０は、遊離文字構成要素統合部１０９
で遊離した文字構成要素１００１が文字画像５０５に統
合されることを説明する図である。文字行抽出部１０３
では、文字行４０１を抽出したとき、一の文字画像５０
５に含めるべき文字構成要素１００１を除外することが
一般に起こり得る。遊離文字構成要素統合部１０９は、
このような不都合を解消するものである。FIG. 10 shows the free character component integrating unit 109.
FIG. 9 is a diagram for explaining that a character component 1001 separated from the image is integrated into a character image 505. Character line extraction unit 103
Then, when the character line 401 is extracted, one character image 50
It is generally possible to exclude character components 1001 that should be included in 5. The free character component integration unit 109
Such a disadvantage is solved.

【００２４】横書き文字列分割認識部１１０は、横書文
字列判定部１０８から横書き文字列の可能性がある旨の
通知を受けると、通知された文字画像の領域を判定の根
拠となったルールに従い横方向に分割する。分割した新
たな文字画像から特徴を抽出し、認識辞書１０６の標準
特徴との類似度を計算し、類似度の最も高い文字の文字
コードを認識結果出力部１１１に通知する。この際、類
似度の値も併せて通知する。When the horizontal writing character string division recognizing section 110 receives a notification from the horizontal writing character string determination section 108 that there is a possibility of a horizontal writing character string, the rule of the notified character image is used as a basis for the determination. And divide horizontally. The feature is extracted from the new divided character image, the similarity with the standard feature of the recognition dictionary 106 is calculated, and the character code of the character having the highest similarity is notified to the recognition result output unit 111. At this time, the similarity value is also notified.

【００２５】例えば、文字認識部１０７でアルファベッ
ト「ｍ」と認識された数字列「５８８」５０２の文字画
像の場合には、横書き文字列分割認識部１１０は、横書
き文字列判定部からルール（ｂ）が適用された旨通知さ
れている。また、文字画像の直前の文字が郵便番号マー
クであるので、数字列「５８８」５０２の文字画像を射
影により横方向に３分割する。図１１（ａ）に示す数字
列「５８８」５０２の上下の黒画素に挟まれた白画素部
分を黒画素で埋めた画像１１０１（図１１（ｂ））を生
成し、縦方向の射影操作を行い射影画像１１０２（図１
１（ｃ））を得る。射影の谷間１１０３、１１０４を検
出し、その谷間で画像を分割し、新たな文字画像１１０
５、１１０６、１１０７を得る（図１１（ｄ））。これ
により、文字画像１１０５、１１０６、１１０７が接触
した文字列であっても文字画像を分離することができ
る。得られた文字画像１１０５、１１０６、１１０７そ
れぞれについて、特徴を抽出し、認識辞書１０６の標準
特徴との類似度を計算し、最も類似度の高い文字の文字
コードを認識結果出力部１１１に通知する。For example, in the case of a character image of the numeral string "588" 502 recognized by the character recognition unit 107 as the alphabet "m", the horizontal character string division recognition unit 110 sends a rule (b) from the horizontal character string determination unit. ) Has been applied. Since the character immediately before the character image is a postal code mark, the character image of the numeral string "588" 502 is horizontally divided into three parts by projection. An image 1101 (FIG. 11B) in which white pixels sandwiched between upper and lower black pixels of the numeric string “588” 502 shown in FIG. 11A are filled with black pixels, and a vertical projection operation is performed. Performed projected image 1102 (FIG. 1)
1 (c)). The projected valleys 1103 and 1104 are detected, the image is divided by the valleys, and a new character image 110 is detected.
5, 1106 and 1107 are obtained (FIG. 11D). As a result, even if the character images 1105, 1106, and 1107 are touching character strings, the character images can be separated. For each of the obtained character images 1105, 1106, and 1107, the feature is extracted, the similarity with the standard feature of the recognition dictionary 106 is calculated, and the character code of the character with the highest similarity is notified to the recognition result output unit 111. .

【００２６】また、横書き文字列分割認識部１１０は、
２連接数字「４０」５０６の場合には、ルール（ａ）が
横書き文字列判定部１０８から通知されているので、２
連接数字「４０」５０６の文字画像を横方向に等分割し
て、新たな文字画像１２０１、１２０２を得る（図１
２）。文字画像１２０１、１２０２からそれぞれ、特徴
量を抽出して、認識辞書１０６と照合し、認識結果を認
識結果出力部１１１に通知する。The horizontal character string division recognition unit 110
In the case of the two-digit number “40” 506, since the rule (a) has been notified from the horizontal character string determination unit 108,
The character image of the serial number “40” 506 is equally divided in the horizontal direction to obtain new character images 1201 and 1202 (FIG. 1).
2). The feature amount is extracted from each of the character images 1201 and 1202, is compared with the recognition dictionary 106, and the recognition result is notified to the recognition result output unit 111.

【００２７】なお、図１１〜図１２に示した文字画像の
例では、横方向に分割すべき分割数が予め横書き文字列
である旨の判定の根拠となったルール等に従い、決めら
れているけれども、他の横書き文字列の分割方法とし
て、以下のような手法を適用することができる。（ａ）文字画像が、複数の文字構成要素から構成されて
いる場合には、幅が最小の文字構成要素の幅とその隣接
する文字構成要素との間隔とを加えたものを基準幅とし
て、文字数を推定し、射影操作を行い、推定した文字数
の文字画像に分割する。（ｂ）文字画像が単一の文字構成要素から構成されてい
る場合には、文字構成要素の高さの半分を基準に文字数
を推定し、上記（ａ）と同様に新たな文字画像に分割す
る。In the example of the character image shown in FIGS. 11 to 12, the number of divisions to be divided in the horizontal direction is determined in advance according to a rule or the like on which the determination that the character string is a horizontal writing character string is made in advance. However, the following method can be applied as another method of dividing a horizontally written character string. (A) When the character image is composed of a plurality of character components, the reference width is obtained by adding the width of the character component having the smallest width and the interval between adjacent character components. The number of characters is estimated, a projection operation is performed, and the image is divided into character images having the estimated number of characters. (B) When the character image is composed of a single character component, the number of characters is estimated based on half the height of the character component, and divided into a new character image as in (a) above. I do.

【００２８】横書き文字列分割認識部１１０は、横書き
文字列判定部１０８からルール（ｂ）が通知されてお
り、先行する文字が縦書きハイフンである場合又は、図
１３に示すように、横書き文字列判定部１０８からルー
ル（ｂ）が通知されており、先行する文字が縦書きハイ
フンであり、更に遊離文字構成要素統合部１０９から統
合された文字画像５０５の領域が通知されている場合、
上記手法を適用して、新たな文字画像１３０１、１３０
２、１３０３、１３０４を得る。得られた文字画像１３
０１〜１３０４をそれぞれ認識し、認識結果出力部１１
１に通知する。The horizontal character string division recognition unit 110 has been notified of the rule (b) from the horizontal character string determination unit 108. If the preceding character is a vertical hyphen or, as shown in FIG. When the rule (b) is notified from the column determining unit 108, the preceding character is a vertical writing hyphen, and the area of the integrated character image 505 is notified from the loose character component integrating unit 109,
By applying the above method, new character images 1301, 130
2, 1303 and 1304 are obtained. Character image 13 obtained
01 to 1304, respectively, and a recognition result output unit 11
Notify 1.

【００２９】図１４は、文字行４０１中の横書き文字列
分割認識部１１０での文字の認識結果とその距離（類似
度）とを示している。なお、参考のために文字認識部１
０７での認識結果も併せて示している。なお、横書き文
字列分割認識部１１０は、図８に示したような横書き文
字列８０１の場合にも、横書き文字列判定部１０８にお
いて、再分割可能性有と判定された文字画像について、
所定のルールに基づいて文字数を決定又は推定し、接触
文字等を分割して、新たな文字画像から文字認識する。FIG. 14 shows the result of character recognition by the horizontally written character string division recognition unit 110 in the character line 401 and the distance (similarity) thereof. For reference, the character recognition unit 1
The recognition result at 07 is also shown. Note that, even in the case of the horizontally written character string 801 as shown in FIG. 8, the horizontally written character string division recognition unit 110 performs the following processing on the character image determined to be re-segmentable by the horizontally written character string determination unit 108.
The number of characters is determined or estimated based on a predetermined rule, and a contact character or the like is divided to recognize a character from a new character image.

【００３０】認識結果出力部１１１は、横書き文字判定
部１０８で横書き文字列と判定された文字画像の文字認
識部１０７での認識結果を導き出した類似度と、横書き
文字列分割認識部１１０での認識結果を導き出した類似
度との値を比較し、より類似度の高い認識結果を文字画
像の認識された文字として選択し、表示画面に表示す
る。The recognition result output unit 111 outputs the similarity derived from the recognition result of the character image determined to be a horizontal writing character string by the horizontal writing character determination unit 108 by the character recognition unit 107 and the similarity obtained by the horizontal writing character string division recognition unit 110. The value of the similarity with which the recognition result was derived is compared, and a recognition result with a higher similarity is selected as a recognized character of the character image and displayed on the display screen.

【００３１】なお、横書き文字列分割認識部１１０で
は、複数の横書き文字列として認識するので各文字それ
ぞれの類似度が計算されている。文字認識部１０７で
は、通常一字の文字として認識しているので類似度は一
つである。従って、類似度を比較する際には、文字認識
部１０７での類似度と、横書き文字列分割認識部１１０
での複数の類似度のうち一番低い類似度とを比較して、
類似度の高い方を選択してもよいし、横書き文字列分割
認識部１１０での複数の類似度の平均値を用いて比較し
てもよい。Since the horizontal character string division recognition unit 110 recognizes a plurality of horizontal character strings, the similarity of each character is calculated. The character recognizing unit 107 normally recognizes the character as a single character, so the degree of similarity is one. Therefore, when comparing the similarities, the similarity in the character recognition unit 107 and the horizontal writing character string division recognition unit 110 are compared.
By comparing the lowest similarity among multiple similarities in
The higher similarity may be selected, or the comparison may be performed using the average value of a plurality of similarities in the horizontally written character string division recognition unit 110.

【００３２】また、図１４では、類似度を距離として示
しているので、類似度が高いとは、距離が近いことをい
う。次に、本実施の形態の動作を図１５、図１６、図１
７のフローチャートを用いて説明する。先ず、２値画像
入力部１０１は、認識対象となる文書画像を２値画像と
して入力する（Ｓ１５０２）。In FIG. 14, since the similarity is shown as a distance, a high similarity means that the distance is short. Next, the operation of the present embodiment will be described with reference to FIGS.
This will be described with reference to the flowchart of FIG. First, the binary image input unit 101 inputs a document image to be recognized as a binary image (S1502).

【００３３】文字構成要素検出部１０２は、２値画像か
ら文字構成要素（黒画素）を検出し（Ｓ１５０４）、文
字行抽出部１０３は、検出された黒画素を連結し、文字
行（文字行の外接矩形の座標値）を抽出する（Ｓ１５０
６）。文字行方向判定部１０４は、抽出された文字行の
座標値から、長手方向が縦方向か横方向かを見て文字行
が横書きか否かを判定する（Ｓ１５０８）。否であれ
ば、Ｓ１７０２に移る。The character component detecting section 102 detects a character component (black pixel) from the binary image (S1504), and the character line extracting section 103 connects the detected black pixels to form a character line (character line). Is extracted (S150).
6). The character line direction determination unit 104 determines whether or not the character line is written horizontally by checking whether the longitudinal direction is the vertical direction or the horizontal direction based on the coordinate values of the extracted character line (S1508). If not, the process moves to S1702.

【００３４】肯定のときは、文字抽出部１０５は、抽出
された文字行から１文字としての文字画像（文字画像の
外接矩形の座標値）を抽出する（Ｓ１５１０）。文字認
識部１０７は、抽出された文字行中に未認識の文字が有
るか否かを判定し（Ｓ１５１２）、なければ処理を終了
し、有れば、その文字画像領域の文字を認識する（Ｓ１
５１４）。When the result is affirmative, the character extracting unit 105 extracts a character image (coordinates of a circumscribed rectangle of the character image) as one character from the extracted character line (S1510). The character recognizing unit 107 determines whether there is any unrecognized character in the extracted character line (S1512). If not, the process ends. If there is, the character in the character image area is recognized (S1512). S1
514).

【００３５】横書き文字列判定部１０８は、認識された
文字が２連接数字であるか否かを判定する（Ｓ１５１
６）。否であれば、横書き文字列分割認識部１１０は、
文字画像を左右方向に均等に分割し（Ｓ１５１８）、新
たな２つの文字画像からそれぞれ文字を再認識する（Ｓ
１５２０）。認識結果出力部１１１は、文字認識部１０
７で認識された文字と、横書き文字列分割認識部１１０
で認識された文字列との両類似度を比較し、類似度の高
い文字又は文字列を選択し、表示画面に表示して（Ｓ１
５２２）、Ｓ１５１２に戻る。The horizontally written character string determination unit 108 determines whether or not the recognized character is a two-digit number (S151).
6). If not, the horizontal writing character string division recognition unit 110
The character image is equally divided in the left-right direction (S1518), and the characters are re-recognized from the two new character images (S1518).
1520). The recognition result output unit 111 includes the character recognition unit 10
7 and the horizontal character string division recognition unit 110
Compare the two similarities with the character string recognized in the above, select a character or a character string with a high similarity, display it on the display screen (S1
522), and return to S1512.

【００３６】Ｓ１５１６において、否と判定されたと
き、Ｓ１６０２において、横書き文字列判定部１０８
は、認識された前文字が郵便番号マークか否かを判定す
る（１６０２）。肯定のとき、横書き文字列分割認識部
１１０は、郵便番号マークに認識された次の文字画像の
領域を射影操作により左右に３分割する（Ｓ１６０
４）。分割した新たな３つの文字画像からそれぞれ文字
を再認識する（Ｓ１６０６）。If it is determined in S1516 that the character string is not, in S1602, the horizontal character string determination unit 108
Determines whether the recognized previous character is a postal code mark (1602). When the result is affirmative, the horizontally written character string division recognition unit 110 divides the area of the next character image recognized by the postal code mark into three parts by the projection operation (S160).
4). Characters are re-recognized from the three new divided character images (S1606).

【００３７】認識結果出力部１１１は、Ｓ１５２２と同
様に文字認識部１０７と横書き文字列分割認識部１１０
との認識結果から類似度の高い認識結果を選択し（Ｓ１
６０８）、Ｓ１５１２に戻る。Ｓ１６０２において、否
と判定したとき、横書き文字列判定部１０８は、前文字
が縦書きのハイフンであるか否かを判定する（Ｓ１６１
０）。否であれば、Ｓ１５１２に戻る。The recognition result output unit 111 includes a character recognition unit 107 and a horizontally written character string division recognition unit 110 as in S1522.
(S1).
608), and return to S1512. When determining in S1602 that the character is not, the horizontal writing character string determination unit 108 determines whether or not the preceding character is a vertical writing hyphen (S161).
0). If no, the process returns to S1512.

【００３８】肯定のときは、横書き文字列分割認識部１
１０は、ハイフンに認識された文字の次の文字画像を文
字構成要素の基準幅又は高さの半分を基準に文字数を推
定し、分割を行い（Ｓ１６１２）、Ｓ１６０６に移る。
Ｓ１５０８において、文字行方行判定部１０４で文字行
が横書きと判定されたとき、文字抽出部１０５は、一文
字としての文字画像を抽出する（Ｓ１７０２）。When the result is affirmative, the horizontal writing character string division recognition unit 1
In step S1612, the character 10 estimates the number of characters of the character image next to the character recognized as a hyphen based on the reference width or half the height of the character component, and divides the character image (S1612).
In S1508, when the character line direction determination unit 104 determines that the character line is horizontal writing, the character extraction unit 105 extracts a character image as one character (S1702).

【００３９】文字認識部１０７は文字が有るか否かを判
定し（Ｓ１７０４）、なければ処理を終了する。文字が
あるときは、その特徴を抽出し、認識辞書１０６と照合
して文字を認識する（Ｓ１７０６）。横書き文字列判定
部１０８は、認識された文字が２連接数字であるか否か
を判定する（Ｓ１７０８）。２連接数字でなければＳ１
７０４に戻る。The character recognizing unit 107 determines whether or not there is a character (S1704), and if not, ends the processing. If there is a character, the feature is extracted and the character is recognized by collating with the recognition dictionary 106 (S1706). The horizontal writing character string determination unit 108 determines whether the recognized character is a two-digit number (S1708). If it is not a two-digit number, S1
Return to 704.

【００４０】２連接数字のときには、横書き文字列分割
認識部１１０は、その文字画像を左右方向に２つに均等
分割し、それぞれの文字画像の特徴を抽出し、認識辞書
１０６と照合して文字を認識する（Ｓ１７１２）。認識
結果出力部１１１は、文字認識部１０７と横書き文字列
分割認識部１１０とで認識された文字又は文字列のうち
類似度の高い文字又は文字列を選択して（Ｓ１７１
４）、Ｓ１７０４に戻る。In the case of two consecutive numbers, the horizontally written character string division recognition unit 110 divides the character image equally into two in the left-right direction, extracts the characteristics of each character image, compares it with the recognition dictionary 106, and compares it with the recognition dictionary 106. Is recognized (S1712). The recognition result output unit 111 selects a character or a character string having a high degree of similarity from among the characters or character strings recognized by the character recognition unit 107 and the horizontally written character string division recognition unit 110 (S171).
4) Return to S1704.

【００４１】なお、Ｓ１６０４およびＳ１６１０におい
て、肯定と判定されたとき、遊離文字構成要素統合部１
０９は、認識対象の文字画像の左右（横方向）の外方に
文字行から所定の距離内に遊離した文字構成要素が存在
するか否かを調べ、存在するときには、認識対象の文字
画像に統合する。横書き文字列分割認識部１１０は、統
合された文字画像について、射影操作を行い文字画像を
分割する。It should be noted that if the determination is affirmative in S1604 and S1610, the free character component integrating unit 1
09 checks whether there is a character component separated within a predetermined distance from the character line outside the left and right (horizontal direction) of the character image to be recognized. Integrate. The horizontal writing character string division recognition unit 110 performs a projection operation on the integrated character image to divide the character image.

【００４２】また、本発明に係る文字認識装置は、上記
実施の形態において、図１に構成図として示したけれど
も、各構成部分の機能を発揮するプログラムを、フロッ
ピーディスク等のコンピュータ読み取り可能な記録媒体
に記録して移送する、あるいは、通信回路を介して移送
することにより、独立した他のコンピュータシステムで
容易に実施することができる。Although the character recognition apparatus according to the present invention is shown in FIG. 1 as a configuration diagram in the above-described embodiment, a computer-readable recording program such as a floppy disk can be used to execute a program that performs the functions of each component. By recording the data on a medium and transferring the data, or transferring the data via a communication circuit, it can be easily implemented by another independent computer system.

【００４３】[0043]

【発明の効果】以上説明したように、本発明に係る文字
認識装置は、入力された２値文字画像から文字構成要素
を検出する文字構成要素検出手段と、検出された文字構
成要素を連結し、文字行を抽出する文字行抽出手段と、
抽出された文字行が縦書きであるか横書きであるかを文
字行の長手方向に従い判定する縦横判定手段と、前記文
字行内で一文字としての文字画像を構成する第１文字画
像領域を検出する文字画像領域検出手段と、検出された
第１文字画像領域から一の文字として文字認識する第１
文字認識手段と、前記縦横判定手段で文字行が縦書きと
判定された場合、第１文字認識手段で文字認識された認
識結果に基づいて、第１文字画像領域の文字画像を横書
き文字列と判定する横書き文字列判定手段と、前記横書
き文字列判定手段で横書き文字列と判定されたとき、第
１文字画像領域を横方向の複数の第２文字画像領域に分
割し、各第２文字画像領域の文字画像から文字認識する
第２文字認識手段と、前記第１文字認識手段で認識され
た文字と第２文字認識手段で認識された文字列のいずれ
かを認識結果として選択する認識結果選択手段とを備え
ることとしている。このような構成によって、縦書き文
字行に含まれる横書き文字列中の文字を正しく認識する
ことができる。As described above, the character recognition device according to the present invention connects a character component detecting means for detecting a character component from an input binary character image with the detected character component. A character line extracting means for extracting a character line,
Vertical / horizontal determining means for determining whether the extracted character line is vertical or horizontal according to the longitudinal direction of the character line, and a character for detecting a first character image area constituting a character image as one character in the character line An image area detecting unit, and a first character recognition unit that recognizes a character as one character from the detected first character image area
When a character line is determined to be vertical writing by the character recognition means and the vertical / horizontal determination means, the character image in the first character image area is defined as a horizontal writing character string based on the recognition result of character recognition by the first character recognition means. When the horizontal writing character string determining means determines that the character string is a horizontal writing character string, the first character image area is divided into a plurality of second character image areas in the horizontal direction. A second character recognizing means for recognizing a character from the character image in the area, and a recognition result selection for selecting any of a character recognized by the first character recognizing means and a character string recognized by the second character recognizing means as a recognition result Means. With such a configuration, characters in a horizontal writing character string included in a vertical writing character line can be correctly recognized.

【００４４】また、前記横書き文字列判定手段は、前記
第１文字認識手段で認識された文字が郵便番号マークで
あるときに、この郵便番号マークに後続する第１文字認
識手段で認識された文字が数字以外に認識されていると
き、当該数字以外に認識された第１文字画像領域の文字
画像を横書き文字列と判定する郵便番号判定部を有し、
前記第２文字認識手段は、前記郵便番号判定部で横書き
文字列と判定されたとき、判定された第１文字画像領域
を横方向に３分割する第１分割部を有することとしてい
る。このような構成によって、郵便番号マークに後続す
る３桁の横書き数字列を正しく認識することができる。When the character recognized by the first character recognizing means is a postal code mark, the horizontal character string judging means may determine the character recognized by the first character recognizing means following the postal code mark. When a character other than a number is recognized, a postal code determining unit that determines a character image of the first character image area recognized as a character other than the number as a horizontally written character string,
The second character recognizing means has a first division unit for horizontally dividing the determined first character image area into three when the postal code determination unit determines that the character string is a horizontal writing character string. With such a configuration, it is possible to correctly recognize a three-digit horizontal digit string following the postal code mark.

【００４５】また、前記横書き文字列判定手段は、前記
第１文字認識手段で認識された文字が縦書きのハイフン
であるときに、このハイフンに後続する第１文字認識手
段で認識された文字が数字以外に認識されているとき、
当該数字以外に認識された第１文字画像領域の文字画像
を横書き文字列と判定するハイフン判定部を有し、前記
第２文字認識手段は、前記ハイフン判定部で横書き文字
列と判定されたとき、判定された第１文字画像領域を横
方向に分割する第２分割部を有することとしている。こ
のような構成によって、ハイフンに後続する横書き数字
列を正しく認識することができる。When the character recognized by the first character recognizing means is a vertical hyphen, the horizontal character string judging means recognizes the character recognized by the first character recognizing means following the hyphen. When it is recognized other than numbers,
A hyphen determining unit that determines a character image of the first character image area recognized as a character other than the numeral as a horizontally written character string, wherein the second character recognition unit determines that the character image is a horizontally written character string by the hyphen determining unit; And a second division unit for dividing the determined first character image area in the horizontal direction. With such a configuration, it is possible to correctly recognize a horizontal numeral string following a hyphen.

【００４６】また、前記横書き文字列判定手段は、前記
第１文字認識手段で認識された文字がアルファベットの
とき、第１文字画像領域の文字画像を横書き文字列と判
定するアルファベット判定部を有することとしている。
このような構成によって、縦書き文字行では、出現頻度
の少ないアルファベットと認識された文字画像を正しく
認識することができる。Further, the horizontal character string determining means has an alphabet determining part for determining a character image in the first character image area as a horizontal character string when the character recognized by the first character recognizing means is an alphabet. And
With such a configuration, in a vertically written character line, a character image recognized as an alphabet having a low appearance frequency can be correctly recognized.

【００４７】また、前記第２文字認識手段は、前記第一
文字画像領域の横方向に第１文字画像領域に含まれなか
った文字構成要素が所定の距離以内にあるときには、当
該文字構成要素を第１文字画像領域に統合する統合部を
有することとしている。このような構成によって、一般
におこり易い文字行の抽出誤りを是正して、正しく文字
認識をすることができる。Further, the second character recognizing means, when a character component not included in the first character image area in the lateral direction of the first character image area is within a predetermined distance, the second character recognition means removes the character component from the first character image area. It has an integration unit for integrating into one character image area. With such a configuration, it is possible to correct a character line extraction error that is likely to occur in general, and perform character recognition correctly.

【００４８】また、前記第２文字認識手段は、前記第１
文字画像領域に含まれる文字構成要素に囲まれる領域を
文字構成要素と同一の要素に変換して、縦方向の射影処
理を行い、射影形状の谷間の位置で前記第１文字画像領
域を分割する谷間分割部を有することとしている。この
ような構成によって、横書き文字列を精度よく分割した
文字画像を得ることができるので、正しく文字認識をす
ることができる。Further, the second character recognizing means includes the first character recognizing means.
The area surrounded by the character components included in the character image region is converted into the same elements as the character components, vertical projection processing is performed, and the first character image region is divided at the valleys of the projected shape. It has a valley division. With such a configuration, it is possible to obtain a character image obtained by dividing a horizontally written character string with high accuracy, so that character recognition can be performed correctly.

【００４９】また、前記第１文字認識手段は、第１画像
領域の文字画像から一の文字を認識するための認識辞書
を有し、当該認識辞書には、２桁の全ての数字を組み合
わせた２連接数字が登録されており、前記横書き文字列
判定手段は、前記第１文字認識手段で２連接数字と認識
されたとき、当該第１文字画像領域の文字画像を横書き
文字列と判定する２連接数字判定部を有し、前記第２文
字認識手段は、前記２連接数字判定部で横書き文字列と
判定された第１画像領域を横方向に２等分する第３分割
部を有することとしている。このような構成によって、
接触した２連接数字が縦書き文字行に含まれていても正
しく文字認識することができる。Further, the first character recognizing means has a recognition dictionary for recognizing one character from the character image in the first image area, and the two-digit number is combined with the recognition dictionary. The two-character string is registered, and the horizontal character string determination means determines that the character image in the first character image area is a horizontal character string when the first character recognition means recognizes the character as a two-character string. It has a concatenated-number determining unit, and the second character recognizing means has a third dividing unit that divides the first image area determined as a horizontally written character string by the two-concatenated-number determining unit into two equal parts in the horizontal direction. I have. With such a configuration,
Even if the contacted two-digit number is included in the vertical character line, the character can be correctly recognized.

【００５０】また、前記認識結果選択手段は、前記第１
文字認識手段で認識された文字と第２文字認識手段で認
識された文字列のうち、類似度の高い方を選択する選択
部を有することとしている。このような構成によって、
認識された文字からより確かな文字を認識結果とするこ
とができる。また、本発明に係る文字認識方法は、入力
された２値文字画像から文字構成要素を検出する文字構
成要素検出ステップと、検出された文字構成要素を連結
し、文字行を抽出する文字行抽出ステップと、抽出され
た文字行が縦書きであるか横書きであるかを文字行の長
手方向に従い判定する縦横判定ステップと、前記文字行
内で一文字としての文字画像を構成する第１文字画像領
域を検出する文字画像領域検出ステップと、検出された
第１文字画像領域から一の文字として文字認識する第１
文字認識ステップと、前記縦横判定手段で文字行が縦書
きと判定された場合、第１文字認識ステップで文字認識
された認識結果に基づいて、第１文字画像領域の文字画
像を横書き文字列と判定する横書き文字列判定ステップ
と、前記横書き文字列判定ステップで横書き文字列と判
定されたとき、第１文字画像領域を横方向の複数の第２
文字画像領域に分割し、各第２文字画像領域の文字画像
から文字認識する第２文字認識ステップと、前記第１文
字認識手段で認識された文字と第２文字認識ステップで
認識された文字列のいずれかを認識結果として選択する
認識結果選択ステップとを有することとしている。この
ような構成によって、縦書き文字行に含まれる横書き文
字列中の文字を正しく認識することができる。[0050] The recognition result selecting means may include the first
There is provided a selection unit that selects the one with the higher similarity between the character recognized by the character recognition unit and the character string recognized by the second character recognition unit. With such a configuration,
A more reliable character can be determined as a recognition result from the recognized characters. In addition, the character recognition method according to the present invention includes a character component detecting step of detecting a character component from an input binary character image, and a character line extracting step of connecting the detected character components to extract a character line. Step, a vertical / horizontal determination step of determining whether the extracted character line is vertical or horizontal according to the longitudinal direction of the character line, and a first character image area forming a character image as one character in the character line. Detecting a character image area to be detected, and performing first character recognition as one character from the detected first character image area
In the character recognition step, when the character line is determined to be vertical writing by the vertical / horizontal determination means, the character image in the first character image area is defined as a horizontal character string based on the result of character recognition in the first character recognition step. Determining a horizontal writing character string in the horizontal writing character string determining step; and determining a first character image area in a plurality of second horizontal writing character strings when the horizontal writing character string is determined in the horizontal writing character string determining step.
A second character recognition step of dividing the character image into character image areas and recognizing characters from the character images of the second character image areas; a character recognized by the first character recognition means and a character string recognized by the second character recognition step; And a recognition result selecting step of selecting any of the above as a recognition result. With such a configuration, characters in a horizontal writing character string included in a vertical writing character line can be correctly recognized.

【００５１】更に、本発明は、コンピュータ読み取り可
能な記録媒体であって、入力された２値文字画像から文
字構成要素を検出する文字構成要素検出ステップと、検
出された文字構成要素を連結し、文字行を抽出する文字
行抽出ステップと、抽出された文字行が縦書きであるか
横書きであるかを文字行の長手方向に従い判定する縦横
判定ステップと、前記文字行内で一文字としての文字画
像を構成する第１文字画像領域を検出する文字画像領域
検出ステップと、検出された第１文字画像領域から一の
文字として文字認識する第１文字認識ステップと、前記
縦横判定手段で文字行が縦書きと判定された場合、第１
文字認識ステップで文字認識された認識結果に基づい
て、第１文字画像領域の文字画像を横書き文字列と判定
する横書き文字列判定ステップと、前記横書き文字列判
定ステップで横書き文字列と判定されたとき、第１文字
画像領域を横方向の複数の第２文字画像領域に分割し、
各第２文字画像領域の文字画像から文字認識する第２文
字認識ステップと、前記第１文字認識手段で認識された
文字と第２文字認識ステップで認識された文字列のいず
れかを認識結果として選択する認識結果選択ステップと
を有する方法をプログラムとして記録したコンピュータ
読み取り可能な記録媒体としている。このような構成に
よって、縦書き文字行に含まれる横書き文字列中の文字
を正しく認識する機能を有しない文字認識装置に適用す
ることで、このような機能を発揮することができる。Further, the present invention is a computer-readable recording medium, comprising a character component detecting step of detecting a character component from an input binary character image, and linking the detected character components. A character line extraction step of extracting a character line, a vertical / horizontal determination step of determining whether the extracted character line is vertical writing or horizontal writing according to the longitudinal direction of the character line, and a character image as one character in the character line. A character image region detecting step of detecting a first character image region to be configured; a first character recognition step of recognizing a character as one character from the detected first character image region; If it is determined that the first
Based on the recognition result of the character recognition in the character recognition step, the character image in the first character image area is determined to be a horizontal character string, and the horizontal character string determination step is determined to be a horizontal character string. At this time, the first character image area is divided into a plurality of horizontal second character image areas,
A second character recognition step of recognizing a character from a character image in each second character image area; and a character recognized by the first character recognition means or a character string recognized by the second character recognition step as a recognition result. And a computer-readable recording medium storing a method having a selecting step of selecting a recognition result. With such a configuration, such a function can be exhibited by applying to a character recognition device that does not have a function of correctly recognizing characters in a horizontal writing character string included in a vertical writing character line.

[Brief description of the drawings]

【図１】本発明に係る文字認識装置の一実施の形態の構
成図である。FIG. 1 is a configuration diagram of an embodiment of a character recognition device according to the present invention.

【図２】上記実施の形態の２値画像入力部で入力された
認識対象の２値の文書画像を示す図である。FIG. 2 is a diagram illustrating a binary document image to be recognized input by a binary image input unit according to the embodiment.

【図３】上記実施の形態の文字構成要素検出部で検出さ
れた文字構成要素を示す図である。FIG. 3 is a diagram illustrating character components detected by a character component detection unit according to the embodiment.

【図４】上記実施の形態の文字行抽出部で抽出された文
字行を示す図である。FIG. 4 is a diagram showing character lines extracted by a character line extraction unit according to the embodiment.

【図５】（ａ）は、上記実施の形態の文字抽出部で適用
されるルールを説明する図である。（ｂ）は、上記実施
の形態の文字抽出部で抽出された文字画像の領域を説明
する図である。FIG. 5A is a diagram illustrating a rule applied by a character extracting unit according to the embodiment. FIG. 4B is a diagram illustrating a region of a character image extracted by the character extraction unit according to the embodiment.

【図６】上記実施の形態の認識辞書の２連接数字クラス
タ辞書部分に登録されている２連接数字画像の一例を示
す図である。FIG. 6 is a diagram illustrating an example of a two-digit numeric image registered in a two-digit numeric cluster dictionary part of the recognition dictionary according to the embodiment.

【図７】上記実施の形態の文字認識部で文字行４０１か
ら認識された文字（認識結果）と横書き文字列判定部で
横書き文字列可能性有と判定された文字を示す図であ
る。FIG. 7 is a diagram illustrating a character (recognition result) recognized from the character line 401 by the character recognition unit according to the embodiment and a character determined to have a horizontal writing character string possibility by the horizontal writing character string determination unit.

【図８】上記実施の形態で、入力文書画像が横書き文字
列の場合の文字抽出部で抽出された文字画像の領域を説
明する図である。FIG. 8 is a diagram illustrating an area of a character image extracted by a character extracting unit when an input document image is a horizontally written character string in the embodiment.

【図９】上記図８に示した横書き文字列の文字認識部で
の認識結果と、横書き文字列判定部での再分割可能性有
と判定された文字を示す図である。FIG. 9 is a diagram illustrating a result of recognition of the horizontal writing character string shown in FIG. 8 by the character recognition unit and a character determined to be re-dividable by the horizontal writing character string determination unit.

【図１０】上記実施の形態の遊離文字構成要素統合部で
の遊離している文字構成要素の統合を説明する図であ
る。FIG. 10 is a diagram illustrating integration of loose character components in the loose character component integration unit according to the embodiment.

【図１１】上記実施の形態の横書き文字列分割認識部で
の文字画像を射影操作で分割する様子を説明する図であ
る。FIG. 11 is a diagram illustrating a state in which a character image is divided by a projection operation in a horizontally written character string division recognition unit according to the embodiment.

【図１２】上記実施の形態の横書き文字列分割認識部で
の２連接数字画像の分割を説明する図である。FIG. 12 is a diagram illustrating division of a two-part numeric image in a horizontally written character string division recognition unit according to the embodiment.

【図１３】上記実施の形態の遊離文字構成要素統合部で
統合された文字画像の横書き文字列分割認識部での文字
画像の分割を説明する図である。FIG. 13 is a diagram for explaining division of a character image by a horizontal writing character string division recognition unit for a character image integrated by a loose character component integration unit according to the embodiment.

【図１４】上記実施の形態の横書き文字列分割認識部で
の認識結果を示す図である。FIG. 14 is a diagram showing a recognition result in a horizontally written character string division recognition unit of the embodiment.

【図１５】上記実施の形態の動作を説明するフローチャ
ートである。FIG. 15 is a flowchart illustrating the operation of the embodiment.

【図１６】上記実施の形態の動作を説明するフローチャ
ートである。FIG. 16 is a flowchart illustrating an operation of the embodiment.

【図１７】上記実施の形態の動作を説明するフローチャ
ートである。FIG. 17 is a flowchart illustrating an operation of the embodiment.

[Explanation of symbols]

１０１２値画像入力部１０２文字構成要素検出部１０３文字行抽出部１０４文字行方向判定部１０５文字抽出部１０６認識辞書１０７文字認識部１０８横書き文字列判定部１０９遊離文字構成要素統合部１１０横書き文字列分割認識部１１１認識結果出力部 Reference Signs List 101 Binary image input unit 102 Character component detection unit 103 Character line extraction unit 104 Character line direction determination unit 105 Character extraction unit 106 Recognition dictionary 107 Character recognition unit 108 Horizontal character string determination unit 109 Free character component integration unit 110 Horizontal character Column division recognition unit 111 Recognition result output unit

───────────────────────────────────────────────────── フロントページの続き (72)発明者中尾一郎大阪府門真市大字門真1006番地松下電器産業株式会社内 (72)発明者井藤好克大阪府門真市大字門真1006番地松下電器産業株式会社内Ｆターム(参考） 5B029 AA01 BB02 CC20 CC28 EE08 5B064 AA01 AA04 AB13 AB16 AB19 CA08 DC08 DC09 DC18 EA26 EA36 ──────────────────────────────────────────────────の Continuing on the front page (72) Inventor Ichiro Nakao 1006 Kadoma Kadoma, Osaka Prefecture Inside Matsushita Electric Industrial Co., Ltd. F term (reference) 5B029 AA01 BB02 CC20 CC28 EE08 5B064 AA01 AA04 AB13 AB16 AB19 CA08 DC08 DC09 DC18 EA26 EA36

Claims

[Claims]

1. A character component detecting means for detecting a character component from an input binary character image, a character line extracting means for connecting the detected character components and extracting a character line, Vertical / horizontal determining means for determining whether the character line is vertical or horizontal according to the longitudinal direction of the character line, and a first character image constituting one character in the character line
A character image area detecting means for detecting a character image area; a first character recognizing means for recognizing a character as one character from the detected first character image area; If
A horizontally written character string determining unit that determines a character image in the first character image area as a horizontally written character string based on the recognition result of the character recognition by the first character recognition unit, and a horizontally written character string that is determined by the horizontally written character string determining unit. The first character image area is divided into a plurality of second character image areas in the horizontal direction, and a second character recognizing means for recognizing characters from the character images in each of the second character image areas; A character recognition device comprising: a recognition result selecting unit that selects, as a recognition result, one of the character recognized by the character recognition unit and the character string recognized by the second character recognition unit.

2. The method according to claim 1, wherein the character string recognized by the first character recognizing means is a postal code mark when the character recognized by the first character recognizing means is a postal code mark. Has a zip code determining unit that determines a character image of the first character image area recognized as a character other than the number as a horizontally written character string when the character is recognized as a character other than the number. When the number judgment unit judges that the character string is a horizontal writing character string,
A first division of the determined first character image area into three in the horizontal direction
The character recognition device according to claim 1, further comprising a dividing unit.

3. When the character recognized by the first character recognizing means is a vertical hyphen, the horizontally-written character string determining means determines whether the character recognized by the first character recognizing means following the hyphen is a hyphen. When a character image other than a number is recognized, the character image in the first character image area recognized as a character other than the number has a hyphen determination unit that determines a horizontal writing character string, and the second character recognition unit includes the hyphen determination unit. Is determined as a horizontal character string,
2. The character recognition device according to claim 1, further comprising a second division unit that divides the determined first character image area in the horizontal direction.

4. A horizontal character string determination unit, comprising: an alphabet determination unit for determining a character image in a first character image area as a horizontal character string when a character recognized by the first character recognition unit is an alphabet. The character recognition device according to claim 1, wherein:

5. The character recognition device according to claim 1, wherein a character component not included in the first character image area in a lateral direction of the first character image area is within a predetermined distance. 2. The character recognition device according to claim 1, further comprising an integration unit that integrates the image into one character image area.

6. The second character recognizing means converts an area surrounded by character components included in the first character image area into the same element as a character component, and performs a vertical projection process. The character recognition device according to claim 2, further comprising a valley dividing unit that divides the first character image area at a position of a valley in a projected shape.

7. The first character recognizing means has a recognition dictionary for recognizing one character from a character image in a first image area, wherein the recognition dictionary combines all two-digit numbers. When two consecutive numbers are registered, and the horizontal writing character string determining means is recognized as two consecutive numbers by the first character recognizing means,
The character image in the first character image area is a horizontally written character string; and the second character recognition means is a first character image determined to be a horizontally written character string by the two-character string determination unit.
2. The character recognition device according to claim 1, further comprising a third dividing unit that divides the image area into two in the horizontal direction.

8. The recognition result selecting unit includes a selecting unit that selects a character having a higher similarity between the character recognized by the first character recognizing unit and the character string recognized by the second character recognizing unit. The character recognition device according to any one of claims 1 to 7, wherein:

9. A character component detecting step of detecting a character component from the input binary character image, a character line extracting step of connecting the detected character components and extracting a character line, A vertical / horizontal determination step of determining whether a character line is vertical or horizontal according to the longitudinal direction of the character line; and a first image forming a character image as one character in the character line.
A character image region detecting step of detecting a character image region; a first character recognition step of recognizing a character as one character from the detected first character image region; and the vertical / horizontal determining means determines that the character line is vertical writing If
A horizontally written character string determining step of determining the character image in the first character image area as a horizontally written character string based on the recognition result of the character recognition in the first character recognition step; and determining the horizontally written character string in the horizontally written character string determining step. The first character image area is divided into a plurality of second character image areas in the horizontal direction, and a second character recognition step of recognizing characters from the character images in the second character image areas; A character recognition method comprising: a recognition result selecting step of selecting, as a recognition result, one of the character recognized in step (1) and the character string recognized in the second character recognition step.

10. A computer-readable recording medium, comprising: a character component detecting step of detecting a character component from an input binary character image; and connecting the detected character components to extract a character line. A character line extracting step, a vertical / horizontal determining step of determining whether the extracted character line is vertical writing or horizontal writing in accordance with the longitudinal direction of the character line; and a first image forming a character image as one character in the character line.
A character image region detecting step of detecting a character image region; a first character recognition step of recognizing a character as one character from the detected first character image region; and the vertical / horizontal determining means determines that the character line is vertical writing If
A horizontally written character string determining step of determining the character image in the first character image area as a horizontally written character string based on the recognition result of the character recognition in the first character recognition step; and determining the horizontally written character string in the horizontally written character string determining step. The first character image area is divided into a plurality of second character image areas in the horizontal direction, and a second character recognition step of recognizing characters from the character images in the second character image areas; A computer-readable recording medium which records a method having a recognition result selecting step of selecting, as a recognition result, one of a character recognized in step (1) and a character string recognized in the second character recognition step.