JPH0433080A

JPH0433080A - Method for recognizing character in table

Info

Publication number: JPH0433080A
Application number: JP2134876A
Authority: JP
Inventors: Goro Bessho; 吾朗別所
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1990-05-24
Filing date: 1990-05-24
Publication date: 1992-02-04
Anticipated expiration: 2014-08-09
Also published as: JP2931041B2

Abstract

PURPOSE:To enable character recognition by exactly extracting not only the line of laterally describing characters but also the line of longitudinally describing characters in a table by discriminating the lengths of each frame in main scanning and sub scanning directions and switching the method of extracting the character line in each frame corresponding to the result of the discrimination. CONSTITUTION:The frame surrounded by line segments in the main scanning and sub scanning directions is extracted from the table area of a document picture. Then, the lengths of each frame in the main scanning and sub scanning directions are discriminated and corresponding to the discriminated result, the method for extracting the character line in each frame is switched. Namely, it is automatically discriminated whether the character printed in each frame of the table is described laterally or longitudinally, and the character line extraction method suitable for the discriminated direction is applied. Thus, even in the table mixing the laterally and longitudinally describing frames, the character line in each frame is exactly extracted, and the accuracy for the character recognition is improved.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、文字認識装置における文書の表内文字認識方
法に関する。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a method for recognizing characters within a document in a character recognition device.

[Conventional technology]

文字認識装置においては５文書画像を文字領域、写真や
図形などのイメージ領域、表領域などに分割し、それぞ
れの領域に別の処理を行うことが多い。In character recognition devices, five document images are often divided into a character area, an image area such as a photograph or figure, a table area, etc., and different processing is performed on each area.

表領域に関しては、表を構成する罫線の位置を認識し、
罫線で囲まれた枠内の画像に対して連結した黒画素の追
跡を行い、黒画素連結の外接矩形を求め、それを統合し
て文字行を抽出し、文字認識している。Regarding the table area, it recognizes the position of the borders that make up the table,
Connected black pixels are traced in an image within a frame surrounded by ruled lines, a circumscribed rectangle of connected black pixels is found, and the circumscribed rectangles are integrated to extract character lines and character recognition is performed.

（発明が解決しようとする課題〕しかし従来は１表の各枠内の文字が横書きまたは縦書き
のいずれか一方で印字されていることを前提に文字行抽
出処理をしているため、例えば第３図に示すような横書
きの文字行と縦書きの文字行が混在した表の場合、横書
き（縦書き）を前提としているときには縦書き文字行（
横書き文字行）の抽出が正確に行われず、結果として文
字認識が正確に行われないことがある。(Problem to be solved by the invention) However, conventionally, character line extraction processing is performed on the assumption that the characters in each frame of a table are printed either horizontally or vertically. In the case of a table with a mixture of horizontal and vertical text lines as shown in Figure 3, if horizontal text (vertical writing) is assumed, the vertical text lines (
(Horizontal character lines) may not be extracted accurately, and as a result, character recognition may not be performed accurately.

本発明の目的は、表内の横書き文字行も縦書き文字行も
正確に抽出して文字認識することができる表内文字認識
方法を提供することにある。An object of the present invention is to provide a table character recognition method that can accurately extract and recognize both horizontally written character lines and vertically written character lines in a table.

[Means to solve the problem]

本発明は、文書画像の表領域より、主走査方向及び副走
査方向の線分で囲まれた枠を抽出し、各枠内の文字行を
抽出して文字認識する表内文字認識方法において、各粋
の主走査方向の長さ及び副走査方向の長さによって各枠
内の文字行が横書きであるか縦書きであるかを判別し、
この判別の結果に応じて各枠内の文字行の抽出方法を切
り替えることを特徴とする。The present invention provides a table character recognition method that extracts frames surrounded by line segments in the main scanning direction and sub-scanning direction from a table area of a document image, extracts character lines within each frame, and recognizes the characters. Determine whether the character line in each frame is written horizontally or vertically based on the length in the main scanning direction and the length in the sub-scanning direction of each frame,
The method is characterized in that the method for extracting character lines within each frame is switched depending on the result of this determination.

[For production]

本発明によれば、表中の各枠内に印字された文字が横書
きであるか縦書きであるかが自動的に判別され５判別さ
れた方向に適した文字行抽出方法が適用されることによ
り、横書きの枠と縦書きの枠が混在した表においても、
各枠内の文字行が正確に抽出され、したがって各枠内の
文字の切り出し及び文字認識の精度が上がる。According to the present invention, it is automatically determined whether the characters printed in each frame in the table are written horizontally or vertically, and a character line extraction method suitable for the determined direction is applied. Therefore, even in tables with a mixture of horizontal and vertical frames,
The character lines within each frame are accurately extracted, thus increasing the accuracy of character extraction and character recognition within each frame.

〔Example〕

第１図は本発明の一実施例を示すブロック図。 FIG. 1 is a block diagram showing one embodiment of the present invention.

第２図は処理のフローチャートである。FIG. 2 is a flowchart of the process.

スキャナーなどの２値画像入力部１０１によって文書を
読取り、その２値画像を２値イメージメモリ１０２に格
納する（処理ステップ２０１）。A document is read by a binary image input unit 101 such as a scanner, and the binary image is stored in the binary image memory 102 (processing step 201).

この文書画像に対して１表領域認識部１０３はランレン
グス分布などを利用して表領域を自動的に認識するか、
あるいはマウスなどを用いて操作者から指定された領域
を表領域として認識し、表領域のイメージを表領域イメ
ージメモリ１０４に格納する（処理ステップ２０２）。For this document image, the table area recognition unit 103 automatically recognizes the table area using run length distribution or the like.
Alternatively, an area specified by the operator using a mouse or the like is recognized as a table area, and an image of the table area is stored in the table area image memory 104 (processing step 202).

この表領域のイメージに対し、主走査方向線分抽出部１
０５において、主走査方向に連結した黒画素を追跡して
主走査方向の線分を抽出し、その始点及び終点の座標を
主走査方向線分座標メモリ１０６に格納する（処理ステ
ップ２０３）。同様に副走査方向線分抽出部１０７にお
いて、表領域イメージ内の副走査方向に連結した黒画素
を追跡して副走査方向の線分を抽出し、その始点及び終
点の座標を副走査方向線分座標メモリ１０８に格納する
（処理ステップ２０４）。For this image of the table area, the line segment extraction unit 1 in the main scanning direction
05, black pixels connected in the main scanning direction are traced to extract a line segment in the main scanning direction, and the coordinates of the starting point and end point are stored in the main scanning direction line segment coordinate memory 106 (processing step 203). Similarly, the sub-scanning direction line segment extraction unit 107 traces the black pixels connected in the sub-scanning direction in the table area image, extracts a line segment in the sub-scanning direction, and calculates the coordinates of the starting point and end point along the sub-scanning direction line. The coordinates are stored in the minute coordinate memory 108 (processing step 204).

次に枠認識部１０９において、各メモリ１０６゜１０８
に格納された線分座標を参照し、主走査方向線分と副走
査方向線分の組合せにより表の枠を認識し、枠の座標例
えば対角頂点の座標を枠座標メモリ１１０に格納する（
処理ステップ２０５）。Next, in the frame recognition unit 109, each memory 106° 108
The frame of the table is recognized by the combination of the line segment in the main scanning direction and the line segment in the sub-scanning direction, and the coordinates of the frame, for example, the coordinates of the diagonal vertices, are stored in the frame coordinate memory 110 (
Process step 205).

また枠領域抽出部１１１において、枠座標メモリ１１５
内の枠座標を参照することにより、表領域イメージメモ
リ１０４より枠の領域の画像を抽出して枠領域画像メモ
リ１１２に格納する（処理ステップ２０６）。In addition, in the frame area extraction unit 111, the frame coordinate memory 115
By referring to the frame coordinates in , the image of the frame area is extracted from the table area image memory 104 and stored in the frame area image memory 112 (processing step 206).

次に行方向判定部１１３において、枠座標メモリ１１０
を参照して全ての枠に対して主走査方向及び副走査方向
の長さのヒストグラムを作成する（処理ステップ２０７
，２０８）、そして、＆大度数の副走査方向の長さを持
つ枠は全て行方向が横書きの枠であると判別しく処理ス
テップ２０９゜２１０）、その長さと同じ主走査方向の
長さを持つ枠は行方向が縦書きの枠であると判別しく処
理ステップ２１１，２１２）、残った枠はそれまでに判
別された枠数が多いほうの行方向の枠であると判別する
（処理ステップ２１３）、なお、処理ステップ２０７，
２０８でヒストグラムを求める際には各走査方向の長さ
にある程度の幅を持たせ。Next, in the row direction determination unit 113, the frame coordinate memory 110
A histogram of lengths in the main scanning direction and the sub-scanning direction is created for all frames with reference to (processing step 207
, 208), and all frames with a length in the sub-scanning direction of & large degrees are determined to be frames with horizontal writing in the line direction. The remaining frames are determined to be frames with vertical writing in the line direction (processing steps 211 and 212), and the remaining frames are determined to be vertical frames with the largest number of frames determined up to that point (processing steps 211 and 212). 213), and processing steps 207,
When calculating the histogram in step 208, a certain amount of width is given to the length in each scanning direction.

同様に処理ステップ２１１で長さを判別する際にも、比
較する長さの差がある幅の範囲内のときは一致すると判
定する。求められた行方向の情報は外接矩形抽出部１１
４を経由して行画像抽出部１１６へ伝えられる。Similarly, when determining the length in processing step 211, if the difference in length to be compared is within a certain width range, it is determined that they match. The obtained row direction information is sent to the circumscribed rectangle extraction unit 11
4 to the row image extraction unit 116.

例えば第３図に示した表の場合、横書きの枠の副走査方
向の長さは全て同一（あるいは、はぼ同一）であるので
、その頻度は最大である。したがって、この表の横書き
の枠はすべて処理ステップ２１０で横書きと判別される
。また、この表の縦書き文字列″データ″が印刷された
枠の主走査方向の長さは、最大頻度の副走査方向の長さ
とほぼ同一である（差が一定の幅の範囲である）ので。For example, in the case of the table shown in FIG. 3, the lengths of the horizontally written frames in the sub-scanning direction are all the same (or almost the same), so the frequency is maximum. Therefore, all horizontal writing frames in this table are determined to be horizontal writing in processing step 210. Also, the length of the frame in which the vertically written character string "data" in this table is printed in the main scanning direction is almost the same as the length of the maximum frequency in the sub-scanning direction (the difference is within a certain width range) So.

処理ステップ２１２で縦書きの枠と判別される。In processing step 212, it is determined that the frame is for vertical writing.

次に外接矩形抽出部１１４において、枠領域画像メモリ
１１２を参照し、各枠内の画像に対して連結した黒画素
を追跡し、黒画素連結の外接矩形を抽出して、その対角
頂点の座標を外接矩形座標メモリ１１５に格納する（処
理ステップ２１４）。Next, the circumscribed rectangle extraction unit 114 refers to the frame area image memory 112, traces the connected black pixels for the image within each frame, extracts the circumscribed rectangle of connected black pixels, and extracts the circumscribed rectangle of the connected black pixels, and The coordinates are stored in the circumscribed rectangle coordinate memory 115 (processing step 214).

次に行画像抽出部１１６において、外接矩形座標メモリ
１１５を参照し、枠領域画像メモリ１１２内の各枠領域
画像に対して黒画素連結の外接矩形を１行方向判別部１
１３により判別された行方向へ統合することにより、枠
内の文字行画像（文字列画像）を抽出し行画像メモリ１
１７に格納する（処理ステップ２１５，２１６）− このように各枠毎に行方向すなわち横書き・縦書きのい
ずれであるかの判別を行い、判別した行方向に適した方
法により文字行画像抽出を行うため、横書きの枠と縦書
きの枠が混在した表領域において、いずれの行方向の文
字行画像も正確に抽出することが可能となる。Next, the row image extraction unit 116 refers to the circumscribed rectangle coordinate memory 115 and extracts a circumscribed rectangle connected to black pixels for each frame area image in the frame area image memory 112 by the line direction determination unit 11.
By integrating in the row direction determined by 13, character row images (character string images) within the frame are extracted and stored in row image memory 1.
17 (processing steps 215, 216) - In this way, the line direction for each frame is determined, that is, whether it is horizontal writing or vertical writing, and character line image extraction is performed using a method suitable for the determined line direction. Therefore, in a table area where horizontal writing frames and vertical writing frames coexist, it becomes possible to accurately extract character line images in any row direction.

次に文字認識部１１８において、行画像メモリ１１７内
の各粋の文字行画像より文字画像を切り出すが、前段の
文字行画像抽出が正確であるため。Next, in the character recognition unit 118, a character image is cut out from each type of character line image in the line image memory 117, since the character line image extraction in the previous stage is accurate.

この文字画像切出しも正確に行うことができる。This character image cutting out can also be performed accurately.

そして、切り出した文字画像の特徴を抽出し、認識辞書
とのマツチングを行って認識し、認識結果を外部へ出力
する（処理ステップ２１７）。Then, the features of the cut out character image are extracted, recognized by matching with the recognition dictionary, and the recognition result is output to the outside (processing step 217).

〔Effect of the invention〕

以上説明した如く、本発明によれば、表中の各枠内に印
字された文字行が横書きであるか縦書きであるかを自動
的に判別し、判別した方向に応じた方法により文字行抽
出を行うので、横書き枠と縦書き枠が混在した表におい
ても、各枠内の文字行の切出しを精度良く行うことがで
き、したがって枠内文字のｉ８ｇ！精度と上げることが
できる。As explained above, according to the present invention, it is automatically determined whether a character line printed in each frame in a table is written horizontally or vertically, and the character line is Since extraction is performed, even in a table with a mixture of horizontal and vertical writing frames, the character lines within each frame can be extracted with high precision, and therefore the i8g! Accuracy can be increased.

[Brief explanation of drawings]

第１図は本発明の一実施例を示すブロック図、第２図は
処理のフローチャート、第３図は横書きと縦書きが混在
した表の例を示す図である。１０１・・・２値画像入力部、１０２・・・２値イメージメモリ、１０３・・・表領域認識部。１０４・・・表領域イメージメモリ、１０５・・・主走査方向線分抽出部、１０６・・・主走査方向線分座標メモリ、１０７・・・
副走査方向線分抽出部、１０８・・・副走査方向線分座標メモリ、１０９・・・
枠認識部、　１１０・・・枠座標メモリ、１・・・枠領
域抽出部、２・・・枠領域画像メモリ、３・・・行方向判定部、４・・・外接矩形抽出部、５・・・外接矩形座標メモリ、６・・・行画像抽出部、７・・・行画像メモリ、　１１８・・・認識部。第１図FIG. 1 is a block diagram showing an embodiment of the present invention, FIG. 2 is a flowchart of processing, and FIG. 3 is a diagram showing an example of a table in which horizontal writing and vertical writing are mixed. 101... Binary image input unit, 102... Binary image memory, 103... Table area recognition unit. 104...Table area image memory, 105...Main scanning direction line segment extraction unit, 106...Main scanning direction line segment coordinate memory, 107...
Sub-scanning direction line segment extraction unit, 108... Sub-scanning direction line segment coordinate memory, 109...
Frame recognition unit, 110...Frame coordinate memory, 1...Frame area extraction unit, 2...Frame area image memory, 3...Line direction determination unit, 4...Circumscribing rectangle extraction unit, 5. . . . Circumscribed rectangle coordinate memory, 6 . . . Line image extraction section, 7 . . . Line image memory, 118 . . . Recognition section. Figure 1

Claims

[Claims]

(1) In the table character recognition method, a frame surrounded by line segments in the main scanning direction and sub-scanning direction is extracted from a table area of a document image, and character lines within each frame are extracted and characters are recognized. It is determined whether the character line in each frame is written horizontally or vertically depending on the length of the frame in the main scanning direction and the length in the sub-scanning direction, and the character line in each frame is determined according to the result of this determination. A method for recognizing characters in tables characterized by switching extraction methods.