JPH06187489A

JPH06187489A - Character recognizing device

Info

Publication number: JPH06187489A
Application number: JP4337330A
Authority: JP
Inventors: Yumiko Ikemure; 由美子池牟▲禮▼
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1992-12-17
Filing date: 1992-12-17
Publication date: 1994-07-08

Abstract

PURPOSE:To exactly extract an area even in the case of a document for which the character intervals of binary data fetched from a scanner are wide. CONSTITUTION:After image data fetched from the scanner are reduced by an image data reduction part 8, a position where black picture elements are linked is detected by a circumscribed rectangle extraction part 9, and the information of the circumscribed rectangle of linked black picture elements is stored in a memory. When a character rectangle identification part 10 identifies the rectangle of character candidates based on the size of the circumscribed rectangle, the number of picture elements and the density of black picture elements, a character interval/line interval detection part 11 detects the character interval/ line interval both from an inter-rectangle distance between the rectangles adjacent to the up, own, right and left sides of the character candidate rectangle and from the number of times of appearance. On the condition of (character interval < line interval < area interval), a line merging part 12 extracts a line by the merging processing of rectangles with the valid merging distance of character candidate rectangles as the line interval. Then, a character area merging part 13 extracts the character area based on the extracted line, and a character recognition part 14 recognizes characters.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、印刷文書のデータベー
ス化や文書の再利用のために、スキャナ等の光学的手段
を用いて文書画像を取り込んだ上、取り込んだ画像デー
タから文字，図形，表等の属性毎に領域を抽出して、各
属性に応じた認識処理を行う文字認識装置に関するもの
である。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention captures a document image by using an optical means such as a scanner in order to make a database of a printed document and reuse the document, and then, a character, a graphic, The present invention relates to a character recognition device that extracts a region for each attribute such as a table and performs a recognition process according to each attribute.

【０００２】[0002]

【従来の技術】従来の文字認識装置は、スキャナにより
取り込んだ二値データの黒画素の連結状態を調べ、黒画
素が連結しているかたまりに外接する矩形(以下「外接
矩形」という)の座標を検出して、メモリに格納した
上、予め定められた閾値以下の大きさの外接矩形を文字
候補として抽出して、この文字候補の矩形(以下「文字
候補矩形」という)を統合することにより、行を抽出す
る。そして、基準となる文字候補矩形(以下「基準矩
形」という)に対して、上下左右の４方向で最も近い文
字候補矩形(以下「最短距離矩形」という)を検出した
上、図９のように文字領域の間隔が狭い文書に対して、
図10のような文字領域間にまたがった行統合を防ぐた
め、基準矩形と検出された文字候補矩形との距離が基準
矩形の矩形サイズの半分未満であれば、２つの矩形は同
一文字領域内に存在するとみなして、矩形の統合を行
う。このとき、基準矩形と検出された最短距離矩形が上
下に位置していれば縦組み文字列、左右に位置していれ
ば横組み文字列と決定する。そこで、検出された行情報
から領域の抽出を行って、同一方向に隣り合う行の幅が
等しく行間の距離が閾値以下であれば、２つの行は同一
領域内にあるとして領域の統合を行った後、抽出した文
字領域について、文字の切り出し及び認識処理を行う。2. Description of the Related Art A conventional character recognition device checks the connection state of black pixels of binary data captured by a scanner, and coordinates of a rectangle circumscribing a block in which black pixels are connected (hereinafter referred to as "circumscribing rectangle"). By detecting the, storing in memory, extracting a circumscribing rectangle of a size equal to or smaller than a predetermined threshold value as a character candidate, and integrating the character candidate rectangles (hereinafter referred to as “character candidate rectangles”). , To extract a row. Then, with respect to the reference character candidate rectangle (hereinafter referred to as "reference rectangle"), the closest character candidate rectangle (hereinafter referred to as "shortest distance rectangle") in four directions of up, down, left and right is detected, and as shown in FIG. For documents with narrow character areas,
To prevent line integration across character areas as shown in Fig. 10, if the distance between the reference rectangle and the detected character candidate rectangle is less than half the rectangle size of the reference rectangle, the two rectangles are within the same character area. Rectangles are merged, assuming that they exist in. At this time, if the shortest distance rectangle detected as the reference rectangle is located vertically, it is determined to be a vertically-assembled character string, and if it is located to the left or right, it is determined to be a horizontally-assembled character string. Therefore, an area is extracted from the detected row information, and if the widths of adjacent rows in the same direction are equal and the distance between the rows is equal to or less than a threshold value, it is determined that the two rows are in the same area and the areas are integrated. After that, character extraction and recognition processing is performed on the extracted character area.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、従来の
方式では、矩形統合の有効距離を基準矩形の矩形サイズ
の半分未満としているため、図３のように文字間隔が文
字サイズの半分以上あるような文書では領域が複数に分
かれてしまうといった課題を有していた。However, in the conventional method, since the effective distance of rectangle integration is set to less than half of the rectangle size of the reference rectangle, the character interval is more than half the character size as shown in FIG. The document has a problem that the area is divided into a plurality of areas.

【０００４】本発明は、このような課題に鑑みてなされ
たもので、文字間隔が広い文書でも正確に領域抽出でき
る文字認識装置を提供することを目的としている。The present invention has been made in view of the above problems, and an object thereof is to provide a character recognition device capable of accurately extracting a region even in a document having a wide character space.

【０００５】[0005]

【課題を解決するための手段】本発明は、文字統合処理
を軽減するために、スキャナより取り込んだ画像データ
を縮小する手段と、縮小データに対して外接矩形を検出
して、外接矩形の大きさが予め定められた閾値以下の矩
形を文字候補矩形として識別する手段と、識別した全文
字候補矩形の最頻サイズを認識対象文書の基準文字サイ
ズとする手段と、各文字候補矩形に対して左右に隣接す
る矩形を検出して、矩形間の距離を算出した上、最も出
現回数の多い距離を認識対象文書の水平方向の矩形間距
離として、出現回数と距離とを記憶させる手段と、各文
字候補矩形に対して上下に隣接する矩形を検出して、矩
形間の距離を算出した上、最も出現回数の多い距離を認
識対象文書の垂直方向の矩形間距離として、出現回数と
距離とを記憶させる手段と、検出した水平方向の矩形間
距離及び垂直方向の矩形間距離とそれらの出現回数か
ら、文字組み方向，文字間，行間を検出する手段と、
(文字間＜行間＜領域間)の条件を用いて、文字候補矩形
の統合有効距離を行間として矩形の統合処理を行って、
行を抽出する手段と、抽出された行を基に文字領域を抽
出して、それぞれの文字領域について文字の切り出し及
び認識を行う手段とからなるものである。SUMMARY OF THE INVENTION The present invention, in order to reduce the character integration process, a means for reducing image data captured by a scanner and a circumscribed rectangle for the reduced data to detect the size of the circumscribed rectangle. Means for identifying a rectangle having a predetermined threshold value or less as a character candidate rectangle, means for determining the mode size of all identified character candidate rectangles as the reference character size of the recognition target document, and for each character candidate rectangle Detecting adjacent rectangles on the left and right, calculating the distance between the rectangles, and storing the number of appearances and the distance as the distance between the rectangles having the largest number of appearances as the horizontal distance between the rectangles of the recognition target document. The rectangles that are vertically adjacent to the character candidate rectangle are detected, the distance between the rectangles is calculated, and the number of appearances and the distance are defined as the distance between the rectangles with the largest number of appearances, which is the vertical distance between the rectangles of the recognition target document. Memorize Means, from the rectangular distance and the number of occurrences of their rectangular distance and vertical detected horizontal, mojikumi direction, between characters, and means for detecting the line space,
Using the condition (between characters <line spacing <between areas), the rectangle integration processing is performed with the integrated effective distance of the character candidate rectangle as the line spacing.
It is composed of means for extracting a line and means for extracting a character region based on the extracted line and cutting out and recognizing a character for each character region.

【０００６】[0006]

【作用】本発明によれば、矩形を統合する有効距離を行
間とすることにより、文字間隔が広い文書でも領域抽出
を正確に行うことができる。According to the present invention, the effective distance for integrating rectangles is set to the line spacing, so that the region extraction can be accurately performed even in a document having a wide character space.

【０００７】又、行統合処理を水平方向と垂直方向とで
それぞれ個別に行うことによって、縦組みと、横組みと
の混在文書にも対応できる。By separately performing the line integration processing in the horizontal direction and the vertical direction, respectively, it is possible to deal with a mixed document of vertical writing and horizontal writing.

【０００８】[0008]

【実施例】以下、本発明の一実施例について、図面を参
照しながら説明する。図１は本発明の一実施例における
領域分割を実行する装置の構成を示したもので、１は領
域抽出を行う中央処理装置(以下「ＣＰＵ」という)で、
このＣＰＵ１は、図２に示すように、画像データ取込部
７，画像データ縮小部８，外接矩形抽出部９，文字矩形
識別部10，文字間・行間検出部11，行統合部12，文字領
域統合部13，文字認識部14からなる。２は文字認識プロ
グラムを格納したリードオンリーメモリ(以下「ＲＯ
Ｍ」という)、３はスキャナ４で読み取った画像データ
及び認識プログラムのデータを格納するランダムアクセ
スメモリ(以下「ＲＡＭ」という)、５は外部からＣＰＵ
１に対して指令を与えるためのキーボード、６はＣＰＵ
１が認識した認識結果を表示するＣＲＴである。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 shows the configuration of an apparatus for executing area division in one embodiment of the present invention. Reference numeral 1 denotes a central processing unit (hereinafter referred to as “CPU”) for area extraction,
As shown in FIG. 2, the CPU 1 includes an image data capturing unit 7, an image data reducing unit 8, a circumscribing rectangle extracting unit 9, a character rectangle identifying unit 10, a character spacing / line spacing detecting unit 11, a line integrating unit 12, and a character. It consists of a region integration unit 13 and a character recognition unit 14. 2 is a read-only memory (hereinafter referred to as “RO
3) Random access memory (hereinafter referred to as “RAM”) for storing the image data read by the scanner 4 and the data of the recognition program. 5 is an external CPU.
Keyboard for giving commands to 1, 6 is CPU
1 is a CRT that displays the recognition result recognized by 1.

【０００９】以下、図３及び図４のフローチャートと図
５の画像データ例とを基に文字領域抽出処理について説
明する。The character area extracting process will be described below with reference to the flowcharts of FIGS. 3 and 4 and the image data example of FIG.

【００１０】画像データ取込部７は、認識対象文書をス
キャナ４により２値画像データとして取り込んだ上〔ス
テップ１(以下各ステップを「Ｓ１」の如く表す)〕、そ
の取り込んだ二値画像データの文字領域抽出処理を画像
データ縮小部８において高速で行うために、例えば解像
度100DPI程度にＯＲ縮小してＲＡＭ３に格納する(Ｓ
２)。The image data capturing section 7 captures the document to be recognized as binary image data by the scanner 4 [step 1 (each step is represented as "S1" below)] and the captured binary image data. In order to perform the character area extraction processing of (1) at high speed in the image data reducing unit 8, for example, OR reduction is performed to a resolution of about 100 DPI and the result is stored in the RAM 3 (S
2).

【００１１】外接矩形抽出部９は、縮小された画像デー
タから黒画像が連結している箇所を検出して、連結して
いる黒画像の外接矩形を抽出した上、全ての外接矩形の
座標情報をＲＡＭ３に格納する(Ｓ３)。例えば、図５に
示した画像データは、図６に示したＫ１〜Ｋ８の８個の
外接矩形として抽出される。この外接矩形の座標系は左
上が原点で、水平方向の座標をｘで、垂直方向の座標を
ｙで表し、又、外接矩形は左上の座標(ｘ１，ｙ１)と右
下の座標(ｘ２，ｙ２)とで表す。The circumscribing rectangle extraction unit 9 detects a portion where the black images are connected from the reduced image data, extracts the circumscribing rectangle of the connected black images, and coordinate information of all the circumscribing rectangles. Is stored in the RAM 3 (S3). For example, the image data shown in FIG. 5 is extracted as eight circumscribing rectangles K1 to K8 shown in FIG. In the coordinate system of this circumscribed rectangle, the upper left is the origin, the horizontal coordinate is x, and the vertical coordinate is y, and the circumscribed rectangle is the upper left coordinate (x1, y1) and the lower right coordinate (x2, It is represented by y2).

【００１２】文字矩形識別部10は外接矩形の大きさ，黒
画素の数及び黒画素の密度に基づいて文字か文字以外か
に識別して、文字候補の矩形を抽出する(Ｓ４)。この場
合、文字以外の属性矩形として識別するのは、次の５つ
の条件の内のいずれか１つを満たせばよい。即ち、 1) 外接矩形の黒画素数が閾値のとき 2) 矩形の縦横比が閾値ＦＤ＿ＲＡＴＩＯ以上のとき 3) 外接矩形の短辺が閾値ＣＨＡＲ＿ＭＡＸ以上のとき 4) 外接矩形の長辺が閾値ＣＨＡＲ＿ＭＡＸ以上で、外
接矩形の黒画素密度が閾値ＰＥＲ＿ＤＩＡＧ＿ＭＩＮ以
下のとき 5) 外接矩形の長辺が閾値ＣＨＡＲ＿ＭＡＸ以上で、外
接矩形の黒画素密度が閾値ＰＥＲ＿ＤＩＡＧ＿ＭＡＸ以
上のとき又、この条件に当てはまらなかったものは文字候補矩形
として識別される。例えば、図６のＫ１〜Ｋ８の矩形は
すべて文字候補矩形として識別される。The character rectangle identifying unit 10 identifies a character or non-character based on the size of the circumscribing rectangle, the number of black pixels, and the density of black pixels, and extracts a character candidate rectangle (S4). In this case, to identify as an attribute rectangle other than a character, any one of the following five conditions may be satisfied. That is, 1) When the number of black pixels of the circumscribed rectangle is the threshold value 2) When the aspect ratio of the rectangle is the threshold value FD_RATIO or higher 3) When the short side of the circumscribed rectangle is the threshold value CHAR_MAX or higher 4) The long side of the circumscribed rectangle is the threshold value CHAR_MAX or higher When the black pixel density of the circumscribed rectangle is less than or equal to the threshold value PER_DIAG_MIN 5) When the long side of the circumscribed rectangle is greater than or equal to the threshold value CHAR_MAX and the black pixel density of the circumscribed rectangle is greater than or equal to the threshold value PER_DIAG_MAX. Identified as a candidate rectangle. For example, the rectangles K1 to K8 in FIG. 6 are all identified as character candidate rectangles.

【００１３】文字間・行間検出部11は、文字候補矩形を
基に、Ｓ５〜Ｓ12の処理を行う。The character-to-character / line-to-line detecting unit 11 performs the processing of S5 to S12 based on the character candidate rectangle.

【００１４】先ず、文字矩形識別部10で抽出した文字候
補矩形から各文字候補矩形の幅と高さを求めた上、その
中で出現回数が最も多いものを認識対象文書の基準文字
サイズとする(Ｓ５)。ただし、文字候補矩形の文字サイ
ズが閾値ＰＯＩＮＴ＿０６に満たない場合には、基準文
字サイズをＰＯＩＮＴ＿０６にし、又、出現回数が同数
の文字サイズが複数ある場合には、その中の最大サイズ
にする。例えば、図６の文字候補矩形の文字サイズが
５，８，９で、その出現回数はいずれも４回であるの
で、基準文字サイズは最も大きい数の９とする。First, the width and height of each character candidate rectangle are calculated from the character candidate rectangles extracted by the character rectangle identifying unit 10, and the one having the largest number of appearances is set as the reference character size of the recognition target document. (S5). However, when the character size of the character candidate rectangle is less than the threshold value POINT_06, the reference character size is set to POINT_06, and when there are a plurality of character sizes having the same number of appearances, the reference character size is set to the maximum size. For example, the character size of the character candidate rectangle in FIG. 6 is 5, 8 and 9, and the number of appearances thereof is 4, so the reference character size is set to the largest number of 9.

【００１５】次に、水平方向の矩形間の最頻距離を検出
する。各文字候補矩形に対して隣接する矩形を検出し
て、その距離を求め、矩形間の距離の出現回数が最も多
かったものを認識対象文書の水平方向の矩形間距離とし
て、その出現回数をＲＡＭ３のｈＣｏｕｎｔに、その距
離をＲＡＭ３のｈＤｉｓにそれぞれ記憶させる(Ｓ５)。
例えば、図６の外接矩形の水平矩形間の距離ｈＤｉｓは
５で、その出現回数ｈＣｏｕｎｔは３である。Next, the most frequent distance between the horizontal rectangles is detected. A rectangle adjacent to each of the character candidate rectangles is detected, the distance is calculated, and the number of appearances of the distance between the rectangles is set as the horizontal inter-rectangle distance of the recognition target document, and the appearance count is determined by the RAM 3 The distances are stored in the hCount of the RAM 3 and in the hDis of the RAM 3 (S5).
For example, the distance hDis between the horizontal rectangles of the circumscribing rectangle in FIG. 6 is 5 and the number of appearances hCount thereof is 3.

【００１６】同様にして、垂直方向の矩形間の最頻距離
を検出して、垂直矩形間の距離をＲＡＭ３のｖＤｉｓ
に、その出現回数をＲＡＭ３のｖＣｏｕｎｔに記憶させ
る(Ｓ５)。例えば、図６の外接矩形の垂直矩形間の距離
ｖＤｉｓは８で、その出現回数ｖＣｏｕｎｔは４であ
る。Similarly, the mode distance between the vertical rectangles is detected, and the distance between the vertical rectangles is calculated as vDis of the RAM 3.
Then, the number of appearances is stored in vCount of the RAM 3 (S5). For example, the distance vDis between the vertical rectangles of the circumscribing rectangle in FIG. 6 is 8, and the number of appearances vCount thereof is 4.

【００１７】このようにして検出した水平及び垂直矩形
間の距離と出現回数とから文字間・行間の決定を行う
(Ｓ６)。水平方向の矩形間距離の出現回数ｈＣｏｕｎｔ
と垂直方向の矩形間距離の出現回数ｖＣｏｕｎｔとに差
がある場合はＳ７の処理を行い、差がない場合はＳ10の
処理を行う。このとき、ｈＣｏｕｎｔとｖＣｏｕｎｔと
の差については、｜ｈＣｏｕｎｔ−ｖＣｏｕｎｔ｜÷(ｈＣｏｕｎｔ＋ｖＣｏｕｎｔ)×100 ＞閾値ＣＴ＿ＤＩＦＦの条件を満たせば差があると判定する。例えば、図６で
は計算結果が14％で、閾値ＣＴ＿ＤＩＦＦより小さいの
で、出現個数には差がないと判定して、Ｓ10の処理を行
う。Character spacing and line spacing are determined from the distance between the horizontal and vertical rectangles thus detected and the number of appearances.
(S6). Number of occurrences of horizontal distance between rectangles hCount
And the number of times vCount of appearance of the distance between the rectangles in the vertical direction are different, the process of S7 is performed, and if there is no difference, the process of S10 is performed. At this time, regarding the difference between hCount and vCount, it is determined that there is a difference if the condition of | hCount−vCount | ÷ (hCount + vCount) × 100> threshold CT_DIFF is satisfied. For example, in FIG. 6, the calculation result is 14%, which is smaller than the threshold value CT_DIFF. Therefore, it is determined that there is no difference in the number of appearances, and the process of S10 is performed.

【００１８】又、出現回数に差があった場合には、水平
方向の矩形間距離の出現回数ｈＣｏｕｎｔと垂直方向の
矩形間距離の出現回数ｖＣｏｕｎｔとを比較して(Ｓ
７)、ｖＣｏｕｎｔが多ければ認識対象文書は縦組み文
書であると判定して、ＲＡＭ３の矩形マージ方向フラグ
ｄｉｒＦｌａｇに垂直方向情報を、文字ピッチｃｈＰｉ
ｔｃｈに垂直方向の矩形間の距離ｖＤｉｓを、行間ピッ
チｌｎＰｉｔｃｈに水平方向の矩形間の距離ｈＤｉｓを
それぞれセットする(Ｓ８)。ｈＣｏｕｎｔが多ければ認
識対象文書は横組み文書であると判定して、矩形マージ
方向フラグｄｉｒＦｌａｇに水平方向情報を、文字ピッ
チｃｈＰｉｔｃｈに水平方向の矩形間の距離ｈＤｉｓ
を、行間ピッチｌｎＰｉｔｃｈに垂直方向の矩形間の距
離ｖＤｉｓをそれぞれセットする(Ｓ９)。When there is a difference in the number of appearances, the number of appearances hCount of the distance between rectangles in the horizontal direction and the number of appearances vCount of the distance between rectangles in the vertical direction are compared (S
7) If vCount is large, it is determined that the document to be recognized is a vertically aligned document, and the vertical direction information is set to the rectangular merge direction flag dirFlag of the RAM 3 and the character pitch chPi is set.
The distance vDis between the rectangles in the vertical direction is set to tch, and the distance hDis between the rectangles in the horizontal direction is set to the line pitch lnPitch (S8). If there are many hCounts, it is determined that the document to be recognized is a horizontal layout document, horizontal direction information is set in the rectangle merge direction flag dirFlag, and the horizontal distance hDis between rectangles in the character pitch chPitch.
Is set to the line pitch lnPitch, and the distance vDis between the rectangles in the vertical direction is set (S9).

【００１９】又、出現回数に差がなかった場合には、水
平方向の矩形間距離ｈＤｉｓと垂直方向の矩形間距離ｖ
Ｄｉｓを比較して(Ｓ10)、ｖＤｉｓがｈＤｉｓよりも短
ければ、認識対象文書は縦組み文書と判定して、矩形マ
ージ方向フラグｄｉｒＦｌａｇに垂直方向情報を、文字
ピッチｃｈＰｉｔｃｈに垂直方向の矩形間の距離ｖＤｉ
ｓを、行間ピッチｌｎＰｉｔｃｈに水平方向の矩形間の
距離ｈＤｉｓをそれぞれセットする(Ｓ11)。そして、ｈ
ＤｉｓがｖＤｉｓよりも短ければ、横組み文書と判定し
て、矩形マージ方向フラグｄｉｒＦｌａｇに水平方向情
報を、文字ピッチｃｈＰｉｔｃｈに水平方向の矩形間の
距離ｈＤｉｓを、行間ピッチｌｎＰｉｔｃｈに垂直方向
の矩形間の距離ｖＤｉｓをそれぞれセットする(Ｓ12)。When there is no difference in the number of appearances, the horizontal inter-rectangle distance hDis and the vertical inter-rectangle distance v
When Dis is compared (S10), and vDis is shorter than hDis, the recognition target document is determined to be a vertically-assembled document, vertical direction information is set in the rectangular merge direction flag dirFlag, and vertical rectangles are set in the character pitch chPitch. Distance vDi
s is set to the line pitch lnPitch, and the horizontal distance hDis between rectangles is set (S11). And h
If Dis is shorter than vDis, it is determined that the document is a horizontal document, and horizontal information is set in the rectangle merge direction flag dirFlag, the character pitch chPitch is the distance hDis between the horizontal rectangles, and the line pitch lnPitch is the vertical interval. The distance vDis of each is set (S12).

【００２０】例えば、図６では出現回数に差がなく、ｈ
Ｄｉｓが５で、ｖＤｉｓの８よりも距離が短いので、Ｓ
12の処理を行って、ｄｉｒＦｌａｇ＝水平方向，ｃｈＰ
ｉｔｃｈ＝５，ｌｎ＿Ｐｉｔｃｈ＝８がセットされる。For example, in FIG. 6, there is no difference in the number of appearances, and h
Since Dis is 5 and the distance is shorter than 8 of vDis, S
After performing 12 processes, dirFlag = horizontal direction, chP
Itch = 5, ln_Pitch = 8 are set.

【００２１】行統合部12は、このようにして検出した文
字間・行間から行統合を行い(Ｓ13)、矩形マージ方向フ
ラグｄｉｒＦｌａｇが垂直の場合は、垂直方向の矩形統
合(Ｓ14)、水平方向の矩形統合(Ｓ15)の順序で、ｄｉｒ
Ｆｌａｇが水平の場合は、水平方向の矩形統合(Ｓ16)、
垂直方向の矩形統合(Ｓ17)の順序で矩形の統合を行っ
て、行を抽出する。このとき、(文字間＜行間＜領域間)
の条件から、行統合の有効距離を文字間・行間検出部11
で検出した行間とする。例えば、図６では、ｄｉｒＦｌ
ａｇが水平方向となっているので、水平方向の矩形統合
処理を行った後、垂直方向の矩形統合処理を行う。その
際の行統合の有効距離はｌｎＰｉｔｃｈ＝８となり、矩
形統合の結果は図５のようになる。ここで、水平方向の
矩形の統合と垂直方向の矩形の統合をそれぞれ行うの
は、縦横混在文書に対応するためである。The line integration unit 12 performs line integration from the characters and lines detected in this way (S13). If the rectangle merge direction flag dirFlag is vertical, vertical rectangle integration (S14), horizontal direction. In the order of rectangle integration (S15) of
If the Flag is horizontal, horizontal rectangle integration (S16),
Rectangle integration is performed in the order of vertical rectangle integration (S17) to extract rows. At this time, (character spacing <line spacing <region spacing)
From the condition of, the effective distance of line integration is determined by the character / line spacing detection unit 11
It is the line spacing detected in. For example, in FIG. 6, dirFl
Since ag is in the horizontal direction, horizontal rectangle integration processing is performed, and then vertical rectangle integration processing is performed. The effective distance of line integration at that time is lnPitch = 8, and the result of rectangle integration is as shown in FIG. Here, the integration of the horizontal rectangles and the integration of the vertical rectangles are carried out in order to deal with a vertically and horizontally mixed document.

【００２２】文字領域統合部13は、行統合結果から文字
領域の抽出を行う(Ｓ18)。例えば、図７の行情報から抽
出した文字領域は図８のようになる。The character area integrating unit 13 extracts a character area from the line integration result (S18). For example, the character area extracted from the line information of FIG. 7 is as shown in FIG.

【００２３】そして、最後に抽出された文字領域に対し
て、文字認識処理を行う(Ｓ19)。Then, character recognition processing is performed on the finally extracted character area (S19).

【００２４】尚、本実施例では、ＮＯＩＳＥ＿ＭＩＮ ≦ ２ＦＤ＿ＲＡＴＩＯ ≦ 30 ＣＨＡＲ＿ＭＡＸ ≦ 100 ＰＥＲ＿ＤＩＡＧ＿ＭＩＮ ≦ 15 ＰＥＲ＿ＤＩＡＧ＿ＭＡＸ ≦ 80 ＰＯＩＮＴ＿０６ ≦ ８ＣＴ＿ＦＩＦＦ ≦ 20 の値とした。In this embodiment, NOISE_MIN ≤ 2 FD_RATIO ≤ 30 CHAR_MAX ≤ 100 PER_DIAG_MIN ≤ 15 PER_DIAG_MAX ≤ 80 POINT_06 ≤ 8 CT_FIFF ≤ 20.

【００２５】[0025]

【発明の効果】以上説明したように、本発明によれば、
矩形を統合する有効距離を行間とすることにより、文字
間隔が広い文書でも正確に領域抽出することができると
いう効果を奏する。As described above, according to the present invention,
By setting the effective distance for integrating the rectangles to be the line spacing, it is possible to accurately extract the region even in a document having a wide character spacing.

【００２６】又、行統合処理を水平方向と垂直方向とで
それぞれ個別に行うことによって、縦組みと横組みとの
混在文書にも対応できるという効果を奏する。Further, by performing the line integration processing individually in the horizontal direction and the vertical direction, it is possible to deal with a mixed document of vertical writing and horizontal writing.

[Brief description of drawings]

【図１】本発明の一実施例における領域分割を実行する
装置のブロック図である。FIG. 1 is a block diagram of an apparatus for performing area division according to an embodiment of the present invention.

【図２】本発明の一実施例のブロック図である。FIG. 2 is a block diagram of an embodiment of the present invention.

【図３】本発明の一実施例のフローチャートである。FIG. 3 is a flowchart of an embodiment of the present invention.

【図４】図３のフローチャートの続きである。FIG. 4 is a continuation of the flowchart of FIG.

【図５】本発明の一実施例の説明に使用する画像データ
の一例である。FIG. 5 is an example of image data used to describe an embodiment of the present invention.

【図６】本発明の一実施例において図５の画像データの
外接矩形の抽出結果を示すものである。FIG. 6 shows a result of extracting a circumscribing rectangle of the image data of FIG. 5 in an embodiment of the present invention.

【図７】本発明の一実施例において図６の外接矩形から
の行の抽出結果を示すものである。FIG. 7 shows a result of extracting rows from the circumscribed rectangle of FIG. 6 in an embodiment of the present invention.

【図８】本発明の一実施例において図７の行からの領域
の抽出結果を示すものである。FIG. 8 shows a result of extracting regions from the row of FIG. 7 in an embodiment of the present invention.

【図９】文字の領域の間隔が狭い画像データの例を示す
ものである。FIG. 9 shows an example of image data in which character regions are closely spaced.

【図１０】図９の画像データの外接矩形の抽出結果を示
すものである。FIG. 10 shows a result of extracting a circumscribing rectangle of the image data shown in FIG.

[Explanation of symbols]

１…中央処理装置(ＣＰＵ)、２…リードオンリーメモ
リ(ＲＯＭ)、３…ランダムアクセスメモリ(ＲＡＭ)、
４…スキャナ、５…キーボード、６…ＣＲＴ、
７…画像データ取込部、８…画像データ縮小部、９
…外接矩形抽出部、 10…文字矩形識別部、 11…文字
間・行間検出部、 12…行統合部、 13…文字領域統合
部、 14…文字認識部。1 ... Central processing unit (CPU), 2 ... Read only memory (ROM), 3 ... Random access memory (RAM),
4 ... Scanner, 5 ... Keyboard, 6 ... CRT,
7 ... Image data importing unit, 8 ... Image data reducing unit, 9
... circumscribed rectangle extraction unit, 10 ... character rectangle identification unit, 11 ... character / line spacing detection unit, 12 ... line integration unit, 13 ... character area integration unit, 14 ... character recognition unit.

Claims

[Claims]

1. A means for reducing image data of a binarized character recognition target document, a portion where black pixels are connected to each other is detected, and information on a circumscribed rectangle of the connected black pixels is stored in a memory. And a means for identifying a character candidate rectangle from the size of the circumscribing rectangle, the aspect ratio of the circumscribing rectangle, and the black pixel density, and a horizontal rectangular interval and a vertical rectangular interval of the character candidate rectangle. And a unit for detecting a character space and a line space, a unit for extracting the line by integrating the character candidate rectangles based on the line space, and a unit for integrating the line and extracting a character region. A character recognition device characterized in that