JPH06187489A - Character recognizing device - Google Patents

Character recognizing device

Info

Publication number
JPH06187489A
JPH06187489A JP4337330A JP33733092A JPH06187489A JP H06187489 A JPH06187489 A JP H06187489A JP 4337330 A JP4337330 A JP 4337330A JP 33733092 A JP33733092 A JP 33733092A JP H06187489 A JPH06187489 A JP H06187489A
Authority
JP
Japan
Prior art keywords
character
rectangle
line
rectangles
interval
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP4337330A
Other languages
Japanese (ja)
Inventor
Yumiko Ikemure
由美子 池牟▲禮▼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Priority to JP4337330A priority Critical patent/JPH06187489A/en
Publication of JPH06187489A publication Critical patent/JPH06187489A/en
Pending legal-status Critical Current

Links

Landscapes

  • Character Input (AREA)

Abstract

PURPOSE:To exactly extract an area even in the case of a document for which the character intervals of binary data fetched from a scanner are wide. CONSTITUTION:After image data fetched from the scanner are reduced by an image data reduction part 8, a position where black picture elements are linked is detected by a circumscribed rectangle extraction part 9, and the information of the circumscribed rectangle of linked black picture elements is stored in a memory. When a character rectangle identification part 10 identifies the rectangle of character candidates based on the size of the circumscribed rectangle, the number of picture elements and the density of black picture elements, a character interval/line interval detection part 11 detects the character interval/ line interval both from an inter-rectangle distance between the rectangles adjacent to the up, own, right and left sides of the character candidate rectangle and from the number of times of appearance. On the condition of (character interval < line interval < area interval), a line merging part 12 extracts a line by the merging processing of rectangles with the valid merging distance of character candidate rectangles as the line interval. Then, a character area merging part 13 extracts the character area based on the extracted line, and a character recognition part 14 recognizes characters.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は、印刷文書のデータベー
ス化や文書の再利用のために、スキャナ等の光学的手段
を用いて文書画像を取り込んだ上、取り込んだ画像デー
タから文字,図形,表等の属性毎に領域を抽出して、各
属性に応じた認識処理を行う文字認識装置に関するもの
である。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention captures a document image by using an optical means such as a scanner in order to make a database of a printed document and reuse the document, and then, a character, a graphic, The present invention relates to a character recognition device that extracts a region for each attribute such as a table and performs a recognition process according to each attribute.

【0002】[0002]

【従来の技術】従来の文字認識装置は、スキャナにより
取り込んだ二値データの黒画素の連結状態を調べ、黒画
素が連結しているかたまりに外接する矩形(以下「外接
矩形」という)の座標を検出して、メモリに格納した
上、予め定められた閾値以下の大きさの外接矩形を文字
候補として抽出して、この文字候補の矩形(以下「文字
候補矩形」という)を統合することにより、行を抽出す
る。そして、基準となる文字候補矩形(以下「基準矩
形」という)に対して、上下左右の4方向で最も近い文
字候補矩形(以下「最短距離矩形」という)を検出した
上、図9のように文字領域の間隔が狭い文書に対して、
図10のような文字領域間にまたがった行統合を防ぐた
め、基準矩形と検出された文字候補矩形との距離が基準
矩形の矩形サイズの半分未満であれば、2つの矩形は同
一文字領域内に存在するとみなして、矩形の統合を行
う。このとき、基準矩形と検出された最短距離矩形が上
下に位置していれば縦組み文字列、左右に位置していれ
ば横組み文字列と決定する。そこで、検出された行情報
から領域の抽出を行って、同一方向に隣り合う行の幅が
等しく行間の距離が閾値以下であれば、2つの行は同一
領域内にあるとして領域の統合を行った後、抽出した文
字領域について、文字の切り出し及び認識処理を行う。
2. Description of the Related Art A conventional character recognition device checks the connection state of black pixels of binary data captured by a scanner, and coordinates of a rectangle circumscribing a block in which black pixels are connected (hereinafter referred to as "circumscribing rectangle"). By detecting the, storing in memory, extracting a circumscribing rectangle of a size equal to or smaller than a predetermined threshold value as a character candidate, and integrating the character candidate rectangles (hereinafter referred to as “character candidate rectangles”). , To extract a row. Then, with respect to the reference character candidate rectangle (hereinafter referred to as "reference rectangle"), the closest character candidate rectangle (hereinafter referred to as "shortest distance rectangle") in four directions of up, down, left and right is detected, and as shown in FIG. For documents with narrow character areas,
To prevent line integration across character areas as shown in Fig. 10, if the distance between the reference rectangle and the detected character candidate rectangle is less than half the rectangle size of the reference rectangle, the two rectangles are within the same character area. Rectangles are merged, assuming that they exist in. At this time, if the shortest distance rectangle detected as the reference rectangle is located vertically, it is determined to be a vertically-assembled character string, and if it is located to the left or right, it is determined to be a horizontally-assembled character string. Therefore, an area is extracted from the detected row information, and if the widths of adjacent rows in the same direction are equal and the distance between the rows is equal to or less than a threshold value, it is determined that the two rows are in the same area and the areas are integrated. After that, character extraction and recognition processing is performed on the extracted character area.

【0003】[0003]

【発明が解決しようとする課題】しかしながら、従来の
方式では、矩形統合の有効距離を基準矩形の矩形サイズ
の半分未満としているため、図3のように文字間隔が文
字サイズの半分以上あるような文書では領域が複数に分
かれてしまうといった課題を有していた。
However, in the conventional method, since the effective distance of rectangle integration is set to less than half of the rectangle size of the reference rectangle, the character interval is more than half the character size as shown in FIG. The document has a problem that the area is divided into a plurality of areas.

【0004】本発明は、このような課題に鑑みてなされ
たもので、文字間隔が広い文書でも正確に領域抽出でき
る文字認識装置を提供することを目的としている。
The present invention has been made in view of the above problems, and an object thereof is to provide a character recognition device capable of accurately extracting a region even in a document having a wide character space.

【0005】[0005]

【課題を解決するための手段】本発明は、文字統合処理
を軽減するために、スキャナより取り込んだ画像データ
を縮小する手段と、縮小データに対して外接矩形を検出
して、外接矩形の大きさが予め定められた閾値以下の矩
形を文字候補矩形として識別する手段と、識別した全文
字候補矩形の最頻サイズを認識対象文書の基準文字サイ
ズとする手段と、各文字候補矩形に対して左右に隣接す
る矩形を検出して、矩形間の距離を算出した上、最も出
現回数の多い距離を認識対象文書の水平方向の矩形間距
離として、出現回数と距離とを記憶させる手段と、各文
字候補矩形に対して上下に隣接する矩形を検出して、矩
形間の距離を算出した上、最も出現回数の多い距離を認
識対象文書の垂直方向の矩形間距離として、出現回数と
距離とを記憶させる手段と、検出した水平方向の矩形間
距離及び垂直方向の矩形間距離とそれらの出現回数か
ら、文字組み方向,文字間,行間を検出する手段と、
(文字間<行間<領域間)の条件を用いて、文字候補矩形
の統合有効距離を行間として矩形の統合処理を行って、
行を抽出する手段と、抽出された行を基に文字領域を抽
出して、それぞれの文字領域について文字の切り出し及
び認識を行う手段とからなるものである。
SUMMARY OF THE INVENTION The present invention, in order to reduce the character integration process, a means for reducing image data captured by a scanner and a circumscribed rectangle for the reduced data to detect the size of the circumscribed rectangle. Means for identifying a rectangle having a predetermined threshold value or less as a character candidate rectangle, means for determining the mode size of all identified character candidate rectangles as the reference character size of the recognition target document, and for each character candidate rectangle Detecting adjacent rectangles on the left and right, calculating the distance between the rectangles, and storing the number of appearances and the distance as the distance between the rectangles having the largest number of appearances as the horizontal distance between the rectangles of the recognition target document. The rectangles that are vertically adjacent to the character candidate rectangle are detected, the distance between the rectangles is calculated, and the number of appearances and the distance are defined as the distance between the rectangles with the largest number of appearances, which is the vertical distance between the rectangles of the recognition target document. Memorize Means, from the rectangular distance and the number of occurrences of their rectangular distance and vertical detected horizontal, mojikumi direction, between characters, and means for detecting the line space,
Using the condition (between characters <line spacing <between areas), the rectangle integration processing is performed with the integrated effective distance of the character candidate rectangle as the line spacing.
It is composed of means for extracting a line and means for extracting a character region based on the extracted line and cutting out and recognizing a character for each character region.

【0006】[0006]

【作用】本発明によれば、矩形を統合する有効距離を行
間とすることにより、文字間隔が広い文書でも領域抽出
を正確に行うことができる。
According to the present invention, the effective distance for integrating rectangles is set to the line spacing, so that the region extraction can be accurately performed even in a document having a wide character space.

【0007】又、行統合処理を水平方向と垂直方向とで
それぞれ個別に行うことによって、縦組みと、横組みと
の混在文書にも対応できる。
By separately performing the line integration processing in the horizontal direction and the vertical direction, respectively, it is possible to deal with a mixed document of vertical writing and horizontal writing.

【0008】[0008]

【実施例】以下、本発明の一実施例について、図面を参
照しながら説明する。図1は本発明の一実施例における
領域分割を実行する装置の構成を示したもので、1は領
域抽出を行う中央処理装置(以下「CPU」という)で、
このCPU1は、図2に示すように、画像データ取込部
7,画像データ縮小部8,外接矩形抽出部9,文字矩形
識別部10,文字間・行間検出部11,行統合部12,文字領
域統合部13,文字認識部14からなる。2は文字認識プロ
グラムを格納したリードオンリーメモリ(以下「RO
M」という)、3はスキャナ4で読み取った画像データ
及び認識プログラムのデータを格納するランダムアクセ
スメモリ(以下「RAM」という)、5は外部からCPU
1に対して指令を与えるためのキーボード、6はCPU
1が認識した認識結果を表示するCRTである。
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 shows the configuration of an apparatus for executing area division in one embodiment of the present invention. Reference numeral 1 denotes a central processing unit (hereinafter referred to as “CPU”) for area extraction,
As shown in FIG. 2, the CPU 1 includes an image data capturing unit 7, an image data reducing unit 8, a circumscribing rectangle extracting unit 9, a character rectangle identifying unit 10, a character spacing / line spacing detecting unit 11, a line integrating unit 12, and a character. It consists of a region integration unit 13 and a character recognition unit 14. 2 is a read-only memory (hereinafter referred to as “RO
3) Random access memory (hereinafter referred to as “RAM”) for storing the image data read by the scanner 4 and the data of the recognition program. 5 is an external CPU.
Keyboard for giving commands to 1, 6 is CPU
1 is a CRT that displays the recognition result recognized by 1.

【0009】以下、図3及び図4のフローチャートと図
5の画像データ例とを基に文字領域抽出処理について説
明する。
The character area extracting process will be described below with reference to the flowcharts of FIGS. 3 and 4 and the image data example of FIG.

【0010】画像データ取込部7は、認識対象文書をス
キャナ4により2値画像データとして取り込んだ上〔ス
テップ1(以下各ステップを「S1」の如く表す)〕、そ
の取り込んだ二値画像データの文字領域抽出処理を画像
データ縮小部8において高速で行うために、例えば解像
度100DPI程度にOR縮小してRAM3に格納する(S
2)。
The image data capturing section 7 captures the document to be recognized as binary image data by the scanner 4 [step 1 (each step is represented as "S1" below)] and the captured binary image data. In order to perform the character area extraction processing of (1) at high speed in the image data reducing unit 8, for example, OR reduction is performed to a resolution of about 100 DPI and the result is stored in the RAM 3 (S
2).

【0011】外接矩形抽出部9は、縮小された画像デー
タから黒画像が連結している箇所を検出して、連結して
いる黒画像の外接矩形を抽出した上、全ての外接矩形の
座標情報をRAM3に格納する(S3)。例えば、図5に
示した画像データは、図6に示したK1〜K8の8個の
外接矩形として抽出される。この外接矩形の座標系は左
上が原点で、水平方向の座標をxで、垂直方向の座標を
yで表し、又、外接矩形は左上の座標(x1,y1)と右
下の座標(x2,y2)とで表す。
The circumscribing rectangle extraction unit 9 detects a portion where the black images are connected from the reduced image data, extracts the circumscribing rectangle of the connected black images, and coordinate information of all the circumscribing rectangles. Is stored in the RAM 3 (S3). For example, the image data shown in FIG. 5 is extracted as eight circumscribing rectangles K1 to K8 shown in FIG. In the coordinate system of this circumscribed rectangle, the upper left is the origin, the horizontal coordinate is x, and the vertical coordinate is y, and the circumscribed rectangle is the upper left coordinate (x1, y1) and the lower right coordinate (x2, It is represented by y2).

【0012】文字矩形識別部10は外接矩形の大きさ,黒
画素の数及び黒画素の密度に基づいて文字か文字以外か
に識別して、文字候補の矩形を抽出する(S4)。この場
合、文字以外の属性矩形として識別するのは、次の5つ
の条件の内のいずれか1つを満たせばよい。即ち、 1) 外接矩形の黒画素数が閾値のとき 2) 矩形の縦横比が閾値FD_RATIO以上のとき 3) 外接矩形の短辺が閾値CHAR_MAX以上のとき 4) 外接矩形の長辺が閾値CHAR_MAX以上で、外
接矩形の黒画素密度が閾値PER_DIAG_MIN以
下のとき 5) 外接矩形の長辺が閾値CHAR_MAX以上で、外
接矩形の黒画素密度が閾値PER_DIAG_MAX以
上のとき 又、この条件に当てはまらなかったものは文字候補矩形
として識別される。例えば、図6のK1〜K8の矩形は
すべて文字候補矩形として識別される。
The character rectangle identifying unit 10 identifies a character or non-character based on the size of the circumscribing rectangle, the number of black pixels, and the density of black pixels, and extracts a character candidate rectangle (S4). In this case, to identify as an attribute rectangle other than a character, any one of the following five conditions may be satisfied. That is, 1) When the number of black pixels of the circumscribed rectangle is the threshold value 2) When the aspect ratio of the rectangle is the threshold value FD_RATIO or higher 3) When the short side of the circumscribed rectangle is the threshold value CHAR_MAX or higher 4) The long side of the circumscribed rectangle is the threshold value CHAR_MAX or higher When the black pixel density of the circumscribed rectangle is less than or equal to the threshold value PER_DIAG_MIN 5) When the long side of the circumscribed rectangle is greater than or equal to the threshold value CHAR_MAX and the black pixel density of the circumscribed rectangle is greater than or equal to the threshold value PER_DIAG_MAX. Identified as a candidate rectangle. For example, the rectangles K1 to K8 in FIG. 6 are all identified as character candidate rectangles.

【0013】文字間・行間検出部11は、文字候補矩形を
基に、S5〜S12の処理を行う。
The character-to-character / line-to-line detecting unit 11 performs the processing of S5 to S12 based on the character candidate rectangle.

【0014】先ず、文字矩形識別部10で抽出した文字候
補矩形から各文字候補矩形の幅と高さを求めた上、その
中で出現回数が最も多いものを認識対象文書の基準文字
サイズとする(S5)。ただし、文字候補矩形の文字サイ
ズが閾値POINT_06に満たない場合には、基準文
字サイズをPOINT_06にし、又、出現回数が同数
の文字サイズが複数ある場合には、その中の最大サイズ
にする。例えば、図6の文字候補矩形の文字サイズが
5,8,9で、その出現回数はいずれも4回であるの
で、基準文字サイズは最も大きい数の9とする。
First, the width and height of each character candidate rectangle are calculated from the character candidate rectangles extracted by the character rectangle identifying unit 10, and the one having the largest number of appearances is set as the reference character size of the recognition target document. (S5). However, when the character size of the character candidate rectangle is less than the threshold value POINT_06, the reference character size is set to POINT_06, and when there are a plurality of character sizes having the same number of appearances, the reference character size is set to the maximum size. For example, the character size of the character candidate rectangle in FIG. 6 is 5, 8 and 9, and the number of appearances thereof is 4, so the reference character size is set to the largest number of 9.

【0015】次に、水平方向の矩形間の最頻距離を検出
する。各文字候補矩形に対して隣接する矩形を検出し
て、その距離を求め、矩形間の距離の出現回数が最も多
かったものを認識対象文書の水平方向の矩形間距離とし
て、その出現回数をRAM3のhCountに、その距
離をRAM3のhDisにそれぞれ記憶させる(S5)。
例えば、図6の外接矩形の水平矩形間の距離hDisは
5で、その出現回数hCountは3である。
Next, the most frequent distance between the horizontal rectangles is detected. A rectangle adjacent to each of the character candidate rectangles is detected, the distance is calculated, and the number of appearances of the distance between the rectangles is set as the horizontal inter-rectangle distance of the recognition target document, and the appearance count is determined by the RAM 3 The distances are stored in the hCount of the RAM 3 and in the hDis of the RAM 3 (S5).
For example, the distance hDis between the horizontal rectangles of the circumscribing rectangle in FIG. 6 is 5 and the number of appearances hCount thereof is 3.

【0016】同様にして、垂直方向の矩形間の最頻距離
を検出して、垂直矩形間の距離をRAM3のvDis
に、その出現回数をRAM3のvCountに記憶させ
る(S5)。例えば、図6の外接矩形の垂直矩形間の距離
vDisは8で、その出現回数vCountは4であ
る。
Similarly, the mode distance between the vertical rectangles is detected, and the distance between the vertical rectangles is calculated as vDis of the RAM 3.
Then, the number of appearances is stored in vCount of the RAM 3 (S5). For example, the distance vDis between the vertical rectangles of the circumscribing rectangle in FIG. 6 is 8, and the number of appearances vCount thereof is 4.

【0017】このようにして検出した水平及び垂直矩形
間の距離と出現回数とから文字間・行間の決定を行う
(S6)。水平方向の矩形間距離の出現回数hCount
と垂直方向の矩形間距離の出現回数vCountとに差
がある場合はS7の処理を行い、差がない場合はS10の
処理を行う。このとき、hCountとvCountと
の差については、 |hCount−vCount|÷(hCount+vCount)×100 > 閾値CT_DIFF の条件を満たせば差があると判定する。例えば、図6で
は計算結果が14%で、閾値CT_DIFFより小さいの
で、出現個数には差がないと判定して、S10の処理を行
う。
Character spacing and line spacing are determined from the distance between the horizontal and vertical rectangles thus detected and the number of appearances.
(S6). Number of occurrences of horizontal distance between rectangles hCount
And the number of times vCount of appearance of the distance between the rectangles in the vertical direction are different, the process of S7 is performed, and if there is no difference, the process of S10 is performed. At this time, regarding the difference between hCount and vCount, it is determined that there is a difference if the condition of | hCount−vCount | ÷ (hCount + vCount) × 100> threshold CT_DIFF is satisfied. For example, in FIG. 6, the calculation result is 14%, which is smaller than the threshold value CT_DIFF. Therefore, it is determined that there is no difference in the number of appearances, and the process of S10 is performed.

【0018】又、出現回数に差があった場合には、水平
方向の矩形間距離の出現回数hCountと垂直方向の
矩形間距離の出現回数vCountとを比較して(S
7)、vCountが多ければ認識対象文書は縦組み文
書であると判定して、RAM3の矩形マージ方向フラグ
dirFlagに垂直方向情報を、文字ピッチchPi
tchに垂直方向の矩形間の距離vDisを、行間ピッ
チlnPitchに水平方向の矩形間の距離hDisを
それぞれセットする(S8)。hCountが多ければ認
識対象文書は横組み文書であると判定して、矩形マージ
方向フラグdirFlagに水平方向情報を、文字ピッ
チchPitchに水平方向の矩形間の距離hDis
を、行間ピッチlnPitchに垂直方向の矩形間の距
離vDisをそれぞれセットする(S9)。
When there is a difference in the number of appearances, the number of appearances hCount of the distance between rectangles in the horizontal direction and the number of appearances vCount of the distance between rectangles in the vertical direction are compared (S
7) If vCount is large, it is determined that the document to be recognized is a vertically aligned document, and the vertical direction information is set to the rectangular merge direction flag dirFlag of the RAM 3 and the character pitch chPi is set.
The distance vDis between the rectangles in the vertical direction is set to tch, and the distance hDis between the rectangles in the horizontal direction is set to the line pitch lnPitch (S8). If there are many hCounts, it is determined that the document to be recognized is a horizontal layout document, horizontal direction information is set in the rectangle merge direction flag dirFlag, and the horizontal distance hDis between rectangles in the character pitch chPitch.
Is set to the line pitch lnPitch, and the distance vDis between the rectangles in the vertical direction is set (S9).

【0019】又、出現回数に差がなかった場合には、水
平方向の矩形間距離hDisと垂直方向の矩形間距離v
Disを比較して(S10)、vDisがhDisよりも短
ければ、認識対象文書は縦組み文書と判定して、矩形マ
ージ方向フラグdirFlagに垂直方向情報を、文字
ピッチchPitchに垂直方向の矩形間の距離vDi
sを、行間ピッチlnPitchに水平方向の矩形間の
距離hDisをそれぞれセットする(S11)。そして、h
DisがvDisよりも短ければ、横組み文書と判定し
て、矩形マージ方向フラグdirFlagに水平方向情
報を、文字ピッチchPitchに水平方向の矩形間の
距離hDisを、行間ピッチlnPitchに垂直方向
の矩形間の距離vDisをそれぞれセットする(S12)。
When there is no difference in the number of appearances, the horizontal inter-rectangle distance hDis and the vertical inter-rectangle distance v
When Dis is compared (S10), and vDis is shorter than hDis, the recognition target document is determined to be a vertically-assembled document, vertical direction information is set in the rectangular merge direction flag dirFlag, and vertical rectangles are set in the character pitch chPitch. Distance vDi
s is set to the line pitch lnPitch, and the horizontal distance hDis between rectangles is set (S11). And h
If Dis is shorter than vDis, it is determined that the document is a horizontal document, and horizontal information is set in the rectangle merge direction flag dirFlag, the character pitch chPitch is the distance hDis between the horizontal rectangles, and the line pitch lnPitch is the vertical interval. The distance vDis of each is set (S12).

【0020】例えば、図6では出現回数に差がなく、h
Disが5で、vDisの8よりも距離が短いので、S
12の処理を行って、dirFlag=水平方向,chP
itch=5,ln_Pitch=8がセットされる。
For example, in FIG. 6, there is no difference in the number of appearances, and h
Since Dis is 5 and the distance is shorter than 8 of vDis, S
After performing 12 processes, dirFlag = horizontal direction, chP
Itch = 5, ln_Pitch = 8 are set.

【0021】行統合部12は、このようにして検出した文
字間・行間から行統合を行い(S13)、矩形マージ方向フ
ラグdirFlagが垂直の場合は、垂直方向の矩形統
合(S14)、水平方向の矩形統合(S15)の順序で、dir
Flagが水平の場合は、水平方向の矩形統合(S16)、
垂直方向の矩形統合(S17)の順序で矩形の統合を行っ
て、行を抽出する。このとき、(文字間<行間<領域間)
の条件から、行統合の有効距離を文字間・行間検出部11
で検出した行間とする。例えば、図6では、dirFl
agが水平方向となっているので、水平方向の矩形統合
処理を行った後、垂直方向の矩形統合処理を行う。その
際の行統合の有効距離はlnPitch=8となり、矩
形統合の結果は図5のようになる。ここで、水平方向の
矩形の統合と垂直方向の矩形の統合をそれぞれ行うの
は、縦横混在文書に対応するためである。
The line integration unit 12 performs line integration from the characters and lines detected in this way (S13). If the rectangle merge direction flag dirFlag is vertical, vertical rectangle integration (S14), horizontal direction. In the order of rectangle integration (S15) of
If the Flag is horizontal, horizontal rectangle integration (S16),
Rectangle integration is performed in the order of vertical rectangle integration (S17) to extract rows. At this time, (character spacing <line spacing <region spacing)
From the condition of, the effective distance of line integration is determined by the character / line spacing detection unit 11
It is the line spacing detected in. For example, in FIG. 6, dirFl
Since ag is in the horizontal direction, horizontal rectangle integration processing is performed, and then vertical rectangle integration processing is performed. The effective distance of line integration at that time is lnPitch = 8, and the result of rectangle integration is as shown in FIG. Here, the integration of the horizontal rectangles and the integration of the vertical rectangles are carried out in order to deal with a vertically and horizontally mixed document.

【0022】文字領域統合部13は、行統合結果から文字
領域の抽出を行う(S18)。例えば、図7の行情報から抽
出した文字領域は図8のようになる。
The character area integrating unit 13 extracts a character area from the line integration result (S18). For example, the character area extracted from the line information of FIG. 7 is as shown in FIG.

【0023】そして、最後に抽出された文字領域に対し
て、文字認識処理を行う(S19)。
Then, character recognition processing is performed on the finally extracted character area (S19).

【0024】尚、本実施例では、 NOISE_MIN ≦ 2 FD_RATIO ≦ 30 CHAR_MAX ≦ 100 PER_DIAG_MIN ≦ 15 PER_DIAG_MAX ≦ 80 POINT_06 ≦ 8 CT_FIFF ≦ 20 の値とした。In this embodiment, NOISE_MIN ≤ 2 FD_RATIO ≤ 30 CHAR_MAX ≤ 100 PER_DIAG_MIN ≤ 15 PER_DIAG_MAX ≤ 80 POINT_06 ≤ 8 CT_FIFF ≤ 20.

【0025】[0025]

【発明の効果】以上説明したように、本発明によれば、
矩形を統合する有効距離を行間とすることにより、文字
間隔が広い文書でも正確に領域抽出することができると
いう効果を奏する。
As described above, according to the present invention,
By setting the effective distance for integrating the rectangles to be the line spacing, it is possible to accurately extract the region even in a document having a wide character spacing.

【0026】又、行統合処理を水平方向と垂直方向とで
それぞれ個別に行うことによって、縦組みと横組みとの
混在文書にも対応できるという効果を奏する。
Further, by performing the line integration processing individually in the horizontal direction and the vertical direction, it is possible to deal with a mixed document of vertical writing and horizontal writing.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の一実施例における領域分割を実行する
装置のブロック図である。
FIG. 1 is a block diagram of an apparatus for performing area division according to an embodiment of the present invention.

【図2】本発明の一実施例のブロック図である。FIG. 2 is a block diagram of an embodiment of the present invention.

【図3】本発明の一実施例のフローチャートである。FIG. 3 is a flowchart of an embodiment of the present invention.

【図4】図3のフローチャートの続きである。FIG. 4 is a continuation of the flowchart of FIG.

【図5】本発明の一実施例の説明に使用する画像データ
の一例である。
FIG. 5 is an example of image data used to describe an embodiment of the present invention.

【図6】本発明の一実施例において図5の画像データの
外接矩形の抽出結果を示すものである。
FIG. 6 shows a result of extracting a circumscribing rectangle of the image data of FIG. 5 in an embodiment of the present invention.

【図7】本発明の一実施例において図6の外接矩形から
の行の抽出結果を示すものである。
FIG. 7 shows a result of extracting rows from the circumscribed rectangle of FIG. 6 in an embodiment of the present invention.

【図8】本発明の一実施例において図7の行からの領域
の抽出結果を示すものである。
FIG. 8 shows a result of extracting regions from the row of FIG. 7 in an embodiment of the present invention.

【図9】文字の領域の間隔が狭い画像データの例を示す
ものである。
FIG. 9 shows an example of image data in which character regions are closely spaced.

【図10】図9の画像データの外接矩形の抽出結果を示
すものである。
FIG. 10 shows a result of extracting a circumscribing rectangle of the image data shown in FIG.

【符号の説明】[Explanation of symbols]

1…中央処理装置(CPU)、 2…リードオンリーメモ
リ(ROM)、 3…ランダムアクセスメモリ(RAM)、
4…スキャナ、 5…キーボード、 6…CRT、
7…画像データ取込部、 8…画像データ縮小部、 9
…外接矩形抽出部、 10…文字矩形識別部、 11…文字
間・行間検出部、 12…行統合部、 13…文字領域統合
部、 14…文字認識部。
1 ... Central processing unit (CPU), 2 ... Read only memory (ROM), 3 ... Random access memory (RAM),
4 ... Scanner, 5 ... Keyboard, 6 ... CRT,
7 ... Image data importing unit, 8 ... Image data reducing unit, 9
... circumscribed rectangle extraction unit, 10 ... character rectangle identification unit, 11 ... character / line spacing detection unit, 12 ... line integration unit, 13 ... character area integration unit, 14 ... character recognition unit.

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】 二値化された文字認識対象文書の画像デ
ータを縮小する手段と、 黒画素が連結している箇所を検出して、連結している前
記黒画素の外接矩形の情報をメモリに格納する手段と、 前記外接矩形の大きさ,前記外接矩形の縦横比及び黒画
素密度から文字候補矩形を識別する手段と、 前記文字候補矩形の水平方向の矩形間隔及び垂直方向の
矩形間隔を検出して、文字間及び行間を検出する手段
と、 前記行間を基に前記文字候補矩形を統合して、行を抽出
する手段と、 前記行を統合して文字領域を抽出する手段とを具備した
ことを特徴とする文字認識装置。
1. A means for reducing image data of a binarized character recognition target document, a portion where black pixels are connected to each other is detected, and information on a circumscribed rectangle of the connected black pixels is stored in a memory. And a means for identifying a character candidate rectangle from the size of the circumscribing rectangle, the aspect ratio of the circumscribing rectangle, and the black pixel density, and a horizontal rectangular interval and a vertical rectangular interval of the character candidate rectangle. And a unit for detecting a character space and a line space, a unit for extracting the line by integrating the character candidate rectangles based on the line space, and a unit for integrating the line and extracting a character region. A character recognition device characterized in that
JP4337330A 1992-12-17 1992-12-17 Character recognizing device Pending JPH06187489A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP4337330A JPH06187489A (en) 1992-12-17 1992-12-17 Character recognizing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP4337330A JPH06187489A (en) 1992-12-17 1992-12-17 Character recognizing device

Publications (1)

Publication Number Publication Date
JPH06187489A true JPH06187489A (en) 1994-07-08

Family

ID=18307614

Family Applications (1)

Application Number Title Priority Date Filing Date
JP4337330A Pending JPH06187489A (en) 1992-12-17 1992-12-17 Character recognizing device

Country Status (1)

Country Link
JP (1) JPH06187489A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH1063776A (en) * 1996-08-16 1998-03-06 Fujitsu Ltd Method and device for estimating character string direction
US7660014B2 (en) 2006-01-17 2010-02-09 Konica Minolta Business Technologies, Inc. Image processing apparatus capable of extracting rule from document image with high precision
US8208744B2 (en) 2006-01-23 2012-06-26 Konica Minolta Business Technologies, Inc. Image processing apparatus capable of accurately and quickly determining character part included in image
US8213748B2 (en) 2008-02-26 2012-07-03 Fuji Xerox Co., Ltd. Generating an electronic document with reference to allocated font corresponding to character identifier from an image
US8411955B2 (en) 2007-02-21 2013-04-02 Fuji Xerox Co., Ltd. Image processing apparatus, image processing method and computer-readable medium
JP2019515374A (en) * 2016-08-31 2019-06-06 バイドゥ オンライン ネットワーク テクノロジー (ベイジン) カンパニー リミテッド Method and apparatus for recognizing character areas in an image

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH1063776A (en) * 1996-08-16 1998-03-06 Fujitsu Ltd Method and device for estimating character string direction
US7660014B2 (en) 2006-01-17 2010-02-09 Konica Minolta Business Technologies, Inc. Image processing apparatus capable of extracting rule from document image with high precision
US8208744B2 (en) 2006-01-23 2012-06-26 Konica Minolta Business Technologies, Inc. Image processing apparatus capable of accurately and quickly determining character part included in image
US8411955B2 (en) 2007-02-21 2013-04-02 Fuji Xerox Co., Ltd. Image processing apparatus, image processing method and computer-readable medium
US8213748B2 (en) 2008-02-26 2012-07-03 Fuji Xerox Co., Ltd. Generating an electronic document with reference to allocated font corresponding to character identifier from an image
JP2019515374A (en) * 2016-08-31 2019-06-06 バイドゥ オンライン ネットワーク テクノロジー (ベイジン) カンパニー リミテッド Method and apparatus for recognizing character areas in an image
US10803338B2 (en) 2016-08-31 2020-10-13 Baidu Online Network Technology (Beijing) Co., Ltd. Method and device for recognizing the character area in a image

Similar Documents

Publication Publication Date Title
US5502777A (en) Method and apparatus for recognizing table and figure having many lateral and longitudinal lines
JP3411472B2 (en) Pattern extraction device
JPH06187489A (en) Character recognizing device
JPH0950527A (en) Frame extracting device and rectangle extracting device
JP2845370B2 (en) Character recognition method
JP2868134B2 (en) Image processing method and apparatus
JP2890306B2 (en) Table space separation apparatus and table space separation method
CN1983302A (en) Program, method and device for determining line direction
JP3476595B2 (en) Image area division method and image binarization method
JP3276555B2 (en) Format recognition device and character reader
JPH07160810A (en) Character recognizing device
JP2917427B2 (en) Drawing reader
JPH06180771A (en) English letter recognizing device
JPH02116987A (en) Character recognizing device
JP3095470B2 (en) Character recognition device
JP3437296B2 (en) High-speed character string extraction device
JP7043667B1 (en) Information processing equipment, programs, and information processing methods
JPH07168911A (en) Document recognition device
JPH0628520A (en) Character recognition device
JP2917394B2 (en) Character recognition device and character segmentation method
JPH0589244A (en) Method and device for pattern matching
JPH0721309A (en) Document recognizing device
JPH0449148B2 (en)
JPH05274472A (en) Image recognizing device
JP2978801B2 (en) Character input method for handwritten character recognition