JPH07160811A

JPH07160811A - Character recognition device

Info

Publication number: JPH07160811A
Application number: JP5304012A
Authority: JP
Inventors: Yumiko Ikemure; 由美子池牟禮
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1993-12-03
Filing date: 1993-12-03
Publication date: 1995-06-23

Abstract

(57)【要約】（修正有）【目的】領域のサイズ、属性を修正できることを目的
とする。【構成】文字認識対象文書を二値化した二値画像デー
タから文字，表，図形，画像，罫線の属性付きの領域を
抽出する手段と、抽出された領域と二値画像データと重
ねて表示する手段と、領域の枠と領域の属性を重ねて表
示する手段と、領域のサイズを拡縮する手段と、領域の
属性を変更する手段と、領域に対して領域属性に応じた
認識処理を行う手段を備える。 (57) [Summary] (Modified) [Purpose] The purpose is to be able to modify the size and attributes of the area. [Structure] A means for extracting an area with attributes of characters, tables, figures, images, and ruled lines from binary image data obtained by binarizing a character recognition target document, and displaying the extracted area and the binary image data in an overlapping manner. Means, a means for displaying the area frame and the area attribute in an overlapping manner, a means for increasing or reducing the size of the area, a means for changing the area attribute, and a recognition process according to the area attribute for the area. Means are provided.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、印刷文書のデータベー
ス化や文書の再利用のために、スキャナ等の光学的手段
を用いて取り込んだ二値画像データから文字，図形，
表，画像，罫線等の属性毎に領域を抽出し、各属性に応
じた認識処理を行う文字認識装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to character data, graphic data, and the like from binary image data captured by using an optical means such as a scanner for making a database of printed documents and reusing documents.
The present invention relates to a character recognition device that extracts a region for each attribute such as a table, an image, and a ruled line, and performs recognition processing according to each attribute.

【０００２】[0002]

【従来の技術】従来の文字認識装置における認識処理の
流れについて、以下に説明する。（１）スキャナにより認識対象となる文書の二値画像デ
ータを取り込む。（２）取り込んだ二値画像データから、文字，図形，
表，画像（写真），罫線の属性をもつ領域を抽出し、抽
出された領域の範囲を示す枠と領域の属性を表示する。（３）表示された領域結果をマウス等のポインティング
デバイスによって認識する領域を選択する。（４）選択された領域に対して、領域の属性に応じた認
識処理を行う。2. Description of the Related Art A flow of recognition processing in a conventional character recognition device will be described below. (1) Capture binary image data of a document to be recognized by a scanner. (2) Characters, figures,
Areas having attributes of tables, images (photos), and ruled lines are extracted, and a frame indicating the range of the extracted areas and the attributes of the areas are displayed. (3) Select an area in which the displayed area result is recognized by a pointing device such as a mouse. (4) The recognition processing is performed on the selected area according to the area attribute.

【０００３】文字領域が選択された場合は文字認識を行
い、図形領域が選択された場合は図形のベクトル化処理
を行い、表領域が選択された場合は表構造の認識を行っ
た後、セル内の文字認識を行う。また、画像領域につい
ては画像データの圧縮を行い、データ量の軽減を図る。（５）文字認識された結果に対しては認識結果の表示を
行い、認識確度の低い認識結果に対しては、複数候補文
字選択、あるいは、文字の挿入削除等の文字編集を行
い、希望するデータの取得を実現する。When a character area is selected, character recognition is performed, when a graphic area is selected, graphic vectorization processing is performed, and when a table area is selected, a table structure is recognized, and then a cell is recognized. Character recognition inside. Further, the image area is compressed to reduce the data amount. (5) The recognition result is displayed for the character recognition result, and for the recognition result with low recognition accuracy, a plurality of candidate characters are selected, or character editing such as insertion / deletion of characters is performed to make a desired result Achieve data acquisition.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら上記従来
の文字認識装置では、実際には画像であるものを文字と
して認識してしまうことがあり、画像に文字認識をか
け、文字認識結果がデタラメなものとなることがあっ
た。このような場合、オペレータはいちいちデタラメな
認識結果を修正せねばならず、オペレータの負担が甚大
となるという問題点を有していた。However, in the above-mentioned conventional character recognition device, there is a case where what is actually an image is recognized as a character, and the character recognition is applied to the image, and the character recognition result is distorted. Was sometimes. In such a case, the operator has to correct the random recognition result, which causes a problem that the operator's burden becomes enormous.

【０００５】そこで本発明は、オペレータの負担を軽減
できる文字認識装置を提供することを目的とする。Therefore, an object of the present invention is to provide a character recognition device which can reduce the burden on the operator.

【０００６】[0006]

【課題を解決するための手段】本発明は、文字認識対象
文書を二値化した二値画像データから文字，表，図形，
画像，罫線の属性付きの領域を抽出する手段と、抽出さ
れた領域と二値画像データと重ねて表示する手段と、領
域の枠と領域の属性を重ねて表示する手段と、領域のサ
イズを拡縮する手段と、領域の属性を変更する手段と、
領域に対して領域属性に応じた認識処理を行う手段を備
える。According to the present invention, a character, a table, a figure,
A method for extracting an area with attributes of images and ruled lines, a means for displaying the extracted area and the binary image data in an overlapping manner, a means for displaying the area frame and the area attribute in an overlapping manner, and a size of the area Means for scaling, means for changing the attributes of a region,
A means is provided for performing recognition processing on the area according to the area attribute.

【０００７】[0007]

【作用】本発明は前記の構成により、領域修正手段によ
って領域の属性又はサイズの変更を行い、オペレータが
認識処理に先立ち正しい属性サイズに変更することがで
き、爾後の認識の精度を向上することにより、オペレー
タの修正の手間を減ずることができる。According to the present invention, the attribute or size of the area can be changed by the area correcting means, and the operator can change the attribute size to the correct attribute size prior to the recognition processing, thereby improving the accuracy of subsequent recognition. As a result, it is possible to reduce the operator's troublesome correction.

【０００８】[0008]

【実施例】本発明の一実施例における文字認識装置につ
いて図面を参照して説明する。図１は本発明の一実施例
における文字認識装置のブロック図である。１は、リー
ドオンリーメモリ（以下、ＲＯＭと略す）２に格納され
ている文字認識プログラムを実行する中央処理装置（以
下、ＣＰＵと略す）である。ランダムアクセスメモリ
（以下、ＲＡＭと略す）３には、認識対象文書をスキャ
ナ４より取り込んだ画像データ及び文字認識プログラム
で使用するデータを格納する。キーボード５，マウス６
は、認識結果に対して、オペレータが確認／修正の指令
を外部より与える装置である。ＣＲＴ７は、ＣＰＵ１に
よって実行された認識結果を表示する表示装置である。DESCRIPTION OF THE PREFERRED EMBODIMENTS A character recognition device according to an embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram of a character recognition device according to an embodiment of the present invention. A central processing unit (hereinafter abbreviated as CPU) 1 executes a character recognition program stored in a read only memory (hereinafter abbreviated as ROM) 2. A random access memory (hereinafter abbreviated as RAM) 3 stores image data obtained by capturing a recognition target document from the scanner 4 and data used in a character recognition program. Keyboard 5, mouse 6
Is a device from which an operator gives a confirmation / correction command to the recognition result from the outside. The CRT 7 is a display device that displays the recognition result executed by the CPU 1.

【０００９】図２は、本発明の一実施例における文字認
識装置のフローチャート、図３は本発明の一実施例にお
ける文字認識装置において領域を表わす枠と二値画像デ
ータを重ねて表示した表示例図、図４は本発明の一実施
例における文字認識装置において領域と属性を重ねて表
示した表示例図である。以下、図２〜図４を参照しなが
ら本発明の一実施例における文字認識装置の文字認識処
理について説明する。FIG. 2 is a flow chart of the character recognition apparatus in one embodiment of the present invention, and FIG. 3 is a display example in which a frame representing an area and binary image data are displayed in an overlapping manner in the character recognition apparatus in one embodiment of the present invention. FIG. 4 and FIG. 4 are display example diagrams in which areas and attributes are overlapped and displayed in the character recognition device according to the embodiment of the present invention. Hereinafter, the character recognition processing of the character recognition device according to the embodiment of the present invention will be described with reference to FIGS.

【００１０】スキャナ４により認識対象となる文書の二
値画像データを取り込む（Ｓ１）。本実施例では領域抽
出処理を高速に行うために取り込んだ二値画像データを
解像度１００ＤＰＩ程度にＯＲ縮小して、ＲＡＭ３に格
納する。The scanner 4 takes in binary image data of a document to be recognized (S1). In this embodiment, the binary image data fetched for performing the region extraction processing at high speed is OR-reduced to a resolution of about 100 DPI and stored in the RAM 3.

【００１１】取り込んだ画像データに対して、認識処理
を行うか否かの確認のために、取り込み画像データをＣ
ＲＴ７に表示する（Ｓ２）。ここで表示する画像データ
は、画像データの全体が表示可能となるように縮小され
たデータである。In order to confirm whether or not the recognition processing is performed on the captured image data, the captured image data is C
It is displayed on RT7 (S2). The image data displayed here is data that has been reduced so that the entire image data can be displayed.

【００１２】次に、取り込んだ画像データに対して処理
を続けるか否かをキーボード５、あるいは、マウス６に
よる入力により決定する（Ｓ３）。処理終了が選択され
た場合は、ｅｎｄへジャンプする。Next, it is determined by the keyboard 5 or the mouse 6 whether or not to continue the processing for the captured image data (S3). When the processing end is selected, the process jumps to end.

【００１３】処理継続の場合は、領域抽出用の解像度１
００ＤＰＩの画像データを基に、文字，表，図形，画像
（写真），罫線の属性付きの領域を抽出する（Ｓ４）。In the case of continuing processing, the resolution 1 for area extraction
Based on the image data of 00DPI, areas with attributes of characters, tables, figures, images (photos), and ruled lines are extracted (S4).

【００１４】図３に示すように、Ｓ４で抽出した領域に
対し、各領域の範囲を表わす枠と画像データを重ねて、
ＣＲＴ７に表示して（Ｓ５）、キーボード５、あるい
は、マウス６によるオペレータからの外部指令（入力）
を待つ（Ｓ６）。As shown in FIG. 3, a frame representing the range of each area and image data are superimposed on the area extracted in S4,
Displayed on the CRT 7 (S5), and external command (input) from the operator using the keyboard 5 or mouse 6
Wait for (S6).

【００１５】キーボード５、あるいは、マウス６より入
力があった場合に、認識処理を行う領域が指定されたか
否かの判定を行う（Ｓ７）。認識を行う領域が選択され
た場合はＳ１６へジャンプする。領域選択でない場合は
Ｓ８へ進む。When there is an input from the keyboard 5 or the mouse 6, it is determined whether or not an area for recognition processing is designated (S7). If the area to be recognized is selected, the process jumps to S16. If it is not the area selection, the process proceeds to S8.

【００１６】Ｓ８では、拡大表示が選択されたか否かを
判定する。拡大表示が選択された場合は、Ｓ１３の拡大
表示処理へジャンプし、そうでない場合はＳ９へ進む。In S8, it is determined whether or not the enlarged display is selected. If the enlarged display is selected, the process jumps to the enlarged display process of S13, and if not, the process proceeds to S9.

【００１７】Ｓ９では、縮小表示が選択されたか否かを
判定する。縮小表示が選択された場合は、Ｓ１４の縮小
表示処理へジャンプし、そうでない場合はＳ１０へ進
む。In S9, it is determined whether or not the reduced display is selected. If the reduced display is selected, the process jumps to the reduced display process of S14, and if not, the process proceeds to S10.

【００１８】Ｓ１０では、領域修正が選択されたか否か
を判定する。領域修正が選択された場合はＳ１５の領域
修正処理へジャンプし、そうでない場合はＳ１１へ進
む。In step S10, it is determined whether the area correction has been selected. If the area correction is selected, the processing jumps to the area correction processing of S15, and if not, the processing proceeds to S11.

【００１９】Ｓ１１では、領域抽出結果表示の画面切替
えが選択されたか否かを判定する。画面切替えが選択さ
れた場合はＳ１２へ進み、それ以外の入力の場合はＳ６
の入力待ち状態に戻る。In S11, it is determined whether or not the screen switching of the area extraction result display is selected. When the screen switching is selected, the process proceeds to S12, and when the input is other than that, S6 is performed.
To return to the input waiting state.

【００２０】Ｓ１２は、キーボード５、あるいは、マウ
ス６によって画面切替えが選択された場合の処理であ
る。現在表示されている状態が図３に示すように画像デ
ータと領域枠の場合は、図４に示すような領域枠と領域
の属性が表示される。また、現在表示されている状態が
図４に示すように領域枠と領域の属性が表示されている
場合は、図３に示すように画像データと領域枠の表示に
切替えられる。即ち、画面切替えは図３と図４の状態が
トグルとなる。図３の状態は画像データに対して、領域
抽出範囲が正確に検出されているかの確認のため、図４
の状態は領域の属性が正確に検出されているかの確認の
ために提供される。領域属性の区別に枠の線のパターン
により区別したり、色で区別したりしてもよいが、その
場合白黒のＣＲＴ７では判定できないので、この表示切
替え方式を実現した。領域表示切替え終了後、Ｓ６の入
力待ち状態に戻る。Step S12 is a process when screen switching is selected by the keyboard 5 or the mouse 6. When the currently displayed state is the image data and the area frame as shown in FIG. 3, the area frame and the area attribute as shown in FIG. 4 are displayed. Further, when the currently displayed state is the area frame and the area attribute are displayed as shown in FIG. 4, the display is switched to the display of the image data and the area frame as shown in FIG. That is, the screen switching is toggled between the states shown in FIGS. The state in FIG. 3 is shown in FIG.
The state of is provided to confirm that the attributes of the region are detected correctly. The area attributes may be distinguished by the pattern of the frame line or by the color, but in this case, the black and white CRT 7 cannot make the determination, so this display switching method is realized. After the area display is switched, the process returns to the input waiting state of S6.

【００２１】Ｓ１３は、キーボード５、あるいは、マウ
ス６によって拡大表示が選択された場合の処理である。
ここでの処理は、現在表示している画像データの倍率を
１段階上げて再表示する。この機能を設けることによっ
て、検出した領域の修正が高精度で行える。拡大表示処
理終了後、Ｓ６の入力待ち状態に戻る。Step S13 is a process when the enlarged display is selected by the keyboard 5 or the mouse 6.
In the processing here, the magnification of the image data currently displayed is increased by one step and redisplayed. By providing this function, the detected area can be corrected with high accuracy. After the enlarged display process is completed, the process returns to the input waiting state of S6.

【００２２】Ｓ１４は、キーボード５、あるいは、マウ
ス６によって縮小表示が選択された場合の処理である。
ここでの処理は、現在表示している画像データの倍率を
１段階下げて再表示する。拡大表示した画像データの文
書全体のイメージを見たい場合にこの機能を選択する。
縮小表示処理終了後、Ｓ６の入力待ち状態に戻る。S14 is a process when the reduced display is selected by the keyboard 5 or the mouse 6.
In this processing, the magnification of the image data currently displayed is reduced by one step and the image data is displayed again. Select this function when you want to see the image of the entire document of the enlarged image data.
After the reduction display process is completed, the process returns to the input waiting state of S6.

【００２３】Ｓ１５は、キーボード５、あるいは、マウ
ス６によって領域修正が選択された場合の処理である。
領域抽出された結果に誤りがあった場合等、検出した領
域に対して修正を行う場合に、この機能が選択される。
修正項目には、領域の座標値の変更，属性の変更，領域
の削除，追加，分割，統合があり、希望の領域結果を得
ることが可能である。領域修正処理終了後、Ｓ６の入力
待ち状態に戻る。Step S15 is a process when area correction is selected by the keyboard 5 or the mouse 6.
This function is selected when the detected area is corrected, such as when the area extracted result is incorrect.
The correction items include changing the coordinate value of the area, changing the attribute, deleting the area, adding, dividing, and integrating, and it is possible to obtain a desired area result. After the area correction process is completed, the process returns to the input waiting state of S6.

【００２４】Ｓ１６では、キーボード５、あるいは、マ
ウス６によって領域修正が選択された場合の処理であ
る。選択された領域に対して、領域の属性に応じた認識
処理を行う。文字領域が選択された場合は文字認識を行
い、図形領域が選択された場合は図形のベクトル化処理
を行い、表領域が選択された場合は表構造の認識を行っ
た後、セル内の文字認識を行う。また、画像領域につい
ては画像データの圧縮を行い、データ量の軽減を図る。
文字認識された結果に対しては認識結果の表示を行い、
認識確度の低い認識結果に対しては、複数候補文字選
択、あるいは、文字の挿入削除等の文字編集を行い、希
望するデータの取得を実現する。In S16, the process is performed when the area correction is selected by the keyboard 5 or the mouse 6. A recognition process is performed on the selected area according to the area attribute. When the character area is selected, the character recognition is performed, when the figure area is selected, the vectorization process of the figure is performed, and when the table area is selected, the table structure is recognized. To recognize. Further, the image area is compressed to reduce the data amount.
For the result of character recognition, display the recognition result,
For the recognition result with low recognition accuracy, a plurality of candidate characters are selected, or character editing such as insertion / deletion of characters is performed to obtain desired data.

【００２５】[0025]

【発明の効果】本発明は上述のように構成したので、認
識処理に先立ち領域のサイズ、属性を修正でき、認識の
精度を向上することにより、オペレータの負担を軽くす
ることができる。Since the present invention is configured as described above, the size and attributes of the area can be corrected prior to the recognition processing, and the recognition accuracy can be improved, so that the burden on the operator can be reduced.

[Brief description of drawings]

【図１】本発明の一実施例における文字認識装置のブロ
ック図FIG. 1 is a block diagram of a character recognition device according to an embodiment of the present invention.

【図２】本発明の一実施例における文字認識装置のフロ
ーチャートFIG. 2 is a flowchart of a character recognition device according to an embodiment of the present invention.

【図３】本発明の一実施例における文字認識装置におい
て領域を表わす枠と二値画像データを重ねて表示した表
示例図FIG. 3 is a display example diagram in which a frame representing an area and binary image data are displayed in an overlapping manner in the character recognition device according to the embodiment of the present invention.

【図４】本発明の一実施例における文字認識装置におい
て領域と属性を重ねて表示した表示例図FIG. 4 is a diagram showing a display example in which an area and an attribute are overlapped and displayed on the character recognition device in the embodiment of the present invention.

[Explanation of symbols]

１中央処理装置（ＣＰＵ）２リードオンリーメモリ（ＲＯＭ）３ランダムアクセスメモリ（ＲＡＭ）４スキャナ５キーボード６マウス７ＣＲＴ 1 Central Processing Unit (CPU) 2 Read Only Memory (ROM) 3 Random Access Memory (RAM) 4 Scanner 5 Keyboard 6 Mouse 7 CRT

Claims

[Claims]

1. A means for extracting an area with attributes of characters, tables, figures, images and ruled lines from binary image data obtained by binarizing a character recognition target document, and superposing the extracted area and the binary image data. Display means, area frame and area attribute overlapping display, area size scaling means, area attribute changing means, and area recognition processing according to area attributes A character recognition device comprising means for performing.