JPH11250177A

JPH11250177A - Document reader

Info

Publication number: JPH11250177A
Application number: JP10049673A
Authority: JP
Inventors: Motomitsu Kikuchi; 基充菊地
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1998-03-02
Filing date: 1998-03-02
Publication date: 1999-09-17

Abstract

PROBLEM TO BE SOLVED: To provide a document reader capable of easily executing correction processing for a recognition result. SOLUTION: The input image data S1 of a document P are sent from a scanner part 1 to an image input part 22. The data S1 are entered into the input part 22 and stored in a memory 40. An image data recognition part 23 executes character recognition processing for the data S1 stored in the memory 40 and outputs a recognition result. The recognition result is sent to a monitor 51 through a picture display part 26 and displayed on the monitor 51. An operator displays the data S1 on the monitor 51 as a guide for correcting the recognition result. In this case, a character frame extraction part 25 extracts a character frame from the data S1 and a character frame conversion part 25 deletes the character frame. Then an image obtained by removing the character frame from the data S1 is displayed on the monitor 51. The operator executes correction processing by collating each character displayed on the monitor 51 with the recognition result.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、例えば、所定のフ
ォーマットの伝票等のような文字枠中に文字が記載され
た帳票の文書画像を認識し、その認識結果に対する修正
処理が容易にできるようにした文書読取装置に関するも
のである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention recognizes a document image of a form in which characters are described in a character frame such as a slip of a predetermined format, and makes it possible to easily correct the recognition result. The present invention relates to a document reading apparatus described above.

【０００２】[0002]

【従来の技術】図２は、文字枠中に文字が記載された帳
票の文書の例を示す図である。この図では、例えば、所
定のフォーマットの伝票等のように、文字枠Ｆ中に文字
（例えば、「０、１、２、…」）が記載されている。更
に、図３は、図２の文書を従来の文書読取装置で読取っ
たときの表示部の表示画面の一例を示す図である。この
図では、図２の文書を従来の文書読取装置で取込んだと
きの入力画像Ｖ１と、入力画像Ｖ１中の文字の認識結果
Ｒ１とが表示されている。2. Description of the Related Art FIG. 2 is a diagram showing an example of a form document in which characters are described in a character frame. In this figure, characters (for example, “0, 1, 2,...”) Are described in a character frame F, such as a slip in a predetermined format. FIG. 3 is a diagram showing an example of a display screen of the display unit when the document of FIG. 2 is read by a conventional document reading apparatus. In this figure, an input image V1 when the document shown in FIG. 2 is read by a conventional document reading apparatus and a recognition result R1 of characters in the input image V1 are displayed.

【０００３】従来の文書読取装置では、図２の文書の入
力画像データが画像入力部に取込まれ、この入力画像デ
ータ中の文字パターンが画像データ認識部で認識され
る。又、入力画像データ中の文字枠Ｆは、文字枠抽出部
で抽出される。入力画像データ中の文字、文字枠Ｆ及び
文字の認識結果は、表示部で表示される。この表示部で
は、図３に示すように、入力画像Ｖ１中に各文字と共に
文字枠Ｆが一緒に表示されている。認識結果Ｒ１中に
は、修正処理の対象になる文字の位置にカーソルＣが表
示されている。オペレータは、入力画像Ｖ１中の文字と
認識結果Ｒ１中の文字とをそれぞれ照合し、不一致の場
合には入力部を操作してカーソルＣを不一致の文字の位
置に移動させ、修正信号を入力する。修正信号が入力さ
れたとき、不一致の文字を入力画像Ｖ１中の文字に一致
させるように修正処理が行われる。In a conventional document reading apparatus, input image data of the document shown in FIG. 2 is taken into an image input section, and a character pattern in the input image data is recognized by an image data recognition section. The character frame F in the input image data is extracted by the character frame extracting unit. The recognition result of the character, the character frame F, and the character in the input image data is displayed on the display unit. In this display section, as shown in FIG. 3, a character frame F is displayed together with each character in the input image V1. In the recognition result R1, a cursor C is displayed at a position of a character to be corrected. The operator compares the character in the input image V1 with the character in the recognition result R1, and in the case of a mismatch, operates the input unit to move the cursor C to the position of the mismatched character and inputs a correction signal. . When a correction signal is input, a correction process is performed so that a mismatched character matches a character in the input image V1.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、従来の
文書読取装置では、認識結果に対する修正作業を行う場
合、図３に示すように、表示画面中の入力画像Ｖ１に各
文字と共に文字枠Ｆが一緒に表示される。そのため、オ
ペレータが入力画像Ｖ１中の文字と認識結果Ｒ１中の文
字とを照合する場合、入力画像Ｖ１が非常に見にくいの
で、修正作業の効率が低下するという課題があった。However, in the conventional document reading apparatus, when correcting the recognition result, as shown in FIG. 3, a character frame F is put together with each character in the input image V1 on the display screen. Will be displayed. Therefore, when the operator checks the characters in the input image V1 and the characters in the recognition result R1, the input image V1 is very difficult to see, and there is a problem that the efficiency of the correction work is reduced.

【０００５】[0005]

【課題を解決するための手段】前記課題を解決するため
に、本発明のうちの請求項１に係る発明は、文書読取装
置において、文字枠中に文字が記載された文書の入力画
像データを取込む画像入力手段と、前記画像入力手段で
取込まれた入力画像データに含まれる１つ又は複数の文
字パターンをそれぞれ囲む文字領域を切出し、該文字パ
ターンを認識して認識結果を出力する画像データ認識手
段と、前記画像入力手段で取込まれた入力画像データか
ら前記文字枠を抽出する文字枠抽出手段と、前記抽出さ
れた文字枠の全部又は任意の部分を該文字枠とは異なる
色の照合用文字枠に変換するか又は該部分を削除して変
換・削除結果を出力する文字枠変換・削除手段と、前記
入力画像データ中の文字、変換・削除結果及び認識結果
を表示する表示手段と、前記表示手段で表示された入力
画像データ中の文字と認識結果とが不一致の場合、オペ
レータの操作に基づいて修正信号を入力する修正入力手
段と、前記修正信号が入力されたとき、前記表示手段で
表示された認識結果を前記入力画像データ中の文字に一
致させる修正処理を行う制御手段とを、備えている。According to a first aspect of the present invention, there is provided a document reading apparatus for converting input image data of a document in which characters are described in a character frame. An image input unit to be captured, and an image to cut out a character area surrounding each of one or more character patterns included in the input image data captured by the image input unit, recognize the character pattern, and output a recognition result Data recognizing means, character frame extracting means for extracting the character frame from the input image data captured by the image input means, and all or any part of the extracted character frame having a color different from the character frame A character frame conversion / deletion means for converting or deleting the part to output a conversion / deletion result, and a display for displaying characters in the input image data, conversion / deletion result, and recognition result hand When the character in the input image data displayed on the display means and the recognition result do not match, a correction input means for inputting a correction signal based on an operation of an operator, and when the correction signal is input, And control means for performing a correction process for matching the recognition result displayed on the display means with characters in the input image data.

【０００６】このような構成を採用したことにより、文
字枠中に文字が記載された文書の入力画像データが画像
入力手段に取込まれる。この入力画像データは、画像デ
ータ認識手段で文字領域が切出され、文字パターンが認
識されて認識結果が出力される。又、前記画像入力手段
で取込まれた入力画像データは、文字枠抽出手段で文字
枠が抽出される。この抽出された文字枠は、文字枠変換
・削除手段でその全部又は任意の部分が該文字枠とは異
なる色の照合用文字枠に変換されるか又は該部分が削除
され、該文字枠変換・削除手段から変換・削除結果が出
力される。前記入力画像データ中の文字、変換・削除結
果及び認識結果は、表示手段に表示される。オペレータ
は、表示手段に表示された入力画像データ中の文字と認
識結果とを照合し、不一致の場合、修正入力手段を操作
して修正信号を制御手段へ入力する。制御手段では、修
正信号が入力されると、表示手段に表示された認識結果
が入力画像データ中の文字に一致するように修正処理が
行われる。By adopting such a configuration, input image data of a document in which a character is described in a character frame is taken into the image input means. A character area is cut out from the input image data by an image data recognizing means, a character pattern is recognized, and a recognition result is output. Further, a character frame is extracted from the input image data captured by the image input means by a character frame extracting means. The extracted character frame is converted by the character frame conversion / deletion means into a collation character frame of a color different from that of the character frame in its entirety or an arbitrary part, or the part is deleted. -The conversion / deletion result is output from the deletion means. The characters in the input image data, the conversion / deletion result, and the recognition result are displayed on a display unit. The operator compares the character in the input image data displayed on the display means with the recognition result, and when they do not match, operates the correction input means to input a correction signal to the control means. When the correction signal is input, the control unit performs a correction process so that the recognition result displayed on the display unit matches a character in the input image data.

【０００７】請求項２に係る発明では、文字枠中に文字
が記載された文書の入力画像データを取込む画像入力手
段と、前記入力画像データを記憶する第１の記憶手段
と、前記第１の記憶手段に記憶された入力画像データに
含まれる１つ又は複数の文字パターンをそれぞれ囲む文
字領域を切出し、該文字パターンを認識して認識結果を
出力する画像データ認識手段と、前記認識結果を記憶す
る第２の記憶手段と、前記第１の記憶手段に記憶された
入力画像データから前記文字枠を抽出する文字枠抽出手
段と、前記抽出された文字枠の全部又は任意の部分を該
文字枠とは異なる色の照合用文字枠に変換するか又は該
部分を削除して変換・削除結果を出力する文字枠変換・
削除手段と、前記第１の記憶手段に記憶された入力画像
データ中の文字、変換・削除結果及び前記第２の記憶手
段に記憶された認識結果を表示する表示手段と、前記表
示手段で表示された入力画像データ中の文字と認識結果
とが不一致の場合、オペレータの操作に基づいて修正信
号を入力する修正入力手段と、前記修正信号が入力され
たとき、前記表示手段で表示された認識結果を入力画像
データ中の文字に一致させる修正処理を行う制御手段と
を、備えている。In the invention according to claim 2, image input means for inputting input image data of a document in which characters are described in a character frame, first storage means for storing the input image data, Image data recognizing means for extracting a character area surrounding one or more character patterns included in the input image data stored in the storage means, recognizing the character pattern and outputting a recognition result, and recognizing the recognition result. Second storage means for storing; character frame extraction means for extracting the character frame from the input image data stored in the first storage means; and all or any part of the extracted character frame being represented by the character Convert to a collation character frame of a color different from the frame, or delete the part and output the conversion / deletion result.
Deletion means, display means for displaying characters in the input image data stored in the first storage means, conversion / deletion results, and recognition results stored in the second storage means, and display by the display means A correction input unit for inputting a correction signal based on an operation of an operator when the character in the input image data does not match the recognition result, and a recognition unit displayed on the display unit when the correction signal is input. And control means for performing a correction process for matching the result with characters in the input image data.

【０００８】このような構成を採用したことにより、文
字枠中に文字が記載された文書の入力画像データが画像
入力手段に取込まれる。この入力画像データは、第１の
記憶手段に記憶される。第１の記憶手段に記憶された入
力画像データは、画像データ認識手段で文字領域が切出
され、文字パターンが認識されて認識結果が出力され
る。この認識結果は、第２の記憶手段に記憶される。第
１の記憶手段に記憶された入力画像データは、文字枠抽
出手段で文字枠が抽出される。この抽出された文字枠
は、文字枠変換・削除手段でその全部又は任意の部分が
該文字枠とは異なる色の照合用文字枠に変換されるか又
は該部分が削除され、該文字枠変換・削除手段から変換
・削除結果が出力される。第１の記憶手段に記憶された
入力画像データ中の文字、変換・削除結果及び第２の記
憶手段に記憶された認識結果は、表示手段に表示され
る。オペレータは、表示手段に表示された入力画像デー
タ中の文字と認識結果とを照合し、不一致の場合、修正
入力手段を操作して修正信号を制御手段へ入力する。制
御手段では、修正信号が入力されると、表示手段に表示
された認識結果が入力画像データ中の文字に一致するよ
うに修正処理が行われる。請求項３に係る発明では、請
求項１又は２に係る発明の画像入力手段は、文字枠中に
文字が記載された帳票を走査するスキャナから入力画像
データを取込む構成にしている。By adopting such a configuration, input image data of a document in which characters are described in a character frame is taken into the image input means. This input image data is stored in the first storage means. From the input image data stored in the first storage means, a character area is cut out by the image data recognition means, the character pattern is recognized, and the recognition result is output. This recognition result is stored in the second storage means. A character frame is extracted from the input image data stored in the first storage means by the character frame extraction means. The extracted character frame is converted by the character frame conversion / deletion means into a collation character frame of a color different from that of the character frame in its entirety or an arbitrary part, or the part is deleted. -The conversion / deletion result is output from the deletion means. Characters in the input image data stored in the first storage means, conversion / deletion results, and recognition results stored in the second storage means are displayed on the display means. The operator compares the character in the input image data displayed on the display means with the recognition result, and when they do not match, operates the correction input means to input a correction signal to the control means. When the correction signal is input, the control unit performs a correction process so that the recognition result displayed on the display unit matches a character in the input image data. According to a third aspect of the present invention, the image input means of the first or second aspect of the present invention is configured to take in input image data from a scanner that scans a form in which characters are described in a character frame.

【０００９】このような構成を採用したことにより、入
力画像データはスキャナから出力されて画像入力手段に
取込まれ、請求項１又は２に係る発明と同様の処理が行
われる。請求項４に係る発明では、請求項１又は２に係
る発明の画像入力手段は、通信網に接続されたファクシ
ミリ装置（以下、ＦＡＸという）から該通信網を介して
文字枠中に文字が記載された文書の入力画像データを取
込む構成にしている。このような構成を採用したことに
より、入力画像データはＦＡＸから出力され、通信網を
介して画像入力手段に取込まれる。その後、請求項１又
は２に係る発明と同様の処理が行われる。By adopting such a configuration, the input image data is output from the scanner and taken into the image input means, and the same processing as in the invention according to claim 1 or 2 is performed. According to a fourth aspect of the present invention, the image input means according to the first or second aspect of the present invention includes a facsimile apparatus (hereinafter, referred to as a facsimile) connected to a communication network for writing characters in a character frame via the communication network. It is configured to take in input image data of a written document. By adopting such a configuration, input image data is output from a facsimile and taken into an image input unit via a communication network. After that, the same processing as in the invention according to claim 1 or 2 is performed.

【００１０】[0010]

【発明の実施の形態】第１の実施形態図１は、本発明の第１の実施形態を示す文書読取装置の
構成図である。この文書読取装置は、スキャナ部１を備
えている。スキャナ部１は、例えば、電荷結合素子（Ｃ
ＣＤ）センサやアナログ／ディジタル変換回路等からな
る光電変換部を有し、文字枠中に文字が記載された帳票
Ｐに光を照射して走査し、その反射光を電気信号の入力
画像データＳ１に変換して出力する機能を有している。
スキャナ部１の出力側には、入力画像データＳ１を解析
して文字の読取りを行う読取装置本体１０が接続されて
いる。読取装置本体１０は、入力画像データＳ１に対す
る認識処理を行うプロセッサ２０と、このプロセッサ２
０にデータバス３０を介して接続された第１及び第２の
記憶手段（例えば、メモリ）４０とを有している。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS First Embodiment FIG. 1 is a block diagram of a document reading apparatus according to a first embodiment of the present invention. This document reading device includes a scanner unit 1. The scanner unit 1 includes, for example, a charge-coupled device (C
CD) a photoelectric conversion unit including a sensor, an analog / digital conversion circuit, and the like, irradiates and scans a form P on which characters are described in a character frame, and uses the reflected light to input image data S1 of an electric signal. It has the function of converting to and outputting.
The output side of the scanner unit 1 is connected to a reading apparatus main body 10 that analyzes input image data S1 and reads characters. The reading device main body 10 includes a processor 20 for performing a recognition process on the input image data S1,
0 and a first storage means (for example, a memory) 40 connected via a data bus 30.

【００１１】プロセッサ２０は、文書読取プログラムに
従って該プロセッサ２０全体を制御する制御部２１と、
入力画像データＳ１を取込み、データバス３０を介して
メモリ４０の画像記憶領域に送出する画像入力手段（例
えば、画像入力部）２２と、画像入力部２２で取込まれ
た入力画像データＳ１に含まれる文字パターンを囲む文
字領域を切出し、この文字パターンを認識してコード化
した認識結果をデータバス３０を介してメモリ４０の認
識結果記憶領域に送出する画像データ認識手段（例え
ば、画像データ認識部）２３とを有している。又、プロ
セッサ２０には、画像入力部２２で取込まれた入力画像
データＳ１から文字枠を抽出する文字枠抽出手段（例え
ば、文字枠抽出部）２４と、文字枠抽出部２４で抽出さ
れた文字枠の全部又は任意の部分を削除して変換・削除
結果を出力する文字枠変換・削除手段（例えば、文字枠
変換部）２５と、入力画像データＳ１中の文字、文字枠
変換部２５から出力された変換・削除結果及び画像デー
タ認識部２３から出力された認識結果を表示するための
表示信号を生成する表示手段（例えば、画像表示部）２
６とが設けられている。The processor 20 includes a control unit 21 for controlling the entire processor 20 according to a document reading program,
Image input means (for example, an image input unit) 22 which takes in the input image data S1 and sends it to the image storage area of the memory 40 via the data bus 30, and is included in the input image data S1 taken in by the image input unit 22 A character area surrounding the character pattern to be extracted is cut out, and image data recognizing means (for example, an image data recognizing unit) for recognizing the character pattern and sending the coded recognition result to the recognition result storage area of the memory 40 via the data bus 30 ) 23. Further, the processor 20 includes a character frame extracting unit (for example, a character frame extracting unit) 24 for extracting a character frame from the input image data S1 captured by the image input unit 22, and a character frame extracting unit 24. A character frame conversion / deletion unit (for example, a character frame conversion unit) 25 that deletes all or an arbitrary part of the character frame and outputs a conversion / deletion result, and a character in the input image data S1 and a character frame conversion unit 25 A display unit (for example, an image display unit) 2 for generating a display signal for displaying the output conversion / deletion result and the recognition result output from the image data recognition unit 23
6 are provided.

【００１２】プロセッサ２０及びメモリ４０には、デー
タバス３０を介して、画像表示部２６で生成された表示
信号に基づいた画像を表示するモニタ５１、オペレータ
の操作によって制御部２１に入力信号を送出するマウス
５２とキーボード５３、メモリ４０の認識結果記憶領域
に格納された認識結果を保存するフロッピーディスク５
４、制御部２６を動作させるためのプログラムを格納す
るハードディスク５５が接続されている。A monitor 51 for displaying an image based on the display signal generated by the image display unit 26 to the processor 20 and the memory 40 via the data bus 30, and an input signal is sent to the control unit 21 by an operator's operation. Mouse 52 and keyboard 53, and a floppy disk 5 for storing the recognition results stored in the recognition result storage area of the memory 40.
4. A hard disk 55 for storing a program for operating the control unit 26 is connected.

【００１３】次に、図１の文書読取装置における読取動
作（１）、及び修正処理動作（２）を説明する（１）読取動作図４は、図１の文書読取装置における読取動作を説明す
るためのフローチャートである。Next, a reading operation (1) and a correction processing operation (2) in the document reading apparatus of FIG. 1 will be described. (1) Reading Operation FIG. 4 describes a reading operation in the document reading apparatus of FIG. It is a flowchart for the.

【００１４】読取動作では、先ず、スキャナ部１で帳票
Ｐに光が照射され、その反射光から帳票Ｐの画像が読み
込まれ、入力画像データＳ１に変換されてプロセッサ２
０に送出される（ステップＳＴ１）。プロセッサ２０に
おいて、入力画像データＳ１は画像入力部２２に取込ま
れ、メモリ４０の画像記憶領域に格納される（ステップ
ＳＴ２）。メモリ４０に格納された入力画像データＳ１
は、画像データ認識部２３で文字領域が切出され、文字
認識処理が行われて認識結果が出力される（ステップＳ
Ｔ３）。文字認識処理が終了した後、認識結果に対する
修正処理が行われる（ステップＳＴ４）。修正処理が終
了した後、帳票Ｐの画像の読み込みが終了したか否かが
判定され、終了していれば、この文書読取装置における
処理動作を終了し、終了していなければ、ステップＳＴ
１に戻る。In the reading operation, first, the form P is irradiated with light by the scanner unit 1, an image of the form P is read from the reflected light, converted into input image data S 1, and
0 (step ST1). In the processor 20, the input image data S1 is taken into the image input unit 22, and stored in the image storage area of the memory 40 (step ST2). Input image data S1 stored in memory 40
Indicates that a character area is cut out by the image data recognizing unit 23, character recognition processing is performed, and a recognition result is output (step S).
T3). After the end of the character recognition process, a correction process for the recognition result is performed (step ST4). After the correction processing is completed, it is determined whether the reading of the image of the form P is completed. If the reading is completed, the processing operation in the document reading apparatus is completed.
Return to 1.

【００１５】（２）修正処理動作図５は、図４中の認識結果修正処理動作（ステップＳＴ
４）を説明するためのフローチャートである。図６は、
図１中のモニタ５１の表示画面の例を示す図であり、図
２の文書を図１の文書読取装置で取込んだときの入力画
像Ｖ２と、入力画像Ｖ２中の文字の認識結果Ｒ２とが示
されている。(2) Correction processing operation FIG. 5 shows the recognition result correction processing operation (step ST) in FIG.
It is a flowchart for demonstrating 4). FIG.
FIG. 3 is a diagram illustrating an example of a display screen of a monitor 51 in FIG. 1, showing an input image V2 when the document in FIG. 2 is captured by the document reading device in FIG. 1, and a recognition result R2 of characters in the input image V2; It is shown.

【００１６】修正処理動作では、図５において、画像デ
ータ認識部２３から出力された認識結果は画像表示部２
６に送出され、画像表示部２６で表示信号が生成され
る。この表示信号はモニタ５１に送出され、認識結果が
表示される（ステップＳＴ１１）。オペレータは、認識
結果を修正する為のガイドとして、入力画像データＳ１
をモニタ５１に表示するか否かを決定する。又は、入力
画像データＳ１をモニタ５１に自動的に表示するように
予め設定しておく（ステップＳＴ１２）。入力画像デー
タＳ１をモニタ５１に表示する場合、入力画像データＳ
１から文字枠を抽出する。この場合、文字枠抽出部２４
において、例えば入力画像データＳ１の縦方向及び横方
向の画素数のヒストグラムが算出され、このヒストグラ
ムが予め設定された閾値以上になった場合に文字枠とし
て抽出される（ステップＳＴ１３）。文字枠変換部２５
において、入力画像データＳ１中の文字枠Ｆが削除され
る。この場合、例えば、抽出された文字枠の画像データ
をモニタ５１に送出しないようにする（ステップＳＴ１
４）。図６に示すように、入力画像Ｖ２中には各文字の
みが表示され、文字枠は表示されていない。認識結果Ｒ
２中には修正処理の対象になる文字の位置にカーソルＣ
が表示されている。In the correction processing operation, in FIG. 5, the recognition result output from the image data
6 and a display signal is generated by the image display unit 26. This display signal is sent to the monitor 51, and the recognition result is displayed (step ST11). The operator operates the input image data S1 as a guide for correcting the recognition result.
Is displayed on the monitor 51. Alternatively, it is set in advance so that the input image data S1 is automatically displayed on the monitor 51 (step ST12). When displaying the input image data S1 on the monitor 51, the input image data S1
Extract a character frame from 1. In this case, the character frame extracting unit 24
In, for example, a histogram of the number of pixels in the vertical direction and the horizontal direction of the input image data S1 is calculated, and when the histogram is equal to or larger than a preset threshold, it is extracted as a character frame (step ST13). Character frame conversion unit 25
In, the character frame F in the input image data S1 is deleted. In this case, for example, the image data of the extracted character frame is not sent to the monitor 51 (step ST1).
4). As shown in FIG. 6, only each character is displayed in the input image V2, and no character frame is displayed. Recognition result R
During cursor 2, place the cursor C at the position of the character to be corrected.
Is displayed.

【００１７】入力画像データＳ１から文字枠が除去され
た画像（即ち、入力画像データＳ１中の文字）がモニタ
５１に表示される（ステップＳＴ１５）。オペレータ
は、モニタ５１に表示された入力画像Ｖ２中の文字と認
識結果Ｒ２とを照合し、不一致の場合、マウス５２又は
キーボード５３を操作することによって修正信号を制御
部２１へ入力する。制御部２１では、修正信号が入力さ
れると、認識結果Ｒ２が入力画像Ｖ２中の文字に一致す
るように修正処理が行われる（ステップＳＴ１６）。こ
の修正処理の結果は画像表示部２６を経てモニタ５１へ
送出される。オペレータは、モニタ５１の表示画面を見
ることによって修正処理が終了したか否かを判定し（ス
テップＳＴ１７）、終了していなければステップＳＴ１
６に戻り、終了していれば図４中のステップＳ５へ移
る。前記ステップＳＴ１２において、入力画像データＳ
１をモニタ５１に表示しない場合は、ステップＳＴ１６
へ移る。以上のように、この第１の実施形態では、文字
枠変換部２５において入力画像データＳ１から文字枠Ｆ
を削除し、モニタ５１で文字枠Ｆが表示されていない入
力画像Ｖ２を表示するようにしたので、修正作業におけ
るオペレータの負担を軽くできる。An image in which the character frame has been removed from the input image data S1 (ie, the characters in the input image data S1) is displayed on the monitor 51 (step ST15). The operator compares the character in the input image V2 displayed on the monitor 51 with the recognition result R2, and inputs a correction signal to the control unit 21 by operating the mouse 52 or the keyboard 53 if they do not match. When the correction signal is input, the control unit 21 performs a correction process so that the recognition result R2 matches a character in the input image V2 (step ST16). The result of the correction process is sent to the monitor 51 via the image display unit 26. The operator determines whether or not the correction process has been completed by looking at the display screen of the monitor 51 (step ST17).
Returning to step S6, if the processing has been completed, the processing moves to step S5 in FIG. In step ST12, the input image data S
If 1 is not displayed on the monitor 51, step ST16
Move to As described above, in the first embodiment, the character frame conversion unit 25 converts the input image data S1 into the character frame F.
Is deleted and the input image V2 without the character frame F is displayed on the monitor 51, so that the burden on the operator in the correction work can be reduced.

【００１８】第２の実施形態本実施形態の文書読取装置では、図１中の文字枠変換部
２５に代えて、図示しない異なる構成の文字枠変換部２
５Ａが設けられている。この文字枠変換部２５Ａは、文
字枠抽出部２４で抽出された文字枠の全部又は任意の部
分を該文字枠とは異なる色の照合用文字枠に変換して変
換・削除結果を出力する構成になっている。 Second Embodiment In the document reading apparatus of this embodiment, a character frame conversion unit 2 having a different configuration (not shown) is used instead of the character frame conversion unit 25 shown in FIG.
5A is provided. The character frame conversion unit 25A is configured to convert all or any part of the character frame extracted by the character frame extraction unit 24 into a collation character frame of a color different from the character frame and output a conversion / deletion result. It has become.

【００１９】図７は、本発明の第２の実施形態を示す修
正処理動作のフローチャートであり、第１の実施形態を
示す図５中の要素と共通の要素には共通の符号が付され
ている。図８は、図７のモニタ５１の表示画面の例を示
す図である。この文書読取装置では、読取動作が第１の
実施形態と同様に行われる。修正処理動作では、図７に
示すように、ステップＳＴ１４Ａにおいて、文字枠変換
部２５Ａで、文字枠抽出部２４で抽出された文字枠の全
部又は任意の部分が、例えばカラーパレット等の色変換
手段によって該文字枠とは異なる色の照合用文字枠に変
換される。この照合用文字枠の画像データは、画像表示
部２６を経てモニタ５１に送出される。図８に示すよう
に、入力画像Ｖ３中には、各文字と共に色が変更された
照合用文字枠ＦＡが一緒に表示されている。認識結果Ｒ
３中には、修正処理の対象になる文字の位置にカーソル
Ｃが表示されている。他は、図５と同様の処理が行われ
る。FIG. 7 is a flowchart of a correction processing operation according to the second embodiment of the present invention. Elements common to those in FIG. 5 according to the first embodiment are denoted by the same reference numerals. I have. FIG. 8 is a diagram showing an example of a display screen of the monitor 51 of FIG. In this document reading apparatus, the reading operation is performed in the same manner as in the first embodiment. In the correction processing operation, as shown in FIG. 7, in step ST14A, in the character frame conversion unit 25A, all or any part of the character frame extracted by the character frame extraction unit 24 is converted to a color conversion unit such as a color palette. Is converted to a collation character frame of a different color from the character frame. The image data of the collation character frame is transmitted to the monitor 51 via the image display unit 26. As shown in FIG. 8, in the input image V3, a collation character frame FA whose color has been changed is displayed together with each character. Recognition result R
In 3, the cursor C is displayed at the position of the character to be corrected. Otherwise, the same processing as in FIG. 5 is performed.

【００２０】以上のように、この第２の実施形態では、
文字枠変換部２５Ａにおいて入力画像データＳ１中の文
字枠の色を変更し、モニタ５１でこの文字枠を含む入力
画像Ｖ３を表示するようにしたので、図６中の入力画像
Ｖ２よりも見やすくなり、修正作業におけるオペレータ
の負担をより軽くできる。As described above, in the second embodiment,
Since the color of the character frame in the input image data S1 is changed in the character frame conversion unit 25A and the input image V3 including the character frame is displayed on the monitor 51, it is easier to see than the input image V2 in FIG. Thus, the burden on the operator in the correction work can be reduced.

【００２１】第３の実施形態本実施形態の文書読取装置では、図１中の文字枠変換部
２５に代えて、図示しない異なる構成の文字枠変換部２
５Ｂが設けられている。この文字枠変換部２５Ｂは、文
字枠抽出部２４で抽出された文字枠の全部又は任意の部
分を該文字枠とは異なる色の照合用文字枠に変換する
か、該部分を削除するか、又は変更せずに出力するか
を、オペレータの操作によって選択する構成になってい
る。 Third Embodiment In the document reading apparatus of this embodiment, a character frame conversion unit 2 having a different configuration (not shown) is used instead of the character frame conversion unit 25 shown in FIG.
5B is provided. The character frame conversion unit 25B converts all or any part of the character frame extracted by the character frame extraction unit 24 into a collation character frame of a color different from the character frame, deletes the part, Alternatively, whether to output without changing is selected by an operation of the operator.

【００２２】図９は、本発明の第３の実施形態を示す修
正処理動作のフローチャートであり、図５及び図７中の
要素と共通の要素には共通の符号が付されている。この
文書読取装置でも、読取動作が第１の実施形態と同様に
行われる。修正処理動作では、図９に示すように、ステ
ップＳＴ２１において、オペレータによって文字枠を非
表示にするか否かが判定され、非表示にする場合はステ
ップＳＴ２１ａへ進み、文字枠変換部２５Ｂで文字枠が
削除される。その後、ステップＳＴ１５へ進む。文字枠
を表示する場合はステップＳＴ２２へ進み、該文字枠の
色を変更するか否かが判定される。文字枠の色を変更す
る場合はステップＳＴ２２ａへ進み、文字枠変換部２５
Ｂで文字枠の色が変更される。その後、ステップＳＴ１
５へ進む。文字枠の色を変更しない場合はステップＳＴ
２３へ進み、文字枠がそのまま表示される。他は、図５
と同様の処理が行われる。FIG. 9 is a flowchart of a correction processing operation according to the third embodiment of the present invention. Elements common to those in FIGS. 5 and 7 are denoted by the same reference numerals. In this document reading apparatus, the reading operation is performed in the same manner as in the first embodiment. In the correction processing operation, as shown in FIG. 9, in step ST21, it is determined whether or not the character frame is hidden by the operator. The frame is deleted. Thereafter, the process proceeds to step ST15. When displaying the character frame, the process proceeds to step ST22, and it is determined whether or not to change the color of the character frame. When changing the color of the character frame, the process proceeds to step ST22a, where the character frame conversion unit 25
B changes the color of the character frame. Then, step ST1
Go to 5. Step ST when not changing the color of the character frame
Proceeding to 23, the character frame is displayed as it is. The other is FIG.
Is performed.

【００２３】以上のように、この第３の実施形態では、
文字枠変換部２５Ｂにおいて入力画像データＳ１中の文
字枠の色を変更するか又は削除するかを選択できるよう
にしたので、修正作業におけるオペレータの負担を更に
軽くできる。As described above, in the third embodiment,
Since the character frame conversion unit 25B can select whether to change or delete the color of the character frame in the input image data S1, the burden on the operator in the correction work can be further reduced.

【００２４】第４の実施形態図１０は、本発明の第４の実施形態を示す文書読取装置
の構成図であり、第１の実施形態を示す図１中の要素と
共通の要素には共通の符号が付されている。この文書読
取装置では、図１中の読取装置本体１０に代えて、異な
る構成の読取装置本体１０Ａが設けられている。読取装
置本体１０Ａでは、図１中の画像入力部２２がＦＡＸ受
信部２２Ａに変更されたプロセッサ２０Ａが設けられて
いる。ＦＡＸ受信部２２Ａには、通信網（例えば、公衆
網）ＮＷを介してＦＡＸ６０が接続されている。他は、
図１と同様の構成である。 Fourth Embodiment FIG. 10 is a block diagram of a document reading apparatus showing a fourth embodiment of the present invention, and is common to the elements in FIG. 1 showing the first embodiment and common elements. Are given. In this document reading apparatus, a reading apparatus main body 10A having a different configuration is provided instead of the reading apparatus main body 10 in FIG. In the main body 10A of the reading apparatus, a processor 20A in which the image input section 22 in FIG. 1 is replaced with a FAX receiving section 22A is provided. The FAX 60 is connected to the FAX receiving unit 22A via a communication network (for example, a public network) NW. Others
The configuration is similar to that of FIG.

【００２５】この文書読取装置では、ＦＡＸ６０から送
信された送信データＳ６０が、公衆網ＮＷを介してＦＡ
Ｘ受信部２２Ａに取込まれ、メモリ４０の画像記憶領域
に格納される。その後、第１、第２又は第３の実施形態
と同様の読取動作及び修正処理動作が行われる。以上の
ように、この第４の実施形態では、第１、第２又は第３
の実施形態と同様に、ＦＡＸ６０から送信された送信デ
ータＳ６０の認識結果に対する修正作業におけるオペレ
ータの負担を軽くできる。In this document reading apparatus, the transmission data S60 transmitted from the FAX 60 is transmitted to the FA via the public network NW.
The image is taken into the X receiving unit 22A and stored in the image storage area of the memory 40. After that, the same reading operation and correction processing operation as in the first, second, or third embodiment are performed. As described above, in the fourth embodiment, the first, second, or third
As in the embodiment, the burden on the operator in correcting the recognition result of the transmission data S60 transmitted from the FAX 60 can be reduced.

【００２６】尚、本発明は上記実施形態に限定されず、
種々の変形が可能である。その変形例としては、例えば
次のようなものがある。（ａ）図１では、入力画像データＳ１はスキャナ部１
から出力されるようになっているが、他の装置で生成し
た入力画像データを例えばフロッピーディスク等の記憶
手段に保存し、この保存された入力画像データを該記憶
手段の駆動装置で読出すことによって出力するようにし
てもよい。（ｂ）図６では、抽出された文字枠の画像データをモ
ニタ５１に送出しないようにしたが、この文字枠の画像
データを表示画面の背景と同一の色にしてもよい。これ
により、文字枠が表示画面に表示されなくなり、第１の
実施形態と同様の効果が得られる。The present invention is not limited to the above embodiment,
Various modifications are possible. For example, there are the following modifications. (A) In FIG. 1, the input image data S1 is the scanner unit 1
The input image data generated by another device is stored in a storage device such as a floppy disk, and the stored input image data is read out by a driving device of the storage device. May be output. (B) In FIG. 6, the extracted image data of the character frame is not sent to the monitor 51. However, the image data of the character frame may have the same color as the background of the display screen. Thereby, the character frame is not displayed on the display screen, and the same effect as in the first embodiment can be obtained.

【００２７】（ｃ）図８中の照合用文字枠ＦＡは、実
線で表示されているが、例えば破線や一点鎖線等で表示
するようにしてもよい。この場合、照合用文字枠ＦＡの
一部を削除する処理を行う。（ｄ）実施形態では、第１及び第２の記憶手段をメモ
リ４０で構成したが、それぞれ独立したメモリで構成し
てもよい。（ｅ）図１０中の公衆網ＮＷは、他の通信網で構成し
てもよい。(C) Although the collating character frame FA in FIG. 8 is displayed by a solid line, it may be displayed by, for example, a dashed line or a dashed line. In this case, a process of deleting a part of the collation character frame FA is performed. (D) In the embodiment, the first and second storage units are configured by the memory 40, but may be configured by independent memories. (E) The public network NW in FIG. 10 may be configured by another communication network.

【００２８】[0028]

【発明の効果】以上詳細に説明したように、請求項１に
係る発明によれば、文字枠変換・削除手段において入力
画像データ中の文字枠の全部又は任意の部分を該文字枠
とは異なる色の照合用文字枠に変換するか又は該部分を
削除し、その結果を表示手段で表示するようにしたの
で、修正作業におけるオペレータの負担を軽くできる。
請求項２に係る発明によれば、画像入力手段で取込まれ
た入力画像データを第１の記憶手段で記憶し、画像デー
タ認識手段から出力された認識結果を第２の記憶手段で
記憶するようにしたので、任意の時間で修正処理を行う
ことができる。そのため、請求項１に係る発明の効果に
加え、修正作業におけるオペレータの負担をより軽くで
きる。As described in detail above, according to the first aspect of the present invention, the character frame conversion / deletion means makes all or any part of the character frame in the input image data different from the character frame. Since the result is converted into a character frame for color comparison or the part is deleted and the result is displayed on the display means, the burden on the operator in the correction work can be reduced.
According to the invention of claim 2, the input image data captured by the image input means is stored in the first storage means, and the recognition result output from the image data recognition means is stored in the second storage means. Thus, the correction process can be performed at an arbitrary time. Therefore, in addition to the effect of the invention according to claim 1, the burden on the operator in the correction work can be further reduced.

【００２９】請求項３に係る発明によれば、画像入力手
段をスキャナで構成したので、このスキャナで取込まれ
た入力画像データに対して請求項１又は２に係る発明と
同様の効果がある。請求項４に係る発明によれば、画像
入力手段をＦＡＸで構成したので、このＦＡＸで取込ま
れた入力画像データに対して請求項１又は２に係る発明
と同様の効果がある。According to the third aspect of the present invention, since the image input means is constituted by a scanner, the same effect as that of the first or second aspect of the present invention can be obtained for input image data taken in by the scanner. . According to the fourth aspect of the present invention, since the image input means is constituted by a facsimile, the same effect as that of the first or second aspect of the invention can be obtained for the input image data captured by the facsimile.

[Brief description of the drawings]

【図１】本発明の第１の実施形態の文書読取装置の構成
図である。FIG. 1 is a configuration diagram of a document reading device according to a first embodiment of the present invention.

【図２】文字枠中に文字が記載された文書例を示す図で
ある。FIG. 2 is a diagram illustrating an example of a document in which characters are described in a character frame.

【図３】図２に対応する表示画面を示す図である。FIG. 3 is a view showing a display screen corresponding to FIG. 2;

【図４】図１の読取動作のフローチャートである。FIG. 4 is a flowchart of a reading operation of FIG. 1;

【図５】図４中の修正処理動作のフローチャートであ
る。FIG. 5 is a flowchart of a correction processing operation in FIG. 4;

【図６】図１中の表示画面例を示す図である。FIG. 6 is a diagram showing an example of a display screen in FIG. 1;

【図７】本発明の第２の実施形態の修正処理動作のフロ
ーチャートである。FIG. 7 is a flowchart of a correction processing operation according to the second embodiment of the present invention.

【図８】図７の表示画面例を示す図である。FIG. 8 is a diagram showing an example of the display screen of FIG.

【図９】本発明の第３の実施形態の修正処理動作のフロ
ーチャートである。FIG. 9 is a flowchart of a correction processing operation according to the third embodiment of the present invention.

【図１０】本発明の第４の実施形態の文書読取装置の構
成図である。FIG. 10 is a configuration diagram of a document reading device according to a fourth embodiment of the present invention.

[Explanation of symbols]

１スキャナ部１０，１０Ａ読取装置本体２１制御部２２画像入力部２２ＡＦＡＸ受信部２３画像データ認識部２４文字枠抽出部２５文字枠変換部２６画像表示部４０メモリ５１モニタ５２マウス５３キーボード６０ＦＡＸ DESCRIPTION OF SYMBOLS 1 Scanner part 10, 10A Reading device main body 21 Control part 22 Image input part 22A FAX receiving part 23 Image data recognition part 24 Character frame extraction part 25 Character frame conversion part 26 Image display part 40 Memory 51 Monitor 52 Mouse 53 Keyboard 60 FAX

Claims

[Claims]

1. An image input unit for inputting input image data of a document in which characters are described in a character frame, and one or more character patterns included in the input image data input by the image input unit. An image data recognition unit that cuts out each of the surrounding character regions, recognizes the character pattern and outputs a recognition result, and a character frame extraction unit that extracts the character frame from the input image data captured by the image input unit. A character frame conversion / deletion means for converting all or any part of the extracted character frame to a collation character frame of a color different from the character frame or deleting the part and outputting a conversion / deletion result; A display unit for displaying characters in the input image data, a conversion / deletion result, and a recognition result; and an operation by an operator when the characters in the input image data displayed by the display unit do not match the recognition result. Correction input means for inputting a correction signal based on the correction signal, and when the correction signal is input, a control means for performing correction processing for matching the recognition result displayed on the display means to characters in the input image data, A document reading device comprising:

2. An image input unit for receiving input image data of a document in which characters are described in a character frame, a first storage unit for storing the input image data, and a first storage unit for storing the input image data. Image data recognizing means for extracting a character area surrounding each of one or a plurality of character patterns included in the input image data, recognizing the character pattern and outputting a recognition result, and a second storage for storing the recognition result Means, character frame extracting means for extracting the character frame from the input image data stored in the first storage means, all or any part of the extracted character frame having a different color from the character frame. Character frame conversion / deletion means for converting to a collation character frame or deleting the part and outputting a conversion / deletion result; characters in the input image data stored in the first storage means; Results and said Display means for displaying the recognition result stored in the second storage means; and if a character in the input image data displayed by the display means does not match the recognition result, a correction signal is inputted based on an operation of the operator. Correction input means, and when the correction signal is input, control means for performing correction processing for matching the recognition result displayed on the display means to characters in the input image data, Document reading device.

3. A document reading apparatus according to claim 1, wherein said image input means takes in input image data from a scanner that scans a form in which characters are described in a character frame. .

4. The apparatus according to claim 1, wherein said image input means takes in input image data of a document in which a character is written in a character frame from a facsimile apparatus connected to a communication network via the communication network. The document reading device according to claim 1 or 2, wherein