JPH1011530A

JPH1011530A - Image input and output method and device therefor

Info

Publication number: JPH1011530A
Application number: JP8164635A
Authority: JP
Inventors: Eiji Ohara; 栄治大原
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1996-06-25
Filing date: 1996-06-25
Publication date: 1998-01-16

Abstract

PROBLEM TO BE SOLVED: To easily obtain the correct result of recognition when a character part is extracted from an input document image and displayed after its recognition. SOLUTION: A document image is read by a scanner circuit 1 as the digital data and stored in a memory circuit 3. An area division circuit 4 decides a character part from an image part, and this decision result is stored in an area memory 5. At the same time, a recognition circuit 6 recognizes the characters of the character part and displays them on the screen of a display device 10. An information processing circuit 9 reconfigures the recognition result of the character part and displays both images and characters.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、文書画像を入出力
する装置、特に文書画像中の文字部を認識してその結果
を修正可能な画像入出力装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a device for inputting / outputting a document image, and more particularly to an image input / output device capable of recognizing a character portion in a document image and correcting the result.

【０００２】[0002]

【従来の技術】従来、この種の画像入出力装置として
は、例えば文字認識を利用した全文検索が可能なファイ
リング装置が知られている。これは、文字認識したコー
ド情報と文書画像を対にしてファイリングしたり、ある
いは文字認識したコード情報のみファイリングしたりす
るものであるが、上記コード情報すべてが検索する際の
キーワードとなるものである。2. Description of the Related Art Conventionally, as this type of image input / output device, for example, a filing device capable of performing full-text search using character recognition is known. This is to file a pair of the character information and the document image that have been recognized as characters, or to file only the code information that has been recognized as characters. However, all of the above-mentioned code information is used as a keyword when searching. .

【０００３】このような全文検索として文字認識を利用
する装置の場合、認識結果が必ずしも完全である必要は
なく、多少認識結果に違いがあっても実用上問題はない
が、その認識結果を他のアプリケーションソフトに出力
しコード情報として再利用する場合には、文書を完全に
する必要があるため、文字認識の結果と文書画像とを比
較することにより差を確認して修正するようにしてい
る。[0003] In the case of such an apparatus utilizing character recognition for full-text search, the recognition result does not necessarily have to be perfect. Even if there is a slight difference in the recognition result, there is no practical problem. In order to output to the application software and reuse it as code information, it is necessary to complete the document. Therefore, the difference is confirmed by comparing the result of character recognition with the document image and correcting it. .

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、上記の
ような従来の画像入出力装置にあっては、誤認識があっ
た場合には比較的容易にみつけることができるが、例え
ば新聞のように文書の段落が複雑な場合や文字部が複雑
に配置されている場合には、文字部の認識処理を行う順
番を誤ることも多く、それにより認識結果の文脈が崩れ
てしまい、これを元に復元するのは大変な作業であり、
また認識結果を大量に利用する場合は、このわずらわし
い作業を繰り返さなければならないという問題点があっ
た。However, in the above-described conventional image input / output device, if an erroneous recognition is made, it can be found relatively easily. If the paragraph is complicated or the character part is arranged in a complicated manner, the order of performing the recognition process of the character part is often wrong, which breaks the context of the recognition result and restores it based on this It is a difficult task to do
In addition, when a large number of recognition results are used, there is a problem that this troublesome work must be repeated.

【０００５】本発明は、上記のような問題点に着目して
なされたもので、入力した文書画像中の文字部を認識し
たときにより正しい認識結果を容易に得ることができる
画像入出力方法及び装置を提供することを目的としてい
る。SUMMARY OF THE INVENTION The present invention has been made in view of the above-described problems. An image input / output method and a method for easily obtaining a correct recognition result when a character portion in an input document image is recognized. It is intended to provide a device.

【０００６】[0006]

【課題を解決するための手段】本発明に係る画像入出力
装置は、次のように構成したものである。The image input / output device according to the present invention is configured as follows.

【０００７】（１）入力した文書画像から文字部を抽出
する抽出手段と、抽出された文字部を表示する表示手段
と、その文字部の中から選択された任意の文字部の認識
結果を再構成する処理手段とを備えた。(1) Extraction means for extracting a character part from an input document image, display means for displaying the extracted character part, and re-recognition result of an arbitrary character part selected from the character parts And processing means to be configured.

【０００８】（２）上記（１）の装置において、処理手
段は、選択された複数の文字部の認識結果の順序を入れ
替えるようにした。(2) In the apparatus of the above (1), the processing means changes the order of recognition results of the selected plural character parts.

【０００９】（３）上記（１）または（２）の装置にお
いて、処理手段は、選択された複数の文字部を結合して
一つの文字部とするようにした。(3) In the apparatus according to the above (1) or (2), the processing means combines a plurality of selected character parts into one character part.

【００１０】（４）上記（１）ないし（３）何れかの装
置において、表示手段は、文字部を表す領域を画像に合
成して表示するようにした。(4) In any one of the above-mentioned devices (1) to (3), the display means combines the area representing the character portion with the image and displays it.

【００１１】（５）上記（１）ないし（４）何れかの装
置において、表示手段は、文字部を表す領域に認識処理
した順序を表示するようにした。(5) In any one of the above-mentioned devices (1) to (4), the display means displays the order of the recognition processing in the area representing the character portion.

【００１２】（６）上記（４）または（５）の装置にお
いて、文字部を表す領域は、文字部分に外接する矩形領
域とした。(6) In the device of (4) or (5) above, the region representing the character portion is a rectangular region circumscribing the character portion.

【００１３】また本発明に係る画像入出力方法は、次の
ように構成したものである。An image input / output method according to the present invention is configured as follows.

【００１４】（７）入力した文書画像から文字部を抽出
する抽出工程と、抽出された文字部を表示する表示工程
と、その文字部の中から選択された任意の文字部の認識
結果を再構成する処理工程とを備えた。(7) An extraction step of extracting a character part from the input document image, a display step of displaying the extracted character part, and a recognizing result of an arbitrary character part selected from the character part. And processing steps to be constituted.

【００１５】（８）上記（７）の方法において、処理工
程は、選択された複数の文字部の認識結果の順序を入れ
替えるようにした。(8) In the method of the above (7), in the processing step, the order of the recognition results of the selected plural character parts is changed.

【００１６】（９）上記（７）または（８）の方法にお
いて、処理工程は、選択された複数の文字部を結合して
一つの文字部とするようにした。(9) In the method of the above (7) or (8), in the processing step, a plurality of selected character portions are combined into one character portion.

【００１７】（１０）上記（７）ないし（９）何れかの
方法において、表示工程は、文字部を表す領域を画像に
合成して表示する工程を含むようにした。(10) In any one of the above methods (7) to (9), the display step includes a step of combining an area representing a character portion with an image and displaying the image.

【００１８】（１１）上記（７）ないし（１０）何れか
の方法において、表示工程は、文字部を表す領域に認識
処理した順序を表示する工程を含むようにした。(11) In any one of the above methods (7) to (10), the displaying step includes a step of displaying the order of the recognition processing in the area representing the character portion.

【００１９】（１２）上記（１０）または（１１）の方
法において、文字部を表す領域は、文字部分に外接する
矩形領域であるという表示工程を含むようにした。(12) In the method of (10) or (11), a display step is provided in which the region representing the character portion is a rectangular region circumscribing the character portion.

【００２０】[0020]

【発明の実施の形態】以下、本発明を実施例により詳し
く説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, the present invention will be described in detail with reference to embodiments.

【００２１】図１は本発明に係る画像入出力装置の構成
を示すブロック図である。同図において、１は文書画像
を取り込むスキャナー回路（入力手段）、２はその文書
画像を記憶するメモリ回路（記憶手段）３を制御するメ
モリコントロール回路、４は入力した文書画像の特徴に
より文字部の領域等を分割する領域分割回路（抽出手
段）、５はその領域分割結果を記憶する領域メモリ、６
は画像の特徴を抽出、比較することにより文字，数字，
記号等を認識する認識回路（認識手段）、７はその認識
結果を記憶する認識メモリ、８は辞書メモリ、９は入力
情報の処理を行う情報処理回路（処理手段）で、任意の
複数の文字部を選択し、選択した複数の文字部の認識結
果を再構成する。FIG. 1 is a block diagram showing the configuration of an image input / output device according to the present invention. In FIG. 1, 1 is a scanner circuit (input means) for taking in a document image, 2 is a memory control circuit for controlling a memory circuit (storage means) 3 for storing the document image, and 4 is a character portion according to the characteristics of the input document image. An area dividing circuit (extracting means) for dividing an area or the like, 5 an area memory for storing the area dividing result, 6
Extracts and compares the features of the image,
A recognition circuit (recognition means) for recognizing symbols, etc., 7 a recognition memory for storing the recognition result, 8 a dictionary memory, 9 an information processing circuit (processing means) for processing input information, and a plurality of arbitrary characters. A part is selected, and the recognition results of the selected character parts are reconstructed.

【００２２】１０は文字部や画像を表示するモニター等
の表示装置（表示手段）、１１はＭＯドライブあるいは
ハードディスク等の外部記憶装置、１２は本画像入出力
装置のシステム全体の制御を司るＣＰＵ等からなるシス
テム制御回路であり、ＲＯＭ１３に記憶されたプログラ
ムに従って全体の制御を行う。ＲＯＭ１３は後述するフ
ローチャート等本発明に係る処理の制御プログラムを記
憶している。Reference numeral 10 denotes a display device (display means) such as a monitor for displaying a character portion or an image, 11 an external storage device such as an MO drive or a hard disk, and 12 a CPU or the like which controls the entire system of the image input / output device. And controls the entire system according to a program stored in the ROM 13. The ROM 13 stores a control program for processing according to the present invention, such as a flowchart described later.

【００２３】次に、図２のフローチャートにより動作を
説明する。このフローチャートの動作は、システム制御
回路１２により実行されるものである。Next, the operation will be described with reference to the flowchart of FIG. The operation of this flowchart is executed by the system control circuit 12.

【００２４】まず、システム制御回路１２の制御の下
で、スキャナー回路１に文書画像読み込み指示が与えら
れると、スキャナー回路１は文書画像をデジタル画像デ
ータ（以下イメージデータという）として読み取る（ス
テップＳ１）。この読み取ったイメージデータは、メモ
リコントロール回路２の制御により、メモリ回路３に記
憶される（ステップＳ２）。First, when a document image reading instruction is given to the scanner circuit 1 under the control of the system control circuit 12, the scanner circuit 1 reads the document image as digital image data (hereinafter referred to as image data) (step S1). . The read image data is stored in the memory circuit 3 under the control of the memory control circuit 2 (step S2).

【００２５】そして、文書画像１枚分のイメージデータ
がメモリ回路３に記憶されると、システム制御回路１２
はメモリコントロール回路２に指示し、読み取ったイメ
ージデータを領域分割回路４に出力させるとともに、領
域分割回路４に対して動作指示を与える。この結果、領
域分割回路４は、入力されたイメージデータ中のイメー
ジの種別を判定し（ステップＳ３）、その結果を領域メ
モリ５に記憶させる（ステップＳ４）。When the image data for one document image is stored in the memory circuit 3, the system control circuit 12
Instructs the memory control circuit 2 to output the read image data to the area dividing circuit 4 and gives an operation instruction to the area dividing circuit 4. As a result, the area dividing circuit 4 determines the type of the image in the input image data (step S3), and stores the result in the area memory 5 (step S4).

【００２６】ここで、領域判定処理としては、例えば入
力したイメージデータに対して処理速度を上げるために
間引き処理が行われたり、黒画素の連続性を検出するた
めに輪郭線追跡が行われる。そして、黒画素が連続する
領域に外接する矩形を抽出し、その矩形の大きさや、そ
の中に存在する黒画素の占める割合などでもってその領
域が文字からなるイメージか、写真などの非文字部分か
を判定する。Here, as the area determination processing, for example, thinning processing is performed on input image data in order to increase the processing speed, or contour tracing is performed to detect continuity of black pixels. Then, a rectangle circumscribing the region where the black pixels are continuous is extracted, and the size of the rectangle and the proportion of the black pixels present in the rectangle are determined by the size of the region. Is determined.

【００２７】一般に、文字画像は縦書きと横書きの場合
があるが、これは上記の文字と判定された矩形領域との
距離を調べ、水平方向との距離が小さい場合には横書
き、垂直方向の距離が小さい場合には縦書きと判定する
ことができる。In general, a character image may be written vertically or horizontally. The distance between the character and the rectangular area determined to be the character is checked. If the distance between the character image and the horizontal direction is small, the character image is written horizontally and vertically. If the distance is short, it can be determined that the document is written vertically.

【００２８】また、写真等の場合には、矩形の大きさが
ある程度以上になり、且つ黒画素数が多い場合であり、
表の場合には、矩形が整然と並んでいて、且つ矩形内の
黒画素数が占める割合が低いので容易に判断できる。In the case of a photograph or the like, the size of the rectangle is a certain size or more and the number of black pixels is large.
In the case of the table, the rectangles are arranged neatly, and the ratio of the number of black pixels in the rectangle is low, so that it can be easily determined.

【００２９】本実施形態では以上のようにして領域分割
するものとするが、これ以外のアルゴリズムに従っても
良いのは勿論である。要するに、本実施形態では少なく
とも文字イメージ部分と非文字部分とを識別できれば良
い。In the present embodiment, the region is divided as described above, but it goes without saying that other algorithms may be used. In short, in the present embodiment, it is sufficient that at least the character image portion and the non-character portion can be identified.

【００３０】また、場合によっては、個々の領域をユー
ザーが表示画面を見て、ポインティングデバイスでもっ
て領域を設定し、種別を指定或は修正するようにしても
良い。In some cases, the user may view the display screen of each area, set the area using a pointing device, and specify or correct the type.

【００３１】以上の結果、領域分割回路４は領域メモリ
５に例えば図３に示すようなテーブルを構築する。As a result, the area dividing circuit 4 constructs a table as shown in FIG. 3 in the area memory 5, for example.

【００３２】図３に示すテーブルにおいて、第１フィー
ルドは個々の領域に付けられた領域番号であり、第２フ
ィールドはその領域（矩形）を特定する情報（左上隅と
右下隅の座標）である。また、第３フィールドは個々の
領域の種別が格納され、本実施形態では文字イメージで
あることを示す“テキスト”と、写真画像等の非文字で
あることを示す“非テキスト”の２種類としている。In the table shown in FIG. 3, the first field is an area number assigned to each area, and the second field is information (coordinates of the upper left corner and the lower right corner) for specifying the area (rectangle). . In the third field, the type of each area is stored. In the present embodiment, “text” indicating a character image and “non-text” indicating a non-character such as a photographic image are used. I have.

【００３３】次に、上述の領域分割回路４による領域分
割処理が完了すると、システム制御回路１２は領域メモ
リ５を参照して、テキスト領域として判定された領域の
イメージデータをメモリ回路３より読み出し（ステップ
Ｓ５）、そのデータを認識回路６に出力させる。このと
き、領域分割回路４は単にバイパスするのみである。Next, when the area dividing process by the above-described area dividing circuit 4 is completed, the system control circuit 12 refers to the area memory 5 and reads out the image data of the area determined as the text area from the memory circuit 3 ( Step S5), the data is output to the recognition circuit 6. At this time, the area dividing circuit 4 simply bypasses.

【００３４】認識回路６は、与えられたイメージデータ
から文字イメージを切り出し、その切り出した文字イメ
ージから特徴情報を抽出し、辞書メモリ８内の標準特徴
情報と比較することで（ステップＳ６）、公知の文字認
識処理を行う。The recognition circuit 6 cuts out a character image from the given image data, extracts feature information from the cut-out character image, and compares it with the standard feature information in the dictionary memory 8 (step S6). Of character recognition processing.

【００３５】一般に、このような文字認識処理では、認
識対象の文字イメージから得られる特徴情報（特徴ベク
トル）と、認識辞書（辞書メモリ８）に記憶されている
標準的な特徴情報との距離を求めていき、最も距離が近
い標準特徴情報を有する文字（文字コード）を第１候補
とし、以下第２候補、第３候補・・・として出力する。
このとき、最大候補数はＮ個とし、それ以上の候補は出
力しないようにしている。In general, in such a character recognition process, the distance between feature information (feature vector) obtained from a character image to be recognized and standard feature information stored in a recognition dictionary (dictionary memory 8) is determined. The character (character code) having the standard feature information having the shortest distance is determined as the first candidate, and is output as the second candidate, the third candidate, and so on.
At this time, the maximum number of candidates is set to N, and no more candidates are output.

【００３６】次に、認識回路６は、上記認識処理して得
られた第１候補とその他の候補を認識メモリ７に出力す
る。このとき、システム制御回路１２は、認識された文
字イメージの位置情報（文書画像中のどの位置に存在し
た文字かを示す情報）を認識回路６に渡し、その情報も
認識メモリ７に格納させる（ステップＳ７）。この結
果、認識メモリ７には図４に示すようなテーブルが構築
される。Next, the recognition circuit 6 outputs the first candidate and other candidates obtained by the above-described recognition processing to the recognition memory 7. At this time, the system control circuit 12 transfers the position information of the recognized character image (information indicating the position of the character in the document image) to the recognition circuit 6 and stores the information in the recognition memory 7 ( Step S7). As a result, a table as shown in FIG.

【００３７】図４に示すテーブルにおいて、第１フィー
ルドにはどの領域番号（図３参照）の文字かを示す情報
が、第２フィールドには文字イメージの位置情報が、第
３フィールドには文字コード（認識結果得られた候補文
字で第１候補，第２候補，・・・）がそれぞれ格納され
る。In the table shown in FIG. 4, information indicating which area number (see FIG. 3) the character is is in the first field, the position information of the character image in the second field, and the character code in the third field. (A first candidate, a second candidate,... With candidate characters obtained as a result of recognition) are respectively stored.

【００３８】なお、ここでは個々の文字の大きさがほぼ
一定であるものとしているが、異なるサイズ文字を有す
る文書画像を認識する場合には、図４のテーブルにおけ
る座標位置情報は二つ必要になる。すなわち、例えば左
上隅と右下隅の座標データである。Although the size of each character is assumed to be substantially constant here, when recognizing a document image having characters of different sizes, two pieces of coordinate position information in the table of FIG. Become. That is, for example, the coordinate data of the upper left corner and the lower right corner.

【００３９】次に、上記の如く、文書画像中の文字領域
に対する文字認識が完了すると、入力した文書画像中の
文字以外と判定された領域についてはそのビットマップ
データを、文字認識された部分に関しては認識メモリ７
に記憶されたテーブルの内容をそれぞれ外部記憶装置１
１にファイリングする。また、文書画像の種別や位置を
表している、領域メモリ５に記憶されたテーブルの内容
も同時にファイリングする（ステップＳ８）。Next, as described above, when the character recognition for the character area in the document image is completed, the bitmap data for the area determined to be other than the character in the input document image is replaced by the bit map data for the character recognized part. Is the recognition memory 7
The contents of the table stored in the external storage device 1
File 1 In addition, the contents of the table stored in the area memory 5 indicating the type and position of the document image are also simultaneously filed (step S8).

【００４０】ここで、文書画像が複数枚ある場合には、
例えばスキャナー回路１に設けられた自動原稿搬送機の
原稿載置センサーにより文書画像があるか否かの判断を
行い、もしあれば以上述べた動作を繰り返し行う（ステ
ップＳ９）。Here, when there are a plurality of document images,
For example, it is determined whether or not there is a document image by a document placement sensor of the automatic document feeder provided in the scanner circuit 1, and if so, the operation described above is repeated (step S9).

【００４１】また、不図示のキーボード等の指示を受
け、他のアプリケーション等により認識した文字コード
情報をプレーンテキストとして利用する場合は、外部記
憶装置１１より任意に選択した画像を読み出し、これを
領域分割結果に基づいた画像として表示装置１０に表示
させる（ステップＳ１０）。これらのファイリング処理
及び表示処理は、システム制御回路１２の制御の下で情
報処理回路９が行う。When an instruction from a keyboard or the like (not shown) is received and character code information recognized by another application or the like is used as plain text, an image arbitrarily selected from the external storage device 11 is read, and the read image is stored in the area. The image is displayed on the display device 10 based on the division result (step S10). The filing process and the display process are performed by the information processing circuit 9 under the control of the system control circuit 12.

【００４２】次に、表示処理について説明する。ここで
は説明を簡単にするため、本実施形態における表示装置
１０は汎用モニターとし、情報処理回路９内には表示画
像を格納するバッファメモリが備えられているものとす
る。Next, the display processing will be described. Here, for simplicity of explanation, it is assumed that the display device 10 in the present embodiment is a general-purpose monitor, and the information processing circuit 9 includes a buffer memory for storing a display image.

【００４３】非文字イメージ（写真など）についてはそ
のまま上記のバッファメモリに展開し、文字イメージに
対しては認識結果の第１候補の文字コードに基づいて不
図示のフォントメモリよりフォントデータを取りだして
文字パターンを発生させ、認識メモリ７に格納されてい
る図４に示すテーブルの位置情報に従って対応する位置
に展開する。A non-character image (such as a photograph) is directly developed in the above buffer memory, and font data is extracted from a font memory (not shown) based on the first candidate character code as a result of recognition for the character image. A character pattern is generated and developed at a corresponding position according to the position information of the table shown in FIG.

【００４４】このとき、図３のテーブルの第３フィール
ドに個々の領域の種別が格納されているので、テキスト
領域についてのみ、第１フィールドに格納された領域番
号及び第２フィールドに格納された領域（矩形）を特定
する情報により、バッファメモリに展開された画像に認
識処理が行われた順序を表す番号を付加するとともに、
文字イメージ領域を表す枠を付加した後、表示装置１０
に表示する。At this time, since the type of each area is stored in the third field of the table of FIG. 3, only the text area has the area number stored in the first field and the area stored in the second field. According to the information specifying the (rectangle), a number indicating the order in which the recognition processing was performed is added to the image developed in the buffer memory,
After adding a frame representing the character image area, the display device 10
To be displayed.

【００４５】図５は原稿画像と表示画像の関係の具体例
を示したものである。同図中、５０は文書画像（もしく
はメモリ回路３に記憶された画像データ）を示してお
り、図示の如く、文書画像は写真画像と文字とが混在し
たものとなっている。FIG. 5 shows a specific example of the relationship between a document image and a display image. In the figure, reference numeral 50 denotes a document image (or image data stored in the memory circuit 3). As shown in the figure, the document image is a mixture of a photographic image and characters.

【００４６】また、５１は表示装置１０の表示内容を示
し、５２はこのとき利用可能なプレーンテキストを示し
ており、認識処理の順序を誤ったために文脈が前後して
いる。Reference numeral 51 denotes the display content of the display device 10, and reference numeral 52 denotes a plain text that can be used at this time. The context is shifted because the order of the recognition processing is incorrect.

【００４７】そこで、不図示のキーボード等により、５
３の表示内容のように、表示された認識処理の順序を変
更指示することによって、システム制御回路１２の制御
の下で情報処理回路９は外部記憶装置１１に記憶された
文字コードの順序を変更し、５４に示すようなプレーン
テキストを形成し、外部記憶装置１１に別ファイルとし
て記憶する。Therefore, a keyboard (not shown) or the like is used to
By instructing to change the order of the displayed recognition processing as in the display contents of 3, the information processing circuit 9 changes the order of the character codes stored in the external storage device 11 under the control of the system control circuit 12. Then, a plain text as shown in 54 is formed and stored in the external storage device 11 as a separate file.

【００４８】ここで、文字コードの順序を変更すると
き、例えば不図示の表の中に複数の文字部が含まれてい
る場合など、表全体の挿入位置を変更したい場合がある
が、その際、表内の複数の文字部を結合させて一つの文
字部として扱うことが可能であり、扱う文字部の数を削
減することができる。Here, when changing the order of the character codes, there are cases where it is desired to change the insertion position of the entire table, for example, when a table (not shown) includes a plurality of character portions. , A plurality of character parts in the table can be combined and treated as one character part, and the number of character parts to be handled can be reduced.

【００４９】このようにして、外部記憶装置１１に記憶
された別ファイルを他のアプリケーションより読み込む
ことにより、文脈を正しく訂正した文字コード情報がプ
レーンテキストとして利用される。In this way, by reading another file stored in the external storage device 11 from another application, the character code information whose context has been correctly corrected is used as plain text.

【００５０】図６を用い、もう少し詳しく説明する。A more detailed description will be given with reference to FIG.

【００５１】まず、前記した表示装置１０の表示内容に
付加された認識処理が行われた順序を表す番号について
であるが、以下のようにして作成している。First, regarding the numbers added to the display contents of the display device 10 and indicating the order in which the recognition processing has been performed, the numbers are created as follows.

【００５２】図６ａ）には５１の表示内容に相当する５
の領域メモリに構築されたテーブル内容を示している
が、これより第１フィールドの領域番号から非テキスト
領域を除いて、テキスト領域のみの領域番号を選択し、
各々の順序関係を保存しながら空き番号がなくなるよう
新規領域番号を設定し図６ｂ）に示すような対応表を作
成する。FIG. 6A) shows 5 corresponding to the display content of 51.
The table contents constructed in the area memory are shown, but the area number of the text area only is selected by excluding the non-text area from the area number of the first field,
A new area number is set such that there is no empty number while preserving the respective order relations, and a correspondence table as shown in FIG. 6B) is created.

【００５３】この対応表において、第１フィールドは図
６ａ）の第１フィールドにおけるテキスト領域のみの領
域番号であり、第２フィールドは前記した新規領域番号
であり、これが５１の表示内容に付加した認識処理が行
われた順序を表す番号に相当する。In this correspondence table, the first field is the area number of only the text area in the first field of FIG. 6A, and the second field is the above-mentioned new area number. This corresponds to a number indicating the order in which the processing was performed.

【００５４】従って、前記５３の表示内容で説明したよ
うに認識処理の順番を変更し、文脈を修正する場合は、
例えば、不図示のキーボード等を用いて指示することに
より認識処理が行われた順序を表す新規領域番号を図６
ｃ）に示す用に修正することにより行うことが可能であ
る。ここにおいては、２番目と３番目に行われた認識処
理の順番を逆転させているが、対応表の第２フィールド
に対応する第１フィールドの領域番号から、図４のテー
ブルを参照して領域番号に対応する文字コードを抽出す
る事ができる。このようにして、修正した新規領域番号
の順番で文字コードを外部記憶装置１１より読み出し、
５４に示すようなプレーンテキストが形成されることに
なる。また、前記した表内の複数の文字部を結合させて
１つの文字部として扱う場合を図７により説明する。Therefore, when the order of the recognition processing is changed and the context is corrected as described in the display contents of 53,
For example, a new area number indicating the order in which the recognition processing was performed by giving an instruction using a keyboard (not shown) or the like is shown in FIG.
This can be done by modifying as shown in c). Here, the order of the second and third recognition processes is reversed, but the area number of the first field corresponding to the second field of the correspondence table is referred to by referring to the table of FIG. The character code corresponding to the number can be extracted. In this way, the character codes are read from the external storage device 11 in the order of the corrected new area numbers,
A plain text as shown at 54 will be formed. A case where a plurality of character parts in the above-described table are combined and treated as one character part will be described with reference to FIG.

【００５５】図７ａ）は５の領域メモリに構築されたテ
ーブル内容を示していて、領域番号の４〜７に対応する
文字部が表内の文字部とする。FIG. 7A shows the contents of the table constructed in the area memory of area 5, and the character parts corresponding to the area numbers 4 to 7 are the character parts in the table.

【００５６】図７ｂ）は修正前、図７ｃ）は修正後の対
応表であるが、これらを比較すると第１フィールドの４
〜７に対応する文字部における第２フィールドの値が等
しく変更され、しかも第１フィールドの２に対応する文
字部における第２フィールドの値と順序関係が逆転して
いる。これは表内の文字部が一括して第１フィールドの
１と２の文字部の間に挿入されることを示している。FIG. 7B) shows the correspondence table before correction, and FIG. 7C) shows the correspondence table after correction.
The value of the second field in the character portion corresponding to .about.7 is changed equally, and the order of the second field in the character portion corresponding to 2 of the first field is reversed. This indicates that the character part in the table is inserted between the character parts 1 and 2 of the first field all together.

【００５７】このようにして表内の複数の文字部に対し
て第２フィールドの値を等しくしておき、扱いを連動さ
せることができる。In this way, the value of the second field can be made equal for a plurality of character parts in the table, and the handling can be linked.

【００５８】ただし、連動させる文字部の順序を入れ替
える場合も考えられ、図７ｃ）の対応表に第３フィール
ドを拡張し、表内の文字部の認識処理の順序を記述して
いる。この拡張部分の順序を変更することで順序を変更
する。However, it is conceivable that the order of the character parts to be linked is changed, and the third field is extended to the correspondence table shown in FIG. 7C) to describe the order of the recognition processing of the character parts in the table. The order is changed by changing the order of the extensions.

【００５９】以上の結果、例えばファイリングされた文
字認識結果を利用したい場合には、他のアプリケーショ
ンに移動する前に、文字部の認識結果が文脈通りになっ
ているかを容易に認識することができ、それが満足でき
ないものであれば、不図示のキーボード等で適宜認識結
果を訂正することが可能である。As described above, for example, when it is desired to use the filed character recognition result, it is possible to easily recognize whether or not the recognition result of the character part is in context before moving to another application. If it is not satisfactory, the recognition result can be corrected appropriately using a keyboard (not shown) or the like.

【００６０】なお、上記の例では、認識結果の文字コー
ドに基づいて文字パターンを発生させて表示したが、全
画像のビットマップをファイリングすることにより、そ
のまま表示するようにしても良い。但し、この場合に
も、認識処理が行われた順序を表す番号を付加するとと
もに、文字部ごとに文字イメージ領域を表す枠を付加す
る。In the above example, a character pattern is generated and displayed based on the character code of the recognition result. However, it may be displayed as it is by filing the bit map of all images. However, also in this case, a number indicating the order in which the recognition processing has been performed is added, and a frame indicating a character image area is added for each character portion.

【００６１】また、上記の例では、認識処理が行われた
順序を数字により表し、また文字イメージ領域を表すの
に枠により囲んだが、これに限らず、順序を表すのにア
ルファベット等を使用したり、枠のかわりに網掛けや表
示色を変えたりするなどしても良い。すなわち、区別で
きさえすれば、いかなる表示形態にしても良い。In the above example, the order in which the recognition processing was performed is represented by a numeral, and the character image area is surrounded by a frame. However, the present invention is not limited to this, and alphabets and the like are used to represent the order. Alternatively, shading or display color may be changed instead of the frame. That is, any display form may be used as long as it can be distinguished.

【００６２】また、上記の例では、領域分割結果に基づ
く画像の表示及び確認をすることにより、文脈を訂正し
た後、プレーンテキストを別ファイルとして記憶してい
るが、その際、同時に既にファイリングされた各テーブ
ル内容を前記対応表を作成・記憶することにより修正す
るようにしても良い。In the above example, the plain text is stored as a separate file after the context is corrected by displaying and confirming the image based on the area division result. The contents of each table may be modified by creating and storing the correspondence table.

【００６３】また、上記の例では、領域分割結果に基づ
く画像を表示する例を示したが、プリント出力装置によ
りプリントしても良い。この場合、複数枚出力すること
により、後で一括して訂正処理を行うことが可能であ
る。Further, in the above example, an example in which an image based on the region division result is displayed has been described, but the image may be printed by a print output device. In this case, by outputting a plurality of sheets, it is possible to perform correction processing collectively later.

【００６４】更に、本実施形態で示した装置を汎用の情
報処理装置に適用しても良い。この場合の装置構成とし
ては、例えば図８に示すような構成となる。Further, the apparatus shown in this embodiment may be applied to a general-purpose information processing apparatus. The device configuration in this case is, for example, as shown in FIG.

【００６５】図８において、１００は装置あるいはシス
テム全体を制御するＣＰＵ、１０１はブートプログラム
やＢＩＯＳ等を記憶するＲＯＭ、１０２はＣＰＵ１００
のワーク領域として使用するＲＡＭであり、図１のメモ
リ回路３に対応するイメージ格納エリア１０３、領域メ
モリ５に対応する領域種別エリア１０４、認識メモリ７
に対応する１０５の認識結果エリア１０５をはじめ、以
下のデータあるいはプログラムが展開もしくは確保され
ている。Referring to FIG. 8, reference numeral 100 denotes a CPU for controlling the apparatus or the entire system; 101, a ROM for storing a boot program and a BIOS;
The image storage area 103 corresponding to the memory circuit 3 of FIG. 1, the area type area 104 corresponding to the area memory 5, the recognition memory 7
The following data or programs are developed or secured, including the recognition result area 105 of 105 corresponding to.

【００６６】すなわち、１０６は一連の処理手順に対応
するプログラム、１０７は印刷する時に使用する出力バ
ッファエリア、１０８は制御プログラム１０６をはじめ
として本装置全体の制御を行うオペレーティングシステ
ム（ＯＳ）である。That is, 106 is a program corresponding to a series of processing procedures, 107 is an output buffer area used for printing, and 108 is an operating system (OS) for controlling the entire apparatus including the control program 106.

【００６７】また、１０９は原稿画像を読み取るための
イメージスキャナー、１１０はハードディスク装置や光
磁気ディスク装置などの大容量記憶装置であり、この中
にはオペレーティングシステム１０８等も記憶されてい
る。１１１は可搬性の記憶媒体からのデータのリードを
行うための装置であって、図示ではディスクドライブを
有したフロッピーディスク装置である。このフロッピー
ディスク装置１１１にセットするフロッピーディスク、
もしくはハードディスク装置等の記憶装置１１０のいず
れか一方に、上述の制御プログラム１０６が格納されて
いる。Reference numeral 109 denotes an image scanner for reading a document image, and 110 denotes a large-capacity storage device such as a hard disk device or a magneto-optical disk device, in which an operating system 108 and the like are also stored. An apparatus 111 for reading data from a portable storage medium is a floppy disk apparatus having a disk drive in the figure. A floppy disk set in the floppy disk device 111;
Alternatively, the control program 106 is stored in one of the storage devices 110 such as a hard disk device.

【００６８】１１２は表示画面に表示するビットマップ
イメージを展開するためのＶＲＡＭ、１１３はこのＶＲ
ＡＭ１１２に展開されたビットマップイメージを表示す
る表示器、１１４は認識結果を修正するための各種情報
を入力するためのキーボード、１１５は表示器１１３の
表示画面の中で所望の位置を指定したり、各種メニュー
の中の一つを選択したりするためポインティングデバイ
ス、１１６はプリンタである。Reference numeral 112 denotes a VRAM for expanding a bitmap image to be displayed on a display screen, and 113 denotes a VR.
A display for displaying the bitmap image developed on the AM 112, a keyboard 114 for inputting various information for correcting the recognition result, and a 115 for designating a desired position on the display screen of the display 113 And a pointing device 116 for selecting one of various menus.

【００６９】上記構成の装置における動作は、先に説明
した実施形態の動作とほぼ同じであるが、メモリコント
ロール回路２や領域分割回路４に相当する処理は、ＣＰ
Ｕ１００が制御プログラム１０６に従って行うことにな
る。The operation of the device having the above configuration is almost the same as that of the embodiment described above, but the processing corresponding to the memory control circuit 2 and the area dividing circuit 4 is the same as that of the CP.
U100 is performed according to the control program 106.

【００７０】このように、本発明は、汎用の情報処理装
置あるいはシステムに、先に説明した処理を実現するた
めのプログラムを外部から供給することでも達成でき
る。As described above, the present invention can also be achieved by externally supplying a program for implementing the above-described processing to a general-purpose information processing apparatus or system.

【００７１】したがって、本発明は、ホストコンピュー
タ，インターフェース，プリンタ等の複数の機器から構
成されるシステムに適用しても、複写機等の一つの機器
からなる装置に適用しても良い。Therefore, the present invention may be applied to a system including a plurality of devices such as a host computer, an interface, and a printer, or to an apparatus including a single device such as a copying machine.

【００７２】また、本発明は、システムあるいは装置に
記憶媒体に格納されたプログラムを供給することによっ
て達成されることは言うまでもない。It is needless to say that the present invention can be attained by supplying a program stored in a storage medium to a system or an apparatus.

【００７３】[0073]

【発明の効果】以上説明したように、本発明によれば、
入力した文書画像中の文字部を認識したときに、より正
しい認識結果を容易に得ることができるという効果があ
る。As described above, according to the present invention,
When a character portion in an input document image is recognized, a more correct recognition result can be easily obtained.

[Brief description of the drawings]

【図１】本発明に係る画像入出力装置の構成を示すブ
ロック図FIG. 1 is a block diagram illustrating a configuration of an image input / output device according to the present invention.

【図２】図１の装置の動作を示すフローチャートFIG. 2 is a flowchart showing the operation of the apparatus of FIG. 1;

【図３】図１の領域メモリに構築されるテーブルの内
容を示す図FIG. 3 is a view showing contents of a table constructed in an area memory of FIG. 1;

【図４】図１の認識メモリに構築されるテーブルの内
容を示す図FIG. 4 is a diagram showing contents of a table constructed in the recognition memory of FIG. 1;

【図５】実施形態における原稿画像と表示画像との関
係を示す説明図FIG. 5 is an explanatory diagram illustrating a relationship between a document image and a display image according to the embodiment;

【図６】実施形態におけるテーブル内容と対応表の関
係を示す図FIG. 6 is a diagram illustrating a relationship between table contents and a correspondence table in the embodiment.

【図７】実施形態におけるテーブル内容と対応表の関
係を示す図FIG. 7 is a diagram illustrating a relationship between table contents and a correspondence table according to the embodiment;

【図８】装置の他の構成を示すブロック図FIG. 8 is a block diagram showing another configuration of the device.

[Explanation of symbols]

１スキャナー回路（入力手段）２メモリコントロール回路３メモリ回路（記憶手段）４領域分割回路（抽出手段）５領域メモリ６認識回路（認識手段）７認識メモリ８辞書メモリ９情報処理回路（処理手段）１０表示装置（表示手段）１１外部記憶装置１２システム制御回路１３ＲＯＭ REFERENCE SIGNS LIST 1 scanner circuit (input means) 2 memory control circuit 3 memory circuit (storage means) 4 area dividing circuit (extraction means) 5 area memory 6 recognition circuit (recognition means) 7 recognition memory 8 dictionary memory 9 information processing circuit (processing means) Reference Signs List 10 display device (display means) 11 external storage device 12 system control circuit 13 ROM

Claims

[Claims]

1. An extracting means for extracting a character part from an input document image, a display means for displaying the extracted character part,
Processing means for reconstructing a recognition result of an arbitrary character portion selected from the character portions.

2. The method according to claim 1, wherein the processing unit changes the order of the recognition results of the plurality of selected character parts.
The image input / output device according to the above.

3. The method according to claim 1, wherein the processing unit combines the plurality of selected character parts into one character part.
Or the image input / output device according to 2.

4. The image input / output apparatus according to claim 1, wherein the display unit combines and displays an area representing a character portion with the image.

5. The image input / output apparatus according to claim 1, wherein the display means displays the order of the recognition processing in an area representing a character portion.

6. The image input / output device according to claim 4, wherein the region representing the character portion is a rectangular region circumscribing the character portion.

7. An extracting step of extracting a character part from an input document image, a displaying step of displaying the extracted character part,
Reconstructing a recognition result of an arbitrary character portion selected from the character portions.

8. The processing step according to claim 7, wherein the order of the recognition results of the selected plurality of character parts is changed.
Image input / output method described.

9. The processing step of combining a plurality of selected character parts into one character part.
Or the image input / output method according to 8.

10. The image input / output method according to claim 7, wherein the displaying step includes a step of combining and displaying an area representing a character portion with the image.

11. The image input / output method according to claim 7, wherein the displaying step includes a step of displaying an order of the recognition processing in an area representing a character portion.

12. The image input / output method according to claim 10, further comprising a display step of indicating that the region representing the character portion is a rectangular region circumscribing the character portion.