JPH1063869A

JPH1063869A - Document processor

Info

Publication number: JPH1063869A
Application number: JP8215639A
Authority: JP
Inventors: Hiroaki Kubota; 浩明久保田; Shigeyoshi Shimotsuji; 成佳下辻; Toshiaki Ishii; 利明石井; Tameo Yamamoto; 為雄山本
Original assignee: Toshiba Corp; Tokyo Electric Power Co Inc
Current assignee: Toshiba Corp; Tokyo Electric Power Co Holdings Inc
Priority date: 1996-08-15
Filing date: 1996-08-15
Publication date: 1998-03-06

Abstract

PROBLEM TO BE SOLVED: To reduce user's correcting operation in total by showing a correction place by similar area extraction and making a user confirm whether or not a correcting process is correct. SOLUTION: A similar area extraction part 205 extracts a similar area having similar structure from a registered document stored in a structure storage part according to frame information extracted by a structure analysis part 204. A structure substitution part 206 extracts differences between the structure having the similar area extracted by the extraction part 205 and the part corresponding to the similar area extracted by the analysis part 204 by making them correspond to each other and corrects the parts where the differences are present by substitution processing, etc. Further, a structure correction part 207 shows a corrected document obtained by the substitution part 206, receives user's confirmation input or indication input for manual structure correction by the user, and makes correction according to the indication input. Thus, the correcting process can be performed by the user's simple procedure.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、各種のドキュメン
トの画像を取り込んでデータ化するためのドキュメント
処理装置に関する。[0001] 1. Field of the Invention [0002] The present invention relates to a document processing apparatus for capturing images of various documents and converting the images into data.

【０００２】[0002]

【従来の技術】従来のドキュメント処理装置には、ドキ
ュメントを画像入力し、画像処理を施して罫線や文字領
域を推定することによって、ドキュメントの構造を認識
するものがある。この種のドキュメント処理装置では、
まず、紙の状態であるドキュメントをスキャナなどの画
像入力装置を利用して画像データとして入力する。入力
された画像データに対しては、２値化、細線化、線分近
似などの画像処理を施して、線分として抽出を行う。そ
して、抽出された線分の長さ、方向性、互いの距離など
の特徴量を利用して、罫線と文字領域とその他の成分に
分類する。2. Description of the Related Art Some conventional document processing apparatuses recognize a document structure by inputting a document image and performing image processing to estimate ruled lines and character areas. In this type of document processing device,
First, a document in a paper state is input as image data using an image input device such as a scanner. The input image data is subjected to image processing such as binarization, thinning, and line segment approximation, and is extracted as a line segment. Then, it classifies into ruled lines, character regions, and other components by using feature amounts such as the length, directionality, and distance between the extracted line segments.

【０００３】また、従来のドキュメント処理装置には、
罫線を組合わせて枠として抽出を行い、その枠内や枠の
近くに文字領域がある場合には、その文字領域に対して
文字認識を行うことによって文字コードに変換し、その
枠の意味的な構造を理解するものがある。[0003] Conventional document processing devices include:
When a ruled line is combined and extracted as a frame, and there is a character area within or near the frame, the character area is converted to a character code by performing character recognition, and the meaning of the frame is determined. There is something that understands the structure.

【０００４】また、従来のドキュメント処理装置には、
文字領域のサイズや互いの距離を考慮して、複数の文字
領域を統合し、一連の文字領域として抽出するものがあ
る。さらに、各領域の相互の位置関係を考慮して、段落
領域を定義し、ドキュメント全体のレイアウト構造をド
キュメントの構造情報として獲得するものがある。[0004] Conventional document processing devices include:
In some cases, a plurality of character regions are integrated in consideration of the size of the character regions and their distance from each other, and are extracted as a series of character regions. Further, there is a method in which a paragraph region is defined in consideration of a mutual positional relationship between the regions, and a layout structure of the entire document is obtained as document structure information.

【０００５】[0005]

【発明が解決しようとする課題】ところが、上述したよ
うな従来のドキュメント処理装置では、そのほとんどの
処理において誤認識あるいは誤判定が存在し、得られる
ドキュメントの構造情報に、間違って設定された部分が
数多く見受けられる。これは、入力されたドキュメント
の画像に雑音成分が加えられていたり、ドキュメントの
物理的な状況だけでは複数の構造解釈が存在し、曖昧な
箇所の判定が不可能であるからである。したがって、ユ
ーザによるかなりの量の修正作業を必要とする。However, in the conventional document processing apparatus as described above, erroneous recognition or erroneous determination exists in most of the processing, and the erroneously set portion is included in the obtained document structure information. Many are seen. This is because a noise component is added to the input document image or a plurality of structural interpretations exist only in the physical state of the document, and it is impossible to determine an ambiguous part. Therefore, it requires a considerable amount of correction work by the user.

【０００６】特開平７−２２００２３号公報には、入力
時の傾きに対応して正確にフォーマット情報を得ようと
する技術が開示されているが、この技術では、罫線の位
置ずれ補正を行うのみであり、帳票等に代表される記入
部分を含むドキュメントの記入領域に関する詳細設定を
修正することは不可能である。Japanese Unexamined Patent Publication No. Hei 7-220023 discloses a technique for accurately obtaining format information corresponding to a tilt at the time of input. However, this technique only corrects misregistration of a ruled line. Therefore, it is impossible to correct the detailed setting regarding the entry area of the document including the entry portion represented by the form or the like.

【０００７】本発明は、上記事情を考慮してなされたも
ので、ドキュメントの構造情報の認識において誤認識さ
れた箇所を自動修正し、ユーザによる修正作業の量を大
幅に削減することの可能なドキュメント処理装置を提供
することを目的とする。[0007] The present invention has been made in consideration of the above circumstances, and can automatically correct a misrecognized portion in recognizing document structure information, thereby greatly reducing the amount of correction work by a user. It is an object to provide a document processing device.

【０００８】[0008]

【課題を解決するための手段】本発明は、ドキュメント
を画像入力して所定の処理を行うドキュメント処理装置
において、ドキュメントの構造情報を格納する構造格納
手段と、画像入力された入力ドキュメントの構造情報を
設定する構造情報設定手段と、設定された前記入力ドキ
ュメントの構造情報に類似する領域を前記構造格納手段
に格納されているドキュメントから検索する類似領域検
出手段と、前記入力ドキュメントの構造情報と検索され
た類似領域の構造情報との差異に基づいて、前記入力ド
キュメントの構造情報を修正する構造修正手段とを備え
たことを特徴とする。SUMMARY OF THE INVENTION According to the present invention, there is provided a document processing apparatus for performing predetermined processing by inputting an image of a document, comprising: structure storage means for storing the structure information of the document; A similar area detecting means for searching a document stored in the structure storing means for an area similar to the set structure information of the input document; and a search for the structure information of the input document. And a structure correcting means for correcting the structure information of the input document based on the difference from the obtained similar region structure information.

【０００９】好ましくは、前記構造修正手段は、前記入
力ドキュメントの構造情報のうち、該入力ドキュメント
の構造情報と検索された類似領域の構造情報との間に差
異の存在する箇所を修正箇所として設定する修正箇所設
定手段と、前記修正箇所設定手段によって設定された修
正箇所に対応する構造情報を、前記類似領域の持つ構造
情報で置換することによって修正を行う構造置換手段と
を有することを特徴とする。Preferably, the structure correcting means sets, as a corrected portion, a portion of the input document structure information in which a difference exists between the structure information of the input document and the structure information of the searched similar area. Correction location setting means for performing the correction by replacing the structural information corresponding to the correction location set by the correction location setting means with the structural information of the similar area. I do.

【００１０】好ましくは、前記構造修正手段は、前記修
正箇所設定手段によって設定された修正箇所を呈示する
修正箇所呈示手段と、呈示された修正箇所に対する修正
の良否の確認入力を受け付ける修正確認手段とをさらに
有し、前記構造置換手段は、前記修正確認手段からの修
正すべき旨の確認入力を受けた場合に修正を行うことを
特徴とする。[0010] Preferably, the structure correcting means includes a correction point presenting means for presenting the correction point set by the correction point setting means, and a correction confirming means for accepting a confirmation input as to whether or not the correction is correct for the presented correction point. Wherein the structure replacement unit performs the correction when receiving a confirmation input from the correction confirmation unit that the structure should be corrected.

【００１１】好ましくは、前記構造修正手段は、前記類
似領域の構造情報を利用して修正された構造情報を修正
案として作成する修正構造作成手段と、作成された修正
案としての構造情報を呈示する修正構造表示手段と、表
示された修正案としての構造情報に対して修正の良否の
確認入力を受け付ける修正確認手段とを有し、前記修正
確認手段からの修正すべき旨の確認入力を受けた場合に
修正を行うことを特徴とする。Preferably, the structure modifying means presents modified structure creating means for creating structure information modified using the structure information of the similar area as a modification plan, and presents the structure information as the created modification plan. And a correction confirmation means for accepting a confirmation input on whether or not modification of the displayed structural information is correct, and receiving a confirmation input to the effect that modification is required from the modification confirmation means. It is characterized in that a correction is made in the event of a failure.

【００１２】好ましくは、前記ドキュメントをブロック
単位に分割するブロック抽出手段をさらに備え、前記類
似領域検出手段は、ブロック単位に類似領域を検索する
ことを特徴とする。Preferably, the apparatus further comprises block extracting means for dividing the document into blocks, wherein the similar area detecting means searches for similar areas in block units.

【００１３】好ましくは、入力されたドキュメントの画
像から手書き入力されるデータ記入領域を抽出するデー
タ記入領域抽出手段と、入力されたドキュメントの画像
から文字列を抽出する文字認識手段と、抽出された前記
データ記入領域と、抽出された文字列のうち該データ記
入領域を説明する文字列とを関連付ける関連文字列抽出
手段とをさらに備え、前記ブロック抽出手段は、関連付
けられた文字列をもとして前記データ記入領域を統合す
ることによって前記ブロックを抽出することを特徴とす
る。Preferably, data entry area extraction means for extracting a data entry area to be input by handwriting from the input document image, character recognition means for extracting a character string from the input document image, The data entry area further includes a related character string extraction unit that associates a character string that describes the data entry area among the extracted character strings, and the block extraction unit includes the associated character string as the The block is extracted by integrating data entry areas.

【００１４】なお、以上の各装置に係る発明は、方法に
係る発明としても成立する。本発明によれば、認識にお
いて間違った箇所を容易に修正でき、ユーザによる修正
作業の量を削減することができる。The invention relating to each of the above-described devices is also valid as an invention relating to a method. ADVANTAGE OF THE INVENTION According to this invention, the incorrect part in recognition can be easily corrected, and the amount of correction work by a user can be reduced.

【００１５】また、本発明によれば、ユーザに呈示した
修正箇所の修正が正しいものかどうか確認させることに
よって、ユーザの簡単な手続きで正確に修正を行うこと
が可能となり、全体的なユーザの修正作業を軽減するこ
とが可能となる。Further, according to the present invention, it is possible to make a correct correction by a simple procedure of the user by asking the user whether the correction of the correction portion presented is correct or not, and to improve the user's entirety. Correction work can be reduced.

【００１６】[0016]

【発明の実施の形態】以下、図面を参照しながら発明の
実施の形態を説明する。まず、本発明の第１の実施形態
に係るドキュメント処理装置について説明する。Embodiments of the present invention will be described below with reference to the drawings. First, a document processing apparatus according to a first embodiment of the present invention will be described.

【００１７】本実施形態では、各ドキュメントが持つフ
ォーマット（例えばデータ枠にデータが記入されていな
いブランク帳票）をデータベースに格納しておき、ドキ
ュメント（例えばデータ枠にデータを記入したデータ帳
票）の読み取りを行う場合には、そのフォーマットに応
じて予め登録しておいた処理手順を行うドキュメント処
理装置において本発明をフォーマット登録処理を行う部
分（後述する構造登録部１０２）に適用したものを例と
して説明する。In this embodiment, the format (for example, a blank form in which data is not entered in the data frame) of each document is stored in a database, and the document (for example, a data form in which data is entered in the data frame) is read. In this case, a document processing apparatus that performs a processing procedure registered in advance according to the format, in which the present invention is applied to a part (a structure registration unit 102 described later) that performs a format registration process, will be described as an example. I do.

【００１８】図１は、本実施形態に係るドキュメント処
理装置の構成を示す図である。本ドキュメント装置は、
画像入力部１０１、構造登録部１０２、構造格納部１０
３、手続設定部１０４、構造識別部１０５、手続処理部
１０６、データ出力部１０７を備えている。FIG. 1 is a diagram showing a configuration of a document processing apparatus according to the present embodiment. This document device
Image input unit 101, structure registration unit 102, structure storage unit 10
3, a procedure setting unit 104, a structure identification unit 105, a procedure processing unit 106, and a data output unit 107.

【００１９】画像入力部１０１は、ドキュメントの画像
をデジタルデータとして取り込むためのものであり、例
えばスキャナ装置などを用いて構成される。構造登録部
１０２は、入力されたドキュメント（ブランク帳票）の
フォーマットを画像処理、図形認識を利用して構造の認
識を行うとともに、認識できなかった箇所（認識結果が
既登録の構造と相違する箇所）の修正を行うことによっ
て、構造情報の設定を行う。The image input unit 101 is for taking in an image of a document as digital data, and is constituted by using, for example, a scanner. The structure registration unit 102 recognizes the format of the input document (blank form) using image processing and graphic recognition, and recognizes a part that cannot be recognized (a part whose recognition result is different from the registered structure). The structure information is set by performing the correction in ()).

【００２０】構造格納部１０３は、ドキュメントのフォ
ーマットの構造情報をデータベースとして格納するため
のものである。手続設定部１０４は、データベースとし
て格納されたドキュメント（ブランク帳票）と同じ種類
のドキュメント（データ帳票）が入力された場合にいか
なる処理を行うべきかを設定しておく。この設定は、例
えば登録された各ドキュメント単位あるいは、一定のド
キュメント群単位に行われる。The structure storage unit 103 stores structure information of a document format as a database. The procedure setting unit 104 sets what processing should be performed when a document (data form) of the same type as a document (blank form) stored as a database is input. This setting is performed, for example, for each registered document or for a certain document group.

【００２１】構造識別部１０５は、処理すべきドキュメ
ント（データ帳票）が画像入力部１０１において入力さ
れた場合に、入力されたドキュメントと同種のドキュメ
ントを、構造格納部１０３内のデータベースに既に登録
されているものから識別する。When a document (data form) to be processed is input in the image input unit 101, the structure identification unit 105 registers a document of the same type as the input document in a database in the structure storage unit 103. From what you are doing.

【００２２】手続処理部１０６は、識別されたドキュメ
ントの種類に応じて手続設定部１０４で設定された手続
を行う。データ出力部１０７は、手続処理部１０６によ
って処理されたデータを画面やファイルなどに出力す
る。The procedure processing unit 106 performs the procedure set by the procedure setting unit 104 according to the type of the identified document. The data output unit 107 outputs the data processed by the procedure processing unit 106 to a screen or a file.

【００２３】図２は、本実施形態に係る構造登録部１０
２の内部構成の一例を示す図である。本構造登録部１０
２は、画像処理部２０１、図形処理部２０２、文字認識
部２０３、構造解析部２０４、類似領域抽出部２０５、
構造置換部２０６、構造修正部２０７を備えている。FIG. 2 shows a structure registration unit 10 according to this embodiment.
2 is a diagram illustrating an example of the internal configuration of FIG. Main structure registration unit 10
2 is an image processing unit 201, a graphic processing unit 202, a character recognition unit 203, a structure analysis unit 204, a similar region extraction unit 205,
A structure replacement unit 206 and a structure correction unit 207 are provided.

【００２４】画像処理部２０１は、画像入力部１０１に
よって入力されたドキュメント（ブランク帳票）の画像
から線分の抽出、文字領域の抽出を行う。図形処理部２
０２は、抽出された線分を分類し、枠の抽出、文字領域
の抽出を行う。The image processing unit 201 extracts a line segment and a character area from an image of a document (blank form) input by the image input unit 101. Figure processing unit 2
02 classifies the extracted line segment, extracts a frame, and extracts a character area.

【００２５】文字認識部２０３は、抽出された文字領域
内の文字画像を文字コードに変換する。構造解析部２０
４は、抽出された枠の詳細情報を獲得する。The character recognition unit 203 converts the character image in the extracted character area into a character code. Structural analysis unit 20
4 acquires the detailed information of the extracted frame.

【００２６】類似領域抽出部２０５は、構造解析部２０
４によって抽出された枠情報に基づいて、構造格納部１
０３に格納されている既登録ドキュメントから類似する
構造を持つ類似領域を抽出する。The similar area extracting unit 205
4 based on the frame information extracted by
A similar area having a similar structure is extracted from the registered documents stored in the storage area 03.

【００２７】構造置換部２０６は、類似領域抽出部２０
５によって抽出された類似領域が持つ構造と、構造解析
部２０４によって抽出された構造のうち該類似領域に対
応する部分との対応付けを行うことによって、それらの
間の差異を抽出し、差異の存在する部分を置換処理等に
よって修正する。The structure replacement unit 206 includes the similar area extraction unit 20
5 and a portion corresponding to the similar region in the structure extracted by the structure analysis unit 204, thereby extracting a difference between the structures. The existing part is corrected by a replacement process or the like.

【００２８】構造修正部２０７は、構造置換部２０６に
よる修正後のドキュメントを提示して、ユーザの確認入
力を受け（正しく認識された場合）、あるいはユーザに
よるマニュアルでの構造修正のための指示入力（追加、
削除、置換等）を受け付け、この指示入力に基づいて修
正を行う（誤った部分が存在する場合）。The structure correcting unit 207 presents the document after the correction by the structure replacing unit 206, receives a user's confirmation input (when it is correctly recognized), or inputs an instruction for manual structure correction by the user. (add to,
Deletion, replacement, etc.) are accepted, and correction is performed based on the instruction input (when an erroneous portion exists).

【００２９】以下、図３を参照しながらフォーマット登
録処理について説明する。図３は、本実施形態の構造登
録部１０２で行われる各ドキュメントのフォーマット登
録処理の一例を示すフローチャートである。Hereinafter, the format registration processing will be described with reference to FIG. FIG. 3 is a flowchart illustrating an example of a format registration process of each document performed by the structure registration unit 102 according to the present embodiment.

【００３０】まず、フォーマット登録のためにドキュメ
ントの画像が入力されると（ステップＳ２１０）、画像
処理部２０１において線分抽出を行う（ステップＳ２１
１）。ここでは、入力された２値画像に対して細線化処
理あるいは輪郭線追跡処理を行うことによって線素を抽
出し、線分近似処理を行うことによって全ての画像成分
を線分に変換する。First, when a document image is input for format registration (step S210), a line segment is extracted in the image processing unit 201 (step S21).
1). Here, line elements are extracted by performing thinning processing or contour tracing processing on the input binary image, and all image components are converted into line segments by performing line segment approximation processing.

【００３１】次いで、図形処理部２０２において、抽出
された線分から長方形の枠を抽出する（ステップＳ２１
２）。長方形の枠の抽出の一例として、特願平７−２８
３３３に開示されている方法がある。この方法では、互
いに垂直に交差する線分より交点を計測し、これらの交
点より４点を抽出した場合にこれらの４点によって長方
形を形成するかを検査することにより長方形枠を抽出す
る方法である。同時に長方形枠に使われなかった残りの
線分から、その長さ、方向性を利用して罫線、下線を抽
出する。Next, the graphic processing unit 202 extracts a rectangular frame from the extracted line segment (step S21).
2). As an example of extracting a rectangular frame, Japanese Patent Application No. 7-28
333. In this method, a rectangular frame is extracted by measuring intersections from line segments that intersect perpendicularly to each other, and examining whether a rectangle is formed by these four points when four points are extracted from these intersections. is there. At the same time, ruled lines and underlines are extracted from the remaining line segments not used for the rectangular frame by using their length and directionality.

【００３２】また、枠や罫線に使われなかった長さの短
い線分は、文字を形成する線分と判断し、残された線分
の分布状況から文字領域を抽出する（ステップＳ２１
３）。そして、文字認識部２０３により、抽出された文
字領域内に存在する活字文字の画像をパターン認識技術
を用いて文字コードに変換する（ステップＳ２１４）。A short line segment not used for a frame or a ruled line is determined as a line segment forming a character, and a character region is extracted from the distribution state of the remaining line segments (step S21).
3). Then, the character recognition unit 203 converts the image of the printed characters present in the extracted character area into a character code by using a pattern recognition technique (step S214).

【００３３】次に、構造解析部２０４において、抽出さ
れた枠について、周辺の枠や文字情報を利用して、枠に
記入部分が含まれるかどうか、枠に含まれる記入部分に
はどのような種類の文字が記入されるか、あるいはどの
ような意味の言葉が記入されるか、といった属性を抽出
することにより、このドキュメントの構造解析処理を行
う（ステップＳ２１５）。この構造解析処理としては例
えば特願平７−２８３３４に開示されたものを用いるこ
とができる。Next, in the structural analysis unit 204, the extracted frame is determined by using surrounding frames and character information to determine whether the frame includes an entry portion, and what kind of entry portion is included in the frame. The structure analysis process of this document is performed by extracting attributes such as the type of character to be entered or the meaning of words to be entered (step S215). As the structure analysis processing, for example, the one disclosed in Japanese Patent Application No. 7-28334 can be used.

【００３４】構造解析処理が終了すると、入力されたド
キュメントのフォーマット情報の認識処理が一旦終了し
たことになる。ところが、入力されたドキュメントにお
いて認識結果に誤りが存在する場合が一般的である。こ
れは、画像の情報からだけでは構造解析の詳細な判断が
不可能な箇所が存在したり、元々の画像にノイズなどの
影響で欠損が生じていたりするためである。When the structural analysis process is completed, the process of recognizing the format information of the input document has been completed. However, it is common that there is an error in the recognition result in the input document. This is because there are places where detailed determination of the structural analysis cannot be made only from the information of the image, or the original image is missing due to the influence of noise or the like.

【００３５】本実施形態では、これに対応するために、
既登録のドキュメントのフォーマット情報をフォーマッ
ト設定時の辞書として利用する。これを行うために、ま
ず、構造格納部１０３に登録されている既登録のドキュ
メントから現在入力設定中のドキュメントの類似領域を
抽出する（ステップＳ２１６）。In this embodiment, in order to cope with this,
The format information of the registered document is used as a dictionary at the time of format setting. In order to do this, first, a similar area of the document whose input is currently set is extracted from the registered documents registered in the structure storage unit 103 (step S216).

【００３６】ここでは、ドキュメントのページ単位に類
似の情報を獲得するのではなく、ドキュメントを内容に
依存した領域に分割し、その領域単位に類似の情報を抽
出する方法を説明する。Here, a method will be described in which, instead of acquiring similar information in page units of the document, the document is divided into regions depending on the contents and similar information is extracted in units of regions.

【００３７】なお、ドキュメントを内容に依存した領域
に分割するために、ドキュメント内にブロックを定義す
るものとし、これをドキュメント登録時に抽出し、登録
しておくものとする。ドキュメントからブロックを抽出
し、データベースとして登録する処理の一例を図４を用
いて後に説明する。また、ブロック単位に類似領域を抽
出する処理の一例を図６を用いて後に説明する。In order to divide the document into regions depending on the contents, it is assumed that a block is defined in the document, which is extracted and registered at the time of document registration. An example of a process of extracting a block from a document and registering the block as a database will be described later with reference to FIG. An example of a process for extracting a similar region in block units will be described later with reference to FIG.

【００３８】さて、ドキュメントの類似領域が抽出され
た場合は、抽出された類似領域との差異を検出し、差異
が生じている部分について類似領域の情報を置換するこ
とによって、自動修正を行う（ステップＳ２１７）。こ
こで行われる修正処理の一例を図７を用いて後に説明す
る。When a similar region of a document is extracted, a difference from the extracted similar region is detected, and information of the similar region is replaced with respect to a portion where the difference occurs, thereby performing automatic correction ( Step S217). An example of the correction process performed here will be described later with reference to FIG.

【００３９】また、自動で修正しきれなかった部分があ
れば、ユーザによりマニュアル修正することができる。
修正が終了したフォーマット情報は、新規のフォーマッ
トとしてデータベースに格納される（ステップＳ２１
８）。If there is any part that cannot be automatically corrected, the user can manually correct it.
The corrected format information is stored in the database as a new format (step S21).
8).

【００４０】以上により、ドキュメントのフォーマット
登録処理を終了する。次に、図４を参照しながら、ドキ
ュメントからブロックを抽出し、データベースとして登
録する処理について説明する。図４は、設定されたドキ
ュメントのフォーマット情報を利用してブロックを抽出
して、データベースの一部として登録するための処理の
一例を示すフローチャートである。Thus, the document format registration processing is completed. Next, a process of extracting a block from a document and registering it as a database will be described with reference to FIG. FIG. 4 is a flowchart illustrating an example of a process for extracting a block using the set document format information and registering the extracted block as a part of a database.

【００４１】この処理は、図１の構造格納部１０３にお
いて、入力されたドキュメントに対してフォーマットの
設定が全て終了した段階で行われる。まず、設定が完了
したフォーマットの情報を入力する（ステップＳ３０
１）。This processing is performed at the stage when all the format settings have been completed for the input document in the structure storage unit 103 of FIG. First, information on the format for which setting has been completed is input (step S30).
1).

【００４２】ブロックの抽出に用いられる情報は、フォ
ーマット設定において抽出された各々の枠に関連付けさ
れた文字列の情報である。同じ文字列を関連付けている
枠同士を結合することで小ブロックの抽出を行う（ステ
ップＳ３０２）。このとき、抽出されたブロックを以下
に説明する特別な場合を除いてレベル０のブロックとし
ておく。The information used for extracting a block is information on a character string associated with each frame extracted in the format setting. Small blocks are extracted by combining frames that associate the same character string (step S302). At this time, the extracted block is set to a level 0 block except for a special case described below.

【００４３】次に、抽出された小ブロックを複数個組み
合わせることによって、四辺形を構成するものを、より
大きなブロックとして抽出する（ステップＳ３０３）。
ブロックを結合させる度にブロックのレベルをあげて、
それぞれが親子関係を持つようにする。これによって、
ブロックに階層構造を作ることができる。抽出された大
小の全てのブロックに対して、ブロックが占める領域の
座標、ブロック内に存在する枠、文字列に関する情報を
計測する（ステップＳ３０４）。Next, by combining a plurality of the extracted small blocks, those forming a quadrilateral are extracted as larger blocks (step S303).
Each time you combine blocks, raise the level of the block,
Make each have a parent-child relationship. by this,
You can create a hierarchical structure for blocks. With respect to all the extracted large and small blocks, the coordinates of the area occupied by the blocks, information on the frames existing in the blocks, and the character strings are measured (step S304).

【００４４】最後に、これらのブロックの情報にブロッ
クのレベルと親子関係のリンクを付け加えてデータベー
スに登録する（ステップＳ３０５）。この際、同一の構
造を持つブロックが二重登録されないように、同一ブロ
ックの存在を枠の相対位置と文字列の内容によって検査
したうえで登録を行う。Finally, the information of these blocks is added to the link of the block level and the parent-child relationship and registered in the database (step S305). At this time, the existence of the same block is checked based on the relative position of the frame and the content of the character string so that the block having the same structure is not registered twice.

【００４５】ここで、図５を参照しながら、小ブロック
の抽出方法を具体例を用いて説明する。小ブロックは、
前述した通り、同じ文字列を関連付けされている枠同士
を結合することで抽出する。Here, a method of extracting a small block will be described using a specific example with reference to FIG. The small block is
As described above, the same character string is extracted by combining the associated frames.

【００４６】例えば図５（ａ）に示すように、枠５０１
に文字列５０３、枠５０２に文字列５０３が関連付けさ
れているものとする。このとき、領域５０４で示される
範囲をブロックとして抽出する。このブロックのレベル
は０である。For example, as shown in FIG.
Is associated with the character string 503, and the character string 503 is associated with the frame 502. At this time, the range indicated by the area 504 is extracted as a block. The level of this block is 0.

【００４７】また、図５（ｂ）に示すように、枠５０５
に対して文字列５０８，５１０が、枠５０６に対して文
字列５０９，５１０が、枠５０７に対して文字列５０
９，５１０がそれぞれ関連付けられているものとする。
全てを囲む領域５１１の内部には、領域５１２と領域５
１３の２つのブロックが存在すると考え、領域５１１を
レベル１のブロック、領域５１２と領域５１３をレベル
０のブロックとして抽出し、両者に親子関係を定義す
る。Further, as shown in FIG.
Character strings 508 and 510 for the frame, character strings 509 and 510 for the frame 506, and a character string 50 for the frame 507.
9 and 510 are associated with each other.
The region 512 and the region 5
Considering that there are two blocks 13, the area 511 is extracted as a level 1 block and the areas 512 and 513 are extracted as level 0 blocks, and a parent-child relationship is defined between them.

【００４８】次に、図６を参照しながら、ステップＳ２
１６における類似領域抽出処理の一例について説明す
る。まず、抽出された枠、文字列に関する構造情報を入
力し（ステップＳ４０１）、最小ブロックの抽出を行う
（ステップＳ４０２）。最小ブロックの抽出方法として
は、図５を用いて説明したレベル０の小ブロックを抽出
する方法をとるものとする。枠に関連付けられた文字列
の情報を用いて枠を統合することによりブロックを抽出
し、これを最小ブロックとする。Next, referring to FIG.
An example of the similar region extraction processing in No. 16 will be described. First, structural information relating to the extracted frame and character string is input (step S401), and a minimum block is extracted (step S402). As a method of extracting the minimum block, the method of extracting the small block of level 0 described with reference to FIG. 5 is taken. Blocks are extracted by integrating the frames using the information on the character strings associated with the frames, and the extracted blocks are defined as the minimum blocks.

【００４９】抽出された最小ブロックの全てに対して類
似領域が抽出されるように、ブロックの検索処理を繰り
返して行う（ステップＳ４０３）。類似領域の抽出は、
抽出された最小ブロックに対するレベル０のブロックの
検索処理から始める。データベースに登録されているレ
ベル０のブロックから同一ブロックを検出し、これを類
似ブロックとして抽出する（ステップＳ４０４）。同一
ブロックの判定は、ブロック内の枠の接続状態と文字列
の内容から判断する。Block search processing is repeatedly performed so that similar regions are extracted from all of the extracted minimum blocks (step S403). Extraction of similar regions
The process starts with a level 0 block search process for the extracted minimum block. The same block is detected from the level 0 blocks registered in the database, and is extracted as a similar block (step S404). The determination of the same block is made based on the connection state of the frame in the block and the content of the character string.

【００５０】もし、類似ブロックが抽出されなかった場
合は、次の最小ブロックに対してステップＳ４０３の処
理から行う（ステップＳ４０５）。類似ブロックが抽出
された場合は、ブロックの親子関係を利用して１つレベ
ルの高いブロックの情報を取り出す（ステップＳ４０
６）。If a similar block has not been extracted, the process from step S403 is performed on the next smallest block (step S405). If a similar block is extracted, the information of the block one level higher is extracted using the parent-child relationship of the block (step S40).
6).

【００５１】上位ブロックが検出できなかった場合（ス
テップＳ４０７でＮＯの場合）は、類似領域の抽出処理
を終了し、これまでに見つかった領域を類似領域とする
（ステップＳ４１０）。If the upper block has not been detected (NO in step S407), the extraction processing of the similar area is terminated, and the area found so far is set as a similar area (step S410).

【００５２】上位ブロックが検出された場合は（ステッ
プＳ４０７でＹＥＳの場合）、検出された上位ブロック
内に存在する枠、文字列を入力ドキュメントの枠、文字
列にそれぞれ対応付けを行って、対応付けの成功数から
類似度を計算する（ステップＳ４０８）。類似度は、例
えば、（対応付け成功数）／（枠と文字列の総数）で定
義する。類似度があるしきい値α（例えば、０．７）よ
りも大きい場合は、類似領域が見つかったとして、さら
にこれよりも大きい類似領域を抽出するために、ステッ
プＳ４０６に戻って繰り返し処理する（ステップＳ４０
９）。If an upper block is detected (YES in step S407), the frame and character string existing in the detected upper block are associated with the input document frame and character string, respectively. The similarity is calculated from the number of successful attachments (step S408). The similarity is defined by, for example, (number of successful associations) / (total number of frames and character strings). If the similarity is larger than a certain threshold value α (for example, 0.7), it is determined that a similar region has been found, and the process returns to step S406 and is repeatedly performed to extract a similar region larger than this. Step S40
9).

【００５３】類似度がしきい値αに至らなかった場合
は、これまでに見つかった領域を類似領域とする（ステ
ップＳ４１０）。類似領域が見つからなかった場合は、
現在対象の最小ブロックについては類似領域が抽出でき
なかったものとする。すなわち、類似のフォーマットを
持つドキュメントが登録されていなかったことを意味す
る。If the similarity does not reach the threshold value α, the area found so far is set as a similar area (step S410). If no similar area is found,
It is assumed that a similar area could not be extracted for the current target minimum block. That is, it means that a document having a similar format has not been registered.

【００５４】なお、上記では、類似領域が複数存在し得
る場合でも、最初に抽出されたものを類似領域として決
定しているが、その代わりに、データベースに登録され
ている構造情報から類似領域をすべて抽出し、そのうち
レベル数が最高かつ類似度が最高のものを類似領域とし
て決定するようにしても良い。In the above description, even when a plurality of similar regions may exist, the first extracted one is determined as the similar region. Instead, the similar region is determined from the structural information registered in the database. All of them may be extracted, and the one having the highest number of levels and the highest similarity may be determined as a similar area.

【００５５】次に、図７を参照しながらステップＳ２１
７における構造修正処理の一例について説明する。ま
ず、修正対象範囲に対応する類似領域内のプリミティブ
（枠、文字列、文字コード、記入領域など）の情報から
（ステップＳ４１１）、対応付けリストの作成を行う
（ステップＳ４１２）。Next, referring to FIG. 7, step S21 will be described.
7 will be described. First, an association list is created (step S412) from information on primitives (frames, character strings, character codes, entry areas, etc.) in the similar area corresponding to the correction target range (step S411).

【００５６】この対応付けリストにしたがって、修正対
象範囲の各プリミティブと類似領域内のプリミティブを
対応付ける（ステップＳ４１３）。対応付けができなか
ったプリミティブ、および対応付けが一対一でなかった
プリミティブを修正対象として、対応付けの結果とプリ
ミティブの種類に応じて、置換、削除、追加等の修正方
法を選択する（ステップＳ４１４）。According to this association list, each primitive in the correction target range is associated with a primitive in the similar area (step S413). A correction method such as replacement, deletion, or addition is selected according to the result of the association and the type of the primitive, with the primitives that could not be associated and the primitives that were not in one-to-one correspondence selected as correction targets (step S414). ).

【００５７】最後に、選択された修正方法にしたがって
データを修正する（ステップＳ４１５）。例えば、対応
付けができなかったプリミティブのうち、修正対象にの
み存在するプリミティブに関してはそのプリミティブの
データを削除し、類似領域にのみ存在するプリミティブ
に関しては、そのプリミティブのデータを追加する。ま
た、多対一で対応づいたプリミティブはデータを統合す
る修正を行い、一対多で対応づいたプリミティブはデー
タを分割する修正を行う。また、サイズに関するものは
均等割りなどの処置を施す。Finally, the data is corrected according to the selected correction method (step S415). For example, among the primitives that could not be associated, the primitive data that exists only in the correction target is deleted, and the primitive data that exists only in the similar area is added to the primitive data. In addition, primitives that correspond in many-to-one correspondence make corrections to integrate data, and primitives that correspond in one-to-many make corrections to divide data. In addition, for the size, measures such as equal division are applied.

【００５８】また、例えば、一対一で対応づいたプリミ
ティブの中で、文字コードなどに差異が認められるもの
は、文字コードを置換することによって変更を行う。こ
のとき、文字認識時の類似度を利用して、類似度が低い
ものに絞って修正を行うように修正に制限を設けること
ができる。For example, among primitives that are associated one-to-one, if a difference is found in the character code, the change is made by replacing the character code. At this time, by using the similarity at the time of character recognition, it is possible to limit the correction so that the correction is performed only on the one with the low similarity.

【００５９】以上のようにして、フォーマット認識の結
果を使ってデータベースから類似の領域を検索し、認識
結果と検索された類似領域を比較することによって、既
に設定されたフォーマット情報にしたがって自動修正を
行うことができる。As described above, the similar area is searched from the database using the format recognition result, and the recognition result is compared with the searched similar area, whereby the automatic correction is performed according to the already set format information. It can be carried out.

【００６０】次に、本発明の第２の実施形態に係るドキ
ュメント処理装置について説明する。本実施形態は、自
動修正を行うのに先だって、ユーザに修正案を提示し、
その確認入力を受けた後に自動修正するようにしたもの
である。Next, a document processing apparatus according to a second embodiment of the present invention will be described. This embodiment presents a correction plan to the user before performing the automatic correction,
The automatic correction is made after receiving the confirmation input.

【００６１】本実施形態のドキュメント処理装置の構成
は、「構造登録部」に関する部分以外は第１の実施形態
と同様であるので、それらの詳細な説明は省略し、ここ
では第１の実施形態と相違する「構造登録部」に関する
部分を主として説明する。The configuration of the document processing apparatus of this embodiment is the same as that of the first embodiment except for the part relating to the "structure registration unit", and therefore, detailed description thereof is omitted, and the first embodiment is described here. The part related to the “structure registration unit”, which is different from the above, will be mainly described.

【００６２】本実施形態のドキュメント処理装置の構成
は、基本的には図１と同様であるが、構造登録部の内部
構成に付加する部分が存在する。また、第１の実施形態
で説明した構造修正処理以外の各処理についても同様で
ある。例えば、図３、図４、図６の各フローチャートを
用いることができる。The configuration of the document processing apparatus of this embodiment is basically the same as that of FIG. 1, but there is a portion added to the internal configuration of the structure registration unit. The same applies to each processing other than the structure modification processing described in the first embodiment. For example, the flowcharts of FIGS. 3, 4, and 6 can be used.

【００６３】図８に、本実施形態に係るドキュメント処
理装置の構造登録部１０２の内部構成例を示す。この構
造登録部１０２は、画像処理部２０１、図形処理部２０
２、文字認識部２０３、構造解析部２０４、類似領域抽
出部２０５、構造置換部２０６、構造修正部２０７、構
造表示部２０８、修正確認部２０９を備えている。すな
わち、第１の実施形態における構造登録部に構造表示部
２０８と修正確認部２０９を付加した構成になってい
る。FIG. 8 shows an example of the internal configuration of the structure registration unit 102 of the document processing apparatus according to this embodiment. The structure registration unit 102 includes an image processing unit 201, a graphic processing unit 20
2, a character recognition unit 203, a structure analysis unit 204, a similar area extraction unit 205, a structure replacement unit 206, a structure correction unit 207, a structure display unit 208, and a correction confirmation unit 209. That is, the configuration is such that the structure display unit 208 and the correction confirmation unit 209 are added to the structure registration unit in the first embodiment.

【００６４】画像処理部２０１は、画像入力部１０１に
よって入力されたドキュメントの画像から線分の抽出、
文字領域の抽出を行う。図形処理部２０２は、抽出され
た線分を分類し、枠の抽出、文字領域の抽出を行う。The image processing unit 201 extracts line segments from a document image input by the image input unit 101,
Extract the character area. The graphic processing unit 202 classifies the extracted line segment, extracts a frame, and extracts a character area.

【００６５】文字認識部２０３は、抽出された文字領域
内の文字画像を文字コードに変換する。構造解析部２０
４は、抽出された枠の詳細情報を獲得する。The character recognition section 203 converts the character image in the extracted character area into a character code. Structural analysis unit 20
4 acquires the detailed information of the extracted frame.

【００６６】類似領域抽出部２０５は、構造解析部２０
４によって抽出された枠情報に基づいて構造格納部１０
３に格納されている既登録ドキュメントから類似の領域
を抽出する。The similar area extracting unit 205 is configured to
4, the structure storage unit 10 based on the frame information extracted by
3. A similar area is extracted from the registered document stored in No.3.

【００６７】構造置換部２０６は、類似領域抽出部２０
５によって抽出された類似領域が持つ構造と該当する領
域の構造解析部２０４によって抽出された構造との対応
付けを行うことによって差異を抽出し、（ユーザによる
自動修正をすべき旨の入力を受けた後に）差異の存在す
る部分を置換処理等によって修正を行う。The structure replacing section 206 is similar to the similar area extracting section 20.
The difference is extracted by associating the structure of the similar region extracted by step 5 with the structure of the corresponding region extracted by the structure analysis unit 204, and the difference is extracted. After that, the part where the difference exists is corrected by a replacement process or the like.

【００６８】構造修正部２０７は、ユーザによるマニュ
アルでの構造修正のための指示入力（追加、削除、置換
等）を受け付け、この指示入力に基づいて修正を行う。
構造表示部２０８は、設定された構造、自動修正対称箇
所の構造、自動修正された構造あるいは自動修正案を表
示する。The structure correction unit 207 receives a manual instruction input (addition, deletion, replacement, etc.) for the structure correction by the user, and performs correction based on the instruction input.
The structure display unit 208 displays the set structure, the structure of the automatically corrected symmetric portion, the automatically corrected structure, or the automatic correction plan.

【００６９】修正確認部２０９は、ユーザによる自動修
正の確認の入力あるいはキャンセル等、自動修正に関す
る指示入力を受け付ける。次に、図９を参照しながらス
テップＳ２１７における構造修正処理について説明す
る。The correction confirming unit 209 accepts an instruction input relating to automatic correction, such as an input of confirmation of automatic correction or cancellation by the user. Next, the structure correcting process in step S217 will be described with reference to FIG.

【００７０】図９は、本実施形態のドキュメント処理装
置における構造修正処理の一例を示すフローチャートで
ある。修正対象範囲に対応する類似領域内のプリミティ
ブ（枠、文字列、文字コード、記入領域など）の情報か
ら（ステップＳ４１１）、対応付けリストの作成を行う
（ステップＳ４１２）。FIG. 9 is a flowchart showing an example of the structure correcting process in the document processing apparatus according to the present embodiment. A correspondence list is created (step S412) from information on primitives (frames, character strings, character codes, entry areas, etc.) in the similar area corresponding to the correction target range (step S411).

【００７１】この対応付けリストにしたがって、修正対
象範囲の各プリミティブと類似領域内のプリミティブを
対応付ける（ステップＳ４１３）。前述のように、対応
付けができなかったプリミティブおよび対応付けが一対
一でなかったプリミティブを修正対象として、対応付け
の結果とプリミティブの種類に応じて、置換、削除、追
加等の修正方法を選択する（ステップＳ４１４）。According to this association list, each primitive in the correction target range is associated with a primitive in the similar area (step S413). As described above, for the primitives that could not be mapped and the primitives for which the mapping was not one-to-one, correction methods such as replacement, deletion, and addition were selected according to the mapping result and the type of primitive. (Step S414).

【００７２】次に、本実施形態では、この修正方法にし
たがって修正を施したものをワークエリアに生成し、構
造表示部２０８に表示する（ステップＳ４１６）。ユー
ザに対して表示された修正フォーマットが有効であるか
の確認を修正確認部２０９において行った上で（ステッ
プＳ４１７）、もし修正を行う確認［ＯＫ］がとれた場
合は（ステップＳ４１８）、実際のデータに対して修正
処理を行う。Next, in the present embodiment, an object modified according to this modification method is generated in the work area and displayed on the structure display unit 208 (step S416). After confirming whether the modification format displayed to the user is valid or not in the modification confirmation unit 209 (step S417), if confirmation [OK] for modification is obtained (step S418), the actual Correction processing is performed on the data of.

【００７３】もし修正がキャンセルされた場合は（ステ
ップＳ４１８）、修正を行わず次の類似領域について同
様の処理を続ける。修正処理の具体例は、図７で説明し
た通りである。If the correction has been canceled (step S418), the same processing is continued for the next similar area without performing the correction. A specific example of the correction processing is as described with reference to FIG.

【００７４】なお、上記の手順では、類似領域が複数存
在し得る場合でも、最初に抽出されたものを類似領域と
して修正案を提示しているが、その代わりに、データベ
ースに登録されている構造情報から類似領域をすべて抽
出し、そのうちレベル数が最高かつ類似度が最高のもの
を類似領域として修正案を作成し、これを提示するよう
にしても良い。In the above procedure, even when a plurality of similar regions may exist, the first extracted one is presented as a similar region, and a correction plan is presented. It is also possible to extract all similar regions from the information, create a correction plan with the highest number of levels and the highest similarity as a similar region, and present it.

【００７５】また、類似領域が複数存在する場合、いく
つかの修正案を提示するようにしても良い。この場合、
レベル数が最高かつ類似度が最高の類似領域から順に幾
つかを選んで夫々修正案を作成し、これらを順次提示す
るようにしても良い。When there are a plurality of similar regions, some correction plans may be presented. in this case,
Some of the similar areas having the highest number of levels and the highest similarity may be selected in order from the similar area, and correction plans may be respectively created, and these may be sequentially presented.

【００７６】以下では、上記処理の具体例をいくつか説
明する。図１０は、構造表示部２０８に表示されるフォ
ーマット修正例の画面の一例である。なお、図中の６０
０は画面全体を、６０１は画面中に表示された表示ウィ
ンドウを示す。この例では、認識した構造情報に類似領
域との差異が見つかった場合に表示ウィンドウ６０１を
発生させ、認識した構造情報を認識結果６０２に、この
部分に対する置換処理等を施した修正例を修正結果６０
３に夫々表示させ、ユーザはボタン６０４，６０５によ
ってこの部分の修正の良否を選択することができる。修
正がＯＫである場合はボタン６０４、ＮＧである場合は
ボタン６０５をクリックすることによって、実際のデー
タへの修正処理を制御することができる。Hereinafter, some specific examples of the above processing will be described. FIG. 10 is an example of a format correction example screen displayed on the structure display unit 208. Note that 60 in FIG.
0 indicates the entire screen, and 601 indicates a display window displayed on the screen. In this example, when a difference from the similar region is found in the recognized structure information, a display window 601 is generated, and the corrected result in which the recognized structure information is subjected to a replacement process or the like on the recognition result 602 is obtained by modifying the correction result. 60
3 are displayed, and the user can select whether to correct this portion by using buttons 604 and 605. By clicking the button 604 when the correction is OK and clicking the button 605 when the correction is NG, the correction processing to the actual data can be controlled.

【００７７】図１１は、フォーマット修正案を複数用意
した場合の構造表示部２０８に表示されるフォーマット
修正確認画面（表示ウィンドウ）の一例である。フォー
マット修正案を複数用意することによって、ユーザは複
数の修正案から選択して修正を行うことができる。例え
ば、図１１（ｂ）のような類似領域抽出部２０５におい
て抽出された類似領域が持つフォーマット情報６１１に
対して、構造修正部２０６において認識結果６０２との
差異を抽出し、差異が複数抽出された場合には、それぞ
れを修正するか否かで複数の修正案を作成することがで
きる。FIG. 11 is an example of a format correction confirmation screen (display window) displayed on the structure display unit 208 when a plurality of format correction plans are prepared. By preparing a plurality of format amendments, the user can select and amend the plurality of amendments. For example, with respect to the format information 611 of the similar area extracted by the similar area extracting unit 205 as shown in FIG. 11B, a difference from the recognition result 602 is extracted by the structure correcting unit 206, and a plurality of differences are extracted. In such a case, a plurality of correction plans can be created depending on whether or not each is to be corrected.

【００７８】まず、図１１（ａ）のように、表示ウィン
ドウ６１２には、最初の修正案６１３が表示される。別
の修正案を表示させたい場合は、ボタン６１４あるいは
ボタン６１５をクリックする。ボタン６１４は、以前に
表示された修正案の前候補の表示を行う。ボタン６１５
は、修正案の次候補の表示を行う。もし、現在表示され
ている修正案でＯＫである場合は、ＯＫボタン６０４を
クリックすることによって、現表示の修正案による実際
のデータへの修正処理を行う。全ての修正案による修正
がＮＧである場合は、ボタン６０５をクリックすること
によって、その箇所の修正をキャンセルすることができ
る。前候補ボタン６１４、次候補ボタン６１５が押され
た場合は、ウィンドウ６１２は修正案６１３の表示が別
の修正案６１７に切り替わる。First, as shown in FIG. 11A, a first modification 613 is displayed in the display window 612. To display another correction plan, the user clicks the button 614 or the button 615. A button 614 displays a previous candidate for a previously displayed revision. Button 615
Displays the next candidate of the amendment proposal. If the currently proposed amendment is OK, the user clicks an OK button 604 to perform an amendment process to the actual data using the amendment currently displayed. If all the corrections are NG, the user can click the button 605 to cancel the correction at that point. When the previous candidate button 614 and the next candidate button 615 are pressed, the display of the modification plan 613 is switched to another modification plan 617 in the window 612.

【００７９】この例では、認識結果６０２と抽出された
類似領域６１１との差異として、文字コード「受注伝
票」、文字コード「コート」、枠（コートの右）の３個
所抽出され、３個の差異の組み合わせで修正案を作成す
ることができる。In this example, as a difference between the recognition result 602 and the extracted similar area 611, three places of a character code “order slip”, a character code “coat”, and a frame (to the right of the coat) are extracted. An amendment can be created by combining the differences.

【００８０】図１２は、フォーマット修正箇所を表示し
た場合の構造表示部２０８に表示されるフォーマット修
正確認画面（表示ウィンドウ）の一例である。（ａ）の
画面６１２は認識されたフォーマット構造が表示された
状態であり、修正箇所が抽出された時点で、（ｂ）のよ
うに修正対象箇所となる位置のプリミティブの表示が強
調される。強調の方法としては、例えば、色表示した
り、あるいはブリンク表示したりする。FIG. 12 shows an example of a format correction confirmation screen (display window) displayed on the structure display section 208 when a format correction portion is displayed. The screen 612 of (a) shows a state in which the recognized format structure is displayed, and when the correction portion is extracted, the display of the primitive at the position to be corrected as shown in (b) is highlighted. As a method of emphasis, for example, color display or blink display is performed.

【００８１】また、修正案のイメージを作成し、認識結
果と修正案のイメージを交互に表示させることも可能で
ある。この場合、認識結果を黒常時、修正案を赤色表示
させて現在表示されているイメージがどちらのものであ
るかを分かりやすくすることができる。It is also possible to create an image of a correction plan and display the recognition result and the image of the correction plan alternately. In this case, the recognition result is always displayed in black and the correction plan is displayed in red, so that it is easy to understand which image is currently displayed.

【００８２】また、複数の修正対象箇所が存在する場合
に、修正すべき箇所を、例えばユーザが画面上の該当箇
所をマウスでクリックするなどして選択指示するように
しても良い。図１２の例で、文字コード「受注伝票」は
選択されず、文字コード「コート」と枠（コートの右）
が選択された場合、（ｃ）のような修正結果が得られ
る。When there are a plurality of portions to be corrected, a portion to be corrected may be selected and instructed by, for example, a user clicking a corresponding portion on the screen with a mouse. In the example of FIG. 12, the character code "order slip" is not selected, and the character code "coat" and a frame (right of the coat) are selected.
Is selected, a correction result as shown in (c) is obtained.

【００８３】以上のようにして、フォーマット認識の結
果を使ってデータベースから類似の領域を検索し、認識
結果と検索された類似領域を比較することによって、そ
の修正方法が正しいものかを、画面に表示されたものを
確認することによって、スムーズな修正作業を行うこと
ができる。本発明は、上述した実施の形態に限定される
ものではなく、その技術的範囲において種々変形して実
施することができる。As described above, the similar area is searched from the database using the format recognition result, and the recognition result is compared with the searched similar area to determine whether the correction method is correct. By checking what is displayed, a smooth correction operation can be performed. The present invention is not limited to the above-described embodiment, and can be implemented with various modifications within the technical scope.

【００８４】[0084]

【発明の効果】本発明によれば、ドキュメントのフォー
マット情報を登録する場合において、既登録のドキュメ
ントのフォーマット情報を辞書として利用することによ
り、フォーマットの認識処理における認識誤りを自動的
に修正することが可能であり、間違った箇所をユーザが
修正するための作業量を削減することができる。According to the present invention, when registering the format information of a document, the recognition error in the format recognition processing is automatically corrected by using the format information of the registered document as a dictionary. It is possible to reduce the amount of work required for the user to correct a wrong part.

【００８５】また、類似領域抽出による修正箇所を呈示
し、修正処理が正しいものかをユーザによって確認させ
ることにより、ユーザの簡単な手続で正確に修正処理を
行うことが可能となり、全体的なユーザの修正作業を軽
減することが可能となる。Further, by presenting the corrected portion by the similar region extraction and allowing the user to confirm whether or not the corrected process is correct, the corrected process can be accurately performed by a simple procedure of the user, and the entire user can be corrected. Correction work can be reduced.

[Brief description of the drawings]

【図１】本発明の一実施形態に係るドキュメント管理装
置の構成を示す図FIG. 1 is a diagram showing a configuration of a document management apparatus according to an embodiment of the present invention.

【図２】同実施の形態に係るドキュメント管理装置の構
造登録部の構成の一例を示す図FIG. 2 is a view showing an example of a configuration of a structure registration unit of the document management apparatus according to the embodiment;

【図３】フォーマット登録処理の手順を示すフローチャ
ートFIG. 3 is a flowchart showing a procedure of a format registration process;

【図４】ブロック登録処理の手順を示すフローチャートFIG. 4 is a flowchart showing a procedure of a block registration process;

【図５】小ブロック抽出方法を説明するための図FIG. 5 is a diagram for explaining a small block extraction method;

【図６】類似領域抽出処理の手順を示すフローチャートFIG. 6 is a flowchart illustrating a procedure of a similar region extraction process.

【図７】構造修正処理の手順を示すフローチャートFIG. 7 is a flowchart showing the procedure of a structure modification process;

【図８】同実施の形態に係るドキュメント管理装置の構
造登録部の構成の他の例を示す図FIG. 8 is a diagram showing another example of the configuration of the structure registration unit of the document management device according to the embodiment;

【図９】修正確認を採り入れた構造修正処理の手順を示
すフローチャートFIG. 9 is a flowchart showing a procedure of a structure correction process adopting a correction confirmation;

【図１０】フォーマット修正確認のための表示画面の一
例を示す図FIG. 10 is a diagram showing an example of a display screen for confirming format correction.

【図１１】フォーマット修正確認のための表示画面の一
例を示す図FIG. 11 is a diagram showing an example of a display screen for confirming format correction.

【図１２】フォーマット修正箇所の表示画面の一例を示
す図FIG. 12 is a diagram showing an example of a display screen of a format correction location.

[Explanation of symbols]

１０１…画像入力部１０２…構造登録部１０３…構造格納部１０４…手続設定部１０５…構造識別部１０６…手続処理部１０７…データ出力部２０１…画像処理部２０２…図形処理部２０３…文字認識部２０４…構造解析部２０５…類似領域抽出部２０６…構造置換部２０７…構造修正部２０８…構造表示部２０９…修正確認部 Reference Signs List 101 image input unit 102 structure registration unit 103 structure storage unit 104 procedure setting unit 105 structure identification unit 106 procedure processing unit 107 data output unit 201 image processing unit 202 graphic processing unit 203 character recognition unit 204: structure analysis unit 205: similar area extraction unit 206: structure replacement unit 207: structure correction unit 208: structure display unit 209: correction confirmation unit

───────────────────────────────────────────────────── フロントページの続き (72)発明者石井利明東京都千代田区内幸町１丁目１番３号東京電力株式会社内 (72)発明者山本為雄東京都千代田区内幸町１丁目１番３号東京電力株式会社内 ──────────────────────────────────────────────────続き Continued on the front page (72) Inventor Toshiaki Ishii 1-3-1 Uchisaiwaicho, Chiyoda-ku, Tokyo Tokyo Electric Power Company (72) Inventor Takuo Yamamoto 1-3-1 Uchisaiwaicho, Chiyoda-ku, Tokyo Tokyo Electric Power Company

Claims

[Claims]

1. A document processing apparatus for performing predetermined processing by inputting an image of a document, comprising: a structure storage unit configured to store structure information of the document; and a structure information setting unit configured to set the structure information of the input document input by the image. A similar area detecting means for searching an area similar to the set structure information of the input document from a document stored in the structure storage means; and a structure information of the input document and a structure information of the searched similar area. And a structure correcting means for correcting the structure information of the input document based on the difference between the two.

2. The structure correcting means sets, as a corrected part, a part of the structure information of the input document where a difference exists between the structure information of the input document and the structure information of the searched similar area. A modification portion setting unit; and a structure replacement unit that performs modification by replacing the structure information corresponding to the modification portion set by the modification region setting unit with the structure information of the similar region. The document processing device according to claim 1.

3. The structure correcting means includes: a correcting part presenting means for presenting the correcting part set by the correcting part setting means; 3. The document processing apparatus according to claim 2, further comprising: when receiving a confirmation input from the modification confirming unit that the modification is to be performed, the structure replacing unit performs modification.

4. The modified structure creating means for creating modified structure information as a modified plan using the structural information of the similar area as a modified plan, and presenting the structured information as the modified modified plan. Correction structure display means, and correction confirmation means for accepting a confirmation input of the correctness of the modification with respect to the displayed structural information as a modification plan, and receiving a confirmation input to the effect that the modification confirmation means requires modification. 2. The document processing apparatus according to claim 1, wherein correction is performed in the case.

5. The document processing apparatus according to claim 1, further comprising block extracting means for dividing the document into blocks, and wherein the similar area detecting means searches for similar areas in block units.

6. A data entry area extracting means for extracting a data entry area to be handwritten from an input document image, a character recognizing means for extracting a character string from an input document image, and A data entry area, and associated character string extraction means for associating a character string describing the data entry area among the extracted character strings, the block extraction means further comprising: The document processing apparatus according to claim 5, wherein the block is extracted by integrating entry areas.