JP4881722B2

JP4881722B2 - Form identification device and form identification program

Info

Publication number: JP4881722B2
Application number: JP2006354585A
Authority: JP
Inventors: 健永崎; 勝美丸川; 広新庄; 和樹中島; 憲志柴田
Original assignee: Hitachi Computer Peripherals Co Ltd
Current assignee: Hitachi Information and Telecommunication Engineering Ltd
Priority date: 2006-12-28
Filing date: 2006-12-28
Publication date: 2012-02-22
Anticipated expiration: 2026-12-28
Also published as: JP2008165506A

Description

本発明は、帳票識別処理において識別した帳票の種類を視覚的に表示する帳票識別技術に関する。 The present invention relates to a form identification technique for visually displaying the type of form identified in a form identification process.

一般に、帳票の自動識別を行う場合、帳票内の特徴量を用いて帳票の種類を識別することを行う。例えば、帳票内に記載されている文字コード、文字行、罫線、枠等を抽出し、それら特徴量と、事前にサンプルとして登録した帳票の特徴量とを照合して、帳票の種類を識別する。例えば、特許文献１に開示された技術は、帳票の特徴データを帳票毎に辞書として登録しておき、識別対象となる帳票画像が入力された場合、入力画像より罫線２値画像を作成し、予め登録しておいた辞書とでマッチングを実施する方法である。辞書には、罫線から得られる黒画素の位置などが、二次元座標の集合として登録される。また、特許文献２に開示された技術では識別対象となる入力帳票の特徴に点座標を付与した点（以下、入力点）と、辞書作成のために予め登録をした登録帳票の特徴に点座標を付与した点（以下、辞書点）とを設けて、前記入力点と辞書点の距離を算出し、前記距離がある距離基準内であれば入力点に対応付けられた辞書点が存在すると判定し、対応付けられた点数、入力点数及び辞書点数を基に照合の一致の程度を求める方法である。この手法では、入力点または辞書点として、罫線の端点や交点、並びに上下左右を罫線で囲まれる最小の領域（セル）の中心座標などを特徴点として用いる。
このように識別対象となる帳票に対して様々な識別手法が存在する。ここで問題となるのが帳票識別の結果を表示装置に提示する方法である。帳票種類の名称を表示することが一般的であるが、扱う帳票が多種に渡る場合、名称だけで当該識別結果が正しいかどうかを判断することは難しい。 In general, when performing automatic identification of a form, the type of form is identified using the feature amount in the form. For example, character codes, character lines, ruled lines, frames, etc. described in the form are extracted, and the feature type is compared with the feature quantity of the form registered in advance as a sample to identify the type of form . For example, the technique disclosed in Patent Literature 1 registers feature data of a form as a dictionary for each form, and creates a ruled line binary image from an input image when a form image to be identified is input, This is a method of performing matching with a dictionary registered in advance. In the dictionary, the positions of black pixels obtained from ruled lines and the like are registered as a set of two-dimensional coordinates. In the technique disclosed in Patent Literature 2, point coordinates are added to the features of the input form to be identified (hereinafter referred to as input points), and the features of the registered form registered in advance for creating the dictionary are point coordinates. And a distance between the input point and the dictionary point is calculated. If the distance is within a certain distance criterion, it is determined that a dictionary point associated with the input point exists. In this method, the degree of matching is obtained based on the associated score, input score, and dictionary score. In this method, as input points or dictionary points, the end points and intersections of ruled lines, and the center coordinates of the smallest area (cell) surrounded by the ruled lines are used as feature points.
As described above, there are various identification methods for the forms to be identified. The problem here is how to present the result of form identification on the display device. It is common to display the name of the form type. However, when there are various forms to be handled, it is difficult to determine whether the identification result is correct only by the name.

特開平１０−２２２５８７号公報Japanese Patent Laid-Open No. 10-222587 特開２００２−３２４２３６号公報JP 2002-324236 A

従来の帳票識別処理においては、識別の結果として得た帳票種類の名称を提示するだけで、その識別結果が正しいかどうかを区別することが困難となる課題があった。 In the conventional form identification processing, there is a problem that it is difficult to distinguish whether the identification result is correct only by presenting the name of the form type obtained as a result of identification.

本発明は、多種類の帳票を取り扱う場合に、帳票識別結果を視覚的に提示すると共に、視覚的表示の際に、帳票画像にずれが存在する場合や類似した帳票が混在するような場合でも、差異ある個所を強調して表示する帳票識別装置、及びそのプログラムを提供することを目的とする。更に、本発明は、辞書に登録された見本帳票画像あるいは入力帳票画像に記載された個人情報等を保護した表示が可能な帳票識別装置を提供することを目的とする。 The present invention visually presents a form identification result when handling various types of forms, and even when there are deviations in the form image or when similar forms are mixed in the visual display. It is an object of the present invention to provide a form identification device that highlights and displays differences and a program thereof. It is another object of the present invention to provide a form identification device that can display a sample form image registered in a dictionary or protect personal information described in an input form image.

本発明は、上記目的を達成するため、画像撮像部と処理部と記憶部と表示部とを備え、画像撮像部は帳票を撮像することによって、当該帳票の画像データを取得し、処理部は、記憶部に蓄えられた辞書を元に帳票画像の種類を識別し、当該帳票識別の結果を表示する表示データを表示部に提示する帳票識別装置とそのプログラムを、帳票識別結果を表示する際に、入力帳票画像と、帳票識別処理の結果によって得られた最も一致した特徴を持つ帳票種類の見本帳票画像を重ね合わせて提示する構成とする。 In order to achieve the above object, the present invention includes an image capturing unit, a processing unit, a storage unit, and a display unit. The image capturing unit captures a form to acquire image data of the form, and the processing unit When identifying the form image type based on the dictionary stored in the storage unit and displaying the form identification result and the program for displaying the form identification result on the display unit and displaying the form identification result on the display unit In addition, the input form image and the sample form image of the form type having the most matched characteristic obtained by the result of the form identification process are displayed in a superimposed manner.

また、入力帳票画像と見本帳票画像間でずれがあることを検知し、ずれが存在すると判断した場合は、帳票特徴点マッピングを用いてずれを吸収して表示する帳票識別装置とそのプログラムを提供する。 In addition, a form identification device and a program for detecting and detecting a shift between the input form image and the sample form image and absorbing the shift using the form feature point mapping are provided. To do.

更に、本発明においては、差異のある個所を強調表示する、あるいは個人情報が記載されうる箇所にマスクを掛けて表示する等の処理を行う帳票識別装置とそのプログラムを提供する。 Furthermore, the present invention provides a form identification apparatus and program for performing processing such as highlighting a portion having a difference or displaying a portion where personal information can be written with a mask.

本発明により、識別した帳票の種類に応じて、帳票識別結果の表示を入力画像と見本画像を重ね合わせて行うことができる。更に、帳票画像間でずれが存在する場合にも、帳票特徴点マッピングを用いて、位置ずれを吸収して表示し、更に差異のある個所を強調表示する等の表示処理が可能となるため、帳票の振分け処理の効率化が見込める。 According to the present invention, the form identification result can be displayed by superimposing the input image and the sample image in accordance with the identified form type. Furthermore, even when there is a deviation between the form images, it is possible to perform display processing such as absorbing and displaying the position deviation by using the form feature point mapping, and further highlighting the difference. The efficiency of form distribution processing can be expected.

以下、本発明の実施の形態を図面を参照して説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

初めに、本実施例の前提とするハードウェア構成について述べる。図１は、帳票読取装置の一実施例を示すハードウェア構成図である。高速帳票読取装置では、画像撮像部である画像撮像装置０１０１により紙文書を電子データに変換し、それを記憶部である外部記憶装置０１０５及びメモリ０１０６に蓄えて、中央処理部（ＣＰＵ）である中央演算装置０１０７により読取を行う。本実施例に係わる帳票識別・表示プログラムは、外部記憶装置０１０５またはメモリ０１０６に蓄えられているか、場合によっては、通信装置０１０９を介して帳票読取装置に導入されこれら記憶部に記憶される。 First, the hardware configuration assumed in this embodiment will be described. FIG. 1 is a hardware configuration diagram illustrating an embodiment of a form reading apparatus. In the high-speed form reading device, a paper document is converted into electronic data by an image pickup device 0101 which is an image pickup unit, which is stored in an external storage device 0105 and a memory 0106 which are storage units, and is a central processing unit (CPU). Reading is performed by the central processing unit 0107. The form identification / display program according to the present embodiment is stored in the external storage device 0105 or the memory 0106, or is introduced into the form reading device via the communication device 0109 and stored in these storage units depending on circumstances.

この帳票識別・表示プログラムは、撮像された電子データ画像に対して、中央演算装置０１０７が帳票識別処理や、識別結果の表示制御を行う。帳票の種類を特定する場合は、記憶部である外部記憶装置０１０５またはメモリ０１０６に蓄えられた帳票辞書を参照し、帳票種類を判別する。これらの処理は、操作端末装置０１０２を通して操作者である人間が操作可能であり、処理結果は表示端末装置０１０３を通して表示される。処理結果などの情報は、必要に応じて外部記憶装置０１０５に蓄積または通信装置０１０９を通して外部装置にデータとして送信される。上記装置群は内部バス０１０８によって繋がっている。帳票識別結果を反映して、入力された伝票類はソータ装置０１０４によって、定義された箱に分配・集積される。言い換えるなら、画像撮像装置０１０１とソータ装置０１０４を除けば、通常のパーソナルコンピュータ（ＰＣ）などのコンピュータシステムで構成されうる。なお通常のＰＣにおいては、表示制御用のチップをＣＰＵとは別に有している場合もあるが、本実施例においても、グラフィックボードを別構成とし、表示制御を分担しても良いことは言うまでもなく、どちらの構成であっても、帳票識別と表示用のプログラムを実行する構成を処理部と呼ぶことがある。 In this form identification / display program, the central processing unit 0107 performs form identification processing and identification result display control for the captured electronic data image. When specifying the form type, the form type is determined by referring to the form dictionary stored in the external storage device 0105 or the memory 0106 as a storage unit. These processes can be operated by a human operator through the operation terminal device 0102, and the processing results are displayed through the display terminal device 0103. Information such as processing results is stored in the external storage device 0105 or transmitted as data to the external device through the communication device 0109 as necessary. The above devices are connected by an internal bus 0108. Reflecting the form identification result, the entered slips are distributed and collected by the sorter device 0104 in a defined box. In other words, except for the image capturing device 0101 and the sorter device 0104, the image capturing device 0101 and the sorter device 0104 can be configured by a computer system such as a normal personal computer (PC). Although a normal PC may have a display control chip separately from the CPU, it goes without saying that in this embodiment, the graphic board may be configured separately to share the display control. Regardless of the configuration, the configuration for executing the form identification and display program may be referred to as a processing unit.

さて、図２は、上述した帳票読取装置における帳票読取処理、すなわち帳票識別から振分けに至る処理の機能ブロック図を表す。初めに、画像撮像装置０１０１からなる画像撮像部０２０１で、入力の紙伝票が撮像され、電子（データ）画像となる。この電子画像は通常は外部記憶装置０１０５やメモリ０１０６などの記憶部に一旦記憶され、以降の帳票識別処理は、この撮像された電子画像を元に行う。続いて、帳票識別部０２０２において帳票識別を行う。帳票識別を行う場合は、帳票を識別するための特徴を辞書に蓄えた帳票識別特徴辞書０２０５を用いる。識別された帳票種類は、識別結果表示部０２０３によって表示端末装置０１０３上に視覚的に表示されるよう制御される。この表示制御によって表示される画像としては、見本帳票画像辞書０２０６に蓄えられた画像、または帳票識別特徴辞書０２０５から計算された画像が使われる。最後に、確定した帳票識別結果に従って、帳票振分辞書０２０７に定義された振分定義に応じて、ソータ装置０１０４からなるソータ部０２０４の該当スタックに識別対象であった帳票が搬送される。 FIG. 2 is a functional block diagram of the form reading process in the form reading apparatus described above, that is, the process from form identification to distribution. First, an input paper slip is picked up by an image pickup unit 0201 including the image pickup device 0101 to be an electronic (data) image. This electronic image is normally temporarily stored in a storage unit such as the external storage device 0105 or the memory 0106, and subsequent form identification processing is performed based on the captured electronic image. Subsequently, the form identification unit 0202 performs form identification. When form identification is performed, a form identification feature dictionary 0205 in which features for identifying a form are stored in a dictionary is used. The identified form type is controlled to be visually displayed on the display terminal device 0103 by the identification result display unit 0203. As an image displayed by this display control, an image stored in the sample form image dictionary 0206 or an image calculated from the form identification feature dictionary 0205 is used. Finally, according to the confirmed form identification result, according to the distribution definition defined in the form distribution dictionary 0207, the form that is the identification target is conveyed to the corresponding stack of the sorter unit 0204 including the sorter device 0104.

ここで、帳票識別部０２０２と識別結果表示部０２０３は，処理部である中央演算装置０１０７などが実行するプログラムによるソフトエア処理によって構成されることは、上述の通りである。次に、まず帳票識別処理部０２０２と帳票識別特徴辞書０２０５についてその概略を述べる。 Here, the form identification unit 0202 and the identification result display unit 0203 are configured by software processing by a program executed by the central processing unit 0107, which is a processing unit, as described above. Next, the outline of the form identification processing unit 0202 and the form identification feature dictionary 0205 will be described first.

図３は、処理部である中央演算装置０１０７で実行される本実施例の帳票識別部０２０２における帳票の識別過程を示すフロー図である。判別対象となる入力帳票画像の電子画像は当該過程を経て識別が行われる。先ず、判別対象となる帳票の電子画像は、画像入力部である画像撮像装置０１０１より取得されており、これを直接、或いは記憶部に一旦記憶してから入力画像とする（０３０１）。次に、この入力画像より特徴量の抽出範囲を設定するために、帳票用紙の四隅を検出する紙領域基準抽出を行う（０３０２）。検出した帳票用紙の四隅の座標より用紙サイズの算出を行い、判別対象となる帳票の用紙サイズを判定する（０３０３）。入力画像より上述の過程で決定した用紙サイズ内より入力画像の特徴量を抽出し、帳票の学習サンプル画像より作成した帳票識別特徴辞書０２０５を用いて特徴量の一次照合を行う（０３０４）。 FIG. 3 is a flowchart showing a form identifying process in the form identifying unit 0202 of the present embodiment, which is executed by the central processing unit 0107 as a processing unit. The electronic image of the input form image to be discriminated is identified through this process. First, an electronic image of a form to be discriminated is acquired from an image capturing device 0101 which is an image input unit, which is directly or directly stored in a storage unit and used as an input image (0301). Next, in order to set the feature value extraction range from the input image, paper area reference extraction for detecting the four corners of the form paper is performed (0302). The paper size is calculated from the coordinates of the four corners of the detected form paper, and the paper size of the form to be determined is determined (0303). The feature quantity of the input image is extracted from the paper size determined in the above process from the input image, and the feature quantity is firstly collated using the form identification feature dictionary 0205 created from the form learning sample image (0304).

ここでの特徴量とは、判別対象となる帳票から抽出した入力画像において、上述した辞書作成の際の特徴量同様に、例えば、画像の輝度が顕著に表れる箇所のことを指し、入力画像から格子状に特徴点を抽出して使用する。そして、入力画像より抽出した特徴量と、サンプル画像より作成した辞書とを比較し、両者の特徴量の距離計算値がある距離基準値内である場合、入力画像とサンプル画像とが一致、符合する（ＹＥＳ）と判定し、処理を終了する（０３０５）。距離計算手法としては，ユークリッド距離や、動的記入マスク距離を用いる。入力帳票が回転しているケースに対応する場合は、帳票を１８０度または９０度ずつ回転させた上で特徴量を生成し、これとの距離計算を行う。この特徴量と、辞書との距離計算値が上記の距離基準値を上まわっている場合、入力画像より帳票用紙内に記載されている領域（記載領域という）を検出する記載領域基準抽出を行う（０３０６）。検出した記載領域の範囲内より入力画像の特徴量を新たに抽出し、特徴量の二次照合を行い、入力画像とサンプル画像とが一致するか否かを判定して（０３０７）、帳票識別過程を終了する。 The feature amount here refers to a portion where the luminance of the image appears remarkably in the input image extracted from the form to be discriminated, for example, as in the case of the dictionary creation described above. Feature points are extracted and used in a grid pattern. Then, the feature quantity extracted from the input image is compared with the dictionary created from the sample image. If the distance calculation value of both feature quantities is within a certain distance reference value, the input image and the sample image match and match. Yes (YES), and the process ends (0305). As the distance calculation method, Euclidean distance or dynamic entry mask distance is used. When the case corresponds to the case where the input form is rotating, the form is rotated 180 degrees or 90 degrees at a time, and the feature amount is generated, and the distance is calculated. When the calculated distance between the feature amount and the dictionary exceeds the distance reference value, a description area reference extraction for detecting an area (referred to as a description area) described in the form sheet from the input image is performed. (0306). The feature quantity of the input image is newly extracted from the range of the detected description area, the feature quantity is secondarily collated, and it is determined whether or not the input image and the sample image match (0307), and the form is identified. End the process.

図４は、本実施例における、帳票識別結果を表示端末装置０１０３に提示する表示処理の過程を示すフロー図である。処理部である中央演算装置０１０７で実行される識別結果表示部０２０３のフローにおいて、入力される情報としては、入力帳票画像と、帳票識別結果がある（０４０１）。入力帳票画像は先の電子画像である。帳票識別結果には、認識した帳票種類、その識別尤度、回転方向、ずれ度合いが最低含まれるとする。まず、帳票識別結果に含まれるずれ度合いが閾値を超えるか、または、識別尤度が閾値を超えるかした場合（０４０２）、入力帳票画像と見本帳票画像の間でずれがあると判断し、これらの画像を適切に重ね合わせるための計算を行う。まず入力帳票画像、及び見本帳票画像から特徴点を抽出する（０４０３）。次に特徴点の照合を行い（０４０４）、入力帳票画像と見本帳票画像を最適に重ね合わせるためのパラメータ計算（ずれ量、回転量、伸縮量の計算）を行う。 FIG. 4 is a flowchart showing the process of display processing for presenting the form identification result to the display terminal device 0103 in the present embodiment. In the flow of the identification result display unit 0203 executed by the central processing unit 0107 as the processing unit, input information includes an input form image and a form identification result (0401). The input form image is the previous electronic image. The form identification result includes at least the recognized form type, its identification likelihood, the rotation direction, and the degree of deviation. First, if the degree of deviation included in the form identification result exceeds a threshold or the identification likelihood exceeds the threshold (0402), it is determined that there is a deviation between the input form image and the sample form image. Perform calculations to properly overlay the images. First, feature points are extracted from the input form image and the sample form image (0403). Next, feature points are collated (0404), and parameter calculation (calculation of shift amount, rotation amount, expansion / contraction amount) for optimally overlaying the input form image and the sample form image is performed.

以上の結果を元に、重ね合わせ表示に用いる見本帳票画像の作成を行う（０４０５）。例えば回転があれば、見本帳票画像に対して回転を掛けた場合の画像を計算する。最初にずれがないと判断された場合は、上記の特徴点計算の過程を省くことができる。この特徴点間の対応を取り、両画像間の最適に近似する線形変換パラメータを求めるアルゴリズムを、特徴点マッピングと総称する。ずれ計算の過程であるステップ０４０３から０４０４の計算処理については、後で説明を行う。次に、見本画像計算（０４０５）が終わった後に、入力帳票画像と見本帳票画像を、どのように表示するかの表示画像計算を行う（０４０６）。そして、計算された画像を表示端末装置０１０３のディスプレイ上に表示し（０４０７）、識別結果表示部０２０３の処理を終える。 Based on the above result, a sample form image used for overlay display is created (0405). For example, if there is rotation, an image when the sample form image is rotated is calculated. If it is first determined that there is no deviation, the above feature point calculation process can be omitted. Algorithms that take correspondence between feature points and obtain a linear transformation parameter that optimally approximates both images are collectively referred to as feature point mapping. The calculation processing of steps 0403 to 0404, which is a process of deviation calculation, will be described later. Next, after the sample image calculation (0405) is completed, display image calculation of how to display the input form image and the sample form image is performed (0406). Then, the calculated image is displayed on the display of the display terminal device 0103 (0407), and the processing of the identification result display unit 0203 is finished.

図５は上記のフローにおいてステップ４０３で抽出、計算した特徴点の具体例を示す図である。特徴点としては、例えば、帳票の黒罫線を抽出し、上下左右の４組の黒罫線から構成される枠の中心点を特徴点とする、あるいはハリス（Ｈａｒｒｉｓ）特徴のように画像の局所的な濃度変動方向を計算し、その極大値を特徴点とする方法などがある。本図で示した方法は輪郭の解析の例である。ここでは２値画像が入力されたとして（図５上段）、そこから連結成分の輪郭を抽出して、これを直線分で近似し（図５中段）、その直線分の端点（×印）を特徴点としたものである（図５下段）。先に説明した特徴点抽出０４０３、特徴点照合０４０４において、例えば、図５に示すような特徴点抽出を入力帳票画像と、見本帳票画像に対して行い、次に特徴点間の対応を取ることで、２つの画像にどれほどのずれや回転が存在するのかを性格に計算することができる。 FIG. 5 is a diagram showing a specific example of the feature points extracted and calculated in step 403 in the above flow. As the feature points, for example, a black ruled line of a form is extracted, and the center point of a frame composed of four sets of upper, lower, left, and right black ruled lines is used as a feature point, or a local feature of an image such as a Harris feature is used. There is a method of calculating a concentration variation direction and using the maximum value as a feature point. The method shown in this figure is an example of contour analysis. Here, assuming that a binary image has been input (upper part of FIG. 5), the outline of the connected component is extracted therefrom and approximated by a straight line (middle part of FIG. 5). This is a feature point (lower part of FIG. 5). In the feature point extraction 0403 and the feature point matching 0404 described above, for example, feature point extraction as shown in FIG. 5 is performed on the input form image and the sample form image, and then correspondence between the feature points is taken. Thus, it is possible to calculate exactly how much deviation or rotation exists between the two images.

正しい特徴点と誤った特徴点群が混ざり合った中から、正しいデータのみを取り出す代表的な手法にＲＡＮＳＡＣ(Random Sample Consensus)がある。これは、全データから必要最低限の個数のデータをランダムに選択し、それから拘束条件のパラメータ（線形変換）を計算する。そして、残りのデータからその値の妥当性を評価し、これを十分多数回行って最大の評価を得た値とそれを支持したデータを取り出す。これによって正しいパラメータと正しいデータの両方が同時に推定できる。ＲＡＮＳＡＣアルゴリズムの計算では３点必要だが、線形変換の種類として、回転、シフト、伸縮（かつ、ＸＹで等倍）というものを考えるならば、実は２点の対応関係を選ぶだけで足りる。この性質を利用することで帳票識別の計算量の削減を図ることができる。これを帳票特徴点マッピングと称する。 RANSAC (Random Sample Consensus) is a typical method for extracting only correct data from a mixture of correct feature points and incorrect feature points. In this case, a necessary minimum number of data are randomly selected from all data, and then a constraint condition parameter (linear transformation) is calculated. Then, the validity of the value is evaluated from the remaining data, and this is performed a sufficient number of times to obtain the value obtained with the maximum evaluation and the data supporting it. This allows both correct parameters and correct data to be estimated simultaneously. In the calculation of the RANSAC algorithm, three points are required. However, when considering the types of linear transformation such as rotation, shift, and expansion / contraction (and XY equal magnification), it is actually sufficient to select the correspondence between two points. By utilizing this property, it is possible to reduce the amount of calculation for form identification. This is called form feature point mapping.

つまり、図６に示すように、点｛ｘ１、ｘ２｝が見本帳票から得られた特徴点、点｛ｐ１、ｐ２｝が入力帳票から得られた特徴点であるとして、点ｘ１を基準点として、点ｘ２の座標を伸縮、回転して、ベクトルｘ２−ｘ１がベクトルｐ２−ｐ１と一致するようにした上で、点ｘ１を点ｐ１へと移動する作業を行うという変換を意味する。計算量を削減した帳票特徴点マッピングの擬似プログラムコードを図７に示す。擬似コード中の線形変換は、次の式で表される。すなわち、まず、線形変換を That is, as shown in FIG. 6, assuming that the point {x1, x2} is a feature point obtained from the sample form and the point {p1, p2} is a feature point obtained from the input form, the point x1 is used as a reference point. , The coordinate of the point x2 is expanded / contracted and rotated so that the vector x2-x1 coincides with the vector p2-p1, and then the operation of moving the point x1 to the point p1 is performed. FIG. 7 shows a pseudo program code for form feature point mapping with a reduced amount of calculation. The linear transformation in the pseudo code is expressed by the following formula. That is, first, the linear transformation

とモデル化する。このとき、 And model. At this time,

と置くならば、伸縮、回転、移動の各変換量は次のように求まる。すなわち、伸縮量は、 Then, the conversion amounts of expansion / contraction, rotation, and movement can be obtained as follows. That is, the amount of expansion and contraction is

と表される。回転量は、 It is expressed. The amount of rotation is

と表される。また、移動量は、 It is expressed. The amount of movement is

と導かれる。 It is guided.

図８は第一の実施例により、入力帳票画像と見本帳票画像を重ね合わせた表示の具体例である。ここでは、帳票識別特徴辞書０２０５に含まれるメッシュの粗い画像（粗画像と呼ぶ）を辞書として、２つを重ね合わせている。この場合、図２に示した見本画像辞書０２０６は不要となり、辞書容量が削減できる。粗画像表示の利点は、帳票識別を運用する現場において入力帳票等に記入された個人情報の流出を避け得ること、また、画像処理の計算量を削減することなどである。 FIG. 8 is a specific example of the display in which the input form image and the sample form image are superimposed according to the first embodiment. Here, a coarse mesh image (referred to as a coarse image) included in the form identification feature dictionary 0205 is used as a dictionary, and the two are superimposed. In this case, the sample image dictionary 0206 shown in FIG. 2 becomes unnecessary, and the dictionary capacity can be reduced. The advantage of the coarse image display is that it is possible to avoid the leakage of personal information entered in the input form or the like at the site where the form identification is operated, and the amount of calculation for image processing is reduced.

続いて、図９は表示処理の他の実施例として、入力帳票画像と見本帳票画像を重ね合わせ、更に差異のある差異箇所を色表示やグレー表示９０によって強調した表示例である。 Next, FIG. 9 shows a display example in which the input form image and the sample form image are overlaid as another embodiment of the display process, and further, the different differences are highlighted by color display or gray display 90.

差異箇所の計算は次のようにして行う。差異箇所の計算段階では、入力帳票画像の帳票種を識別する処理と、当該帳票種を代表する見本帳票画像との帳票特徴点マッピングが済んでいるとする。ここで、帳票種の識別は、図３に示す０３０２から０３０７の処理によって行われる。また、帳票特徴点マッピングは、図４に示す０４０２から０４０４の処理によって行われる。 The difference is calculated as follows. It is assumed that the process of identifying the form type of the input form image and the form feature point mapping between the sample form image representing the form type have been completed at the difference portion calculation stage. Here, the form type is identified by the processes 0302 to 0307 shown in FIG. Form feature point mapping is performed by the processes 0402 to 0404 shown in FIG.

上述した帳票特徴点マッピングにより、両画像間を重ね合わせる際に行うべき、回転、シフト、伸縮量が判明する。そこで入力帳票画像に対して当該パラメータを用いた変換を行い、二次元格子状配置された特徴点を用いて特徴ベクトルを計算する。線形変換を行った入力帳票画像から得られたベクトルをＩ、見本帳票画像から得られたベクトルをＭとする。各ベクトルは二次元格子状の特徴点を用いて抽出しているため、これらのベクトルを、帳票を荒く近似した画像と見なすことができる。そこで、入力帳票画像に対応するベクトルをＩ（ｘ，ｙ）、見本帳票画像に対応するベクトルをＭ（ｘ，ｙ）で表す。このとき、２つの画像間の違いを表す差異指標Ｄ（ｘ，ｙ）は、 By the above-described form feature point mapping, the amount of rotation, shift, and expansion / contraction to be performed when the two images are overlaid is determined. Therefore, the input form image is converted using the parameter, and the feature vector is calculated using the feature points arranged in a two-dimensional grid. Let I be a vector obtained from the input form image subjected to linear transformation, and M be a vector obtained from the sample form image. Since each vector is extracted using two-dimensional grid-like feature points, these vectors can be regarded as an image that roughly approximates a form. Therefore, a vector corresponding to the input form image is represented by I (x, y), and a vector corresponding to the sample form image is represented by M (x, y). At this time, the difference index D (x, y) representing the difference between the two images is

で計算できる。すなわちＤ（ｘ，ｙ）の大きい箇所は、両帳票画像間で何らかの差異が存在する所である。ただし、この差異指標Ｄ値をそのまま用いると局所的な変動に影響され易いという問題があるため、安定した差異を表す尺度として差異尤度Ｌ（ｘ，ｙ）を用いる。差異尤度Ｌは、差異指標の部分領域における平均値であり、次式に従って計算する。 It can be calculated with That is, the part where D (x, y) is large is a place where there is some difference between the two form images. However, if this difference index D value is used as it is, there is a problem that it is easily affected by local fluctuations, and therefore, the difference likelihood L (x, y) is used as a scale representing a stable difference. The difference likelihood L is an average value in the partial area of the difference index, and is calculated according to the following equation.

この式中のＲｘ、Ｒｙは部分平均化のサイズを表す。次に、差異尤度の高い方から順に、指定されたＮ個の点（ｘ１，ｙ１）・・・（ｘＮ，ｙＮ）を選択する。これらの点に相当する箇所が、画像上の差異箇所と判断され、当該箇所に色を強調して表示することとなる。 Rx and Ry in this formula represent the size of partial averaging. Next, the designated N points (x1, y1)... (XN, yN) are selected in order from the highest difference likelihood. A part corresponding to these points is determined as a difference part on the image, and the color is emphasized and displayed at the part.

差異強調表示の利点は、類似した帳票群がある場合において、帳票識別の結果を成否を判断をサポートし、迅速な処理を実現し得ることである。なお、これらの差異箇所の計算は、先に説明したＣＰＵなどで構成される中央演算装置０１０７におけるプログラム処理で実現できることは言うまでもない。また、画像表示の際、画面上の特定されたエリアである差異箇所９０を、色表示、グレー表示によって強調表示する技術も、中央演算装置０１０７などで実行される表示制御プログラムで実現可能である。 The advantage of the difference highlighting is that, in the case where there are similar form groups, it is possible to support the judgment of success or failure of the result of form identification and to realize quick processing. Needless to say, the calculation of these differences can be realized by the program processing in the central processing unit 0107 constituted by the CPU described above. In addition, a technique for highlighting the difference portion 90 that is a specified area on the screen by color display or gray display when displaying an image can be realized by a display control program executed by the central processing unit 0107 or the like. .

次に、図１０は表示処理の他の実施例として、帳票画像上に書かれた特定情報（記載内容）の部分（特定エリアと呼ぶ）にマスク１００を掛けて表示する表示例である。マスクの掛け方としては２通りある。１つは、見本帳票画像が未記入の帳票画像として、見本帳票画像と異なる部分で、かつ黒画素が閾値以上の所を記載エリアと判断し、この記載エリアをマスクして表示する方法である。もう一つは、見本帳票画像が記入帳票である場合で、連結成分のサイズや輪郭成分の周波数解析などにより、手書き記入と見なされたエリア（記入エリア）にマスクを掛けて表示する方法である。マスク表示の利点は、帳票識別の運用現場において、入力帳票等に記入された個人情報の流出を避け得ることである。 Next, FIG. 10 shows a display example in which a specific information (description content) written on a form image (referred to as a specific area) is displayed with a mask 100 as another embodiment of the display process. There are two ways to apply the mask. One is a method in which a sample form image is an unfilled form image, a portion different from the sample form image and a black pixel having a threshold value or more is determined as a description area, and the description area is masked and displayed. . The other is a method in which the sample form image is an entry form, and the area (entry area) considered to be handwritten entry is masked and displayed by analysis of the size of the connected component and frequency analysis of the outline component. . The advantage of the mask display is that it is possible to avoid the leakage of personal information entered in the input form or the like at the form identification operation site.

なお、記入帳票上に書かれている文字が手書き文字か、あるいは元々帳票に印刷されているような活字文字であるかを見分ける手段としては、本件出願人が先に出願した特開２００３−３０８４８３公報「文字種判定プログラム、文字識別プログラム及び文字識別装置」にて開示した方法を用いることができる。これらのマスク表示のための特定エリア検出は、先に説明したＣＰＵなどで構成される中央演算装置０１０７におけるプログラム処理で実現できることは言うまでもない。また、画像表示の際、画面上の特定エリアをマスク表示する技術も、中央演算装置０１０７などで実行される表示制御プログラムで実現可能である。 Incidentally, as means for discriminating whether the character written on the entry form is a handwritten character or a type character originally printed on the form, Japanese Patent Application Laid-Open No. 2003-308483 filed earlier by the present applicant. The method disclosed in the publication “Character Type Determination Program, Character Identification Program, and Character Identification Device” can be used. It goes without saying that the specific area detection for mask display can be realized by the program processing in the central processing unit 0107 constituted by the CPU described above. In addition, a technique for mask-displaying a specific area on the screen when displaying an image can be realized by a display control program executed by the central processing unit 0107 or the like.

さらに、上述した表示手法は、単独で行われるのみでなく、組み合わせての運用も可能である。例えば、ぼかし表示と差異強調表示を組み合わせる、マスク表示と差異強調表示を組み合わせるなどである。 Furthermore, the display methods described above are not only performed independently, but also can be used in combination. For example, a combination of blur display and difference emphasis display is combined, and mask display and difference emphasis display are combined.

また、図１１は表示処理の他の実施例として、最も単純な場合である、入力帳票と見本帳票画像をそのまま重ね合わせて表示する表示例である。 FIG. 11 shows a display example in which the input form and the sample form image are superimposed and displayed as they are in the simplest case as another embodiment of the display process.

図１２は、上述した表示手法を用いた場合の、帳票識別結果をユーザに示す場合の画面構成の一例である。１２０１には識別対象となった未知帳票の番号が表示されており、１２０２に帳票識別の結果として候補に挙げられた帳票種の名前が並んでいる。また、帳票種の名前を並べると同時に、その縮小画像を並べて表示することも可能である。１２０３には、そのなかのどれを画像上で重ね合わせて表示するか、を選択するためのボタンが並んでいる。この場合、第一候補である帳票Dがユーザによって選択されているので、帳票Dの見本帳票画像と、入力帳票画像とが重ね合わさり、１２０４に表示される。この表示形態により、ユーザは帳票画像を見比べながら、正しい帳票種を選択することが可能となる。また、識別候補の中に正しい帳票種が見つからない場合は、ユーザが帳票種名を入力できるように、１２０５に入力ボックスがある。選択された帳票種が正しいとユーザが判断した場合は、１２０６にある決定ボタンを押して、帳票の識別結果を確定する。これら表示画面に関する処理もまた、中央演算装置０１０７で実行される。 FIG. 12 is an example of a screen configuration when the form identification result is shown to the user when the above-described display method is used. 1201 displays the number of the unknown form as an identification target, and 1202 lists the names of the form types listed as candidates as a result of the form identification. It is also possible to display the reduced images side by side at the same time the form types are arranged. In 1203, buttons for selecting which of them to be displayed on the image are arranged. In this case, since the form D which is the first candidate is selected by the user, the sample form image of the form D and the input form image are superimposed and displayed on 1204. With this display form, the user can select the correct form type while comparing the form images. If a correct form type is not found among the identification candidates, an input box 1205 is provided so that the user can input the form type name. If the user determines that the selected form type is correct, the user presses a decision button 1206 to confirm the form identification result. The processing related to these display screens is also executed by the central processing unit 0107.

第一の実施例に用いられる高速帳票読取装置のハード構成図。The hardware block diagram of the high-speed form reading apparatus used for a 1st Example. 第一の実施例における帳票読取処理のブロック構成図。The block block diagram of the form reading process in a 1st Example. 第一の実施例における帳票識別処理の処理フロー図。The processing flowchart of the form identification process in a 1st Example. 第一の実施例における帳票識別結果表示処理の処理フロー図。The processing flowchart of the form identification result display process in a 1st Example. 第一の実施例における帳票画像からの特徴点抽出例を示す図。The figure which shows the example of feature point extraction from the form image in a 1st Example. 第一の実施例における特徴点間の対応をイメージした図。The figure which imaged the correspondence between the feature points in a 1st Example. 第一の実施例における帳票特徴点マッピングの擬似コードを示した図。The figure which showed the pseudo code of the form feature point mapping in a 1st Example. 第一の実施例における入力帳票画像と見本帳票画像の重ね合わせ表示例１を示す図。The figure which shows the example 1 of a superimposition display of the input form image and sample form image in a 1st Example. 他の実施例における入力帳票画像と見本帳票画像の重ね合わせ表示例２を示す図。The figure which shows the example 2 of superimposition display of the input form image and sample form image in another Example. 他の実施例における入力帳票画像と見本帳票画像の重ね合わせ表示例３を示す図。The figure which shows the example 3 of an overlay display of the input form image and sample form image in another Example. 他の実施例における入力帳票画像と見本帳票画像の重ね合わせ表示例４を示す図。The figure which shows the example 4 of a superimposed display of the input form image and sample form image in another Example. 帳票識別結果の確認画面構成例を示す図。The figure which shows the example of a confirmation screen structure of a form identification result.

Explanation of symbols

０１０１…画像撮像装置、０１０２…操作端末装置、０１０３…表示端末装置、０１０４…ソータ装置、０１０５…外部記憶装置、０１０６…メモリ、０１０７…中央演算装置、０１０８…内部バス、０１０９…通信装置、０２０１…画像撮像部、０２０２…帳票識別部、０２０３…識別結果表示部、０２０４…ソータ部、０２０５…帳票識別特徴辞書、０２０６…見本帳票画像辞書、０２０７…帳票振分辞書、０３０１…帳票画像入力部、０３０２…紙領域基準抽出部、０３０３…サイズ判定部、０３０４…一次照合部、０３０５…閾値判定部、０３０６…記載領域基準抽出部、０３０７…二次照合部、０４０１…入力データ、０４０２…ズレ判定部、０４０３…特徴点抽出部、０４０４…特徴点照合部、０４０５…見本画像計算部、０４０６…表示画像計算部、０４０７…画像表示部。 DESCRIPTION OF SYMBOLS 0101 ... Image pick-up device, 0102 ... Operation terminal device, 0103 ... Display terminal device, 0104 ... Sorter device, 0105 ... External storage device, 0106 ... Memory, 0107 ... Central processing unit, 0108 ... Internal bus, 0109 ... Communication device, 0201 Image capturing unit, 0202 ... Form identifying unit, 0203 ... Identification result display unit, 0204 ... Sorter unit, 0205 ... Form identifying feature dictionary, 0206 ... Sample form image dictionary, 0207 ... Form distribution dictionary, 0301 ... Form image input unit , 0302 ... Paper area reference extraction unit, 0303 ... Size determination part, 0304 ... Primary collation part, 0305 ... Threshold judgment part, 0306 ... Description area reference extraction part, 0307 ... Secondary collation part, 0401 ... Input data, 0402 ... Deviation Determination unit, 0403 ... feature point extraction unit, 0404 ... feature point collation unit, 0405 ... sample image calculation unit, 04 6 ... display image calculating unit, 0407 ... the image display unit.

Claims

A form identification device that reads and identifies forms,
An image capturing unit that reads the form to acquire an input form image, a storage unit that stores a form identification feature dictionary describing feature amounts of a plurality of sample form images, a processing unit that processes the input form image, and a display And
The processor is
Based on the dictionary stored in the storage unit, identify the type of the input form image,
When displaying the result of the form identification on the display unit, the input form image, the sample form image having the most consistent feature amount as a result of the form identification , the input form image and the sample form image When the degree of deviation exceeds a predetermined threshold, the feature point extraction of the input form image and the sample form image is performed, and the input form image and the A form identification device that calculates conversion parameters for superimposing sample form images, converts the sample form images according to the conversion parameters, and controls the display so as to display them superimposed.

The form identification device according to claim 1,
The processor is
A form identifying apparatus that controls to display a coarse image in a superimposed manner when the input form image and the sample form image are displayed in a superimposed manner.

The form identification device according to claim 1,
The processor is
A form identification device that controls to display a difference portion between the input form image and the sample form image in an emphasized manner when displaying the input form image and the sample form image.

The form identification device according to claim 1,
The processor is
A form identification device that controls to display a specific area of the input form image with a mask when the input form image and the sample form image are displayed in a superimposed manner.

A form identification program for use in a form identification apparatus including a storage unit for storing a form identification feature dictionary describing feature quantities of a plurality of sample form images, and a processing unit for processing an input form image obtained by reading an input form,
The processing unit is
An identification step for identifying the input form image and which of the plurality of sample form images matches;
When displaying the identification result obtained by the identification step , the degree of deviation between the input form image and the sample form image is determined in advance for the sample form image having the feature quantity that most closely matches the input form image. If the threshold value exceeds the threshold value, the feature points of the input form image and the sample form image are extracted, and the input form image and the sample form image are superimposed based on the extracted matching result of the feature points. A form identification program for controlling to execute a display control step for controlling to display a superimposed parameter after calculating a conversion parameter and converting the sample form image with the conversion parameter .

The form identification program according to claim 5,
In the display control step, the processing unit is
When the input form image and the sample form image are overlaid, either a rough image is overlaid, a difference portion is highlighted, or a specific area is masked or displayed. A form identification program that controls to execute combinations.