JP6487100B1

JP6487100B1 - Form processing apparatus and form processing method

Info

Publication number: JP6487100B1
Application number: JP2018099935A
Authority: JP
Inventors: 正暁杉山
Original assignee: Toshiba Corp; Toshiba Infrastructure Systems and Solutions Corp
Current assignee: Toshiba Corp; Toshiba Infrastructure Systems and Solutions Corp
Priority date: 2018-05-24
Filing date: 2018-05-24
Publication date: 2019-03-20
Anticipated expiration: 2038-05-24
Also published as: JP2019204363A

Abstract

【課題】帳票モデルの特定に優れた帳票処理装置を提供すること。
【解決手段】実施形態に係る情報認識装置は、入力部と、検出部と、識別部とを備える。前記入力部は、識別対象帳票の帳票イメージを入力する。前記検出部は、前記帳票イメージの入力から入力特徴点を検出する。前記識別部は、複数種類の帳票に対応する各帳票モデルのモデル特徴点間と前記入力特徴点間との成す角度に基づき各帳票モデルと前記帳票イメージとの類似度を判定し、類似度の判定結果に基づき前記帳票イメージに対応する所定の帳票モデルを識別する。
【選択図】図１To provide a form processing apparatus excellent in specifying a form model.
An information recognition apparatus according to an embodiment includes an input unit, a detection unit, and an identification unit. The input unit inputs a form image of a form to be identified. The detection unit detects an input feature point from the input of the form image. The identification unit determines the similarity between each form model and the form image based on an angle formed between model feature points of each form model corresponding to a plurality of types of forms and between the input feature points, A predetermined form model corresponding to the form image is identified based on the determination result.
[Selection] Figure 1

Description

本発明の実施形態は、帳票処理装置及び帳票処理方法に関する。 Embodiments described herein relate generally to a form processing apparatus and a form processing method.

光学文字認識（Optical Character Recognition）の応用として、帳票画像に文字認識処理を行うことで、帳票内の記載内容をデータ化する業務（データエントリ業務）が知られている。このような技術において、様々な帳票の文字認識をするために、入力された帳票の種類を特定する帳票識別技術が必要とされている。 As an application of optical character recognition (Optical Character Recognition), a task (data entry task) is known in which the contents of a form are converted into data by performing a character recognition process on the form image. In such a technique, a form identification technique for specifying the type of the input form is required in order to perform character recognition of various forms.

帳票識別においては、帳票内に帳票固有の文字列が印字された、識別キーとなり得る位置をモデルとして事前定義しておき、未識別帳票が入力された場合には事前定義された箇所を文字認識して該当するモデルを特定する方法が一般的に知られている。この場合の識別キーとなり得る文字列は帳票内の一般的な文字認識対象（例：請求書における請求先、日付、金額等）とは必ずしも一致しない。 In form identification, a position that can be used as an identification key, in which a character string unique to the form is printed, is pre-defined as a model, and when an unidentified form is input, the pre-defined part is recognized as a character. In general, a method for identifying a corresponding model is known. A character string that can be an identification key in this case does not necessarily match a general character recognition target (eg, billing address, date, amount, etc. in an invoice).

特開２００１−２２９３４１号公報JP 2001-229341 A

しかしながら、上記した手法には、以下の課題が存在し、これら課題の少なくとも一つを解決したいという要望がある。
・モデル毎に帳票固有の文字列が存在しない場合は識別キーを定義することができず、帳票識別ができない。
・識別キーに対する文字認識処理を、定義済みのモデル数分実行する必要があり、モデル数が膨大な場合、処理に時間がかかる。
・識別キーが帳票の伸縮や印字ずれによって定義位置からずれた箇所に印字されていた場合、正確な文字認識結果が得られず、帳票識別に失敗する場合がある。 However, the above-described methods have the following problems, and there is a desire to solve at least one of these problems.
-If there is no form-specific character string for each model, the identification key cannot be defined and the form cannot be identified.
-It is necessary to execute the character recognition process for the identification key for the number of defined models. If the number of models is enormous, the process takes time.
-If the identification key is printed at a location deviated from the defined position due to expansion or contraction of the form or printing deviation, an accurate character recognition result may not be obtained and form identification may fail.

本発明の目的は、帳票モデルの特定に優れた帳票処理装置及び帳票処理方法を提供することにある。 An object of the present invention is to provide a form processing apparatus and a form processing method excellent in specifying a form model.

実施形態に係る情報認識装置は、入力部と、検出部と、識別部とを備える。前記入力部は、識別対象帳票の帳票イメージを入力する。前記検出部は、前記帳票イメージの入力から入力特徴点を検出する。前記識別部は、複数種類の帳票に対応する各帳票モデルのモデル特徴点間と前記入力特徴点間との成す角度に基づき各帳票モデルと前記帳票イメージとの類似度を判定し、類似度の判定結果に基づき前記帳票イメージに対応する所定の帳票モデルを識別する。 The information recognition apparatus according to the embodiment includes an input unit, a detection unit, and an identification unit. The input unit inputs a form image of a form to be identified. The detection unit detects an input feature point from the input of the form image. The identification unit determines the similarity between each form model and the form image based on an angle formed between model feature points of each form model corresponding to a plurality of types of forms and between the input feature points, A predetermined form model corresponding to the form image is identified based on the determination result.

図１は、実施形態に係る帳票処理装置の概略構成の一例を示すブロック図である。FIG. 1 is a block diagram illustrating an example of a schematic configuration of a form processing apparatus according to an embodiment. 図２は、実施形態に係る帳票処理装置の応用例を示すブロック図である。FIG. 2 is a block diagram illustrating an application example of the form processing apparatus according to the embodiment. 図３は、実施形態に係る帳票処理装置によるモデル定義処理の一例を示すフローチャートである。FIG. 3 is a flowchart illustrating an example of model definition processing by the form processing apparatus according to the embodiment. 図４は、実施形態に係る帳票処理装置による帳票識別処理及び文字認識処理の一例を示すフローチャートである。FIG. 4 is a flowchart illustrating an example of a form identification process and a character recognition process performed by the form processing apparatus according to the embodiment. 図５Ａは、モデル定義部により生成された読み取り定義情報の一例を示す図である。FIG. 5A is a diagram illustrating an example of read definition information generated by the model definition unit. 図５Ｂは、モデル定義部により検出された罫線交点情報の一例を示す図である。FIG. 5B is a diagram illustrating an example of ruled line intersection information detected by the model definition unit. 図６は、帳票イメージに対応する罫線交点の一例を示す図である。FIG. 6 is a diagram illustrating an example of a ruled line intersection corresponding to a form image. 図７は、定義情報表示領域の一例を示す図である。FIG. 7 is a diagram illustrating an example of the definition information display area. 図８Ａは、入力された帳票イメージの一例を示す図である。FIG. 8A is a diagram illustrating an example of an input form image. 図８Ｂは、入力された帳票イメージの一例を示す図である。FIG. 8B is a diagram illustrating an example of an input form image. 図９Ａは、複数種類の帳票に対応する各帳票モデル（モデル定義情報）の概念を示す図である。FIG. 9A is a diagram illustrating the concept of each form model (model definition information) corresponding to a plurality of types of forms. 図９Ｂは、複数種類の帳票に対応する各帳票モデル（モデル定義情報）の概念を示す図である。FIG. 9B is a diagram illustrating the concept of each form model (model definition information) corresponding to a plurality of types of forms. 図１０は、モデル特徴点間と入力特徴点間との成す角度に基づき各帳票モデルと帳票イメージとの類似度を判定し、判定結果に基づき帳票モデルを識別（特定）する概念を説明するための図である。FIG. 10 illustrates the concept of determining the similarity between each form model and the form image based on the angle formed between the model feature points and the input feature points, and identifying (specifying) the form model based on the determination result. FIG.

以下、図面を参照して実施形態について説明する。 Hereinafter, embodiments will be described with reference to the drawings.

図１は、実施形態に係る帳票処理装置の概略構成の一例を示すブロック図である。
図１に示すように、帳票処理装置１は、帳票イメージ取り込み部２、帳票イメージ保存部３、モデル定義部４、モデル保存部５、帳票識別部６、文字認識部７、及び文字認識結果保存部８を備える。例えば、保存部Ｓが、帳票イメージ保存部３、モデル保存部５、及び文字認識結果保存部８を含み、保存部Ｓが、１以上のハードディスクドライブ等で構成されてもよい。 FIG. 1 is a block diagram illustrating an example of a schematic configuration of a form processing apparatus according to an embodiment.
As shown in FIG. 1, the form processing apparatus 1 includes a form image capturing unit 2, a form image storage unit 3, a model definition unit 4, a model storage unit 5, a form identification unit 6, a character recognition unit 7, and a character recognition result storage. Part 8 is provided. For example, the storage unit S may include a form image storage unit 3, a model storage unit 5, and a character recognition result storage unit 8, and the storage unit S may be configured with one or more hard disk drives.

帳票イメージ取り込み部２は、帳票モデルとなる帳票Ｄｏｃの帳票イメージを入力する入力部であり、また、識別対象となる帳票Ｄｏｃ（未識別の帳票Ｄｏｃ）の帳票イメージを入力する入力部でもある。帳票イメージ取り込み部２は、帳票Ｄｏｃの画像を光学的に読み取るスキャナを備える。帳票イメージ取り込み部２は、スキャナにより請求書又は明細書等の帳票Ｄｏｃを光学的に読み取り、帳票Ｄｏｃに対応する帳票イメージを出力する。なお、帳票イメージ取り込み部２は、外部のスキャナ等で読み取られた帳票イメージを入力するものであってもよく、その場合、帳票イメージ取り込み部２は、スキャナを備えなくてもよい。帳票イメージ保存部３は、帳票イメージ取り込み部２から出力される帳票イメージを保存する。 The form image capturing unit 2 is an input unit that inputs a form image of a form Doc that is a form model, and is also an input unit that inputs a form image of a form Doc (unidentified form Doc) to be identified. The form image capturing unit 2 includes a scanner that optically reads an image of the form Doc. The form image capturing unit 2 optically reads a form Doc such as an invoice or a statement by a scanner and outputs a form image corresponding to the form Doc. Note that the form image capturing unit 2 may input a form image read by an external scanner or the like. In this case, the form image capturing unit 2 may not include a scanner. The form image storage unit 3 stores the form image output from the form image capture unit 2.

モデル定義部４は、１以上のプロセッサで構成される。モデル定義部４は、帳票イメージ保存部３に保存される複数種類の帳票イメージに基づき、複数種類の帳票Ｄｏｃに対応する複数の帳票モデルを定義（登録）する。モデル保存部５は、モデル定義部４で定義される複数の帳票モデルを保存する。 The model definition unit 4 is composed of one or more processors. The model definition unit 4 defines (registers) a plurality of form models corresponding to a plurality of types of form Doc based on a plurality of types of form images stored in the form image storage unit 3. The model storage unit 5 stores a plurality of form models defined by the model definition unit 4.

例えば、ユーザーは、タッチパネル又はマウス等の情報入力機器を介して、ディスプレイ等に表示される複数種類の帳票イメージから、モデル対象となる帳票イメージを選択する。さらに、ユーザーは、情報入力機器を介して、表示されるモデル対象の帳票イメージに対して、読み取り位置と読み取り属性（数字、氏名、又は住所等の属性）、及び文字等の認識位置等を指定する。モデル定義部４は、モデル対象の帳票イメージから画像処理等により罫線交点等の特徴点を検出し、検出された特徴点を含む帳票モデルを定義する。また、モデル定義部は、指定された読み取り位置において、指定された読み取り属性を示す情報を特徴点として設定し、この設定された特徴点を含む帳票モデルを定義する。さらに、モデル定義部は、指定された文字等の認識位置を含む帳票モデルを定義する。つまり、各帳票モデルは、罫線交点等の特徴点、指定された読み取り位置において指定された読み取り属性を示す特徴点、及び認識位置等を対応付けて帳票モデルを定義する。 For example, the user selects a form image to be modeled from a plurality of types of form images displayed on a display or the like via an information input device such as a touch panel or a mouse. In addition, the user specifies the reading position and reading attributes (attributes such as numbers, names, or addresses) and the recognition position of characters, etc., for the displayed model target form image via the information input device. To do. The model definition unit 4 detects feature points such as ruled line intersections by image processing or the like from the model image of the model object, and defines a form model including the detected feature points. The model definition unit sets information indicating the specified reading attribute as a feature point at the specified reading position, and defines a form model including the set feature point. Further, the model definition unit defines a form model including a recognition position of a designated character or the like. That is, each form model defines a form model by associating a feature point such as a ruled line intersection, a feature point indicating a reading attribute designated at a designated reading position, a recognition position, and the like.

帳票識別部６は、１以上のプロセッサにより構成され、帳票イメージ取り込み部２から出力される帳票イメージがどの帳票モデルに対応するかを識別（特定）する。つまり、帳票識別部６は、帳票イメージ取り込み部２から出力される帳票イメージがどのような種類の帳票かを識別（特定）する。帳票識別部６は、識別対象帳票の帳票イメージから入力特徴点を検出し、複数種類の帳票に対応する各帳票モデルのモデル特徴点間と入力特徴点間との成す角度を閾値と比較し、比較結果に基づき各帳票モデルと帳票イメージとの類似度を判定し、類似度の判定結果に基づき帳票イメージに対応する所定の帳票モデルを識別（特定）する。例えば、成す角度が閾値より大きいほど類似度は低くなり、成す角度が閾値より小さいほど類似度は高くなる。例えば、最も類似度の高い帳票モデルを特定する。 The form identifying unit 6 includes one or more processors, and identifies (specifies) which form model the form image output from the form image capturing unit 2 corresponds to. That is, the form identifying unit 6 identifies (specifies) what type of form the form image output from the form image capturing unit 2 is. The form identifying unit 6 detects input feature points from the form image of the form to be identified, compares the angle between the model feature points of each form model corresponding to a plurality of types of forms and the input feature points with a threshold, The similarity between each form model and the form image is determined based on the comparison result, and a predetermined form model corresponding to the form image is identified (specified) based on the similarity determination result. For example, the similarity is lower as the formed angle is larger than the threshold, and the similarity is higher as the formed angle is smaller than the threshold. For example, the form model having the highest similarity is specified.

文字認識部７は、１以上のプロセッサにより構成され、帳票識別部６から出力される帳票イメージに含まれる文字、数字、及び記号等の各種情報を認識する。文字認識部７は、帳票モデル（帳票種別）に応じた文字認識処理により、効率的に、帳票イメージに含まれる文字等の各種情報を認識する。文字認識部７は、帳票識別部６で識別された所定の帳票モデルに対応付けられた文字等の認識位置を用いて、帳票イメージに対して認識処理を実行する。所定の帳票モデルに対応付けられた文字等の認識位置に対応する位置を対象とすることにより、効率的に認識処理を実行することができる。なお、所定の帳票モデルに対応付けられた文字等の認識位置を所定サイズだけ拡大して、拡大された文字等の認識位置を対象として認識処理を実行するようにしてもよい。文字認識結果保存部８は、文字認識部７による文字認識結果を保存する。 The character recognition unit 7 includes one or more processors, and recognizes various information such as characters, numbers, and symbols included in the form image output from the form identification unit 6. The character recognition unit 7 efficiently recognizes various information such as characters included in the form image by character recognition processing according to the form model (form type). The character recognition unit 7 performs a recognition process on the form image using the recognition position of the character or the like associated with the predetermined form model identified by the form identification unit 6. By targeting a position corresponding to a recognition position of a character or the like associated with a predetermined form model, the recognition process can be executed efficiently. Note that the recognition position of a character or the like associated with a predetermined form model may be enlarged by a predetermined size, and the recognition process may be executed on the recognition position of the enlarged character or the like. The character recognition result storage unit 8 stores the character recognition result by the character recognition unit 7.

なお、モデル定義部４、帳票識別部６、及び文字認識部７による処理を１又は複数のプロセッサが統括的に実行するようにしてもよい。なお、モデル定義部４、帳票識別部６、及び文字認識部７による処理を必ずしも分散させる必要はない。 In addition, you may make it a 1 or several processor perform a process by the model definition part 4, the form identification part 6, and the character recognition part 7 collectively. Note that the processing by the model definition unit 4, the form identification unit 6, and the character recognition unit 7 is not necessarily distributed.

また、保存部Ｓは、各部のプロセッサの動作を制御する１以上のプログラムを記憶する。例えば、保存部Ｓは、モデル定義部４によるモデル定義処理、帳票識別部６による帳票識別処理、文字認識部７による文字認識処理等の動作を制御する１以上のプログラムを記憶する。言い換えれば、モデル定義部４、帳票識別部６、及び文字認識部７は、保存部Ｓに記憶された１以上のプログラムに基づき動作する。 The storage unit S stores one or more programs that control the operation of the processor of each unit. For example, the storage unit S stores one or more programs that control operations such as model definition processing by the model definition unit 4, form identification processing by the form identification unit 6, and character recognition processing by the character recognition unit 7. In other words, the model definition unit 4, the form identification unit 6, and the character recognition unit 7 operate based on one or more programs stored in the storage unit S.

例えば、コンピュータ読み取り可能な記憶媒体を通じて上記プログラムを帳票処理装置１へインストールして実行するだけで、上記処理を容易に実現することができる。例えば、帳票処理装置１は、上記プログラムをネットワーク経由でダウンロードし、ダウンロードしたプログラムを記憶し、プログラムのインストールを完了することができる。或いは、帳票処理装置１は、上記プログラムを情報記憶媒体から読み取り、読み取ったプログラムを記憶し、プログラムのインストールを完了することができる。 For example, the above-described processing can be easily realized simply by installing and executing the above-described program in the form processing apparatus 1 through a computer-readable storage medium. For example, the form processing apparatus 1 can download the program via a network, store the downloaded program, and complete the installation of the program. Alternatively, the form processing apparatus 1 can read the program from the information storage medium, store the read program, and complete the installation of the program.

図２は、実施形態に係る帳票処理装置の応用例を示すブロック図である。
図２に示すように、帳票処理装置１は、帳票イメージ取り込み部２、帳票イメージ保存部３、モデル定義部４、モデル保存部５、帳票識別部６、文字認識部７、及び文字認識結果保存部８、画面制御部９、及びユーザインタフェース部１０を備える。 FIG. 2 is a block diagram illustrating an application example of the form processing apparatus according to the embodiment.
As shown in FIG. 2, the form processing apparatus 1 includes a form image capturing unit 2, a form image storage unit 3, a model definition unit 4, a model storage unit 5, a form identification unit 6, a character recognition unit 7, and a character recognition result storage. Unit 8, screen control unit 9, and user interface unit 10.

帳票イメージ取り込み部２は、スキャナ２１及び帳票イメージ書込部２２を備える。スキャナ２は請求書等の帳票Ｄｏｃを光学的に読み取り、帳票Ｄｏｃに対応する帳票イメージを出力する。帳票イメージ書込部２２は、スキャナ２から出力される帳票イメージにＩＤ（識別情報）を割り当てて、ＩＤに対応付けられた帳票イメージを帳票イメージ保存部３に書き込む。例えば、帳票イメージ保存部３は、ＩＤに対応付けて帳票イメージを保存する。 The form image capturing unit 2 includes a scanner 21 and a form image writing unit 22. The scanner 2 optically reads a form Doc such as an invoice and outputs a form image corresponding to the form Doc. The form image writing unit 22 assigns an ID (identification information) to the form image output from the scanner 2 and writes the form image associated with the ID in the form image storage unit 3. For example, the form image storage unit 3 stores the form image in association with the ID.

帳票イメージ保存部３、モデル定義部４、モデル保存部５、及び帳票識別部６については、既に説明した通りであり、詳細説明は省略する。 The form image storage unit 3, the model definition unit 4, the model storage unit 5, and the form identification unit 6 are as described above, and detailed description thereof is omitted.

画面制御部９は、帳票イメージ入出力部９１、及び画面生成部９２を備える。帳票イメージ入出力部９１は、帳票イメージ保存部３に保存された帳票イメージを読み出し、画面生成部９２へ出力する。画面生成部９２は、読み出された帳票イメージに基づく画面を生成し、ユーザインタフェース部１０（ディスプレイ１０１）へ出力する。 The screen control unit 9 includes a form image input / output unit 91 and a screen generation unit 92. The form image input / output unit 91 reads out the form image stored in the form image storage unit 3 and outputs it to the screen generation unit 92. The screen generation unit 92 generates a screen based on the read form image and outputs it to the user interface unit 10 (display 101).

ユーザインタフェース部１０は、ディスプレイ１０１、タッチパネル１０２、キーボード１０３、及びマウス１０４等を備える。ディスプレイ１０１は、生成された帳票イメージに基づく画面を表示する。タッチパネル１０２、キーボード１０３、マウス１０４は、ユーザーからの入力に応じて、ディスプレイ１０１の画面上に表示されるボタンを選択したり、画面上で領域を指定したりする。 The user interface unit 10 includes a display 101, a touch panel 102, a keyboard 103, a mouse 104, and the like. The display 101 displays a screen based on the generated form image. The touch panel 102, the keyboard 103, and the mouse 104 select a button displayed on the screen of the display 101 or specify an area on the screen in accordance with an input from the user.

文字認識部７は、辞書データベース７１、及びＯＣＲエンジン７２を備える。ＯＣＲエンジン７２は、辞書データベース７１に登録された辞書データに基づき、帳票識別部６から出力される帳票イメージに含まれる文字、数字、及び記号等の各種情報を認識する。ＯＣＲエンジン７２は、帳票モデル（帳票種別）に応じた文字認識処理により、効率的に、帳票イメージに含まれる文字等の各種情報を認識する。帳票モデルに応じた文字認識処理とは、帳票モデルに応じて文字認識位置が指定された文字認識処理、帳票モデルに応じて文字サイズが指定された文字認識処理、及び帳票モデルに応じて認識対象の種別が指定された文字認識処理である。なお、３つの文字認識処理を例として挙げたが、少なくとも一つの文字認識処理を帳票モデルに応じた文字認識処理とすることができる。文字認識結果保存部８は、文字認識部７による文字認識結果を保存する。 The character recognition unit 7 includes a dictionary database 71 and an OCR engine 72. Based on the dictionary data registered in the dictionary database 71, the OCR engine 72 recognizes various information such as characters, numbers, and symbols included in the form image output from the form identifying unit 6. The OCR engine 72 efficiently recognizes various information such as characters included in the form image by the character recognition process according to the form model (form type). Character recognition processing according to the form model is a character recognition process with a character recognition position specified according to the form model, a character recognition process with a character size specified according to the form model, and a recognition target according to the form model. This is a character recognition process in which the type of is specified. Although three character recognition processes are given as examples, at least one character recognition process can be a character recognition process corresponding to a form model. The character recognition result storage unit 8 stores the character recognition result by the character recognition unit 7.

図３は、実施形態に係る帳票処理装置によるモデル定義処理の一例を示すフローチャートである。 FIG. 3 is a flowchart illustrating an example of model definition processing by the form processing apparatus according to the embodiment.

帳票イメージ取り込み部２は、モデル定義モードの実行に対応して、モデル定義の対象となる帳票Ｄｏｃの帳票イメージを取り込み（ＳＴＪ１）、帳票イメージ保存部３に対して帳票Ｄｏｃの帳票イメージを保存する。 In response to execution of the model definition mode, the form image capturing unit 2 captures the form image of the form Doc to be model defined (STJ1), and stores the form image of the form Doc in the form image storage unit 3. .

モデル定義部４は、モデル定義モードの実行に対応して、帳票イメージ保存部３から帳票イメージを受け取り、また、帳票イメージ保存部３の帳票イメージを画面制御部９を介してディスプレイ１０１に表示させる。図６は、ディスプレイに表示される帳票イメージ（イメージ表示領域）の一例を示すとともに、特徴点及び読み取り位置指定等を説明するための図である。 In response to execution of the model definition mode, the model definition unit 4 receives a form image from the form image storage unit 3 and causes the display 101 to display the form image of the form image storage unit 3 via the screen control unit 9. . FIG. 6 is a diagram for explaining an example of a form image (image display area) displayed on the display and for explaining feature points and reading position designation.

モデル定義部４は、帳票イメージに基づきモデル定義を実施する（ＳＴＪ２）。モデル定義部４は、ユーザインタフェース部１０からのユーザー入力に基づき、特徴点としての読み取り定義情報を生成する。また、モデル定義部４は、帳票イメージから、特徴点としての罫線交点情報を検出する。図５Ａは、モデル定義部により生成された読み取り定義情報の一例を示す図である。また、図５Ｂは、モデル定義部により検出された罫線交点情報の一例を示す図である。モデル定義部４は、帳票イメージの位置情報に対応付けられた特徴点としての読み取り定義情報を生成し、帳票イメージの位置情報に対応付けられた特徴点としての罫線交点情報を検出する。 The model definition unit 4 performs model definition based on the form image (STJ2). The model definition unit 4 generates read definition information as a feature point based on a user input from the user interface unit 10. Further, the model definition unit 4 detects ruled line intersection information as a feature point from the form image. FIG. 5A is a diagram illustrating an example of read definition information generated by the model definition unit. FIG. 5B is a diagram illustrating an example of ruled line intersection information detected by the model definition unit. The model definition unit 4 generates read definition information as feature points associated with the position information of the form image, and detects ruled line intersection information as feature points associated with the position information of the form image.

図６は、帳票イメージに対応する罫線交点の一例を示す図である。図５Ｂ及び図６に示すように、帳票イメージから複数の罫線交点が検出され、これら罫線交点には位置情報（Ｘ座標及びＹ座標）が対応付けられている。 FIG. 6 is a diagram illustrating an example of a ruled line intersection corresponding to a form image. As shown in FIGS. 5B and 6, a plurality of ruled line intersections are detected from the form image, and position information (X coordinate and Y coordinate) is associated with the ruled line intersections.

ユーザーは、図６の帳票イメージ（ディスプレイ１０１）を参照しながら、タッチパネル１０２又はマウス１０４等を介して読み取り位置及び文字等の認識対象位置を指定し、モデル定義部４は、位置情報に対応付けられた読み取り位置及び文字等の認識対象位置を検出する。さらに、ユーザーは、図７の定義情報表示領域（ディスプレイ１０１）を参照しながら、キーボード１０３等を介して、読み取り位置に対応する金額、氏名、住所、又は備考欄などの項目名を入力し、さらに、読み取り位置に対応する読み取り属性をプルダウンリストより選択する。例えば、読み取り属性として、数字、氏名、住所、又は文字を選択する。 The user designates a reading position and a recognition target position such as a character via the touch panel 102 or the mouse 104 while referring to the form image (display 101) in FIG. 6, and the model definition unit 4 associates with the position information. The read position and the recognition target position such as characters are detected. Further, the user inputs an item name such as an amount, name, address, or remarks column corresponding to the reading position via the keyboard 103 or the like while referring to the definition information display area (display 101) in FIG. Further, a reading attribute corresponding to the reading position is selected from the pull-down list. For example, numbers, names, addresses, or characters are selected as reading attributes.

モデル定義部４は、帳票イメージの位置情報に対応付けられた罫線交点、項目名及び属性が対応付けられた読み取り定義情報、及び認識対象位置等を含む帳票モデルを生成し、帳票モデルをモデル保存部５へ保存する（ＳＴＪ３）。 The model definition unit 4 generates a form model including ruled line intersections associated with the position information of the form image, read definition information associated with item names and attributes, and recognition target positions, and stores the form model as a model. Stored in part 5 (STJ3).

複数種類の帳票が存在する場合、ＳＴＪ１〜ＳＴＪ３の処理を繰り返し、これら複数種類の帳票に対応する帳票モデルをモデル保存部５へ保存する。 When there are a plurality of types of forms, the processes of STJ1 to STJ3 are repeated, and the form models corresponding to these types of forms are stored in the model storage unit 5.

図４は、実施形態に係る帳票処理装置による帳票識別処理及び文字認識処理の一例を示すフローチャートである。 FIG. 4 is a flowchart illustrating an example of a form identification process and a character recognition process performed by the form processing apparatus according to the embodiment.

帳票イメージ取り込み部２は、帳票識別及び文字認識モードの実行に対応して、識別対象となる帳票Ｄｏｃの帳票イメージを取り込み（ＳＴＪ４）、帳票識別部６へ帳票イメージを入力する。或いは、帳票イメージ取り込み部２は、帳票識別及び文字認識モードの実行に対応して、帳票イメージ保存部３に保存されている帳票イメージを取り出して、帳票識別部６へ帳票イメージを入力する。 The form image capture unit 2 captures the form image of the form Doc to be identified in response to the execution of the form identification and character recognition mode (STJ4), and inputs the form image to the form identification unit 6. Alternatively, the form image capture unit 2 takes out the form image stored in the form image storage unit 3 and inputs the form image to the form identification unit 6 in response to the execution of the form identification and character recognition mode.

帳票識別部６は、帳票イメージを受け取り、モデル保存部５から定義済みの帳票モデルを受け取る。図８Ａ及び図８Ｂは、入力された帳票イメージの一例を示す図である。図９Ａ及び図９Ｂは、複数種類の帳票に対応する各帳票モデル（定義情報）の概念を示す図である。 The form identification unit 6 receives a form image and receives a defined form model from the model storage unit 5. 8A and 8B are diagrams illustrating an example of an input form image. 9A and 9B are diagrams showing the concept of each form model (definition information) corresponding to a plurality of types of forms.

帳票識別部６は、図８Ａ及び図８Ｂに示す帳票イメージの入力特徴点と、図９Ａ及び図９Ｂに示す各帳票モデルにおけるモデル特徴点とを比較し、この帳票イメージに対して最も類似度の高い帳票モデルを特定する。例えば、図１０に示すように、帳票識別部６は、複数種類の帳票に対応する各帳票モデルのモデル特徴点間と帳票イメージの入力特徴点間との成す角度に基づき、各帳票モデルと帳票イメージとの類似度を判定し、類似度の判定結果に基づき帳票イメージに対応する所定の帳票モデルを識別（特定）する（ＳＴＪ５）。例えば、罫線交点に対応するモデル特徴点間と罫線交点に対応する入力特徴点間との成す角度を比較して類似度を判定してもよいし、読み取り定義情報に対応するモデル特徴点間と読み取り定義情報に対応する入力特徴点間との成す角度とを比較して類似度を判定してもよい。また、罫線交点及び読み取り定義情報の全てに対応するモデル特徴点間と罫線交点及び読み取り定義情報の全てに対応する入力特徴点間との成す角度とを比較して類似度を判定してもよい。 The form identification unit 6 compares the input feature point of the form image shown in FIGS. 8A and 8B with the model feature point in each form model shown in FIGS. 9A and 9B, and has the highest similarity to this form image. Identify high form models. For example, as shown in FIG. 10, the form identifying unit 6 uses each form model and form based on the angle formed between the model feature points of each form model corresponding to a plurality of types of forms and the input feature points of the form image. The similarity with the image is determined, and a predetermined form model corresponding to the form image is identified (specified) based on the determination result of the similarity (STJ5). For example, the degree of similarity may be determined by comparing the angle formed between the model feature points corresponding to the ruled line intersection and the input feature points corresponding to the ruled line intersection, or between the model feature points corresponding to the reading definition information The similarity may be determined by comparing the angle formed between the input feature points corresponding to the reading definition information. Further, the similarity may be determined by comparing the angle formed between the model feature points corresponding to all of the ruled line intersection and the read definition information and the input feature point corresponding to all of the ruled line intersection and the read definition information. .

文字認識部７は、ＳＴＪ５で識別された所定の帳票モデルに対応付けられた文字等の認識対象位置情報に基づき、帳票イメージに含まれる情報を認識する（ＳＴＪ６）。文字認識部７は、ＳＴＪ６で得られた文字認識結果を文字認識結果保存部８に保存する（ＳＴＪ７）。 The character recognition unit 7 recognizes information included in the form image based on recognition target position information such as characters associated with the predetermined form model identified in STJ5 (STJ6). The character recognition unit 7 stores the character recognition result obtained in STJ6 in the character recognition result storage unit 8 (STJ7).

ここで、特徴点の検出、及び特徴点間の成す角度に基づく類似度判定の詳細について説明する。 Here, details of detection of feature points and determination of similarity based on angles formed between feature points will be described.

帳票識別部６は、帳票モデルに含まれる読み取り位置、及び読み取り位置に対応付けられた読み取り属性に基づき、帳票イメージ（未識別）に対して文字認識を実行し、読み取り属性に該当する情報が認識された場合に、この認識結果を特徴点として検出する。例えば、読み取り属性が数字であれば、読み取り位置から、数字が認識された場合に、この認識結果を特徴点として検出する。また、帳票識別部６は、帳票イメージ（未識別）に対して画像処理を実行し、罫線交点を特徴点として検出する。 The form identifying unit 6 performs character recognition on the form image (unidentified) based on the reading position included in the form model and the reading attribute associated with the reading position, and information corresponding to the reading attribute is recognized. In such a case, the recognition result is detected as a feature point. For example, if the reading attribute is a number, when the number is recognized from the reading position, the recognition result is detected as a feature point. In addition, the form identification unit 6 performs image processing on the form image (unidentified) and detects ruled line intersection points as feature points.

帳票識別部６は、各帳票モデルからのモデル特徴点と、帳票イメージ（未識別）からの入力特徴点とを比較照合し、帳票イメージに対応する所定の帳票モデルを識別する。例えば、帳票識別部６は、位置情報に対応付けられた各モデル特徴点と、位置情報に対応付けられた各入力特徴点に基づき各モデル特徴点と各入力特徴点との対応関係を判定し、対応関係にあるモデル特徴点間と入力特徴点間との成す角度とに基づき各帳票モデルと帳票イメージとの類似度を判定する。 The form identifying unit 6 compares the model feature points from each form model with the input feature points from the form image (unidentified), and identifies a predetermined form model corresponding to the form image. For example, the form identifying unit 6 determines the correspondence between each model feature point and each input feature point based on each model feature point associated with the position information and each input feature point associated with the position information. The similarity between each form model and the form image is determined based on the angle formed between the model feature points and the input feature points that are in a correspondence relationship.

例えば、帳票識別部６は、各モデル特徴点の位置情報に基づき各モデル特徴点を順にソートし、各入力特徴点の位置情報に基づき各入力特徴点を順にソートし、各モデル特徴点と各入力特徴点の対応関係を判定する。 For example, the form identifying unit 6 sorts each model feature point sequentially based on the position information of each model feature point, sorts each input feature point sequentially based on the position information of each input feature point, A correspondence relationship between input feature points is determined.

例えば、帳票識別部６は、座標情報に基づきモデル特徴点を順にソートし、同様に、座標情報に基づき入力特徴点を順にソートし、順番に辿った場合の、モデル特徴点と入力特徴点との成す角度を内積により計算する。角度差が小さければ加点、大きければ減点することで帳票モデルと帳票イメージの類似度を計算し、類似度が最も高い帳票モデルを識別結果として採用する。 For example, the form identifying unit 6 sorts the model feature points sequentially based on the coordinate information, and similarly sorts the input feature points sequentially based on the coordinate information, and the model feature points and the input feature points when traced in order. The angle formed by is calculated by the inner product. The degree of similarity between the form model and the form image is calculated by adding points if the angle difference is small and subtracting if the angle difference is large, and the form model having the highest degree of similarity is adopted as the identification result.

例えば、帳票モデル毎の類似度Ｐは、以下の式により算出される。 For example, the similarity P for each form model is calculated by the following equation.

式中の定義は、以下の通り。
Ｖａ：入力特徴点が成すベクトル
Ｖｂ：モデル特徴点が成すベクトル
θ：内積の閾値
Ｓ：ＶａとＶｂの内積が閾値よりも大きい場合に１、そうでない場合に０
Ｐ：Ｓの合計値をベクトルの数ｎで割った値
Ｐの取り得る範囲は０〜１ The definition in the formula is as follows.
Va: vector formed by input feature points Vb: vector formed by model feature points θ: inner product threshold S: 1 if inner product of Va and Vb is larger than threshold, 0 otherwise
P: A value obtained by dividing the total value of S by the number n of vectors.

本実施形態の帳票処理装置によれば、以下の作用効果が得られる。
例えば、帳票に固有の文字列が存在しない場合でも、罫線が存在すれば識別することができる。或いは、帳票に固有の文字列が存在しない場合でも、数字等の文字認識対象が存在すれば識別することができる。或いは、帳票に固有の文字列が存在しない場合でも、罫線及び数字等の文字認識対象が存在すれば識別することができる。 According to the form processing apparatus of the present embodiment, the following operational effects can be obtained.
For example, even if there is no unique character string in the form, it can be identified if there is a ruled line. Alternatively, even if there is no unique character string in the form, it can be identified if there is a character recognition target such as a number. Alternatively, even if there is no unique character string in the form, it can be identified if there are character recognition targets such as ruled lines and numbers.

また、帳票の識別に文字認識処理を実行せず、複数種類の帳票に対応する各帳票モデルのモデル特徴点間と入力特徴点間との成す角度と閾値との比較に基づき類似度を求めるため、帳票モデルの数が膨大であっても処理時間の影響を受け難い。例えば、対応する特徴点の内積計算により類似度を求めることにより、帳票モデルの数が膨大であっても処理時間の影響を受け難くすることができる。 In addition, character recognition processing is not executed for form identification, and the similarity is obtained based on a comparison between an angle formed between model feature points and input feature points of each form model corresponding to a plurality of types of forms and a threshold value. Even if the number of form models is enormous, it is hardly affected by the processing time. For example, by obtaining the similarity by calculating the inner product of the corresponding feature points, it is possible to make it less susceptible to the processing time even if the number of form models is enormous.

また、複数種類の帳票に対応する各帳票モデルのモデル特徴点間と入力特徴点間との成す角度と閾値との比較に基づき類似度を計算するため、伸縮や印字ずれによる影響に頑健である。また内積計算によって伸縮や印字ずれを求めることができるため、帳票識別後の文字認識実行時に読取位置を補正して伸縮や印字ずれによる誤読を防ぐことができる。 In addition, the similarity is calculated based on a comparison between the threshold value and the angle between the model feature points and the input feature points of each form model corresponding to multiple types of forms, so it is robust against the effects of expansion and contraction and printing misalignment. . Further, since expansion / contraction and printing misalignment can be obtained by inner product calculation, the reading position can be corrected when character recognition is performed after form identification to prevent misreading due to expansion / contraction or printing misalignment.

本発明のいくつかの実施形態を説明したが、これらの実施形態は、例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。
以下、本願の出願当初の特許請求の範囲に記載された発明を付記する。
［Ｃ１］
識別対象帳票の帳票イメージを入力する入力部と、
前記帳票イメージの入力から入力特徴点を検出する検出部と、
複数種類の帳票に対応する各帳票モデルのモデル特徴点間と前記入力特徴点間との成す角度に基づき各帳票モデルと前記帳票イメージとの類似度を判定し、類似度の判定結果に基づき前記帳票イメージに対応する所定の帳票モデルを識別する識別部と、
を備える帳票処理装置。
［Ｃ２］
前記識別部は、位置情報に対応付けられた各モデル特徴点と位置情報に対応付けられた各入力特徴点に基づき各モデル特徴点と各入力特徴点との対応関係を判定し、対応関係にある前記モデル特徴点間と前記入力特徴点間との成す角度に基づき各帳票モデルと前記帳票イメージとの類似度を判定する［Ｃ１］の帳票処理装置。
［Ｃ３］
前記識別部は、各モデル特徴点の位置情報に基づき各モデル特徴点を順にソートし、各入力特徴点の位置情報に基づき各入力特徴点を順にソートし、各モデル特徴点と各入力特徴点の対応関係を判定する［Ｃ２］の帳票処理装置。
［Ｃ４］
前記モデル特徴点及び前記入力特徴点は罫線の交点である［Ｃ１］乃至［Ｃ３］の何れか１つの帳票処理装置。
［Ｃ５］
前記所定の帳票モデルに対応付けられた認識対象位置情報に基づき、前記帳票イメージに含まれる情報を認識する認識部を備える［Ｃ１］乃至［Ｃ４］の何れか１つの帳票処理装置。
［Ｃ６］
前記認識部は、文字、数字、及び記号のうちの少なくとも一つを認識対象の情報として、前記帳票イメージに含まれる情報を認識する［Ｃ５］の帳票処理装置。
［Ｃ７］
識別対象帳票の帳票イメージを入力し、
前記帳票イメージの入力から入力特徴点を検出し、
複数種類の帳票に対応する各帳票モデルのモデル特徴点間と前記入力特徴点間との成す角度に基づき各帳票モデルと前記帳票イメージとの類似度を判定し、類似度の判定結果に基づき前記帳票イメージに対応する所定の帳票モデルを識別する帳票処理方法。 Although several embodiments of the present invention have been described, these embodiments are presented by way of example and are not intended to limit the scope of the invention. These novel embodiments can be implemented in various other forms, and various omissions, replacements, and changes can be made without departing from the scope of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are included in the invention described in the claims and the equivalents thereof.
Hereinafter, the invention described in the scope of claims at the beginning of the application of the present application will be added.
[C1]
An input part for inputting a form image of the form to be identified;
A detection unit for detecting an input feature point from the input of the form image;
The similarity between each form model and the form image is determined based on the angle formed between the model feature points of each form model corresponding to a plurality of forms and the input feature points, and based on the determination result of the similarity An identification unit for identifying a predetermined form model corresponding to the form image;
A form processing apparatus comprising:
[C2]
The identification unit determines a correspondence between each model feature point and each input feature point based on each model feature point associated with the position information and each input feature point associated with the position information, The form processing apparatus of [C1] that determines the similarity between each form model and the form image based on an angle formed between the model feature points and the input feature points.
[C3]
The identification unit sequentially sorts each model feature point based on position information of each model feature point, sorts each input feature point sequentially based on the position information of each input feature point, and each model feature point and each input feature point [C2] form processing apparatus for determining the correspondence relationship.
[C4]
The form processing device according to any one of [C1] to [C3], wherein the model feature point and the input feature point are intersections of ruled lines.
[C5]
The form processing apparatus according to any one of [C1] to [C4], comprising a recognition unit that recognizes information included in the form image based on recognition target position information associated with the predetermined form model.
[C6]
[C5] The form processing apparatus according to [C5], wherein the recognition unit recognizes information included in the form image using at least one of characters, numbers, and symbols as information to be recognized.
[C7]
Enter the form image of the form to be identified,
An input feature point is detected from the input of the form image,
The similarity between each form model and the form image is determined based on the angle formed between the model feature points of each form model corresponding to a plurality of forms and the input feature points, and based on the determination result of the similarity A form processing method for identifying a predetermined form model corresponding to a form image.

１…帳票処理装置
２…スキャナ
３…帳票イメージ保存部
４…モデル定義部
５…モデル保存部
６…帳票識別部
７…文字認識部
８…文字認識結果保存部
９…画面制御部
１０…ユーザインタフェース部
２１…スキャナ
２２…帳票イメージ書込部
７１…辞書データベース
７２…エンジン
９１…帳票イメージ入出力部
９２…画面生成部
１０１…ディスプレイ
１０２…タッチパネル
１０３…キーボード
１０４…マウス DESCRIPTION OF SYMBOLS 1 ... Form processing apparatus 2 ... Scanner 3 ... Form image storage part 4 ... Model definition part 5 ... Model storage part 6 ... Form identification part 7 ... Character recognition part 8 ... Character recognition result storage part 9 ... Screen control part 10 ... User interface Unit 21 ... Scanner 22 ... Form image writing unit 71 ... Dictionary database 72 ... Engine 91 ... Form image input / output unit 92 ... Screen generation unit 101 ... Display 102 ... Touch panel 103 ... Keyboard 104 ... Mouse

Claims

An input unit for inputting a model target form image and an identification target form image corresponding to a plurality of types of forms;
A model definition unit that defines each form model corresponding to the plurality of types of forms based on the model target form image;
A detection unit for detecting an input feature point from the input of the identification target form image;
The similarity between each form model and the identification target form image is determined based on the angle formed between the model feature points of each form model and the input feature points, and the identification target form image is determined based on the similarity determination result. An identification unit for identifying a corresponding predetermined form model;
Equipped with a,
The identification unit is
In determining the correspondence between each model feature point and each input feature point based on each model feature point associated with the location information and each input feature point associated with the location information, the location information of each model feature point The model feature points are sorted in order based on the input feature points, the input feature points are sorted in order based on the position information of the input feature points, the correspondence between each model feature point and each input feature point is determined,
A form processing apparatus that determines the degree of similarity between each form model and the form image to be identified based on an angle formed between the model feature points and the input feature points in correspondence .

The form processing apparatus according to claim 1 , wherein the model feature point and the input feature point are intersections of ruled lines.

It said predetermined based on the recognized position information associated with the form model, the identification document processing apparatus according to claim 1 or 2 comprising a recognition unit recognizes the information included in the target document image.

The form processing apparatus according to claim 3 , wherein the recognition unit recognizes information included in the identification target form image using at least one of characters, numbers, and symbols as information to be recognized.

The form processing apparatus according to claim 1, further comprising an interface for designating a reading position for the model target form image in response to an input from a user.

The form processing apparatus according to claim 5, wherein the interface is a touch panel or a mouse.

The model definition unit is configured to specify the reading position, the reading attribute, and the model feature point based on designation of a reading position and a reading attribute for the model target form image and detection of the model feature point from the model target form image. one of form processing apparatus of claims 1 to 5 defining the form model that associates.

Enter the model target form image corresponding to multiple types of forms,
Define each form model corresponding to the plurality of types of forms based on the model target form image,
Enter the form image to be identified,
Detecting input feature points from the input of the identification target form image,
Determining the similarity between each form model and the form image to be identified based on the angle formed between the model feature points of each form model corresponding to a plurality of types of forms and the input feature points ;
A method for identifying a predetermined form model corresponding to the identification target form image based on a similarity determination result ,
Determining the similarity is:
In determining the correspondence between each model feature point and each input feature point based on each model feature point associated with the location information and each input feature point associated with the location information, the location information of each model feature point Sorting each model feature point in order, sorting each input feature point sequentially based on the position information of each input feature point, determining the correspondence between each model feature point and each input feature point;
A form processing method comprising: determining a similarity between each form model and the form image to be identified based on an angle formed between the model feature points and the input feature points in a correspondence relationship .