JP2005128865A

JP2005128865A - Data recognition device and program

Info

Publication number: JP2005128865A
Application number: JP2003364812A
Authority: JP
Inventors: Junji Yamamoto; 淳次山本
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 2003-10-24
Filing date: 2003-10-24
Publication date: 2005-05-19

Abstract

PROBLEM TO BE SOLVED: To provide a data recognition device that enhances a recognition rate for data recognized from slips or forms and that is capable of accurately recognizing data written on slips or forms, by correcting misrecognition fields. SOLUTION: Slip image data are recognized and computational relationships included in recognized detail lines are predefined by a definition tool 12. Defined computations are performed on the result of recognition on the field of each detail line and the recognition result is compared with the computation result to specify the field where a misrecognition has occurred. The defined computations are performed on the result of recognition on the field supposed to have been correctly recognized, so as to calculate a correct value for the field where the misrecognition is supposed to have occurred. The value for the field where the misrecognition is supposed to have occurred is corrected to provide the correct value. COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、照射した光の反射光から文字、数字及び記号を読み取り、演算式に含まれる数値の修正作業を行うデータ認識装置及びプログラムに関する。 The present invention relates to a data recognition apparatus and a program for reading characters, numbers, and symbols from reflected light of irradiated light and performing a correction operation on numerical values included in an arithmetic expression.

近年、紙に書かれた文字を文字コードとしてコンピュータに取り込む技術が飛躍的に進歩している。一般的には、イメージスキャナによって紙に書かれた文字や図形を読み込み、読み込んだイメージデータの中の文字部分を専用のソフトウェアによって文字コードに変換するという方法がとられている。この専用ソフトウェアはＯＣＲソフトと呼ばれ、ビジネスユース、パーソナルユースに広く使われている。このＯＣＲソフトの良し悪しを決定するのが認識率である。 In recent years, technology for taking characters written on paper into a computer as character codes has been dramatically improved. In general, a method is used in which characters and figures written on paper are read by an image scanner, and character portions in the read image data are converted into character codes by dedicated software. This dedicated software is called OCR software and is widely used for business use and personal use. The recognition rate determines whether the OCR software is good or bad.

ＯＣＲソフトの認識率を向上させる方法として、大きく２通りのアプローチが行われている。一つは「１文字１文字の認識率を向上させる」方法であり、もう一つは「前後の文脈やフィールド属性から、より確からしい認識候補を導き出す」方法である。後者の代表的な例として住所の認識が挙げられる。例えば、図２４に示すように、予め得意先データベースや商品データベースを保持させておき、認識した文字列と該データベースを比較しながらより確からしい結果を導き出すというものである。 There are two main approaches for improving the recognition rate of OCR software. One is a method of “improving the recognition rate of one character per character”, and the other is a method of “deriving more probable recognition candidates from the preceding and following contexts and field attributes”. A typical example of the latter is address recognition. For example, as shown in FIG. 24, a customer database or a product database is held in advance, and a more probable result is derived while comparing the recognized character string with the database.

さらに、複数のフィールドの関連から認識率を上げる方法がある。具体的には、電話番号のフィールドの認識結果と住所フィールドの認識結果の確からしさを比較して、より確からしいフィールドの認識結果を元に他方を修正するという方法である。これにより、ＯＣＲソフトの認識率を向上させることができる。 Furthermore, there is a method for increasing the recognition rate from the relationship between a plurality of fields. Specifically, the probability of the recognition result of the telephone number field is compared with the probability of the recognition result of the address field, and the other is corrected based on the more probable field recognition result. Thereby, the recognition rate of OCR software can be improved.

また、伝票の明細行の誤認識を検出する為に相互に演算関係にある認識フィールド同士を演算し、演算結果と認識結果を比較することにより誤認識を検出する方法も知られている（例えば、特許文献１参照）。この方法は、図２５に示すように、相互に演算関係にある数量×単価の演算を実行し、演算結果と、認識された金額のフィールドとを比較して、誤認識を検出する。また、金額フィールドの演算を行い、認識された合計金額と比較を行うことにより誤認識を検出し、検出結果に基づいて誤認識されたフィールドを判別する。具体的には、パイナップルの行において、”２（数量）×３００（単価）≠５００”を検出する。また、合計金額において、”５００＋１００＋５００＋２０００＋６０００≠９２００”を検出する。そして、この検出結果に基づいて、誤認識された可能性の高いフィールドが「パイナップルの金額」であると判別する。これにより、より確からしいフィールドの認識結果と、誤認識された可能性の高いフィールドの認識結果を正確に導き出すことができる。
特開２００２−５６３５４号公報 There is also a known method for detecting misrecognition by calculating recognition fields that are in a calculation relationship with each other in order to detect misrecognition of a detail line of a slip and comparing the calculation result and the recognition result (for example, , See Patent Document 1). In this method, as shown in FIG. 25, the calculation of quantity × unit price, which are in a calculation relationship with each other, is performed, and the calculation result is compared with the field of the recognized amount of money to detect misrecognition. In addition, calculation of the amount field is performed, and the erroneous recognition is detected by comparing with the recognized total amount, and the erroneously recognized field is determined based on the detection result. Specifically, “2 (quantity) × 300 (unit price) ≠ 500” is detected in the pineapple row. In the total amount, “500 + 100 + 500 + 2000 + 6000 ≠ 9200” is detected. Then, based on this detection result, it is determined that the field that is likely to be misrecognized is “Pineapple amount”. As a result, it is possible to accurately derive a more reliable field recognition result and a field recognition result that is likely to be erroneously recognized.
JP 2002-56354 A

しかしながら、上述した従来の光学的文字読取装置（例えば、特許文献１）においては、誤認識の検出は可能であるが誤認識フィールドの修正までは行えなかった。したがって、光学的文字読取装置により誤認識が検出された場合、オペレータによる修正作業を行う必要があった。ここで、修正箇所が多数ある場合、作業が煩雑であるという問題があり、オペレータ等の人的要因により誤修正が施される場合も少なくなかった。 However, in the above-described conventional optical character reading device (for example, Patent Document 1), it is possible to detect erroneous recognition but not to correct the erroneous recognition field. Therefore, when an erroneous recognition is detected by the optical character reader, it is necessary to perform a correction operation by an operator. Here, when there are a large number of correction points, there is a problem that the work is complicated, and there are not a few cases where erroneous correction is performed due to human factors such as an operator.

本発明の課題は、伝票や帳票から認識されるデータの認識率を向上させると共に、誤認識フィールドの修正を行うことにより精度良く伝票や帳票に記載されたデータを認識することができるデータ認識装置及びプログラムを提供することである。 An object of the present invention is to improve the recognition rate of data recognized from a slip or form, and to recognize the data described in the slip or form with high accuracy by correcting a misrecognition field And providing a program.

上記課題を解決するために、請求項１記載の発明は、
帳票に記入された情報を画像情報として読み取り、当該画像情報の文字認識を行うデータ認識装置であって、
前記画像情報に含まれるフィールドを定義するための設定情報を記憶する定義ファイルと、
前記設定情報に基づいて前記画像情報に含まれるフィールドを認識する認識手段と、
前記認識されたフィールド間における演算関係を入力する入力手段と、
前記演算関係に基づいて認識されたフィールド間の演算を行い、フィールドの認識結果と演算結果を比較して、誤認識があるか否かを判別する判別手段と、
フィールドの認識結果に誤認識があると判別された場合に、誤認識がないと判別されたフィールドの認識結果に基づいて、当該誤認識があると判別されたフィールドの値を算出する演算手段と、
を備えることを特徴としている。 In order to solve the above problem, the invention according to claim 1
A data recognition device that reads information entered in a form as image information and performs character recognition of the image information,
A definition file for storing setting information for defining fields included in the image information;
Recognition means for recognizing a field included in the image information based on the setting information;
Input means for inputting an arithmetic relationship between the recognized fields;
A discriminating means for performing an arithmetic operation between recognized fields based on the arithmetic relation, comparing the recognition result of the field with the arithmetic result, and determining whether or not there is a misrecognition;
Calculating means for calculating the value of the field determined to have the misrecognition based on the recognition result of the field determined to have no misrecognition when it is determined that the field recognition result has misrecognition; ,
It is characterized by having.

請求項１又は４記載の発明によれば、帳票に記入された情報を画像情報として読み取り、設定情報に基づいて画像情報に含まれるフィールドを認識し、認識されたフィールド間における演算関係を入力させ、入力された演算関係に基づいて、認識されたフィールド間の演算を行い、認識結果と演算結果に基づいて、各フィールドの認識結果に誤認識があるかを判別する。そして、誤認識があると判別されたフィールドに付いては、誤認識がないと判別されたフィールドの認識結果に基づいて演算を行い、正しい演算結果を取得することができる。これにより、誤認識として検出されたフィールドの修正作業を容易かつ正確に行うことができる。 According to the first or fourth aspect of the present invention, the information entered in the form is read as image information, the field included in the image information is recognized based on the setting information, and the arithmetic relationship between the recognized fields is input. Based on the input calculation relationship, the calculation between recognized fields is performed, and based on the recognition result and the calculation result, it is determined whether there is a misrecognition in the recognition result of each field. For a field determined to have erroneous recognition, an operation can be performed based on the recognition result of the field determined to have no erroneous recognition, and a correct operation result can be obtained. As a result, it is possible to easily and accurately correct the field detected as erroneous recognition.

請求項２記載の発明によれば、入力された演算関係を演算情報として記憶手段に記憶するため、形式が同じ帳票については、過去に定義された演算関係に基づいて、帳票の認識を行うことができる。これにより、業務効率を大幅に向上させることができる。 According to the second aspect of the present invention, since the inputted calculation relation is stored in the storage means as calculation information, for a form having the same format, the form is recognized based on the calculation relation defined in the past. Can do. Thereby, business efficiency can be improved significantly.

請求項３記載の発明によれば、認識結果と演算結果とが異なるフィールド及び認識の確からしさを記憶し、当該認識の確からしさに基づいて、認識されたフィールドに誤認識があるか否かを判別するため、誤認識のないフィールドと誤認識されたフィールドを精度よく判別することができる。これにより、データ認識装置の認識率を向上させると共に、誤認識されたフィールドついて正確な値を算出することができる。 According to the third aspect of the present invention, the field in which the recognition result and the calculation result are different and the probability of the recognition are stored, and whether or not the recognized field has a misrecognition based on the certainty of the recognition. In order to discriminate, it is possible to accurately discriminate between a field having no erroneous recognition and a field that has been erroneously recognized. As a result, the recognition rate of the data recognition apparatus can be improved, and an accurate value can be calculated for the misrecognized field.

以下、図を参照して本発明を実施するための最良の形態を詳細に説明する。ただし、発明の範囲は、図示例に限定されない。 Hereinafter, the best mode for carrying out the present invention will be described in detail with reference to the drawings. However, the scope of the invention is not limited to the illustrated examples.

まず、本実施の形態の構成を説明する。
図１は、本実施の形態におけるデータ認識装置１の概略構成を示す図である。図１に示すように、データ認識装置１は、読取装置１１、定義ツール１２、定義ファイル１３、ＯＣＲ認識エンジン１４、業務アプリ１５、記憶装置１６、入力装置１７、表示装置１８等を備えて構成される。 First, the configuration of the present embodiment will be described.
FIG. 1 is a diagram showing a schematic configuration of a data recognition apparatus 1 in the present embodiment. As shown in FIG. 1, the data recognition device 1 includes a reading device 11, a definition tool 12, a definition file 13, an OCR recognition engine 14, a business application 15, a storage device 16, an input device 17, a display device 18, and the like. Is done.

読取装置１１は、伝票や帳票を画像データとして取得し、取得した画像データを定義ツール１２及びＯＣＲ認識エンジン１４に出力する。図２に、読取装置１１の構成ブロック図を示す。図２に示すように、読取装置１１は、読取部１１ａ、イメージ処理部１１ｂ、メモリ部１１ｃ、スキャナ制御部１１ｄ、インターフェース部１１ｅを備えて構成される。 The reading device 11 acquires a slip or a form as image data, and outputs the acquired image data to the definition tool 12 and the OCR recognition engine 14. FIG. 2 shows a configuration block diagram of the reading device 11. As shown in FIG. 2, the reading device 11 includes a reading unit 11a, an image processing unit 11b, a memory unit 11c, a scanner control unit 11d, and an interface unit 11e.

読取部１１ａは、伝票に光を照射する光源部１１１と、伝票から反射された光を読み取るＣＣＤ（Charge Coupled Device）１１２と、ＣＣＤ１１２が読み取る画像信号をアナログ／デジタル変換するＡ／Ｄ変換部１１３とを備えている。この読取部１１ａは、光源部１１１からの光を伝票に対して照射し、その反射光をＣＣＤ１１２で検出することにより伝票の画像データを取得する。 The reading unit 11a includes a light source unit 111 that irradiates light on a slip, a CCD (Charge Coupled Device) 112 that reads light reflected from the slip, and an A / D conversion unit 113 that performs analog / digital conversion on an image signal read by the CCD 112. And. The reading unit 11a irradiates the slip with light from the light source unit 111 and detects the reflected light by the CCD 112, thereby acquiring the image data of the slip.

イメージ処理部１１ｂは、読取部１１ａで読み取った画像データに二値化、スムージング、ノイズ除去等の画像処理を施す。メモリ部１１ｃは、イメージ処理部１１ｂにおいて画像処理が施された伝票の画像データを格納する。 The image processing unit 11b performs image processing such as binarization, smoothing, and noise removal on the image data read by the reading unit 11a. The memory unit 11c stores the image data of the slip that has undergone image processing in the image processing unit 11b.

スキャナ制御部１１ｄは、記憶装置１６に格納されている各種制御プログラムを読み出してＲＡＭ内に展開し、それらの制御プログラムに従って、読取装置１０の各部を制御する。インターフェース部１１ｆは、スキャナ制御部１１ｄの制御に応じて、メモリ部１１ｃに格納された伝票の画像データを定義ツール１２、ＯＣＲ認識エンジン１４、表示装置１８に出力する。 The scanner control unit 11d reads various control programs stored in the storage device 16, develops them in the RAM, and controls each unit of the reading device 10 according to these control programs. The interface unit 11f outputs the slip image data stored in the memory unit 11c to the definition tool 12, the OCR recognition engine 14, and the display device 18 under the control of the scanner control unit 11d.

図１に戻り、定義ツール１２は、読み込まれた伝票の画像データ上に点在する認識フィールドの位置、認識フィールドの文字数や文字種、認識フィールド間における演算関係等の設定情報を定義するための役割を担うツールである。具体的に、定義ツール１２は、記憶装置１６から各種処理プログラムを読み出して、後述するメイン処理、イメージ読込処理、フィールド設定処理、演算設定処理を実行する。なお、各処理の詳細は後述する。 Returning to FIG. 1, the definition tool 12 has a role for defining setting information such as the positions of recognition fields scattered on the image data of the read slip, the number and type of characters of the recognition fields, and the arithmetic relationship between the recognition fields. It is a tool that bears. Specifically, the definition tool 12 reads various processing programs from the storage device 16 and executes main processing, image reading processing, field setting processing, and calculation setting processing described later. Details of each process will be described later.

以下、図３〜図４を参照して、定義ツール１２により、定義される情報について説明する。図３（ａ）は、読み取られた伝票から認識フィールドの位置、認識フィールドの文字数及び文字種を設定する設定画面１８１を示す図である。図３（ａ）に示すように、例えば、設定画面１８１の上段には、得意先名を入力するフィールド１８１ａ、住所を入力するフィールド１８１ｂ、電話番号を入力するフィールド１８１ｃが設けられている。ここで、入力装置１７に備えるマウス等により、各フィールドの範囲（例えば、矩形領域）が指定されると、定義ツール１２により、指定された矩形領域の位置情報（例えば、左上座標及び右下座標）が取得され、指定された矩形領域が認識フィールドとして設定される。 Hereinafter, information defined by the definition tool 12 will be described with reference to FIGS. FIG. 3A shows a setting screen 181 for setting the position of the recognition field, the number of characters in the recognition field, and the character type from the read slip. As shown in FIG. 3A, for example, a field 181a for inputting a customer name, a field 181b for inputting an address, and a field 181c for inputting a telephone number are provided in the upper part of the setting screen 181. Here, when the range of each field (for example, a rectangular area) is designated by the mouse or the like provided in the input device 17, the definition tool 12 causes the position information (for example, the upper left coordinate and the lower right coordinate) of the designated rectangular area. ) Is acquired, and the designated rectangular area is set as the recognition field.

また、フィールドの矩形領域が指定されると、図３（ｂ）に示す設定画面１８２が表示され、この設定画面１８２において、対応する認識フィールドの文字数及び文字種が設定される。すなわち、設定画面１８２において入力された文字数及び文字種が指定されたフィールドの範囲と対応付けて取得され、定義ファイル１３に設定情報として格納される。 When the rectangular area of the field is designated, a setting screen 182 shown in FIG. 3B is displayed. In this setting screen 182, the number of characters and the character type of the corresponding recognition field are set. That is, the number of characters and the character type input on the setting screen 182 are acquired in association with the designated field range, and stored in the definition file 13 as setting information.

また、設定画面１８１の下段には、商品名を入力するフィールド１８１ｄ、数量を入力するフィールド１８１ｅ、単価を入力するフィールド１８１ｆ、金額を入力するフィールド１８１ｇ、合計金額を入力するフィールド１８１ｈが設けられている。フィールド１８１ｄ〜１８１ｈにおいても、認識フィールドの位置、認識フィールドの文字数及び文字種が同様に設定画面１８１及び１８２において設定される。 In the lower part of the setting screen 181, a field 181d for inputting a product name, a field 181e for inputting a quantity, a field 181f for inputting a unit price, a field 181g for inputting an amount, and a field 181h for inputting a total amount are provided. Yes. In the fields 181d to 181h, the position of the recognition field, the number of characters in the recognition field, and the character type are similarly set on the setting screens 181 and 182.

さらに、設定画面１８１の下段に示されるフィールド１８１ｄ〜１８１ｈにおいては、演算関係が成立するため、これらの演算関係が定義ツール１２により定義される。図４を参照して、定義ツール１２により定義される演算関係について説明する。 Furthermore, in the fields 181d to 181h shown in the lower part of the setting screen 181, calculation relationships are established, and these calculation relationships are defined by the definition tool 12. With reference to FIG. 4, the calculation relationship defined by the definition tool 12 will be described.

図４（ａ）は、２通りの演算関係が存在する伝票のデータ構成例を示す図である。まず、第１の演算関係として、図４（ｂ）に示す通り、
金額フィールドｆ（４）＝数量フィールドｆ（２）×単価フィールドｆ（３）
の演算関係が成立している。この演算関係は、"f(4)・1=f(2)・1xf(3)・1"を定義する演算情報として、定義ファイル１３に格納可能なデータ形式に変換され、後述する定義ファイル１３に格納される。例えば、上述した演算情報は、図５（ｂ）のデータ１８１ｉに示すように、逆ポーランド記法を用いてスタック形式に変換される。なお、データ形式はこれに限定されない。 FIG. 4A is a diagram illustrating a data configuration example of a slip having two types of calculation relationships. First, as the first calculation relationship, as shown in FIG.
Amount field f (4) = quantity field f (2) × unit price field f (3)
The calculation relationship is established. This calculation relationship is converted into a data format that can be stored in the definition file 13 as calculation information that defines “f (4) · 1 = f (2) · 1xf (3) · 1”. Stored in For example, the calculation information described above is converted into a stack format using reverse Polish notation, as indicated by data 181i in FIG. The data format is not limited to this.

次に、第２の演算関係として、図４（ｃ）に示すとおり、
合計フィールドｓｕｍ（１）＝合計フィールドｆ（４）・１＋合計フィールドｆ（４）・２＋合計フィールドｆ（４）・３＋合計フィールドｆ（４）・４＋合計フィールドｆ（４）・５
の演算関係が成立している。この演算関係は、"sum(1)=f(4)・1+f(4)・2+f(4)・3+f(4)・4+f(4)・5"を定義する演算情報として、定義ファイル１３に格納可能なデータ形式に変換され、後述する定義ファイル１３に格納される。 Next, as the second calculation relationship, as shown in FIG.
Total field sum (1) = total field f (4) .1 + total field f (4) .2 + total field f (4) .3 + total field f (4) .4 + total field f (4) .5
The calculation relationship is established. This operation relation is an operation that defines "sum (1) = f (4) ・ 1 + f (4) ・ 2 + f (4) ・ 3 + f (4) ・ 4 + f (4) ・ 5" Information is converted into a data format that can be stored in the definition file 13 and stored in the definition file 13 described later.

定義ファイル１３は、上述した定義ツール１２により定義された設定情報、演算情報を格納する。図５〜図６を参照して、定義ツール１２により定義された各情報のデータ構成例について説明する。図５〜図６に示すように、定義ファイル１３は、階層構造的に接続された構造体を構築し、当該構造体の中に各情報に格納している。 The definition file 13 stores setting information and calculation information defined by the definition tool 12 described above. A data configuration example of each piece of information defined by the definition tool 12 will be described with reference to FIGS. As shown in FIG. 5 to FIG. 6, the definition file 13 constructs a hierarchically connected structure and stores it in each piece of information in the structure.

図５に示すように、最上位の構造体として、伝票情報構造体１２ａが構築される。この伝票情報構造体１２ａには、基本情報、ヘッダ情報、明細情報が格納される。また、基本情報の下層には、基本情報構造体１２ｂが構築され、読み取られた伝票の伝票サイズ（横）、伝票サイズ（縦）が格納される。 As shown in FIG. 5, a slip information structure 12a is constructed as the topmost structure. The slip information structure 12a stores basic information, header information, and detailed information. Also, a basic information structure 12b is constructed below the basic information, and the slip size (horizontal) and slip size (vertical) of the read slip are stored.

ヘッダ情報の下層には、ヘッダ情報構造体１２ｃが構築され、ヘッダ情報構造体１２ｃには、発行元会社コード、発行元会社コード表示情報、発行元会社名、表示情報、発行元ＴＥＬ番号、発行元ＴＥＬ番号表示情報、発行元ＦＡＸ番号、発行元ＦＡＸ番号表示情報が格納される。 Below the header information, a header information structure 12c is constructed. The header information structure 12c includes an issuer company code, issuer company code display information, issuer company name, display information, issuer TEL number, and issue. The original TEL number display information, the issuer FAX number, and the issuer FAX number display information are stored.

また、図６に示すように、ヘッダ情報構造体１２ｃの項目の下層には、対応する表示表示情報構造体１２ｉ〜１２ｌが構築されされている。そして、表示情報構造体１２ｉ〜１２ｌには、対応するフィールドを表示画面に表示する位置を示す情報として、左下座標（横）、左下座標（縦）、右下座標（横）、右下座標（縦）が格納される。 Further, as shown in FIG. 6, corresponding display display information structures 12i to 12l are constructed below the items of the header information structure 12c. In the display information structures 12i to 12l, as information indicating the position where the corresponding field is displayed on the display screen, lower left coordinates (horizontal), lower left coordinates (vertical), lower right coordinates (horizontal), lower right coordinates ( Vertical) is stored.

図５に戻り、明細情報の下層には、明細情報構造体１２ｄが構築され、明細情報構造体１２ｄには、明細行数、フィールド数、フィールド情報、フィールド演算情報、合計フィールド数、合計フィールド情報、合計フィールド演算情報が格納される。 Returning to FIG. 5, a detail information structure 12d is constructed below the detail information. The detail information structure 12d includes the number of detail rows, the number of fields, field information, field operation information, the total number of fields, and the total field information. , Total field calculation information is stored.

また、明細情報構造体１２ｄの下層には、フィールド情報に対応してフィールド情報構造体１２ｍ（図６）、フィールド演算情報に対応してフィールド演算情報構造体１２ｅ（図５）、合計フィールド情報に対応して合計フィールド情報構造体１２ｑ（図６）、合計フィールド演算情報に対応して合計フィールド演算情報構造体１２ｇ（図６）がそれぞれ構築されている。 Below the detailed information structure 12d, there is a field information structure 12m (FIG. 6) corresponding to the field information, a field operation information structure 12e (FIG. 5) corresponding to the field operation information, and total field information. Correspondingly, a total field operation information structure 12q (FIG. 6) and a total field operation information structure 12g (FIG. 6) are constructed corresponding to the total field operation information.

図６に示すように、フィールド情報構造体１２ｍには、文字数、文字種、フィールド表示情報、認識の確からしさ、認識候補、認識結果、次のフィールド情報が格納されている。また、フィールド表示情報の下層には、表示情報構造体１２ｎが構築され、次のフィールド情報の下層には、フィールド情報構造体１２ｏが構築されている。 As shown in FIG. 6, the field information structure 12m stores the number of characters, the character type, the field display information, the probability of recognition, the recognition candidate, the recognition result, and the next field information. Further, a display information structure 12n is constructed below the field display information, and a field information structure 12o is constructed below the next field information.

同様に、合計フィールド情報構造体１２ｑには、文字数、文字種、フィールド表示情報、認識の確からしさ、認識候補、認識結果、次の合計フィールド情報が格納されている。また、フィールド表示情報の下層には、表示情報構造体１２ｒが構築され、次のフィールド情報の下層には、合計フィールド情報構造体１２ｓが構築されている。 Similarly, the total field information structure 12q stores the number of characters, character type, field display information, probability of recognition, recognition candidate, recognition result, and the following total field information. Further, a display information structure 12r is constructed below the field display information, and a total field information structure 12s is constructed below the next field information.

また、図５に示すように、フィールド演算情報構造体１２ｅには、演算情報、次の演算情報が格納されており、合計フィールド演算情報構造体１２ｇには、合計演算情報、次の合計演算情報が格納されている。 Further, as shown in FIG. 5, the field calculation information structure 12e stores calculation information and next calculation information, and the total field calculation information structure 12g stores total calculation information and next total calculation information. Is stored.

図１に戻り、ＯＣＲ認識エンジン１４は、データ処理部１４ａ、データ管理部１４ｂ等を備えて構成され、定義ツール１２により定義された各情報を定義ファイル１３から取得し、読取装置１１から入力される伝票の画像データの各種認識処理を行うと共に、処理された情報を管理する。 Returning to FIG. 1, the OCR recognition engine 14 includes a data processing unit 14 a, a data management unit 14 b, and the like, acquires each information defined by the definition tool 12 from the definition file 13, and is input from the reading device 11. In addition to performing various recognition processing of the image data of the slip, the processed information is managed.

データ処理部１４ａは、読取装置１１から伝票の画像データを取得し、取得した画像データから定義ツール１２により定義された情報に基づいてフィールドを認識し、認識したフィールドをデータ管理部１４ｂに格納する。そして、データ処理部１４ａは、記憶装置１６から各種プログラムを取得し、データ管理部１４ｂに格納される情報に対して、後述するメイン処理、検算処理、誤認識フィールド検出処理、誤認識フィールド修正処理を実行する。なお、各処理の詳細については後述する。 The data processing unit 14a acquires the image data of the slip from the reading device 11, recognizes the field based on the information defined by the definition tool 12 from the acquired image data, and stores the recognized field in the data management unit 14b. . The data processing unit 14a acquires various programs from the storage device 16, and performs main processing, verification processing, erroneous recognition field detection processing, and erroneous recognition field correction processing, which will be described later, for information stored in the data management unit 14b. Execute. Details of each process will be described later.

データ管理部１４ｂは、データ処理部１４ａにより認識されたフィールドをフィールド情報構造体に分類して格納すると共に、上述したデータ処理部１４ａにより実行された各処理において生成された情報を各構造体に格納して、管理する。図７〜図８を参照して、データ管理部１４ｂに格納される情報のデータ構成例について説明する。図７〜図８に示すように、データ管理部１４ｂは、各情報を階層構造的に接続された構造体に格納する。 The data management unit 14b classifies and stores the fields recognized by the data processing unit 14a into field information structures, and stores information generated in each process executed by the data processing unit 14a described above in each structure. Store and manage. A data configuration example of information stored in the data management unit 14b will be described with reference to FIGS. As illustrated in FIGS. 7 to 8, the data management unit 14 b stores each piece of information in a hierarchically connected structure.

図７は、伝票の画像データから認識されたフィールドに関する情報のデータ構成例を示す図である。図７に示すように、最上位の構造体として、伝票情報構造体１４ｃが構築され、伝票情報構造体１４ｃには、基本情報、ヘッダ情報、明細情報が格納される。また、伝票情報構造体１４ｃの下層には、明細情報に対応して明細情報構造体１４ｄが構築される。明細情報構造体１４ｄには、明細行数、フィールド数、フィールド情報、フィールド演算情報、合計フィールド数、合計フィールド情報、合計フィールド演算情報が格納される。 FIG. 7 is a diagram illustrating a data configuration example of information regarding fields recognized from the image data of the slip. As shown in FIG. 7, a slip information structure 14c is constructed as the uppermost structure, and basic information, header information, and detail information are stored in the slip information structure 14c. Further, a detail information structure 14d is constructed corresponding to the detail information below the slip information structure 14c. The detail information structure 14d stores the number of detail rows, the number of fields, field information, field operation information, the total field number, total field information, and total field operation information.

また、明細情報構造体１４ｄの下層には、フィールド情報に対応してフィールド情報構造体１４ｅが構築され、フィールド情報構造体１４ｅには、文字数、文字種、フィールド表示情報、認識の確からしさ、認識候補、認識結果、次のフィールド情報が格納される。フィールド演算情報に対応して、フィールド演算情報構造体１４ｈが構築され、フィールド演算情報構造体１４ｈには、演算情報、次の演算情報が格納される。 Further, a field information structure 14e corresponding to the field information is constructed below the detailed information structure 14d. The field information structure 14e includes the number of characters, character type, field display information, probability of recognition, and recognition candidates. The recognition result and the following field information are stored. Corresponding to the field operation information, a field operation information structure 14h is constructed, and operation information and the next operation information are stored in the field operation information structure 14h.

また、図８は、認識されたフィールドのうち誤認識フィールドを特定するためのデータ管理情報のデータ構成例を示す図である。最上位の構造体には、検算管理情報１４ｊが構築され、検算管理情報１４ｊの下層には、検算管理構造体１４ｋが構築されている。この検算管理構造体１４ｋには、異常フィールド情報、正常フィールド情報が格納される。 FIG. 8 is a diagram illustrating a data configuration example of data management information for specifying a misrecognized field among recognized fields. The verification management information 14j is constructed in the uppermost structure, and the verification management structure 14k is constructed in the lower layer of the verification management information 14j. The verification management structure 14k stores abnormal field information and normal field information.

また、検算管理構造体１４ｋの下層には、異常フィールド情報に対応して異常フィールド管理構造体１４ｌが構築され、異常フィールド管理構造体１４ｌには、不正フィールド情報と次の管理構造体が格納される。そして、後述する検算処理の結果、異常フィールド管理構造体１４ｌに不正フィールド情報が格納された場合、検算結果が異常であったフィールドのフィールド情報構造体（図７）がポイントされる。 In addition, an abnormal field management structure 141 corresponding to the abnormal field information is constructed below the check management structure 14k. The abnormal field management structure 14l stores illegal field information and the next management structure. The If invalid field information is stored in the abnormal field management structure 141 as a result of the verification process described later, the field information structure (FIG. 7) of the field whose verification result is abnormal is pointed.

また、検算管理構造体１４ｋの下層には、正常フィールド情報に対応して正常フィールド管理構造体１４ｈが構築され、正常フィールド管理構造体１４ｈには、正常フィールド情報と次の管理構造体が格納される。そして、後述する検算処理の結果、正常フィールド管理構造体１４ｈに正常フィールド情報が格納された場合、検算結果が正常であったフィールドのフィールド情報構造体（図７）がポイントされる。 Further, a normal field management structure 14h corresponding to normal field information is constructed below the check management structure 14k, and normal field information and the next management structure are stored in the normal field management structure 14h. The If normal field information is stored in the normal field management structure 14h as a result of the verification process described later, the field information structure (FIG. 7) of the field whose verification result is normal is pointed out.

図１に戻り、業務アプリ１５は、ＯＣＲ認識エンジン１４において、処理されたデータを用いて各種業務処理を行うアプリケーションである。 Returning to FIG. 1, the business application 15 is an application that performs various business processes using the processed data in the OCR recognition engine 14.

記憶装置１６は、プログラムやデータ等があらかじめ記憶されている記録媒体（図示せず）を有しており、この記録媒体は磁気的、光学的記録媒体、若しくは半導体メモリで構成されている。この記録媒体は記録装置１６に固定的に設けられるもの、若しくは着脱自在に装着するものであり、この記録媒体には、前記システムプログラム及び当該システムに対応する各種アプリケーションプログラム、及び各種処理プログラムで処理されたデータ等を記憶する。 The storage device 16 has a recording medium (not shown) in which a program, data, and the like are stored in advance, and this recording medium is constituted by a magnetic or optical recording medium or a semiconductor memory. This recording medium is fixedly attached to the recording device 16 or is detachably mounted. This recording medium is processed by the system program, various application programs corresponding to the system, and various processing programs. Stored data and the like are stored.

また、記録媒体に記憶するプログラム、データ等は、その一部若しくは全部をサーバやクライアント等の他の機器からＷＡＮ（Wide Area Netwrok）、ＬＡＮ（Local Area Network）等のネットワーク回線の伝送媒体を介して伝送制御部１４から受信して記憶する構成にしてもよく、さらに、記録媒体はネットワーク上に構築されたサーバの記録媒体であってもよい。さらに、前記プログラムをネットワーク回線等の伝送媒体を介してサーバやクライアントへ伝送してこれらの機器にインストールするように構成してもよい。 Some or all of the programs, data, and the like stored in the recording medium are transmitted from other devices such as a server and a client via a network line transmission medium such as a WAN (Wide Area Network) and a LAN (Local Area Network). The recording medium may be received and stored from the transmission control unit 14, and the recording medium may be a recording medium of a server constructed on a network. Furthermore, the program may be transmitted to a server or a client via a transmission medium such as a network line and installed in these devices.

入力装置１７は、カーソルキー、数字入力キー、及び各種機能キーを備えたキーボードを含み、このキーボードで押下されたキーに対応する押下信号を読取装置１１、定義ツール１２又はデータ処理部１４ａに出力する。なお、入力装置１７は、必要に応じてマウス、タッチパネル等のポインティングデバイスや、その他の入力装置を備えるものとしてもよい。 The input device 17 includes a keyboard having cursor keys, numeric input keys, and various function keys, and outputs a pressing signal corresponding to the key pressed on the keyboard to the reading device 11, the definition tool 12, or the data processing unit 14a. To do. Note that the input device 17 may include a pointing device such as a mouse and a touch panel, and other input devices as necessary.

表示装置１８は、ＣＲＴ（Cathode Ray Tube）やＬＣＤ（Liquid Crystal Display）等によってなる表示画面を備え、読取装置１１から読み取られた画像データや入力装置１７から入力された入力指示等の表示情報を画面上に表示する。 The display device 18 includes a display screen made up of a CRT (Cathode Ray Tube), an LCD (Liquid Crystal Display) or the like, and displays display information such as image data read from the reading device 11 and input instructions input from the input device 17. Display on the screen.

次に、本実施の形態の動作を説明する。
後述する各フローチャートに記述されている各機能を実現するためのプログラムは読み取り可能なプログラムコードの形態でデータ認識装置１の記憶装置１６に格納されており、定義ツール１２又はデータ処理部１４ａは、該プログラムコードに従った動作を逐次実行する。 Next, the operation of the present embodiment will be described.
A program for realizing each function described in each flowchart described later is stored in the storage device 16 of the data recognition device 1 in the form of a readable program code, and the definition tool 12 or the data processing unit 14a The operations according to the program code are sequentially executed.

まず、図９〜図１８を参照して、定義ツール１２により実行される処理について説明する。図９は、定義ツール１２により実行されるメイン処理を示すフローチャートである。図９に示すように、入力装置１７を介して実行する処理が選択されると（ステップＳ１）、定義ツール１２により、終了選択であるか否かを判別され、終了選択でない場合（ステップＳ２；ＮＯ）、イメージ読込処理が選択されたか否かが判別される（ステップＳ３）。ここで、イメージ読込処理が選択された場合（ステップＳ３；ＹＥＳ）、定義ツール１２により、記憶装置１６からイメージ読込処理プログラムが読み出され、後述するイメージ読込処理（図１０）が実行される（ステップＳ６）。図１０を参照してイメージ読込処理について説明する。 First, processing executed by the definition tool 12 will be described with reference to FIGS. FIG. 9 is a flowchart showing main processing executed by the definition tool 12. As shown in FIG. 9, when a process to be executed via the input device 17 is selected (step S1), the definition tool 12 determines whether or not it is an end selection, and when it is not an end selection (step S2; NO), it is determined whether or not the image reading process is selected (step S3). Here, when the image reading process is selected (step S3; YES), the definition tool 12 reads an image reading process program from the storage device 16, and an image reading process (FIG. 10) described later is executed (FIG. 10). Step S6). The image reading process will be described with reference to FIG.

図１０は、メイン処理の一部として実行されるイメージ読込処理を示すフローチャートである。図１０に示すように、読取装置１１から伝票の画像データ（イメージ）が読み込まれると（ステップＳ１１）、定義ツール１２により、読み込まれた画像データが記憶装置１６（主記憶装置）の所定のアドレスに格納される（ステップＳ１２）。さらに、読み込まれた画像データが表示装置１８に表示され（ステップＳ１３）、本イメージ読込処理終了後、メイン処理に移行する。 FIG. 10 is a flowchart showing an image reading process executed as a part of the main process. As shown in FIG. 10, when the slip image data (image) is read from the reading device 11 (step S11), the definition tool 12 reads the read image data into a predetermined address of the storage device 16 (main storage device). (Step S12). Further, the read image data is displayed on the display device 18 (step S13), and after the image reading process is completed, the process proceeds to the main process.

図９に戻り、ステップＳ１においてフィールド設定の処理が選択された場合（ステップＳ４；ＹＥＳ）、定義ツール１２により、記憶装置１６からフィールド設定処理プログラムが読み出され、後述するフィールド設定処理（図１１）が実行される。図１１を参照して、フィールド設定処理について説明する。 Returning to FIG. 9, when the field setting process is selected in step S1 (step S4; YES), the definition tool 12 reads the field setting process program from the storage device 16, and the field setting process (FIG. 11) described later. ) Is executed. The field setting process will be described with reference to FIG.

図１１は、メイン処理の一部として実行されるフィールド設定処理を示すフローチャートである。図１１に示すように、フィールド設定処理を実行するにあたり、初期化処理（図１２）が実行される。図１２を参照して初期化処理について説明する。 FIG. 11 is a flowchart showing a field setting process executed as part of the main process. As shown in FIG. 11, when the field setting process is executed, an initialization process (FIG. 12) is executed. The initialization process will be described with reference to FIG.

図１２に示すように、レジスタの初期化が行われ、「CNT=0，SX=0，SY=0，EX=0，ET=0，CN=0，CK=0」としてセットされる（ステップＳ４１）。次に、フィールド情報構造体が確保され、アドレスが「Mlinep→Fieldp」及びレジスタＦＰにセットされる（ステップＳ４２）。さらに、表示情報構造体が確保され、アドレスが「Mlinep→Fieldp→FieldDispp」及びレジスタFDPにセットされる（ステップＳ４３）。そして、本初期化処理終了後、フィールド設定処理に移行する。 As shown in FIG. 12, the register is initialized and set as “CNT = 0, SX = 0, SY = 0, EX = 0, ET = 0, CN = 0, CK = 0” (step S41). Next, the field information structure is secured, and the address is set in “Mlinep → Fieldp” and the register FP (step S42). Further, the display information structure is secured, and the address is set in “Mlinep → Fieldp → FieldDispp” and the register FDP (step S43). Then, after the initialization process is completed, the process proceeds to the field setting process.

図１１に戻り、入力装置１７を介して実行する処理が選択されると（ステップＳ２２）、定義ツール１２により、終了選択であるか否かを判別され、終了選択でない場合（ステップＳ２３；ＮＯ）、位置・大きさ設定の指示が選択されたか否かが判別される（ステップＳ２４）。 Returning to FIG. 11, when the process to be executed is selected via the input device 17 (step S22), the definition tool 12 determines whether or not it is an end selection, and when it is not an end selection (step S23; NO). Then, it is determined whether or not a position / size setting instruction has been selected (step S24).

ここで、位置大きさ設定の指示が入力された場合（ステップＳ２４；ＹＥＳ）、すなわち、上述した図３（ａ）に示す設定画面１８１において、マウス等により、矩形領域が指定された場合、指定された矩形領域の左上座標（SX，SY）及び右下座標（EX，EY）を取得する（ステップＳ２５）。次に、指定された矩形領域の最大文字数及び文字種を設定するための設定画面１８２（図３（ｂ））を表示装置１８に表示させ（ステップＳ２６）、設定画面１８２において入力された最大文字数（CN）及び文字種（CK）を取得する（ステップＳ２７、Ｓ２８）。 Here, when an instruction to set the position size is input (step S24; YES), that is, when a rectangular area is specified by the mouse or the like on the setting screen 181 shown in FIG. The upper left coordinates (SX, SY) and lower right coordinates (EX, EY) of the rectangular area thus obtained are acquired (step S25). Next, a setting screen 182 (FIG. 3B) for setting the maximum number of characters and the character type of the designated rectangular area is displayed on the display device 18 (step S26), and the maximum number of characters input on the setting screen 182 ( CN) and character type (CK) are acquired (steps S27 and S28).

そして、取得した各情報が所定のアドレスに格納される（ステップＳ２８）。すなわち、フィールド情報構造体（FP）の文字数（Chanum）に取得された文字種（CN）が格納され、フィールド情報構造体（FP）の文字種（Charkind）に取得された文字種（CK）が格納される。また、表示構造情報体（FDP）の左上座標横（SX）、左上座標縦（SY）、右下座標横（EX）、右下座標縦（EY）に、取得された左上座標（SX，SY）、右下座標（EX，EY）がそれぞれ格納される。 Then, each acquired information is stored at a predetermined address (step S28). That is, the acquired character type (CN) is stored in the number of characters (Chanum) of the field information structure (FP), and the acquired character type (CK) is stored in the character type (Charkind) of the field information structure (FP). . The upper left coordinate (SX, SY), the upper left coordinate horizontal (SX), the upper left coordinate vertical (SY), the lower right coordinate horizontal (EX), and the lower right coordinate vertical (EY) of the display structure information body (FDP) are obtained. ) And lower right coordinates (EX, EY) are stored.

次いで、次のフィールドを定義するため、領域確保・レジスタ更新処理が実行される（ステップＳ３０）。図１３は、フィールド設定処理の一部として実行フィールド情報構造体を確保され（ステップＳ４５）、アドレスが「FP→Nextp」にセットされる（ステップＳ４５）。次いで、「FP」を「FP→Nextp」とする（ステップＳ４６）。また、表示情報構造体が確保され、アドレスが「FP→FieldDispp」にセットされる（ステップＳ４７）。次いで、「FDP」を「FP→FieldDispp」とする（ステップＳ４６）。そして、領域確保・レジスタ更新処理終了後、フィールド設定処理に移行する。 Next, in order to define the next field, an area reservation / register update process is executed (step S30). In FIG. 13, the execution field information structure is secured as part of the field setting process (step S45), and the address is set to “FP → Nextp” (step S45). Next, “FP” is changed to “FP → Nextp” (step S46). Further, the display information structure is secured and the address is set to “FP → FieldDispp” (step S47). Next, “FDP” is changed to “FP → FieldDispp” (step S46). Then, after the area reservation / register update process is completed, the process proceeds to the field setting process.

図１１に戻り、レジスタのカウンタがインクリメント（＋１）されて（ステップＳ３１）、ステップＳ２２に移行し、次の処理が選択されると（ステップＳ２２）、ステップＳ２４〜Ｓ３１の処理が繰り返して実行される。一方、終了処理が選択された場合（ステップＳ２３；ＹＥＳ）、フィールド設定処理を終了するための後処理が実行される（ステップＳ３２）。図１４を参照して後処理について説明する。 Returning to FIG. 11, the register counter is incremented (+1) (step S31), the process proceeds to step S22, and when the next process is selected (step S22), the processes of steps S24 to S31 are repeatedly executed. The On the other hand, when the end process is selected (step S23; YES), a post process for ending the field setting process is executed (step S32). The post-processing will be described with reference to FIG.

図１４に示すように、まず、カウンタが０であるか否かが判別され、カウンタが０である場合（ステップＳ５１；ＹＥＳ）、「Mlinp→Fieldp→FieldDispp」でポイントされたメモリが解放される（ステップＳ５２）。次いで、「Mlinp→Feildp」でポイントされたメモリが解放され（ステップＳ５３）、フィールド設定処理が終了される。 As shown in FIG. 14, first, it is determined whether or not the counter is 0. If the counter is 0 (step S51; YES), the memory pointed to by “Mlinp → Fieldp → FieldDispp” is released. (Step S52). Next, the memory pointed to by “Mlinp → Feildp” is released (step S53), and the field setting process ends.

続いて、図９に戻り、ステップＳ１において、演算設定の処理が選択されると（ステップＳ５；ＹＥＳ）、定義ツール１２により、記憶装置１６から演算設定処理プログラムを読み出して、後述する演算設定処理を実行させる（ステップＳ８）。図１５を参照して演算設定処理について説明する。 Subsequently, returning to FIG. 9, when calculation setting processing is selected in step S 1 (step S 5; YES), the definition tool 12 reads the calculation setting processing program from the storage device 16, and calculation setting processing described later. Is executed (step S8). The calculation setting process will be described with reference to FIG.

図１５は、メイン処理の一部として実行される演算設定処理を示すフローチャートである。図１５に示すように、演算設定処理を実行するにあたり、初期化処理（図１６）が実行される（ステップＳ６１）。図１６を参照して初期化処理について説明する。 FIG. 15 is a flowchart showing calculation setting processing executed as part of the main processing. As shown in FIG. 15, in executing the calculation setting process, an initialization process (FIG. 16) is executed (step S61). The initialization process will be described with reference to FIG.

図１６は、演算設定処理の一部として実行される初期化処理を示すフローチャートである。図１６に示すように、レジスタの初期化が行われ、カウンタが０にセットされる（ステップＳ７１）。次いで、フィールド情報構造体が確保され、アドレスが「Mlinep→FieldCulp」及びレジスタFCPにセットされる（ステップＳ７２）。さらに、表示情報構造体が確保され、アドレスが「FCP→FieldCulp」及びレジスタFCPにセットされる（ステップＳ７３）。そして、本初期化処理終了後、演算設定処理に移行する。 FIG. 16 is a flowchart showing an initialization process executed as a part of the calculation setting process. As shown in FIG. 16, the register is initialized and the counter is set to 0 (step S71). Next, the field information structure is secured, and the address is set in “Mlinep → FieldCulp” and the register FCP (step S72). Further, the display information structure is secured, and the address is set in “FCP → FieldCulp” and the register FCP (step S73). Then, after the initialization process is completed, the process proceeds to the calculation setting process.

図１５に戻り、入力装置１７を介して実行する処理が選択されると（ステップＳ６２）、定義ツール１２により、終了選択であるか否かを判別され、終了選択でない場合（ステップＳ６３；ＮＯ）、認識フィールドが選択されたか否かが判別される（ステップＳ６４）。 Returning to FIG. 15, when the process to be executed is selected via the input device 17 (step S62), the definition tool 12 determines whether or not it is an end selection, and when it is not an end selection (step S63; NO). It is determined whether or not a recognition field has been selected (step S64).

認識フィールドが選択された場合（ステップＳ６４；ＹＥＳ）、選択されたフィールドのフィールド情報構造体が特定される（ステップＳ６５）。次いで、FCPスタックに「FP」が追加され（ステップＳ６６）、FCPスタックに「=」が追加される（ステップＳ６７）。続いて、定義ツール１２により、記憶装置１６から演算情報のセット処理プログラムが読み出され、演算情報のセット処理が実行される（ステップＳ６８）。図１７を参照して、演算情報のセット処理について説明する。 When the recognition field is selected (step S64; YES), the field information structure of the selected field is specified (step S65). Next, “FP” is added to the FCP stack (step S66), and “=” is added to the FCP stack (step S67). Subsequently, the definition tool 12 reads the calculation information setting process program from the storage device 16 and executes the calculation information setting process (step S68). The calculation information setting process will be described with reference to FIG.

図１７は、演算設定処理の一部として実行される演算情報のセット処理を示すフローチャートである。図１７に示すように、入力装置１７を介して認識フィールドが選択される（ステップＳ８２；ＹＥＳ）、選択された認識フィールドのフィールド情報構造体（ＦＰ）が特定され（ステップＳ８３）、FCPスタックに「FP」が追加される（ステップＳ８４）。 FIG. 17 is a flowchart showing calculation information setting processing executed as part of the calculation setting processing. As shown in FIG. 17, a recognition field is selected via the input device 17 (step S82; YES), and the field information structure (FP) of the selected recognition field is specified (step S83), and is added to the FCP stack. “FP” is added (step S84).

次いで、演算子が入力されたか否かが判別され（ステップＳ８５）、演算子が入力された場合（ステップＳ８５；ＹＥＳ）、FCPスタックに入力された演算子が追加される（ステップＳ８６）。続いて、終了指示が選択されたか否かが判別され、終了指示が入力されない場合（ステップＳ８１；ＮＯ）、ステップＳ８２〜Ｓ８６の処理が繰り返して実行される。終了指示が選択された場合（ステップＳ８１；ＹＥＳ）、本演算情報のセット処理を終了して、演算設定処理に移行する。 Next, it is determined whether or not an operator has been input (step S85). If an operator has been input (step S85; YES), the operator input to the FCP stack is added (step S86). Subsequently, it is determined whether or not an end instruction has been selected. If no end instruction is input (step S81; NO), the processes of steps S82 to S86 are repeated. When the end instruction is selected (step S81; YES), the calculation information setting process is ended, and the process proceeds to the calculation setting process.

図１５に戻り、演算情報のセット処理が終了されると（ステップＳ６８）、レジスタのカウンタがインクリメントされ（ステップＳ６９）、ステップＳ６２に移行して、次の処理が選択される（ステップＳ６２）。ここで、次の認識フィールドが選択されると、ステップＳ６５〜Ｓ６９の処理が繰り返して実行される。一方、終了処理が選択されると（ステップＳ６３；ＹＥＳ）、演算設定処理を終了するための後処理を実行される（ステップＳ７０）。図１８を参照して、後処理について説明する。 Returning to FIG. 15, when the calculation information setting process is completed (step S68), the counter of the register is incremented (step S69), the process proceeds to step S62, and the next process is selected (step S62). Here, when the next recognition field is selected, the processes of steps S65 to S69 are repeated. On the other hand, when the end process is selected (step S63; YES), a post process for ending the calculation setting process is executed (step S70). The post-processing will be described with reference to FIG.

図１８に示すように、まず、カウンタが０であるか否かが判別され、カウンタが０である場合（ステップＳ８７；ＹＥＳ）、「Mlinp→FieldCulp→Culinfop」でポイントされたメモリが解放される（ステップＳ８８）。次いで、「Mlinp→FeildCulp」でポイントされたメモリが解放され（ステップＳ８９）、演算設定処理が終了される。 As shown in FIG. 18, first, it is determined whether or not the counter is 0. If the counter is 0 (step S87; YES), the memory pointed to by “Mlinp → FieldCulp → Culinfop” is released. (Step S88). Next, the memory pointed to by “Mlinp → FeildCulp” is released (step S89), and the calculation setting process ends.

図９に戻り、イメージ読込処理（図１０）、フィールド設定処理（図１１）、演算設定処理（図１８）等の各処理が終了すると、ステップＳ１に移行して、次の処理を選択させる。ここで、終了処理が選択されると（ステップＳ２）、定義ツール１２により実行されるメイン処理が終了する。 Returning to FIG. 9, when the image reading process (FIG. 10), the field setting process (FIG. 11), the calculation setting process (FIG. 18), and the like are completed, the process proceeds to step S 1 and the next process is selected. Here, when the termination process is selected (step S2), the main process executed by the definition tool 12 is terminated.

次に、図１９〜図２３を参照してデータ処理部１４ａにより実行される各処理について説明する。図１９は、データ処理部１４ａにより実行されるメイン処理を示すフローチャートである。図１９に示すように、「FCP」を「Mlinep→FieldCulp」とする（ステップＳ９１）。次いで、データ処理部１４ａにより、正常フィールド管理構造体領域を確保させ、アドレスが「KMNP→rightp」及びレジスタrightpにセットされる（ステップＳ９２）。 Next, each process executed by the data processing unit 14a will be described with reference to FIGS. FIG. 19 is a flowchart showing main processing executed by the data processing unit 14a. As shown in FIG. 19, “FCP” is changed to “Mlinep → FieldCulp” (step S91). Next, the normal field management structure area is secured by the data processing unit 14a, and the address is set in “KMNP → rightp” and the register rightp (step S92).

また、異常フィールド管理構造体領域が確保され、アドレスが「KMNP→badp」及びレジスタBadpにセットされる（ステップＳ９２）。次いで、FCPが０であるか否かを判別され（ステップＳ９４）、FCPが０でない場合（ステップＳ９４；ＮＯ）、記憶装置１６から検算処理プログラムが読み出され、後述する検算処理（図２０）が実行される。図２０を参照して検算処理について説明する。 Also, an abnormal field management structure area is secured, and the address is set in “KMNP → badp” and the register Badp (step S92). Next, it is determined whether or not the FCP is 0 (step S94). If the FCP is not 0 (step S94; NO), the verification processing program is read from the storage device 16, and the verification processing described later (FIG. 20). Is executed. The verification process will be described with reference to FIG.

図２０は、メイン処理の一部として実行される検算処理を示すフローチャートである。図２０に示すように、「SP」を「FCP->Culinfop」とする（ステップＳ１１１）。次いで、SPでポイントされたスタックに、例えば、逆ポーランド記法によりスタックされた演算を実行する（ステップＳ１１２）。そして、演算結果が正しいか否かが判別される（ステップＳ１１３）。 FIG. 20 is a flowchart showing a verification process executed as part of the main process. As shown in FIG. 20, “SP” is set to “FCP-> Culinfop” (step S111). Next, for example, an operation stacked in the reverse Polish notation on the stack pointed by SP is executed (step S112). Then, it is determined whether or not the calculation result is correct (step S113).

演算結果が正しい場合（ステップＳ１１３；ＹＥＳ）、引数を「Rightp」として、フィールド管理構造体の構築が行われる（ステップＳ１１４）。一方、演算結果が誤っている場合（ステップＳ１１３；ＮＯ）、引数を「Badp」として、フィールド管理構造体の構築が行われる（ステップＳ１１４）。図２１を参照して、フィールド管理構造体の構築処理について説明する。 If the calculation result is correct (step S113; YES), the field management structure is constructed with the argument “Rightp” (step S114). On the other hand, if the operation result is incorrect (step S113; NO), the field management structure is constructed with the argument “Badp” (step S114). With reference to FIG. 21, the construction process of the field management structure will be described.

図２１は、検算処理の一部として実行されるフィールド管理構造体の構築処理を示すフローチャートである、図２１に示すように、データ処理部１４ａにより、スタックポインタSPが抜き出され、レジスタの「RR」に格納される（ステップＳ１２１）。次いで、「RR」が「=」であるか否かどうかが判別され（ステップＳ１２２）、「RR」が「=」でない場合（ステップＳ１２２；ＮＯ）、「RR」が演算子であるか否かが判別される（ステップＳ１２３）。「RR」が演算子である場合（ステップＳ１２３；ＹＥＳ）、ステップＳ１２１に移行する。 FIG. 21 is a flowchart showing a field management structure construction process executed as a part of the verification process. As shown in FIG. 21, the data processing unit 14a extracts the stack pointer SP and registers “ Is stored in "RR" (step S121). Next, it is determined whether or not “RR” is “=” (step S122). If “RR” is not “=” (step S122; NO), whether or not “RR” is an operator. Is discriminated (step S123). When “RR” is an operator (step S123; YES), the process proceeds to step S121.

一方、「RR」が演算子でない場合（ステップＳ１２３；ＮＯ）、「PP→fp」を「RR」とする（ステップＳ１２４）。そして、フィールド管理構造体が確保され、アドレスが「PP→Nextp」及び「PP」にセットされる（ステップＳ１２４）。そして、スタックポイントSPが抜き出され、「RR」が「=」であるか否かどうかが判別される（ステップＳ１２２）。ここで、「RR」が「=」である場合（ステップＳ１２２）、本フィールド管理構造体の構築処理が終了される。 On the other hand, when “RR” is not an operator (step S123; NO), “PP → fp” is set to “RR” (step S124). Then, the field management structure is secured, and the addresses are set to “PP → Nextp” and “PP” (step S124). Then, the stack point SP is extracted, and it is determined whether or not “RR” is “=” (step S122). Here, when “RR” is “=” (step S122), the construction process of the field management structure is ended.

上述した本フィールド管理構造体の構築処理においては、検算処理において演算結果が正解であった場合、その演算結果を構成するフィールドのフィールド情報構造体のアドレスが正常フィールド管理構造体に記憶される。また、検算処理において演算結果に誤りがあった場合、その演算結果を構成するフィールドのフィールド情報構造体のアドレスが異常構造フィールド管理構造体に格納される。 In the construction process of the field management structure described above, when the calculation result is correct in the verification process, the address of the field information structure of the field constituting the calculation result is stored in the normal field management structure. If there is an error in the operation result in the verification process, the address of the field information structure of the field constituting the operation result is stored in the abnormal structure field management structure.

したがって、フィールド管理構造体の構築処理により、正常フィールド管理構造体にリストされるフィールドには、誤認識フィールドは含まれないが、異常フィールド管理構造体にリストされるフィールドには、正しく認識されているフィールドも含まれることとなる。 Therefore, the field management structure construction process does not include the misrecognized field in the field listed in the normal field management structure, but the field listed in the abnormal field management structure is correctly recognized. This field is also included.

次いで、図１９に戻り、検算処理が終了すると、「FCP」を「FCP→FieldNextp」とし、ステップＳ９４に移行する。ステップＳ９４において、FCPが０であると判別された場合（ステップＳ９４；ＹＥＳ）、「FCP」を「Mlinep→FieldCulp」とする（ステップＳ９７）。次いで、「KMNP→badp」が０であるか否かが判別され（ステップＳ９８）、「KMNP→badp」が０の場合（ステップＳ９８；ＹＥＳ）、メイン処理が終了される。 Next, returning to FIG. 19, when the verification process ends, “FCP” is changed to “FCP → FieldNextp”, and the process proceeds to step S94. If it is determined in step S94 that FCP is 0 (step S94; YES), “FCP” is changed to “Mlinep → FieldCulp” (step S97). Next, it is determined whether or not “KMNP → badp” is 0 (step S98). If “KMNP → badp” is 0 (step S98; YES), the main process is terminated.

一方、「KMNP→badp」が０でない場合（ステップＳ９８；ＮＯ）、データ処理部１４ａにより、記憶装置１６から誤認識フィールド検出処理プログラムが読みだされ、後述する誤認識フィールド検出処理（ステップＳ９９）が実行される。この誤認識フィールド検出処理は、上述したフィールド管理構造体の構築処理において、異常フィールド構造体にリストされたフィールドの中から、正しく認識されたフィールドを取り除く処理である。図２２を参照して誤認識フィールド検出処理を説明する。 On the other hand, if “KMNP → badp” is not 0 (step S98; NO), the data processing unit 14a reads an erroneous recognition field detection processing program from the storage device 16, and an erroneous recognition field detection processing (step S99) described later. Is executed. This misrecognized field detection process is a process of removing a correctly recognized field from the fields listed in the abnormal field structure in the above-described field management structure construction process. The misrecognition field detection process will be described with reference to FIG.

図２２は、メイン処理の一部として実行される誤認識フィールド検出処理を示すフローチャートである。図２２に示すように、「Rightp」を「KMNP→rightp」、「Badp」を「KMNP→badp」、レジスタの「B」を「Badp」、レジスタの「R」を「Rightp」とする（ステップＳ１３１〜Ｓ１３４）。次いで、「R」が０であるか否かが判別され（ステップＳ１３５）、「R」が０の場合（ステップＳ１３５；ＹＥＳ）、「B」が「B→Nextp」とされて、ステップＳ１３４に移行する。 FIG. 22 is a flowchart showing a misrecognition field detection process executed as part of the main process. As shown in FIG. 22, “Rightp” is “KMNP → rightp”, “Badp” is “KMNP → badp”, register “B” is “Badp”, and register “R” is “Rightp” (step) S131-S134). Next, it is determined whether or not “R” is 0 (step S135). If “R” is 0 (step S135; YES), “B” is changed to “B → Nextp”, and the process proceeds to step S134. Transition.

一方、「R」が０でない場合（ステップＳ１３５；ＮＯ）、「B」が０であるか否かが判別され（ステップＳ１３７）、「B」が０である場合（ステップＳ１３７；ＹＥＳ）、本誤認識フィールド検出処理を終了する。また、「B」が０でない場合（ステップＳ１３７；ＮＯ）、「R→fp」と「B→fp」が等しいかどうかが判別される（ステップＳ１３８）。 On the other hand, if “R” is not 0 (step S135; NO), it is determined whether or not “B” is 0 (step S137). If “B” is 0 (step S137; YES), this The erroneous recognition field detection process is terminated. If “B” is not 0 (step S137; NO), it is determined whether “R → fp” and “B → fp” are equal (step S138).

「R→fp」と「B→fp」が等しい場合（ステップＳ１３８；ＹＥＳ）、「B→fp」を０とする（ステップＳ１３９）。一方、「R→fp」と「B→fp」が等しくない場合（ステップＳ１３８；ＮＯ）、「R」を「R→Nextp」とする（ステップＳ１４０）。そして、ステップＳ１３５に移行して、「B」が０となるまで、上述した処理を繰り返して実行する。 When “R → fp” and “B → fp” are equal (step S138; YES), “B → fp” is set to 0 (step S139). On the other hand, if “R → fp” and “B → fp” are not equal (step S138; NO), “R” is set to “R → Nextp” (step S140). Then, the process proceeds to step S135, and the above-described processing is repeatedly executed until “B” becomes zero.

図１９に戻り、誤認識フィールド検出処理が終了すると（ステップＳ９９）、異常フィールド管理構造体の再チェーン化が行われる（ステップＳ１００）。具体的には、「KMNP→badp→fp→RecLebel」に記憶されている認識の確からしさの低い順に「KMNP→badp」でチェーンされている異常フィールド管理構造体がソートされて再チェーン化が行われる。 Returning to FIG. 19, when the erroneous recognition field detection process is completed (step S99), the abnormal field management structure is re-chained (step S100). Specifically, the abnormal field management structures chained in “KMNP → badp” are sorted in the order of the low probability of recognition stored in “KMNP → badp → fp → RecLebel” and re-chaining is performed. Is called.

次いで、再チェーン化が行われると、データ処理部１４ａにより、記憶装置１６から誤認識フィールド修正処理プログラムが読み出され、後述する誤認識フィールド修正処理（図２３）が実行される。図２３を参照して誤認識フィールド修正処理について説明する。 Next, when re-chaining is performed, an erroneous recognition field correction processing program is read from the storage device 16 by the data processing unit 14a, and an erroneous recognition field correction processing (FIG. 23) described later is executed. The misrecognition field correction process will be described with reference to FIG.

図２３は、メイン処理の一部として実行される誤認識フィールド修正処理を示すフローチャートである、図２３に示すように、「Ilp」を「KMNP→FieldCulp」、「Badp」を「KMNP→badp」、レジスタの「I」を「Ilp」、レジスタの「B」を「Badp」とする（ステップＳ１４１〜Ｓ１４４）。次いで、「B」が０であるか否かが判別され（ステップＳ１４５）、「B」が０の場合（ステップＳ１４５；ＹＥＳ）、「I」が「I→FieldNextp」とされて（ステップＳ１４６）、ステップＳ１４４に移行する。 FIG. 23 is a flowchart showing a misrecognition field correction process executed as part of the main process. As shown in FIG. 23, “Ilp” is “KMNP → FieldCulp”, and “Badp” is “KMNP → badp”. The register “I” is “Ilp” and the register “B” is “Badp” (steps S141 to S144). Next, it is determined whether or not “B” is 0 (step S145). If “B” is 0 (step S145; YES), “I” is changed to “I → FieldNextp” (step S146). The process proceeds to step S144.

一方、「B」が０でない場合（ステップＳ１４５；ＮＯ）、「I」が０であるか否かが判別され（ステップＳ１４７）、「I」が０である場合（ステップＳ１４７；ＹＥＳ）、本誤認識フィールド修正処理を終了する。また、「I」が０でない場合（ステップＳ１４７；ＮＯ）、「R→fp」と同じアドレスが「I→Colinfop」のポイントするスタックに存在するか否かが判別される（ステップＳ１４８）。 On the other hand, if “B” is not 0 (step S145; NO), it is determined whether or not “I” is 0 (step S147). If “I” is 0 (step S147; YES), this The erroneous recognition field correction process is terminated. If “I” is not 0 (step S147; NO), it is determined whether or not the same address as “R → fp” exists in the stack pointed to by “I → Colinfop” (step S148).

「R→fp」と同じアドレスが「I→Colinfop」のポイントするスタックに存在する場合（ステップＳ１４８；ＹＥＳ）、「I→Culinfop」スタックの「B→fp」で管理する入力フィールドの正しい値をスタックの計算式から求めて「B→fp→RecReslt」に格納する（ステップＳ１４９）。すなわち、スタックに記憶している計算式により、最も確からしさの低いフィールドの演算結果を他のフィールドの認識結果を用いて算出する。 If the same address as “R → fp” exists in the stack pointed to by “I → Colinfop” (step S148; YES), the correct value of the input field managed by “B → fp” in the “I → Culinfop” stack is set. It is obtained from the calculation formula of the stack and stored in “B → fp → RecReslt” (step S149). That is, the calculation result stored in the stack is used to calculate the operation result of the least probable field using the recognition result of the other field.

また、「R→fp」と同じアドレスが「I→Culinfop」のポイントするスタックに存在しない場合（ステップＳ１４８；ＮＯ）、「B」を「B→Nextp」として（ステップＳ１５０）、ステップＳ１４５に移行する。そして、「I」が０となるまで、上述した処理が繰り返して実行される。 If the same address as “R → fp” does not exist in the stack pointed to by “I → Culinfop” (step S148; NO), “B” is changed to “B → Nextp” (step S150), and the process proceeds to step S145. To do. Then, the above-described processing is repeatedly executed until “I” becomes zero.

図１９に戻り、誤認識フィールド認識処理が終了すると（ステップＳ１０１）、ステップＳ９１に移行して、ステップＳ９１〜Ｓ１０１の処理が繰り返して実行され、「KMNP→badp」が０になった場合（ステップＳ９８；ＹＥＳ）、本メイン処理が終了される。 Returning to FIG. 19, when the misrecognition field recognition process is completed (step S 101), the process proceeds to step S 91, and the processes of steps S 91 to S 101 are repeatedly executed. S98; YES), the main process is terminated.

以上のように、伝票の画像データを認識させ、認識された明細行に含まれる演算関係を予め定義ツール１２により定義する。そして、明細行の各フィールドの認識結果に、定義された演算を施し、認識結果と演算結果を比較することにより、誤認識が発生しているフィールドを特定する。また、正しく認識されたと推測されるフィールドの認識結果に定義されている演算を施し、誤認識のあったと推測されるフィールドの正しい値を算出する。そして、誤認識のあったと推測されるフィールドの値を算出された正しい値として修正する。 As described above, the image data of the slip is recognized, and the calculation relationship included in the recognized detail line is defined by the definition tool 12 in advance. Then, a defined calculation is performed on the recognition result of each field in the detail line, and the field in which erroneous recognition occurs is specified by comparing the recognition result and the calculation result. In addition, an operation defined for the recognition result of the field that is presumed to be correctly recognized is performed to calculate a correct value of the field that is presumed to be erroneously recognized. Then, the value of the field estimated to have been erroneously recognized is corrected as the calculated correct value.

これにより、画像データから認識された認識結果から誤認識を検出するだけでなく、正しく認識されたと推測されるフィールドの認識結果に基づいて、誤認識されたと推測されるフィールドの正しい値を算出することができる。したがって、誤認識されたフィールドの修正作業を容易かつ正確に行うことができ、データ認識装置１における認識率を向上させることができる。 As a result, not only erroneous recognition is detected from the recognition result recognized from the image data, but also the correct value of the field presumed to be misrecognized is calculated based on the recognition result of the field presumed to be correctly recognized. be able to. Therefore, it is possible to easily and accurately correct the erroneously recognized field, and to improve the recognition rate in the data recognition apparatus 1.

また、複数の演算関係に基づいて、認識されたフィールドの確からしさを記憶し、正しく認識された推測されるフィールドの認識結果に基づいて、誤認識のあったと推測されるフィールドの値を算出するため、より正確な値に基づいて、誤認識のあったと推測されるフィールドの値を修正することができ、データ認識装置１の認識率を向上させることができる。 Further, the probability of the recognized field is stored based on a plurality of arithmetic relationships, and the value of the field that is presumed to be erroneously recognized is calculated based on the recognition result of the presumed field that is correctly recognized. Therefore, based on a more accurate value, the value of the field estimated to have been erroneously recognized can be corrected, and the recognition rate of the data recognition apparatus 1 can be improved.

また、定義ツール１２により、取得した伝票の画像データからフィールドを認識して、フィールド間の演算関係をそれぞれ定義することができるため、種々の演算関係を含む伝票や帳票等に本発明を適用することが可能であり、汎用性の高いデータ認識装置１を提供することができる。 In addition, since the definition tool 12 can recognize the fields from the acquired image data of the slip and define the arithmetic relations between the fields, the present invention is applied to slips, forms, and the like including various arithmetic relations. Therefore, it is possible to provide a highly versatile data recognition device 1.

なお、本実施の形態における記述は、本発明に係る好適なデータ認識装置１の一例であり、これに限定されるものではない。例えば、本実施の形態において、データ認識装置１は伝票の画像データを読み取り、伝票に記入された情報を認識処理する場合を例として説明を行ったが、その他の帳票、見積書、設計書等を用いるものでもよい。 The description in the present embodiment is an example of a suitable data recognition apparatus 1 according to the present invention, and the present invention is not limited to this. For example, in the present embodiment, the case where the data recognition apparatus 1 reads the image data of a slip and recognizes information entered in the slip has been described as an example. However, other forms, estimates, design documents, etc. May be used.

また、定義ファイル１３又はデータ管理部１４ｂに格納されるデータの構成例は一例であり、上述した例に限定されない。例えば、演算情報は、逆ポーランド記法を用いてスタック記憶する方法を用いて説明を行ったがこれに限らない。また、本実施の形態においては、合計フィールドをフィールド演算情報の１つとして処理したが、合計フィールドとフィールド演算情報を区別して扱ってもよい。 The configuration example of the data stored in the definition file 13 or the data management unit 14b is an example, and is not limited to the above-described example. For example, the calculation information has been described using a method of storing the stack using reverse Polish notation, but the present invention is not limited to this. In the present embodiment, the total field is processed as one of the field calculation information, but the total field and the field calculation information may be handled separately.

その他、本実施の形態におけるデータ認識装置１の細部構成及び詳細動作に関しては、本発明の趣旨を逸脱しない範囲で適宜変更可能であることはもちろんである。 In addition, it is needless to say that the detailed configuration and detailed operation of the data recognition apparatus 1 in the present embodiment can be appropriately changed without departing from the spirit of the present invention.

本発明を適用した実施の形態におけるデータ認識装置１の機能的構成を示すブロック図である。It is a block diagram which shows the functional structure of the data recognition apparatus 1 in embodiment which applied this invention. 図１に示す読取装置１１の機能的構成を示すブロック図である。It is a block diagram which shows the functional structure of the reader 11 shown in FIG. （ａ）図１の表示装置１８に表示される設定画面１８１の一例を示す図である。（ｂ）図１の表示装置１８に表示される設定画面１８１の一例を示す図である。(A) It is a figure which shows an example of the setting screen 181 displayed on the display apparatus 18 of FIG. (B) It is a figure which shows an example of the setting screen 181 displayed on the display apparatus 18 of FIG. 定義ツール１２により演算関係を定義する方法を説明する図である。It is a figure explaining the method of defining a calculation relationship by the definition tool. 定義ファイル１３に格納される一部のデータ構成例を示す図である。FIG. 4 is a diagram illustrating an example of a partial data configuration stored in a definition file 13. 定義ファイル１３に格納される一部のデータ構成例を示す図である。FIG. 4 is a diagram illustrating an example of a partial data configuration stored in a definition file 13. データ管理部１４ｂにより管理される一部のデータ構成例を示す図である。It is a figure which shows the example of a part of data structure managed by the data management part 14b. データ管理部１４ｂにより管理される一部のデータ構成例を示す図である。It is a figure which shows the example of a part of data structure managed by the data management part 14b. 定義ツール１２により実行されるメイン処理を示すフローチャートである。4 is a flowchart showing main processing executed by a definition tool 12. 定義ツール１２により実行されるイメージ読込処理を示すフローチャートである。5 is a flowchart showing an image reading process executed by a definition tool 12. 定義ツール１２により実行されるフィールド設定処理を示すフローチャートである。6 is a flowchart showing a field setting process executed by the definition tool 12. 定義ツール１２により実行される初期化処理を示すフローチャートである。6 is a flowchart showing an initialization process executed by the definition tool 12. 定義ツール１２により実行される領域確保・レジスタ更新処理を示すフローチャートである。5 is a flowchart showing area reservation / register update processing executed by a definition tool 12; 定義ツール１２により実行される後処理を示すフローチャートである。3 is a flowchart showing post-processing executed by a definition tool 12. 定義ツール１２により実行される演算設定処理を示すフローチャートである。4 is a flowchart showing calculation setting processing executed by a definition tool 12. 定義ツール１２により実行される初期化処理を示すフローチャートである。6 is a flowchart showing an initialization process executed by the definition tool 12. 定義ツール１２により実行される演算情報のセット処理を示すフローチャートである。10 is a flowchart showing calculation information setting processing executed by the definition tool 12; 定義ツール１２により実行される後処理を示すフローチャートである。3 is a flowchart showing post-processing executed by a definition tool 12. データ処理部１４ａにより実行されるメイン処理を示すフローチャートである。It is a flowchart which shows the main process performed by the data processing part 14a. データ処理部１４ａにより実行される検算処理を示すフローチャートである。It is a flowchart which shows the verification process performed by the data processing part 14a. データ処理部１４ａにより実行されるフィールド管理構造体の構築処理を示すフローチャートである。It is a flowchart which shows the construction process of the field management structure performed by the data processing part 14a. データ処理部１４ａにより実行される誤認識フィールド検出処理を示すフローチャートである。It is a flowchart which shows the misrecognition field detection process performed by the data processing part 14a. データ処理部１４ａにより実行される誤認識フィールド修正処理を示すフローチャートである。It is a flowchart which shows the misrecognition field correction process performed by the data processing part 14a. 従来のデータ認識装置においてデータベースを参照して認識率を向上させる方法を説明する図である。It is a figure explaining the method of improving a recognition rate with reference to a database in the conventional data recognition apparatus. 従来のデータ認識装置において認識結果に演算を施して認識率を向上させる方法を説明する図である。It is a figure explaining the method of performing a calculation to a recognition result and improving a recognition rate in the conventional data recognition device.

Explanation of symbols

１データ認識装置
１１読取装置
１２定義ツール
１３定義ファイル
１４ＯＣＲ認識エンジン
１４ａデータ処理部
１４ｂデータ管理部
１５業務アプリ
１６記憶装置
１７入力装置
１８表示装置 DESCRIPTION OF SYMBOLS 1 Data recognition apparatus 11 Reading apparatus 12 Definition tool 13 Definition file 14 OCR recognition engine 14a Data processing part 14b Data management part 15 Business application 16 Storage apparatus 17 Input apparatus 18 Display apparatus

Claims

A data recognition device that reads information entered in a form as image information and performs character recognition of the image information,
A definition file for storing setting information for defining fields included in the image information;
Recognition means for recognizing a field included in the image information based on the setting information;
Input means for inputting an arithmetic relationship between the recognized fields;
A discriminating means for performing an arithmetic operation between recognized fields based on the arithmetic relationship, comparing the recognition result of the field with the arithmetic result, and determining whether or not there is a misrecognition;
Calculating means for calculating the value of the field determined to have the misrecognition based on the recognition result of the field determined to have no misrecognition when it is determined that the field recognition result has the misrecognition; ,
A data recognition apparatus comprising:

2. The data recognition apparatus according to claim 1, wherein the definition file stores a calculation relationship between fields input by the input unit as calculation information.

Based on the comparison result between the recognition result of the field and the calculation result, the storage unit stores a field in which the recognition result and the calculation result are different and the probability of the recognition,
The determination means compares the probability of recognition of a field in which the recognition result stored in the storage means and the calculation result are different, and determines whether or not the recognized field has a false recognition. The data recognition apparatus according to claim 1.

On the computer,
A function of storing setting information for defining a field included in the image information;
A function of recognizing a field included in the image information based on the setting information;
A function of inputting an arithmetic relationship between the recognized fields;
A function of performing an operation between the recognized fields based on the operation relationship, comparing the field recognition result and the operation result, and determining whether or not there is a recognition error;
A function for calculating the value of the field determined to have the erroneous recognition based on the recognition result of the field determined to have no erroneous recognition when it is determined that the recognition result of the field has erroneous recognition;
A program to realize