JP4347677B2

JP4347677B2 - Form OCR program, method and apparatus

Info

Publication number: JP4347677B2
Application number: JP2003409481A
Authority: JP
Inventors: 雅健栗原; 正博上野; 昭夫安達
Original assignee: Fujifilm Corp
Current assignee: Fujifilm Corp
Priority date: 2003-12-08
Filing date: 2003-12-08
Publication date: 2009-10-21
Anticipated expiration: 2023-12-08
Also published as: JP2005173730A

Description

本発明は、帳票などの文書を光学的にイメージデータとして読み取り、読み取ったイメージデータから文字認識を行う帳票ＯＣＲプログラム、方法及び装置に関するものである。 The present invention relates to a form OCR program, method and apparatus for optically reading a document such as a form as image data and performing character recognition from the read image data.

伝票や給与報告書などの帳票を、光学的にイメージデータとして読み取り、読み取ったイメージデータから帳票に記載された文字を認識するＯＣＲ（Optical Characterize Recognition）装置（例えば、下記特許文献１参照）が知られている。ここで、文字には、数字や記号も含まれる。帳票には、氏名，受給者番号，給与所得の額などの複数の項目に関して、それぞれの項目の個別具体的な文字が記入される記入欄が設けられており、この記入欄の近傍には、それぞれの項目の名称（項目名）が予めプリント（プレプリント）されている。記入欄や項目名は、それぞれ罫線によって構成された枠によって区画されて配列されている。ＯＣＲ装置によって最終的に抽出したい情報は、記入欄内に記入された文字であり、ＯＣＲ装置においては、この記入欄を特定しその欄内の文字を精度良く読み取ることが主要な課題となっている。 An OCR (Optical Characterize Recognition) device that optically reads a form such as a slip or a salary report as image data and recognizes characters written on the form from the read image data (for example, see Patent Document 1 below) is known. It has been. Here, the characters include numbers and symbols. The form has an entry field where individual specific characters for each item are entered for multiple items such as name, beneficiary number, salary income, etc. The name (item name) of each item is printed (preprinted) in advance. The entry fields and item names are divided and arranged by a frame constituted by ruled lines. The information that is finally extracted by the OCR device is the characters entered in the entry column. In the OCR device, it is a major issue to identify this entry column and accurately read the characters in the entry column. Yes.

特許文献１記載のＯＣＲ装置は、予め帳票の種類毎に、前記項目名と前記記入欄との相対的な位置関係を示す論理レイアウト情報を用意しておき、帳票種別コードによって所望の帳票に対応する論理レイアウト情報を選択し、選択された論理レイアウト情報を参照しながら記入欄内の文字を認識する。論理レイアウト情報は、帳票上の項目名及び記入欄のレイアウトをテキストデータで表現したものであり、項目名と記入欄との相対的な位置関係は、各項目名を取り囲む項目名枠と記入欄とを表すテキストの記述順序によって示される。 The OCR device described in Patent Document 1 prepares logical layout information indicating the relative positional relationship between the item name and the entry field in advance for each form type, and corresponds to a desired form by a form type code. The logical layout information to be selected is selected, and the characters in the entry column are recognized while referring to the selected logical layout information. The logical layout information is the textual representation of the item name and entry field layout on the form. The relative positional relationship between the item name and entry field is the item name frame and entry field surrounding each item name. It is shown by the description order of the text representing

例えば、帳票上、項目名「氏名」の右隣に記入欄がある場合には、ファイルには、”項目名「氏名」＆記入欄”というように、氏名の項目名枠の右隣に＆記号を挟んで記述される。この記述順序により項目名枠の右には記入欄があることが示される。また、項目名枠の下に記入欄がある場合には、１行目に”項目名「氏名」”を記述し、改行して項目名「氏名」の下に”記入欄”と記述することで、項目名枠の下に記入欄があることが示される。論理レイアウト情報には、こうした情報が帳票の全項目分収録される。 For example, if there is an entry field to the right of the item name “Name” on the form, the file will have an entry field to the right of the item name box of “Name” “Name” & entry field ”. The description order indicates that there is an entry field to the right of the item name frame, and if there is an entry field below the item name frame, By describing the name “name” and then writing “entry column” under the item name “name” after a line break, it is indicated that there is an entry column below the item name frame. Such information is recorded in the logical layout information for all items of the form.

論理レイアウト情報は、絶対的な座標位置によって記入欄や項目名枠の位置を記述するものではなく、両者の相対的位置関係のみを記述したものである。このため、論理レイアウト情報を記入欄の特定に使用することにより、帳票をイメージデータ化する際の読み取り倍率を正確に合わせる必要もなく、帳票イメージデータの基準位置が基準座標位置からずれているといった場合でも、記入欄が特定不能になるということがなくなるので、記入欄の認識率が向上し、その結果文字認識率も向上する。 The logical layout information does not describe the position of the entry field or the item name frame by the absolute coordinate position, but describes only the relative positional relationship between the two. For this reason, by using the logical layout information for specifying the entry field, it is not necessary to accurately adjust the reading magnification when converting the form into image data, and the reference position of the form image data is deviated from the reference coordinate position. Even in this case, the entry column is not unspecified, so the recognition rate of the entry column is improved, and as a result, the character recognition rate is also improved.

特開平１０−２０７９８１号公報Japanese Patent Laid-Open No. 10-207981

しかしながら、例えば、給与支払報告書など、公的な機関によって記入項目が規定されている帳票でも、その帳票を作成するメーカーによって、記入欄や項目名のレイアウトは異なる。上記特許文献１記載のＯＣＲ装置では、項目のレイアウトが異なる場合には、認識精度が著しく低下してしまうという問題がある。もちろん、レイアウトが異なる帳票毎に論理レイアウト情報を予め作成し、これを登録しておけば、認識精度の低下を防止することはできる。しかし、レイアウトが異なる帳票の数は膨大であるため、すべての帳票の論理レイアウト情報を用意することは事実上不可能に近い。 However, even for a form whose entry items are regulated by a public institution such as a salary payment report, the layout of entry fields and item names differs depending on the manufacturer that creates the form. The OCR device described in Patent Document 1 has a problem that the recognition accuracy is significantly lowered when the layout of items is different. Of course, if logical layout information is created in advance for each form having a different layout and registered, it is possible to prevent the recognition accuracy from being lowered. However, since the number of forms with different layouts is enormous, it is virtually impossible to prepare logical layout information for all forms.

本発明は、項目のレイアウトが異なる帳票に対して柔軟に対応して認識精度の低下を防止することができる帳票ＯＣＲプログラム、方法及び装置を提供することを目的とする。 An object of the present invention is to provide a form OCR program, method, and apparatus that can flexibly cope with forms having different item layouts and prevent a reduction in recognition accuracy.

本発明の帳票ＯＣＲプログラムは、複数の項目に関して、その記入欄と予めプリントされた項目名とが配列され、前記記入欄と項目名とがそれぞれ罫線によって区画された帳票を読み取った帳票イメージから、前記記入欄内の文字を認識する帳票ＯＣＲ処理をコンピュータに実行させる帳票ＯＣＲプログラムにおいて、前記帳票イメージの全面に対してＯＣＲ処理を実行し、前記記入欄を区画する記入枠の位置，前記項目名を取り囲む項目名枠の位置，及びこれらの枠内の文字列を認識するとともに、認識した情報を枠毎に１つのレコードとしてまとめる全面ＯＣＲ処理ステップと、各項目名毎に再度ＯＣＲ処理が必要か否かを予め定義した再ＯＣＲ指定情報を参照して、再度ＯＣＲ処理が必要な項目名に対応する前記レコードを読み出すとともに、各項目名毎に対応する記入欄との相対的な位置関係を予め定義した記入欄の位置情報を参照して、読み出した前記レコードに含まれる項目名枠の位置から再度ＯＣＲ処理をすべき記入欄を特定する記入欄特定処理ステップと、予め定義された各項目の文字属性情報に基づいて、対象となる記入欄の属性に適合した辞書データを使用し、前記記入欄特定処理ステップで特定された記入欄に対して部分的にＯＣＲ処理を実行する部分ＯＣＲ処理ステップとからなることを特徴とする。 In the form OCR program of the present invention, for a plurality of items, entry fields and pre-printed item names are arranged, and from the form image obtained by reading the form in which the entry field and the item names are partitioned by ruled lines, In a form OCR program that causes a computer to execute a form OCR process for recognizing characters in the entry field, the OCR process is performed on the entire surface of the form image, and the position of the entry frame that defines the entry field, the item name Recognize the position of the item name frame that surrounds and the character string in these frames, and collect the recognized information as one record for each frame, and whether OCR processing is required again for each item name When the record corresponding to the item name that needs OCR processing again is read with reference to the re-OCR designation information that defines whether or not In addition, the OCR process is performed again from the position of the item name frame included in the read record by referring to the position information of the entry field in which the relative positional relationship with the entry field corresponding to each item name is defined in advance. An entry field specifying process step for specifying an entry field to be used, and using the dictionary data suitable for the attribute of the entry field based on the character attribute information of each item defined in advance, the entry field specifying process step And a partial OCR processing step for partially executing the OCR processing on the entry field specified in (1).

また、本発明の帳票ＯＣＲ方法は、複数の項目に関して、その記入欄と予めプリントされた項目名とが配列され、前記記入欄と項目名とがそれぞれ罫線によって区画された帳票を読み取った帳票イメージから、前記記入欄内の文字を認識する帳票ＯＣＲ方法において、前記帳票イメージの全面に対してＯＣＲ処理を実行し、前記記入欄を区画する記入枠の位置，前記項目名を取り囲む項目名枠の位置，及びこれらの枠内の文字列を認識するとともに、認識した情報を枠毎に１つのレコードとしてまとめる全面ＯＣＲ処理ステップと、各項目名毎に再度ＯＣＲ処理が必要か否かを予め定義した再ＯＣＲ指定情報を参照して、再度ＯＣＲ処理が必要な項目名に対応する前記レコードを読み出すとともに、各項目名毎に対応する記入欄との相対的な位置関係を予め定義した記入欄の位置情報を参照して、読み出した前記レコードに含まれる項目名枠の位置から再度ＯＣＲ処理をすべき記入欄を特定する記入欄特定処理ステップと、予め定義された各項目の文字属性情報に基づいて、対象となる記入欄の属性に適合した辞書データを使用し、前記記入欄特定処理ステップで特定された記入欄に対して部分的にＯＣＲ処理を実行する部分ＯＣＲ処理ステップとからなることを特徴とする。 In the form OCR method of the present invention, a form image obtained by reading a form in which entry fields and pre-printed item names are arranged for a plurality of items, and the entry field and the item names are partitioned by ruled lines, respectively. In the form OCR method for recognizing characters in the entry field, OCR processing is performed on the entire surface of the form image, and the position of the entry frame that divides the entry field, the item name frame surrounding the item name, Recognize positions and character strings in these frames, and pre-define whether or not OCR processing is necessary again for each item name, and the entire OCR processing step for collecting the recognized information as one record for each frame With reference to the re-OCR designation information, the record corresponding to the item name that needs the OCR processing again is read out, and relative to the entry field corresponding to each item name While referring to the positional information of the entry fields that predefined location relationship, the entry column specifying process step of identifying the answer column should again OCR processing from the position of the item name frame included in the record read out, predefined Based on the character attribute information of each item, the dictionary data suitable for the attribute of the target entry field is used , and the OCR process is partially executed on the entry field specified in the entry field specifying process step. It consists of a partial OCR processing step.

また、本発明の帳票ＯＣＲ装置は、複数の項目に関して、その記入欄と予めプリントされた項目名とが配列され、前記記入欄と項目名とがそれぞれ罫線によって区画された帳票を読み取った帳票イメージから、前記記入欄内の文字を認識する帳票ＯＣＲ装置において、前記帳票イメージの全面に対してＯＣＲ処理を実行し、前記記入欄を区画する記入枠の位置，前記項目名を取り囲む項目名枠の位置，及びこれらの枠内の文字列を認識するとともに、認識した情報を枠毎に１つのレコードとしてまとめる全面ＯＣＲ処理部と、各項目名毎に再度ＯＣＲ処理が必要か否かを予め定義した再ＯＣＲ指定情報を参照して、再度ＯＣＲ処理が必要な項目名に対応する前記レコードを読み出すとともに、各項目名毎に対応する記入欄との相対的な位置関係を予め定義した記入欄の位置情報を参照して、読み出した前記レコードに含まれる項目名枠の位置から再度ＯＣＲ処理をすべき記入欄を特定する記入欄特定処理部と、予め定義された各項目の文字属性情報に基づいて、対象となる記入欄の属性に適合した辞書データを使用し、前記記入欄特定処理部で特定された記入欄に対して部分的にＯＣＲ処理を実行する部分ＯＣＲ処理部とを備えたことを特徴とする帳票ＯＣＲ装置。 Further, the form OCR device of the present invention has a form image obtained by reading a form in which entry fields and pre-printed item names are arranged for a plurality of items, and the entry field and the item names are partitioned by ruled lines, respectively. In the form OCR device for recognizing characters in the entry field, the OCR process is executed on the entire surface of the form image, and the position of the entry frame that divides the entry field, the item name frame surrounding the item name, Recognizes the position and character strings in these frames, and pre-defines whether or not the OCR processing is necessary again for each item name, and the entire OCR processing unit that collects the recognized information as one record for each frame With reference to the re-OCR designation information, the record corresponding to the item name that needs OCR processing again is read, and the relative position with the entry column corresponding to each item name is read. With reference to the predefined position information of the entry field and a entry column identifying unit for identifying the answer column to be re-OCR processing from the position of the item name frame included in the record read out, the predefined Partial OCR that partially executes OCR processing on the entry field specified by the entry field specifying processing unit using dictionary data suitable for the attribute of the entry field to be processed based on the character attribute information of the item A form OCR apparatus comprising a processing unit.

本発明は、複数の項目に関して、その記入欄と予めプリントされた項目名とが配列され、前記記入欄と項目名とがそれぞれ罫線によって区画された帳票を読み取った帳票イメージから、前記記入欄内の文字を認識する帳票ＯＣＲ処理をコンピュータに実行させる帳票ＯＣＲプログラムにおいて、前記帳票イメージの全面に対してＯＣＲ処理を実行し、前記記入欄を区画する記入枠の位置，前記項目名を取り囲む項目名枠の位置，及びこれらの枠内の文字列を認識するとともに、認識した情報を枠毎に１つのレコードとしてまとめる全面ＯＣＲ処理ステップと、各項目名毎に再度ＯＣＲ処理が必要か否かを予め定義した再ＯＣＲ指定情報を参照して、再度ＯＣＲ処理が必要な項目名に対応する前記レコードを読み出すとともに、各項目名毎に対応する記入欄との相対的な位置関係を予め定義した記入欄の位置情報を参照して、読み出した前記レコードに含まれる項目名枠の位置から再度ＯＣＲ処理をすべき記入欄を特定する記入欄特定処理ステップと、予め定義された各項目の文字属性情報に基づいて、対象となる記入欄の属性に適合した辞書データを使用し、前記記入欄特定処理ステップで特定された記入欄に対して部分的にＯＣＲ処理を実行する部分ＯＣＲ処理ステップとからなるので、予め帳票毎に項目のレイアウト情報を準備することなく、項目のレイアウトが異なる帳票に対して柔軟に対応することが可能となり、認識精度の低下を防止することができる。 In the present invention, for a plurality of items, the entry fields and preprinted item names are arranged, and from the form image obtained by reading the form in which the entry fields and the item names are partitioned by ruled lines, In a form OCR program that causes a computer to execute a form OCR process for recognizing the characters of the form, the OCR process is executed on the entire surface of the form image, and the position of the entry frame that divides the entry field and the item names surrounding the item names Recognize the position of the frames and the character strings in these frames, and collect the recognized information as one record for each frame, and whether or not OCR processing is necessary again for each item name in advance. With reference to the defined re-OCR designation information, the record corresponding to the item name that needs OCR processing again is read out, and for each item name, An entry field that specifies the entry field to be subjected to OCR processing again from the position of the item name frame included in the read record with reference to the position information of the entry field in which the relative positional relationship with the entry field to be defined is defined in advance Using the dictionary data suitable for the attribute of the target entry field based on the character processing information of each item defined in advance, and for the entry field identified in the entry field identification process step Since it consists of partial OCR processing steps that partially execute OCR processing, it is possible to flexibly cope with forms with different item layouts without preparing item layout information for each form in advance. A reduction in accuracy can be prevented.

図１に示す帳票ＯＣＲシステム１０は、メインユニット１１，イメージスキャナ１２，イメージデータサーバ１３とからなり、これらは、例えば、ＬＡＮ１４などの通信ネットワーク１４によって接続されている。イメージスキャナ１２の給紙トレイ１２ａには、例えば、数百枚という単位で給与報告書などの帳票１６がセットされる。イメージスキャナ１２は、これらの帳票１６をＣＣＤイメージセンサでスキャンして、１枚の帳票に対して１つの帳票イメージデータ３５（図２参照）を出力する。帳票イメージデータ３５は、画素データの集合であるビットマップデータとして生成される。イメージデータサーバ１３は、ＨＤＤ（ハードディスクドライブ）などのデータストレージデバイスを備えており、イメージスキャナ１２から出力された数千枚分の帳票イメージデータを蓄積する。 A form OCR system 10 shown in FIG. 1 includes a main unit 11, an image scanner 12, and an image data server 13, which are connected by a communication network 14 such as a LAN 14. On the paper feed tray 12a of the image scanner 12, for example, a form 16 such as a salary report is set in units of several hundred sheets. The image scanner 12 scans these forms 16 with a CCD image sensor, and outputs one form image data 35 (see FIG. 2) for one form. The form image data 35 is generated as bitmap data that is a set of pixel data. The image data server 13 includes a data storage device such as an HDD (hard disk drive), and accumulates thousands of form image data output from the image scanner 12.

メインユニット１１は、イメージデータサーバ１３にアクセスして、帳票イメージデータ３５を１つずつ読み取り、読み取った帳票イメージデータ３５に対してＯＣＲ処理を施す。メインユニット１１が認識した文字のデータは、例えば、課税計算システム等に引き渡されて処理される。 The main unit 11 accesses the image data server 13 to read the form image data 35 one by one, and performs OCR processing on the read form image data 35. The character data recognized by the main unit 11 is transferred to, for example, a taxation calculation system and processed.

メインユニット１１は、例えば、パーソナルコンピュータやワークステーションをベースにして、これに帳票ＯＣＲプログラム２８をインストールしたものであり、ＣＰＵ２１，ＲＡＭ２２，操作部２４，ディスプレイ２６，ハードディスクドライブ（ＨＤＤ）２７からなる。これらメインユニット１１の各部は、データバス２３によって接続されている。 The main unit 11 is based on, for example, a personal computer or a workstation and has a form OCR program 28 installed therein, and includes a CPU 21, a RAM 22, an operation unit 24, a display 26, and a hard disk drive (HDD) 27. Each part of the main unit 11 is connected by a data bus 23.

ＣＰＵ２１は、オペレーティングシステムを実行してメインユニット１１の各部を制御するとともに、帳票ＯＣＲプログラム２８を実行する。ＲＡＭ２２は、ＣＰＵ２１がプログラムを実行する際に使用される作業用メモリである。帳票ＯＣＲプログラム２８が実行される際には、ＲＡＭ２２に帳票ＯＣＲプログラム２８や定義データなどがロードされる。操作部２４は、キーボードやマウスなどの入力デバイスからなり、ＣＰＵ２１に対してコマンドを入力したり、処理条件の入力を行う。ディスプレイ２６には、帳票ＯＣＲプログラム２８の操作画面が表示される。ＨＤＤ２７は、データストレージデバイスであり、オペレーティングシステム，帳票ＯＣＲプログラム２８の他、後述する各種の定義データ２９，ＯＣＲ処理で参照する辞書データなどを記憶する。 The CPU 21 executes the operating system to control each part of the main unit 11 and executes the form OCR program 28. The RAM 22 is a working memory used when the CPU 21 executes a program. When the form OCR program 28 is executed, the form OCR program 28 and definition data are loaded into the RAM 22. The operation unit 24 includes an input device such as a keyboard and a mouse, and inputs commands and inputs processing conditions to the CPU 21. On the display 26, an operation screen of the form OCR program 28 is displayed. The HDD 27 is a data storage device and stores an operating system, a form OCR program 28, various definition data 29 described later, dictionary data to be referred to in OCR processing, and the like.

図２は、帳票１６の説明図である。本例においては、給与所得報告書を帳票１６の具体例として説明する。帳票１６は、外枠３１内に、「支払を受ける者」，「住所」，「氏名」，「受給者番号」，「フリガナ」などといった項目名がプレプリントされており、各項目名の近傍には、記入欄が設けられている。各項目名及び記入欄は、罫線によって区画されている。例えば、「受給者番号」や「支払金額」という項目名は、それぞれ枠３２ａ，３３ａによって区画されており、「受給者番号」の項目名枠３２ａの右隣に隣接する枠３２ｂは、受給者番号そのものが記入される記入欄を構成する記入枠であり、「支払金額」の項目名枠３３ａの下に隣接する記入欄も枠３３ｂによって区画されている。 FIG. 2 is an explanatory diagram of the form 16. In this example, a salary income report will be described as a specific example of the form 16. The form 16 is pre-printed with item names such as “payee”, “address”, “name”, “recipient number”, “reading”, etc. in the outer frame 31, and in the vicinity of each item name. Has an entry field. Each item name and entry field are partitioned by ruled lines. For example, the item names “recipient number” and “payment amount” are divided by frames 32a and 33a, respectively, and a frame 32b adjacent to the right of the item name frame 32a of “recipient number” is a recipient. This is an entry frame that constitutes an entry field in which the number itself is entered. An entry field adjacent to the “payment amount” item name box 33a is also divided by a frame 33b.

図３は、帳票ＯＣＲプログラム２８の帳票ＯＣＲ処理手順の全体を示すフローチャートである。帳票ＯＣＲプログラム２８の処理ステップは、帳票イメージ取り込み処理，全面ＯＣＲ処理，記入欄特定処理，部分ＯＣＲ処理からなる。記入欄特定処理は、項目名枠特定処理と再ＯＣＲエリア設定処理からなる。 FIG. 3 is a flowchart showing the overall procedure of the form OCR processing of the form OCR program 28. The processing steps of the form OCR program 28 include a form image capturing process, a full OCR process, an entry field specifying process, and a partial OCR process. The entry field specifying process includes an item name frame specifying process and a re-OCR area setting process.

帳票イメージ取り込み処理は、イメージデータサーバ１３から、帳票イメージデータ３５を１帳票分ずつ読み出す。この帳票イメージデータ３５に対して全面ＯＣＲ処理が実行される。 In the form image capturing process, the form image data 35 is read from the image data server 13 for each form. A full OCR process is performed on the form image data 35.

全面ＯＣＲ処理は、帳票１６の全面に対してＯＣＲ処理を実行するとともに、外枠３１内に存在するすべての項目名枠と記入枠とを認識するとともに、各項目の項目名や記入欄内の文字列を認識する。認識された枠は、座標情報で表現されるベクトルデータに変換され、文字は、テキストデータに変換される。 In the full OCR process, the OCR process is executed on the entire surface of the form 16, and all the item name frames and entry frames existing in the outer frame 31 are recognized. Recognize character strings. The recognized frame is converted into vector data represented by coordinate information, and the character is converted into text data.

図４は、全面ＯＣＲ処理の手順を示すフローチャートである。全面ＯＣＲ処理は、外枠３１の左上に設定された原点Ｏを起点として、左端から右端に向かって順に行われ、最終的に右下の頂点に至る。全画面ＯＣＲ処理では、１つの枠を検出すると、その枠情報，枠内の行情報，枠内の文字情報を認識する。 FIG. 4 is a flowchart showing the procedure of the entire OCR process. The entire OCR process is performed in order from the left end to the right end with the origin O set at the upper left of the outer frame 31 as the starting point, and finally reaches the lower right vertex. In the full screen OCR process, when one frame is detected, the frame information, line information in the frame, and character information in the frame are recognized.

図５（Ａ）に示すように、これら認識した情報は、枠毎に１つの認識情報レコードとしてまとめられ、当該認識情報レコードには、枠番号として、認識した順序でシーケンス番号が付与される。行情報には、枠内の文字が記入される行数及びその行の座標情報が含まれる。枠内の文字情報には、各行毎の文字数や、認識した文字そのもの、各文字の座標情報が含まれる。そして、全面分の認識情報レコードをまとめて、全面ＯＣＲ結果ファイル３６として出力する。出力された全面ＯＣＲ結果ファイル３６は、ＲＡＭ２２や、ＨＤＤ２７に設定されたワーク領域に一時的に記憶される。 As shown in FIG. 5A, the recognized information is collected as one recognition information record for each frame, and sequence numbers are assigned to the recognition information records in the order of recognition as frame numbers. The line information includes the number of lines in which characters in the frame are entered and the coordinate information of the lines. The character information in the frame includes the number of characters for each line, the recognized character itself, and the coordinate information of each character. Then, the recognition information records for the entire surface are collected and output as the entire OCR result file 36. The output entire OCR result file 36 is temporarily stored in the work area set in the RAM 22 or the HDD 27.

図５（Ｂ）は、全面ＯＣＲ結果ファイル３６の内容のより具体的な説明図である。帳票１６において、一番左上の枠は最初に認識されるので、枠番号として「１」が付与される。その枠には、「支払を受ける者」という文字がプレプリントされており、このプレプリントされた文字を認識した文字情報が、正確に認識されると「支払を受ける者」という認識文字となる。文字認識率は１００％ではないので、正確に認識できない場合もある。その場合には、誤認識した文字情報が、そのまま認識文字となる。また、この枠内の行数は、１行目が「支払」，２行目が「を受け」，３行目が「る者」というように、３行に渡っているので、枠内の行数は「３」となる。各文字の座標は、１文字毎にその左上と右下のそれぞれのＸＹ座標が抽出される。 FIG. 5B is a more specific explanatory diagram of the contents of the entire OCR result file 36. In the form 16, since the upper left frame is recognized first, “1” is assigned as the frame number. The frame is pre-printed with the characters “Payee”, and if the character information recognizing this preprinted character is correctly recognized, it becomes the recognition character “Payee”. . Since the character recognition rate is not 100%, it may not be recognized correctly. In that case, the misrecognized character information becomes the recognized character as it is. In addition, the number of lines in this frame is 3 lines, such as “Payment” for the first line, “Received” for the second line, and “Ru” for the third line. The number of rows is “3”. As for the coordinates of each character, the XY coordinates of the upper left and lower right are extracted for each character.

帳票ＯＣＲプログラム２８は、辞書データ３０（図１参照）を参照してＯＣＲ処理を実行する。この辞書データ３０には、システム辞書とユーザー辞書とがある。システム辞書は、英数字，記号，かな，カタカナ，漢字など複数の文字の属性に関わらず汎用的に使用される辞書であるのに対して、ユーザー辞書は、文字の各属性に特化した専用の辞書であり、英数字用のユーザー辞書，記号用のユーザー辞書など、各属性毎に複数の種類がある。ユーザー辞書は、該当する属性の文字認識率は、システム辞書に比較してはるかに高いが、他の属性の文字認識には使用できない。これらシステム辞書とユーザー辞書とは、ＨＤＤ２７に記憶されており、ＣＰＵ２１が帳票ＯＣＲプログラム２７を実行する際に適宜使用される。全面ＯＣＲ処理においては、異なる属性の項目が複数混在する全面がＯＣＲ対象エリアなので、システム辞書が選択される。 The form OCR program 28 executes OCR processing with reference to the dictionary data 30 (see FIG. 1). The dictionary data 30 includes a system dictionary and a user dictionary. The system dictionary is a dictionary that is used universally regardless of the attributes of multiple characters such as alphanumeric characters, symbols, kana, katakana, and kanji, whereas the user dictionary is dedicated to each character attribute. There are several types for each attribute, such as an alphanumeric user dictionary and a symbol user dictionary. The user dictionary has a character recognition rate of the corresponding attribute much higher than that of the system dictionary, but cannot be used for character recognition of other attributes. The system dictionary and the user dictionary are stored in the HDD 27 and are used as appropriate when the CPU 21 executes the form OCR program 27. In the entire OCR process, since the entire area where a plurality of items having different attributes are mixed is the OCR target area, the system dictionary is selected.

全面ＯＣＲ処理が終了すると、項目名枠特定処理が実行される。項目名枠特定処理は、項目定義ファイル３７に基づいて、全面ＯＣＲ結果ファイル３６内のすべての認識情報レコードのうち、項目名枠の認識情報レコードを特定する。 When the entire OCR process is completed, an item name frame specifying process is executed. In the item name frame specifying process, the recognition information record of the item name frame is specified among all the recognition information records in the entire OCR result file 36 based on the item definition file 37.

図６（Ａ）に示すように、項目定義ファイル３７は、帳票１６に記載される項目名毎の複数の定義レコードからなり、各定義レコードには、項目名と、各項目名毎に再ＯＣＲが必要か否かを指定する再ＯＣＲ指定情報と、各項目名とそれらに対応する記入欄との相対的な位置関係を示す記入欄の位置情報と、各項目名の近傍の項目名との相対位置情報とが含まれている。また、図示しないが、この項目定義ファイル３７には、後述するように、再度ＯＣＲ処理を実行するエリアの項目番号である再ＯＣＲ項目番号が含まれている。 As shown in FIG. 6A, the item definition file 37 includes a plurality of definition records for each item name described in the form 16, and each definition record includes an item name and a re-OCR for each item name. Re-OCR designation information for designating whether or not the item name is necessary, position information in the entry column indicating the relative positional relationship between each item name and the entry column corresponding to each item name, and item names in the vicinity of each item name Relative position information. Although not shown, the item definition file 37 includes a re-OCR item number that is an item number of an area in which OCR processing is executed again, as will be described later.

記入欄の位置情報は、各項目名のどの方向に隣接して記入欄が存在するかを示す情報である。この記入欄の位置情報は、数字で規定されており、それぞれの数字には、図６（Ｂ）に示すように、「１」は、「項目名の右に位置する枠が記入欄」、「２」は、「項目名の下に位置する枠が記入欄」というように、それぞれの意味が定義されている。例えば、支払金額の項目は、その項目名の下に記入欄が位置するので、記入欄の位置情報は、「２」と指定される。 The position information of the entry column is information indicating in which direction of each item name the entry column is adjacent. The position information in this entry field is defined by numbers. As shown in FIG. 6 (B), “1” is “the box located to the right of the item name is the entry field”, The meaning of “2” is defined as “the frame positioned under the item name is an entry field”. For example, since an entry column is located under the item name for the item of payment amount, the position information of the entry column is designated as “2”.

近傍の項目名との相対位置情報は、具体的には、「受給者番号」という項目名の左には「氏名」という項目名があり、下には「フリガナ」という項目名があるという形で記述される。この近傍の項目名との相対位置情報は、後述するように、項目名枠特定処理において、ある項目名をキーに、それに対応する項目名枠を特定できなかった場合に使用される。 Specifically, the relative position information with the nearby item name is such that the item name “name” is on the left of the item name “recipient number”, and the item name “phonetic” is below. It is described by. The relative position information with the neighboring item names is used when an item name frame corresponding to a certain item name cannot be specified in the item name frame specifying process as described later.

図７に示すフローチャートは、項目名枠特定処理の具体的な手順を示す。まず、項目定義ファイル３７から項目名枠を特定すべき１つの項目名を読み出し、その項目名と、全面ＯＣＲ結果ファイル３６に含まれる認識文字とを照合することにより、前記項目名に対応する項目名枠の枠番号をサーチする。そして、認識文字と項目名とが一致した場合には、図８に示すように、全面ＯＣＲ結果ファイル３６からその枠番号を読み出し、これを項目定義ファイル３７の対応する項目名のレコードに追加して、項目名枠特定データファイル３８を生成する。 The flowchart shown in FIG. 7 shows a specific procedure of the item name frame specifying process. First, one item name for which the item name frame is to be specified is read from the item definition file 37, and the item name and the recognition character included in the full-scale OCR result file 36 are collated, thereby the item corresponding to the item name. Search for the frame number of the name frame. If the recognized character matches the item name, the frame number is read from the full OCR result file 36 and added to the corresponding item name record in the item definition file 37 as shown in FIG. Thus, the item name frame specifying data file 38 is generated.

例えば、「受給者番号」の項目名枠を特定する場合には、項目定義ファイル３７から、「受給者番号」を読み出し、この「受給者番号」をキーに、全面ＯＣＲ結果データファイル３６内の認識文字と照合を行うことにより、特定対象となる項目名枠の枠番号をサーチする。サーチできた場合、すなわち、全面ＯＣＲ結果データファイル３６内に「受給者番号」という文字列が存在した場合には、その認識文字に対応する枠番号を項目定義ファイル３６の受給者番号のレコードに追加して、項目名枠特定データファイル３８を作成する。 For example, when the item name frame of “recipient number” is specified, “recipient number” is read from the item definition file 37, and this “recipient number” is used as a key in the entire OCR result data file 36. By matching with the recognized character, the frame number of the item name frame to be specified is searched. If the search can be performed, that is, if the character string “recipient number” exists in the full OCR result data file 36, the frame number corresponding to the recognized character is stored in the record of the recipient number in the item definition file 36. In addition, the item name frame specifying data file 38 is created.

しかし、全面ＯＣＲ処理において、文字列を誤認識していたり認識不能だった場合には、当然ながら全面ＯＣＲ結果ファイル３６内に「受給者番号」という文字列は存在しない。このように特定すべき項目名枠の枠番号をサーチできなかった場合には、特定すべき項目名の近傍に位置する項目名をキーにサーチ処理を実行する。例えば、「受給者番号」という文字列が存在しない場合には、項目定義データファイル３７内の相対位置情報を参照して、「氏名」や「フリガナ」といった、「受給者番号」の近傍に位置する項目名を調べ、その項目名をキーにサーチ処理を実行する。そして、全面ＯＣＲ結果データファイル３６内に「氏名」という文字列が見つかった場合には、前記相対位置情報（「氏名」の右側に「受給者番号」が存在する）に基づいて、「受給者番号」の項目名枠の枠番号を推定する。この推定した枠番号を、検索対象となる項目名枠の枠番号として項目定義データに追加する。 However, if the character string is misrecognized or cannot be recognized in the full OCR process, the character string “recipient number” does not exist in the full OCR result file 36 as a matter of course. When the frame number of the item name frame to be specified cannot be searched in this way, the search process is executed using the item name located in the vicinity of the item name to be specified as a key. For example, when the character string “recipient number” does not exist, the relative position information in the item definition data file 37 is referred to, and a position in the vicinity of “recipient number” such as “name” or “phonetic” is displayed. The item name to be checked is checked, and search processing is executed using the item name as a key. When the character string “name” is found in the entire OCR result data file 36, based on the relative position information (“receiver number” exists on the right side of “name”), The frame number of the item name frame of “number” is estimated. The estimated frame number is added to the item definition data as the frame number of the item name frame to be searched.

このように、所望の項目名をキーにそれに対応する項目名枠の特定ができなかった場合に、所望の項目名の近傍に位置する項目名をキーに前記項目名枠を推定することにより、全面ＯＣＲ処理において文字列を誤認識したり認識不能であった場合でも、所望の項目名枠を特定することが可能になる。 As described above, when the item name frame corresponding to the desired item name cannot be specified, the item name frame is estimated using the item name located in the vicinity of the desired item name as a key, Even when the character string is misrecognized or cannot be recognized in the entire OCR process, a desired item name frame can be specified.

こうした近傍の項目名によるサーチは、全面ＯＣＲ結果ファイル３６の全データに渡って実行してもよい。例えば、「受給者番号」の項目名枠を特定する際に、まずはじめに、「受給者番号」をキーにサーチを行い、それでサーチが不能な場合には、その近傍にある「氏名」や「フリガナ」といった項目名がサーチキーとして使用され、それでも見つからない場合には、「氏名」や「フリガナ」の近傍の項目名をキーにサーチを行うというように、全データに渡ってサーチを実行することも可能である。しかし、こうすると、サーチ処理の負荷が増大して、サーチ時間も非常に大きくなる。 Such a search based on item names in the vicinity may be executed over all data in the full OCR result file 36. For example, when specifying the item name frame of “recipient number”, first, a search is performed using “recipient number” as a key, and if the search is impossible, the “name” and “ If an item name such as “Reading” is used as a search key and still cannot be found, a search is performed across all data, such as searching using the item name in the vicinity of “Name” or “Reading”. It is also possible. However, this increases the load of the search process, and the search time becomes very long.

そこで、帳票ＯＣＲプログラム２８では、推定処理の際のサーチ範囲を規定することで、サーチ範囲を限定している。図２に示すバンド（バンド１〜５）とは、それぞれサーチ範囲を示し、項目定義ファイル３７（図６参照）にはそれぞれの項目がどのバンドに属するかを示すバンドＮｏが含まれている。バンドは、帳票１６の筆記方向、すなわち本例においては帳票１６は横書きなので、横方向に延びた帯状のエリアとして定義される。本例では、１番左上の「支払を受ける者」の項目の幅をバンド１とし、「種別」，「支払金額」，「給与所得控除後の金額」，「所得控除の額の合計額」，「源泉徴収額」の各項目が並ぶ幅をバンド２というようにバンドを定義している。このバンドの定義は、座標情報などの物理的な位置情報によってなされるのではなく、項目名によって論理的に定義される。すなわち、バンド１の定義は、バンド１の範囲を座標情報によって定義するのではなく、バンド１内に含まれる複数の項目名（氏名，フリガナなど）を記述することによって行われる。 Therefore, the form OCR program 28 limits the search range by defining the search range in the estimation process. The bands (bands 1 to 5) shown in FIG. 2 indicate search ranges, respectively, and the item definition file 37 (see FIG. 6) includes a band number indicating which band each item belongs to. The band is defined as a band-shaped area extending in the horizontal direction since the writing direction of the form 16, that is, in this example, the form 16 is written horizontally. In this example, the width of the “payee” item at the top left is band 1 and “type”, “payment amount”, “amount after deduction of salary income”, “total amount of income deduction” , “Band 2” is defined as the width in which each item of “withholding amount” is arranged. This band is not defined by physical position information such as coordinate information, but is logically defined by item names. That is, the definition of band 1 is not performed by defining the range of band 1 by coordinate information, but by describing a plurality of item names (name, reading, etc.) included in band 1.

このように、サーチ範囲を限定したことで、例えば、「種別」という項目名枠を特定する場合には、推定処理に使用されるサーチキーが、「支払金額」，「給与所得控除後の金額」，「所得控除の額の合計額」，「源泉徴収額」の４つの項目名に限定される。これにより、サーチ処理の負荷が軽減されサーチ時間が短くなる。 In this way, by limiting the search range, for example, when specifying the item name frame “type”, the search key used for the estimation process is “payment amount”, “amount after deduction of salary income” ”,“ Total amount of deduction for income ”, and“ Withholding amount ”. This reduces the load of search processing and shortens the search time.

また、近傍の項目名をキーにサーチ処理を実行しても、所望の項目名枠を特定できない場合にはエラーとする。エラーの場合には、項目名枠特定データファイル３８の枠番号欄は空白となる。こうした項目名枠特定処理によって、項目定義ファイル３７の各項目名のレコードと、全面ＯＣＲ結果ファイル３６の各レコードとが対応付けられる。 Further, if a desired item name frame cannot be specified even if the search process is executed using a nearby item name as a key, an error occurs. In the case of an error, the frame number field of the item name frame specifying data file 38 is blank. By such an item name frame specifying process, the record of each item name in the item definition file 37 is associated with each record in the entire OCR result file 36.

項目名枠特定処理の後には、再ＯＣＲエリア設定処理が実行される。再ＯＣＲエリア設定処理は、項目名枠特定データファイル３８と、再ＯＣＲ項目定義ファイル３９とに基づいて、再ＯＣＲすべきエリアを指定する再ＯＣＲエリアデータファイル４１を出力する。 After the item name frame specifying process, a re-OCR area setting process is executed. The re-OCR area setting process outputs a re-OCR area data file 41 for designating an area to be re-OCR based on the item name frame specifying data file 38 and the re-OCR item definition file 39.

図９に示すように、再ＯＣＲ項目定義ファイル３９は、再度ＯＣＲすべき項目の記入欄に関する情報を、項目毎に定義したファイルである。再ＯＣＲ項目は、例えば、「受給者番号」，「フリガナ」，「種別」，「支払金額」，「給与所得控除後の金額」，「所得控除の額の合計額」，「源泉徴収額」などである。これらの項目は、項目定義ファイル３７において、再ＯＣＲ指定がなされる。再ＯＣＲ項目定義ファイル３９は、各項目毎に、再ＯＣＲ項目番号，項目名，属性とを含む情報が１レコードになっている。属性情報は、英字，数字，カナ，かな漢字など記入される項目の文字属性の情報である。再ＯＣＲ処理をする際には、この属性情報に基づいて、属性に適合するユーザー辞書が選択される。 As shown in FIG. 9, the re-OCR item definition file 39 is a file in which information related to the entry column of items to be OCR again is defined for each item. The re-OCR item includes, for example, “recipient number”, “phonetic”, “type”, “payment amount”, “amount after deduction of salary income”, “total amount of income deduction”, “withholding amount” Etc. These items are designated for re-OCR in the item definition file 37. In the re-OCR item definition file 39, information including a re-OCR item number, an item name, and an attribute is one record for each item. The attribute information is information on character attributes of items to be entered such as English letters, numbers, kana, kana and kanji. When the re-OCR process is performed, a user dictionary that matches the attribute is selected based on the attribute information.

図１０は、再ＯＣＲエリア設定処理の手順を示すフローチャートである。再ＯＣＲエリア設定処理では、まず、項目名枠特定データファイル３８から、再ＯＣＲ指定がなされている項目名枠のレコードを１つ読み出す。そして、当該項目名枠に対応する記入欄の位置情報に基づいて、全面ＯＣＲ結果ファイル３６内の各認識情報レコードの中から、記入欄の認識情報レコードを特定し、抽出する。 FIG. 10 is a flowchart showing the procedure of the re-OCR area setting process. In the re-OCR area setting process, first, one item name frame record for which re-OCR is specified is read from the item name frame specifying data file 38. Then, based on the position information of the entry field corresponding to the item name frame, the recognition information record of the entry field is identified and extracted from among the recognition information records in the entire OCR result file 36.

そして、再ＯＣＲ項目定義ファイル３９から該当する項目の属性情報を読み出し、その属性情報に基づいて、記入欄内の不要な認識文字の情報を除去する。例えば、図１１に示すように、支払金額の記入欄４６には、その欄内に、金額を示す文字列「５，６００，０００」の他、その上の行に、「内」，「円」といった文字列がプレプリントされている。全面ＯＣＲ結果ファイル３６の認識情報レコードには、記入欄４６の枠情報とその枠内の行情報や文字情報がすべて格納されている。 Then, the attribute information of the corresponding item is read from the re-OCR item definition file 39, and unnecessary recognition character information in the entry column is removed based on the attribute information. For example, as shown in FIG. 11, the payment amount entry field 46 includes a character string “5,600,000” indicating the amount in the field, and “inside” and “yen” in the line above it. "Is preprinted. The recognition information record of the entire OCR result file 36 stores all the frame information in the entry field 46 and the line information and character information in the frame.

再ＯＣＲ処理をすべきエリアは、記入欄４６内のうち、金額そのもの（「５，６００，０００」）が記述されたエリアのみでよい。その他の文字列は、認識不要であるばかりでなく、必要な文字列を認識する際のノイズになるおそれもある。このため、帳票ＯＣＲプログラム２８では、再ＯＣＲエリア設定処理において、前記認識情報レコードから、こうした不要文字に関する情報を除去している。不要文字か否かの判断は、属性情報に基づいて行われる。例えば、支払金額の属性は数字であるので、文字列の中から数字以外のものが不要文字と判断される。不要文字が除去されると、認識情報レコードには、記入欄４６の枠情報と、その欄内の金額の位置を示す文字座標４６ａとが残る。 The area where the re-OCR processing is to be performed is only the area in which the amount of money (“5,600,000”) is described in the entry field 46. Other character strings need not be recognized, but may cause noise when recognizing necessary character strings. For this reason, the form OCR program 28 removes information on such unnecessary characters from the recognition information record in the re-OCR area setting process. The determination of whether or not the character is an unnecessary character is made based on the attribute information. For example, since the attribute of the payment amount is a number, a character other than a number is determined as an unnecessary character from the character string. When the unnecessary character is removed, the frame information of the entry field 46 and the character coordinates 46a indicating the position of the amount of money in the field remain in the recognition information record.

こうして不要な認識文字情報が除去された認識情報レコードと、その記入欄の属性情報とを結合したデータが、再ＯＣＲエリアデータとして出力される。こうした処理が、再ＯＣＲ処理を実行する全項目に対して行われ、全項目分のデータをまとめた再ＯＣＲエリアデータファイル４１が生成される。 Data obtained by combining the recognition information record from which unnecessary recognition character information is removed in this way and the attribute information in the entry column is output as re-OCR area data. Such processing is performed for all items for which re-OCR processing is executed, and a re-OCR area data file 41 in which data for all items are collected is generated.

このように、項目枠特定処理と再ＯＣＲエリア設定処理とを行うことにより、再度ＯＣＲすべき記入欄の特定が行われる。 In this way, by performing the item frame specifying process and the re-OCR area setting process, the entry field to be OCR again is specified.

部分ＯＣＲ処理は、再ＯＣＲエリアデータファイル４１を参照して、帳票イメージの再ＯＣＲ指定されたエリアに対して部分的にＯＣＲ処理を実行する。まず、再ＯＣＲエリアデータファイル４１から、記入欄のエリア情報を１項目分読み出す。次に、読み出した項目の属性に対応するユーザー辞書を設定する。例えば、支払金額の記入欄の場合には、属性が数字であるので、数字用のユーザー辞書を設定する。そして、エリア情報の座標情報から、再ＯＣＲエリアを特定し、そのエリアのＯＣＲ処理を実行する。こうした処理を全項目分繰り返す。認識された文字列は、再ＯＣＲ項目番号，項目名とともに、部分ＯＣＲ結果ファイル４２に出力される。このように、文字属性に応じたユーザー辞書を使用して部分ＯＣＲ処理が行われるので、精度が高い文字認識を行うことができる。 The partial OCR process refers to the re-OCR area data file 41 and partially executes the OCR process for the area of the form image designated as re-OCR. First, one item of area information in the entry column is read from the re-OCR area data file 41. Next, a user dictionary corresponding to the attribute of the read item is set. For example, in the payment amount entry field, the attribute is a number, so a user dictionary for numbers is set. Then, the re-OCR area is identified from the coordinate information of the area information, and the OCR process for that area is executed. This process is repeated for all items. The recognized character string is output to the partial OCR result file 42 together with the re-OCR item number and the item name. In this way, partial OCR processing is performed using a user dictionary corresponding to the character attribute, so that character recognition with high accuracy can be performed.

以下、上記構成による作用について説明する。オペレータが、イメージスキャナ１２に帳票１６をセットして、読み取り指示を与えると、イメージスキャナ１２が帳票１６をイメージデータに変換し、そのイメージデータがイメージデータサーバ１３に蓄積される。次に、オペレータがメインユニット１１から、帳票ＯＣＲ処理実行指示を与えると、帳票ＯＣＲプログラム２８が起動する。帳票ＯＣＲプログラムは、イメージデータサーバ１３から帳票イメージデータを１ファイルずつ取り込み、帳票ＯＣＲ処理を実行する。 Hereinafter, the operation of the above configuration will be described. When the operator sets the form 16 in the image scanner 12 and gives a reading instruction, the image scanner 12 converts the form 16 into image data, and the image data is stored in the image data server 13. Next, when the operator gives a form OCR process execution instruction from the main unit 11, the form OCR program 28 is activated. The form OCR program takes the form image data from the image data server 13 one file at a time and executes the form OCR process.

まず、全面ＯＣＲ処理が実行されて、外枠３１内の全項目分の項目名枠，記入枠及び枠内の文字列が認識され、これらの認識情報が枠毎に認識情報レコードとしてまとめられ、全面ＯＣＲ結果ファイル３６として出力される。この全面ＯＣＲ処理により、帳票１６の各項目の枠のレイアウトが認識される。 First, the entire OCR process is executed to recognize item name frames, entry frames, and character strings in the frames for all items in the outer frame 31, and these recognition information are grouped into recognition information records for each frame. The entire OCR result file 36 is output. By this entire OCR process, the frame layout of each item of the form 16 is recognized.

次に、この全面ＯＣＲ結果ファイル３６と、項目定義ファイル３７とに基づいて、項目名枠特定処理が実行されて、項目名枠が特定される。この項目名枠特定処理においては、所望の項目名をキーに、対応する項目名枠が特定されるが、その項目名で特定ができない場合には、近傍の項目名をキーとして、所望の項目名枠が推定される。このため、全面ＯＣＲ処理において、所望の項目名を誤認識していたり、認識不能であった場合でも、項目名枠の特定が可能になるので、記入欄の特定率が向上する。項目名枠特定処理の結果は、項目名枠特定データファイル３８として出力される。 Next, an item name frame specifying process is executed based on the entire OCR result file 36 and the item definition file 37 to specify an item name frame. In this item name frame identification process, the corresponding item name frame is identified using the desired item name as a key. If the item name frame cannot be identified, the desired item name is identified using the neighboring item name as a key. Name slots are estimated. For this reason, in the full-screen OCR process, even when a desired item name is erroneously recognized or cannot be recognized, the item name frame can be specified, so that the specification rate of the entry column is improved. The result of the item name frame specifying process is output as an item name frame specifying data file 38.

この前記項目名枠特定データファイル３８と再ＯＣＲ項目定義ファイル３９とに基づいて、再ＯＣＲエリア設定処理が実行されて、再ＯＣＲエリア（再度ＯＣＲすべき記入欄）が特定される。この再ＯＣＲエリア設定処理によって、記入欄の位置及びその属性の特定，及び不要文字情報の除去が行われ、その結果情報として再ＯＣＲエリアデータファイル４１が出力される。 Based on the item name frame specifying data file 38 and the re-OCR item definition file 39, a re-OCR area setting process is executed to specify a re-OCR area (an entry field to be OCR again). By this re-OCR area setting process, the position of the entry field and its attribute are specified, and unnecessary character information is removed, and as a result, the re-OCR area data file 41 is output.

部分ＯＣＲ処理は、この再ＯＣＲエリアデータファイル４１に基づいて、指定された記入欄に対して再度ＯＣＲ処理を実行する。この部分ＯＣＲ処理では、属性情報に基づいて、対象となる記入欄に適合したユーザー辞書が選択されるから、精度の高い文字認識が可能となる。 In the partial OCR process, based on the re-OCR area data file 41, the OCR process is executed again for the designated entry field. In this partial OCR process, a user dictionary suitable for the target entry field is selected based on the attribute information, so that highly accurate character recognition is possible.

このように、帳票ＯＣＲプログラム２８は、まず、全面ＯＣＲ処理により、帳票１６の各項目のレイアウトを認識した後、その結果情報と項目定義データに基づいて記入欄を特定している。このため、項目のレイアウトが異なる場合でも、予め帳票毎のレイアウト情報を準備することなく、必要な項目名を含む項目定義データを準備するだけで済むので、柔軟な対応が可能となり、認識精度の低下がなくなる。 As described above, the form OCR program 28 first recognizes the layout of each item of the form 16 by the full OCR process, and then specifies the entry field based on the result information and the item definition data. For this reason, even if the layout of the items is different, it is only necessary to prepare item definition data including necessary item names without preparing layout information for each form in advance. No decrease.

上記実施形態では、帳票ＯＣＲシステムのメインユニットとして、汎用的なパーソナルコンピュータやワークステーションをベースに帳票ＯＣＲプログラムをインストールした形態の帳票ＯＣＲ装置を使用し、各処理ステップのすべてをコンピュータがソフトウエアを実行することにより実現する例で説明しているが、もちろん、メインユニットとしては、各処理ステップのうち少なくとも一部を専用のハードウエアによって実行する処理部を備えた専用の帳票ＯＣＲ装置を使用してもよい。 In the above-described embodiment, a form OCR apparatus in which a form OCR program is installed on the basis of a general-purpose personal computer or workstation is used as the main unit of the form OCR system. Although explained in the example realized by executing, of course, as the main unit, a dedicated form OCR device having a processing unit for executing at least a part of each processing step by dedicated hardware is used. May be.

帳票ＯＣＲシステムの全体構成図である。It is a whole block diagram of a form OCR system. 帳票の説明図である。It is explanatory drawing of a form. 帳票ＯＣＲ処理の全体の手順を示すフローチャートである。It is a flowchart which shows the procedure of the whole form OCR process. 全面ＯＣＲ処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of a whole surface OCR process. 全面ＯＣＲ結果ファイルの説明図である。It is explanatory drawing of a whole surface OCR result file. 項目定義ファイルの説明図である。It is explanatory drawing of an item definition file. 項目名枠特定処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of an item name frame specific process. 項目名枠特定処理の説明図である。It is explanatory drawing of an item name frame specific process. 再ＯＣＲエリア設定処理の説明図である。It is explanatory drawing of a re-OCR area setting process. 再ＯＣＲエリア設定処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of a re-OCR area setting process. 不要文字除去処理の説明図である。It is explanatory drawing of an unnecessary character removal process. 部分ＯＣＲ処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of a partial OCR process.

Explanation of symbols

１０帳票ＯＣＲシステム
１１メインユニット
１６帳票
２１ＣＰＵ
２２ＲＡＭ
２７ＨＤＤ
３５帳票イメージデータ
３６全面ＯＣＲ結果ファイル
３７項目定義ファイル
３８項目名枠特定データファイル
３９再ＯＣＲ項目定義ファイル
４１再ＯＣＲエリアデータファイル
４２部分ＯＣＲ結果ファイル 10 Form OCR system 11 Main unit 16 Form 21 CPU
22 RAM
27 HDD
35 Form image data 36 Full OCR result file 37 Item definition file 38 Item name frame specific data file 39 Re-OCR item definition file 41 Re-OCR area data file 42 Partial OCR result file

Claims

For a plurality of items, the entry fields and preprinted item names are arranged, and characters in the entry fields are recognized from a form image obtained by reading a form in which the entry fields and the item names are partitioned by ruled lines. In a form OCR program that causes a computer to execute form OCR processing
OCR processing is performed on the entire surface of the form image to recognize the position of the entry frame that divides the entry field, the position of the item name frame surrounding the item name, and the character string in these frames. Complete OCR processing step to collect the information as one record per frame,
With reference to re-OCR designation information that defines in advance whether or not OCR processing is necessary again for each item name, the record corresponding to the item name that requires OCR processing is read out again, and corresponding to each item name Specify the entry field to identify the entry field to be subjected to the OCR process again from the position of the item name frame included in the read record with reference to the position information of the entry field in which the relative positional relationship with the entry field is defined in advance. Processing steps;
Based on the character attribute information of each item defined in advance, the dictionary data suitable for the attribute of the target entry field is used , and the OCR process is partially performed on the entry field specified in the entry field specifying process step. A form OCR program characterized by comprising a partial OCR processing step for executing.

For a plurality of items, the entry fields and preprinted item names are arranged, and characters in the entry fields are recognized from a form image obtained by reading a form in which the entry fields and the item names are partitioned by ruled lines. In the form OCR method
OCR processing is performed on the entire surface of the form image to recognize the position of the entry frame that divides the entry field, the position of the item name frame surrounding the item name, and the character string in these frames. Complete OCR processing step to collect the information as one record per frame,
With reference to re-OCR designation information that defines in advance whether or not OCR processing is necessary again for each item name, the record corresponding to the item name that requires OCR processing is read out again, and corresponding to each item name Specify the entry field to identify the entry field to be subjected to the OCR process again from the position of the item name frame included in the read record with reference to the position information of the entry field in which the relative positional relationship with the entry field is defined in advance. Processing steps;
Based on the character attribute information of each item defined in advance, the dictionary data suitable for the attribute of the target entry field is used , and the OCR process is partially performed on the entry field specified in the entry field specifying process step. A form OCR method characterized by comprising: a partial OCR processing step for executing

For a plurality of items, the entry fields and preprinted item names are arranged, and characters in the entry fields are recognized from a form image obtained by reading a form in which the entry fields and the item names are partitioned by ruled lines. In the form OCR device
OCR processing is performed on the entire surface of the form image to recognize the position of the entry frame that divides the entry field, the position of the item name frame surrounding the item name, and the character string in these frames. A full-screen OCR processing unit that collects the information as one record for each frame,
With reference to re-OCR designation information that defines in advance whether or not OCR processing is necessary again for each item name, the record corresponding to the item name that requires OCR processing is read again, and corresponding to each item name Specify the entry field to identify the entry field to be subjected to the OCR process again from the position of the item name frame included in the read record with reference to the position information of the entry field in which the relative positional relationship with the entry field is defined in advance. A processing unit;
Based on the character attribute information of each item defined in advance, the dictionary data suitable for the attribute of the target entry field is used , and the OCR process is partially performed on the entry field specified by the entry field specifying processing unit. A form OCR apparatus comprising a partial OCR processing unit for executing