JP2013030040A

JP2013030040A - Information processing program, information processor, and character recognition method

Info

Publication number: JP2013030040A
Application number: JP2011166451A
Authority: JP
Inventors: Masaaki Ozawa; 昌昭小澤; Shohei Hasegawa; 将平長谷川; Akira Takada; 亮高田; Hirotaka Inoue; 博貴井上; Kazuo Nakamura; 一夫中村
Original assignee: Fujitsu Frontech Ltd
Current assignee: Fujitsu Frontech Ltd
Priority date: 2011-07-29
Filing date: 2011-07-29
Publication date: 2013-02-07
Anticipated expiration: 2031-07-29
Also published as: JP5566971B2

Abstract

PROBLEM TO BE SOLVED: To efficiently perform character recognition processing.SOLUTION: A detection section 1a detects identification information 2a showing that the layout of a form is changed from image information 2 obtained by imaging the form. When the detection section 1a detects the identification information 2a, a region specifying section 1b detects region identification information 2b for identifying a processing object region 3 for performing prescribed character recognition processing which is included in the image information 2 to specify the processing object region 3 on the basis of the region identification information 2b. A processing section 1c performs prescribed character recognition processing on the specified processing object region 3.

Description

本件は文字認識を行う情報処理プログラム、情報処理装置および文字認識方法に関する。 The present invention relates to an information processing program, an information processing apparatus, and a character recognition method for performing character recognition.

従来、紙面に記入された文字（あるいは文字列）の認識（文字認識）を行い、データとして取得する情報処理装置が利用されている。例えば、情報処理装置は次のようにして文字認識を行う。まず、紙面の記入面を撮像装置により撮像して、紙面の画像情報を取得する。次に、該紙面の画像情報に含まれる文字の画像情報を、予め定義された文字パターンと照合し、該文字パターンとの一致度に基づいて、文字の画像情報が何れの文字パターンに対応するか決定する。そして、決定した文字パターンに対応する文字コードを、該紙面に記入された文字のデータとして取得する。各文字につき、この処理を繰り返し行い、文字列のデータを取得する。 2. Description of the Related Art Conventionally, information processing apparatuses that perform recognition (character recognition) of characters (or character strings) entered on paper and acquire them as data have been used. For example, the information processing apparatus performs character recognition as follows. First, a paper entry surface is imaged by an imaging device to obtain image information on the paper surface. Next, the character image information included in the image information on the paper is compared with a predefined character pattern, and the character image information corresponds to any character pattern based on the degree of coincidence with the character pattern. To decide. Then, the character code corresponding to the determined character pattern is acquired as the character data entered on the page. This process is repeated for each character to obtain character string data.

文字認識では、紙面上に記入される文字列を所定のデータ項目に対応付けて取得することもある。例えば、金融機関で預金や出金などの取引に用いる帳票には、氏名、口座番号および取引金額などのデータ項目に対する文字列を、顧客に記入させる記入欄が設けられている。各記入欄は、帳票の種類ごとに定位置に配置される。この場合、データ項目に対応付けて記入欄の帳票上の位置（レイアウト）を予め定義しておけば、情報処理装置は帳票の画像情報に基づき、データ項目と文字列のデータとを対応付けて取得できる。このように、レイアウトの定義情報に基づいて、データ項目に対する文字列を抽出する文字認識の方法を、レイアウト認識による文字認識と呼ぶことがある。 In character recognition, a character string written on paper is sometimes obtained in association with a predetermined data item. For example, a form used for transactions such as deposits and withdrawals at a financial institution is provided with an entry field that allows a customer to enter character strings for data items such as name, account number, and transaction amount. Each entry field is arranged at a fixed position for each type of form. In this case, if the position (layout) of the entry field in the form is defined in advance in association with the data item, the information processing apparatus associates the data item with character string data based on the form image information. You can get it. As described above, a character recognition method for extracting a character string for a data item based on layout definition information may be referred to as character recognition by layout recognition.

ここで、帳票のレイアウトを変更することがある。その場合、レイアウト認識では、変更のたびにレイアウトの定義情報を更新するための作業負担が生じる。そこで、レイアウトの定義情報に依らずに記入された文字列とデータ項目との対応を判断して、レイアウト変更を容易にする文字認識の方法も考えられている。 Here, the form layout may be changed. In that case, in the layout recognition, a work load for updating the definition information of the layout every time a change occurs. Therefore, a method of character recognition that makes it easy to change the layout by determining the correspondence between the entered character string and the data item without depending on the layout definition information has been considered.

例えば、予め定義した項目名（「金額」や「振込金額」など）を示す文字列（キーワード）を帳票から抽出し、該キーワードの位置と記入された文字列との位置関係などに基づいて、該キーワードが示す項目名と記入された文字列とを対応付ける提案がある（例えば、特許文献１参照）。このように、項目名を示すキーワードを検出して、記入された文字列との対応を判断する文字認識の方法を、キーワード認識による文字認識と呼ぶことがある。 For example, a character string (keyword) indicating a predefined item name (such as “amount” or “transfer amount”) is extracted from a form, and based on the positional relationship between the position of the keyword and the entered character string, There is a proposal for associating an item name indicated by the keyword with a written character string (for example, see Patent Document 1). As described above, the character recognition method for detecting the keyword indicating the item name and determining the correspondence with the entered character string may be called character recognition by keyword recognition.

また、例えば、文字列を記入するフィールドが設けられた帳票上の、フィールドに対する所定位置に、項目に対応した識別コードを記載する提案がある（例えば、特許文献２参照）。この提案では、該識別コードを検出して該フィールドの項目を認識する。 In addition, for example, there is a proposal to describe an identification code corresponding to an item at a predetermined position with respect to a field on a form provided with a field for entering a character string (see, for example, Patent Document 2). In this proposal, the item of the field is recognized by detecting the identification code.

更に、例えば、項目ごとの文字列についての規定を含む構文ルール情報を定義しておく提案もある（例えば、特許文献３参照）。この提案では、認識した文字列を該構文ルール情報に基づいて解析し、認識した文字列と項目との対応付けを行う。 Further, for example, there is a proposal for defining syntax rule information including a rule for a character string for each item (see, for example, Patent Document 3). In this proposal, the recognized character string is analyzed based on the syntax rule information, and the recognized character string is associated with the item.

特開２０１０−３１５５号公報JP 2010-3155 A 特開２００４−１６４３７６号公報JP 2004-164376 A 特開２００４−１９９５２９号公報Japanese Patent Laid-Open No. 2004-199529

レイアウト変更後の帳票について所定の文字認識処理を行う際に、帳票上の領域全体を処理対象とすると処理効率が悪い場合がある。
例えば、帳票によっては、文字認識の対象としなくてもよい領域（例えば、事業者側で使用する欄や顧客に情報を伝えるためのお知らせ欄など）が含まれ得る。このような領域をも該文字認識処理の対象とすると、処理時間が余分にかかり、処理効率が悪い。 When a predetermined character recognition process is performed on a form after the layout is changed, processing efficiency may be poor if the entire area on the form is a processing target.
For example, depending on the form, an area that does not have to be a character recognition target (for example, a column used on the business side or a notification column for transmitting information to a customer) may be included. If such a region is also subject to character recognition processing, it takes extra processing time and processing efficiency is poor.

また、例えば、変更の非対象の領域は、既存の文字認識（例えば、レイアウト認識による文字認識）を行えばよい場合もある。その場合に、例えば変更の非対象の領域をも、上記特許文献１〜３に例示されるような文字認識処理の対象とすると、既存の文字認識で対応可能な領域を重複して処理することになる。すると、処理時間が余分にかかり、処理効率が悪い。 In addition, for example, there may be a case where existing character recognition (for example, character recognition by layout recognition) may be performed on a non-target area to be changed. In that case, for example, if a non-target area to be changed is also a target of character recognition processing as exemplified in Patent Documents 1 to 3, an area that can be handled by existing character recognition is overlapped. become. Then, it takes extra processing time and processing efficiency is poor.

そこで、帳票上のレイアウトが変更されたときに、所定の文字認識処理の対象領域を容易に指定可能として、該文字認識処理を効率的に行う仕組みをどのようにして実現するかが問題となる。 Therefore, when the layout on the form is changed, there is a problem of how to implement a mechanism for efficiently performing the character recognition process by making it possible to easily specify a target area for a predetermined character recognition process. .

本発明はこのような点に鑑みてなされたものであり、文字認識処理を効率的に行えるようにした情報処理プログラム、情報処理装置および文字認識方法を提供することを目的とする。 The present invention has been made in view of these points, and an object thereof is to provide an information processing program, an information processing apparatus, and a character recognition method capable of efficiently performing character recognition processing.

帳票を撮像して得られた画像情報から該帳票のレイアウトが変更された旨を示す識別情報を検出し、識別情報を検出すると、画像情報に含まれる、所定の文字認識処理を行う処理対象領域を識別するための領域識別情報を検出して、該領域識別情報に基づき処理対象領域を特定し、特定した処理対象領域に対して所定の文字認識処理を行う、処理をコンピュータに実行させる情報処理プログラムが提供される。 A processing target area for performing a predetermined character recognition process included in the image information when the identification information indicating that the layout of the form has been changed is detected from the image information obtained by capturing the form and the identification information is detected. Information processing for detecting a region identification information for identifying a region, specifying a processing target region based on the region identification information, performing a predetermined character recognition process on the specified processing target region, and causing a computer to execute the processing A program is provided.

また、検出部と領域特定部と処理部とを有する情報処理装置が提供される。検出部は、帳票を撮像して得られた画像情報から該帳票のレイアウトが変更された旨を示す識別情報を検出する。領域特定部は、識別情報を検出すると、画像情報に含まれる、所定の文字認識処理を行う処理対象領域を識別するための領域識別情報を検出して、該領域識別情報に基づき処理対象領域を特定する。処理部は、特定した処理対象領域に対して所定の文字認識処理を行う。 An information processing apparatus having a detection unit, a region specifying unit, and a processing unit is provided. The detection unit detects identification information indicating that the layout of the form has been changed from image information obtained by capturing the form. Upon detecting the identification information, the area specifying unit detects area identification information included in the image information for identifying a processing target area for performing a predetermined character recognition process, and determines the processing target area based on the area identification information. Identify. The processing unit performs a predetermined character recognition process on the specified processing target area.

また、情報処理装置が実行する文字認識方法が提供される。この文字認識方法では、帳票を撮像して得られた画像情報から該帳票のレイアウトが変更された旨を示す識別情報を検出すると、画像情報に含まれる、所定の文字認識処理を行う処理対象領域を識別するための領域識別情報を検出して、該領域識別情報に基づき処理対象領域を特定する。特定した処理対象領域に対して所定の文字認識処理を行う。 A character recognition method executed by the information processing apparatus is also provided. In this character recognition method, when identification information indicating that the layout of the form has been changed is detected from image information obtained by capturing the form, a processing target area for performing a predetermined character recognition process included in the image information The region identification information for identifying the region is detected, and the processing target region is specified based on the region identification information. A predetermined character recognition process is performed on the identified processing target area.

文字認識処理を効率的に行える。 Character recognition processing can be performed efficiently.

第１の実施の形態の情報処理装置を示す図である。It is a figure which shows the information processing apparatus of 1st Embodiment. 第２の実施の形態の情報処理システムを示す図である。It is a figure which shows the information processing system of 2nd Embodiment. 第２の実施の形態の帳票読取装置のハードウェアを示す図である。It is a figure which shows the hardware of the form reading apparatus of 2nd Embodiment. 第２の実施の形態の改訂前の帳票の例を示す図である。It is a figure which shows the example of the form before revision of 2nd Embodiment. 第２の実施の形態の改訂後の帳票の例を示す図である。It is a figure which shows the example of the form after revision of 2nd Embodiment. 第２の実施の形態の帳票読取装置の機能構成を示すブロック図である。It is a block diagram which shows the function structure of the form reading apparatus of 2nd Embodiment. 第２の実施の形態のレイアウト定義テーブルの例を示す図である。It is a figure which shows the example of the layout definition table of 2nd Embodiment. 第２の実施の形態の帳票読取処理を示すフローチャートである。It is a flowchart which shows the form reading process of 2nd Embodiment. 第２の実施の形態のドロップアウト処理後の帳票画像の例を示す図である。It is a figure which shows the example of the form image after the dropout process of 2nd Embodiment. 第２の実施の形態のキーワード認識処理を示すフローチャートである。It is a flowchart which shows the keyword recognition process of 2nd Embodiment. 第２の実施の形態のキーワード認識対象領域の例を示す図である。It is a figure which shows the example of the keyword recognition object area | region of 2nd Embodiment. 第２の実施の形態の帳票ＩＤの例を示す図である。It is a figure which shows the example of form ID of 2nd Embodiment. 第３の実施の形態の改訂後の帳票の例を示す図である。It is a figure which shows the example of the form after revision of 3rd Embodiment. 第３の実施の形態のキーワード認識処理を示すフローチャートである。It is a flowchart which shows the keyword recognition process of 3rd Embodiment. 第４の実施の形態の改訂前の帳票の例を示す図である。It is a figure which shows the example of the form before revision of 4th Embodiment. 第４の実施の形態の改訂後の帳票の例を示す図である。It is a figure which shows the example of the form after revision of 4th Embodiment. 第４の実施の形態の帳票読取処理を示すフローチャートである。It is a flowchart which shows the form reading process of 4th Embodiment. 第４の実施の形態のキーワード認識処理を示すフローチャートである。It is a flowchart which shows the keyword recognition process of 4th Embodiment. 第４の実施の形態の各文字認識の対象領域の第１の例を示す図である。It is a figure which shows the 1st example of the object area | region of each character recognition of 4th Embodiment. 第４の実施の形態の各文字認識の対象領域の第２の例を示す図である。It is a figure which shows the 2nd example of the object area | region of each character recognition of 4th Embodiment. 第５の実施の形態のキーワード認識対象領域の例を示す図である。It is a figure which shows the example of the keyword recognition object area | region of 5th Embodiment. 第５の実施の形態のデータ部の特定処理を示すフローチャートである。It is a flowchart which shows the specific process of the data part of 5th Embodiment. 第５の実施の形態のデータ部の特定方法の例を示す図である。It is a figure which shows the example of the identification method of the data part of 5th Embodiment. 第５の実施の形態のデータ部の特定方法の他の例を示す図である。It is a figure which shows the other example of the identification method of the data part of 5th Embodiment.

以下、本実施の形態を図面を参照して説明する。
［第１の実施の形態］
図１は、第１の実施の形態の情報処理装置を示す図である。情報処理装置１は、所定のレイアウトが施された帳票を撮像して得られた画像情報２に対して文字認識処理を行う。レイアウトとは、例えば帳票上の記入欄の配置である。 Hereinafter, the present embodiment will be described with reference to the drawings.
[First Embodiment]
FIG. 1 is a diagram illustrating the information processing apparatus according to the first embodiment. The information processing apparatus 1 performs a character recognition process on the image information 2 obtained by imaging a form having a predetermined layout. The layout is an arrangement of entry fields on a form, for example.

画像情報２には、識別情報２ａおよび領域識別情報２ｂが含まれる。識別情報２ａは、帳票上のレイアウトが変更された旨を示す情報である。領域識別情報２ｂは、所定の文字認識処理を行う処理対象領域３を識別するための情報である。識別情報２ａおよび領域識別情報２ｂに対応する情報は、帳票上に予め印字される。 The image information 2 includes identification information 2a and area identification information 2b. The identification information 2a is information indicating that the layout on the form has been changed. The area identification information 2b is information for identifying the processing target area 3 for performing a predetermined character recognition process. Information corresponding to the identification information 2a and the area identification information 2b is printed in advance on the form.

情報処理装置１は、検出部１ａ、領域特定部１ｂおよび処理部１ｃを有する。
検出部１ａは、画像情報２から帳票上のレイアウトが変更された旨を示す識別情報２ａを検出する。例えば、識別情報２ａは、画像情報２の所定位置に配置される。配置位置は、検出部１ａに予め設定される。例えば、識別情報２ａは、数値、文字、記号、図形およびこれらの組合せなどにより表される。どのような情報が識別情報２ａに該当するかは、検出部１ａに予め設定される。 The information processing apparatus 1 includes a detection unit 1a, a region specifying unit 1b, and a processing unit 1c.
The detection unit 1a detects identification information 2a indicating that the layout on the form has been changed from the image information 2. For example, the identification information 2a is arranged at a predetermined position of the image information 2. The arrangement position is preset in the detection unit 1a. For example, the identification information 2a is represented by numerical values, characters, symbols, figures, combinations thereof, and the like. What information corresponds to the identification information 2a is preset in the detection unit 1a.

領域特定部１ｂは、検出部１ａが識別情報２ａを検出すると、画像情報２から領域識別情報２ｂを検出する。領域特定部１ｂは、領域識別情報２ｂに基づき処理対象領域３を特定する。例えば、領域識別情報２ｂは、所定領域を囲う線として表される。この場合、領域特定部１ｂは、領域識別情報２ｂの内側の領域を処理対象領域３と特定できる。なお、領域識別情報２ｂの線の太さや線の種別（実線、点線および一点鎖線など）など、どのような線が領域識別情報２ｂに該当するかは、領域特定部１ｂに予め設定される。領域特定部１ｂで示される図形は、任意の多角形や多角形に限らない任意の図形としてもよい。 The area specifying unit 1b detects the area identification information 2b from the image information 2 when the detection unit 1a detects the identification information 2a. The area specifying unit 1b specifies the processing target area 3 based on the area identification information 2b. For example, the area identification information 2b is represented as a line surrounding a predetermined area. In this case, the area specifying unit 1b can specify the area inside the area identification information 2b as the process target area 3. Note that what type of line corresponds to the area identification information 2b, such as the thickness of the line of the area identification information 2b and the type of line (eg, solid line, dotted line, and alternate long and short dash line), is preset in the area specifying unit 1b. The figure shown by the area specifying unit 1b may be an arbitrary polygon or an arbitrary figure that is not limited to a polygon.

なお、領域識別情報２ｂは、所定領域を囲う線以外の方法で表すこともできる。例えば、数値、文字、記号、図形およびこれらの組合せなどを用いてもよい。その場合、例えば、領域特定部１ｂには、処理対象領域３が長方形で形成されることを予め設定する。その場合、領域特定部１ｂは、長方形の１対の対角に設けられた２つの領域識別情報を検出することで処理対象領域３を特定できる。処理対象領域３（領域識別情報で示される図形）は、長方形に限らず任意の多角形または図形としてもよい。例えば、多角形の頂点に対応する各位置に、領域識別情報を設けておけば、領域特定部１ｂは、領域識別情報により頂点を特定し、頂点で囲われる多角形を特定できる。なお、この場合も、どのような情報が領域識別情報に該当するかは、領域特定部１ｂに予め設定される。 Note that the area identification information 2b can also be expressed by a method other than the line surrounding the predetermined area. For example, numerical values, characters, symbols, figures, and combinations thereof may be used. In that case, for example, it is set in advance in the region specifying unit 1b that the processing target region 3 is formed in a rectangular shape. In that case, the region specifying unit 1b can specify the processing target region 3 by detecting two pieces of region identification information provided at a pair of diagonal corners. The processing target area 3 (a graphic indicated by the area identification information) is not limited to a rectangle, and may be an arbitrary polygon or graphic. For example, if region identification information is provided at each position corresponding to a vertex of a polygon, the region identification unit 1b can identify a vertex by the region identification information and identify a polygon surrounded by the vertex. Also in this case, what information corresponds to the region identification information is preset in the region specifying unit 1b.

ここで、検出部１ａ、領域特定部１ｂおよび処理部１ｃは、ＣＰＵ（Central Processing Unit）およびＲＡＭ（Random Access Memory）を用いて実行されるプログラムとして実装してもよい。 Here, the detection unit 1a, the region specifying unit 1b, and the processing unit 1c may be implemented as programs that are executed using a CPU (Central Processing Unit) and a RAM (Random Access Memory).

情報処理装置１によれば、検出部１ａにより、画像情報２から識別情報２ａが検出される。すると、領域特定部１ｂにより、画像情報２に含まれる領域識別情報２ｂが検出され、領域識別情報２ｂに基づき処理対象領域３が特定される。処理部１ｃにより、処理対象領域３に対して所定の文字認識処理が行われる。 According to the information processing apparatus 1, the identification information 2 a is detected from the image information 2 by the detection unit 1 a. Then, the region identification information 1b included in the image information 2 is detected by the region identification unit 1b, and the processing target region 3 is identified based on the region identification information 2b. A predetermined character recognition process is performed on the processing target area 3 by the processing unit 1c.

これにより、文字認識処理を効率的に行える。具体的には、所定の文字認識処理を行う領域を処理対象領域３とし、それ以外の領域に対しては該所定の文字認識処理を行わない。すなわち、所定の文字認識処理が余計な領域に対して行われるのを抑止でき、余分な処理時間がかからずに済む。 Thereby, a character recognition process can be performed efficiently. Specifically, an area where a predetermined character recognition process is performed is set as a process target area 3, and the predetermined character recognition process is not performed for other areas. That is, it is possible to prevent the predetermined character recognition processing from being performed on an extra area, and an extra processing time is not required.

なお、図１の画像情報２では領域識別情報２ｂで１つの処理対象領域３を特定する場合を例示した。これに対し、複数の領域識別情報により、複数の処理対象領域を特定可能としてもよい。より具体的には、領域を囲う線として表される領域識別情報を複数設けて、複数の処理対象領域を特定可能としてもよい。あるいは、第１の処理対象領域の頂点位置に第１の領域識別情報を設けて該第１の処理対象領域を特定可能とし、第２の処理対象領域の頂点位置に、第１の領域識別情報とは異なる第２の領域識別情報を設けて該第２の処理対象領域を特定可能としてもよい。 In the image information 2 in FIG. 1, the case where one processing target area 3 is specified by the area identification information 2 b is illustrated. On the other hand, a plurality of process target areas may be specified by a plurality of area identification information. More specifically, a plurality of region identification information expressed as lines surrounding the region may be provided so that a plurality of processing target regions can be specified. Alternatively, the first area identification information is provided at the vertex position of the first processing target area so that the first processing target area can be specified, and the first area identification information is set at the vertex position of the second processing target area. Second region identification information different from that may be provided so that the second processing target region can be specified.

以下、金融機関の窓口などに設置され、顧客が記入した帳票の画像情報に対して文字認識処理を行う帳票読取装置に、情報処理装置１を適用する例を説明する。
［第２の実施の形態］
図２は、第２の実施の形態の情報処理システムを示す図である。この情報処理システムは、金融機関の窓口業務を支援する。この情報処理システムは、帳票読取装置１００とサーバ装置２００とを含む。帳票読取装置１００およびサーバ装置２００は、ネットワーク１０を介して接続される。ネットワーク１０は、該金融機関内に設けられたイントラネットである。帳票読取装置１００とサーバ装置２００とは、別個の拠点に設置されてもよい。ネットワーク１０の経路内には、この情報処理システムのために敷設された専用線のネットワーク、インターネットおよび通信事業者のＩＰ（Internet Protocol）網などを含んでもよい。インターネットやＩＰ網を含む場合、ＶＰＮ（Virtual Private Network）などを利用して通信のセキュリティが確保される。 Hereinafter, an example will be described in which the information processing apparatus 1 is applied to a form reading apparatus installed at a financial institution or the like and performing character recognition processing on image information of a form entered by a customer.
[Second Embodiment]
FIG. 2 illustrates an information processing system according to the second embodiment. This information processing system supports the window business of financial institutions. This information processing system includes a form reading device 100 and a server device 200. The form reading device 100 and the server device 200 are connected via the network 10. The network 10 is an intranet provided in the financial institution. The form reading device 100 and the server device 200 may be installed in separate bases. The route of the network 10 may include a dedicated line network laid for the information processing system, the Internet, and an IP (Internet Protocol) network of a communication carrier. When the Internet or IP network is included, communication security is ensured using a VPN (Virtual Private Network) or the like.

帳票読取装置１００は、金融機関の窓口に設置される情報処理装置である。帳票読取装置１００は、帳票を撮像して取得された帳票の画像情報（以下、帳票画像と呼ぶことがある）に対して文字認識処理を行い、顧客が帳票に記入した情報を取得してサーバ装置２００に送信する。 The form reading apparatus 100 is an information processing apparatus installed at a financial institution window. The form reading device 100 performs character recognition processing on image information of a form (hereinafter, also referred to as a form image) acquired by imaging a form, acquires information entered in the form by a customer, and serves as a server To device 200.

帳票読取装置１００が読み取る帳票の種別は、取引に応じて複数の種類が存在する。例えば、入金、出金および新規申込などの取引に応じた帳票が考えられる。帳票読取装置１００は、既存のレイアウトの帳票に対する文字認識処理にレイアウト認識を用いるものとする。 There are a plurality of types of forms read by the form reading apparatus 100 depending on the transaction. For example, forms corresponding to transactions such as deposits, withdrawals and new applications can be considered. It is assumed that the form reading apparatus 100 uses layout recognition for character recognition processing for a form having an existing layout.

サーバ装置２００は、顧客の口座情報を管理する情報処理装置である。サーバ装置２００は、帳票読取装置１００が読み取った帳票の情報を受信して、該情報に基づく取引の処理を実行する。例えば、現金の出金、現金による入金、ある口座から他の口座への預金の振替などの取引を確定するための処理である。 The server device 200 is an information processing device that manages customer account information. The server device 200 receives information on the form read by the form reading device 100 and executes a transaction process based on the information. For example, it is a process for confirming transactions such as cash withdrawal, cash deposit, transfer of deposits from one account to another.

なお、金融機関の窓口には、帳票読取装置１００以外にも複数の帳票読取装置が設けられてもよい。
金融機関は、帳票のレイアウトを変更すること（以下、帳票の改訂ということもある）がある。例えば、新たなデータ項目の記入欄を追加したり、既存のデータ項目の記入欄を削減したりする場合が考えられる。レイアウト認識では、レイアウト変更に際して帳票のレイアウト定義情報を変更するための作業負担が生じる。そこで、帳票読取装置１００は、改訂箇所につきキーワード認識による文字認識処理を行うことで、該レイアウト変更に容易に対応可能である。以下、この場合にキーワード認識による文字認識処理を効率的に行うための構成を説明する。 In addition to the form reading device 100, a plurality of form reading devices may be provided at the counter of the financial institution.
Financial institutions sometimes change the layout of forms (hereinafter, sometimes referred to as revision of forms). For example, it may be possible to add a new data item entry field or to reduce an existing data item entry field. In layout recognition, a work load for changing the layout definition information of a form occurs when the layout is changed. Therefore, the form reading apparatus 100 can easily cope with the layout change by performing character recognition processing by keyword recognition for the revised portion. Hereinafter, a configuration for efficiently performing character recognition processing by keyword recognition in this case will be described.

図３は、第２の実施の形態の帳票読取装置のハードウェアを示す図である。帳票読取装置１００は、ＣＰＵ１０１、ＲＯＭ（Read Only Memory）１０２、ＲＡＭ１０３、ＨＤＤ（Hard Disk Drive）１０４、グラフィックインタフェース１０５、入力インタフェース１０６、スキャナインタフェース１０７、ディスクドライブ１０８および通信インタフェース１０９を有する。 FIG. 3 is a diagram illustrating hardware of the form reading apparatus according to the second embodiment. The form reading apparatus 100 includes a CPU 101, a ROM (Read Only Memory) 102, a RAM 103, an HDD (Hard Disk Drive) 104, a graphic interface 105, an input interface 106, a scanner interface 107, a disk drive 108, and a communication interface 109.

ＣＰＵ１０１は、ＯＳ（Operating System）プログラムやアプリケーションプログラムを実行して、帳票読取装置１００全体を制御する。
ＲＯＭ１０２は、帳票読取装置１００の起動時に実行されるＢＩＯＳ（Basic Input / Output System）プログラムなどの所定のプログラムを記憶する。ＲＯＭ１０２は、書き換え可能な不揮発性メモリであってもよい。 The CPU 101 executes an OS (Operating System) program and an application program to control the entire form reading apparatus 100.
The ROM 102 stores a predetermined program such as a BIOS (Basic Input / Output System) program executed when the form reading apparatus 100 is activated. The ROM 102 may be a rewritable nonvolatile memory.

ＲＡＭ１０３は、ＣＰＵ１０１が実行するＯＳプログラムやアプリケーションプログラムの少なくとも一部を一時的に記憶する。また、ＲＡＭ１０３は、ＣＰＵ１０１の処理に用いられるデータの少なくとも一部を一時的に記憶する。 The RAM 103 temporarily stores at least part of an OS program and application programs executed by the CPU 101. The RAM 103 temporarily stores at least a part of data used for the processing of the CPU 101.

ＨＤＤ１０４は、ＯＳプログラムやアプリケーションプログラムを記憶する。また、ＨＤＤ１０４は、ＣＰＵ１０１の処理に用いられるデータを記憶する。なお、ＨＤＤ１０４に代えて（または、ＨＤＤ１０４と併せて）、ＳＳＤ（Solid State Drive）など他の種類の不揮発性の記憶装置を用いてもよい。 The HDD 104 stores an OS program and application programs. The HDD 104 stores data used for the processing of the CPU 101. Instead of the HDD 104 (or in combination with the HDD 104), other types of nonvolatile storage devices such as an SSD (Solid State Drive) may be used.

グラフィックインタフェース１０５は、モニタ１１に接続される。グラフィックインタフェース１０５は、ＣＰＵ１０１からの命令に従って、画像をモニタ１１に表示させる。
入力インタフェース１０６は、キーボード１２やマウス１３などの入力デバイスに接続される。入力インタフェース１０６は、入力デバイスから送られる入力信号をＣＰＵ１０１に出力する。 The graphic interface 105 is connected to the monitor 11. The graphic interface 105 displays an image on the monitor 11 in accordance with a command from the CPU 101.
The input interface 106 is connected to input devices such as the keyboard 12 and the mouse 13. The input interface 106 outputs an input signal sent from the input device to the CPU 101.

スキャナインタフェース１０７は、イメージスキャナ１４に接続される。イメージスキャナ１４は、帳票を撮像して帳票画像を生成する撮像装置である。スキャナインタフェース１０７は、イメージスキャナ１４から取得した帳票画像をＣＰＵ１０１、ＲＡＭ１０３およびＨＤＤ１０４などに出力する。 The scanner interface 107 is connected to the image scanner 14. The image scanner 14 is an imaging device that captures a form and generates a form image. The scanner interface 107 outputs the form image acquired from the image scanner 14 to the CPU 101, the RAM 103, the HDD 104, and the like.

ディスクドライブ１０８は、記録媒体１５に格納されたデータを読み取る読取装置である。記録媒体１５には、例えば、帳票読取装置１００に実行させるプログラムが記録されている。帳票読取装置１００は、例えば、記録媒体１５に記録されたプログラムを実行することで、後述するような機能を実現できる。すなわち、該プログラムはコンピュータ読み取り可能な記録媒体１５に記録して配布可能である。 The disk drive 108 is a reading device that reads data stored in the recording medium 15. For example, a program to be executed by the form reading apparatus 100 is recorded on the recording medium 15. The form reading apparatus 100 can realize functions as described later by executing a program recorded in the recording medium 15, for example. That is, the program can be recorded on a computer-readable recording medium 15 and distributed.

記録媒体１５としては、例えば、磁気記録装置、光ディスク、光磁気記録媒体、半導体メモリを使用できる。磁気記録装置には、ＨＤＤ、フレキシブルディスク（ＦＤ）、磁気テープなどがある。光ディスクには、ＣＤ（Compact Disc）、ＣＤ−Ｒ（Recordable）／ＲＷ（ReWritable）、ＤＶＤ（Digital Versatile Disc）、ＤＶＤ−Ｒ／ＲＷ／ＲＡＭなどがある。光磁気記録媒体には、ＭＯ（Magneto-Optical disk）などがある。半導体メモリには、ＵＳＢ（Universal Serial Bus）メモリなどのフラッシュメモリがある。 As the recording medium 15, for example, a magnetic recording device, an optical disk, a magneto-optical recording medium, and a semiconductor memory can be used. Examples of the magnetic recording device include an HDD, a flexible disk (FD), and a magnetic tape. Optical disks include CD (Compact Disc), CD-R (Recordable) / RW (ReWritable), DVD (Digital Versatile Disc), DVD-R / RW / RAM, and the like. Magneto-optical recording media include MO (Magneto-Optical disk). Semiconductor memory includes flash memory such as USB (Universal Serial Bus) memory.

通信インタフェース１０９は、ネットワーク１０に接続される。通信インタフェース１０９は、ネットワーク１０を介してサーバ装置２００とデータ通信を行える。
帳票読取装置１００は、文字認識により読み取った各データ項目の文字列データをサーバ装置２００に送信する。 The communication interface 109 is connected to the network 10. The communication interface 109 can perform data communication with the server device 200 via the network 10.
The form reading device 100 transmits the character string data of each data item read by character recognition to the server device 200.

なお、サーバ装置２００も帳票読取装置１００と同様のハードウェア構成により実現できる。
図４は、第２の実施の形態の改訂前の帳票の例を示す図である。帳票３００は、改訂前の（既存の）帳票を例示している。帳票３００は、例えば、規格などによってそのレイアウトが定められているものである。なお、規格などによりレイアウトが定められた帳票を制定帳票と呼ぶこともある。帳票３００には、帳票ＩＤ（IDentifier）３１０および記入欄群３２０，３３０が印字されている。 The server apparatus 200 can also be realized by the same hardware configuration as the form reading apparatus 100.
FIG. 4 is a diagram illustrating an example of a form before revision according to the second embodiment. The form 300 exemplifies the (existing) form before revision. The form 300 has a layout determined by, for example, a standard. A form whose layout is defined by a standard or the like may be called an enacted form. On the form 300, a form ID (IDentifier) 310 and entry column groups 320 and 330 are printed.

帳票ＩＤ３１０は、帳票の種別を識別するための識別情報である。帳票ＩＤ３１０として、“２００１００”が印字されている。帳票ＩＤ３１０の上４桁の数値“２００１”は、帳票３００が口座開設の申し込みを行ったり、自身の口座に入金する取引を行ったりするための帳票であることを示す。帳票ＩＤ３１０の下２桁の数値“００”は、帳票３００が改訂されたものではないことを示す。ここで、以下では、帳票ＩＤの下２桁の数値を、帳票ＩＤの枝番と呼ぶことがある。 The form ID 310 is identification information for identifying the type of form. “200100” is printed as the form ID 310. The first four-digit numerical value “2001” of the form ID 310 indicates that the form 300 is a form for making an application for opening an account or performing a transaction for depositing into its own account. The last two digits “00” of the form ID 310 indicate that the form 300 has not been revised. Here, in the following, the last two digits of the form ID may be referred to as a branch number of the form ID.

記入欄群３２０，３３０は、顧客に記入させる複数の記入欄を印字した領域である。ここで、記入欄とは、該記入欄の見出し部分（例えば、“おところ”や“おなまえ”といった文字列が予め印字される部分）と顧客が筆記具を用いて文字列を記入する部分とを含む欄である。 The entry column groups 320 and 330 are areas in which a plurality of entry columns to be entered by the customer are printed. Here, the entry field is a heading part of the entry field (for example, a part in which a character string such as “place” or “name” is printed in advance) and a part in which a customer enters a character string using a writing instrument. It is a column containing.

記入欄群３２０は、記入欄３２１，３２２，３２３，３２４，３２５，３２６，３２７，３２８，３２９を含む。記入欄３２１は、申込日（例えば、年月日）を記入させるための欄である。記入欄３２２は、金融機関の店舗を識別するための番号（店番）を記入させるための欄である。記入欄３２３は、口座番号を記入させるための欄である。記入欄３２４は、郵便番号および住所を記入させるための欄である。記入欄３２５は、口座名義人の氏名を記入させるための欄である。記入欄３２６は、取引の金額を記入させるための欄である。記入欄３２７は、金融機関が提供する金融商品の種類を選択させるための欄である。記入欄３２８は、利用する通帳の種類を選択させるための欄である。記入欄３２９は、利用するキャッシュカードの種類を選択させるための欄である。 The entry column group 320 includes entry columns 321, 322, 323, 324, 325, 326, 327, 328, and 329. The entry column 321 is a column for entering an application date (for example, date). The entry column 322 is a column for entering a number (store number) for identifying a store of a financial institution. The entry column 323 is a column for entering an account number. The entry column 324 is a column for entering a zip code and an address. The entry column 325 is a column for entering the name of the account holder. The entry column 326 is a column for entering the transaction amount. The entry column 327 is a column for selecting the type of financial product provided by the financial institution. The entry column 328 is a column for selecting the type of passbook to be used. The entry column 329 is a column for selecting the type of cash card to be used.

記入欄群３３０は、記入欄３３１，３３２，３３３を含む。記入欄３３１は、口座名義人の電話番号を記入させるための欄である。記入欄３３２は、口座名義人の性別を選択させるための欄である。記入欄３３３は、口座名義人の生年月日を記入させるための欄である。 The entry field group 330 includes entry fields 331, 332, and 333. The entry column 331 is a column for entering the account holder's telephone number. The entry column 332 is a column for selecting the gender of the account holder. The entry column 333 is a column for entering the date of birth of the account holder.

記入欄群３２０，３３０の各欄の枠線および帳票３００上の一部箇所には所定の色が付される。この色は、帳票３００のカラーイメージ（カラーの帳票画像）を取得後、該イメージに所定のドロップアウト処理を行うことで、該イメージからドロップアウト（消去）可能な色である。このような色を、ドロップアウト色と呼ぶことがある。 A predetermined color is given to a frame line of each column of the entry column groups 320 and 330 and a part of the form 300. This color is a color that can be dropped out (erased) from the image by obtaining a color image (colored form image) of the form 300 and performing a predetermined dropout process on the image. Such a color may be referred to as a dropout color.

図５は、第２の実施の形態の改訂後の帳票の例を示す図である。帳票３００ａは、帳票３００に対する改訂後の帳票を例示している。帳票３００ａには、帳票ＩＤ３１０ａ、記入欄群３２０ａ，３３０ａおよび領域識別情報３４０が印字されている。 FIG. 5 is a diagram illustrating an example of a revised form according to the second embodiment. The form 300a exemplifies a revised form for the form 300. A form ID 310a, entry column groups 320a and 330a, and area identification information 340 are printed on the form 300a.

帳票ＩＤ３１０ａは、帳票の種別を識別するための識別情報である。帳票ＩＤ３１０ａとして、“２００１０１”が印字されている。帳票ＩＤ３１０ａの上４桁の数値“２００１”は、帳票３００ａが口座開設の申し込みを行ったり、自身の口座に入金する取引を行ったりするための帳票であることを示す。帳票ＩＤ３１０ａの下２桁の数値“０１”は、帳票３００ａが改訂後の版数“０１”の帳票であることを示す。すなわち、該下２桁の数値が“００”以外のとき、該帳票は、改訂後のものであることを示し、該数値が改訂後の版数を示している。 The form ID 310a is identification information for identifying the type of form. “200101” is printed as the form ID 310a. The first four-digit numerical value “2001” of the form ID 310a indicates that the form 300a is a form for making an application for opening an account or performing a transaction for depositing into its own account. The last two digits “01” of the form ID 310a indicate that the form 300a is a revised version “01”. That is, when the numerical value of the last two digits is other than “00”, it indicates that the form is the revised version, and the numeric value indicates the revised version number.

記入欄群３２０ａ，３３０ａは、顧客に記入させる複数の記入欄を印字した領域である。
記入欄群３２０ａは、記入欄３２１，３２２，３２３，３２４，３２５，３２６，３２７ａ，３２８ａ，３２９ａを含む。帳票３００と比較すると、記入欄３２７ａ，３２８ａ，３２９ａが相違する。 The entry column groups 320a and 330a are areas in which a plurality of entry columns to be entered by the customer are printed.
The entry column group 320a includes entry columns 321, 322, 323, 324, 325, 326, 327a, 328a, and 329a. Compared with the form 300, entry fields 327a, 328a, and 329a are different.

記入欄３２７ａ，３２８ａ，３２９ａは、記入欄３２７，３２８，３２９と同様の内容を記入させるための欄である。記入欄３２７ａ，３２８ａ，３２９ａは、記入欄３２７，３２８，３２９に対して、新しい選択項目を追加している点が異なる。これにより、記入欄３２８ａ，３２９ａは、その欄の占める領域が、図５の紙面に向かって下方に拡張されている。 The entry fields 327a, 328a, and 329a are fields for entering the same contents as the entry fields 327, 328, and 329. The entry fields 327a, 328a, and 329a differ from the entry fields 327, 328, and 329 in that new selection items are added. As a result, in the entry fields 328a and 329a, the area occupied by the fields is expanded downward toward the paper surface of FIG.

記入欄群３３０ａは、記入欄３３１ａ，３３２ａ，３３３ａを含む。記入欄３３１ａ，３３２ａ，３３３ａは、記入欄３３１，３３２，３３３と同様の内容を記入させるための欄である。帳票３００と比較すると、記入欄３２８ａ，３２９ａの占める領域が拡張したことで、記入欄３３１ａ，３３２ａ，３３３ａは図５の紙面に向かって下方に移動している。 The entry column group 330a includes entry columns 331a, 332a, and 333a. The entry fields 331a, 332a, 333a are fields for entering the same contents as the entry fields 331, 332, 333. Compared with the form 300, the areas occupied by the entry fields 328a and 329a are expanded, so that the entry fields 331a, 332a and 333a are moved downward toward the paper surface of FIG.

領域識別情報３４０は、記入欄群３３０ａを囲う線であり、四角形の各辺を形成している。
記入欄群３２０ａ，３３０ａの各欄の枠線および帳票３００ａ上の一部箇所には、帳票３００と同様にドロップアウト色が付される。領域識別情報３４０には、ドロップアウト色以外の色が付される。帳票読取装置１００が領域識別情報３４０の検出処理を適切に行えるようにするためである（後述する）。 The area identification information 340 is a line surrounding the entry field group 330a and forms each side of a rectangle.
Similar to the form 300, dropout colors are given to the frame lines of the respective fields of the entry field groups 320a and 330a and a part of the form 300a. The region identification information 340 is assigned a color other than the dropout color. This is so that the form reading apparatus 100 can appropriately perform the detection processing of the area identification information 340 (described later).

図６は、第２の実施の形態の帳票読取装置の機能構成を示すブロック図である。帳票読取装置１００は、記憶部１１０、帳票ＩＤ読取部１２０、文字認識処理部１３０および領域特定部１４０を有する。これらの各機能は、例えばＣＰＵ１０１が所定のプログラムを実行することにより、帳票読取装置１００上に実現される。これらの各機能の全部または一部を専用のハードウェアで実装してもよい。 FIG. 6 is a block diagram illustrating a functional configuration of the form reading apparatus according to the second embodiment. The form reading apparatus 100 includes a storage unit 110, a form ID reading unit 120, a character recognition processing unit 130, and an area specifying unit 140. Each of these functions is realized on the form reading apparatus 100 by the CPU 101 executing a predetermined program, for example. All or some of these functions may be implemented by dedicated hardware.

記憶部１１０は、帳票のレイアウトを定義したレイアウト定義テーブルを記憶する。
帳票ＩＤ読取部１２０は、イメージスキャナ１４から受信したカラーの帳票画像に含まれる帳票ＩＤを読み取る。このとき、帳票ＩＤ読取部１２０は、該帳票画像に対してドロップアウト処理を行う。ドロップアウト色を何れの色とするかは、例えば、帳票ＩＤに対応付けて記憶部１１０に予め設定される。また、帳票ＩＤは、例えば、帳票上の所定の位置に印字される。よって、帳票ＩＤ読取部１２０に、その位置（例えば、帳票画像上の座標値）を予め設定しておけば、帳票ＩＤ読取部１２０は、帳票画像中の該座標位置から帳票ＩＤを読み取れる。あるいは、帳票ＩＤ読取部１２０は、帳票ＩＤのフォーマット（例えば、数値や桁数など）に基づいて、該フォーマットに合致する文字列を帳票ＩＤとして読み取ってもよい。帳票ＩＤ読取部１２０は、帳票ＩＤの読み取り結果を文字認識処理部１３０に出力する。 The storage unit 110 stores a layout definition table that defines the layout of a form.
The form ID reading unit 120 reads the form ID included in the color form image received from the image scanner 14. At this time, the form ID reading unit 120 performs a dropout process on the form image. Which color is used as the dropout color is preset in the storage unit 110 in association with the form ID, for example. The form ID is printed at a predetermined position on the form, for example. Therefore, if the position (for example, coordinate value on the form image) is set in advance in the form ID reading unit 120, the form ID reading unit 120 can read the form ID from the coordinate position in the form image. Alternatively, the form ID reading unit 120 may read a character string that matches the format as the form ID based on the format (for example, a numerical value or the number of digits) of the form ID. The form ID reading unit 120 outputs the result of reading the form ID to the character recognition processing unit 130.

なお、帳票ＩＤ読取部１２０は、イメージスキャナ１４から受信した帳票画像および該帳票画像にドロップアウト処理を行った後の画像情報をＲＡＭ１０３上の所定の領域に格納する。以降の処理において、文字認識処理部１３０および領域特定部１４０は、ＲＡＭ１０３上の該領域を参照して帳票画像（ドロップアウト処理後の画像情報を含む）に対する各部の処理を実行する。 The form ID reading unit 120 stores the form image received from the image scanner 14 and the image information after the dropout process is performed on the form image in a predetermined area on the RAM 103. In the subsequent processing, the character recognition processing unit 130 and the region specifying unit 140 refer to the region on the RAM 103 and execute processing of each unit for the form image (including image information after the dropout processing).

文字認識処理部１３０は、帳票に対してレイアウト認識による文字認識処理を実行する。文字認識処理部１３０は、帳票ＩＤが改訂された旨を示している場合、該帳票画像に対してレイアウト認識およびキーワード認識による文字認識を行う。文字認識処理部１３０は、文字認識の結果をモニタ１１に表示させる。オペレータに結果の正誤確認などを促すためである。オペレータは、読取結果（データ項目および文字列データの内容）が適正であれば、キーボード１２やマウス１３を操作して、該読取結果のサーバ装置２００への送信を文字認識処理部１３０に指示できる。文字認識処理部１３０は、この指示を受け付けると、読取結果をサーバ装置２００に送信して、取引に関する処理の実行を要求する。 The character recognition processing unit 130 executes character recognition processing by layout recognition on the form. When the form ID indicates that the form ID has been revised, the character recognition processing unit 130 performs character recognition by layout recognition and keyword recognition on the form image. The character recognition processing unit 130 displays the result of character recognition on the monitor 11. This is to prompt the operator to check the correctness of the result. If the reading result (data item and character string data contents) is appropriate, the operator can operate the keyboard 12 and the mouse 13 to instruct the character recognition processing unit 130 to transmit the reading result to the server device 200. . When the character recognition processing unit 130 receives this instruction, the character recognition processing unit 130 transmits the reading result to the server device 200 and requests execution of processing related to the transaction.

文字認識処理部１３０は、レイアウト認識処理部１３１およびキーワード認識処理部１３２を有する。
レイアウト認識処理部１３１は、帳票ＩＤと、記憶部１１０に記憶されたレイアウト定義テーブルと、に基づいて、帳票画像に対するレイアウト認識処理（レイアウト認識による文字認識）を行う。 The character recognition processing unit 130 includes a layout recognition processing unit 131 and a keyword recognition processing unit 132.
The layout recognition processing unit 131 performs layout recognition processing (character recognition by layout recognition) on the form image based on the form ID and the layout definition table stored in the storage unit 110.

キーワード認識処理部１３２は、帳票ＩＤと、記憶部１１０に記憶されたレイアウト定義テーブルと、に基づいて、帳票画像に対するキーワード認識処理（キーワード認識による文字認識処理）を行う。その際、キーワード認識処理部１３２は、帳票画像上の処理対象領域の特定を、領域特定部１４０に依頼する。 The keyword recognition processing unit 132 performs keyword recognition processing (character recognition processing by keyword recognition) on the form image based on the form ID and the layout definition table stored in the storage unit 110. At that time, the keyword recognition processing unit 132 requests the region specifying unit 140 to specify the processing target region on the form image.

領域特定部１４０は、キーワード認識処理部１３２の依頼に応じて、帳票画像から領域特定情報を読み取る。領域特定情報は、所定の太さおよび線の種別で帳票上に印字される。よって、領域特定部１４０にその太さおよび線の種別を予め設定しておけば、領域特定部１４０はその設定内容に基づいて、帳票画像から領域特定情報を読み取れる。 The area specifying unit 140 reads area specifying information from the form image in response to a request from the keyword recognition processing unit 132. The area specifying information is printed on the form with a predetermined thickness and line type. Therefore, if the thickness and line type are set in advance in the area specifying unit 140, the area specifying unit 140 can read the area specifying information from the form image based on the setting contents.

領域特定情報は、多角形の頂点などを示す情報としてもよい。その場合には、領域特定部１４０に、領域特定情報として認識すべき数値、文字、記号、図形およびこれらの組合せなどを予め設定しておけばよい。領域特定部１４０は、特定した領域を示す情報（例えば、該領域を示す座標値）をキーワード認識処理部１３２に出力する。 The area specifying information may be information indicating a vertex of the polygon. In that case, a numerical value, a character, a symbol, a figure, a combination thereof, and the like to be recognized as the area specifying information may be set in advance in the area specifying unit 140. The area specifying unit 140 outputs information indicating the specified area (for example, coordinate values indicating the area) to the keyword recognition processing unit 132.

図７は、第２の実施の形態のレイアウト定義テーブルの例を示す図である。レイアウト定義テーブル１１１は、記憶部１１０に記憶される。レイアウト定義テーブル１１１には、帳票ＩＤ、データ項目名、記入欄の座標、カテゴリおよび手活区分の項目を含む。各項目の横方向に並べられた情報同士が互いに関連付けられて、帳票上に記入される１つの内容を特定するための情報を示す。 FIG. 7 is a diagram illustrating an example of a layout definition table according to the second embodiment. The layout definition table 111 is stored in the storage unit 110. The layout definition table 111 includes items such as form ID, data item name, entry column coordinates, category, and manual activity classification. The information arranged in the horizontal direction of each item is associated with each other, and indicates information for specifying one content to be entered on the form.

帳票ＩＤの項目には、帳票ＩＤ（枝番を除く）が設定される。データ項目名の項目には、データ項目名が設定される。
記入欄の座標の項目には、記入欄の座標値が設定される。ここで、１つの記入欄の座標値は、図４で示した帳票３００を例にとると、該記入欄の図４の紙面に向かって左上側の座標値と、該記入欄の同右下側の座標と、を指定することで示される。座標値は、帳票３００の図４の紙面に向かって左上の頂点を原点として、同右向き方向をｘ座標の正方向、同下向き方向をｙ座標の正方向と定義するものとする。ただし、これとは異なる座標系で記入欄の位置を表してもよい。 A form ID (excluding branch numbers) is set in the form ID item. The data item name is set in the data item name item.
The coordinate value of the entry field is set in the coordinate field of the entry field. Here, taking the form 300 shown in FIG. 4 as an example, the coordinate value of one entry column is the upper left coordinate value of the entry column in FIG. 4 and the lower right side of the entry column. This is indicated by specifying the coordinates of. The coordinate values are defined by defining the upper left vertex of the form 300 in FIG. 4 as the origin, the right direction as the positive direction of the x coordinate, and the downward direction as the positive direction of the y coordinate. However, the position of the entry field may be expressed in a different coordinate system.

カテゴリの項目には、記入欄に記入される文字列が数値、カタカナ、漢字および記号などのうち、何れの種類であるかが設定される。手活区分の項目には、該記入欄に記入される文字列が手書きのものであるか活字として印字されるものであるかを示す情報が設定される。 In the category item, it is set which type of character string to be entered in the entry field is numeric, katakana, kanji, or symbol. Information indicating whether the character string to be entered in the entry field is handwritten or printed as a type is set in the item of hand type.

例えば、レイアウト定義テーブル１１１には、帳票ＩＤが“２００１”、データ項目名が“申込日”、記入欄の座標が“Ｓ（３５，２０），Ｅ（６０，３０）”、カテゴリが“数値”、手活区分が“手書き”という情報が設定される。なお、記入欄の座標の設定につき“Ｓ”の文字は記入欄の左上側の座標値に付されるものである。同様に、“Ｅ”の文字は記入欄の右下側の座標値に付されるものである。 For example, in the layout definition table 111, the form ID is “2001”, the data item name is “application date”, the coordinates of the entry column are “S (35, 20), E (60, 30)”, and the category is “numeric value”. ", The information that the hand activity classification is" handwritten "is set. Note that the character “S” is assigned to the coordinate value on the upper left side of the entry field for setting the coordinates in the entry field. Similarly, the letter “E” is attached to the coordinate value on the lower right side of the entry field.

よって、このレコードは、帳票ＩＤの上４桁が“２００１”の帳票３００の帳票画像につき、データ項目名“申込日”に対応する記入欄が帳票画像の（ｘ，ｙ）＝（３５，２０）および（ｘ，ｙ）＝（６０，３０）を対角にもつ長方形の領域であることを示す。また、この記入欄に記入される文字列が数値であり、該文字列が顧客により手書きで記入されるものであることを示す。更に、このレコードは、帳票ＩＤの上４桁が“２００１”の帳票３００ａの帳票画像についても、同様の記入欄が含まれ得ることを示している。 Therefore, in this record, the entry field corresponding to the data item name “application date” is the form image (x, y) = (35, 20) for the form image of the form 300 whose first four digits are “2001”. ) And (x, y) = (60, 30) indicates a rectangular region having diagonal lines. The character string entered in the entry field is a numerical value, and indicates that the character string is handwritten by the customer. Further, this record indicates that the same entry field can be included for the form image of the form 300a whose first four digits of the form ID are “2001”.

次に、以上の構成の帳票読取装置１００の処理手順を説明する。
図８は、第２の実施の形態の帳票読取処理を示すフローチャートである。以下、図８に示す処理をステップ番号に沿って説明する。 Next, a processing procedure of the form reading apparatus 100 having the above configuration will be described.
FIG. 8 is a flowchart illustrating a form reading process according to the second embodiment. In the following, the process illustrated in FIG. 8 will be described in order of step number.

（ステップＳ１１）帳票ＩＤ読取部１２０は、イメージスキャナ１４から受信したカラーの帳票画像をＲＡＭ１０３上の所定領域に格納する。帳票ＩＤ読取部１２０は、該帳票画像に対してドロップアウト処理を行い、ドロップアウト処理後の帳票画像をＲＡＭ１０３上の他の所定領域に格納する。 (Step S 11) The form ID reading unit 120 stores the color form image received from the image scanner 14 in a predetermined area on the RAM 103. The form ID reading unit 120 performs a dropout process on the form image, and stores the form image after the dropout process in another predetermined area on the RAM 103.

（ステップＳ１２）帳票ＩＤ読取部１２０は、ドロップアウト処理後の帳票画像から帳票ＩＤを認識する。例えば、帳票ＩＤ読取部１２０は、該帳票画像の所定位置から帳票ＩＤを読み取れる。帳票ＩＤ読取部１２０は、読み取った帳票ＩＤを文字認識処理部１３０に出力する。 (Step S12) The form ID reading unit 120 recognizes the form ID from the form image after the dropout process. For example, the form ID reading unit 120 can read the form ID from a predetermined position of the form image. The form ID reading unit 120 outputs the read form ID to the character recognition processing unit 130.

（ステップＳ１３）文字認識処理部１３０は、レイアウト認識処理部１３１に処理を委譲し、レイアウト認識処理を実行させる。レイアウト認識処理部１３１は、帳票ＩＤ読取部１２０から取得した帳票ＩＤに基づいて、記憶部１１０に記憶されたレイアウト定義テーブル１１１を参照し、ドロップアウト処理後の帳票画像に対してレイアウト認識による文字認識処理を行う。 (Step S 13) The character recognition processing unit 130 delegates the processing to the layout recognition processing unit 131 and causes the layout recognition processing to be executed. The layout recognition processing unit 131 refers to the layout definition table 111 stored in the storage unit 110 on the basis of the form ID acquired from the form ID reading unit 120, and performs character recognition by layout recognition on the form image after the dropout process. Perform recognition processing.

（ステップＳ１４）文字認識処理部１３０は、帳票ＩＤ読取部１２０から取得した帳票ＩＤの下２桁が“０１”以上であるか否かを判定する。“０１”以上である場合（すなわち、改訂された帳票である場合）、処理をステップＳ１５に進める。“０１”以上でない場合（すなわち、改訂された帳票でない場合）、処理をステップＳ１６に進める。 (Step S14) The character recognition processing unit 130 determines whether or not the last two digits of the form ID acquired from the form ID reading unit 120 is “01” or more. If it is “01” or more (that is, a revised form), the process proceeds to step S15. If it is not “01” or more (that is, if it is not a revised form), the process proceeds to step S16.

（ステップＳ１５）文字認識処理部１３０は、キーワード認識処理部１３２に処理を委譲し、キーワード認識処理を実行させる。該キーワード認識処理については後述する。
（ステップＳ１６）文字認識処理部１３０は、帳票画像に対する文字認識結果を出力し、モニタ１１に該結果を表示させる。 (Step S15) The character recognition processing unit 130 delegates the processing to the keyword recognition processing unit 132 and causes the keyword recognition processing to be executed. The keyword recognition process will be described later.
(Step S16) The character recognition processing unit 130 outputs a character recognition result for the form image and causes the monitor 11 to display the result.

このように、文字認識処理部１３０は、ドロップアウト処理後の帳票画像に対してレイアウト認識処理を実行する。更に、文字認識処理部１３０は、帳票画像に含まれる帳票ＩＤに応じて、キーワード認識処理の実行要否を判断する。文字認識処理部１３０は、改訂された帳票に対してキーワード認識処理を実行し、未改訂の帳票にはキーワード認識処理を実行しない。 As described above, the character recognition processing unit 130 executes the layout recognition process on the form image after the dropout process. Further, the character recognition processing unit 130 determines whether or not the keyword recognition process is necessary according to the form ID included in the form image. The character recognition processing unit 130 performs the keyword recognition process on the revised form, and does not execute the keyword recognition process on the unrevised form.

次に、ステップＳ１１における、帳票３００ａに関する、ドロップアウト処理後の帳票画像を例示する。
図９は、第２の実施の形態のドロップアウト処理後の帳票画像の例を示す図である。帳票画像３００ｂは、イメージスキャナ１４が撮像した帳票画像に対してドロップアウト処理を行って得られた画像情報である。 Next, the form image after the dropout process related to the form 300a in step S11 is illustrated.
FIG. 9 is a diagram illustrating an example of a form image after the dropout process according to the second embodiment. The form image 300b is image information obtained by performing dropout processing on the form image captured by the image scanner 14.

帳票画像３００ｂでは、ドロップアウト色で帳票３００ａに印字された枠線などが消去されている。顧客が枠線内に記入した文字列の色は、ドロップアウト色以外の色で記入されており、ドロップアウト対象にはならない（ドロップアウト処理によって消去されない）。領域識別情報３４０も同様である。 In the form image 300b, the frame line printed on the form 300a in the dropout color is deleted. The color of the character string entered by the customer in the frame line is entered in a color other than the dropout color and is not a dropout target (it is not erased by the dropout process). The same applies to the area identification information 340.

レイアウト認識処理部１３１は、レイアウト定義テーブル１１１を参照して、帳票画像３００ｂ（帳票ＩＤの上４桁が“２００１”）内の所定の座標に記入された文字列を、口座番号や氏名などのデータ項目に対応付けて取得できる。例えば、データ項目名“口座番号”のデータを帳票画像３００ｂから抽出する場合、レイアウト認識処理部１３１は次の処理を行う。 The layout recognition processing unit 131 refers to the layout definition table 111 and uses a character string written at predetermined coordinates in the form image 300b (the first four digits of the form ID is “2001”) as an account number, a name, and the like. Can be acquired in association with data items. For example, when extracting the data of the data item name “account number” from the form image 300b, the layout recognition processing unit 131 performs the following processing.

レイアウト認識処理部１３１は、レイアウト定義テーブル１１１に基づき、“口座番号”に対応する枠３２３ａを特定する。具体的には、帳票画像３００ｂの座標（ｘ，ｙ）＝（１４０，２０）を図９の紙面に向かって左上頂点とし、座標（ｘ，ｙ）＝（１６０，３０）を同右下頂点とする長方形が枠３２３ａである。なお、図９において点線で図示した枠３２３ａの枠線は、帳票画像３００ｂには含まれない。レイアウト認識処理部１３１は、枠３２３ａ内に記入された文字画像を取得する。レイアウト認識処理部１３１は、該文字画像と予め記憶部１１０に格納された文字パターンとの照合を行い、“９９９９９９９”の文字列を抽出する。このときレイアウト定義テーブル１１１によれば、この位置のデータのカテゴリが数値である。よって、レイアウト認識処理部１３１は、数値の文字パターンと、文字画像との照合を行えばよい。 The layout recognition processing unit 131 identifies a frame 323 a corresponding to “account number” based on the layout definition table 111. Specifically, the coordinates (x, y) = (140, 20) of the form image 300b are set as the upper left vertex toward the paper surface of FIG. 9, and the coordinates (x, y) = (160, 30) are set as the lower right vertex. A rectangle to be displayed is a frame 323a. Note that the frame line 323a illustrated by the dotted line in FIG. 9 is not included in the form image 300b. The layout recognition processing unit 131 obtains a character image entered in the frame 323a. The layout recognition processing unit 131 collates the character image with the character pattern stored in the storage unit 110 in advance, and extracts a character string “9999999”. At this time, according to the layout definition table 111, the category of the data at this position is a numerical value. Therefore, the layout recognition processing unit 131 only has to collate numerical character patterns with character images.

このようにして、レイアウト認識処理部１３１は、データ項目“口座番号”に対する文字列データ“９９９９９９９”を取得する。
また、領域識別情報３４０は、ドロップアウト色以外の色で印字されている。このため、領域識別情報３４０は、ドロップアウト対象にはならず、帳票画像３００ｂに含まれる。帳票画像３００ｂには、他の枠線はドロップアウト処理により消去されているので、領域特定部１４０は、領域識別情報３４０を容易に検出できる。 In this way, the layout recognition processing unit 131 acquires the character string data “9999999” for the data item “account number”.
The area identification information 340 is printed in a color other than the dropout color. Therefore, the area identification information 340 is not a dropout target and is included in the form image 300b. In the form image 300b, the other frame lines are erased by the dropout process, so that the area specifying unit 140 can easily detect the area identification information 340.

次に、図８のステップＳ１５で説明したキーワード認識処理の手順を説明する。以下、ＲＡＭ１０３上には次の情報が取得されているものとする。
（１）「帳票３００ａの帳票画像」
（２）「帳票３００ａの帳票画像」にドロップアウト処理を実行して得られた「帳票画像３００ｂ」
「帳票３００ａの帳票画像」という場合、帳票３００ａに含まれる全領域をカラーで取得した画像情報を示す。「帳票画像３００ｂ」という場合、「帳票３００ａの帳票画像」にドロップアウト処理を実行して得られた画像情報を示す。また、何れの帳票画像においても、領域識別情報３４０を同一の符号で指し示すものとする。 Next, the procedure of the keyword recognition process described in step S15 in FIG. 8 will be described. Hereinafter, it is assumed that the following information is acquired on the RAM 103.
(1) “Form image of form 300a”
(2) “Form image 300b” obtained by executing dropout processing on “Form image of form 300a”
The “form image of the form 300a” indicates image information obtained in color for all areas included in the form 300a. In the case of “form image 300b”, “form image of form 300a” indicates image information obtained by executing the dropout process. In any form image, the region identification information 340 is indicated by the same symbol.

図１０は、第２の実施の形態のキーワード認識処理を示すフローチャートである。以下、図１０に示す処理をステップ番号に沿って説明する。
（ステップＳ２１）キーワード認識処理部１３２は、レイアウト認識処理部１３１によるレイアウト認識処理の結果をＲＡＭ１０３またはＨＤＤ１０４上の所定領域に退避させる。 FIG. 10 is a flowchart illustrating keyword recognition processing according to the second embodiment. In the following, the process illustrated in FIG. 10 will be described in order of step number.
(Step S21) The keyword recognition processing unit 132 saves the result of the layout recognition processing by the layout recognition processing unit 131 in a predetermined area on the RAM 103 or the HDD 104.

（ステップＳ２２）キーワード認識処理部１３２は、キーワード認識処理の対象領域の特定を領域特定部１４０に依頼する。領域特定部１４０は、帳票画像３００ｂから領域識別情報３４０を検出する。例えば、領域特定部１４０には、領域識別情報３４０に関する情報（形状の種類、線の太さ、実線・破線といった線の種別など）が予め設定される。ここで、形状の種類とは、矩形の枠線である、楕円形の枠線である、多角形の頂点に付された記号（その記号がどのようなものかを含む）である、などの情報である。領域特定部１４０は、該情報に基づき、帳票画像３００ｂから領域識別情報３４０を検出できる。領域特定部１４０は、特定した処理対象領域を示す領域情報をキーワード認識処理部１３２に出力する。領域情報は、例えば、処理対象領域の各頂点の座標値である。領域特定部１４０は、処理対象領域を複数検出した場合には、複数の領域情報をキーワード認識処理部１３２に出力する。 (Step S22) The keyword recognition processing unit 132 requests the region specifying unit 140 to specify the target region for the keyword recognition processing. The area specifying unit 140 detects area identification information 340 from the form image 300b. For example, information related to the region identification information 340 (type of shape, line thickness, line type such as solid line / broken line, etc.) is preset in the region specifying unit 140. Here, the type of shape is a rectangular frame line, an elliptical frame line, a symbol attached to the vertex of a polygon (including what the symbol is), etc. Information. The area specifying unit 140 can detect the area identification information 340 from the form image 300b based on the information. The area specifying unit 140 outputs area information indicating the specified processing target area to the keyword recognition processing unit 132. The area information is, for example, the coordinate value of each vertex of the processing target area. When the region specifying unit 140 detects a plurality of processing target regions, the region specifying unit 140 outputs a plurality of region information to the keyword recognition processing unit 132.

（ステップＳ２３）キーワード認識処理部１３２は、領域特定部１４０から取得した領域情報に基づき、そのうちの１つの未処理領域のカラーイメージを帳票３００ａの帳票画像から抽出する。例えば、キーワード認識処理部１３２は、領域識別情報３４０内の領域（処理対象領域）のカラーイメージを抽出する。 (Step S23) The keyword recognition processing unit 132 extracts a color image of one unprocessed area from the form image of the form 300a based on the area information acquired from the area specifying unit 140. For example, the keyword recognition processing unit 132 extracts a color image of an area (processing target area) in the area identification information 340.

（ステップＳ２４）キーワード認識処理部１３２は、ステップＳ２３でカラーイメージを取得した処理対象領域からキーワードを１つ抽出する。例えば、キーワード認識処理部１３２は、処理対象領域に含まれる枠線を検出して、該枠線で囲われる枠ごとにキーワードの抽出を試みる。例えば、キーワード認識処理部１３２は、記入欄３３１ａに含まれるキーワード“電話番号”を抽出する。 (Step S24) The keyword recognition processing unit 132 extracts one keyword from the processing target area from which the color image has been acquired in Step S23. For example, the keyword recognition processing unit 132 detects a frame line included in the processing target region, and tries to extract a keyword for each frame surrounded by the frame line. For example, the keyword recognition processing unit 132 extracts the keyword “phone number” included in the entry field 331a.

（ステップＳ２５）キーワード認識処理部１３２は、帳票３００ａの帳票ＩＤおよび記憶部１１０に記憶されたレイアウト定義テーブル１１１を参照して、該キーワードが帳票３００ａに含まれ得るデータ項目名に一致するか否か判定する。一致する場合、処理をステップＳ２６に進める。一致しない場合、該キーワードを破棄して、処理をステップＳ２９に進める。例えば、キーワード認識処理部１３２は、キーワード“電話番号”が、帳票３００ａの帳票ＩＤ“２００１０１”の上４桁“２００１”に対応付けられて、レイアウト定義テーブル１１１に設定されていることを検知する。この場合、該キーワード“電話番号“が帳票３００ａに含まれ得るデータ項目名に一致すると判定する。また、該キーワードが存在するセルが見出し部分のセルである。ここで、セルは、ドロップアウト色の枠線で囲われた１つの枠を示す。 (Step S25) The keyword recognition processing unit 132 refers to the form ID of the form 300a and the layout definition table 111 stored in the storage unit 110, and determines whether or not the keyword matches a data item name that can be included in the form 300a. To determine. If they match, the process proceeds to step S26. If they do not match, the keyword is discarded and the process proceeds to step S29. For example, the keyword recognition processing unit 132 detects that the keyword “phone number” is set in the layout definition table 111 in association with the first four digits “2001” of the form ID “200101” of the form 300a. . In this case, it is determined that the keyword “phone number” matches the data item name that can be included in the form 300a. A cell in which the keyword exists is a heading cell. Here, the cell indicates one frame surrounded by a drop-out color frame line.

（ステップＳ２６）キーワード認識処理部１３２は、該データ項目に対する記入欄の位置（データ部）を特定する。例えば、データ部の位置は、「キーワードが存在するセルの右側に隣接するセル」のようにキーワード認識処理部１３２に予め設定される。あるいは、キーワードや、帳票３００ａ上の表構造に基づき、該キーワードに対応するデータ部を特定してもよい。キーワードとデータ部との対応付け方法としては、例えば、特開２０１０−３１５５号公報に記載された方法を用いることができる。 (Step S26) The keyword recognition processing unit 132 specifies the position (data part) of the entry field for the data item. For example, the position of the data part is set in advance in the keyword recognition processing unit 132 as “a cell adjacent to the right side of the cell in which the keyword exists”. Alternatively, the data portion corresponding to the keyword may be specified based on the keyword or the table structure on the form 300a. As a method for associating the keyword with the data portion, for example, a method described in JP 2010-3155 A can be used.

（ステップＳ２７）キーワード認識処理部１３２は、データ部の特徴を解析し、データ部のイメージに対して枠線やノイズなどを除去し、文字認識に適したイメージに加工する。 (Step S 27) The keyword recognition processing unit 132 analyzes the characteristics of the data part, removes frame lines, noise, and the like from the image of the data part, and processes the image into an image suitable for character recognition.

（ステップＳ２８）キーワード認識処理部１３２は、データ部に記入された文字画像を取得する。キーワード認識処理部１３２は、該文字画像と予め記憶部１１０に格納された文字パターンとの照合を行い、該データ部に記入された文字列データを取得する。例えば、キーワード認識処理部１３２は、データ項目“電話番号”に対して文字列データ“０００−００００−００００”を取得する。 (Step S 28) The keyword recognition processing unit 132 acquires a character image entered in the data portion. The keyword recognition processing unit 132 collates the character image with a character pattern stored in the storage unit 110 in advance, and acquires character string data entered in the data unit. For example, the keyword recognition processing unit 132 acquires character string data “000-0000-0000” for the data item “phone number”.

（ステップＳ２９）キーワード認識処理部１３２は、現在の処理対象領域内でキーワードを未抽出の箇所が存在するか否かを判定する。キーワード未抽出の箇所が存在する場合、処理をステップＳ２４に進める。キーワード未抽出の箇所が存在しない場合、処理をステップＳ３０に進める。 (Step S 29) The keyword recognition processing unit 132 determines whether or not there is a portion where no keyword has been extracted in the current processing target area. If there is a keyword-unextracted portion, the process proceeds to step S24. If there is no keyword-unextracted portion, the process proceeds to step S30.

（ステップＳ３０）キーワード認識処理部１３２は、現在の処理対象領域以外にも、未処理の処理対象領域が存在するか否かを判定する。未処理の処理対象領域が存在する場合、処理をステップＳ２３に進める。全ての処理対象領域につき処理済の場合、処理をステップＳ３１に進める。 (Step S30) The keyword recognition processing unit 132 determines whether there is an unprocessed processing target area other than the current processing target area. If there is an unprocessed processing target area, the process proceeds to step S23. If all the process target areas have been processed, the process proceeds to step S31.

（ステップＳ３１）キーワード認識処理部１３２は、一時退避させたレイアウト認識処理の結果とキーワード認識の結果とをマージする。具体的には、レイアウト定義テーブル１１１に定義された帳票３００ａのデータ項目のうち、レイアウト認識処理では取得できなかったものを、キーワード認識処理で取得した内容で補完する。 (Step S31) The keyword recognition processing unit 132 merges the temporarily saved layout recognition result and the keyword recognition result. Specifically, among the data items of the form 300a defined in the layout definition table 111, those that could not be acquired by the layout recognition process are supplemented with the contents acquired by the keyword recognition process.

このようにして、キーワード認識処理部１３２は、帳票３００ａの変更された領域につき、キーワード認識による文字認識を行う。
図１１は、第２の実施の形態のキーワード認識対象領域の例を示す図である。キーワード認識対象領域は、領域識別情報３４０の内側の領域である。該領域内には、キーワード３５１，３５２，３５３も示されている。 In this way, the keyword recognition processing unit 132 performs character recognition by keyword recognition for the changed area of the form 300a.
FIG. 11 is a diagram illustrating an example of a keyword recognition target area according to the second embodiment. The keyword recognition target area is an area inside the area identification information 340. In the area, keywords 351, 352, and 353 are also shown.

キーワード３５１は、文字列“電話番号”を示しており、記憶部１１０に記憶されたレイアウト定義テーブル１１１によれば、帳票３００ａの帳票画像内に含まれ得るデータ項目名“電話番号”に一致する。 The keyword 351 indicates the character string “telephone number”, and according to the layout definition table 111 stored in the storage unit 110, matches the data item name “telephone number” that can be included in the form image of the form 300a. .

キーワード３５２は、文字列“性別”を示しており、帳票３００ａの帳票画像内に含まれ得るデータ項目名“性別”に一致する。
キーワード３５３は、文字列“生年月日”を示しており、帳票３００ａの帳票画像内に含まれ得るデータ項目名“生年月日”に一致する。 The keyword 352 indicates the character string “sex” and matches the data item name “sex” that can be included in the form image of the form 300a.
The keyword 353 indicates the character string “date of birth” and matches the data item name “date of birth” that can be included in the form image of the form 300a.

キーワード認識処理部１３２は、キーワード３５１を検出すると、キーワード３５１が存在するセルの右側に隣接するセル３５１ａをデータ部と特定する。そして、キーワード認識処理部１３２は、データ項目“電話番号”に対して文字列データ“０００−００００−００００”を取得する。なお、キーワード３５１を含むセルおよびセル３５１ａが図５で示した記入欄３３１ａに対応する。 When the keyword recognition processing unit 132 detects the keyword 351, the keyword recognition processing unit 132 identifies the cell 351a adjacent to the right side of the cell in which the keyword 351 exists as the data unit. Then, the keyword recognition processing unit 132 acquires character string data “000-0000-0000” for the data item “phone number”. Note that the cell including the keyword 351 and the cell 351a correspond to the entry field 331a shown in FIG.

また、キーワード認識処理部１３２は、キーワード３５２を検出すると、キーワード３５２が存在するセルの右側に隣接するセル３５２ａをデータ部と特定する。そして、キーワード認識処理部１３２は、データ項目“性別”に対して、“男”を選択していることを示す選択記号（チェックマーク）を取得する。このような選択記号も文字列データに含まれる。なお、キーワード３５２を含むセルおよびセル３５２ａが図５で示した記入欄３３２ａに対応する。 Further, when the keyword recognition processing unit 132 detects the keyword 352, the keyword recognition processing unit 132 identifies the cell 352a adjacent to the right side of the cell in which the keyword 352 exists as the data unit. Then, the keyword recognition processing unit 132 acquires a selection symbol (check mark) indicating that “male” is selected for the data item “sex”. Such a selection symbol is also included in the character string data. Note that the cell including the keyword 352 and the cell 352a correspond to the entry field 332a shown in FIG.

更に、キーワード認識処理部１３２は、キーワード３５３を検出するとキーワード３５３が存在するセルの右側に隣接するセル３５３ａをデータ部と特定する。そして、キーワード認識処理部１３２は、データ項目“生年月日”に対して、文字列データ“１９８５”年“１１”月“１５”日を取得する。なお、キーワード３５３を含むセルおよびセル３５３ａが図５で示した記入欄３３３ａに対応する。 Furthermore, when the keyword recognition processing unit 132 detects the keyword 353, the cell 353a adjacent to the right side of the cell in which the keyword 353 exists is specified as the data unit. Then, the keyword recognition processing unit 132 acquires the character string data “1985” “11” month “15” day for the data item “birth date”. Note that the cell including the keyword 353 and the cell 353a correspond to the entry field 333a shown in FIG.

なお、該領域内には、“男”、“女”、“年”、“月”、“日”などの他の文字も含まれている。しかし、これらは何れもレイアウト定義テーブル１１１内で帳票３００ａ内のデータ項目名として定義されていない。よって、キーワード認識処理部１３２は、これら他の文字に関しては、データ項目名とはみなさない。 The area includes other characters such as “male”, “female”, “year”, “month”, and “day”. However, none of these are defined as data item names in the form 300a in the layout definition table 111. Therefore, the keyword recognition processing unit 132 does not regard these other characters as data item names.

図１２は、第２の実施の形態の帳票ＩＤの例を示す図である。帳票ＩＤ３１０ａの印字方法は、上記の方法に限られない。例えば、次のように印字してもよい。
図１２（Ａ）では、帳票ＩＤ３１０ｂが示されている。帳票ＩＤ３１０ｂは、帳票ＩＤ３１０と帳票ＩＤ３１０の下側に印字された枝番３１１との組み合せにより、帳票３００ａが改訂後のものであることを示している。枝番３１１は、“０１”などの数値により表されている。 FIG. 12 is a diagram illustrating an example of a form ID according to the second embodiment. The printing method of the form ID 310a is not limited to the above method. For example, you may print as follows.
In FIG. 12A, a form ID 310b is shown. The form ID 310b indicates that the form 300a has been revised by a combination of the form ID 310 and the branch number 311 printed below the form ID 310. The branch number 311 is represented by a numerical value such as “01”.

図１２（Ｂ）では、帳票ＩＤ３１０ｃが示されている。帳票ＩＤ３１０ｃは、帳票ＩＤ３１０と、帳票ＩＤ３１０の位置から紙面に向かって右下側に印字された枝番３１１ａとの組み合わせにより、帳票３００ａが改訂後のものであることを示している。枝番３１１ａは、“０１”などの数値により表されている。 In FIG. 12B, a form ID 310c is shown. The form ID 310c indicates that the form 300a has been revised by a combination of the form ID 310 and the branch number 311a printed on the lower right side from the position of the form ID 310 toward the paper surface. The branch number 311a is represented by a numerical value such as “01”.

図１２（Ａ）（Ｂ）では、該帳票が改訂されていない場合には、枝番３１１，３１１ａを“００”とするか、あるいは枝番３１１，３１１ａを印字しないようにすることが考えられる。 In FIGS. 12A and 12B, when the form is not revised, it is conceivable that the branch numbers 311 and 311a are set to “00” or the branch numbers 311 and 311a are not printed. .

図１２（Ｃ）では、帳票ＩＤ３１０ｄが示されている。帳票ＩＤ３１０ｄは、帳票ＩＤ３１０と、帳票ＩＤ３１０の位置から紙面に向かって右側に印字された枝番３１１ｂとの組み合わせにより、帳票３００ａが改訂後のものであることを示している。枝番３１１ｂは、“Ｒ”などの文字（または文字列）により表されている。 In FIG. 12C, a form ID 310d is shown. The form ID 310d indicates that the form 300a has been revised by a combination of the form ID 310 and the branch number 311b printed on the right side from the position of the form ID 310 toward the paper surface. The branch number 311b is represented by a character (or character string) such as “R”.

図１２（Ｃ）では、該帳票が改訂されていない場合には、枝番３１１ｂを他の文字（または文字列）とするか、あるいは枝番３１１ｂを印字しないようにすることが考えられる。 In FIG. 12C, when the form is not revised, it is conceivable that the branch number 311b is set to another character (or character string) or the branch number 311b is not printed.

このように、帳票ＩＤの表記方法には種々の方法を採れる。上記の例は、その一例であり、上記以外の位置、数値、記号および文字（または文字列）などで帳票ＩＤを表記してもよい。何れの位置または数値などにより、帳票ＩＤが表記されているかは、上述したように帳票ＩＤ読取部１２０に予め設定される。 As described above, various methods can be used for the form ID. The above example is one example, and the form ID may be expressed by a position, numerical value, symbol, character (or character string), etc. other than the above. Which position or numerical value indicates the form ID is preset in the form ID reading unit 120 as described above.

以上で説明したように、第２の実施の形態の帳票読取装置１００によれば、文字認識処理を効率的に行える。具体的には、改訂後の帳票３００ａにつき、キーワード認識処理を行う領域を領域識別情報３４０に基づいて特定する。帳票読取装置１００は、該領域に対してキーワード認識処理を行い、それ以外の領域に対してはキーワード認識処理を行わない。すなわち、余計な領域に対してキーワード認識処理が行われるのを抑止でき、余分な処理時間がかからずに済む。 As described above, according to the form reading apparatus 100 of the second embodiment, character recognition processing can be performed efficiently. Specifically, the area for performing the keyword recognition process is specified based on the area identification information 340 for the revised form 300a. The form reading apparatus 100 performs keyword recognition processing on the area and does not perform keyword recognition processing on the other areas. That is, it is possible to prevent the keyword recognition process from being performed on an extra area, and an extra processing time is not required.

また、キーワード認識処理を行わない領域に対しては、既存のレイアウト定義テーブル１１１に基づくレイアウト認識処理により、文字認識できる。よって、レイアウト認識と併用する場合に特に有効である。具体的には、レイアウト認識処理は、既定位置の情報を読み取るため、キーワード認識処理に比べて、処理時間は一般的に短い。したがって、帳票読取装置１００のように、キーワード認識処理の対象領域を絞り込むことで、レイアウト認識による処理効率の良さを享受しながら、帳票のレイアウトを柔軟に変更可能となる。 In addition, a region that is not subjected to keyword recognition processing can be recognized by layout recognition processing based on the existing layout definition table 111. Therefore, it is particularly effective when used in combination with layout recognition. Specifically, since the layout recognition process reads information at a predetermined position, the processing time is generally shorter than the keyword recognition process. Therefore, by narrowing down the target area for keyword recognition processing as in the form reading device 100, the layout of the form can be flexibly changed while enjoying the good processing efficiency by layout recognition.

ここで、金融機関では、キャンペーン時などの短期間の間だけ一時的に、所定の帳票の一部箇所のレイアウトの変更を行いたい場合がある。具体的には、キャンペーンのお知らせ欄を広く確保して既存レイアウトの配置に影響を与える場合や、キャンペーン時のみ顧客へ記入を要求する記入欄を設ける場合などが考えられる。その場合、恒久的なレイアウト変更を行うケースに比べて、システム変更の作業負担や文字認識の処理効率に与える影響などをできるだけ抑えて、レイアウト変更に容易に対応できることが特に望まれる。帳票読取装置１００によれば、上述のように改訂の対象となった一部箇所に絞ってキーワード認識処理が行われる。これにより、レイアウト定義の変更作業や文字認識の処理効率に与える影響を抑えた文字認識処理を実現できる。 Here, there is a case where a financial institution wants to change the layout of a part of a predetermined form temporarily only for a short period such as a campaign. Specifically, there may be a case where a wide campaign notification column is secured to affect the layout of an existing layout, or a case where a column for requesting a customer to fill in is provided only during the campaign. In that case, it is particularly desirable that the layout change can be easily handled by suppressing the influence of the system change on the work load and the processing efficiency of character recognition as much as possible as compared to the case of performing a permanent layout change. According to the form reading apparatus 100, the keyword recognition process is performed only on a part of the revision target as described above. As a result, it is possible to realize a character recognition process that suppresses the influence on the layout definition changing work and the character recognition processing efficiency.

［第３の実施の形態］
以下、第３の実施の形態を説明する。前述の第２の実施の形態との相違点を主に説明し、同様の事項の説明を省略する。 [Third Embodiment]
Hereinafter, a third embodiment will be described. Differences from the second embodiment will be mainly described, and description of similar matters will be omitted.

第２の実施の形態では、領域識別情報が帳票上に複数存在してもよいことを説明した。この場合、複数の領域識別情報で示される複数の処理対象領域内に、同一のデータ項目名（キーワード）が含まれることがある。このとき、同一のデータ項目名であっても、同一の用途であるとは限らない。例えば、１つの帳票で複数の取引に対応した帳票が利用されることがある。例えば、ある口座からの現金の出金と該口座から他口座への振替との２つの取引に対応可能な帳票が考えられる。より具体的には、（１）第１の口座番号で示される第１の口座からの現金の出金、および（２）該第１の口座から第２の口座番号で示される第２の口座への振替である。 In the second embodiment, it has been described that a plurality of area identification information may exist on a form. In this case, the same data item name (keyword) may be included in a plurality of processing target areas indicated by a plurality of area identification information. At this time, even the same data item name does not necessarily have the same use. For example, a form corresponding to a plurality of transactions may be used in one form. For example, a form that can handle two transactions, that is, cash withdrawal from an account and transfer from the account to another account, can be considered. More specifically, (1) withdrawing cash from a first account indicated by a first account number, and (2) a second account indicated by a second account number from the first account. It is a transfer to.

その場合、第１の口座番号と第２の口座番号が該帳票に記入され得るが、これらをキーワード認識で文字認識すると、何れの口座番号が出金元口座番号で、何れの口座番号が振替先口座番号であるか区別するのが困難な場合がある。具体的には、帳票上に印字されているキーワードが何れも“口座番号”である場合である。この場合、キーワードとレイアウト定義情報内のデータ項目名とを単に照合したとしても、各“口座番号”の用途を区別するのが困難となる。 In that case, the first account number and the second account number can be entered in the form, but when these characters are recognized by keyword recognition, which account number is the withdrawal source account number and which account number is transferred It may be difficult to distinguish whether it is a previous account number. Specifically, this is a case where all the keywords printed on the form are “account numbers”. In this case, even if the keyword is simply compared with the data item name in the layout definition information, it is difficult to distinguish the usage of each “account number”.

そこで、第３の実施の形態では、複数の処理対象領域内に同一のデータ項目名が含まれていたとしても、各データの用途を区別して文字列データを取得する機能を提供する。
ここで、第３の実施の形態の情報処理システムの構成は、図２で説明した第２の実施の形態の情報処理システムの構成と同様である。また、第３の実施の形態の帳票読取装置のハードウェアおよび機能構成は、図３，６で説明した第２の実施の形態の帳票読取装置１００のハードウェアおよび機能構成と同様である。以下、第３の実施の形態の帳票読取装置も帳票読取装置１００と同一の符号・名称を用いて各構成を指し示すものとする。 Therefore, in the third embodiment, even if the same data item name is included in a plurality of processing target areas, a function of acquiring character string data by distinguishing the use of each data is provided.
Here, the configuration of the information processing system of the third embodiment is the same as the configuration of the information processing system of the second embodiment described in FIG. Further, the hardware and functional configuration of the form reading apparatus according to the third embodiment is the same as the hardware and functional configuration of the form reading apparatus 100 according to the second embodiment described with reference to FIGS. Hereinafter, the form reading apparatus according to the third embodiment also indicates each component using the same reference numeral and name as the form reading apparatus 100.

図１３は、第３の実施の形態の改訂後の帳票の例を示す図である。帳票４００は、領域識別情報で示されるキーワード認識処理の対象領域が複数存在する場合を例示している。帳票４００には、帳票ＩＤ４１０、記入欄群４２０、記入欄群４３０、領域識別情報４４１，４４２，４４３，４４４，４６０および記入欄群４５０が印字されている。 FIG. 13 is a diagram illustrating an example of a revised form according to the third embodiment. The form 400 exemplifies a case where there are a plurality of target areas for keyword recognition processing indicated by the area identification information. On the form 400, a form ID 410, an entry column group 420, an entry column group 430, area identification information 441, 442, 443, 444, 460 and an entry column group 450 are printed.

帳票ＩＤ４１０は、帳票の種別を識別するための識別情報である。帳票ＩＤ４１０として、“３００１０１”が印字されている。帳票ＩＤ４１０の上４ケタの数値“３００１”は、帳票４００が現金の出金および口座間の振替の取引を行うための帳票であることを示す。帳票ＩＤ４１０の下２桁の数値“０１”は、帳票４００が改訂後の版数“０１”の帳票であることを示す。 The form ID 410 is identification information for identifying the type of form. “300101” is printed as the form ID 410. A numerical value “3001” in the first four digits of the form ID 410 indicates that the form 400 is a form for performing cash withdrawal and transfer transactions between accounts. The last two digits “01” of the form ID 410 indicate that the form 400 is a revised version “01”.

記入欄群４２０，４３０，４５０は、顧客に入力させる複数の記入欄を印字した領域である。
記入欄群４２０は、記入欄４２１，４２２，４２３を含む。記入欄４２１は、申込日を記入させるための欄である。記入欄４２２は、口座名義人の氏名を記入させるための欄である。記入欄４２３は、届け印を押印させるための欄である。 The entry column groups 420, 430, and 450 are areas in which a plurality of entry columns to be input by the customer are printed.
The entry column group 420 includes entry columns 421, 422, and 423. The entry column 421 is a column for entering the application date. The entry column 422 is a column for entering the name of the account holder. The entry column 423 is a column for imprinting a delivery seal.

記入欄群４３０には、出金元口座の情報を記入するための記入欄が設けられている。記入欄群４３０は、記入欄４３１，４３２，４３３，４３４を含む。記入欄４３１は、店番を記入させるための欄である。記入欄４３２は、出金元口座の口座番号を記入させるための欄である。記入欄４３３は、出金元口座の預金科目を選択させるための欄である。記入欄４３４は、出金する金額を記入させるための欄である。 The entry column group 430 is provided with an entry column for entering information of the withdrawal source account. The entry field group 430 includes entry fields 431, 432, 433, and 434. The entry column 431 is a column for entering a store number. The entry column 432 is a column for entering the account number of the withdrawal source account. The entry column 433 is a column for selecting a deposit item of the withdrawal source account. The entry column 434 is a column for entering the amount to be withdrawn.

領域識別情報４４１，４４２，４４３，４４４は、記入欄群４３０を囲うカギ型の領域識別情報である。４つの領域識別情報４４１，４４２，４４３，４４４が１セットでキーワード認識のための１つの処理対象領域を示している。具体的には、領域識別情報４４１，４４２，４４３，４４４を頂点とした四角形で囲われる領域内が、該処理対象領域である。 The area identification information 441, 442, 443, 444 is key-type area identification information surrounding the entry field group 430. Four region identification information 441, 442, 443, and 444 indicate one processing target region for keyword recognition in one set. Specifically, an area surrounded by a rectangle having the area identification information 441, 442, 443, 444 as a vertex is the processing target area.

記入欄群４５０には、振替先口座の情報を記入するための記入欄が設けられている。記入欄群４５０は、記入欄４５１，４５２，４５３，４５４を含む。記入欄４５１は、振替先口座の預金科目を選択させるための欄である。記入欄４５２は、店番を記入させるための欄である。記入欄４５３は、振替先口座の口座番号を記入させるための欄である。記入欄４５４は、振替先口座の口座名義人を記入させるための欄である。 The entry field group 450 is provided with an entry field for entering information of the transfer destination account. The entry column group 450 includes entry columns 451, 452, 453, and 454. The entry column 451 is a column for selecting a deposit item of the transfer destination account. The entry column 452 is a column for entering a store number. The entry column 453 is a column for entering the account number of the transfer destination account. The entry column 454 is a column for entering the account holder of the transfer destination account.

領域識別情報４６０は、記入欄群４５０を囲う線であり、四角形の各辺を形成している。
記入欄群４２０，４３０，４５０の各欄の枠線および帳票４００上の一部箇所には、ドロップアウト色が付される。領域識別情報４４１，４４２，４４３，４４４，４６０には、ドロップアウト色以外の色が付される。 The area identification information 460 is a line surrounding the entry column group 450 and forms each side of a rectangle.
Dropout colors are assigned to the frame lines of the fields of the entry field groups 420, 430, and 450 and a part of the form 400. The region identification information 441, 442, 443, 444, 460 is assigned a color other than the dropout color.

帳票４００によれば、記入欄群４２０，４３０のみに記入することで、記入欄群４３０で指定した口座から現金を出金できる。更に、記入欄群４５０に振替先の口座を指定することで、記入欄群４３０で指定した口座から、記入欄群４５０で指定した口座への預金の振替を行える。 According to the form 400, cash can be withdrawn from the account designated in the entry field group 430 by filling in only the entry field groups 420 and 430. Furthermore, by designating the transfer destination account in the entry column group 450, the deposit can be transferred from the account designated in the entry column group 430 to the account designated in the entry column group 450.

このように、帳票４００には、出金元口座用の記入欄群４３０に対しては、カギ型の領域識別情報４４１，４４２，４４３，４４４が予め印字される。また、帳票４００には、振替先口座用の記入欄群４５０には、（カギ型とは異なる形状の）四角形の各辺を形成する線である領域識別情報４６０が予め印字される。 In this way, key-type area identification information 441, 442, 443, and 444 is preprinted on the form 400 for the entry column group 430 for the withdrawal source account. Further, in the form 400, the area identification information 460, which is a line forming each side of a quadrilateral (having a shape different from the key shape), is preprinted in the entry field group 450 for the transfer destination account.

次に、以上の構成の帳票読取装置１００の処理手順を説明する。なお、第３の実施の形態の帳票読取処理の手順は、図８で説明した第２の実施の形態の帳票読取処理の手順と同様である。続いて、該帳票読取処理のステップＳ１５で説明したキーワード認識処理の手順を説明する。以下、ＲＡＭ１０３上には次の情報が取得されているものとする。 Next, a processing procedure of the form reading apparatus 100 having the above configuration will be described. Note that the procedure of the form reading process of the third embodiment is the same as the procedure of the form reading process of the second embodiment described with reference to FIG. Next, the procedure of the keyword recognition process described in step S15 of the form reading process will be described. Hereinafter, it is assumed that the following information is acquired on the RAM 103.

（１）「帳票４００の帳票画像」
（２）「帳票４００の帳票画像」にドロップアウト処理を実行して得られた「ドロップアウト処理後の帳票画像」
何れの帳票画像においても、領域識別情報４４１，４４２，４４３，４４４，４６０を同一の符号で指し示すものとする。 (1) “Form image of form 400”
(2) “Form image after dropout processing” obtained by executing dropout processing on “form image of form 400”
In any form image, the region identification information 441, 442, 443, 444, 460 is indicated by the same reference numeral.

図１４は、第３の実施の形態のキーワード認識処理を示すフローチャートである。以下、図１４に示す処理をステップ番号に沿って説明する。
（ステップＳ４１）キーワード認識処理部１３２は、レイアウト認識処理部１３１によるレイアウト認識処理の結果をＲＡＭ１０３またはＨＤＤ１０４上の所定領域に退避させる。 FIG. 14 is a flowchart illustrating keyword recognition processing according to the third embodiment. In the following, the process illustrated in FIG. 14 will be described in order of step number.
(Step S41) The keyword recognition processing unit 132 saves the result of the layout recognition processing by the layout recognition processing unit 131 in a predetermined area on the RAM 103 or the HDD 104.

（ステップＳ４２）キーワード認識処理部１３２は、キーワード認識処理の対象領域の特定を領域特定部１４０に依頼する。領域特定部１４０は、ドロップアウト処理後の帳票画像から領域識別情報４４１，４４２，４４３，４４４，４６０を検出する。例えば、領域特定部１４０には、領域識別情報４４１，４４２，４４３，４４４，４６０に関する情報（形状の種類、線の太さ、実線・破線といった線の種別など）が予め設定される。領域特定部１４０は、該情報に基づき、ドロップアウト処理後の帳票画像から領域識別情報４４１，４４２，４４３，４４４，４６０を検出できる。領域特定部１４０は、特定した処理対象領域を示す領域情報をキーワード認識処理部１３２に出力する。領域情報は、例えば、処理対象領域の各頂点の座標値である。また、領域特定部１４０は、領域情報と共に、検出した領域識別情報の種類（矩形かカギ型かなど）をキーワード認識処理部１３２に出力する。 (Step S42) The keyword recognition processing unit 132 requests the region specifying unit 140 to specify the target region for the keyword recognition processing. The area specifying unit 140 detects area identification information 441, 442, 443, 444, and 460 from the form image after the dropout process. For example, information related to the region identification information 441, 442, 443, 444, 460 (type of shape, line thickness, line type such as solid line / broken line, etc.) is preset in the region specifying unit 140. Based on this information, the area specifying unit 140 can detect the area identification information 441, 442, 443, 444, and 460 from the form image after the dropout process. The area specifying unit 140 outputs area information indicating the specified processing target area to the keyword recognition processing unit 132. The area information is, for example, the coordinate value of each vertex of the processing target area. In addition, the area specifying unit 140 outputs the type of detected area identification information (whether it is rectangular or keyed) together with the area information to the keyword recognition processing unit 132.

（ステップＳ４３）キーワード認識処理部１３２は、領域特定部１４０から取得した領域情報に基づき、そのうちの１つの未処理領域のカラーイメージを帳票４００の帳票画像から抽出する。例えば、キーワード認識処理部１３２は、領域識別情報４４１，４４２，４４３，４４４で囲われる領域内のカラーイメージを抽出する。 (Step S 43) The keyword recognition processing unit 132 extracts a color image of one unprocessed area from the form image of the form 400 based on the area information acquired from the area specifying unit 140. For example, the keyword recognition processing unit 132 extracts a color image in an area surrounded by the area identification information 441, 442, 443, 444.

（ステップＳ４４）キーワード認識処理部１３２は、ステップＳ４３でカラーイメージを取得した処理対象領域からキーワードを１つ抽出する。例えば、キーワード認識処理部１３２は、処理対象領域に含まれる枠線を検出して、該枠線で囲われる枠ごとにキーワードの抽出を試みる。例えば、キーワード認識処理部１３２は、記入欄４３２に対して、キーワード“口座番号”を抽出する。 (Step S44) The keyword recognition processing unit 132 extracts one keyword from the processing target area from which the color image has been acquired in Step S43. For example, the keyword recognition processing unit 132 detects a frame line included in the processing target region, and tries to extract a keyword for each frame surrounded by the frame line. For example, the keyword recognition processing unit 132 extracts the keyword “account number” from the entry field 432.

（ステップＳ４５）キーワード認識処理部１３２は、帳票４００の帳票ＩＤおよび記憶部１１０に記憶されたレイアウト定義テーブル１１１を参照して、該キーワードが帳票４００に含まれ得るデータ項目名に一致するか否か判定する。一致する場合、処理をステップＳ４６に進める。一致しない場合、該キーワードを破棄して、処理をステップＳ５２に進める。例えば、キーワード“口座番号”は、帳票４００に含まれ得るデータ項目名に一致するものとする。 (Step S 45) The keyword recognition processing unit 132 refers to the form ID of the form 400 and the layout definition table 111 stored in the storage unit 110, and determines whether or not the keyword matches a data item name that can be included in the form 400. To determine. If they match, the process proceeds to step S46. If they do not match, the keyword is discarded and the process proceeds to step S52. For example, it is assumed that the keyword “account number” matches a data item name that can be included in the form 400.

（ステップＳ４６）キーワード認識処理部１３２は、該データ項目に対する記入欄の位置（データ部）を特定する。例えば、データ部の位置は、「キーワードが存在するセルの右側に隣接するセル」のようにキーワード認識処理部１３２に予め設定される。あるいは、キーワードや、帳票４００上の表構造に基づき、該キーワードに対応するデータ部を特定してもよい。 (Step S46) The keyword recognition processing unit 132 specifies the position (data portion) of the entry field for the data item. For example, the position of the data part is set in advance in the keyword recognition processing unit 132 as “a cell adjacent to the right side of the cell in which the keyword exists”. Alternatively, based on the keyword and the table structure on the form 400, the data portion corresponding to the keyword may be specified.

（ステップＳ４７）キーワード認識処理部１３２は、データ部の特徴を解析し、データ部のイメージに対して枠線やノイズなどを除去し、文字認識に適したイメージに加工する。 (Step S 47) The keyword recognition processing unit 132 analyzes the characteristics of the data portion, removes frame lines and noise from the image of the data portion, and processes the image into an image suitable for character recognition.

（ステップＳ４８）キーワード認識処理部１３２は、データ部に記入された文字画像を取得する。キーワード認識処理部１３２は、該文字画像と予め記憶部１１０に格納された文字パターンとの照合を行い、該データ部に記入された文字列データを取得する。例えば、キーワード認識処理部１３２は、キーワード“口座番号”に対して、文字列データ“９９９９９９９”を取得する。 (Step S48) The keyword recognition processing unit 132 acquires a character image written in the data portion. The keyword recognition processing unit 132 collates the character image with a character pattern stored in the storage unit 110 in advance, and acquires character string data entered in the data unit. For example, the keyword recognition processing unit 132 acquires character string data “9999999” for the keyword “account number”.

（ステップＳ４９）キーワード認識処理部１３２は、現在の処理対象領域に対応する領域識別情報の種類を判定する。該領域識別情報の形状の種類が、カギ型の場合、処理をステップＳ５０に進める。該領域識別情報の形状の種類が矩形の場合、処理をステップＳ５１に進める。例えば、領域識別情報４４１，４４２，４４３，４４４で囲われた領域であれば、領域識別情報の種類は、「カギ型」となる。 (Step S49) The keyword recognition processing unit 132 determines the type of region identification information corresponding to the current processing target region. If the shape type of the area identification information is a key type, the process proceeds to step S50. If the shape type of the area identification information is rectangular, the process proceeds to step S51. For example, if the region is surrounded by the region identification information 441, 442, 443, 444, the type of the region identification information is “key type”.

（ステップＳ５０）キーワード認識処理部１３２は、ステップＳ４８で取得した文字列データを出金元のデータとして取得する。これにより、例えば、領域識別情報４４１，４４２，４４３，４４４で囲われた処理対象領域から取得した、“口座番号”“９９９９９９９”を出金元口座の口座番号として扱える。そして、処理をステップＳ５２に進める。 (Step S50) The keyword recognition processing unit 132 acquires the character string data acquired in step S48 as the data of the withdrawal source. Thereby, for example, “account number” “9999999” acquired from the processing target area surrounded by the area identification information 441, 442, 443, 444 can be handled as the account number of the withdrawal source account. Then, the process proceeds to step S52.

（ステップＳ５１）キーワード認識処理部１３２は、ステップＳ４８で取得した文字列データを振替先のデータとして取得する。これにより、例えば、領域識別情報４６０で囲われた処理対象領域から取得した“口座番号”“８８８８８８８”を振替先口座の口座番号として扱える。そして、処理をステップＳ５２に進める。 (Step S51) The keyword recognition processing unit 132 acquires the character string data acquired in step S48 as transfer destination data. Thereby, for example, “account number” “88888888” acquired from the processing target area surrounded by the area identification information 460 can be handled as the account number of the transfer destination account. Then, the process proceeds to step S52.

（ステップＳ５２）キーワード認識処理部１３２は、現在の処理対象領域内でキーワードを未抽出の箇所が存在するか否かを判定する。キーワード未抽出の箇所が存在する場合、処理をステップＳ４４に進める。キーワード未抽出の箇所が存在しない場合、処理をステップＳ５３に進める。 (Step S 52) The keyword recognition processing unit 132 determines whether or not there is a portion where no keyword has been extracted in the current processing target area. If there is a keyword-unextracted portion, the process proceeds to step S44. If there is no keyword-unextracted portion, the process proceeds to step S53.

（ステップＳ５３）キーワード認識処理部１３２は、現在の処理対象領域以外にも、未処理の処理対象領域が存在するか否かを判定する。未処理の処理対象領域が存在する場合、処理をステップＳ４３に進める。全ての処理対象領域につき処理済の場合、処理をステップＳ５４に進める。 (Step S53) The keyword recognition processing unit 132 determines whether there is an unprocessed processing target area other than the current processing target area. If there is an unprocessed processing target area, the process proceeds to step S43. If all the process target areas have been processed, the process proceeds to step S54.

（ステップＳ５４）キーワード認識処理部１３２は、一時退避させたレイアウト認識処理の結果とキーワード認識処理の結果とをマージする。具体的には、レイアウト定義テーブル１１１に定義された帳票４００のデータ項目のうち、レイアウト認識処理では取得できなかったものを、キーワード認識処理で取得した内容で補完する。 (Step S54) The keyword recognition processing unit 132 merges the temporarily recognized layout recognition result and the keyword recognition processing result. Specifically, among the data items of the form 400 defined in the layout definition table 111, those that could not be acquired by the layout recognition process are complemented with the contents acquired by the keyword recognition process.

このように、キーワード認識処理部１３２は、領域識別情報の種類に応じて、認識した文字列データを区別する。何れの領域識別情報が、何れの取引に対応するかは、キーワード認識処理部１３２に予め設定される。ステップＳ４９〜Ｓ５１で例示した領域識別情報の種類と取引（データの用途）との対応付けは、一例であり、他にも種々の対応付けが考えられる。例えば、領域識別情報を区別する方法としては、次の内容が考えられる。 In this manner, the keyword recognition processing unit 132 distinguishes recognized character string data according to the type of region identification information. Which area identification information corresponds to which transaction is set in the keyword recognition processing unit 132 in advance. The association between the type of region identification information exemplified in steps S49 to S51 and the transaction (use of data) is an example, and various other associations are conceivable. For example, the following contents can be considered as a method for distinguishing the region identification information.

（Ａ１）領域識別情報の線の種別（点線・破線・一点鎖線など）。
（Ａ２）領域識別情報の形状（矩形・楕円形など）。
（Ａ３）領域識別情報の線の太さ（太い・細いなど）。 (A1) Line type of area identification information (dotted line, broken line, one-dot chain line, etc.).
(A2) The shape of the area identification information (rectangle, ellipse, etc.).
(A3) Line thickness of area identification information (thick, thin, etc.).

（Ａ４）領域識別情報の色（濃い・薄いなど）。
（Ａ５）処理対象領域の頂点に配置させる領域識別情報の形状（カギ型など）。
上記（Ａ１）〜（Ａ５）の何れか１つに基づいて領域識別情報を区別してもよいし、（Ａ１）〜（Ａ５）を組み合わせて区別してもよい。更に、区別した各領域識別情報を以下に示す何れかの用途に対応付けることが考えられる。 (A4) Color of area identification information (dark, light, etc.).
(A5) The shape of the region identification information to be placed at the vertex of the processing target region (such as a key shape).
The area identification information may be distinguished based on any one of the above (A1) to (A5), or may be distinguished by combining (A1) to (A5). Further, it is conceivable that each identified area identification information is associated with one of the following uses.

（Ｂ１）現金や振替による入金をする場合の入金先口座に関する情報。
（Ｂ２）現金や振替による出金をする場合の出金元口座に関する情報。
キーワード認識処理部１３２は、領域識別情報を区別することで、文字列データが何れの取引に用いられるものであるか（すなわち、文字列データの用途）を区別できる。上記用途以外の用途に対応付けてもよい。 (B1) Information related to the deposit account when depositing by cash or transfer.
(B2) Information on the withdrawal source account when withdrawing money or transferring money.
The keyword recognition processing unit 132 can distinguish which transaction the character string data is used for (ie, use of the character string data) by distinguishing the region identification information. You may match with uses other than the said use.

このようにして、第３の実施の形態の帳票読取装置１００は、複数の取引に対応可能な帳票において、複数の処理対象領域内に同一のデータ項目名が含まれても、各データ項目の用途を区別して文字列データを取得できる。 In this way, the form reading apparatus 100 according to the third embodiment is capable of handling each data item even if the same data item name is included in a plurality of processing target areas in a form that can handle a plurality of transactions. Character string data can be acquired by distinguishing usage.

これにより、レイアウト変更のあった帳票に対する文字認識処理を一層効率的に行うことが可能となる。
［第４の実施の形態］
以下、第４の実施の形態を説明する。前述の第２，第３の実施の形態との相違点を主に説明し、同様の事項の説明を省略する。 This makes it possible to more efficiently perform character recognition processing for a form whose layout has been changed.
[Fourth Embodiment]
Hereinafter, a fourth embodiment will be described. Differences from the second and third embodiments will be mainly described, and description of similar matters will be omitted.

第２，第３の実施の形態では、レイアウト認識処理を行った後に、キーワード認識処理の要否を判定するものとした。一方、キーワード認識処理の対象となる領域に対しては、レイアウト認識処理を行わなくてもよい。このようにすれば、処理効率の一層の効率化を図れる。そこで、第４の実施の形態では、キーワード認識処理の対象となる領域に対して、レイアウト認識処理を抑止する機能を提供する。 In the second and third embodiments, the necessity of the keyword recognition process is determined after the layout recognition process. On the other hand, the layout recognition process does not have to be performed on the area that is the target of the keyword recognition process. In this way, the processing efficiency can be further improved. Therefore, in the fourth embodiment, a function of suppressing the layout recognition process is provided for the area that is the target of the keyword recognition process.

ここで、第４の実施の形態の情報処理システムの構成は、図２で説明した第２の実施の形態の情報処理システムの構成と同様である。また、第４の実施の形態の帳票読取装置のハードウェアおよび機能構成は、図３，６で説明した第２の実施の形態の帳票読取装置１００のハードウェアおよび機能構成と同様である。以下、第４の実施の形態の帳票読取装置も帳票読取装置１００と同一の符号・名称を用いて各構成を指し示すものとする。 Here, the configuration of the information processing system of the fourth embodiment is the same as the configuration of the information processing system of the second embodiment described in FIG. The hardware and functional configuration of the form reading apparatus according to the fourth embodiment is the same as the hardware and functional configuration of the form reading apparatus 100 according to the second embodiment described with reference to FIGS. Hereinafter, the form reading apparatus according to the fourth embodiment also indicates each component using the same reference numerals and names as the form reading apparatus 100.

図１５は、第４の実施の形態の改訂前の帳票の例を示す図である。帳票５００は、改訂前の（既存の）帳票を例示している。帳票５００には、帳票ＩＤ５１０および記入欄群５２０，５３０が印字されている。 FIG. 15 is a diagram illustrating an example of a form before revision according to the fourth embodiment. A form 500 illustrates an (existing) form before revision. A form ID 510 and entry column groups 520 and 530 are printed on the form 500.

帳票ＩＤ５１０は、帳票の種別を識別するための識別情報である。帳票ＩＤ５１０として、“１００１００”が印字されている。帳票ＩＤ５１０の上４桁の数値“１００１”は、帳票５００が口座に対して現金を入金する取引を行うための帳票であることを示す。帳票ＩＤ５１０の下２桁の数値“００”は、帳票５００が改訂されたものではないことを示す。 The form ID 510 is identification information for identifying the type of form. “100100” is printed as the form ID 510. The first four-digit numerical value “1001” of the form ID 510 indicates that the form 500 is a form for performing a transaction for depositing cash into the account. The last two digits “00” of the form ID 510 indicate that the form 500 has not been revised.

記入欄群５２０は、顧客に記入させる複数の記入欄を印字した領域である。記入欄群５２０は、記入欄５２１，５２２，５２３，５２４，５２５，５２６を含む。記入欄５２１は、申込日を記入させるための欄である。記入欄５２２は、口座名義人の氏名を記入させるための欄である。記入欄５２３は、店番を記入させるための欄である。記入欄５２４は、口座番号を記入させるための欄である。記入欄５２５は、入金先口座の預金科目を選択させるための欄である。記入欄５２６は、入金する金額を記入させるための欄である。 The entry column group 520 is an area in which a plurality of entry columns to be entered by the customer are printed. The entry column group 520 includes entry columns 521, 522, 523, 524, 525, 526. The entry column 521 is a column for entering the application date. The entry column 522 is a column for entering the name of the account holder. The entry column 523 is a column for entering a store number. The entry column 524 is a column for entering an account number. The entry column 525 is a column for selecting a deposit item of the deposit destination account. The entry column 526 is a column for entering the amount to be deposited.

記入欄群５３０は、金融機関の職員が業務に用いる情報を記入する欄である。記入欄群５３０は、レイアウト認識処理の非対象領域である。
記入欄群５２０，５３０の各欄の枠線および帳票５００上の一部箇所にはドロップアウト色が付される。 The entry column group 530 is a column for entering information used by the financial institution staff for business. The entry column group 530 is a non-target area for layout recognition processing.
Drop-out colors are assigned to the frame lines of each column of the entry column groups 520 and 530 and a part of the form 500.

図１６は、第４の実施の形態の改訂後の帳票の例を示す図である。帳票５００ａは、帳票５００に対する改訂後の帳票を例示している。帳票５００ａには、帳票ＩＤ５１０ａ、記入欄群５２０ａ，５３０ａ、通知欄５４０および領域識別情報５５０が印字されている。 FIG. 16 is a diagram illustrating an example of a revised form according to the fourth embodiment. The form 500a illustrates a revised form for the form 500. A form ID 510a, entry field groups 520a and 530a, a notification field 540, and area identification information 550 are printed on the form 500a.

帳票ＩＤ５１０ａは、帳票の種別を識別するための識別情報である。帳票ＩＤ５１０ａとして、“１００１０１”が印字されている。帳票ＩＤ５１０ａの上４桁の数値“１００１”は、帳票５００ａが口座に対して現金を入金する取引を行うための帳票であることを示す。帳票ＩＤ５１０ａの下２桁の数値“０１”は、帳票５００ａが改訂後の版数“０１”であることを示す。 The form ID 510a is identification information for identifying the type of form. “100101” is printed as the form ID 510a. The first four-digit numerical value “1001” of the form ID 510a indicates that the form 500a is a form for performing a transaction for depositing cash into the account. The last two digits “01” of the form ID 510a indicate that the form 500a is the revised version number “01”.

記入欄群５２０ａは、顧客に入力させる複数の記入欄を印字した領域である。記入欄群５２０ａは、記入欄５２１ａ，５２２ａ，５２３ａ，５２４ａ，５２５ａ，５２６ａを含む。記入欄５２１ａは、申込日を記入させるための欄である。記入欄５２２ａは、口座名義人の氏名を記入させるための欄である。記入欄５２３ａは、店番を記入させるための欄である。記入欄５２４ａは、入金先口座の口座番号を記入させるための欄である。記入欄５２５ａは、入金先口座の預金科目を選択させるための欄である。記入欄５２６ａは、入金する金額を記入させるための欄である。 The entry column group 520a is an area in which a plurality of entry columns to be input by the customer are printed. The entry column group 520a includes entry columns 521a, 522a, 523a, 524a, 525a, and 526a. The entry column 521a is a column for entering the application date. The entry column 522a is a column for entering the name of the account holder. The entry column 523a is a column for entering a store number. The entry column 524a is a column for entering the account number of the deposit destination account. The entry column 525a is a column for selecting a deposit item of the deposit destination account. The entry column 526a is a column for entering the amount to be deposited.

記入欄群５３０ａは、金融機関の職員が業務に用いる情報を記入する欄である。
通知欄５４０は、顧客に通知したいメッセージが印字される領域である。
記入欄群５３０ａおよび通知欄５４０は、レイアウト認識処理の非対象領域である。 The entry column group 530a is a column for entering information used by the financial institution staff for business.
The notification column 540 is an area where a message to be notified to the customer is printed.
The entry column group 530a and the notification column 540 are non-target areas for layout recognition processing.

記入欄群５２０ａ，５３０ａ，通知欄５４０の各欄の枠線および帳票５００ａ上の一部箇所には、ドロップアウト色が付される。領域識別情報５５０には、ドロップアウト色以外の色が付される。 Dropout colors are given to the frame lines of the fields of the entry field groups 520a and 530a and the notification field 540 and to some portions on the form 500a. The region identification information 550 is assigned a color other than the dropout color.

次に、以上の構成の帳票読取装置１００の処理手順を説明する。
図１７は、第４の実施の形態の帳票読取処理を示すフローチャートである。以下、図１７に示す処理をステップ番号に沿って説明する。 Next, a processing procedure of the form reading apparatus 100 having the above configuration will be described.
FIG. 17 is a flowchart illustrating a form reading process according to the fourth embodiment. In the following, the process illustrated in FIG. 17 will be described in order of step number.

（ステップＳ６１）帳票ＩＤ読取部１２０は、イメージスキャナ１４から受信した帳票画像をＲＡＭ１０３上の所定領域に格納する。帳票ＩＤ読取部１２０は、該帳票画像に対してドロップアウト処理を行い、ドロップアウト処理後の帳票画像をＲＡＭ１０３上の他の所定領域に格納する。 (Step S 61) The form ID reading unit 120 stores the form image received from the image scanner 14 in a predetermined area on the RAM 103. The form ID reading unit 120 performs a dropout process on the form image, and stores the form image after the dropout process in another predetermined area on the RAM 103.

（ステップＳ６２）帳票ＩＤ読取部１２０は、ドロップアウト処理後の帳票画像から帳票ＩＤを認識する。帳票ＩＤ読取部１２０は、読み取った帳票ＩＤを文字認識処理部１３０に出力する。 (Step S62) The form ID reading unit 120 recognizes the form ID from the form image after the dropout process. The form ID reading unit 120 outputs the read form ID to the character recognition processing unit 130.

（ステップＳ６３）文字認識処理部１３０は、帳票ＩＤ読取部１２０から取得した帳票ＩＤの下２桁が“０１”以上であるか否かを判定する。“０１”以上である場合（すなわち、改訂された帳票である場合）、処理をステップＳ６５に進める。“０１”以上でない場合（すなわち、改訂された帳票でない場合）、処理をステップＳ６４に進める。 (Step S63) The character recognition processing unit 130 determines whether or not the last two digits of the form ID acquired from the form ID reading unit 120 is “01” or more. If it is “01” or more (that is, a revised form), the process proceeds to step S65. If it is not “01” or more (that is, not a revised form), the process proceeds to step S64.

（ステップＳ６４）文字認識処理部１３０は、レイアウト認識処理部１３１に処理を委譲し、レイアウト認識処理を実行させる。レイアウト認識処理部１３１は、帳票ＩＤ読取部１２０から取得した帳票ＩＤに基づいて、記憶部１１０に記憶されたレイアウト定義テーブル１１１を参照し、ドロップアウト処理後の帳票画像に対してレイアウト認識による文字認識処理を行う。 (Step S 64) The character recognition processing unit 130 delegates the processing to the layout recognition processing unit 131 and causes the layout recognition processing to be executed. The layout recognition processing unit 131 refers to the layout definition table 111 stored in the storage unit 110 on the basis of the form ID acquired from the form ID reading unit 120, and performs character recognition by layout recognition on the form image after the dropout process. Perform recognition processing.

（ステップＳ６５）文字認識処理部１３０は、キーワード認識処理部１３２に処理を委譲し、キーワード認識処理を実行させる。該キーワード認識処理については後述する。
（ステップＳ６６）文字認識処理部１３０は、帳票画像に対する文字認識結果を出力し、モニタ１１に該結果を表示させる。 (Step S 65) The character recognition processing unit 130 delegates the processing to the keyword recognition processing unit 132 and causes the keyword recognition processing to be executed. The keyword recognition process will be described later.
(Step S66) The character recognition processing unit 130 outputs a character recognition result for the form image and causes the monitor 11 to display the result.

このように、文字認識処理部１３０は、レイアウト認識を行う前に帳票の改訂の有無を判断する。そして、ステップＳ６５におけるキーワード認識処理を次のように行う。
なお、以下、ＲＡＭ１０３上には次の情報が取得されているものとする。 As described above, the character recognition processing unit 130 determines whether the form is revised before performing layout recognition. Then, the keyword recognition process in step S65 is performed as follows.
Hereinafter, it is assumed that the following information is acquired on the RAM 103.

（１）「帳票５００ａの帳票画像」
（２）「帳票５００ａの帳票画像」にドロップアウト処理を実行して得られた「ドロップアウト処理後の帳票画像」
何れの帳票画像においても、領域識別情報５５０を同一の符号で指し示すものとする。 (1) “Form image of form 500a”
(2) “Form image after dropout processing” obtained by executing dropout processing on “form image of form 500a”
In any form image, the region identification information 550 is indicated by the same symbol.

図１８は、第４の実施の形態のキーワード認識処理を示すフローチャートである。以下、図１８に示す処理をステップ番号に沿って説明する。
（ステップＳ７１）キーワード認識処理部１３２は、キーワード認識処理の対象領域の特定を領域特定部１４０に依頼する。領域特定部１４０は、ドロップアウト処理後の帳票画像から領域識別情報５５０を検出する。例えば、領域特定部１４０には、領域識別情報５５０に関する情報（形状の種類、線の太さ、実線・破線といった線の種別など）が予め設定される。領域特定部１４０は、該情報に基づき、ドロップアウト処理後の帳票画像から領域識別情報５５０を検出できる。領域特定部１４０は、特定した処理対象領域を示す領域情報をキーワード認識処理部１３２に出力する。領域情報は、例えば、処理対象領域の各頂点の座標値である。 FIG. 18 is a flowchart illustrating keyword recognition processing according to the fourth embodiment. In the following, the process illustrated in FIG. 18 will be described in order of step number.
(Step S71) The keyword recognition processing unit 132 requests the region specifying unit 140 to specify the target region for the keyword recognition processing. The area specifying unit 140 detects the area identification information 550 from the form image after the dropout process. For example, information related to the region identification information 550 (shape type, line thickness, line type such as solid line / broken line, etc.) is preset in the region specifying unit 140. Based on the information, the area specifying unit 140 can detect the area identification information 550 from the form image after the dropout process. The area specifying unit 140 outputs area information indicating the specified processing target area to the keyword recognition processing unit 132. The area information is, for example, the coordinate value of each vertex of the processing target area.

（ステップＳ７２）キーワード認識処理部１３２は、領域特定部１４０から取得した領域情報と、記憶部１１０に記憶されたレイアウト定義テーブル１１１と、を参照して、レイアウト認識の対象領域が存在するか否かを判定する。存在する場合、処理をステップＳ７３に進める。存在しない場合、処理をステップＳ７５に進める。ここで、レイアウト認識の対象領域が存在するか否かは、例えば次のようにして判定できる。まず、キーワード認識処理部１３２は、領域情報に基づいて、キーワード認識の対象となっている領域を特定する。次に、キーワード認識処理部１３２は、レイアウト定義テーブル１１１に基づいて、帳票５００のレイアウト認識の対象領域を特定する。そして、キーワード認識処理部１３２は、キーワード認識の対象領域とレイアウト認識の対象領域とを重ね合わせて、レイアウト認識の対象領域がキーワード認識の対象領域からはみ出すか否かを判断する。はみ出せば、レイアウト認識の対象領域が存在すると判断できる。はみ出さなければ、レイアウト認識の対象領域がキーワード認識の対象領域に包含されるので、レイアウト認識の対象領域が存在しないと判断できる。 (Step S 72) The keyword recognition processing unit 132 refers to the region information acquired from the region specifying unit 140 and the layout definition table 111 stored in the storage unit 110 to determine whether or not there is a layout recognition target region. Determine whether. If it exists, the process proceeds to step S73. If not, the process proceeds to step S75. Here, whether or not a layout recognition target area exists can be determined as follows, for example. First, the keyword recognition processing unit 132 identifies a region that is a keyword recognition target based on the region information. Next, the keyword recognition processing unit 132 specifies a layout recognition target area of the form 500 based on the layout definition table 111. Then, the keyword recognition processing unit 132 superimposes the keyword recognition target area and the layout recognition target area, and determines whether or not the layout recognition target area protrudes from the keyword recognition target area. If it protrudes, it can be determined that there is a target area for layout recognition. If it does not protrude, the target area for layout recognition is included in the target area for keyword recognition, so that it can be determined that there is no target area for layout recognition.

（ステップＳ７３）キーワード認識処理部１３２は、はみ出した領域を示す情報（例えば、該領域の頂点の座標値）をレイアウト認識処理部１３１に出力し、レイアウト認識処理を実行させる。レイアウト認識処理部１３１は、レイアウト定義テーブル１１１を参照して、該領域に含まれる記入欄につきレイアウト認識処理を実行し、キーワード認識処理部１３２にその結果を出力する。 (Step S73) The keyword recognition processing unit 132 outputs information indicating the protruding region (for example, the coordinate value of the vertex of the region) to the layout recognition processing unit 131, and causes the layout recognition processing to be executed. The layout recognition processing unit 131 refers to the layout definition table 111, executes layout recognition processing for the entry fields included in the area, and outputs the result to the keyword recognition processing unit 132.

（ステップＳ７４）キーワード認識処理部１３２は、レイアウト認識処理部１３１によるレイアウト認識の処理結果をＲＡＭ１０３またはＨＤＤ１０４上の所定領域に退避させる。 (Step S 74) The keyword recognition processing unit 132 saves the layout recognition processing result by the layout recognition processing unit 131 in a predetermined area on the RAM 103 or the HDD 104.

（ステップＳ７５）キーワード認識処理部１３２は、領域特定部１４０から取得した領域情報に基づき、そのうちの１つの未処理領域のカラーイメージを帳票５００ａの帳票画像から抽出する。 (Step S75) The keyword recognition processing unit 132 extracts a color image of one unprocessed area from the form image of the form 500a based on the area information acquired from the area specifying unit 140.

（ステップＳ７６）キーワード認識処理部１３２は、ステップＳ７５でカラーイメージを取得した処理対象領域からキーワードを１つ抽出する。例えば、キーワード認識処理部１３２は、処理対象領域に含まれる枠線を検出して、該枠線で囲われる枠ごとにキーワードの抽出を試みる。 (Step S76) The keyword recognition processing unit 132 extracts one keyword from the processing target area from which the color image has been acquired in Step S75. For example, the keyword recognition processing unit 132 detects a frame line included in the processing target region, and tries to extract a keyword for each frame surrounded by the frame line.

（ステップＳ７７）キーワード認識処理部１３２は、帳票５００ａの帳票ＩＤおよびレイアウト定義テーブル１１１を参照して、該キーワードが帳票５００ａに含まれ得るデータ項目名に一致するか否か判定する。一致する場合、処理をステップＳ７８に進める。一致しない場合、処理をステップＳ８１に進める。 (Step S77) The keyword recognition processing unit 132 refers to the form ID of the form 500a and the layout definition table 111, and determines whether or not the keyword matches a data item name that can be included in the form 500a. If they match, the process proceeds to step S78. If not, the process proceeds to step S81.

（ステップＳ７８）キーワード認識処理部１３２は、該データ項目に対する記入欄の位置（データ部）を特定する。例えば、データ部の位置は、「キーワードが存在するセルの右側に隣接するセル」のようにキーワード認識処理部１３２に予め設定される。あるいは、キーワードや、帳票５００ａ上の表構造に基づき、該キーワードに対応するデータ部を特定してもよい。 (Step S78) The keyword recognition processing unit 132 specifies the position (data portion) of the entry field for the data item. For example, the position of the data part is set in advance in the keyword recognition processing unit 132 as “a cell adjacent to the right side of the cell in which the keyword exists”. Alternatively, based on the keyword and the table structure on the form 500a, the data portion corresponding to the keyword may be specified.

（ステップＳ７９）キーワード認識処理部１３２は、データ部の特徴を解析し、データ部のイメージに対して枠線やノイズなどを除去し、文字認識に適したイメージに加工する。 (Step S79) The keyword recognition processing unit 132 analyzes the characteristics of the data portion, removes frame lines and noise from the image of the data portion, and processes the image into an image suitable for character recognition.

（ステップＳ８０）キーワード認識処理部１３２は、データ部に記入された文字画像を取得する。キーワード認識処理部１３２は、該文字画像と予め記憶部１１０に格納された文字パターンとの照合を行い、該データ部に記入された文字列データを取得する。 (Step S80) The keyword recognition processing unit 132 acquires a character image entered in the data portion. The keyword recognition processing unit 132 collates the character image with a character pattern stored in the storage unit 110 in advance, and acquires character string data entered in the data unit.

（ステップＳ８１）キーワード認識処理部１３２は、現在の処理対象領域内でキーワードを未抽出の箇所が存在するか否かを判定する。キーワード未抽出の箇所が存在する場合、処理をステップＳ７６に進める。キーワード未抽出の箇所が存在しない場合、処理をステップＳ８２に進める。 (Step S 81) The keyword recognition processing unit 132 determines whether or not there is a portion where no keyword has been extracted in the current processing target area. If there is a keyword-unextracted portion, the process proceeds to step S76. If there is no keyword-unextracted portion, the process proceeds to step S82.

（ステップＳ８２）キーワード認識処理部１３２は、現在の処理対象領域以外にも、未処理の処理対象領域が存在するか否かを判定する。未処理の処理対象領域が存在する場合、処理をステップＳ７５に進める。全ての処理対象領域につき処理済の場合、処理をステップＳ８３に進める。 (Step S82) The keyword recognition processing unit 132 determines whether there is an unprocessed processing target area other than the current processing target area. If there is an unprocessed processing target area, the process proceeds to step S75. If processing has been completed for all processing target areas, the process proceeds to step S83.

（ステップＳ８３）キーワード認識処理部１３２は、一時退避させたレイアウト認識処理の結果とキーワード認識処理の結果とをマージする。具体的には、レイアウト定義テーブル１１１に定義された帳票５００ａのデータ項目のうち、レイアウト認識処理では取得できなかったものを、キーワード認識処理で取得した内容で補完する。 (Step S83) The keyword recognition processing unit 132 merges the temporarily saved layout recognition result and the keyword recognition result. Specifically, among the data items of the form 500a defined in the layout definition table 111, those that could not be acquired by the layout recognition process are complemented with the contents acquired by the keyword recognition process.

このように、レイアウト認識処理部１３１は、キーワード認識処理の対象外の領域に対して、レイアウト認識による文字認識を実行する。
図１９は、第４の実施の形態の各文字認識の対象領域の第１の例を示す図である。図１９（Ａ）は、改訂前の帳票５００を示している。図１９（Ｂ）は、改訂後の帳票５００ａを示している。 In this way, the layout recognition processing unit 131 performs character recognition by layout recognition on an area that is not subject to keyword recognition processing.
FIG. 19 is a diagram illustrating a first example of target areas for character recognition according to the fourth embodiment. FIG. 19A shows a form 500 before revision. FIG. 19B shows a revised form 500a.

帳票５００には、レイアウト認識対象領域５６０が含まれる。レイアウト認識対象領域５６０は、記入欄群５２０が占める領域と同一である。
帳票５００ａには、キーワード認識対象領域５５０ａが含まれる。キーワード認識対象領域５５０ａは、領域識別情報５５０で囲われる領域と同一である。ここで、帳票５００ａには、帳票５００におけるレイアウト認識対象領域５６０の枠線も図示されている。帳票５００ａでは、キーワード認識対象領域５５０ａにレイアウト認識対象領域５６０が包含されている。したがって、帳票読取装置１００は、帳票５００ａの帳票画像に対してレイアウト認識処理を実行しない。帳票読取装置１００は、帳票５００ａの帳票画像のうち、キーワード認識対象領域５５０ａに対してキーワード認識処理を実行する。 The form 500 includes a layout recognition target area 560. The layout recognition target area 560 is the same as the area occupied by the entry column group 520.
The form 500a includes a keyword recognition target area 550a. The keyword recognition target area 550a is the same as the area surrounded by the area identification information 550. Here, the form 500 a also shows a frame line of the layout recognition target area 560 in the form 500. In the form 500a, the layout recognition target area 560 is included in the keyword recognition target area 550a. Therefore, the form reading apparatus 100 does not execute the layout recognition process on the form image of the form 500a. The form reading apparatus 100 executes keyword recognition processing on the keyword recognition target area 550a in the form image of the form 500a.

図２０は、第４の実施の形態の各文字認識の対象領域の第２の例を示す図である。図２０（Ａ）は、改訂前の帳票６００を示している。図２０（Ｂ）は、帳票６００に対する改訂後の帳票６００ａを示している。 FIG. 20 is a diagram illustrating a second example of target areas for character recognition according to the fourth embodiment. FIG. 20A shows a form 600 before revision. FIG. 20B shows a revised form 600 a for the form 600.

帳票６００には、レイアウト認識対象領域６１０が含まれる。
帳票６００ａには、レイアウト認識対象領域６１０ａおよびキーワード認識対象領域６２０が含まれる。ここで、レイアウト認識対象領域６１０ａは、帳票６００，６００ａを重ねたときに、レイアウト認識対象領域６１０から、キーワード認識対象領域６２０と重なる領域を除いた領域に等しい。帳票読取装置１００は、レイアウト認識対象領域６１０のうち、キーワード認識対象領域６２０からはみ出るレイアウト認識対象領域６１０ａに対してレイアウト認識を実行する。更に、帳票読取装置１００は、キーワード認識対象領域６２０に対してキーワード認識を実行する。 The form 600 includes a layout recognition target area 610.
The form 600a includes a layout recognition target area 610a and a keyword recognition target area 620. Here, the layout recognition target area 610a is equal to the area obtained by excluding the area overlapping the keyword recognition target area 620 from the layout recognition target area 610 when the forms 600 and 600a are overlapped. The form reading apparatus 100 performs layout recognition on a layout recognition target area 610 a that protrudes from the keyword recognition target area 620 in the layout recognition target area 610. Further, the form reading apparatus 100 executes keyword recognition for the keyword recognition target area 620.

このように、第４の実施の形態の帳票読取装置１００によれば、キーワード認識の処理対象領域と被らない領域に対してレイアウト認識を行う。これにより、文字認識を一層効率的に行える。 As described above, according to the form reading apparatus 100 of the fourth embodiment, layout recognition is performed on the keyword recognition processing target area and the non-covered area. Thereby, character recognition can be performed more efficiently.

なお、改訂後にキーワード認識対象領域が複数存在する場合も考えられる。この場合、該複数のキーワード認識対象領域と、改訂前のレイアウト認識対象領域と、の重複の有無を判断する。そして、改訂後の帳票につき、重複しない領域に対してレイアウト認識処理を行えばよい。 Note that there may be a case where a plurality of keyword recognition target areas exist after the revision. In this case, it is determined whether or not there is an overlap between the plurality of keyword recognition target areas and the layout recognition target area before revision. Then, the layout recognition process may be performed on the non-overlapping areas for the revised form.

［第５の実施の形態］
以下、第５の実施の形態を説明する。前述の第２〜第４の実施の形態との相違点を主に説明し、同様の事項の説明を省略する。 [Fifth Embodiment]
Hereinafter, a fifth embodiment will be described. Differences from the second to fourth embodiments will be mainly described, and description of similar matters will be omitted.

第２〜第４の実施の形態では、キーワード認識処理の際、検出したキーワードに対するデータ部の位置を、所定位置と特定したり、帳票の表構造などに基づいて特定したりすることを説明した。 In the second to fourth embodiments, in the keyword recognition process, the position of the data part with respect to the detected keyword is specified as a predetermined position, or specified based on the table structure of the form, etc. .

一方、該データ部を任意に配置して、より自由度の高いレイアウト変更に対応できることが望ましい。例えば、キーワードの存在するセルの下側のセルをデータ部に対応させる記入欄と、キーワードの存在するセルの右下側のセルをデータ部に対応させる記入欄と、を混在させたい場合が考えられる。また、このようにキーワードの存在するセルに対する任意の位置にデータ部を配置したときに、データ部を容易に特定できることが望ましい。例えば、帳票の表構造などを解析する場合には、該解析の処理による負荷が大きくなることもあるからである。 On the other hand, it is desirable that the data part can be arbitrarily arranged to cope with layout change with a higher degree of freedom. For example, you may want to mix an entry field that associates the lower cell of the cell in which the keyword exists with the data part and an entry field that associates the lower right cell of the cell in which the keyword exists with the data part. It is done. Further, it is desirable that the data part can be easily specified when the data part is arranged at an arbitrary position with respect to the cell in which the keyword exists. This is because, for example, when analyzing the table structure of a form, the load due to the analysis process may increase.

そこで、第５の実施の形態では、データ部を容易に特定可能としながら、より自由度の高いレイアウト変更に対応可能とするための機能を提供する。
ここで、第５の実施の形態の情報処理システムの構成は、図２で説明した第２の実施の形態の情報処理システムの構成と同様である。また、第５の実施の形態の帳票読取装置のハードウェアおよび機能構成は、図３，６で説明した第２の実施の形態の帳票読取装置１００のハードウェアおよび機能構成と同様である。以下、第５の実施の形態の帳票読取装置も帳票読取装置１００と同一の符号・名称を用いて各構成を指し示すものとする。 Therefore, in the fifth embodiment, a function is provided for making it possible to cope with a layout change with a higher degree of freedom while making it possible to easily specify the data portion.
Here, the configuration of the information processing system of the fifth embodiment is the same as the configuration of the information processing system of the second embodiment described in FIG. The hardware and functional configuration of the form reading apparatus according to the fifth embodiment is the same as the hardware and functional configuration of the form reading apparatus 100 according to the second embodiment described with reference to FIGS. Hereinafter, the form reading apparatus according to the fifth embodiment also indicates each component using the same reference numeral and name as the form reading apparatus 100.

図２１は、第５の実施の形態のキーワード認識対象領域の例を示す図である。キーワード認識対象領域は、領域識別情報３４０の内側の領域である。該領域内には、記入欄３３１ｂ，３３２ｂ，３３３ｂが含まれる。記入欄３３１ｂ，３３２ｂ，３３３ｂは、図５で説明した記入欄３３１ａ，３３２ａ，３３３ａにそれぞれ対応する。 FIG. 21 is a diagram illustrating an example of a keyword recognition target area according to the fifth embodiment. The keyword recognition target area is an area inside the area identification information 340. The area includes entry fields 331b, 332b, and 333b. The entry fields 331b, 332b, and 333b correspond to the entry fields 331a, 332a, and 333a described in FIG.

記入欄３３１ｂの見出し部分のセルには、キーワード３５１および支援情報３５１ｂが印字されている。
キーワード３５１は、文字列“電話番号”を示している。これは、該帳票の帳票画像内に含まれ得るデータ項目名に一致するものとする。支援情報３５１ｂは、キーワード３５１が存在するセルに対するデータ部の位置を示す記号である。支援情報３５１ｂは、数字“１”であり、キーワード３５１が存在するセルに隣接する、図２１の紙面に向かって右側のセルがデータ部であることを示す。キーワード認識処理部１３２は、支援情報３５１ｂに基づいて、キーワード３５１に対するデータ部を特定できる。例えば、キーワード認識処理部１３２は、データ項目“電話番号”に対して右側に隣接するセルを参照し、文字列データ“０００−００００−００００”を取得する。 A keyword 351 and support information 351b are printed in the cell of the heading portion of the entry field 331b.
The keyword 351 indicates the character string “phone number”. This corresponds to the data item name that can be included in the form image of the form. The support information 351b is a symbol indicating the position of the data part with respect to the cell in which the keyword 351 exists. The support information 351b is the number “1”, and indicates that the right cell toward the page of FIG. 21 adjacent to the cell in which the keyword 351 exists is the data part. The keyword recognition processing unit 132 can specify the data part for the keyword 351 based on the support information 351b. For example, the keyword recognition processing unit 132 refers to a cell adjacent to the right side with respect to the data item “phone number”, and acquires character string data “000-0000-0000”.

記入欄３３２ｂの見出し部分のセルには、キーワード３５２および支援情報３５２ｂが印字されている。
キーワード３５２は、文字列“性別”を示している。これは、該帳票の帳票画像内に含まれ得るデータ項目名に一致するものとする。支援情報３５２ｂは、キーワード３５２が存在するセルに対するデータ部の位置を示す記号である。支援情報３５２ｂは、数字“２”であり、キーワード３５２が存在するセルに隣接する、図２１の紙面に向かって下側のセルがデータ部であることを示す。キーワード認識処理部１３２は、支援情報３５２ｂに基づいて、キーワード３５２に対するデータ部を特定できる。例えば、キーワード認識処理部１３２は、データ項目“性別”に対して下側に隣接するセルを参照し、“男”を選択していることを示す選択記号（チェックマーク）を取得する。 A keyword 352 and support information 352b are printed in the cell of the heading portion of the entry column 332b.
The keyword 352 indicates the character string “sex”. This corresponds to the data item name that can be included in the form image of the form. The support information 352b is a symbol indicating the position of the data part with respect to the cell in which the keyword 352 exists. The support information 352b is a number “2”, and indicates that the cell adjacent to the cell in which the keyword 352 exists and the lower cell toward the page of FIG. 21 is the data part. The keyword recognition processing unit 132 can specify the data part for the keyword 352 based on the support information 352b. For example, the keyword recognition processing unit 132 refers to a cell adjacent on the lower side with respect to the data item “sex”, and acquires a selection symbol (check mark) indicating that “male” is selected.

記入欄３３３ｂの見出し部分のセルには、キーワード３５３および支援情報３５３ｂが印字されている。
キーワード３５３は、文字列“生年月日”を示している。これは、該帳票の帳票画像内に含まれ得るデータ項目名に一致するものとする。支援情報３５３ｂの意味は、支援情報３５２ｂと同様である。キーワード認識処理部１３２は、支援情報３５３ｂに基づいて、キーワード３５３に対するデータ部を特定できる。これにより、キーワード認識処理部１３２は、データ項目“生年月日”に対して下側に隣接するセルを参照し、文字列データ“１９８５”年“１１”月“１５”日を取得する。 A keyword 353 and support information 353b are printed in the cell of the heading portion of the entry column 333b.
The keyword 353 indicates the character string “date of birth”. This corresponds to the data item name that can be included in the form image of the form. The meaning of the support information 353b is the same as that of the support information 352b. The keyword recognition processing unit 132 can specify the data part for the keyword 353 based on the support information 353b. Thereby, the keyword recognition processing unit 132 refers to the cell adjacent to the lower side with respect to the data item “birth date”, and acquires the character string data “1985” “11” month “15” day.

次に、以上の構成の帳票読取装置１００の処理手順を説明する。なお、第５の実施の形態の帳票読取処理の手順は図８で説明した第２の実施の形態の帳票読取処理の手順と同様である。 Next, a processing procedure of the form reading apparatus 100 having the above configuration will be described. Note that the procedure of the form reading process of the fifth embodiment is the same as the procedure of the form reading process of the second embodiment described with reference to FIG.

また、第５の実施の形態のキーワード認識処理の手順は、図１０で説明した第２の実施の形態のキーワード認識処理の手順と同様である。ただし、該キーワード認識処理のステップＳ２６におけるデータ部を特定するための処理が異なる。以下では、このデータ部を特定するための処理の手順を説明する。 Further, the procedure of the keyword recognition process of the fifth embodiment is the same as the procedure of the keyword recognition process of the second embodiment described with reference to FIG. However, the process for specifying the data part in step S26 of the keyword recognition process is different. Below, the procedure of the process for specifying this data part is demonstrated.

図２２は、第５の実施の形態のデータ部の特定処理を示すフローチャートである。以下、図２２に示す処理をステップ番号に沿って説明する。
（ステップＳ９１）キーワード認識処理部１３２は、抽出したキーワードの存在する付近に支援情報が存在するか否かを判定する。支援情報が存在する場合、処理をステップＳ９２に進める。支援情報が存在しない場合、処理をステップＳ９３に進める。ここで、キーワードの存在する付近に支援情報が存在するか否かは、例えば、次の方法により判断できる。具体的には、キーワード認識処理部１３２にキーワードに対する支援情報の相対的な位置と支援情報のフォーマットとを予め設定しておく。キーワード認識処理部１３２は、キーワードに対する所定位置に該フォーマットに合致する文字列等が印字されているかを判断することで、支援情報の存在の有無を判定する。 FIG. 22 is a flowchart illustrating the data portion specifying process according to the fifth embodiment. In the following, the process illustrated in FIG. 22 will be described in order of step number.
(Step S91) The keyword recognition processing unit 132 determines whether or not support information exists in the vicinity of the extracted keyword. If support information exists, the process proceeds to step S92. If support information does not exist, the process proceeds to step S93. Here, whether or not support information exists in the vicinity of the keyword can be determined by the following method, for example. More specifically, the relative position of the support information with respect to the keyword and the format of the support information are set in the keyword recognition processing unit 132 in advance. The keyword recognition processing unit 132 determines whether or not the support information exists by determining whether a character string or the like that matches the format is printed at a predetermined position with respect to the keyword.

（ステップＳ９２）キーワード認識処理部１３２は、支援情報に応じた位置をデータ部と特定する。図２１の例でいえば、支援情報３５１ｂに基づいて、キーワード３５１が存在するセルの右側に隣接するセルをデータ部と特定する。また、支援情報３５２ｂに基づいて、キーワード３５２が存在するセルの下側に隣接するセルをデータ部と特定する。何れの支援情報が、どのような位置を示すかは、キーワード認識処理部１３２に予め設定される。そして、処理を終了する。 (Step S92) The keyword recognition processing unit 132 specifies a position corresponding to the support information as a data unit. In the example of FIG. 21, based on the support information 351b, the cell adjacent to the right side of the cell in which the keyword 351 exists is specified as the data part. Further, based on the support information 352b, the cell adjacent to the lower side of the cell in which the keyword 352 exists is specified as the data part. Which support information indicates what position is set in the keyword recognition processing unit 132 in advance. Then, the process ends.

（ステップＳ９３）キーワード認識処理部１３２は、デフォルトの方法により、抽出したキーワードに対するデータ部を特定する。例えば、キーワード認識処理部１３２は、所定位置のセル（例えば、「該キーワードが存在するセルの右側に隣接するセル」）をデータ部と特定する。あるいは、キーワードや、帳票上の表構造に基づき、該キーワードに対応するデータ部を特定してもよい。そして、処理を終了する。 (Step S93) The keyword recognition processing unit 132 specifies a data part for the extracted keyword by a default method. For example, the keyword recognition processing unit 132 specifies a cell at a predetermined position (for example, “a cell adjacent to the right side of the cell in which the keyword exists”) as the data unit. Alternatively, the data portion corresponding to the keyword may be specified based on the keyword or the table structure on the form. Then, the process ends.

このようにして、キーワード認識処理部１３２は、キーワードとともに印字された支援情報を読み取ることで、該キーワードに対するデータ部を効率的に特定できる。
なお、図２１では、数値を支援情報とする場合を例示したが、他の文字（または文字列）、記号などを用いても構わない。 In this way, the keyword recognition processing unit 132 can efficiently identify the data part for the keyword by reading the support information printed together with the keyword.
21 illustrates the case where numerical values are used as support information, but other characters (or character strings), symbols, and the like may be used.

図２３は、第５の実施の形態のデータ部の特定方法の例を示す図である。図２３（Ａ）は、記入欄７００を例示している。記入欄７００は、セル７０１，７０２を含む。セル７０１は、見出し部分のセルである。セル７０１には、キーワード“振込金額”と支援情報“○”（丸印）が印字されている。支援情報“○”は、セル７０１の右側に隣接するセル７０２が、該キーワードに対するデータ部のセルであることを示す。よって、キーワード認識処理部１３２は、記入欄７００につき、支援情報“○”に基づいて、セル７０２を文字認識し、キーワード“振込金額”に対する文字列データ“￥１１０００”を取得する。 FIG. 23 is a diagram illustrating an example of a data part specifying method according to the fifth embodiment. FIG. 23A illustrates an entry field 700. The entry field 700 includes cells 701 and 702. A cell 701 is a heading cell. In the cell 701, a keyword “transfer amount” and support information “◯” (circle) are printed. The support information “◯” indicates that the cell 702 adjacent to the right side of the cell 701 is a cell of the data portion for the keyword. Therefore, the keyword recognition processing unit 132 performs character recognition on the cell 702 based on the support information “◯” for the entry field 700 and acquires character string data “¥ 11000” for the keyword “transfer amount”.

図２３（Ｂ）は、記入欄７１０を例示している。記入欄７１０は、セル７１１，７１２を含む。セル７１１は、見出し部分のセルである。セル７１１には、キーワード“振込金額”と支援情報“△”（三角印）が印字されている。支援情報“△”は、セル７１１の右下側のセル７１２が、該キーワードに対するデータ部のセルであることを示す。よって、キーワード認識処理部１３２は、記入欄７１０につき、支援情報“△”に基づいて、セル７１２を文字認識し、キーワード“振込金額”に対する文字列データ“￥１１０００”を取得する。 FIG. 23B illustrates an entry field 710. The entry field 710 includes cells 711 and 712. A cell 711 is a heading cell. In the cell 711, the keyword “transfer amount” and support information “Δ” (triangle mark) are printed. The support information “Δ” indicates that the cell 712 on the lower right side of the cell 711 is a cell of the data portion for the keyword. Therefore, the keyword recognition processing unit 132 performs character recognition on the cell 712 for the entry field 710 based on the support information “Δ”, and acquires character string data “¥ 11000” for the keyword “transfer amount”.

図２３（Ｃ）は、記入欄７２０を例示している。記入欄７２０は、セル７２１，７２２を含む。セル７２１は、見出し部分のセルである。セル７２１には、キーワード“振込金額”と支援情報“□”（四角印）が印字されている。支援情報“□”は、セル７２１の下側に隣接するセル７２２が、該キーワードに対するデータ部のセルであることを示す。よって、キーワード認識処理部１３２は、記入欄７２０につき、支援情報“□”に基づいて、セル７２２を文字認識し、キーワード“振込金額”対する文字列データ“￥１１０００”を取得する。 FIG. 23C illustrates an entry field 720. The entry field 720 includes cells 721 and 722. A cell 721 is a heading cell. In the cell 721, a keyword “transfer amount” and support information “□” (square mark) are printed. The support information “□” indicates that the cell 722 adjacent to the lower side of the cell 721 is a cell of the data portion for the keyword. Therefore, the keyword recognition processing unit 132 performs character recognition on the cell 722 for the entry field 720 based on the support information “□”, and acquires character string data “¥ 11000” for the keyword “transfer amount”.

以上は一例であり、支援情報には、種々の数値、文字（または文字列）、記号などを利用できる。また、キーワード認識処理部１３２は、枠線や枠内の領域、キーワードのフォーマットなどとして支援情報を検出し、データ部の位置を特定してもよい。次に、そのような支援情報の例を説明する。 The above is an example, and various numerical values, characters (or character strings), symbols, and the like can be used as the support information. Further, the keyword recognition processing unit 132 may detect support information as a frame line, a region within the frame, a keyword format, and the like, and specify the position of the data unit. Next, an example of such support information will be described.

図２４は、第５の実施の形態のデータ部の特定方法の他の例を示す図である。図２４（Ａ）は、記入欄８００を例示している。記入欄８００は、セル８０１，８０２を含む。セル８０１は、見出し部分のセルである。そして、セル８０１の枠内には、該枠の中央からデータ部の方向に対応する一辺の側に、該一辺と平行な罫線が印字される。具体的には、セル８０１に対するデータ部のセルが該セル８０１の右側に隣接するセル８０２であるとき、セル８０１，８０２の境界に位置する枠線よりもセル８０１側（ただし、セル８０１の中央よりもセル８０２側）に、該枠線と平行な罫線８０１ａが印字される。キーワード認識処理部１３２は、罫線８０１ａを支援情報として検出する。そして、キーワード認識処理部１３２は、セル８０１内の罫線８０１ａの位置に基づいて、セル８０２をデータ部として特定する。これにより、キーワード認識処理部１３２は、セル８０１内のキーワード“振込金額”に対して、セル８０２内の文字列データ“￥１１０００”を取得する。 FIG. 24 is a diagram illustrating another example of the data portion specifying method according to the fifth embodiment. FIG. 24A illustrates an entry field 800. The entry field 800 includes cells 801 and 802. A cell 801 is a heading cell. In the frame of the cell 801, a ruled line parallel to the one side is printed on one side corresponding to the direction of the data portion from the center of the frame. Specifically, when the cell of the data part for the cell 801 is a cell 802 adjacent to the right side of the cell 801, the cell 801 side (however, the center of the cell 801 is located at the border between the cells 801 and 802). A ruled line 801a parallel to the frame line is printed on the cell 802 side. The keyword recognition processing unit 132 detects the ruled line 801a as support information. Then, the keyword recognition processing unit 132 identifies the cell 802 as a data portion based on the position of the ruled line 801a in the cell 801. Thus, the keyword recognition processing unit 132 acquires the character string data “¥ 11000” in the cell 802 for the keyword “transfer amount” in the cell 801.

例えば、セル８０１の下側に隣接するセルをデータ部とする場合には、セル８０１の下側の枠線と平行な罫線がセル８０１内の下側に印字される。キーワード認識処理部１３２は、該罫線に基づいて、セル８０１の下側に隣接するセルをデータ部として特定できる。 For example, when a cell adjacent to the lower side of the cell 801 is used as the data portion, a ruled line parallel to the lower frame line of the cell 801 is printed on the lower side in the cell 801. The keyword recognition processing unit 132 can specify a cell adjacent to the lower side of the cell 801 as a data portion based on the ruled line.

図２４（Ｂ）は、記入欄８１０を例示している。記入欄８１０は、セル８１１，８１２を含む。セル８１１は、見出し部分のセルである。セル８１２は、セル８１１内のキーワードに対応するデータ部のセルである。そして、セル８１１とセル８１２との境界線８１１ａが他の枠線とは異なる太さで印字されている。キーワード認識処理部１３２は、太さの異なる境界線８１１ａに基づいて、データ部の位置を特定する。具体的には、セル８１１の境界線８１１ａの方向に隣接するセル８１２をデータ部として特定する。これにより、キーワード認識処理部１３２は、セル８１１内のキーワード“振込金額”に対して、セル８１２内の文字列データ“￥１１０００”を取得する。 FIG. 24B illustrates an entry field 810. The entry field 810 includes cells 811 and 812. A cell 811 is a heading cell. A cell 812 is a data portion cell corresponding to the keyword in the cell 811. A boundary line 811a between the cell 811 and the cell 812 is printed with a thickness different from that of the other frame lines. The keyword recognition processing unit 132 specifies the position of the data part based on the boundary line 811a having a different thickness. Specifically, the cell 812 adjacent in the direction of the boundary line 811a of the cell 811 is specified as the data portion. Accordingly, the keyword recognition processing unit 132 acquires the character string data “¥ 11000” in the cell 812 for the keyword “transfer amount” in the cell 811.

図２４（Ｃ）は、記入欄８２０を例示している。記入欄８２０は、セル８２１，８２２を含む。セル８２１は、見出し部分のセルである。セル８２１内にはキーワードが横一列に並んだ文字列として“振込金額”と印字される。横一列に並んだ文字列は、セル８１１の右側に隣接するセルが該キーワードに対応するデータ部であることを示している。キーワード認識処理部１３２は、セル８２１内のキーワード“振込金額”を検出し、更に、該キーワードが横一列に並んでいることを検知する。すると、キーワード認識処理部１３２は、セル８２１の右側に隣接するセル８２２をデータ部として特定する。これにより、キーワード認識処理部１３２は、セル８２１内のキーワード“振込金額”に対して、セル８２２内の文字列データ“￥１１０００”を取得する。 FIG. 24C illustrates an entry field 820. The entry field 820 includes cells 821 and 822. A cell 821 is a heading cell. In the cell 821, “transfer amount” is printed as a character string in which keywords are arranged in a horizontal row. The character strings arranged in a horizontal row indicate that the cell adjacent to the right side of the cell 811 is a data portion corresponding to the keyword. The keyword recognition processing unit 132 detects the keyword “transfer amount” in the cell 821 and further detects that the keywords are arranged in a horizontal row. Then, the keyword recognition processing unit 132 specifies the cell 822 adjacent to the right side of the cell 821 as the data portion. Accordingly, the keyword recognition processing unit 132 acquires the character string data “¥ 11000” in the cell 822 for the keyword “transfer amount” in the cell 821.

例えば、セル８２１の下側に隣接するセルをデータ部とする場合には、セル８２１内のキーワードが縦一列に印字される。また、例えば、セル８２１の右下側のセルをデータ部とする場合には、セル８２１の左上から右下へ向かう斜めの一列に並んでキーワードが印字される。キーワード認識処理部１３２は、このようにキーワードに含まれる文字の並び方によって、該キーワードに対するデータ部のセルを特定する。 For example, when a cell adjacent to the lower side of the cell 821 is used as a data portion, the keywords in the cell 821 are printed in a vertical line. Further, for example, when the cell on the lower right side of the cell 821 is used as the data portion, the keywords are printed in a diagonal line from the upper left of the cell 821 to the lower right. The keyword recognition processing unit 132 specifies the cell of the data part for the keyword according to the arrangement of the characters included in the keyword.

図２４（Ｄ）は、記入欄８３０を例示している。記入欄８３０は、セル８３１，８３２を含む。セル８３１は、見出し部分のセルである。セル８３２は、セル８３１内のキーワードに対応するデータ部のセルである。セル８３１の枠内には、所定の色が付されている。記入欄８３０では、この色によって、セル８３１に対するセル８３２の位置を示している。例えば、画像情報中、色は複数のパラメータで表現される。例えば、ＨＳＶによる色表現では、色相（hue）、彩度（saturation value）、明度（value）という３つのパラメータで色が表現される。その場合、各パラメータの何れか、あるいは複数のパラメータにより、セル８３１の上下左右の何れの側に隣接するセルがデータ部であるかを予め定義する。例えば、色相の範囲を４等分して、等分した各範囲を上下左右の何れかに割り当てることが考えられる。彩度や明度を用いる場合も同様である。 FIG. 24D illustrates the entry field 830. The entry field 830 includes cells 831 and 832. A cell 831 is a heading cell. Cell 832 is a cell of the data part corresponding to the keyword in cell 831. A predetermined color is given in the frame of the cell 831. In the entry field 830, the position of the cell 832 with respect to the cell 831 is indicated by this color. For example, in the image information, the color is expressed by a plurality of parameters. For example, in color representation by HSV, a color is represented by three parameters, hue (hue), saturation (saturation value), and lightness (value). In that case, it is defined in advance whether the cell adjacent to the upper, lower, left, or right side of the cell 831 is the data portion by any one of the parameters or a plurality of parameters. For example, it is conceivable to divide the hue range into four equal parts and assign each of the equally divided ranges to the top, bottom, left or right. The same applies when using saturation and lightness.

セル８３１内の色は、セル８３１の右側に隣接するセル８３２がセル８３１に対応するデータ部である旨、キーワード認識処理部１３２に予め設定される。キーワード認識処理部１３２は、セル８３１内の色を解析することで、セル８３１に対応するデータ部のセル８３２を特定する。これにより、キーワード認識処理部１３２は、セル８３１内のキーワード“振込金額”に対して、セル８３２内の文字列データ“￥１１０００”を取得する。 The color in the cell 831 is preset in the keyword recognition processing unit 132 to the effect that the cell 832 adjacent to the right side of the cell 831 is a data part corresponding to the cell 831. The keyword recognition processing unit 132 identifies the cell 832 of the data part corresponding to the cell 831 by analyzing the color in the cell 831. Accordingly, the keyword recognition processing unit 132 acquires the character string data “¥ 11000” in the cell 832 for the keyword “transfer amount” in the cell 831.

なお、画像が２値化されている場合には、ハッチングの種類や、ドットの密度などに応じてデータ部を特定してもよい。
以上のように、帳票読取装置１００は、キーワードと共に印字された支援情報を検出して、データ部のセルを特定する。これにより、帳票に対して自由度の高いレイアウト変更が可能になる。加えて、キーワード認識処理において、キーワードに対するデータ部を容易に特定可能となる。 If the image is binarized, the data portion may be specified according to the type of hatching, the dot density, and the like.
As described above, the form reading device 100 detects the support information printed together with the keyword, and specifies the cell of the data part. Thereby, it is possible to change the layout with a high degree of freedom for the form. In addition, in the keyword recognition process, the data portion for the keyword can be easily specified.

１情報処理装置
１ａ検出部
１ｂ領域特定部
１ｃ処理部
２画像情報
２ａ識別情報
２ｂ領域識別情報
３処理対象領域 DESCRIPTION OF SYMBOLS 1 Information processing apparatus 1a Detection part 1b Area | region specific part 1c Processing part 2 Image information 2a Identification information 2b Area identification information 3 Processing object area | region

Claims

Detecting identification information indicating that the layout of the form has been changed from image information obtained by imaging the form;
When the identification information is detected, area identification information included in the image information for identifying a processing target area for performing a predetermined character recognition process is detected, and the processing target area is specified based on the area identification information. ,
Performing the predetermined character recognition processing on the identified processing target area;
An information processing program that causes a computer to execute processing.

The information processing program according to claim 1, wherein the use of the data acquired by the predetermined character recognition process is distinguished according to the area identification information.

When the identification information is not detected, a character recognition process other than the predetermined character recognition process is performed on the entire area of the image information. When the identification information is detected, the other area is detected in an area other than the process target area. The information processing program according to claim 1, wherein character recognition processing is performed.

The other character recognition processing is performed based on the data item name and the character based on the layout definition information stored in a storage unit that stores layout definition information in which a data item name and a character recognition target position are defined in association with each other. The information processing program according to claim 3, which is a character recognition process by layout recognition that is acquired in association with data extracted from a recognition target position.

The predetermined character recognition process detects a predetermined keyword and support information for specifying a position of a data portion corresponding to the keyword in the image information from the image information, and based on the support information, the data portion The information processing program according to any one of claims 1 to 4, which is a process of identifying the data and acquiring data corresponding to the keyword from the data portion.

A detection unit that detects identification information indicating that the layout of the form has been changed from image information obtained by imaging the form;
When the identification information is detected, area identification information included in the image information for identifying a processing target area for performing a predetermined character recognition process is detected, and the processing target area is specified based on the area identification information. An area identification unit;
A processing unit that performs the predetermined character recognition processing on the identified processing target area;
An information processing apparatus.

A character recognition method executed by an information processing apparatus,
When identification information indicating that the layout of the form has been changed is detected from the image information obtained by imaging the form, the process includes a process for identifying a processing target area to be subjected to a predetermined character recognition process included in the image information. Detecting region identification information, specifying the processing target region based on the region identification information,
Performing the predetermined character recognition processing on the identified processing target area;
Character recognition method.