JP2011248609A

JP2011248609A - Form recognition device and form recognition method

Info

Publication number: JP2011248609A
Application number: JP2010120751A
Authority: JP
Inventors: Junichi Hirayama; 淳一平山; Hiroshi Shinjo; 広新庄
Original assignee: Hitachi Omron Terminal Solutions Corp
Current assignee: Hitachi Omron Terminal Solutions Corp
Priority date: 2010-05-26
Filing date: 2010-05-26
Publication date: 2011-12-08
Anticipated expiration: 2030-05-26
Also published as: JP5621169B2

Abstract

PROBLEM TO BE SOLVED: To read a character string robustly against a character recognition error and with fewer errors for forms which have obscure character string arrangements, without needing to pre-define a described position and attributes of a reading-target character string for a form group in which various layouts coexist.SOLUTION: A form recognition device performs the steps of: detecting a character string area from a form image (S120); calculating, for the detected character strings, an item name likelihood representing likeness of an item name and an item value likelihood representing likelihood of an item value (S140, S150); calculating, for a character string pair constituted by a combination of the detected character strings, an arrangement likelihood representing validity of the arrangement relationship of the character string pair as an item name-item value relationship (S160); calculating an evaluation value of the item name-item value relationship based on the values of the item name likelihood, item value likelihood and arrangement likelihood (S170); and determining the item name-item value relationship in the form image (S180).

Description

本発明は、帳票認識装置および帳票認識方法に係り、特に帳票画像上に記載される文字列の属性の理解と文字列認識の技術に関する。 The present invention relates to a form recognition device and a form recognition method, and more particularly, to a technique for understanding attributes of a character string described on a form image and character string recognition.

従来の帳票認識装置は、あらかじめ読取対象文字列の帳票画像上での記載位置とその属性をユーザが事前に装置に登録しておく「帳票定義」により、読取対象文字列の読取および当該文字列の属性の理解を行っていた。 A conventional form recognition apparatus reads a character string to be read and the character string by “form definition” in which the user registers in advance the position and attribute of the character string to be read on the form image in advance. Had to understand the attributes.

帳票処理業務において、処理する帳票のレイアウト、すなわち文字列の記載位置や枠の記載位置、枠の並びが統一されており、帳票画像における読取対象文字列の記載位置が固定である場合には、前記の帳票定義を事前に装置に登録することで読取対象文字列の位置検出および該文字列の属性の理解を行っていた。一方で、多種レイアウトが混在する帳票処理業務や、処理する帳票のレイアウトが未知である帳票処理業務が存在する。多種レイアウトとは、図２に示す帳票例２００，２０１のように、読取対象文字列の記載位置が帳票ごとに異なることである。図２の例は、帳票画像内から「振込先口座番号」「納入金額」「納入期限日付」を読み取る例であるが、それぞれ記載位置が異なるため、帳票ごとに帳票定義を作成する必要がある。業務によっては帳票レイアウトの種類が数万種類に及ぶ場合もあり、帳票定義による認識は、帳票定義の作成コストが膨大になり利用できなかった。 In the form processing business, when the layout of the form to be processed, that is, the description position of the character string, the description position of the frame, and the arrangement of the frames are unified, and the description position of the character string to be read in the form image is fixed, The position of the character string to be read is detected and the attributes of the character string are understood by registering the form definition in the apparatus in advance. On the other hand, there is a form processing operation in which various layouts are mixed and a form processing operation in which the layout of the form to be processed is unknown. The various layouts mean that the description position of the character string to be read differs for each form as in the form examples 200 and 201 shown in FIG. The example of FIG. 2 is an example of reading the “transfer account number”, “delivery amount”, and “delivery date” from the form image. However, since the description positions are different, it is necessary to create a form definition for each form. . Depending on the business, there may be several tens of thousands of forms layout types, and the recognition by the form definition cannot be used due to the enormous cost of creating the form definition.

多種レイアウト帳票を帳票定義を用いずに認識する技術として、例えば特許文献１に開示の技術のように、帳票画像内の文字列と、項目名辞書に登録された項目名単語とを照合し、項目名単語照合に成功した文字列を項目名、項目名単語照合に失敗した文字列を項目値（特許文献１では「データ」と表現している）候補と判定し、項目名と項目値候補の配置関係から、項目名と項目値の対応関係を決定し、項目名辞書に登録された項目名の属性から、対応する項目値の属性を判定する方式がある。 As a technique for recognizing various layout forms without using a form definition, for example, as in the technique disclosed in Patent Document 1, a character string in a form image is collated with an item name word registered in an item name dictionary. A character string that succeeds in item name word matching is determined as an item name, and a character string that fails in item name word matching is determined as an item value (represented as “data” in Patent Document 1), and an item name and item value candidate. There is a method in which the correspondence between the item name and the item value is determined from the arrangement relationship and the attribute of the corresponding item value is determined from the attribute of the item name registered in the item name dictionary.

また、特許文献２の方式では、帳票を論理的に構成する論理要素（項目名や項目値）からなる論理構造を、論理要素となる文字列と当該文字列の出現頻度および論理要素間の相対位置に関する頻度によって定義した辞書を帳票種ごとに作成し、帳票画像内の文字列と辞書内の論理構造を照合することにより、帳票画像内の文字列が論理構造内の論理要素である確率により、帳票画像内から読取対象の文字列を読み取る。 Further, in the method of Patent Document 2, a logical structure composed of logical elements (item names and item values) that logically constitute a form is obtained by using a character string that is a logical element, an appearance frequency of the character string, and a relative relationship between the logical elements. By creating a dictionary defined by the frequency of position for each form type and comparing the character string in the form image with the logical structure in the dictionary, the probability that the character string in the form image is a logical element in the logical structure The character string to be read is read from the form image.

特開２００８−２０４２２６号公報JP 2008-204226 A 特開２００８−３３８３０号公報JP 2008-33830 A

特許文献１では、帳票画像のノイズや低画質などの悪影響により、文字列認識誤りが発生した場合に、項目名単語照合において照合誤りが発生し、帳票画像内の文字列が正しい項目名であるにもかかわらず、項目名でないと判定されることがある。さらに、特許文献１の方式では、項目名と判定された文字列と項目値候補と判定された文字列の配置関係から、項目名−項目値対応関係を決定するため、項目名となる文字列の判定を誤ると、その誤りが項目名−項目値対応関係の判定誤りに直結してしまう。また、１つの項目名に対し、配置関係上対応付けられる項目値候補が複数存在する場合に、項目名−項目値関係の対応付け誤りが発生する恐れがある。 In Patent Document 1, when a character string recognition error occurs due to adverse effects such as noise or low image quality of a form image, a matching error occurs in item name word matching, and the character string in the form image is the correct item name. Nevertheless, it may be determined that it is not an item name. Furthermore, in the method of Patent Document 1, a character string that becomes an item name is used to determine an item name-item value correspondence from the arrangement relationship between a character string determined as an item name and a character string determined as an item value candidate. If the determination is incorrect, the error is directly connected to the determination error of the item name-item value correspondence. Further, when there are a plurality of item value candidates associated with one item name due to the arrangement relationship, there is a possibility that an association error between the item name and the item value relationship may occur.

また、特許文献２の方式では、帳票種ごとに、論理構造辞書内の論理要素の出現頻度や論理要素間の相対位置の頻度を定義するため、辞書の作成コストが膨大になってしまう。また、論理構造辞書と整合性のとれないレイアウトの帳票の場合、正しい認識結果が得られず、汎用性が低下する。 Further, in the method of Patent Document 2, the appearance frequency of the logical elements in the logical structure dictionary and the frequency of the relative position between the logical elements are defined for each form type, resulting in a huge dictionary creation cost. Also, in the case of a form with a layout that is inconsistent with the logical structure dictionary, a correct recognition result cannot be obtained, and versatility decreases.

本発明は、このような問題に鑑みてなされたものである。
すなわち、本発明は、項目名照合の前段階の処理である文字列認識処理において、文字列認識誤りが発生した場合にも、帳票画像内から正しく項目名−項目値関係を抽出する帳票認識方式を提供することを第１の課題とする。
また、本発明は、項目名−項目値関係の配置関係に曖昧性がある場合、つまり１つの項目名に対し、配置関係上対応付けられる項目値候補が複数存在する場合にも、対応付け誤りを少なく、項目名−項目値関係を抽出する帳票認識方式を提供することを第２の課題とする。
また、本発明は、辞書の作成コストを極力少なくかつ様々なレイアウトの帳票に対しても、汎用性高く認識できる帳票認識方式を提供することを第３の課題とする。 The present invention has been made in view of such problems.
That is, the present invention provides a form recognition method for correctly extracting an item name-item value relationship from a form image even when a character string recognition error occurs in a character string recognition process, which is a process prior to item name matching. It is a first problem to provide the above.
In addition, the present invention provides a correspondence error even when the arrangement relationship of the item name-item value relationship is ambiguous, that is, when there are a plurality of item value candidates associated with the arrangement relationship for one item name. A second problem is to provide a form recognition method for extracting the item name-item value relationship.
It is a third object of the present invention to provide a form recognition method that can recognize a document with high versatility even for forms having various layouts with a minimal dictionary creation cost.

上記課題を解決するために、本発明の帳票認識装置は、帳票画像を入力し、当該帳票画像内の文字列の認識処理を行う帳票認識装置であって、前記帳票画像から文字列領域を検出する文字列検出部と、前記文字列領域の個々の文字を認識する文字列認識部と、帳票画像内の文字列に対し、当該文字列が項目名である確率を表す項目名尤度を計算する項目名尤度計算部と、帳票画像内の文字列に対し、当該文字列が項目値である確率を表す項目値尤度を計算する項目値尤度計算部と、帳票画像内の文字列ペアに対し、当該文字列ペアの配置関係が項目名−項目値関係として妥当であるかを表す配置尤度を計算する配置尤度計算部と、前記項目名尤度、項目値尤度、配置尤度を基に、当該文字列ペアの項目名−項目値としての尤もらしさを表す評価値を計算する項目名−項目値関係評価値計算部と、前記項目名−項目値関係評価値計算部の出力する前記評価値により、帳票画像内での項目名−項目値関係の対応付けを決定する項目名−項目値関係決定部を有することを特徴とするものである。 In order to solve the above-described problem, the form recognition apparatus of the present invention is a form recognition apparatus for inputting a form image and performing recognition processing of a character string in the form image, and detecting a character string region from the form image. A character string detection unit that recognizes individual characters in the character string region, and a character string recognition unit that calculates the probability that the character string is an item name for the character string in the form image The item name likelihood calculating unit, the item value likelihood calculating unit for calculating the item value likelihood representing the probability that the character string is the item value for the character string in the form image, and the character string in the form image For a pair, an arrangement likelihood calculating unit that calculates an arrangement likelihood indicating whether the arrangement relation of the character string pair is valid as an item name-item value relationship, and the item name likelihood, item value likelihood, arrangement Based on the likelihood, the character string pair's item name-an evaluation representing the likelihood as the item value. The item name-item value relationship evaluation value calculation unit for calculating the value and the evaluation value output from the item name-item value relationship evaluation value calculation unit associates the item name-item value relationship in the form image. It has an item name-item value relationship determination unit to be determined.

また、本発明の帳票認識装置において、前記配置尤度計算部は、前記文字列ペアの項目名文字列と項目値文字列の枠の配置関係やサイズ、または文字列矩形の配置関係やサイズの項目名−項目値関係の非妥当さを表すルールであるペナルティルールに基づき、前記配置尤度を計算するものである。 Further, in the form recognition apparatus of the present invention, the placement likelihood calculation unit is configured such that the placement relationship or size of the frame of the item name character string and the item value character string of the character string pair, or the placement relationship or size of the character string rectangle. The placement likelihood is calculated based on a penalty rule that is a rule representing the invalidity of the item name-item value relationship.

また、本発明の帳票認識装置において、前記項目名尤度計算部は、項目名単語を記載した項目名辞書との照合により、前記文字列に対し前記項目名尤度を計算し、前記項目値尤度計算部は、項目値単語や文字列の文法表記ルールを記載した表記辞書との照合により、前記文字列に対し前記項目値尤度を計算するものである。 In the form recognition device of the present invention, the item name likelihood calculating unit calculates the item name likelihood for the character string by collating with an item name dictionary describing item name words, and the item value The likelihood calculating unit calculates the item value likelihood for the character string by collating it with a notation dictionary describing grammar notation rules for item value words and character strings.

本発明により、多種レイアウトの帳票が混在する帳票処理業務において、厳密な帳票定義なしに帳票を認識することができる。また、文字認識誤りに頑健に、ならびに項目名−項目値関係に曖昧性のある帳票を誤りが少なく認識することができる。 According to the present invention, a form can be recognized without a strict form definition in a form processing operation in which forms of various layouts are mixed. Further, it is possible to recognize a form that is robust against a character recognition error and has an ambiguous item name-item value relationship with few errors.

本発明の実施例における、帳票認識処理のフロー図である。It is a flowchart of the form recognition process in the Example of this invention. 多種レイアウト帳票の例を示す図である。It is a figure which shows the example of various layout forms. 本発明の実施例における、帳票認識装置の構成図である。It is a block diagram of the form recognition apparatus in the Example of this invention. 本発明の実施例における、帳票認識部のブロック構成図である。It is a block block diagram of the form recognition part in the Example of this invention. 項目名辞書の記載構造の例を示す図である。It is a figure which shows the example of the description structure of an item name dictionary. 本発明の実施例における、項目名尤度計算のフローチャートである。It is a flowchart of item name likelihood calculation in the Example of this invention. 本発明の実施例における、項目名尤度テーブルのデータ構造例を示す図である。It is a figure which shows the example of a data structure of the item name likelihood table in the Example of this invention. 表記辞書の記載構造の例を示す図である。It is a figure which shows the example of the description structure of a description dictionary. 本発明の実施例における、項目値尤度計算のフローチャートである。It is a flowchart of item value likelihood calculation in the Example of this invention. 本発明の実施例における、項目名尤度テーブルのデータ構造例を示す図である。It is a figure which shows the example of a data structure of the item name likelihood table in the Example of this invention. 本発明の実施例における、配置尤度計算のフローチャートである。It is a flowchart of arrangement | positioning likelihood calculation in the Example of this invention. 配置尤度計算における、ペナルティルールの例である。It is an example of a penalty rule in arrangement likelihood calculation. 本発明の実施例における、配置尤度テーブルのデータ構造例を示す図である。It is a figure which shows the example of a data structure of the arrangement | positioning likelihood table in the Example of this invention. 本発明の実施例における、項目名−項目値関係評価値テーブルのデータ構造例を示す図である。It is a figure which shows the example of a data structure of the item name-item value relationship evaluation value table in the Example of this invention. 帳票画像の例を示す図であるIt is a figure which shows the example of a form image

以下、本発明の実施の形態を説明する。なお、これにより本発明が限定されるものではない。具体的な処理の内容を説明する前に、本発明の概略について説明する。 Embodiments of the present invention will be described below. Note that the present invention is not limited thereby. The outline of the present invention will be described before describing the details of specific processing.

本発明は、多種レイアウトが混在する帳票群を、読取対象文字列の記載位置および当該文字列の属性を事前に登録する帳票定義なしに、読取対象文字列の読取および当該文字列の属性の判定を行うものである。このためには、項目値とそれに対応する項目名のペアを帳票画像内から抽出することが必要である。本実施例では、帳票画像内の全文字列に対し、当該文字列が項目名である確率を表す項目名尤度、項目値である確率を表す項目値尤度を計算し、帳票画像内の全文字列ペアに対し、当該文字列ペアをなす２つの文字列の配置関係が項目名−項目値関係として妥当であるかを表した配置尤度を計算する。さらに、項目名尤度と項目値尤度と配置尤度を基に計算した項目名−項目値関係評価値の値を基に、帳票画像内の項目名−項目値関係の対応付けを決定する。 The present invention reads a character string to be read and determines the attribute of the character string without a form definition for registering in advance the description position of the character string to be read and the attribute of the character string. Is to do. For this purpose, it is necessary to extract pairs of item values and corresponding item names from the form image. In this embodiment, for all the character strings in the form image, the item name likelihood indicating the probability that the character string is the item name and the item value likelihood indicating the probability that the character string is the item value are calculated. For all character string pairs, an arrangement likelihood representing whether the arrangement relationship between the two character strings forming the character string pair is valid as the item name-item value relationship is calculated. Further, the association between the item name-item value relationship in the form image is determined based on the value of the item name-item value relationship evaluation value calculated based on the item name likelihood, the item value likelihood, and the placement likelihood. .

具体的には、以下の順序により、帳票画像内の項目名−項目値関係の対応付けを決定する。
（１）ユーザが事前に登録した項目名のリストである項目名辞書内の項目名単語と、帳票画像内の文字列とを照合し、項目名辞書内の全ての項目名単語と、帳票画像内の全ての文字列の組み合わせに対して、項目名尤度を計算する。
（２）例えば、日付、金額、口座番号などの汎用的に利用できる文法表記ルールによって定義される表記辞書と、帳票画像内の文字列とを照合し、全ての表記辞書と、帳票画像内の全ての文字列の組み合わせに対して、項目値尤度を計算する。
（３）２つの文字列の配置関係が項目名−項目値関係として非妥当な配置関係となるルールを記載したペナルティルールと、帳票画像内の２つの文字列の組み合わせからなる全ての文字列ペアの配置関係を参照し、帳票画像内の全ての文字列ペアに対して、配置尤度を計算する。
（４）項目名尤度、項目値尤度、配置尤度を基に、帳票画像内の全ての文字列ペアに対して、当該文字列ペアが項目名−項目値関係にあるかを表す評価値を計算し、前記評価値を基に帳票画像内から項目名−項目値関係を抽出する。
なお、（１）（２）（３）はそれぞれ独立に処理されるため、順序は上記の順に依らない。 Specifically, the association of the item name-item value relationship in the form image is determined in the following order.
(1) The item name word in the item name dictionary, which is a list of item names registered in advance by the user, is matched with the character string in the form image, and all the item name words in the item name dictionary and the form image The item name likelihood is calculated for all combinations of character strings.
(2) For example, a notation dictionary defined by general-purpose grammar notation rules such as date, amount, and account number is collated with a character string in a form image. Item value likelihood is calculated for all combinations of character strings.
(3) A penalty rule describing a rule in which the arrangement relationship between two character strings is an invalid arrangement relationship as an item name-item value relationship, and all character string pairs composed of two character strings in the form image The placement likelihood is calculated for all the character string pairs in the form image.
(4) Based on the item name likelihood, the item value likelihood, and the placement likelihood, an evaluation indicating whether the character string pair has an item name-item value relationship with respect to all the character string pairs in the form image. A value is calculated, and an item name-item value relationship is extracted from the form image based on the evaluation value.
Since (1), (2), and (3) are processed independently, the order does not depend on the above order.

以下、本発明の一実施例になる帳票認識装置および帳票認識方法について、図面を用いて詳細に説明する。 Hereinafter, a form recognition apparatus and a form recognition method according to an embodiment of the present invention will be described in detail with reference to the drawings.

図３は、本発明の帳票認識装置のハードウェア構成例である。本実施例の帳票認識装置は、命令コマンドやデータなどを入力するための入力装置３０１、認識対象の帳票を入力する画像入力装置３０２、文字列の検出や文字列認識、項目名−項目値関係の解析を行う帳票認識部３００、文字認識辞書や項目名単語辞書を格納する認識辞書３０３、帳票画像の認識結果を表示する表示装置３０４を備える。帳票認識部３００と、入力装置３０１、画像入力装置３０２、認識辞書３０３、表示装置３０４は、物理的な接続手段に依らず、ネットワークなどを介して接続されてもよい。 FIG. 3 is a hardware configuration example of the form recognition apparatus of the present invention. The form recognition apparatus according to the present embodiment includes an input apparatus 301 for inputting command commands and data, an image input apparatus 302 for inputting a form to be recognized, character string detection and character string recognition, and item name-item value relationships. A form recognition unit 300 for analyzing the above, a recognition dictionary 303 for storing a character recognition dictionary and an item name word dictionary, and a display device 304 for displaying the recognition result of the form image. The form recognition unit 300, the input device 301, the image input device 302, the recognition dictionary 303, and the display device 304 may be connected via a network or the like without depending on physical connection means.

図４に、帳票認識部３００の詳細なブロック構成図を示す。帳票認識部３００は、文字列検出部４２０、文字列認識部４３０、項目名尤度計算部４４０、項目値尤度計算部４５０、配置尤度計算部４６０、項目名−項目値関係評価値計算部４７０、項目名−項目値関係決定部４８０などから構成されている。 FIG. 4 shows a detailed block diagram of the form recognition unit 300. The form recognition unit 300 includes a character string detection unit 420, a character string recognition unit 430, an item name likelihood calculation unit 440, an item value likelihood calculation unit 450, an arrangement likelihood calculation unit 460, and an item name-item value relationship evaluation value calculation. 470, an item name-item value relationship determining unit 480, and the like.

ここで、文字列検出部４２０は、帳票画像から文字列領域を検出するものである。文字列認識部４３０は、文字列領域の個々の文字を認識するものである。項目名尤度計算部４４０は、帳票画像内の文字列に対し、当該文字列が項目名辞書に登録された単語である確率を表す項目名尤度を計算するものである。項目値尤度計算部４５０は、帳票画像内の文字列に対し、当該文字列が表記辞書に登録された単語や、金額、日付、口座番号などの文法ルールに一致する確率である項目値尤度を計算するものである。配置尤度計算部４６０は、帳票画像内の文字列ペアに対し、当該文字列ペアの配置関係の項目名−項目値関係としての妥当さである配置尤度を計算するものである。項目名−項目値関係評価値計算部４７０は、前記項目名尤度、項目値尤度、配置尤度を基に、当該文字列ペアの項目名−項目値としての尤もらしさを表す評価値を計算するものである。項目名−項目値関係決定部４８０は、前記項目名−項目値関係評価値計算部の出力する評価値により、帳票画像内での項目名−項目値関係を決定するものである。 Here, the character string detection unit 420 detects a character string region from the form image. The character string recognition unit 430 recognizes individual characters in the character string area. The item name likelihood calculating unit 440 calculates an item name likelihood representing the probability that the character string is a word registered in the item name dictionary for the character string in the form image. The item value likelihood calculation unit 450 is an item value likelihood that is the probability that the character string in the form image matches a grammar rule such as a word, amount, date, or account number registered in the notation dictionary. The degree is calculated. The placement likelihood calculation unit 460 calculates the placement likelihood, which is the validity of the item name-item value relationship of the placement relationship of the character string pair, for the character string pair in the form image. The item name-item value relationship evaluation value calculation unit 470 calculates an evaluation value representing the likelihood as the item name-item value of the character string pair based on the item name likelihood, the item value likelihood, and the placement likelihood. It is to calculate. The item name-item value relationship determination unit 480 determines the item name-item value relationship in the form image based on the evaluation value output from the item name-item value relationship evaluation value calculation unit.

図１に、帳票認識部３００における帳票認識の処理フロー図を示す。まず、ステップＳ１１０において、入力された帳票画像から枠を検出する。次に、ステップＳ１２０において、文字列検出部４２０で、帳票画像から文字列領域を検出する。文字列領域とは、ある１つの文字列を含む矩形領域である。ステップＳ１１０およびＳ１２０の具体例として、例えば、特開平１１−５３４６６号公報に開示の技術のように、帳票画像から罫線を抽出し、２本の罫線の交点と端点を抽出し、矩形枠の四隅に相当する右上角、左上角、右下角、左下角を検出することにより、帳票画像内から枠を検出する方式を利用することができる。ステップＳ１３０では、文字列認識部４３０において、ステップＳ１２０において検出された文字列領域の個々の文字を文字認識辞書１３１を用いて認識する。ステップＳ１３０の具体例として、例えば、非特許文献：F.Kimura et. al. “Modified quadratic discriminant functions and the application to chinese character recognition” IEEE Transaction on Pattern Analysis and Machine Intelligence、 vol.9、 pp.149-153 に開示の技術のように、ベイズの定理から導かれる距離尺度である識別関数を文字カテゴリごとに定め、識別関数の出力する値に基づいて、未知の文字を文字カテゴリに分類することで、個々の文字を認識する方式などがある。 FIG. 1 shows a process flow diagram of form recognition in the form recognition unit 300. First, in step S110, a frame is detected from the input form image. Next, in step S120, the character string detection unit 420 detects a character string region from the form image. The character string area is a rectangular area including a certain character string. As specific examples of steps S110 and S120, for example, as in the technique disclosed in Japanese Patent Application Laid-Open No. 11-53466, a ruled line is extracted from a form image, and an intersection and an end point of two ruled lines are extracted. By detecting the upper right corner, the upper left corner, the lower right corner, and the lower left corner corresponding to, a method of detecting a frame from the form image can be used. In step S130, the character string recognition unit 430 recognizes each character in the character string area detected in step S120 using the character recognition dictionary 131. As a specific example of step S130, for example, non-patent document: F. Kimura et. Al. “Modified quadratic discriminant functions and the application to chinese character recognition” IEEE Transaction on Pattern Analysis and Machine Intelligence, vol.9, pp.149- As in the technique disclosed in 153, by determining the identification function, which is a distance measure derived from Bayes' theorem, for each character category, and classifying unknown characters into the character category based on the value output by the identification function, There are methods for recognizing individual characters.

ステップＳ１４０では、項目名尤度計算部４４０で、ステップＳ１３０において認識された各文字列の文字列認識結果ごとに、項目名辞書１４１と照合し、当該文字列が項目名辞書１４１に登録された単語である確率を表す項目名尤度を計算する。また、ステップＳ１５０では、項目値尤度計算部４５０で、ステップＳ１３０において認識された各文字列の文字列認識結果ごとに、表記辞書１５１と照合し、当該文字列が表記辞書１５１に記載された単語や文字列の文法表記ルールに一致する確率を表す項目値尤度を計算する。ステップＳ１４０、ステップＳ１５０の処理は、後に詳細に説明する。 In step S140, the item name likelihood calculation unit 440 collates the item name dictionary 141 for each character string recognition result of each character string recognized in step S130, and the character string is registered in the item name dictionary 141. The item name likelihood representing the probability of being a word is calculated. In step S150, the item value likelihood calculation unit 450 collates with the notation dictionary 151 for each character string recognition result of each character string recognized in step S130, and the character string is described in the notation dictionary 151. The item value likelihood representing the probability of matching the grammar notation rule of the word or character string is calculated. The processing of step S140 and step S150 will be described in detail later.

ステップＳ１６０では、配置尤度計算部４６０で、ステップＳ１１０において検出された枠の座標およびステップＳ１２０において検出された文字列領域の座標を基に、帳票画像内の文字列ペアをなす２つの文字列の配置関係の項目名−項目値関係としての妥当さを表す配置尤度を計算する。配置尤度は、２つの文字列が属する枠のサイズおよび配置関係や、２つの文字列矩形のサイズおよび配置関係を基に計算する。ステップＳ１６０の処理は、後に詳細に説明する。 In step S160, two character strings forming a character string pair in the form image based on the coordinates of the frame detected in step S110 and the coordinates of the character string region detected in step S120 by the placement likelihood calculation unit 460. The placement likelihood representing the validity as the item name-item value relationship of the placement relationship is calculated. The placement likelihood is calculated based on the size and placement relationship of the frame to which the two character strings belong, and the size and placement relationship of the two character string rectangles. The process of step S160 will be described in detail later.

ステップＳ１７０では、項目名−項目値関係評価値計算部４７０で、ステップＳ１４０において計算された項目名尤度、ステップＳ１５０において計算された項目値尤度、ステップＳ１６０において計算された配置尤度を基に、各尤度の数値から２つの文字列の項目名−項目値関係の評価値を計算する。評価値の計算には、例えば、項目名尤度、項目値尤度、配置尤度を代入することで評価値を出力する評価関数を用いる方法などがある。また、項目名尤度、項目値尤度、配置尤度が全て事前に定義したある閾値を超える場合に評価値を１（フラグを立てる）にする方法などがある。ステップＳ１７０の処理は、後に詳細に説明する。 In step S170, the item name-item value relation evaluation value calculation unit 470 uses the item name likelihood calculated in step S140, the item value likelihood calculated in step S150, and the placement likelihood calculated in step S160. Then, the evaluation value of the item name-item value relationship of the two character strings is calculated from the numerical value of each likelihood. For example, the evaluation value is calculated by using an evaluation function that outputs an evaluation value by substituting item name likelihood, item value likelihood, and placement likelihood. Further, there is a method of setting the evaluation value to 1 (setting a flag) when the item name likelihood, the item value likelihood, and the placement likelihood all exceed a predetermined threshold value. The process of step S170 will be described in detail later.

ステップＳ１８０では、項目名−項目値関係決定部４８０において、ステップＳ１７０で計算された評価値を基に、帳票画像内での項目名−項目値関係を決定する。例えば、評価値がある閾値以上となる文字列ペアを項目名−項目値関係と決定するなどがある。また、同一属性の項目名−項目値関係候補のうち、最大の評価値を持つ文字列ペアを項目名−項目値関係と決定するなどがある。項目名−項目値関係を決定することにより、文字列の認識結果および該文字列の属性が決定される。 In step S180, the item name-item value relationship determination unit 480 determines the item name-item value relationship in the form image based on the evaluation value calculated in step S170. For example, a character string pair whose evaluation value is greater than or equal to a certain threshold value is determined as an item name-item value relationship. In addition, among the item name-item value relationship candidates having the same attribute, a character string pair having the largest evaluation value is determined as the item name-item value relationship. By determining the item name-item value relationship, the recognition result of the character string and the attribute of the character string are determined.

図１において、ステップＳ１４０、ステップＳ１５０、ステップＳ１６０はそれぞれ独立に並列に計算する。なお、ステップＳ１１０の後段にステップＳ１２０、ステップＳ１２０の後段にステップＳ１３０、ステップＳ１３０の後段にステップＳ１４０とＳ１５０、ステップＳ１２０の後段にステップＳ１６０、ステップＳ１４０とＳ１５０とＳ１６０の後段にステップＳ１７０、ステップＳ１７０の後段にステップＳ１８０が処理される構成であれば、処理フローは図１の形式に依らない。 In FIG. 1, step S140, step S150, and step S160 are independently calculated in parallel. Note that step S120 is subsequent to step S110, step S130 is subsequent to step S120, steps S140 and S150 are subsequent to step S130, steps S160 are subsequent to step S120, and steps S170 and S170 are subsequent to steps S140, S150, and S160. If step S180 is processed in the subsequent stage, the processing flow does not depend on the format of FIG.

以下、図１のステップＳ１４０の項目名尤度計算の処理フロー、ステップＳ１５０の項目値尤度計算の処理フロー、ステップＳ１６０の配置尤度計算の処理フロー、およびステップＳ１７０の項目名−項目値関係評価値計算の計算例について詳細に説明する。 Hereinafter, the processing flow of the item name likelihood calculation in step S140 in FIG. 1, the processing flow of the item value likelihood calculation in step S150, the processing flow of the placement likelihood calculation in step S160, and the item name-item value relationship in step S170. A calculation example of evaluation value calculation will be described in detail.

まず、ステップＳ１４０の項目名尤度計算の処理フローについて、図５と図６と図７を用いて説明する。
図５は項目名辞書の例、図６は項目名尤度計算のフローチャート、図７は項目名尤度テーブルの例である。図６のステップＳ１４０１では、帳票画像内の項目名尤度を計算していない文字列の有無を判定する。残り文字列がない場合は、項目名尤度計算処理を終了する。ステップＳ１４０２では、当該文字列に対して、照合を行っていない項目名単語の有無を判定する。ステップＳ１４０３では、帳票画像内の文字列と項目名辞書内の項目名の照合を行い、項目名尤度を計算する。ステップＳ１４０３における単語照合の具体例としては、例えば、特開２００４−１７１３１６号公報に開示の技術を利用することができる。また、項目名尤度の計算方法として、例えば、図１のステップＳ１３０における文字列認識結果が各個別文字の文字識別尤度を有し、単語照合により求まった文字列パスの各個別文字の文字識別尤度の平均値を項目名尤度とする方法が利用できる。また、これに依らず、個別文字の識別尤度、個別文字への切出し尤度、個別文字矩形のサイズおよびアスペクト比等を基に項目名尤度を計算する方式であってもよい。ステップＳ１４０４では、ステップＳ１４０３で得られた項目名尤度が最大のものを図７の項目名尤度テーブルに記録する。図５の項目名辞書を用いて、図１５の帳票画像内の文字列に対して項目名尤度を計算する場合、項目名尤度テーブルの例は図６のようになる。図７の項目名尤度テーブルの場合、帳票画像内のＮ個（本実施例の場合Ｎ＝３）の文字列に対し、全ての項目名単語との照合を行い、項目名尤度を計算する。 First, the processing flow of the item name likelihood calculation in step S140 will be described using FIG. 5, FIG. 6, and FIG.
FIG. 5 is an example of an item name dictionary, FIG. 6 is a flowchart of item name likelihood calculation, and FIG. 7 is an example of an item name likelihood table. In step S1401 of FIG. 6, it is determined whether or not there is a character string for which the item name likelihood is not calculated in the form image. If there is no remaining character string, the item name likelihood calculation process is terminated. In step S1402, it is determined whether or not there is an item name word that has not been collated with respect to the character string. In step S1403, the character string in the form image is compared with the item name in the item name dictionary, and the item name likelihood is calculated. As a specific example of word collation in step S1403, for example, the technique disclosed in Japanese Patent Application Laid-Open No. 2004-171316 can be used. In addition, as a method for calculating the item name likelihood, for example, the character string recognition result in step S130 of FIG. 1 has the character identification likelihood of each individual character, and the character of each individual character in the character string path obtained by word matching A method of using the average value of the identification likelihood as the item name likelihood can be used. In addition, the item name likelihood may be calculated based on the identification likelihood of individual characters, the likelihood of extraction into individual characters, the size and aspect ratio of the individual character rectangle, and the like. In step S1404, the item with the maximum item name likelihood obtained in step S1403 is recorded in the item name likelihood table of FIG. When the item name likelihood is calculated for the character string in the form image of FIG. 15 using the item name dictionary of FIG. 5, an example of the item name likelihood table is as shown in FIG. In the case of the item name likelihood table of FIG. 7, N item strings in the form image (N = 3 in this embodiment) are collated with all item name words, and the item name likelihood is calculated. To do.

次に、図１のステップＳ１５０の項目値尤度計算の処理フローについて、図８と図９と図１０を用いて説明する。基本的な概念は前記の項目名尤度計算と同じである。
図９のステップＳ１５０１では、帳票画像内に項目値尤度の計算を行っていない文字列があるか判定する。残り文字列がなければ、項目値尤度計算処理を終了する。ステップＳ１５０２では、当該文字列に対し、照合を行っていない表記辞書があるか否かを判定する。残り辞書がなければ、帳票画像内の次の文字列の項目値尤度計算に移行する。ステップＳ１５０３では、帳票画像内の文字列と表記辞書との照合を行い、項目値尤度を計算する。ステップＳ１５０３における単語照合の具体的な実施例としては、例えば、非特許文献：高橋他、「回帰的遷移ネットワークを用いた文字経路探索方式の開発」、電子情報通信学会技術研究報告 Vol.109 No.418 pp.141-146、に開示の技術のように、個別文字の識別候補をノードと見立てた識別候補文字ネットワークと状態遷移ネットワークで表現した表記辞書のマッチングにより、状態遷移ネットワークから最適な文字列パスを選択し、文字列認識結果を得る方法がある。項目値尤度の計算例として、前述の項目名尤度と同様に、個別文字の識別尤度、個別文字への切出し尤度、個別文字矩形のサイズおよびアスペクト比等を基に項目名尤度を計算する方式がある。ステップＳ１５０４では、ステップＳ１５０３で得られた項目値尤度が最大のものを、項目値尤度テーブルに登録する。以上のステップＳ１５０３、Ｓ１５０４の処理を全ての帳票画像内の文字列と表記辞書の組み合わせに対して行う（ステップＳ１５０５、Ｓ１５０６）。表記辞書の例として、例えば、図８の８０１に示すような文字列の表記ルールの正規表現、８０２に示す単語ネットワーク、８０３に示す項目値単語リストなどがある。図８の表記辞書８０１を用いて、図１５の帳票画像内の文字列に対して項目値尤度を計算する場合、項目値尤度テーブルは図１０のようになる。 Next, the processing flow of the item value likelihood calculation in step S150 of FIG. 1 will be described using FIG. 8, FIG. 9, and FIG. The basic concept is the same as the item name likelihood calculation described above.
In step S1501 of FIG. 9, it is determined whether there is a character string for which the item value likelihood is not calculated in the form image. If there is no remaining character string, the item value likelihood calculation process is terminated. In step S1502, it is determined whether or not there is a notation dictionary that is not collated for the character string. If there is no remaining dictionary, the process proceeds to the item value likelihood calculation of the next character string in the form image. In step S1503, the character string in the form image is compared with the notation dictionary, and the item value likelihood is calculated. Specific examples of word matching in step S1503 include, for example, non-patent literature: Takahashi et al., “Development of a character path search method using a recursive transition network”, IEICE Technical Report Vol.109 No. As in the technology disclosed in .418 pp.141-146, the optimal character from the state transition network is matched by matching the identification candidate character network in which individual character identification candidates are regarded as nodes and the notation dictionary expressed by the state transition network. There is a method of obtaining a character string recognition result by selecting a column path. As an example of item value likelihood calculation, the item name likelihood is based on the identification likelihood of individual characters, the likelihood of extraction into individual characters, the size and aspect ratio of individual character rectangles, etc. There is a method to calculate In step S1504, the item value likelihood obtained in step S1503 is registered in the item value likelihood table. The processes in steps S1503 and S1504 described above are performed on combinations of character strings and notation dictionaries in all form images (steps S1505 and S1506). Examples of the notation dictionary include a regular expression of a character string notation rule as indicated by 801 in FIG. 8, a word network indicated by 802, an item value word list indicated by 803, and the like. When the item value likelihood is calculated for the character string in the form image of FIG. 15 using the notation dictionary 801 of FIG. 8, the item value likelihood table is as shown in FIG.

次に、図１のステップＳ１６０の配置尤度計算の処理フローについて、図１１と図１２を用いて説明する。図１１のステップＳ１７０１では、配置尤度を計算していない帳票画像内の文字列ペアの有無を判定する。残り文字列ペアがなければ配置尤度計算処理を終了する。ステップＳ１７０２では、配置尤度の初期化を行う。本方式では、例えば配置尤度は、配置尤度が取り得る値の最大値で初期化し、文字列ペアをなす２つの文字列の配置関係が項目名−項目値関係として非妥当であると判断できる配置パターンをペナルティルールとして定義し、初期値からペナルティルールによって得られたペナルティ値を減算した値として計算する方法がある。そのため、ステップＳ１７０２では、配置尤度の初期値を配置尤度が取り得る値の最大値と設定する。ステップＳ１７０４では、２つの文字列の配置関係が参照しているペナルティルールに該当するか否かを判断する。該当した場合、ステップＳ１７０５で、ペナルティ値を計算し、現在の配置尤度から減算する。以上、ステップＳ１７０３、ステップＳ１７０４、ステップＳ１７０５をペナルティルールの数だけ繰り返し、当該文字列ペアの配置尤度を計算する。以上の処理を帳票画像内の全文字列ペアに対して実行する。 Next, the processing flow of the placement likelihood calculation in step S160 of FIG. 1 will be described using FIG. 11 and FIG. In step S1701 of FIG. 11, the presence / absence of a character string pair in the form image for which the placement likelihood is not calculated is determined. If there are no remaining character string pairs, the placement likelihood calculation process is terminated. In step S1702, the placement likelihood is initialized. In this method, for example, the placement likelihood is initialized with the maximum value that the placement likelihood can take, and the placement relationship between the two character strings forming the character string pair is determined to be invalid as the item name-item value relationship. There is a method of defining a possible arrangement pattern as a penalty rule, and calculating as a value obtained by subtracting a penalty value obtained by the penalty rule from an initial value. Therefore, in step S1702, the initial value of the placement likelihood is set as the maximum value that the placement likelihood can take. In step S1704, it is determined whether the arrangement relationship between the two character strings corresponds to the penalty rule referred to. If applicable, in step S1705, a penalty value is calculated and subtracted from the current placement likelihood. As described above, step S1703, step S1704, and step S1705 are repeated by the number of penalty rules, and the arrangement likelihood of the character string pair is calculated. The above processing is executed for all character string pairs in the form image.

ペナルティルールの例を図１２に示す。ペナルティルールは２つの文字列の属する枠の配置関係やサイズ、２つの文字列矩形の配置関係やサイズを基に計算する。例えば、ルール１１０１の場合、２つの文字列が相互に隣接する枠内に存在する場合に、項目名となる文字列の中心座標が、枠の中心座標からずれがある場合に、ずれの距離に応じてペナルティを付加する。これは、本来２つの文字列が項目名−項目値関係にある場合には、項目名となる文字列は枠の中心付近に存在するといった仮定によるものである。ルール１１０２の場合、項目名となる文字列の属する枠の高さが、項目値となる文字列の属する枠の高さよりも大きい場合に、枠高さの比率に応じてペナルティを付加する。これは、本来は、項目名の属する枠の高さより、項目値の属する枠高さが大きいといった仮定に基づくものである。ルール１１０３の場合、２つの文字列が同一枠に存在する場合に、項目名となる文字列よりも、項目値となる文字列が左もしくは上に存在する場合に、左もしくは上方向へのはみ出し距離に応じてペナルティを付加する。これは、本来、２つの文字列が項目名−項目値関係にある場合には、項目名となる文字列の右下方向に項目値が存在するといった仮定に基づくものである。ルール１１０４の場合、２つの文字列が帳票画像内のいずれの枠にも属さず、枠の外かつ相互に近くに存在する場合に、項目名となる文字列の高さと、項目値となる文字列の高さが異なる場合に、文字列高さの比率に応じて、ペナルティを付加する。これは、本来は、項目名と項目値の文字列の高さは互いにほぼ等しいという仮定に基づくものである。ルール１１０５は、項目名となる文字列と、項目値となる文字列の距離が離れている場合に、２つの文字列の距離に応じてペナルティを付加する。これは、２つの文字列が項目名−項目値関係にある場合には、２つの文字列は相互に近くにあるといった仮定に基づくものである。ルール１１０６は、項目名となる文字列より、項目値となる文字列が左上方向に存在する場合に、そのずれの距離に応じてペナルティを付加するものである。これは、本来は、項目名の右下方向に項目値が存在するといった仮定に基づくものである。 An example of the penalty rule is shown in FIG. The penalty rule is calculated based on the arrangement relation and size of the frame to which the two character strings belong, and the arrangement relation and size of the two character string rectangles. For example, in the case of the rule 1101, when two character strings exist in frames adjacent to each other, and the center coordinate of the character string serving as the item name is shifted from the center coordinate of the frame, the shift distance is set. A penalty is added accordingly. This is based on the assumption that when two character strings originally have an item name-item value relationship, the character string serving as the item name exists near the center of the frame. In the case of the rule 1102, if the height of the frame to which the character string that is the item name belongs is larger than the height of the frame to which the character string that is the item value belongs, a penalty is added according to the ratio of the frame height. This is based on the assumption that the frame height to which the item value belongs is larger than the frame height to which the item name belongs. In the case of rule 1103, when two character strings exist in the same frame, if the character string that is the item value exists to the left or above the character string that is the item name, the character protrudes to the left or upward. Penalties are added according to the distance. This is based on the assumption that when two character strings are in the item name-item value relationship, the item value exists in the lower right direction of the character string serving as the item name. In the case of rule 1104, when two character strings do not belong to any frame in the form image and exist outside the frame and close to each other, the height of the character string as the item name and the character as the item value If the column height is different, a penalty is added according to the ratio of the character string height. This is based on the assumption that the height of the character string of the item name and the item value is substantially equal to each other. The rule 1105 adds a penalty according to the distance between two character strings when the distance between the character string as the item name and the character string as the item value is long. This is based on the assumption that when two character strings are in the item name-item value relationship, the two character strings are close to each other. The rule 1106 adds a penalty according to the deviation distance when a character string as an item value exists in the upper left direction from a character string as an item name. This is based on the assumption that the item value exists in the lower right direction of the item name.

以上のように、ペナルティルールは、２つの文字列が項目名−項目値関係にある場合の尤もらしい配置関係を仮定し、その仮定から外れる配置関係に２つの文字列が配置される場合に、ペナルティが付加されるように生成される。なお、ペナルティルールは図１２に示した１１０１から１１０６に限定されるものではなく、２つの文字列の属する枠の配置関係やサイズ、２つの文字列矩形の配置関係やサイズなどから計算されるものであり、２つの文字列が項目名−項目値関係にある場合に尤もらしい配置関係を仮定し、その仮定から外れる場合に付加される計算方法であれば、これに依らない。 As described above, the penalty rule assumes a plausible arrangement relationship when two character strings are in the item name-item value relationship, and when two character strings are arranged in an arrangement relationship that deviates from the assumption, Generated to add a penalty. The penalty rule is not limited to 1101 to 1106 shown in FIG. 12, but is calculated from the layout relationship and size of the frames to which the two character strings belong, the layout relationship and size of the two character string rectangles, etc. Assuming a plausible arrangement relationship when the two character strings are in the item name-item value relationship, the calculation method added when the two character strings deviate from the assumption does not depend on this.

図１２のペナルティルールを用いて、図１５の帳票画像内の文字列ペアに対して配置尤度を計算する場合、配置尤度テーブルは図１３のようになる。例えば、項目名文字列番号「１」−項目値文字列番号「２」の文字列ペアの場合、図１２の１１０１から１１０６のペナルティルールのいずれにも該当しないため、配置尤度は初期値として定めた「1.00」となる。また、項目名文字列番号「１」−項目値文字列番号「３」の文字列ペアの場合、図１２のペナルティルール１１０１に該当するため、ペナルティルール１１０１にて計算されたペナルティが初期値から減算され、配置尤度は「0.90」となる。 When the placement likelihood is calculated for the character string pair in the form image of FIG. 15 using the penalty rule of FIG. 12, the placement likelihood table is as shown in FIG. For example, in the case of the character string pair of item name character string number “1” −item value character string number “2”, it does not correspond to any of the penalty rules 1101 to 1106 in FIG. It is set to “1.00”. Further, in the case of the character string pair of item name character string number “1” −item value character string number “3”, it corresponds to the penalty rule 1101 of FIG. 12, so the penalty calculated by the penalty rule 1101 is determined from the initial value. Subtraction is performed, and the placement likelihood becomes “0.90”.

次に、図１のステップＳ１７０の項目名−項目値関係評価値計算の処理フローについて、図１４を用いて説明する。項目名−項目値関係評価値は、例えば、項目名尤度、項目値尤度、配置尤度を入力とする評価関数によって計算する。LLを項目名尤度、VLを項目値尤度、ALを配置尤度としたときに、評価関数E(LL、 VL、 AL)の例として、例えば式（１）、式（２）、式（３）に示すものがある。 Next, the processing flow of the item name-item value relationship evaluation value calculation in step S170 of FIG. 1 will be described with reference to FIG. The item name-item value relationship evaluation value is calculated by, for example, an evaluation function that receives item name likelihood, item value likelihood, and placement likelihood. As an example of the evaluation function E (LL, VL, AL) where LL is the item name likelihood, VL is the item value likelihood, and AL is the placement likelihood, for example, Equation (1), Equation (2), Equation There is what is shown in (3).

E(LL、 VL、 AL) = (LL + VL) × AL ・・・（１）
E(LL、 VL、 AL) = LL + VL + AL ・・・（２）
E(LL、 VL、 AL) = √(LL × VL) × AL ・・・（３）
なお、評価関数は上記（１）（２）（３）に限るものではなく、項目名尤度、項目値尤度、配置尤度の値から、２つの文字列が項目名−項目値関係にある確からしさを算出できる形式であれば、これに限らない。 E (LL, VL, AL) = (LL + VL) x AL (1)
E (LL, VL, AL) = LL + VL + AL (2)
E (LL, VL, AL) = √ (LL × VL) × AL (3)
The evaluation function is not limited to the above (1), (2), and (3), and the two character strings have an item name-item value relationship based on the item name likelihood, the item value likelihood, and the placement likelihood value. Any format that can calculate certain certainty is not limited to this.

図７の項目名尤度テーブル、図１０の項目値尤度テーブル、図１３の配置尤度テーブルが得られた場合、上記式（１）により求めた図１５の帳票画像に対する項目名−項目値評価値テーブルは、図１４のようになる。項目名−項目値関係評価値は、図５の項目名辞書、図８の表記辞書において、事前に定義した属性ごとに計算し、項目名−項目値評価値テーブルに登録する。本実施例では、項目名尤度、項目値尤度、配置尤度のいずれかが「0.00」となる場合は、項目名−項目値評価値の計算は行わない。例えば、項目名文字列番号「１」、項目値文字列番号「２」の場合は、属性ＩＤ「００２」に対し、E(LL、 VL、 AL) = (0.25+0.85)×1.00=1.10、属性ＩＤ「００３」に対し、E(LL、 VL、 AL) = (0.28+0.20)×1.00=0.48となる。項目名文字列番号「１」、項目値文字列番号「３」の場合は、属性ＩＤ「００３」に対して、E(LL、 VL、 AL) = (0.28+0.92)×0.90=1.08となる。上記のような評価関数による項目名−項目値評価値計算を帳票画像内の全文字列ペアに対して行い、項目名−項目値評価値テーブルに登録する。図１５の帳票画像に対する項目名−項目値関係の候補は、項目名文字列番号「１」と項目値文字列番号「２」の「納付額」−「17,420」および、項目名文字列番号「１」と項目値文字列番号「３」の「納期限」−「21.11.13」となる。この場合、両候補において項目名文字列番号「１」が重複するため、評価値の高い文字列ペアが選択され（図１のステップ１８０）、最終的な項目名−項目値関係抽出結果は、項目名文字列番号「１」と項目値文字列番号「２」の「納付額」−「17,420」（属性ＩＤ：００２）となる。 When the item name likelihood table of FIG. 7, the item value likelihood table of FIG. 10, and the placement likelihood table of FIG. 13 are obtained, the item name-item value for the form image of FIG. 15 obtained by the above equation (1). The evaluation value table is as shown in FIG. The item name-item value relationship evaluation value is calculated for each predefined attribute in the item name dictionary of FIG. 5 and the notation dictionary of FIG. 8, and is registered in the item name-item value evaluation value table. In this embodiment, when any of the item name likelihood, the item value likelihood, and the placement likelihood is “0.00”, the item name-item value evaluation value is not calculated. For example, in the case of the item name character string number “1” and the item value character string number “2”, E (LL, VL, AL) = (0.25 + 0.85) × 1.00 = 1.10 for the attribute ID “002”. For the attribute ID “003”, E (LL, VL, AL) = (0.28 + 0.20) × 1.00 = 0.48. In the case of the item name character string number “1” and the item value character string number “3”, E (LL, VL, AL) = (0.28 + 0.92) × 0.90 = 1.08 for the attribute ID “003”. . The item name-item value evaluation value calculation by the evaluation function as described above is performed for all character string pairs in the form image and registered in the item name-item value evaluation value table. The item name-item value relationship candidates for the form image in FIG. 15 are “payment amount”-“17,420” of the item name character string number “1” and the item value character string number “2”, and the item name character string number “ “1” and item value character string number “3” are “delivery date” − “21.11.13”. In this case, since the item name character string number “1” is duplicated in both candidates, a character string pair having a high evaluation value is selected (step 180 in FIG. 1), and the final item name-item value relationship extraction result is: The item name character string number “1” and the item value character string number “2” are “payment amount” − “17,420” (attribute ID: 002).

以上述べた通り、本発明によれば、帳票画像内の文字列の項目名らしさの項目名尤度、項目値らしさの項目値尤度を全文字列に対して計算し、帳票画像内の文字列ペアの配置関係の項目名−項目値らしさの配置尤度を全文字列ペアに対して計算し、項目名尤度、項目値尤度、配置尤度に基づいて計算される項目名−項目値関係評価値によって、帳票画像内の項目名−項目値関係を抽出する帳票認識方式により、文字認識誤りに頑健に、項目名−項目値関係の配置関係の曖昧性のある非表形式レイアウト帳票を誤り少なく認識することができる。また、項目名尤度、項目値尤度、配置尤度をそれぞれ独立に計算するモジュール構成により、少ない定義で汎用性の高い帳票認識方式を提供することができる。 As described above, according to the present invention, the item name likelihood of the item name likelihood of the character string in the form image and the item value likelihood of the item value likelihood are calculated for all character strings, and the character in the form image is calculated. Item name-item that is calculated based on the item name likelihood, item value likelihood, and arrangement likelihood. A non-tabular layout form that is robust against character recognition errors and has an ambiguous layout relation of the item name-item value relation by the form recognition method that extracts the item name-item value relation in the form image by the value relation evaluation value Can be recognized with few errors. In addition, the module configuration for independently calculating the item name likelihood, the item value likelihood, and the placement likelihood can provide a highly versatile form recognition method with a small number of definitions.

１３１・・・文字認識辞書、１４１・・・項目名辞書、１５１・・・表記辞書、２００，２０１・・・帳票例、３００・・・帳票認識部、３０１・・・入力装置、３０２・・・画像入力装置、３０３・・・認識辞書、３０４・・・表示装置、４２０・・・文字列検出部、４３０・・・文字列認識部、４４０・・・項目名尤度計算部、４５０・・・項目値尤度計算部、４６０・・・配置尤度計算部、４７０・・・項目名−項目値関係評価値計算部、４８０・・・項目名−項目値関係決定部、８０１・・・文字列の表記ルールの正規表現、８０２・・・単語ネットワーク、８０３・・・項目値単語リスト、１１０１〜１１０６・・・ペナルティルール。 131 ... Character recognition dictionary, 141 ... Item name dictionary, 151 ... Notation dictionary, 200, 201 ... Form example, 300 ... Form recognition unit, 301 ... Input device, 302 ... Image input device 303 ... Recognition dictionary 304 ... Display device 420 ... Character string detection unit 430 ... Character string recognition unit 440 ... Item name likelihood calculation unit 450 Item value likelihood calculation unit, 460 ... placement likelihood calculation unit, 470 ... item name-item value relationship evaluation value calculation unit, 480 ... item name-item value relationship determination unit, 801 ... Regular expression of character string notation rule, 802... Word network, 803... Item value word list, 1101 to 1106.

Claims

A form recognition device that inputs a form image and performs recognition processing of a character string in the form image,
A character string detection unit for detecting a character string region from the form image;
A character string recognition unit for recognizing individual characters in the character string region;
An item name likelihood calculating unit that calculates an item name likelihood representing the probability that the character string is an item name for the character string in the form image;
An item value likelihood calculating unit that calculates an item value likelihood representing the probability that the character string is an item value for the character string in the form image;
An arrangement likelihood calculating unit for calculating an arrangement likelihood indicating whether the arrangement relation of the character string pair is valid as the item name-item value relation for the character string pair in the form image;
An item name-item value relation evaluation value calculation unit for calculating an evaluation value representing the likelihood as the item name-item value of the character string pair based on the item name likelihood, the item value likelihood, and the placement likelihood; ,
An item name-item value relationship determining unit that determines an association of an item name-item value relationship in a form image based on the evaluation value output from the item name-item value relationship evaluation value calculation unit, Form recognition device to do.

The form recognition device according to claim 1,
The placement likelihood calculation unit determines whether the item name character string and the item value character string of the character string pair are arranged in relation to the size or size of the character string rectangle, or the relationship between the character string rectangle and the item name-item value relationship of the size. A form recognizing device, wherein the placement likelihood is calculated based on a penalty rule which represents a rule.

The form recognition device according to claim 1,
The item name likelihood calculating unit calculates the item name likelihood for the character string by collating with an item name dictionary describing item name words,
The item value likelihood calculating unit calculates the item value likelihood for the character string by collating with a notation dictionary describing grammar notation rules for item value words and character strings. .

A form recognition method for inputting a form image and recognizing a character string in the form image,
A character string detection step for detecting a character string region from the form image;
A character string recognition step for recognizing individual characters in the character string region;
An item name likelihood calculating step for calculating an item name likelihood representing the probability that the character string is an item name for the character string in the form image;
An item value likelihood calculating step for calculating an item value likelihood representing a probability that the character string is an item value for the character string in the form image;
An arrangement likelihood calculating step for calculating an arrangement likelihood indicating whether the arrangement relation of the character string pair is valid as an item name-item value relation for the character string pair in the form image;
An item name-item value relationship evaluation value calculating step for calculating an evaluation value representing the likelihood as the item name-item value of the character string pair based on the item name likelihood, the item value likelihood, and the placement likelihood; and ,
The item name-item value relationship evaluation value calculating step includes an item name-item value relationship determining step for determining an association between the item name-item value relationship in the form image based on the evaluation value output from the item name-item value relationship evaluation value calculating step. Form recognition method.

The form recognition method according to claim 4,
In the placement likelihood calculation step, the invalidity of the layout relation and size of the frame of the item name character string and the data character string of the character string pair, or the relation of the character string rectangle and the item name-item value relation of the size. A form recognition method, wherein the placement likelihood is calculated based on a penalty rule which is a rule to be expressed.

The form recognition method according to claim 4,
The item name likelihood calculating step calculates the item name likelihood for the character string by matching with an item name dictionary describing item name words.
In the item value likelihood calculating step, the item value likelihood is calculated for the character string by collation with a notation dictionary describing rule of grammar of the item value word and character string.