JP2013105344A

JP2013105344A - Business form processing system and business form processing method

Info

Publication number: JP2013105344A
Application number: JP2011249165A
Authority: JP
Inventors: Juichi Takahashi; 寿一高橋; Hiroshi Shinjo; 広新庄; Junichi Hirayama; 淳一平山; Eisuke Asano; 英輔浅野
Original assignee: Hitachi Omron Terminal Solutions Corp
Current assignee: Hitachi Omron Terminal Solutions Corp
Priority date: 2011-11-14
Filing date: 2011-11-14
Publication date: 2013-05-30
Anticipated expiration: 2031-11-14
Also published as: JP5712415B2

Abstract

PROBLEM TO BE SOLVED: To improve the accuracy with which data in a specific item is extracted from a business form in which the specific item is entered.SOLUTION: A business form processing system stores therein frame structure dictionary data associating character strings corresponding to a plurality of extraction object frames with arrangement relations among the plurality of extraction object frames and includes: a frame extraction unit; a frame collation unit which refers to the frame structure dictionary data to calculate a collation result from association with an extraction object frame of an extracted frame extracted by the frame extraction unit; a character line extraction unit which extracts a character line existing in the extracted frame associated with the extraction object frame; and a score calculation unit which calculates a frame score indicative of similarity of the character line to a character string corresponding to the extraction object frame with respect to each of collation results calculated by the frame collation unit and calculates collation result scores being scores of respective collation results. On the basis of collation result scores calculated by the score calculation unit, one collation result is defined as the extraction and collation result.

Description

本発明は、帳票画像を認識するための帳票処理システムに関し、特に、帳票画像に含まれる枠に存在する文字行を抽出して、抽出した文字行を文字認識する帳票処理システムに関する。 The present invention relates to a form processing system for recognizing a form image, and more particularly to a form processing system that extracts a character line existing in a frame included in a form image and recognizes the extracted character line.

まず、以下の説明で用いる語句を定義する。「枠」は、文書及び帳票において縦罫線と横罫線とによって囲まれた矩形領域として定義する。「表」は、連結した枠の集合であるとして定義する。「枠構造」は、表に含まれる枠の配置構造として定義する。 First, terms used in the following description are defined. A “frame” is defined as a rectangular area surrounded by vertical ruled lines and horizontal ruled lines in a document and a form. A “table” is defined as a set of linked frames. “Frame structure” is defined as an arrangement structure of frames included in a table.

以下、ＯＣＲ（ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅａｄｅｒ）が帳票から特定の項目を抽出する技術について説明する。 Hereinafter, a technique in which an OCR (Optical Character Reader) extracts a specific item from a form will be described.

第１の従来技術として、帳票の表の構造が帳票認識装置に予め定義され、帳票認識装置が入力された帳票画像と予め定義された表の構造とを照合することによって、入力された帳票画像の枠の位置を自動的に検出する技術が知られている（例えば、特許文献１参照）。第１の従来技術では、表の構造が定型的な帳票において、帳票を入力する場合のひずみ、及び帳票の裁断誤差等に起因した帳票画像同士における枠の位置の相違を検出できる。また、第１の従来技術では、かすれ及びノイズが帳票画像に存在しても、かすれ及びノイズの影響を受けにくくできる。 As a first conventional technique, the structure of a form table is defined in advance in the form recognition apparatus, and the form image inputted by the form recognition apparatus is collated with the predefined structure of the table. A technique for automatically detecting the position of the frame is known (see, for example, Patent Document 1). In the first prior art, in a form with a regular table structure, it is possible to detect a difference in frame positions between form images due to distortion when a form is input, and a cutting error of the form. In the first prior art, even if blur and noise are present in the form image, it can be hardly affected by the blur and noise.

第２の従来技術として、帳票の枠同士の配置関係を帳票書式情報として装置に予め記憶する技術が知られている（例えば、非特許文献１参照）。具体的には、当該装置は、帳票全体に対する枠同士の配置関係を予め記憶しておき、入力された帳票画像と帳票書式情報とを照合することによって、枠の位置及び枠の大きさが異なる帳票からも枠の位置を検出できる。 As a second conventional technique, a technique is known in which an arrangement relationship between form frames is stored in advance in the apparatus as form format information (see, for example, Non-Patent Document 1). Specifically, the apparatus stores in advance the layout relationship between frames with respect to the entire form, and the position of the frame and the size of the frame differ by collating the input form image with the form format information. The position of the frame can also be detected from the form.

第３の従来技術として、帳票の部分領域の書式情報を予め記憶し、帳票上の表を部分領域に分割し、部分領域ごとに書式情報と照合し、書式情報のうち最も照合率が高いものを照合結果とする技術が知られている（例えば、特許文献２参照）。第３の従来技術では、枠の位置及び大きさ等が異なる帳票、並びに枠の配置が異なる帳票においても、最も照合率が高い書式情報から枠を特定できる。 As a third conventional technique, the format information of a partial area of a form is stored in advance, the table on the form is divided into partial areas, and the partial information is collated with the format information. Among the format information, the highest collation rate Is known as a verification result (see, for example, Patent Document 2). In the third conventional technique, a frame can be specified from format information having the highest collation rate even in a form having a different frame position and size and a form having a different frame arrangement.

第４の従来技術として、互いに隣接する枠候補及び離れている枠候補の組み合わせによって対象となる表を構成し出力する技術が知られている（例えば、特許文献３参照）。第４の従来技術では、枠の隣接関係及び位置関係に基づいて表を構成する最適な組み合わせを評価し、表を作成できる。 As a fourth conventional technique, a technique is known in which a target table is configured and output by a combination of adjacent frame candidates and distant frame candidates (see, for example, Patent Document 3). In the fourth prior art, it is possible to create a table by evaluating an optimal combination constituting the table based on the adjacent relationship and the positional relationship of the frames.

また、文字列照合を用いて入力された帳票画像から所望の項目を抽出する第５の技術として、項目名及びデータを予め文字列照合して、項目名候補の枠及びデータ候補の枠の配置関係並びに隣接関係等を評価することによって、各枠を抽出する方式がある（例えば、非特許文献２参照）。第５の技術では、帳票画像から抽出された文字行に対して項目名及びデータをそれぞれ予め設定された文字列辞書と照合し、項目名及びデータの仮説を生成し、すべての文字行に対する項目名とデータの仮説から枠の配置関係及び隣接関係等を評価し、項目名とデータとのセットを検出し、最も評価の高い組み合わせを項目として特定できる。 Further, as a fifth technique for extracting a desired item from a form image input using character string collation, item name candidates and character candidate frames are arranged by previously collating item names and data with character strings. There is a method of extracting each frame by evaluating the relationship and the adjacent relationship (for example, see Non-Patent Document 2). In the fifth technique, item names and data are collated with a character string dictionary set in advance for character lines extracted from a form image to generate item names and data hypotheses, and items for all character lines. From the name and data hypotheses, the frame layout and adjacency relationships are evaluated, the set of item names and data is detected, and the combination with the highest evaluation can be identified as the item.

特開平１１−５３４６６号公報Japanese Patent Laid-Open No. 11-53466 特開２００４−１３９４８４号公報JP 2004-139484 A 特開２００７−２１３２５５号公報JP 2007-213255 A

駱琴、渡辺豊英、杉江昇、「多種帳票文書の構造認識」電子情報通信学会論文誌、１９９３年、Ｖｏｌ．Ｊ７６−Ｄ−ＩＩ、Ｎｏ．１０、ｐｐ．２１６５−２１７６Lyre, Toyohide Watanabe, Noboru Sugie, “Structural Recognition of Various Document Documents”, IEICE Transactions, 1993, Vol. J76-D-II, no. 10, pp. 2165-2176 平山淳一、新庄広、高橋寿一、永崎健、「仮説検証型アプローチを用いた定義レス帳票認識技術」、ＦＩＴ２０１０、Ｈ−０１２Junichi Hirayama, Hiroshi Shinjo, Juichi Takahashi, Ken Nagasaki, “Definition-less form recognition technology using hypothesis verification approach”, FIT2010, H-012

一般に、帳票の一部に特定の項目が順番に記入されている。このような帳票では、項目名の枠とデータの枠とが横並びで項目順に右に並んでいる場合、及び、項目名の枠とデータの枠とが縦並びで項目順に右に並んでいる場合等がある。 In general, specific items are sequentially entered in a part of a form. In such a form, when the item name frame and the data frame are arranged side by side in the item order, and when the item name frame and the data frame are arranged in the vertical direction and arranged in the item order to the right Etc.

これらの枠の並びは単純な形であるため、第１〜第４の従来技術では、ノイズによる罫線が混入した場合、及び文字が枠線と接触した場合には、誤った罫線が抽出され、結果として項目が誤って抽出される可能性が高かった。 Since the arrangement of these frames is a simple shape, in the first to fourth prior arts, when ruled lines due to noise are mixed, and when characters come into contact with the frame lines, incorrect ruled lines are extracted, As a result, it was highly likely that items were extracted by mistake.

一方で、第５の従来技術では、項目名及びデータのそれぞれの文字列照合において、特に項目名の照合結果が不読である場合、特定の項目を抽出できないか、誤った項目を抽出してしまうという問題があった。また、項目名及びデータのそれぞれに仮説を生成し、組み合わせを評価することは、処理速度が遅いという問題もあった。 On the other hand, in the fifth prior art, in the item name and data character string matching, especially when the matching result of the item name is unread, a specific item cannot be extracted or an incorrect item is extracted. There was a problem that. In addition, generating a hypothesis for each item name and data and evaluating the combination has a problem in that the processing speed is slow.

本発明の目的は、特定の項目が記入される帳票に対して、特定の項目のデータを抽出する精度を高めることである。 An object of the present invention is to improve the accuracy of extracting data of a specific item for a form in which the specific item is filled.

本発明の代表的な一例を示せば、帳票画像に含まれ、線分に囲まれた領域である枠に存在する文字行を抽出して、抽出した文字行を文字認識する帳票処理システムにおいて、前記帳票処理システムは、前記文字行を抽出する対象となる複数の抽出対象枠に対応する文字列と、前記複数の抽出対象枠の配置関係と、を対応付けた枠構造辞書データを記憶し、前記帳票画像から枠を抽出枠として抽出する枠抽出部と、前記枠構造辞書データを参照し、前記枠抽出部によって抽出された前記抽出枠を前記抽出対象枠に対応付けた照合結果を算出する枠照合部と、前記枠照合部によって算出された照合結果に対して、前記抽出対象枠に対応付けられた抽出枠に存在する文字行を抽出する文字行抽出部と、前記枠照合部によって算出された各照合結果に対して、前記文字行抽出部によって抽出された文字行が抽出された抽出枠に対応する前記抽出対象枠の文字列に対する当該文字行の類似度を示す枠スコアを算出し、各照合結果に含まれる前記抽出対象枠に対応付けられた枠の枠スコアに基づいて、各照合結果のスコアである照合結果スコアを算出するスコア算出部と、前記スコア算出部によって算出された照合結果スコアに基づいて、前記枠照合部によって算出された照合結果のうち、一の照合結果を抽出照合結果とすることを特徴とする。 If a typical example of the present invention is shown, in a form processing system that extracts a character line that is included in a form image and exists in a frame that is an area surrounded by line segments, and that recognizes the extracted character line, The form processing system stores frame structure dictionary data in which a character string corresponding to a plurality of extraction target frames from which the character line is to be extracted and an arrangement relationship of the plurality of extraction target frames are associated with each other. A frame extraction unit that extracts a frame from the form image as an extraction frame, and the frame structure dictionary data are referred to, and a collation result in which the extraction frame extracted by the frame extraction unit is associated with the extraction target frame is calculated. A frame matching unit, a character line extracting unit that extracts a character line existing in an extraction frame associated with the extraction target frame, and a frame matching unit that are calculated with respect to the matching result calculated by the frame matching unit. For each matching result Then, a frame score indicating the similarity of the character line to the character string of the extraction target frame corresponding to the extraction frame from which the character line extracted by the character line extraction unit is extracted is included in each matching result A score calculation unit that calculates a matching result score that is a score of each matching result based on a frame score of the frame that is associated with the extraction target frame, and a matching result score that is calculated by the score calculating unit One of the matching results calculated by the frame matching unit is used as an extracted matching result.

本発明によれば、特定の項目が記入される帳票に対して、特定の項目のデータを抽出する精度を高めることができる。 ADVANTAGE OF THE INVENTION According to this invention, the precision which extracts the data of a specific item with respect to the form in which a specific item is filled can be improved.

本発明の実施形態の帳票処理システムの構成図である。It is a block diagram of the form processing system of embodiment of this invention. 本発明の実施形態の帳票処理のＰＡＤ図である。It is a PAD figure of the form process of embodiment of this invention. 本発明の実施形態の処理対象となる帳票の説明図である。It is explanatory drawing of the form used as the process target of embodiment of this invention. 本発明の実施形態の各格子点における罫線の交差形状に応じて付加する符号（交点符号）の説明図である。It is explanatory drawing of the code | symbol (intersection point code | symbol) added according to the intersection shape of the ruled line in each lattice point of embodiment of this invention. 本発明の実施形態の入力帳票を格子点情報に変換した場合の説明図である。It is explanatory drawing at the time of converting the input form of embodiment of this invention into lattice point information. 本発明の実施形態の枠構造辞書の説明図である。It is explanatory drawing of the frame structure dictionary of embodiment of this invention. 本発明の実施形態の第１座標の説明図である。It is explanatory drawing of the 1st coordinate of embodiment of this invention. 本発明の実施形態の第２座標の説明図である。It is explanatory drawing of the 2nd coordinate of embodiment of this invention. 本発明の実施形態の枠構造照合処理のＰＡＤ図である。It is a PAD figure of the frame structure collation process of embodiment of this invention. 本発明の実施形態の照合処理のＰＡＤ図である。It is a PAD figure of the collation process of embodiment of this invention. 本発明の実施形態の格子点ＤＰマッチングの組み合わせ結果の例の説明図である。It is explanatory drawing of the example of the combination result of the lattice point DP matching of embodiment of this invention. 本発明の実施形態の文字行抽出処理のＰＡＤ図である。It is a PAD figure of the character line extraction process of embodiment of this invention. 本発明の実施形態の文字行抽出処理によって抽出された文字行の一例の説明図である。It is explanatory drawing of an example of the character line extracted by the character line extraction process of embodiment of this invention. 本発明の実施形態の取得対象となる領域の説明図である。It is explanatory drawing of the area | region used as the acquisition object of embodiment of this invention. 本発明の実施形態の接触文字を含む入力帳票を格子点情報に変換した場合の説明図である。It is explanatory drawing at the time of converting the input form containing the contact character of embodiment of this invention into lattice point information. 本発明の実施形態の格子点ＤＰマッチングの組み合わせ結果の例の説明図である。It is explanatory drawing of the example of the combination result of the lattice point DP matching of embodiment of this invention. 本発明の実施形態の格子点ＤＰマッチングの組み合わせ結果の例の説明図である。It is explanatory drawing of the example of the combination result of the lattice point DP matching of embodiment of this invention. 本発明の実施形態の格子点ＤＰマッチングの組み合わせ結果の例の説明図である。It is explanatory drawing of the example of the combination result of the lattice point DP matching of embodiment of this invention. 本発明の実施形態の文字行抽出処理によって抽出された文字行の一例の説明図である。It is explanatory drawing of an example of the character line extracted by the character line extraction process of embodiment of this invention. 本発明の実施形態の候補文字ネットワークの説明図である。It is explanatory drawing of the candidate character network of embodiment of this invention. 本発明の実施形態の特定項目文字列照合処理のＰＡＤ図である。It is a PAD figure of the specific item character string collation process of embodiment of this invention. 本発明の実施形態の複数行の文字行の候補文字ネットワークを連結する処理の説明図である。It is explanatory drawing of the process which connects the candidate character network of the character line of the multiple lines of embodiment of this invention. 本発明の実施形態の候補文字ネットワークと枠構造辞書の文字列情報とを照合する処理の説明図である。It is explanatory drawing of the process which collates the candidate character network of embodiment of this invention, and the character string information of a frame structure dictionary. 本発明の実施形態の特定項目文字列照合結果の説明図である。It is explanatory drawing of the specific item character string collation result of embodiment of this invention. 本発明の実施形態の特定項目文字列照合の説明図である。It is explanatory drawing of the specific item character string collation of embodiment of this invention.

本発明の実施形態を図１〜図２１を用いて説明する。なお、本発明は、以下の記述に限定されるものではなく、本発明の要旨を逸脱しない範囲において適宜変更可能である。 An embodiment of the present invention will be described with reference to FIGS. In addition, this invention is not limited to the following description, In the range which does not deviate from the summary of this invention, it can change suitably.

図１は、本発明の実施形態の帳票処理システムの構成図である。 FIG. 1 is a configuration diagram of a form processing system according to an embodiment of the present invention.

帳票処理システムは、帳票処理用計算機１００によって構成される。 The form processing system includes a form processing computer 100.

帳票処理用計算機１００は、画像入力部１０３、入力部１０５、出力部１０６、通信部１０７、制御部１０８、補助記憶部１０９、記憶部１１０、及び外部補助記憶部１１１を備える。これらは、内部バス１１２を介して互いに接続される。 The form processing computer 100 includes an image input unit 103, an input unit 105, an output unit 106, a communication unit 107, a control unit 108, an auxiliary storage unit 109, a storage unit 110, and an external auxiliary storage unit 111. These are connected to each other via an internal bus 112.

画像入力部１０３には、スキャナ１０２によって読み取られた帳票画像が入力される。画像入力部１０３には、スキャナ１０２を介さずネットワーク等を介して帳票画像が電子データ１０４として入力されてもよい。 A form image read by the scanner 102 is input to the image input unit 103. A form image may be input to the image input unit 103 as the electronic data 104 not via the scanner 102 but via a network or the like.

入力部１０５はユーザによる入力を受け付ける。例えば、入力部１０５はキーボード及びマウス等である。出力部１０６は帳票処理の結果を出力する。例えば、出力部１０６はディスプレイ及びプリンタ等である。通信部１０７は、外部ネットワーク１１３に接続されるインタフェースである。帳票処理の結果は、外部ネットワーク１１３に接続された外部サーバ１１４に出力されてもよい。 The input unit 105 receives input from the user. For example, the input unit 105 is a keyboard and a mouse. The output unit 106 outputs the result of the form processing. For example, the output unit 106 is a display, a printer, or the like. The communication unit 107 is an interface connected to the external network 113. The result of the form processing may be output to the external server 114 connected to the external network 113.

制御部１０８は帳票処理用計算機１００の制御に関する各種処理を実行し、例えば、ＣＰＵ等である。 The control unit 108 executes various processes related to the control of the form processing computer 100, and is, for example, a CPU.

補助記憶部１０９は、帳票処理用計算機１００の内部に備わる記憶部１１０以外の記憶部であり、例えばＨＤＤである。記憶部１１０は、制御部１０８が直接アクセスできる記憶部であり、例えばメモリである。外部補助記憶部１１１は、補助記憶部１０９の一種であり、帳票処理用計算機１００の外部に備わる記憶部である。例えば、外部補助記憶部１１１は、ＣＤ−Ｒ、及びＤＶＤ−Ｒ等である。 The auxiliary storage unit 109 is a storage unit other than the storage unit 110 provided in the form processing computer 100, and is an HDD, for example. The storage unit 110 is a storage unit that can be directly accessed by the control unit 108, and is, for example, a memory. The external auxiliary storage unit 111 is a type of the auxiliary storage unit 109 and is a storage unit provided outside the form processing computer 100. For example, the external auxiliary storage unit 111 is a CD-R, a DVD-R, or the like.

帳票処理に関するプログラム（帳票処理プログラム）を含む各種プログラムは、補助記憶部１０９又は外部補助記憶部１１１に記憶され、制御部１０８が各種プログラムに実行する場合に記憶部１１０にロードされる。制御部１０８は、記憶部１１０にロードされたプログラムを実行する。 Various programs including programs related to form processing (form processing programs) are stored in the auxiliary storage unit 109 or the external auxiliary storage unit 111, and are loaded into the storage unit 110 when the control unit 108 executes various programs. The control unit 108 executes a program loaded in the storage unit 110.

また、制御部１０８は、画像入力部１０３に入力された帳票画像を内部バス１１２を介して記憶部１１０、補助記憶部１０９、及び外部補助記憶部１１１等に記憶する。 In addition, the control unit 108 stores the form image input to the image input unit 103 in the storage unit 110, the auxiliary storage unit 109, the external auxiliary storage unit 111, and the like via the internal bus 112.

なお、帳票処理用計算機１００は、画像入力部１０３、制御部１０８、及び記憶部１１０を少なくとも備えていればよく、他の部は備えなくてもよい。 The form processing computer 100 only needs to include at least the image input unit 103, the control unit 108, and the storage unit 110, and may not include other units.

次に、帳票処理の概要について図２を用いて説明する。 Next, an outline of the form processing will be described with reference to FIG.

図２は、本発明の実施形態の帳票処理のＰＡＤ図である。 FIG. 2 is a PAD diagram of the form processing according to the embodiment of the present invention.

制御部１０８が帳票処理プログラムを実行することによって、図２に示す帳票処理が実行される。 When the control unit 108 executes the form processing program, the form process shown in FIG. 2 is executed.

帳票処理とは、帳票処理用計算機１００に入力された帳票画像から取得対象となる項目名と、当該項目名に対応するデータとを取得する処理である。 The form process is a process for acquiring the item name to be acquired and the data corresponding to the item name from the form image input to the form processing computer 100.

制御部１０８は、画像入力部１０３に帳票画像が入力されたことを検出する（２０１）。 The control unit 108 detects that a form image is input to the image input unit 103 (201).

次に、制御部１０８は、ステップ２０１の処理で入力が検出された帳票画像から枠を抽出する（２０２）。ステップ２０２の処理は、図４及び図５で詳細を説明する。なお、ステップ２０２の処理は、帳票処理用計算機１００における枠抽出部として機能する。 Next, the control unit 108 extracts a frame from the form image whose input is detected in the process of step 201 (202). Details of the processing in step 202 will be described with reference to FIGS. Note that the processing in step 202 functions as a frame extraction unit in the form processing computer 100.

次に、制御部１０８は、ステップ２０１の処理で入力が検出された帳票画像の種類が取得対象となる帳票の種類であるか否かを判定する（２０３）。ステップ２０３の処理の具体的な一例を示せば、制御部１０８は、取得対象となる帳票の種類を示すマークが入力された帳票画像に存在するか否かを判定する。 Next, the control unit 108 determines whether or not the type of form image whose input has been detected in the process of step 201 is the type of form to be acquired (203). If a specific example of the process of step 203 is shown, the control part 108 will determine whether the mark which shows the kind of form used as acquisition object exists in the input form image.

なお、図２では、ステップ２０３の処理がステップ２０２の処理の後に実行されることを示したが、ステップ２０３の処理は、ステップ２０２の処理の前、すなわち、ステップ２０１の処理とステップ２０２の処理との間に実行されてもよい。 2 shows that the process of step 203 is executed after the process of step 202, the process of step 203 is performed before the process of step 202, that is, the process of step 201 and the process of step 202. May be executed between.

ステップ２０３の処理で、入力された帳票画像が取得対象の帳票の種類であると判定された場合、制御部１０８は、ステップ２０２の処理で抽出された枠と、枠構造辞書６００（図６Ａ〜図６Ｃ参照）とを照合する枠構造照合処理を実行する（２０４）。枠構造処理は、ステップ２０２の処理で抽出された枠を、枠構造辞書６００に登録された枠に対応付けた照合結果を算出する処理である。なお、ステップ２０４の処理は、帳票処理用計算機１００における枠照合部として機能する。 When it is determined in the process of step 203 that the input form image is the type of form to be acquired, the control unit 108 uses the frame extracted in the process of step 202 and the frame structure dictionary 600 (FIG. 6A to FIG. 6). (See FIG. 6C) A frame structure matching process is executed (204). The frame structure process is a process of calculating a collation result in which the frame extracted in the process of step 202 is associated with the frame registered in the frame structure dictionary 600. Note that the processing in step 204 functions as a frame matching unit in the form processing computer 100.

ステップ２０２の処理で抽出された枠の数が枠構造辞書に登録された枠の数より少ない場合、照合結果が算出されない。なお、枠構造処理の詳細は図７及び図８で説明する。 When the number of frames extracted in the process of step 202 is smaller than the number of frames registered in the frame structure dictionary, the collation result is not calculated. Details of the frame structure processing will be described with reference to FIGS.

次に、制御部１０８は、ステップ２０４の処理で照合結果が算出されたか否かを判定する（２０５）。 Next, the control unit 108 determines whether or not a collation result has been calculated in the process of step 204 (205).

ステップ２０４の処理で照合結果が算出されたとステップ２０５の処理で判定された場合、制御部１０８は、照合結果において、枠構造辞書６００に登録された枠に対応付けられた枠に存在する文字行を抽出する（２０６）。文字行とは、枠内に存在する文字の集合体である。なお、ステップ２０６の処理は、図１０で詳細を説明する。ステップ２０６の処理は、帳票処理用計算機１００における文字行抽出部として機能する。 If it is determined in step 205 that the collation result has been calculated in step 204, the control unit 108 in the collation result indicates a character line existing in the frame associated with the frame registered in the frame structure dictionary 600. Is extracted (206). A character line is a collection of characters existing in a frame. Details of the processing in step 206 will be described with reference to FIG. The processing in step 206 functions as a character line extraction unit in the form processing computer 100.

次に、制御部１０８は、ステップ２０６の処理で抽出された文字行から文字らしい大きさの図形を文字パタン候補として切り出す（２０７）。ステップ２０７の処理では、例えば、漢字の偏を構成する図形と旁を構成する図形とを分離してもよいし、つぶれによる線同士の接触及びかすれによる線の分離を考慮して図形を切り出してもよい。ステップ２０７の処理における図形の切り出し方法は一意に定めず、複数の切り出し方法によって文字パタン候補を切り出す。なお、制御部１０８は、ステップ２０７の処理で切り出された文字パタン候補を候補文字ネットワーク形式（図１６参照）で記憶部１１０に記憶する。 Next, the control unit 108 cuts out a character-like figure from the character line extracted in the process of step 206 as a character pattern candidate (207). In the process of step 207, for example, the figure constituting the kanji bias and the figure constituting the heel may be separated, or the figure is cut out in consideration of contact between lines due to crushing and line separation due to blurring. Also good. The figure cutout method in the processing of step 207 is not uniquely determined, and character pattern candidates are cut out by a plurality of cutout methods. Note that the control unit 108 stores the character pattern candidates extracted in the process of step 207 in the storage unit 110 in a candidate character network format (see FIG. 16).

次に、制御部１０８は、記憶部１１０に記憶され、複数の文字が登録された図示しない文字データを参照し、ステップ２０７の処理で切り出された文字パタン候補と類似する文字と、文字パタン候補の各文字に対する類似度とを算出する文字識別処理を実行する（２０８）。文字識別処理は図１９で詳細を説明する。 Next, the control unit 108 refers to character data (not shown) stored in the storage unit 110 and registered with a plurality of characters, and the character similar to the character pattern candidate extracted in the process of step 207 and the character pattern candidate Character identification processing for calculating the similarity to each character is executed (208). Details of the character identification process will be described with reference to FIG.

次に、制御部１０８は、ステップ２０８の処理で文字識別処理を実行した枠に対応付けられた枠構造辞書６００に登録された枠の文字列情報と、ステップ２０８の処理で文字識別処理が実行された結果とを照合する特定項目文字列照合処理を実行する（２０９）。特定項目文字列照合処理では、制御部１０８は、ステップ２０８の処理で文字識別処理が実行された結果の識別候補から、当該文字識別処理が実行された枠に対応する枠構造辞書６００に登録された枠の文字列情報を構成する文字と一致する識別候補を選択する。そして、制御部１０８は、抽出された文字行の選択した識別候補の類似度に基づいて、当該文字行を抽出した枠の枠スコアを算出する。そして、制御部１０８は、算出した枠スコアに基づいて照合結果の枠構造辞書６００に対する類似度を示す照合結果スコアを算出し、算出した照合結果スコアが最大の照合結果を選択する。 Next, the control unit 108 executes character identification processing in step 208 and the character string information of the frame registered in the frame structure dictionary 600 associated with the frame for which character identification processing has been executed in step 208. A specific item character string matching process for matching the result is performed (209). In the specific item character string matching process, the control unit 108 is registered in the frame structure dictionary 600 corresponding to the frame in which the character identification process is executed, from the identification candidates obtained as a result of the character identification process executed in the process of step 208. A candidate for identification that matches the characters constituting the character string information of the frame is selected. Then, the control unit 108 calculates the frame score of the frame from which the character line is extracted based on the similarity of the selected identification candidate of the extracted character line. Then, the control unit 108 calculates a matching result score indicating the similarity of the matching result to the frame structure dictionary 600 based on the calculated frame score, and selects a matching result having the maximum calculated matching result score.

なお、特定項目文字列照合処理は、図１７で詳細を説明する。 The specific item character string matching process will be described in detail with reference to FIG.

照合結果スコアを算出するために実行されるステップ２０７〜２０９の処理は、帳票処理用計算機１００におけるスコア算出部として機能する。 The processing in steps 207 to 209 executed to calculate the matching result score functions as a score calculation unit in the form processing computer 100.

次に、制御部１０８は、ステップ２０９の処理で選択された照合結果の照合結果スコアが閾値以上であるか否かを判定する（２１０）。 Next, the control unit 108 determines whether or not the collation result score of the collation result selected in step 209 is equal to or greater than a threshold (210).

ステップ２０９の処理で選択された照合結果の照合結果スコアが閾値以上であると、ステップ２１０の処理で判定された場合、制御部１０８は、当該照合結果のステップ２０９の処理で選択された文字から構成される文字列を出力し、帳票処理を終了する。 When it is determined in the process of step 210 that the collation result score of the collation result selected in the process of step 209 is greater than or equal to the threshold value, the control unit 108 starts from the character selected in the process of step 209 of the collation result. The composed character string is output, and the form processing ends.

入力された帳票画像が取得対象の帳票の種類でないとステップ２０３の処理で判定された場合、ステップ２０４の処理で照合結果が算出されていないとステップ２０５の処理で判定された場合、ステップ２０９の処理で選択された照合結果の照合結果スコアが閾値より小さいとステップ２１０の処理で判定された場合、制御部１０８は、入力された帳票画像の全領域又は指定された領域から枠構造辞書６００に登録された項目名と一致する文字列を抽出し、当該抽出した項目名に対応するデータを抽出する（２１３）。 If it is determined in step 203 that the input form image is not the type of form to be acquired, or if it is determined in step 205 that the collation result has not been calculated in step 204, When it is determined in step 210 that the collation result score of the collation result selected in the process is smaller than the threshold, the control unit 108 transfers the entire form image or the designated area of the input form image to the frame structure dictionary 600. A character string that matches the registered item name is extracted, and data corresponding to the extracted item name is extracted (213).

そして、制御部１０８は、ステップ２１３の処理で抽出した項目名及びデータを出力し（２１３）、帳票処理を終了する。 Then, the control unit 108 outputs the item name and data extracted in step 213 (213), and ends the form processing.

本実施形態によれば、ステップ２０４の処理で抽出されたすべての照合結果に対して、枠構造辞書６００に登録された文字列情報を用いて照合結果スコアを算出し、照合結果スコアが最大の照合結果を選択する。 According to the present embodiment, for all the matching results extracted in the process of step 204, the matching result score is calculated using the character string information registered in the frame structure dictionary 600, and the matching result score is the largest. Select the verification result.

これによって、罫線が少ない帳票から、取得対象の項目名及びデータを正確に取得できるようになる。 As a result, the item name and data to be acquired can be accurately acquired from a form with few ruled lines.

図３は、本発明の実施形態の処理対象となる帳票３０１の説明図である。 FIG. 3 is an explanatory diagram of a form 301 to be processed according to the embodiment of this invention.

図３に示す帳票３０１の点線で囲まれた領域３０２に存在する表に、取得対象となる枠３０３が存在する。 A frame 303 to be acquired exists in a table existing in a region 302 surrounded by a dotted line of the form 301 shown in FIG.

取得対象となる枠３０３とは、図６に示す枠構造辞書で項目名６００と当該項目名に対応するデータとが登録された枠である。 The frame 303 to be acquired is a frame in which the item name 600 and data corresponding to the item name are registered in the frame structure dictionary shown in FIG.

本実施形態では、項目名が「収納機関番号」、「お客様番号」、及び「確認番号」である枠、並びこれらの項目名に対応するデータの枠が取得対象となる枠である。 In the present embodiment, the frames whose item names are “storage organization number”, “customer number”, and “confirmation number”, and the data frames corresponding to these item names are the frames to be acquired.

図４は、本発明の実施形態の各格子点における罫線の交差形状に応じて付加する符号（交点符号）の説明図である。 FIG. 4 is an explanatory diagram of codes (intersection point codes) that are added according to the intersection shape of ruled lines at each lattice point according to the embodiment of the present invention.

交点符号０は、罫線がないことを表す。交点符号１から４は、罫線の端点を表す。交点符号５と６は、罫線の一部分であることを表す。交点符号７から１０は、２本の罫線がＬ字型に交差した交点を表す。交点符号１１から１４は、２本の罫線がＴ字型に交差した交点を表す。交点符号１５は、２本の罫線が十字型に交差した交点を表す。 The intersection point code 0 indicates that there is no ruled line. The intersection codes 1 to 4 represent the end points of the ruled lines. Intersection points 5 and 6 indicate a part of a ruled line. Intersection codes 7 to 10 represent intersections where two ruled lines intersect in an L shape. Intersection codes 11 to 14 represent intersections where two ruled lines intersect in a T shape. The intersection code 15 represents an intersection where two ruled lines intersect in a cross shape.

図５は、本発明の実施形態の入力帳票を格子点情報に変換した場合の説明図である。 FIG. 5 is an explanatory diagram when the input form according to the embodiment of the present invention is converted into grid point information.

直交する罫線の交点座標は、該当する格子点の座標値から獲得することができる。平行する２本の縦罫線間の距離は、罫線が存在する格子点の列間の距離から算出できる。 The intersection coordinates of the orthogonal ruled lines can be obtained from the coordinate values of the corresponding grid points. The distance between two parallel vertical ruled lines can be calculated from the distance between columns of grid points where ruled lines exist.

帳票上の枠は、枠の四隅に相当する格子点（格子点符号７〜１５）の組合せにより表現することができる。このため、ステップ２０２の処理では、帳票画像が変換された格子点情報から格子点符号７〜１５のいずれかによって囲まれる矩形を枠として抽出する。 The frame on the form can be expressed by a combination of lattice points (grid point codes 7 to 15) corresponding to the four corners of the frame. For this reason, in the process of step 202, a rectangle surrounded by any of the grid point codes 7 to 15 is extracted as a frame from the grid point information obtained by converting the form image.

なお，格子点情報を作成するための実線の抽出方式の例としては特開平１１−２３２３８２号公報に，点線の抽出方式の例としては特開平０９−３１９８２４号公報に開示されている。 An example of a solid line extraction method for creating lattice point information is disclosed in Japanese Patent Laid-Open No. 11-232382, and an example of a dotted line extraction method is disclosed in Japanese Patent Laid-Open No. 09-319824.

次に、枠構造辞書について、図６Ａ〜図６Ｃを用いて説明する。 Next, the frame structure dictionary will be described with reference to FIGS. 6A to 6C.

図６Ａは、本発明の実施形態の枠構造辞書６００の説明図である。 FIG. 6A is an explanatory diagram of the frame structure dictionary 600 according to the embodiment of this invention.

枠構造辞書６００には、取得対象となる枠に関するデータが登録される。 In the frame structure dictionary 600, data related to a frame to be acquired is registered.

具体的には、枠構造辞書６００は、項目ＩＤ６０１、項目６０２、属性６０３、文字列定義６０４、第１座標６０５、及び第２座標６０６を含む。 Specifically, the frame structure dictionary 600 includes an item ID 601, an item 602, an attribute 603, a character string definition 604, a first coordinate 605, and a second coordinate 606.

項目ＩＤ６０１には、取得対象となる各枠の一意な識別子が登録される。項目６０２には、取得対象となる枠の名称が登録される。属性６０３には、取得対象となる枠が項目であるかデータ枠であるかを特定するためのデータが登録される。 In the item ID 601, a unique identifier of each frame to be acquired is registered. In the item 602, the name of the frame to be acquired is registered. In the attribute 603, data for specifying whether the frame to be acquired is an item or a data frame is registered.

文字列定義６０４には、取得対象となる枠に存在する文字列に関する情報が登録される。第１座標６０５には、第１座標（図６Ｂ参照）に存在する枠の四隅の座標が登録される。第２座標６０６には、第２座標（図６Ｃ参照）に存在する枠の四隅の座標が登録される。第１座標６０５及び第２座標６０６に登録された座標を、取得対象となる枠の配置関係という。 In the character string definition 604, information on a character string existing in a frame to be acquired is registered. In the first coordinate 605, the coordinates of the four corners of the frame existing at the first coordinate (see FIG. 6B) are registered. In the second coordinates 606, coordinates of the four corners of the frame existing at the second coordinates (see FIG. 6C) are registered. The coordinates registered in the first coordinate 605 and the second coordinate 606 are referred to as an arrangement relationship of frames to be acquired.

換言すれば、第１座標６０５及び第２座標６０６は、それぞれ、取得対象の項目の枠構造を示す。本実施形態では、二つの枠構造が枠構造辞書６００に登録されているが、枠構造辞書６００に登録される枠構造の数はこれに限られない。 In other words, the first coordinate 605 and the second coordinate 606 each indicate the frame structure of the item to be acquired. In the present embodiment, two frame structures are registered in the frame structure dictionary 600, but the number of frame structures registered in the frame structure dictionary 600 is not limited to this.

さらに、第１座標６０５及び第２座標６０６には、図６Ａでは図示しないが、図６Ｂに示す格子点情報６１２及び図６Ｃに示す格子点情報６２２が登録される。 Further, although not shown in FIG. 6A, the grid point information 612 shown in FIG. 6B and the grid point information 622 shown in FIG. 6C are registered in the first coordinate 605 and the second coordinate 606.

図６Ｂは、本発明の実施形態の第１座標６０５の説明図である。 FIG. 6B is an explanatory diagram of the first coordinates 605 according to the embodiment of this invention.

第１座標６０５は、項目名の枠と当該項目名に対応するデータの枠とが左右方向に配列される場合の座標である。 The first coordinates 605 are coordinates when an item name frame and a data frame corresponding to the item name are arranged in the horizontal direction.

図６Ｂの帳票６１１では、項目６０２に登録された「収納機関番号」、「お客様番号」、及び「確認番号」の項目名枠及びデータ枠が左から順に並んでいる。 In the form 611 in FIG. 6B, the item name frames and data frames of “storage organization number”, “customer number”, and “confirmation number” registered in the item 602 are arranged in order from the left.

このため、図６Ａに示す「収納機関番号」の項目名の第１座標６０５には、「収納機関番号」の項目名枠の四隅に対応する図６Ｂの格子点情報６１２上の座標(０，０)、（０，１）、（１，１）、（１，０）が登録される。 Therefore, the first coordinates 605 of the item name “storage engine number” shown in FIG. 6A have coordinates (0, 0) on the grid point information 612 in FIG. 6B corresponding to the four corners of the item name frame of “storage engine number”. 0), (0, 1), (1, 1), and (1, 0) are registered.

また、図６Ａに示す「収納機関番号」のデータの第１座標６０５には、「収納機関番号」のデータ枠の四隅に対応する図６Ｂの格子点情報６１２上の座標(１，０)、（１，１）、（２，１）、（２，０）が登録される。 Further, the first coordinates 605 of the “storage engine number” data shown in FIG. 6A include coordinates (1, 0) on the lattice point information 612 in FIG. 6B corresponding to the four corners of the “storage engine number” data frame, (1,1), (2,1), (2,0) are registered.

また、図６Ａに示す「お客様番号」の項目名の第１座標６０５には、「お客様番号」の項目名枠の四隅に対応する図６Ｂの格子点情報６１２上の座標(２，０)、（２，１）、（３，１）、（３，０）が登録される。 Further, the first coordinates 605 of the item name “customer number” shown in FIG. 6A include coordinates (2, 0) on the lattice point information 612 in FIG. 6B corresponding to the four corners of the item name frame of “customer number”. (2,1), (3,1), (3,0) are registered.

また、図６Ａに示す「お客様番号」のデータの第１座標６０５には、「お客様番号」のデータ枠の四隅に対応する図６Ｂの格子点情報６１２上の座標(３，０)、（３，１）、（４，１）、（４，０）が登録される。 Further, the first coordinates 605 of the “customer number” data shown in FIG. 6A include coordinates (3, 0), (3) on the grid point information 612 of FIG. 6B corresponding to the four corners of the “customer number” data frame. , 1), (4, 1), (4, 0) are registered.

また、図６Ａに示す「確認番号」の項目名の第１座標６０５には、「確認番号」の項目名枠の四隅に対応する図６Ｂの格子点情報６１２上の座標(４，０)、（４，１）、（５，１）、（５，０）が登録される。 Further, the first coordinates 605 of the item name “confirmation number” shown in FIG. 6A include coordinates (4, 0) on the lattice point information 612 in FIG. 6B corresponding to the four corners of the item name frame of “confirmation number”, (4,1), (5,1), (5,0) are registered.

また、図６Ａに示す「確認番号」のデータの第１座標６０５には、「確認番号」のデータ枠の四隅に対応する図６Ｂの格子点情報６１２上の座標(５，０)、（５，１）、（６，１）、（６，０）が登録される。 Further, the first coordinates 605 of the “confirmation number” data shown in FIG. 6A include coordinates (5, 0), (5) on the lattice point information 612 of FIG. 6B corresponding to the four corners of the “confirmation number” data frame. , 1), (6, 1), (6, 0) are registered.

図６Ｃは、本発明の実施形態の第２座標６０６の説明図である。 FIG. 6C is an explanatory diagram of the second coordinates 606 according to the embodiment of this invention.

第２座標６０６は、項目名の枠と当該項目名に対応するデータの枠とが上下方向に配列される場合の座標である。 The second coordinates 606 are coordinates when an item name frame and a data frame corresponding to the item name are arranged vertically.

図６Ｃの帳票６２１では、項目６０２に登録された「収納機関番号」、「お客様番号」、及び「確認番号」の項目名枠が左から順に並び、項目名枠に対応するデータ枠が、項目名枠の下方に並ぶ。 In the form 621 of FIG. 6C, the item name frames of “storage organization number”, “customer number”, and “confirmation number” registered in the item 602 are arranged in order from the left, and the data frame corresponding to the item name frame is an item. Line up below the name frame.

このため、図６Ａに示す「収納機関番号」の項目名の第２座標６０６には、「収納機関番号」の項目名枠の四隅に対応する図６Ｃの格子点情報６２２上の座標(０，０)、（０，１）、（１，１）、（１，０）が登録される。 Therefore, the second coordinates 606 of the item name “storage engine number” shown in FIG. 6A have coordinates (0, 0) on the grid point information 622 in FIG. 6C corresponding to the four corners of the item name frame of “storage engine number”. 0), (0, 1), (1, 1), and (1, 0) are registered.

また、図６Ａに示す「収納機関番号」のデータの第２座標６０６には、「収納機関番号」のデータ枠の四隅に対応する図６Ｃの格子点情報６２２上の座標(０，１)、（０，２）、（１，２）、（１，１）が登録される。 Further, the second coordinates 606 of the “storage engine number” data shown in FIG. 6A include coordinates (0, 1) on the grid point information 622 in FIG. 6C corresponding to the four corners of the “storage engine number” data frame, (0, 2), (1, 2), (1, 1) are registered.

また、図６Ａに示す「お客様番号」の項目名の第２座標６０６には、「お客様番号」の項目名枠の四隅に対応する図６Ｃの格子点情報６２２上の座標(１，０)、（１，１）、（２，１）、（２，０）が登録される。 Further, the second coordinates 606 of the item name “customer number” shown in FIG. 6A include coordinates (1, 0) on the grid point information 622 in FIG. 6C corresponding to the four corners of the item name frame of “customer number”, (1,1), (2,1), (2,0) are registered.

また、図６Ａに示す「お客様番号」のデータの第２座標６０６には、「お客様番号」のデータ枠の四隅に対応する図６Ｃの格子点情報６２２上の座標(１，１)、（１，２）、（２，２）、（２，１）が登録される。 Further, the second coordinates 606 of the “customer number” data shown in FIG. 6A include coordinates (1, 1), (1) on the grid point information 622 in FIG. 6C corresponding to the four corners of the “customer number” data frame. , 2), (2, 2), (2, 1) are registered.

また、図６Ａに示す「確認番号」の項目名の第２座標６０６には、「確認番号」の項目名枠の四隅に対応する図６Ｃの格子点情報６２２上の座標(２，０)、（２，１）、（３，１）、（３，０）が登録される。 Further, the second coordinates 606 of the item name “confirmation number” shown in FIG. 6A include coordinates (2, 0) on the lattice point information 622 in FIG. 6C corresponding to the four corners of the item name frame of “confirmation number”, (2,1), (3,1), (3,0) are registered.

また、図６Ａに示す「確認番号」のデータの第２座標６０６には、「確認番号」のデータ枠の四隅に対応する図６Ｃの格子点情報６２２上の座標(２，１)、（２，２）、（３，２）、（３，１）が登録される。 Further, the second coordinates 606 of the “confirmation number” data shown in FIG. 6A include coordinates (2, 1), (2) on the lattice point information 622 in FIG. 6C corresponding to the four corners of the “confirmation number” data frame. , 2), (3, 2), (3, 1) are registered.

以上によって、帳票処理用計算機１００は、枠構造辞書６００を参照することによって取得対象となる枠の配置関係を把握できる。 As described above, the form processing computer 100 can grasp the arrangement relationship of the frames to be acquired by referring to the frame structure dictionary 600.

図７は、本発明の実施形態の枠構造照合処理のＰＡＤ図である。 FIG. 7 is a PAD diagram of the frame structure matching process according to the embodiment of the present invention.

枠構造照合処理は、図２に示すステップ２０４の処理で実行される。 The frame structure matching process is executed in the process of step 204 shown in FIG.

まず、制御部１０８は、枠構造辞書６００に登録された枠構造の数だけ、ステップ７０２の処理以降の処理を繰り返す（７０１）。例えば、図６Ａに示す枠構造辞書６００では、第１座標６０５及び第２座標６０６に対してステップ７０２以降の処理が繰り返し実行される。なお、ステップ７０１の処理では、ステップ７０２以降の処理で処理の対象となる一つの枠構造が選択される。 First, the control unit 108 repeats the processing after step 702 as many times as the number of frame structures registered in the frame structure dictionary 600 (701). For example, in the frame structure dictionary 600 shown in FIG. 6A, the processing after step 702 is repeatedly executed for the first coordinate 605 and the second coordinate 606. In the process of step 701, one frame structure to be processed in the processes after step 702 is selected.

次に、制御部１０８は、ステップ７０１の処理で選択された枠構造に対応して、入力された帳票画像から取得対象となる領域を選択し、選択した取得対象となる領域を格子点情報に変換する（７０２）。取得対象となる領域とは、図３に示す点線で囲まれた領域３０２である。また、変換後の格子点情報は例えば図５に示す格子点情報である。 Next, the control unit 108 selects an area to be acquired from the input form image corresponding to the frame structure selected in step 701, and sets the selected area to be acquired as grid point information. Conversion is performed (702). The region to be acquired is a region 302 surrounded by a dotted line shown in FIG. Also, the converted grid point information is, for example, the grid point information shown in FIG.

ステップ７０２の処理を具体的に説明する。 The processing in step 702 will be specifically described.

図６Ａに示す枠構造辞書６００では図示していないが、枠構造辞書６００では、枠構造（第１座標６０５及び第２座標６０６）に対応して、ステッ７０２の処理で選択されるべき領域の帳票画像内における座標（選択座標）が登録されている。 Although not shown in the frame structure dictionary 600 shown in FIG. 6A, in the frame structure dictionary 600, the region to be selected in the processing of step 702 corresponding to the frame structure (first coordinate 605 and second coordinate 606). Coordinates (selected coordinates) in the form image are registered.

制御部１０８は、ステップ７０１で選択した枠構造に対応する選択座標を参照し、当該選択座標によって指定される領域を取得対象となる領域として選択する。 The control unit 108 refers to the selected coordinates corresponding to the frame structure selected in step 701, and selects the area specified by the selected coordinates as the area to be acquired.

次に、制御部１０８は、ステップ７０２の処理で変換した格子点情報の各行の格子点情報と、枠構造辞書６００の処理対象となる枠構造の格子点情報とを対応付ける照合処理を実行する（７０３）。照合処理の詳細は、図８で詳細を説明する。 Next, the control unit 108 executes a matching process for associating the grid point information of each row of the grid point information converted in the process of Step 702 with the grid point information of the frame structure to be processed in the frame structure dictionary 600 ( 703). Details of the collation processing will be described with reference to FIG.

次に、制御部１０８は、ステップ７０３の処理で実行した照合処理の照合結果があるか否かを判定する（７０４）。 Next, the control unit 108 determines whether there is a collation result of the collation process executed in the process of Step 703 (704).

ステップ７０４の処理で、照合処理の照合結果があると判定された場合、照合結果を記憶部１１０に記憶し（７０５）、図２に示すステップ２０５の処理に処理を移行する。 If it is determined in step 704 that there is a collation result of the collation process, the collation result is stored in the storage unit 110 (705), and the process proceeds to the process in step 205 shown in FIG.

一方、ステップ７０４の処理で、照合処理の照合結果がないと判定された場合、図２に示すステップ２０５の処理に処理を移行する。 On the other hand, if it is determined in step 704 that there is no collation result of the collation processing, the processing proceeds to step 205 shown in FIG.

図８は、本発明の実施形態の照合処理のＰＡＤ図である。 FIG. 8 is a PAD diagram of the matching process according to the embodiment of the present invention.

照合処理は、図７に示すステップ７０３の処理で実行される。 The collation process is executed in the process of step 703 shown in FIG.

まず、制御部１０８は、枠構造辞書６００の処理対象の枠構造の行数分、ステップ８０２及び８０３の処理を繰り返し実行する（８０１）。ステップ８０１の処理で、制御部１０８は、枠構造辞書６００の処理対象の枠構造の行から、ステップ８０２及び８０３の処理の処理対象となる行を選択する。 First, the control unit 108 repeatedly executes the processes of steps 802 and 803 for the number of lines of the frame structure to be processed in the frame structure dictionary 600 (801). In step 801, the control unit 108 selects a row to be processed in steps 802 and 803 from the row of the frame structure to be processed in the frame structure dictionary 600.

そして、制御部１０８は、図７に示すステップ７０２の処理で変換された格子点情報の行数分、ステップ８０３の処理を繰り返し実行する（８０２）。ステップ８０２の処理で、制御部１０８は、ステップ７０２の処理で変換された格子点情報の行から、ステップ８０３の処理の処理対象となる行を選択する。 Then, the control unit 108 repeatedly executes the process of step 803 for the number of rows of the grid point information converted by the process of step 702 shown in FIG. 7 (802). In step 802, the control unit 108 selects a row to be processed in step 803 from the grid point information row converted in step 702.

次に、制御部１０８は、ステップ８０１の処理で選択した枠構造辞書６００の処理対象となる行、及び、ステップ８０２の処理で選択した格子点情報の処理対象となる行に対して、格子点ＤＰマッチングを実行する（８０３）。 Next, the control unit 108 applies grid points to the row to be processed in the frame structure dictionary 600 selected in step 801 and the row to be processed in the lattice point information selected in step 802. DP matching is executed (803).

格子点ＤＰマッチングは、ステップ８０２の処理で選択した格子点情報の処理対象となる行を構成する格子点（帳票格子点）のうち、ステップ８０１の処理で選択した枠構造辞書６００の処理対象となる行を構成する格子点（辞書格子点）に対応する格子点を、音声認識等に利用されている動的計画法（ＤｙｎａｍｉｃＰｒｏｇｒａｍｉｎｇ）を用いて探索する処理である。この格子点ＤＰマッチングの詳細は、特開２００４−１３９４８４号公報に記載されている。また、動的計画法の原理については、Ｔ．コルメン、Ｃ．ライザーソン，Ｒ．リベスト共著、「アルゴリズムイントロダクション」第２巻、Ｐ５〜２９、近代科学社、１９９５年出版をはじめ，さまざまな文献において解説されている。なお、格子点ＤＰマッチングの詳細は図８の説明の直後に説明する。 Lattice point DP matching is performed with the processing target of the frame structure dictionary 600 selected in step 801 among the lattice points (form lattice points) constituting the processing target line of the grid point information selected in step 802. This is a process of searching for a lattice point corresponding to a lattice point (dictionary lattice point) constituting a line using dynamic programming used for speech recognition or the like. Details of the lattice point DP matching are described in Japanese Patent Application Laid-Open No. 2004-139484. For the principle of dynamic programming, see T.W. Colmen, C.I. Riserson, R.A. Co-authored by Rivest, “Algorithm Introduction”, Volume 2, P5-29, Modern Science, published in 1995, and various other literature. Details of the lattice point DP matching will be described immediately after the description of FIG.

次に、制御部１０８は、格子点ＤＰマッチングの実行結果（枠構造照合結果）が存在するか否かを判定する（８０４）。 Next, the control unit 108 determines whether or not there is an execution result (frame structure matching result) of the lattice point DP matching (804).

ステップ８０４の処理で、格子点ＤＰマッチングの実行結果が存在すると判定された場合、制御部１０８は、当該格子点ＤＰマッチングの実行結果のスコアが閾値以上であるか否かを判定する（８０５）。 When it is determined in step 804 that the execution result of the lattice point DP matching exists, the control unit 108 determines whether or not the score of the execution result of the lattice point DP matching is equal to or greater than the threshold (805). .

ステップ８０５の処理で、格子点ＤＰマッチングの実行結果のスコアが閾値以上であると判定された場合、制御部１０８は、当該格子点ＤＰマッチングの実行結果を照合結果として返す（８０６）。 If it is determined in step 805 that the score of the execution result of the lattice point DP matching is equal to or greater than the threshold, the control unit 108 returns the execution result of the lattice point DP matching as a collation result (806).

一方、格子点ＤＰマッチングの実行結果が存在しないと判定された場合、又は、格子点ＤＰマッチングの実行結果のスコアが閾値未満であると判定された場合、制御部１０８は、照合結果がないという結果を返す（８０７）。 On the other hand, when it is determined that there is no execution result of the lattice point DP matching, or when it is determined that the score of the execution result of the lattice point DP matching is less than the threshold value, the control unit 108 has no verification result. The result is returned (807).

格子点ＤＰマッチングは、上述したように、帳票格子点から辞書格子点に対応する格子点を探索する処理である。ここで、帳票格子点が辞書格子点に対応するとは、帳票格子点と辞書格子点とが同じ格子点である場合、及び、辞書格子点を構成するすべての線分が帳票格子点に含まれる場合をいう。 As described above, the lattice point DP matching is a process of searching for a lattice point corresponding to the dictionary lattice point from the form lattice point. Here, a form lattice point corresponds to a dictionary lattice point when the form lattice point and the dictionary lattice point are the same lattice point, and all line segments constituting the dictionary lattice point are included in the form lattice point. Refers to cases.

辞書格子点のすべての線分が帳票格子点に含まれる場合とは、例えば、辞書格子点が図４に示す交点符号７であれば、帳票格子点が図４に示す交点符号１１、１３、及び１５である場合をいう。 The case where all the line segments of the dictionary lattice points are included in the form lattice point is, for example, if the dictionary lattice point is the intersection point code 7 shown in FIG. And 15.

以下、格子点ＤＰマッチングの処理手順について説明する。 Hereinafter, a processing procedure for lattice point DP matching will be described.

制御部１０８は、帳票格子点のＸ座標が０の格子点を処理対象の格子点（処理対象帳票格子点）として選択し、辞書格子点のＸ座標が０の格子点を処理対象の格子点（処理対象辞書格子点）として選択する。 The control unit 108 selects a grid point whose X coordinate of the form grid point is 0 as a grid point to be processed (processing target form grid point), and selects a grid point whose X coordinate of the dictionary grid point is 0 as a grid point to be processed. Select as (processing target dictionary grid point).

次に、制御部１０８は、処理対象帳票格子点が処理対象辞書格子点に対応するか否かを判定する。 Next, the control unit 108 determines whether or not the processing target form grid point corresponds to the processing target dictionary grid point.

処理対象帳票格子点が処理対象辞書格子点に対応すると判定された場合、制御部１０８は、以下の第１〜第４条件によって処理を分岐させる。 When it is determined that the processing target grid point corresponds to the processing target dictionary grid point, the control unit 108 branches the process according to the following first to fourth conditions.

第１条件は、処理対象帳票格子点及び処理対象辞書格子点が最後の格子点でないという条件である。ここで、処理対象帳票格子点及び処理対象辞書格子点の最後の格子点とは、処理対象帳票格子点及び処理対象辞書格子点のＸ座標をインクリメントした座標に格子点が存在しない処理対象帳票格子点及び処理対象辞書格子点をいう。 The first condition is that the processing target form grid point and the processing target dictionary grid point are not the last grid point. Here, the processing target form grid point and the last grid point of the processing target dictionary grid point are processing target form grids that do not have a grid point at coordinates obtained by incrementing the X coordinate of the processing target form grid point and the processing target dictionary grid point. Points and processing target dictionary grid points.

第１条件が成立する場合、制御部１０８は、処理対象帳票格子点及び処理対象辞書格子点を次の格子点に移動させる。具体的には、制御部１０８は、処理対象帳票格子点となっている格子点のＸ座標をインクリメントした座標にある格子点を新たな処理対象帳票格子点として選択し、処理対象辞書格子点となっている格子点のＸ座標をインクリメントした座標にある格子点を新たな処理対象辞書格子点として選択する。 When the first condition is satisfied, the control unit 108 moves the processing target form grid point and the processing target dictionary grid point to the next grid point. Specifically, the control unit 108 selects a grid point at a coordinate obtained by incrementing the X coordinate of a grid point that is a processing target form grid point as a new processing target form grid point, The grid point at the coordinate obtained by incrementing the X coordinate of the current grid point is selected as a new processing target dictionary grid point.

第２条件は、処理対象帳票格子点及び処理対象辞書格子点が最後の格子点であるという条件である。第２条件が成立した場合、帳票格子点と辞書格子点とが一対一で対応する関係である。 The second condition is that the processing target form grid point and the processing target dictionary grid point are the last grid point. When the second condition is satisfied, the form grid point and the dictionary grid point have a one-to-one correspondence.

そして、現在処理対象となっているステップ７０２の処理で変換された格子点情報の行の次の行を構成する帳票格子点、及び、現在処理対象となっている枠構造辞書６００の処理対象の枠構造の行の次の行を構成する辞書格子点でも、第２条件が成立すれば、２行の帳票格子点で区画される枠及び２行の辞書格子点で区画される枠が、一対一の関係で対応しており、図９（Ａ）及び（Ｂ）のような対応関係となる。この場合、制御部１０８は、この対応関係を組み合わせ結果として保持する。 Then, the form grid point that forms the next line of the grid point information line converted in the process of step 702 that is the current processing target, and the processing target of the frame structure dictionary 600 that is the current processing target. Even in a dictionary lattice point that forms the next row of the frame structure row, if the second condition is satisfied, a pair of frames defined by two form lattice points and a frame defined by two dictionary lattice points are paired. It corresponds by one relationship and becomes a correspondence relationship as shown in FIGS. In this case, the control unit 108 holds this correspondence as a combination result.

なお、本発明では、現在処理対象となっている行を構成する帳票格子点及び辞書格子点である条件が成立すれば、現在処理対象となっている行の次の行を構成する帳票格子点及び辞書格子点でも同じ条件が成立ものとする。現在処理対象となっている行を構成する帳票格子点及び辞書格子点で成立する条件と、現在処理対象となっている行の次の行を構成する帳票格子点及び辞書格子点で成立する条件とが異なる場合、制御部１０８は、これらの対応関係を組み合わせ結果として保持しない。 In the present invention, if a condition that is a grid grid point and a dictionary grid point constituting the currently processed line is satisfied, a form grid point constituting the next line of the currently processed line It is assumed that the same condition holds for dictionary grid points. Conditions established by the form grid points and dictionary grid points that make up the currently processed line, and conditions established by the form grid points and dictionary grid points that form the next line of the currently processed line Are different from each other, the control unit 108 does not hold these correspondences as a combination result.

第３条件は、処理対象帳票格子点が最後の格子点でなく、処理対象辞書格子点が最後の格子点であるという条件である。第３条件が成立した場合、図１３及び図１４に示すように、帳票格子点の数が辞書格子点の数よりも多く、つまり、入力された帳票の枠が枠構造辞書６００の枠構造に登録された枠の数よりも多く、帳票画像に含まれる文字等を罫線として誤って認識した可能性があるので、制御部１０８は、帳票格子点と辞書格子点との対応付けのすべての組み合わせを算出する強制対応処理を実行し、強制対応処理の実行結果を組み合わせ結果として保持する。強制対応処理の詳細は、図１４Ａ〜図１４Ｃで詳細を説明する。 The third condition is a condition that the processing-target form grid point is not the last grid point and the processing-target dictionary grid point is the last grid point. When the third condition is satisfied, as shown in FIGS. 13 and 14, the number of form grid points is larger than the number of dictionary grid points, that is, the frame of the input form becomes the frame structure of the frame structure dictionary 600. Since there is a possibility that the number of registered frames is more than the number of registered frames and characters etc. included in the form image may be mistakenly recognized as ruled lines, the control unit 108 has all combinations of correspondences between form lattice points and dictionary lattice points. Is executed, and the execution result of the forcible processing is held as a combination result. Details of the forcible response processing will be described in detail with reference to FIGS. 14A to 14C.

第４条件は、処理対象帳票格子点が最後の格子点であり、処理対象辞書格子点が最後の格子点でないという条件である。第４条件が成立した場合、帳票格子点の数が辞書格子点の数より少なく、帳票格子点によって構成される帳票画像の行が、辞書格子点によって構成される枠構造辞書６００の枠構造の行と対応しないものである。このため、制御部１０８は、組み合わせ結果を保持しない。 The fourth condition is a condition that the processing target lattice point is the last lattice point and the processing target dictionary lattice point is not the last lattice point. When the fourth condition is satisfied, the number of form lattice points is less than the number of dictionary lattice points, and the line of the form image composed of form lattice points is the frame structure of the frame structure dictionary 600 composed of dictionary lattice points. Does not correspond to a line. For this reason, the control unit 108 does not hold the combination result.

処理対象帳票格子点が処理対象格子点に対応しないと判定された場合、制御部１０８は、以下の第５〜第７条件によって処理を分岐させる。 When it is determined that the processing target grid point does not correspond to the processing target grid point, the control unit 108 branches the process according to the following fifth to seventh conditions.

第５条件は、処理対象帳票格子点が最後の格子点でないという条件である。第５条件が成立する場合、制御部１０８は、処理対象帳票格子点を次の格子点に移動させる。 The fifth condition is a condition that the processing-target form grid point is not the last grid point. When the fifth condition is satisfied, the control unit 108 moves the processing-target form grid point to the next grid point.

第６条件は、処理対象帳票格子点が最後の格子点であり、処理対象辞書格子点が最後の格子点であるという条件である。第６条件が成立した場合、制御部１０８は、第２条件が成立した場合と同様に、この対応関係を組み合わせ結果として保持する。 The sixth condition is a condition that the processing target lattice point is the last lattice point and the processing target dictionary lattice point is the last lattice point. When the sixth condition is satisfied, the control unit 108 holds this correspondence as a combination result, similarly to the case where the second condition is satisfied.

第７条件は、処理対象帳票格子点が最後の格子点であり、処理対象辞書格子点が最後の格子点でないという条件である。第７条件が成立した場合、第４条件と同様に、帳票格子点によって構成される帳票画像の行が、辞書格子点によって構成される枠構造辞書６００の枠構造の行と対応しないものであるため、制御部１０８は、組み合わせ結果を保持しない。 The seventh condition is a condition that the processing-target form grid point is the last grid point and the processing-target dictionary grid point is not the last grid point. When the seventh condition is satisfied, similarly to the fourth condition, the line of the form image constituted by the form lattice points does not correspond to the line of the frame structure of the frame structure dictionary 600 constituted by the dictionary lattice points. Therefore, the control unit 108 does not hold the combination result.

以上のように、格子点ＤＰマッチングでは、帳票格子点と辞書格子点とを対応付け、帳票画像の枠と枠構造辞書の枠構造の枠とを対応付ける処理が実行される。 As described above, in lattice point DP matching, processing is performed in which form lattice points and dictionary lattice points are associated with each other and form image frames are associated with frame structure frames in the frame structure dictionary.

図９は、本発明の実施形態の格子点ＤＰマッチングの組み合わせ結果の例の説明図である。 FIG. 9 is an explanatory diagram of an example of a combination result of lattice point DP matching according to the embodiment of this invention.

図９（Ａ）を説明する前に、ステップ８０１の処理で選択した処理対象となる行が格子点情報６１２の０行目であり、ステップ８０２の処理で選択した処理対象となる行が格子点情報５０１の０行目である場合を例に、格子点ＤＰマッチングについて説明する。 Before describing FIG. 9A, the row to be processed selected in the process of step 801 is the 0th row of the grid point information 612, and the row to be processed selected in the process of step 802 is a grid point. The lattice point DP matching will be described by taking the case of the 0th line of the information 501 as an example.

まず、制御部１０８は、格子点情報６１２のＸ座標が０の辞書格子点（図４に示す交点符号７の格子点）を処理対象辞書格子点として選択し、格子点情報５０１のＸ座標が０の帳票格子点（図４に示す交点符号７の格子点）を処理対象帳票格子点として選択する。そして、制御部１０８は、処理対象帳票格子点と処理対象辞書格子点とが同じであるので、処理対象帳票格子点は処理対象辞書格子点に対応する。この場合、処理対象帳票格子点及び処理対象辞書格子点は最後の格子点でないので、第１条件が成立する。そして、制御部１０８は、格子点情報６１２のＸ座標が１の辞書格子点（図４に示す交点符号１３の格子点）を新たな処理対象辞書格子点として選択し、格子点情報５０２のＸ座標が１の帳票格子点（図４に示す交点符号５）を新たな処理対象帳票格子点として選択する。 First, the control unit 108 selects a dictionary grid point having an X coordinate of 0 in the grid point information 612 (a grid point with an intersection code 7 shown in FIG. 4) as a processing target dictionary grid point, and the X coordinate of the grid point information 501 is A form grid point of 0 (lattice point of intersection code 7 shown in FIG. 4) is selected as a processing form grid point. Then, since the processing target form grid point and the processing target dictionary grid point are the same, the control unit 108 corresponds to the processing target dictionary grid point. In this case, since the processing target form grid point and the processing target dictionary grid point are not the last grid point, the first condition is satisfied. Then, the control unit 108 selects a dictionary lattice point having the X coordinate of 1 in the lattice point information 612 (the lattice point with the intersection code 13 shown in FIG. 4) as a new processing target dictionary lattice point, and the X of the lattice point information 502 is selected. A form grid point with coordinates 1 (intersection code 5 shown in FIG. 4) is selected as a new process target form grid point.

次に、制御部１０８は、処理対象帳票格子点は処理対象辞書格子点に対応しないと判定する。これは、図４を参照するに、交点符号５の格子点（処理対象帳票格子点）は、交点符号１３の格子点（処理対象辞書格子点）のすべての線分を含まないためである。また、処理対象帳票格子点は最後の格子点でないので、第５条件が成立する。このため、格子点情報５０２のＸ座標が２の帳票格子点（図４に示す交点符号５）を新たな処理対象帳票格子点として選択する。 Next, the control unit 108 determines that the processing target form grid point does not correspond to the processing target dictionary grid point. This is because, with reference to FIG. 4, the grid point of intersection code 5 (processing target form grid point) does not include all the line segments of the grid point of intersection code 13 (processing target dictionary grid point). Further, since the processing-target form grid point is not the last grid point, the fifth condition is satisfied. For this reason, the form grid point (intersection code 5 shown in FIG. 4) whose X coordinate of the grid point information 502 is 2 is selected as a new process target form grid point.

このように、制御部１０８が格子点ＤＰマッチングを実行していくことによって、Ｘ座標１の辞書格子点とＸ座標３の帳票格子点とが対応付けられ、Ｘ座標２の辞書格子点とＸ座標７の帳票格子点とが対応付けられ、Ｘ座標３の辞書格子点とＸ座標９の帳票格子点とが対応付けられ、Ｘ座標４の辞書格子点とＸ座標１４の帳票格子点とが対応付けられる。 As described above, the control unit 108 executes the lattice point DP matching, so that the dictionary lattice point of the X coordinate 1 and the form lattice point of the X coordinate 3 are associated with each other. The grid grid point of coordinate 7 is associated, the dictionary grid point of X coordinate 3 and the grid grid point of X coordinate 9 are associated, and the dictionary grid point of X coordinate 4 and the grid grid point of X coordinate 14 are It is associated.

ここで、Ｘ座標４の辞書格子点とＸ座標１４の帳票格子点とが対応すると判定された場合、処理対象帳票格子点が最後の格子点であり、処理対象辞書格子点が最後の格子点でないので、第４条件が成立し、帳票格子点によって構成される帳票画像の０行が、辞書格子点によって構成される枠構造辞書６００の枠構造の０行と対応しないものと判定される。 Here, when it is determined that the dictionary grid point of the X coordinate 4 corresponds to the form grid point of the X coordinate 14, the processing target grid point is the last grid point, and the processing target dictionary grid point is the last grid point. Therefore, the fourth condition is satisfied, and it is determined that the 0th row of the form image constituted by the form lattice points does not correspond to the 0th row of the frame structure of the frame structure dictionary 600 constituted by the dictionary lattice points.

なお、ステップ８０１の処理で選択した処理対象となる行が格子点情報６１２の１行目であり、ステップ８０２の処理で選択した処理対象となる行が格子点情報５０１の１行目である場合も、上述した通りに判定される。 Note that the processing target row selected in step 801 is the first row of grid point information 612, and the processing target row selected in step 802 is the first row of grid point information 501. Is also determined as described above.

このため、枠構造辞書６００の格子点情報６１２の０行目と１行目との間の枠、及び帳票画像の格子点情報５０１の０行目と１行目との間の枠は対応しない。 For this reason, the frame between the 0th line and the 1st line of the lattice point information 612 of the frame structure dictionary 600 and the frame between the 0th line and the 1st line of the lattice point information 501 of the form image do not correspond. .

次に、図９（Ａ）について説明する。 Next, FIG. 9A will be described.

図９（Ａ）では、ステップ８０１の処理で選択した処理対象となる行が格子点情報６１２の０行目であり、ステップ８０２の処理で選択した処理対象となる行が格子点情報５０１の１行目である場合を例に、格子点ＤＰマッチングについて説明する。 In FIG. 9A, the processing target row selected in step 801 is the 0th row of the grid point information 612, and the processing target row selected in step 802 is 1 in the grid point information 501. The lattice point DP matching will be described by taking the case of the line as an example.

格子点ＤＰマッチングが実行されると、Ｘ座標０の辞書格子点とＸ座標０の帳票格子点とが対応付けられ、Ｘ座標１の辞書格子点とＸ座標１の帳票格子点とが対応付けられ、Ｘ座標２の辞書格子点とＸ座標４の帳票格子点とが対応付けられ、Ｘ座標３の辞書格子点とＸ座標５の帳票格子点とが対応付けられ、Ｘ座標４の辞書格子点とＸ座標１０の帳票格子点とが対応付けられ、Ｘ座標５の辞書格子点とＸ座標１２の帳票格子点とが対応付けられ、Ｘ座標６の辞書格子点とＸ座標１４の帳票格子点とが対応付けられる。 When the lattice point DP matching is executed, the dictionary lattice point of X coordinate 0 and the form lattice point of X coordinate 0 are associated with each other, and the dictionary lattice point of X coordinate 1 and the form lattice point of X coordinate 1 are associated with each other. The X-coordinate 2 dictionary lattice point is associated with the X-coordinate 4 form lattice point, the X-coordinate 3 dictionary lattice point is associated with the X-coordinate 5 form lattice point, and the X-coordinate 4 dictionary lattice point The point is associated with the form grid point of the X coordinate 10, the dictionary grid point of the X coordinate 5 is associated with the form grid point of the X coordinate 12, the dictionary grid point of the X coordinate 6 and the form grid point of the X coordinate 14 Points are associated with each other.

ここで、Ｘ座標６の辞書格子点とＸ座標１４の帳票格子点とが対応すると判定された場合、処理対象帳票格子点及び処理対象辞書格子点が最後の格子点であるので、第２条件が成立する。したがって、制御部１０８は、帳票格子点と辞書格子点とが一対一で対応する関係であると判断する。 Here, if it is determined that the dictionary grid point of the X coordinate 6 corresponds to the form grid point of the X coordinate 14, the processing target form grid point and the processing target dictionary grid point are the last grid point. Is established. Therefore, the control unit 108 determines that the form grid points and the dictionary grid points have a one-to-one correspondence.

また、ステップ８０１の処理で選択した処理対象となる行が格子点情報６１２の１行目であり、ステップ８０２の処理で選択した処理対象となる行が格子点情報５０１の２行目である場合も、上述した通りに判定される。 In addition, when the processing target row selected in step 801 is the first row of the grid point information 612, and the processing target row selected in step 802 is the second row of the grid point information 501. Is also determined as described above.

このため、図９（Ａ）に示す領域９０１に示すように、枠構造辞書６００の格子点情報６１２の０行目と１行目との間の枠、及び帳票画像の格子点情報５０１の１行目と２行目との間の枠はそれぞれ一対一の関係で対応付けられる。 For this reason, as shown in an area 901 shown in FIG. 9A, the frame between the 0th and 1st lines of the grid point information 612 of the frame structure dictionary 600, and 1 of the grid point information 501 of the form image. The frames between the second line and the second line are associated with each other in a one-to-one relationship.

次に、図９（Ｂ）について説明する。 Next, FIG. 9B will be described.

図９（Ｂ）では、ステップ８０１の処理で選択した処理対象となる行が格子点情報６１２の０行目であり、ステップ８０２の処理で選択した処理対象となる行が格子点情報５０１の２行目である場合を例に、格子点ＤＰマッチングについて説明する。 In FIG. 9B, the row to be processed selected in the processing of step 801 is the 0th row of the grid point information 612, and the row to be processed selected in the processing of step 802 is 2 of the grid point information 501. The lattice point DP matching will be described by taking the case of the line as an example.

格子点ＤＰマッチングが実行されると、Ｘ座標０の辞書格子点とＸ座標０の帳票格子点とが対応付けられ、Ｘ座標１の辞書格子点とＸ座標２の帳票格子点とが対応付けられ、Ｘ座標２の辞書格子点とＸ座標６の帳票格子点とが対応付けられ、Ｘ座標３の辞書格子点とＸ座標８の帳票格子点とが対応付けられ、Ｘ座標４の辞書格子点とＸ座標１１の帳票格子点とが対応付けられ、Ｘ座標５の辞書格子点とＸ座標１３の帳票格子点とが対応付けられ、Ｘ座標６の辞書格子点とＸ座標１４の帳票格子点とが対応付けられる。 When lattice point DP matching is executed, the dictionary lattice point of X coordinate 0 and the form lattice point of X coordinate 0 are associated, and the dictionary lattice point of X coordinate 1 and the form lattice point of X coordinate 2 are associated. The X-coordinate 2 dictionary lattice point is associated with the X-coordinate 6 form lattice point, the X-coordinate 3 dictionary lattice point is associated with the X-coordinate 8 form lattice point, and the X-coordinate 4 dictionary lattice point. The point is associated with the form grid point of the X coordinate 11, the dictionary grid point of the X coordinate 5 is associated with the form grid point of the X coordinate 13, the dictionary grid point of the X coordinate 6 and the form grid point of the X coordinate 14 Points are associated with each other.

また、ステップ８０１の処理で選択した処理対象となる行が格子点情報６１２の１行目であり、ステップ８０２の処理で選択した処理対象となる行が格子点情報５０１の３行目である場合も、上述した通りに判定される。 In addition, when the row to be processed selected in the processing of step 801 is the first row of the grid point information 612, and the row to be processed selected in the processing of step 802 is the third row of the grid point information 501. Is also determined as described above.

このため、図９（Ｂ）に示す領域９０２に示すように、枠構造辞書６００の格子点情報６１２の０行目と１行目との間の枠、及び帳票画像の格子点情報５０１の２行目と３行目との間の枠はそれぞれ一対一の関係で対応付けられる。 For this reason, as shown in an area 902 shown in FIG. 9B, the frame between the 0th and 1st lines of the grid point information 612 of the frame structure dictionary 600, and 2 of the grid point information 501 of the form image. The frames between the row and the third row are associated with each other in a one-to-one relationship.

以上のように、図９（Ａ）及び（Ｂ）では、第３条件が成立しない場合の格子点ＤＰマッチングの例について説明した。 As described above, in FIGS. 9A and 9B, the example of the lattice point DP matching when the third condition is not satisfied has been described.

なお、図９（Ａ）及び（Ｂ）に示す領域９０１及び９０２を、枠構造照合結果という。 Note that regions 901 and 902 shown in FIGS. 9A and 9B are referred to as frame structure matching results.

次に、文字行抽出処理について図１０を用いて説明する。 Next, the character line extraction process will be described with reference to FIG.

図１０は、本発明の実施形態の文字行抽出処理のＰＡＤ図である。 FIG. 10 is a PAD diagram of character line extraction processing according to the embodiment of this invention.

文字行抽出処理は、ステップ２０６の処理で実行される。 The character line extraction process is executed in step 206.

まず、制御部１０８は、図７に示すステップ７０５の処理で記憶部１１０に記憶された照合結果の数だけステップ１００２の処理以降の処理を実行する（１００１）。ステップ１００１の処理では、制御部１０８は、記憶部１１０に記憶された照合結果から、処理対象となる照合結果を一つ選択する。 First, the control unit 108 executes processing subsequent to the processing of step 1002 by the number of collation results stored in the storage unit 110 in the processing of step 705 shown in FIG. 7 (1001). In the processing of step 1001, the control unit 108 selects one matching result to be processed from the matching results stored in the storage unit 110.

次に、制御部１０８は、ステップ１００１の処理で選択された照合結果に存在する枠の数だけ、ステップ１００３の処理以降の処理を実行する（１００２）。ステップ１００２の処理では、制御部１０８は、ステップ１００１の処理で選択された照合結果に存在する枠から、処理対象となる枠を一つ選択する。 Next, the control unit 108 executes processing subsequent to the processing of step 1003 by the number of frames present in the collation result selected in the processing of step 1001 (1002). In the process of step 1002, the control unit 108 selects one frame to be processed from the frames existing in the collation result selected in the process of step 1001.

次に、制御部１０８は、ステップ１００５の処理で記憶される文字行情報（抽出済み文字行）を参照して、ステップ１００２の処理で選択された枠が未だ文字行を抽出していない枠であるか否かを判定する（１００３）。 Next, the control unit 108 refers to the character line information (extracted character line) stored in the process of step 1005, and the frame selected in the process of step 1002 is a frame that has not yet extracted the character line. It is determined whether or not there is (1003).

ステップ１００２の処理で選択された枠が未だ文字行を抽出していない枠であると、ステップ１００３の処理で判定された場合、制御部１０８は、当該枠から文字行を抽出する（１００４）。 If it is determined in step 1003 that the frame selected in step 1002 is a frame from which a character line has not yet been extracted, the control unit 108 extracts a character line from the frame (1004).

そして、制御部１０８は、ステップ１００４の処理で抽出した文字行と当該文字行を抽出した枠の座標とを対応付けた文字行情報を記憶部１１０に記憶する（１００５）。 Then, the control unit 108 stores, in the storage unit 110, character line information in which the character line extracted in step 1004 is associated with the coordinates of the frame from which the character line is extracted (1005).

一方、ステップ１００２の処理で選択された枠が既に文字行を抽出した枠である場合、制御部１０８は、ステップ１００２の処理に処理を移行し、処理対象となる枠を新たに選択する。 On the other hand, when the frame selected in the process of step 1002 is a frame from which a character line has already been extracted, the control unit 108 shifts the process to the process of step 1002 and newly selects a frame to be processed.

文字行抽出処理の例について、図１１を用いて説明する。図１１は、本発明の実施形態の文字行抽出処理によって抽出された文字行の一例の説明図である。 An example of character line extraction processing will be described with reference to FIG. FIG. 11 is an explanatory diagram illustrating an example of a character line extracted by the character line extraction process according to the embodiment of this invention.

図１１に示す文字行１１０１〜１１０６は、図９（Ａ）に示す領域９０１に対して文字行抽出処理が実行されて抽出された文字行を示し、図１１に示す１１１１〜１１１６は、図９（Ｂ）に示す領域９０２に対して文字行抽出処理が実行されて抽出された文字行を示す。 Character lines 1101 to 1106 shown in FIG. 11 indicate character lines extracted by executing character line extraction processing on the area 901 shown in FIG. 9A, and 1111 to 1116 shown in FIG. A character line extracted by executing a character line extraction process on the area 902 shown in FIG.

まず、文字行１１０１〜１１０６について説明する。 First, the character lines 1101 to 1106 will be described.

図１１に示す「収納機関」「番号」１１０１は、図９（Ａ）に示す座標(０，１)、（０，２）、（１，２）、（１，１）の枠から抽出された文字行であり、「収納機関」「番号」１１０１は、座標(０，１)、（０，２）、（１，２）、（１，１）と対応付けて記憶される。 11 is extracted from the frame of coordinates (0, 1), (0, 2), (1, 2), (1, 1) shown in FIG. 9 (A). The “storage organization” and “number” 1101 are stored in association with the coordinates (0, 1), (0, 2), (1, 2), (1, 1).

図１１に示す「１２３４５」１１０２は、図９（Ａ）に示す座標(１，１)、（１，２）、（４，２）、（４，１）の枠から抽出された文字行であり、「１２３４５」１１０２は、座標(１，１)、（１，２）、（４，２）、（４，１）と対応付けて記憶される。 “12345” 1102 shown in FIG. 11 is a character line extracted from the frame of coordinates (1,1), (1,2), (4,2), (4,1) shown in FIG. Yes, “12345” 1102 is stored in association with coordinates (1,1), (1,2), (4,2), and (4,1).

図１１に示す「お客様」「番号」１１０３は、図９（Ａ）に示す座標(４，１)、（４，２）、（５，２）、（５，１）の枠から抽出された文字行であり、「お客様」「番号」１１０３は、座標(４，１)、（４，２）、（５，２）、（５，１）と対応付けて記憶される。 “Customer” “number” 1103 shown in FIG. 11 is extracted from the frame of coordinates (4, 1), (4, 2), (5, 2), (5, 1) shown in FIG. It is a character line, and “customer” and “number” 1103 are stored in association with coordinates (4, 1), (4, 2), (5, 2), and (5, 1).

図１１に示す「６７８９０１２３４５６」１１０４は、図９（Ａ）に示す座標(５，１)、（５，２）、（１０，２）、（１０，１）の枠から抽出された文字行であり、「６７８９０１２３４５６」１１０４は、座標(５，１)、（５，２）、（１０，２）、（１０，１）と対応付けて記憶される。 “67890123456” 1104 shown in FIG. 11 is a character line extracted from the frame of coordinates (5, 1), (5, 2), (10, 2), (10, 1) shown in FIG. Yes, “67890123456” 1104 is stored in association with coordinates (5,1), (5,2), (10,2), (10,1).

図１１に示す「確認」「番号」１１０５は、図９（Ａ）に示す座標(１０，１)、（１０，２）、（１２，２）、（１２，１）の枠から抽出された文字行であり、「確認」「番号」１１０５は、座標(１０，１)、（１０，２）、（１２，２）、（１２，１）と対応付けて記憶される。 The “confirmation” “number” 1105 shown in FIG. 11 is extracted from the frame of coordinates (10, 1), (10, 2), (12, 2), (12, 1) shown in FIG. It is a character line, and “confirmation” “number” 1105 is stored in association with coordinates (10, 1), (10, 2), (12, 2), and (12, 1).

図１１に示す「７８９０１２」１１０６は、図９（Ａ）に示す座標(１２，１)、（１２，２）、（１４，２）、（１４，１）の枠から抽出された文字行であり、「７８９０１２」１１０６は、座標(１２，１)、（１２，２）、（１４，２）、（１４，１）と対応付けて記憶される。 “789012” 1106 shown in FIG. 11 is a character line extracted from the frame of coordinates (12, 1), (12, 2), (14, 2), (14, 1) shown in FIG. Yes, “7899012” 1106 is stored in association with coordinates (12, 1), (12, 2), (14, 2), (14, 1).

次に、文字行１１１１〜１１１６について説明する。 Next, the character lines 1111 to 1116 will be described.

図１１に示す「会員番号」１１１１は、図９（Ｂ）に示す座標（０，２）、（０，３）、（２，３）、（２，２）の枠から抽出された文字行であり、「会員番号」１１１１は、座標（０，２）、（０，３）、（２，３）、（２，２）と対応付けて記憶される。 “Member number” 1111 shown in FIG. 11 is a character line extracted from the frame of coordinates (0, 2), (0, 3), (2, 3), (2, 2) shown in FIG. The “membership number” 1111 is stored in association with the coordinates (0, 2), (0, 3), (2, 3), (2, 2).

図１１に示す「０００−０００−０００」１１１２は、図９（Ｂ）に示す座標（２，２）、（２，３）、（６，３）、（６，２）の枠から抽出された文字行であり、「０００−０００−０００」１１１２は、座標（２，２）、（２，３）、（６，３）、（６，２）と対応付けて記憶される。 “000-000-000” 1112 shown in FIG. 11 is extracted from the frame of coordinates (2, 2), (2, 3), (6, 3), (6, 2) shown in FIG. 9B. "000-000-000" 1112 is stored in association with coordinates (2, 2), (2, 3), (6, 3), (6, 2).

図１１に示す「請求番号」１１１３は、図９（Ｂ）に示す座標（６，２）、（６，３）、（８，３）、（８，２）の枠から抽出された文字行であり、「請求番号」１１１３は、座標（６，２）、（６，３）、（８，３）、（８，２）と対応付けて記憶される。 “Billing number” 1113 shown in FIG. 11 is a character line extracted from the frame of coordinates (6, 2), (6, 3), (8, 3), (8, 2) shown in FIG. “Billing number” 1113 is stored in association with coordinates (6, 2), (6, 3), (8, 3), (8, 2).

図１１に示す「１２３４」１１１４は、図９（Ｂ）に示す座標（８，２）、（８，３）（１１，３）、（１１，２）の枠から抽出された文字行であり、「請求番号」１１１４は、座標（８，２）、（８，３）、（１１，３）、（１１，２）と対応付けて記憶される。 “1234” 1114 shown in FIG. 11 is a character line extracted from the frame of coordinates (8, 2), (8, 3) (11, 3), (11, 2) shown in FIG. 9B. “Billing number” 1114 is stored in association with coordinates (8, 2), (8, 3), (11, 3), (11, 2).

図１１に示す「お支払い期日」１１１５は、図９（Ｂ）に示す座標（１１，２）、（１１，３）（１３，３）、（１３，２）の枠から抽出された文字行であり、「請求番号」１１１３は、座標（１１，２）、（１１，３）（１３，３）、（１３，２）と対応付けて記憶される。 “Payment due date” 1115 shown in FIG. 11 is a character line extracted from the frame of coordinates (11, 2), (11, 3) (13, 3), (13, 2) shown in FIG. 9B. “Billing number” 1113 is stored in association with coordinates (11, 2), (11, 3) (13, 3), (13, 2).

図１１に示す「５月２０日」１１１６は、図９（Ｂ）に示す座標（１３，２）、（１３，３）（１４，３）、（１４，２）の枠から抽出された文字行であり、「５月２０日」１１１６は、座標（１３，２）、（１３，３）（１４，３）、（１４，２）と対応付けて記憶される。 “May 20” 1116 shown in FIG. 11 is a character extracted from the frame of coordinates (13, 2), (13, 3) (14, 3), (14, 2) shown in FIG. 9B. “May 20” 1116 is stored in association with coordinates (13, 2), (13, 3) (14, 3), (14, 2).

以上のように、文字行抽出処理によって抽出された文字行は、当該文字行が抽出された枠と対応付けて記憶される。これによって、既に文字行が抽出された枠に対して再度文字行を抽出しなくてもよく、処理コストを削減できる。 As described above, the character line extracted by the character line extraction process is stored in association with the frame from which the character line is extracted. As a result, it is not necessary to extract a character line again from a frame in which a character line has already been extracted, and the processing cost can be reduced.

次に、図１２〜図１５を用いて、枠が誤って抽出された場合について説明する。 Next, a case where a frame is erroneously extracted will be described with reference to FIGS.

図１２は、本発明の実施形態の取得対象となる領域３０２の説明図である。 FIG. 12 is an explanatory diagram of the area 302 to be acquired according to the embodiment of this invention.

制御部１０８は、矩形１２０１で囲まれた「１」のように、枠の上下の横罫線に接触している文字又は横罫線に接触しそうな文字（以下、このような文字を総称して接触文字という）を縦罫線として認識する場合がある。接触文字を含む取得対象となる領域が図７に示すステップ７０２の処理で格子点情報に変換されると、図１３に示すような格子点情報１３０１となる。 The control unit 108, for example, “1” surrounded by a rectangle 1201, is a character that is in contact with a horizontal ruled line at the top and bottom of the frame or a character that is likely to be in contact with the horizontal ruled line (hereinafter, such characters are collectively referred to as a contact). May be recognized as vertical ruled lines. When a region to be acquired including a contact character is converted into grid point information by the processing in step 702 shown in FIG. 7, grid point information 1301 as shown in FIG. 13 is obtained.

図１３は、本発明の実施形態の接触文字を含む入力帳票を格子点情報に変換した場合の説明図である。 FIG. 13 is an explanatory diagram when an input form including contact characters according to the embodiment of the present invention is converted into grid point information.

図１３に示す格子点情報１３０１の矩形１３０２で示すように、図１２に示す矩形１２０１で囲まれた「１」が座標（９，１）で図４に示す交点符号「１３」及び座標（９，２）で図４に示す交点符号「１４」として変換されている。 As shown by the rectangle 1302 of the grid point information 1301 shown in FIG. 13, “1” surrounded by the rectangle 1201 shown in FIG. 12 is the coordinate (9, 1) and the intersection point code “13” and the coordinate (9 , 2) is converted as the intersection code “14” shown in FIG.

したがって、格子点情報１３０１に基づいて抽出された図１３に示す格子点情報１３０１の１行目及び２行目の枠の数は、７個となり、枠構造辞書６００の第１座標６０５の枠構造に登録された枠の数（６個）より多くなる。したがって、図１３に示す格子点情報１３０１の１行目及び２行目の枠と枠構造辞書６００の第１座標６０５に登録された枠との対応付けは、図９（Ａ）に示すように一通りでなく、複数通り考えられる。 Accordingly, the number of frames in the first and second lines of the grid point information 1301 shown in FIG. 13 extracted based on the grid point information 1301 is 7, and the frame structure of the first coordinate 605 in the frame structure dictionary 600 is displayed. More than the number of frames registered in (6). Accordingly, the correspondence between the first and second frame of the grid point information 1301 shown in FIG. 13 and the frame registered in the first coordinate 605 of the frame structure dictionary 600 is as shown in FIG. 9A. There are multiple ways to consider.

格子点情報１３０１の枠と枠構造辞書６００の第１座標６０５の枠構造６０５に登録された枠との対応付けについて図１４Ａ〜図１４Ｃを用いて説明する。 The association between the frame of the grid point information 1301 and the frame registered in the frame structure 605 of the first coordinate 605 of the frame structure dictionary 600 will be described with reference to FIGS. 14A to 14C.

図１４Ａ〜図１４Ｃは、本発明の実施形態の格子点ＤＰマッチングの組み合わせ結果の例の説明図である。 FIG. 14A to FIG. 14C are explanatory diagrams illustrating examples of combination results of lattice point DP matching according to the embodiment of this invention.

図１４Ａ（Ａ）では、図８に示すステップ８０１の処理で選択した処理対象となる行が格子点情報６１２の０行目であり、ステップ８０２の処理で選択した処理対象となる行が格子点情報５０１の１行目である場合を例に、格子点ＤＰマッチングについて説明する。 In FIG. 14A (A), the processing target row selected in step 801 shown in FIG. 8 is the 0th row of the grid point information 612, and the processing target row selected in step 802 is the grid point. The lattice point DP matching will be described by taking the case of the first line of information 501 as an example.

格子点ＤＰマッチングが実行されると、Ｘ座標０の辞書格子点とＸ座標０の帳票格子点とが対応付けられ、Ｘ座標１の辞書格子点とＸ座標１の帳票格子点とが対応付けられ、Ｘ座標２の辞書格子点とＸ座標４の帳票格子点とが対応付けられ、Ｘ座標３の辞書格子点とＸ座標５の帳票格子点とが対応付けられ、Ｘ座標４の辞書格子点とＸ座標９の帳票格子点とが対応付けられ、Ｘ座標５の辞書格子点とＸ座標１１の帳票格子点とが対応付けられ、Ｘ座標６の辞書格子点とＸ座標１３の帳票格子点とが対応付けられる。 When the lattice point DP matching is executed, the dictionary lattice point of X coordinate 0 and the form lattice point of X coordinate 0 are associated with each other, and the dictionary lattice point of X coordinate 1 and the form lattice point of X coordinate 1 are associated with each other. The X-coordinate 2 dictionary lattice point is associated with the X-coordinate 4 form lattice point, the X-coordinate 3 dictionary lattice point is associated with the X-coordinate 5 form lattice point, and the X-coordinate 4 dictionary lattice point The point is associated with the form grid point of X coordinate 9, the dictionary grid point of X coordinate 5 is associated with the form grid point of X coordinate 11, the dictionary grid point of X coordinate 6 and the form grid point of X coordinate 13 Points are associated with each other.

ここで、Ｘ座標６の辞書格子点とＸ座標１３の帳票格子点とが対応すると判定された場合、処理対象帳票格子点が最後の格子点でなく、処理対象辞書格子点が最後の格子点でないので、第３条件が成立する。したがって、制御部１０８は、強制対応処理を実行する。 Here, when it is determined that the dictionary grid point of the X coordinate 6 corresponds to the form grid point of the X coordinate 13, the processing target grid grid point is not the last grid point but the processing target grid grid point is the last grid point. Therefore, the third condition is satisfied. Therefore, the control unit 108 executes a forcible response process.

強制対応処理は、上述したように、帳票格子点と辞書格子点との対応付けのすべての組み合わせを算出する処理であり、図１４Ａ（Ｂ）〜図１４Ｃ（Ｈ）を用いて説明する。 As described above, the forcible handling process is a process for calculating all combinations of correspondences between form grid points and dictionary grid points, and will be described with reference to FIGS. 14A (B) to 14C (H).

まず、制御部１０８は、図１４Ａ（Ａ）では、最も右に位置する帳票格子点であるＸ座標１５の帳票格子点がいずれの辞書格子点にも対応付けられていないので、当該帳票格子点を最も右に位置する辞書格子点に対応付ける。すなわち、Ｘ座標６の辞書格子点とＸ座標１５の帳票格子点とが対応付けられる。 First, in FIG. 14A (A), the control unit 108 does not associate the form grid point of the X coordinate 15, which is the rightmost form grid point, with any dictionary grid point. To the rightmost dictionary grid point. That is, the dictionary lattice point of the X coordinate 6 and the form lattice point of the X coordinate 15 are associated with each other.

この場合、新たに辞書格子点と対応付けたＸ座標１５の帳票格子点よりも左に位置するＸ座標０、１、４、５、９、１１、及び１３のいずれか一つの帳票格子点と辞書格子点との対応付けを解除しなければならない。これによって、帳票画像から抽出された二つの枠を枠構造辞書６００に登録された一つの枠に対応付ける。 In this case, any one of the form grid points X coordinates 0, 1, 4, 5, 9, 11, and 13 positioned to the left of the form grid point of the X coordinate 15 newly associated with the dictionary grid point; The correspondence with dictionary grid points must be canceled. Thereby, the two frames extracted from the form image are associated with one frame registered in the frame structure dictionary 600.

図１４Ａ（Ｂ）は、１行目及び２行目のＸ座標１５の帳票格子点と０行目及び１行目のＸ座標６の辞書格子点とを対応付け、１行目及び２行目のＸ座標１３の帳票格子点と０行目及び１行目のＸ座標６の辞書格子点との対応付けを解除した組み合わせ結果である。 14A (B) associates the form grid point of the X coordinate 15 of the first line and the second line with the dictionary grid point of the X coordinate 6 of the 0th line and the first line, and the first line and the second line. This is a combination result of canceling the association between the form grid point of the X coordinate 13 and the dictionary grid point of the X coordinate 6 in the 0th and 1st rows.

図１４Ａ（Ｃ）は、１行目及び２行目のＸ座標１５の帳票格子点と０行目及び１行目のＸ座標６の辞書格子点とを対応付け、１行目及び２行目のＸ座標１１の帳票格子点と０行目及び１行目のＸ座標５の辞書格子点との対応付けを解除した組み合わせ結果である。 FIG. 14A (C) associates the form grid point of the X coordinate 15 of the first line and the second line with the dictionary grid point of the X coordinate 6 of the 0th line and the first line, and the first line and the second line. This is a combination result of canceling the correspondence between the form grid point of the X coordinate 11 and the dictionary grid point of the X coordinate 5 in the 0th and 1st rows.

図１４Ｂ（Ｄ）は、１行目及び２行目のＸ座標１５の帳票格子点と０行目及び１行目のＸ座標６の辞書格子点とを対応付け、１行目及び２行目のＸ座標９の帳票格子点と０行目及び１行目のＸ座標４の辞書格子点との対応付けを解除した組み合わせ結果である。 FIG. 14B (D) associates the form grid point of the X coordinate 15 of the first line and the second line with the dictionary grid point of the X coordinate 6 of the 0th line and the first line, and the first line and the second line. This is a combination result in which the correspondence between the form grid point of the X coordinate 9 and the dictionary grid point of the X coordinate 4 in the 0th row and the 1st row is released.

図１４Ｂ（Ｅ）は、１行目及び２行目のＸ座標１５の帳票格子点と０行目及び１行目のＸ座標６の辞書格子点とを対応付け、１行目及び２行目のＸ座標５の帳票格子点と０行目及び１行目のＸ座標３の辞書格子点との対応付けを解除した組み合わせ結果である。 FIG. 14B (E) associates the form grid points of the X coordinate 15 of the first line and the second line with the dictionary grid points of the X coordinate 6 of the 0th line and the first line, and the first and second lines. This is a combination result of canceling the association between the form grid point of the X coordinate 5 and the dictionary grid point of the X coordinate 3 in the 0th row and the 1st row.

図１４Ｂ（Ｆ）は、１行目及び２行目のＸ座標１５の帳票格子点とＸ座標６の辞書格子点とを対応付け、１行目及び２行目のＸ座標４の帳票格子点と０行目及び１行目のＸ座標２の辞書格子点との対応付けを解除した組み合わせ結果である。 FIG. 14B (F) associates the form grid point of the X coordinate 15 of the first line and the second line with the dictionary grid point of the X coordinate 6 and forms the grid point of the X coordinate 4 of the first line and the second line. Is a combination result in which the correspondence between the X-coordinate 2 dictionary lattice points in the 0th and 1st rows is canceled.

図１４Ｃ（Ｇ）は、１行目及び２行目のＸ座標１５の帳票格子点と０行目及び１行目のＸ座標６の辞書格子点とを対応付け、１行目及び２行目のＸ座標１の帳票格子点と０行目及び１行目のＸ座標１の辞書格子点との対応付けを解除した組み合わせ結果である。 FIG. 14C (G) associates the form grid point of the X coordinate 15 of the first line and the second line with the dictionary grid point of the X coordinate 6 of the 0th line and the first line, and the first line and the second line. This is a combination result of canceling the association between the form grid point of the X coordinate 1 and the dictionary grid point of the X coordinate 1 in the 0th and 1st lines.

図１４Ｃ（Ｈ）は、１行目及び２行目のＸ座標１５の帳票格子点と０行目及び１行目のＸ座標６の辞書格子点とを対応付け、１行目及び２行目のＸ座標０の帳票格子点と０行目及び１行目のＸ座標０の辞書格子点との対応付けを解除した組み合わせ結果である。 FIG. 14C (H) associates the form grid point of the X coordinate 15 of the first line and the second line with the dictionary grid point of the X coordinate 6 of the 0th line and the first line, and the first line and the second line. This is a combination result in which the correspondence between the form grid point of the X coordinate 0 and the dictionary grid point of the X coordinate 0 in the 0th row and the 1st row is canceled.

図１４Ｃ（Ｉ）は、図８に示すステップ８０１の処理で選択した処理対象となる行が格子点情報６１２の０行目であり、ステップ８０２の処理で選択した処理対象となる行が格子点情報５０１の２行目である場合の格子点ＤＰマッチングの結果であり、図９（Ｂ）と同じであるので説明を省略する。 In FIG. 14C (I), the row to be processed selected in the processing of step 801 shown in FIG. 8 is the 0th row of the grid point information 612, and the row to be processed selected in the processing of step 802 is the grid point. The result of the lattice point DP matching in the case of the second line of the information 501 is the same as that in FIG.

以上のように、格子点ＤＰマッチングによって、図１４Ａ（Ａ）〜図１４Ｃ（Ｉ）に示す９通りの組み合わせ結果が算出される。なお、以下では、これらのすべての組み合わせ結果が照合結果となったことを前提に説明する。 As described above, nine combinations of results shown in FIGS. 14A (A) to 14C (I) are calculated by lattice point DP matching. In the following description, it is assumed that all of these combination results are matching results.

また、図１４Ｂ（Ｄ）に示す組み合わせ結果が、図１２に示す接触文字「１」を罫線として抽出していない正しい組み合わせ結果である。 Further, the combination result shown in FIG. 14B (D) is a correct combination result in which the contact character “1” shown in FIG. 12 is not extracted as a ruled line.

次に、図１４Ａ（Ａ）〜図１４Ｃ（Ｉ）に示す照合結果に対して文字行抽出処理が実行されることによって抽出された文字行について、図１５を用いて説明する。 Next, a character line extracted by executing a character line extraction process on the matching results shown in FIGS. 14A (A) to 14C (I) will be described with reference to FIG.

図１５は、本発明の実施形態の文字行抽出処理によって抽出された文字行の一例の説明図である。 FIG. 15 is an explanatory diagram illustrating an example of a character line extracted by the character line extraction process according to the embodiment of this invention.

図１０に示す文字行抽出処理のステップ１００１の処理で、制御部１０８は、図１４Ａ（Ａ）〜図１４Ｃ（Ｉ）に示す照合結果から、処理対象の照合結果として図１４Ａ（Ａ）に示す照合結果を選択する。 In the process of step 1001 of the character line extraction process shown in FIG. 10, the control unit 108 shows the collation result shown in FIG. 14A (A) from the collation results shown in FIGS. 14A (A) to 14C (I). Select the verification result.

そして、制御部１０８は、図１４Ａ（Ａ）に示す照合結果にステップ１００２〜１００４の処理を実行する。なお、図１４Ａ（Ａ）に示す照合結果に対してステップ１００２〜１００４の処理が実行される前に抽出された文字行はないものとする。 And the control part 108 performs the process of step 1002-1004 to the collation result shown to FIG. 14A (A). Note that it is assumed that there is no character line extracted before the processing of steps 1002 to 1004 is performed on the collation result shown in FIG. 14A (A).

図１４Ａ（Ａ）に示す照合結果に対してステップ１００２〜１００４の処理が実行されることによって、文字行１５０１〜１５０６が抽出される。 Character lines 1501 to 1506 are extracted by executing the processing of steps 1002 to 1004 on the collation result shown in FIG. 14A (A).

具体的には、図１５に示す「収納機関」「番号」１５０１は、図１４Ａ（Ａ）に示す座標（０，１）、（０，２）、（１，２）、（１，１）の枠から抽出された文字行であり、「収納機関」「番号」１５０１は、座標(０，１)、（０，２）、（１，２）、（１，１）と対応付けて記憶される。 Specifically, the “storage organization” and “number” 1501 shown in FIG. 15 are the coordinates (0, 1), (0, 2), (1, 2), (1, 1) shown in FIG. 14A (A). The character line extracted from the frame is stored in association with the coordinates (0, 1), (0, 2), (1, 2), (1, 1). Is done.

また、図１５に示す「１２３４５」１５０２は、図１４Ａ（Ａ）に示す座標（１，１）、（１，２）、（４，２）、（４，１）の枠から抽出された文字行であり、「１２３４５」１５０２は、座標（１，１）、（１，２）、（４，２）、（４，１）と対応付けて記憶される。 “12345” 1502 shown in FIG. 15 is a character extracted from the frame of coordinates (1,1), (1,2), (4,2), (4,1) shown in FIG. 14A (A). “12345” 1502 is stored in association with coordinates (1,1), (1,2), (4,2), and (4,1).

また、図１５に示す「お客様」「番号」１５０３は、図１４（Ａ）に示す座標（４，１）、（４，２）、（５，２）、（５，１）の枠から抽出された文字行であり、「お客様」「番号」１５０３は、座標（４，１）、（４，２）、（５，２）、（５，１）と対応付けて記憶される。 Further, “customer” “number” 1503 shown in FIG. 15 is extracted from the frame of coordinates (4, 1), (4, 2), (5, 2), (5, 1) shown in FIG. The “customer” and “number” 1503 are stored in association with the coordinates (4, 1), (4, 2), (5, 2), (5, 1).

また、図１５に示す「０００００」１５０４は、図１４（Ａ）に示す座標（５，１）、（５，２）、（９，２）、（９，１）の枠から抽出された文字行であり、「０００００」１５０４は、座標（５，１）、（５，２）、（９，２）、（９，１）と対応付けて記憶される。 Also, “00000” 1504 shown in FIG. 15 is a character extracted from the frame of coordinates (5, 1), (5, 2), (9, 2), (9, 1) shown in FIG. “00000” 1504 is stored in association with coordinates (5, 1), (5, 2), (9, 2), and (9, 1).

また、図１５に示す「０」１５０５は、図１４（Ａ）に示す座標（９，１）、（９，２）、（１１，２）、（１１，１）の枠から抽出された文字行であり、「０」１５０５は、座標（９，１）、（９，２）、（１１，２）、（１１，１）と対応付けて記憶される。 Further, “0” 1505 shown in FIG. 15 is a character extracted from the frame of coordinates (9, 1), (9, 2), (11, 2), (11, 1) shown in FIG. “0” 1505 is stored in association with coordinates (9, 1), (9, 2), (11, 2), and (11, 1).

また、図１５に示す「確認」「番号」１５０６は、図１４（Ａ）に示す座標（１１，１）、（１１，２）、（１３，２）、（１３，１）の枠から抽出された文字行であり、「確認」「番号」１５０６は、座標（１１，１）、（１１，２）、（１３，２）、（１３，１）と対応付けて記憶される。 15 is extracted from the frame of coordinates (11, 1), (11, 2), (13, 2), (13, 1) shown in FIG. The “confirmation” and “number” 1506 are stored in association with the coordinates (11, 1), (11, 2), (13, 2), and (13, 1).

次に、制御部１０８は、ステップ１００１の処理で、処理対象の照合結果として図１４Ａ（Ｂ）に示す照合結果を選択する。 Next, in step 1001, the control unit 108 selects the collation result shown in FIG. 14A (B) as the collation result to be processed.

この場合、図１４Ａ（Ｂ）に示す照合結果の枠のうち、座標（１１，１）、（１１，２）、（１５，２）、（１５，１）の枠よりも左に位置する枠から文字行は、すでに抽出されているので、座標（１１，１）、（１１，２）、（１５，２）、（１５，１）の枠から文字行を抽出する。 In this case, among the frames of the matching results shown in FIG. 14A (B), the frame is positioned to the left of the frame of coordinates (11, 1), (11, 2), (15, 2), (15, 1). Since the character line has already been extracted, the character line is extracted from the frame of the coordinates (11, 1), (11, 2), (15, 2), (15, 1).

図１５に示す「確認番号」「７８９０１２」１５０７は、図１４Ａ（Ｂ）に示す座標（１１，１）、（１１，２）、（１５，２）、（１５，１）の枠から抽出された文字行であり、「確認番号」「７８９０１２」１５０７は、座標（１１，１）、（１１，２）、（１５，２）、（１５，１）と対応付けて記憶される。 15 is extracted from the frame of coordinates (11, 1), (11, 2), (15, 2), (15, 1) shown in FIG. 14A (B). The “confirmation number” “789012” 1507 is stored in association with the coordinates (11, 1), (11, 2), (15, 2), (15, 1).

次に、制御部１０８は、ステップ１００１の処理で、処理対象の照合結果として図１４Ａ（Ｃ）に示す照合結果を選択する。 Next, in step 1001, the control unit 108 selects the collation result shown in FIG. 14A (C) as the collation result to be processed.

この場合、図１４Ａ（Ｃ）に示す照合結果の枠のうち、座標（９，１）、（９，２）、（１３，２）、（１３，１）の枠よりも左に位置する枠から文字行は、すでに抽出されているので、座標（９，１）、（９，２）、（１３，２）、（１３，１）の枠及び座標（１３，１）、（１３，２）、（１５，２）、（１５，１）の枠から文字行を抽出する。 In this case, among the frames of the matching results shown in FIG. 14A (C), the frame is located to the left of the frame of coordinates (9, 1), (9, 2), (13, 2), (13, 1). Since the character line has already been extracted from the frame of coordinates (9,1), (9,2), (13,2), (13,1) and coordinates (13,1), (13,2) ), (15, 2), and a character line is extracted from the frame of (15, 1).

図１５に示す「０」「確認番号」１５０８は、図１４Ａ（Ｃ）に示す座標（９，１）、（９，２）、（１３，２）、（１３，１）の枠から抽出された文字行であり、「０」「確認番号」１５０８は、座標（９，１）、（９，２）、（１３，２）、（１３，１）と対応付けて記憶される。 “0” “confirmation number” 1508 shown in FIG. 15 is extracted from the frame of coordinates (9, 1), (9, 2), (13, 2), (13, 1) shown in FIG. 14A (C). “0” and “confirmation number” 1508 are stored in association with coordinates (9, 1), (9, 2), (13, 2), and (13, 1).

図１５に示す「７８９０１２」１５０９は、図１４Ａ（Ｃ）に示す座標（１３，１）、（１３，２）、（１５，２）、（１５，１）の枠から抽出された文字行であり、「７８９０１２」１５０９は、座標（１３，１）、（１３，２）、（１５，２）、（１５，１）と対応付けて記憶される。 “789012” 1509 shown in FIG. 15 is a character line extracted from the frame of coordinates (13, 1), (13, 2), (15, 2), (15, 1) shown in FIG. 14A (C). Yes, “7899012” 1509 is stored in association with coordinates (13, 1), (13, 2), (15, 2), and (15, 1).

次に、制御部１０８は、ステップ１００１の処理で、処理対象の照合結果として図１４Ｂ（Ｄ）に示す照合結果を選択する。 Next, in step 1001, the control unit 108 selects the collation result shown in FIG. 14B (D) as the collation result to be processed.

この場合、図１４Ｂ（Ｄ）に示す照合結果の枠のうち、座標（５，１）、（５，２）、（１１，２）、（１１，１）以外の枠から文字行は、すでに抽出されているので、座標（５，１）、（５，２）、（１１，２）、（１１，１）の枠から文字行を抽出する。 In this case, the character lines from the frames other than the coordinates (5, 1), (5, 2), (11, 2), (11, 1) in the matching result frame shown in FIG. Since it has been extracted, a character line is extracted from the frame of coordinates (5, 1), (5, 2), (11, 2), (11, 1).

図１５に示す「０００００１０」１５１１は、図１４Ｂ（Ｄ）に示す座標（５，１）、（５，２）、（１１，２）、（１１，１）の枠から抽出された文字行であり、「０００００１０」１５１１は、座標（５，１）、（５，２）、（１１，２）、（１１，１）と対応付けて記憶される。 “0000010” 1511 shown in FIG. 15 is a character line extracted from the frame of coordinates (5, 1), (5, 2), (11, 2), (11, 1) shown in FIG. 14B (D). Yes, “0000010” 1511 is stored in association with coordinates (5, 1), (5, 2), (11, 2), (11, 1).

次に、制御部１０８は、ステップ１００１の処理で、処理対象の照合結果として図１４Ｂ（Ｅ）に示す照合結果を選択する。 Next, the control part 108 selects the collation result shown to FIG. 14B (E) as a collation result of a process target by the process of step 1001. FIG.

この場合、図１４Ｂ（Ｅ）に示す照合結果の枠のうち、座標（４，１）、（４，２）、（９，２）、（９，１）以外の枠から文字行は、すでに抽出されているので、座標（４，１）、（４，２）、（９，２）、（９，１）の枠から文字行を抽出する。 In this case, character lines from frames other than the coordinates (4, 1), (4, 2), (9, 2), and (9, 1) in the matching result frame shown in FIG. Since it has been extracted, a character line is extracted from the frame of coordinates (4, 1), (4, 2), (9, 2), (9, 1).

図１５に示す「お客様番号」「０００００」１５１０は、図１４Ｂ（Ｅ）に示す座標（４，１）、（４，２）、（９，２）、（９，１）の枠から抽出された文字行であり、「お客様番号」「０００００」１５１０は、座標（４，１）、（４，２）、（９，２）、（９，１）と対応付けて記憶される。 “Customer number” “00000” 1510 shown in FIG. 15 is extracted from the frame of coordinates (4, 1), (4, 2), (9, 2), (9, 1) shown in FIG. 14B (E). “Customer number” “00000” 1510 is stored in association with coordinates (4, 1), (4, 2), (9, 2), (9, 1).

次に、制御部１０８は、ステップ１００１の処理で、処理対象の照合結果として図１４Ｂ（Ｆ）に示す照合結果を選択する。 Next, in step 1001, the control unit 108 selects the collation result shown in FIG. 14B (F) as the collation result to be processed.

この場合、図１４Ｂ（Ｆ）に示す照合結果の枠のうち、座標（１，１）、（１，２）、（５，２）、（５，１）以外の枠から文字行は、すでに抽出されているので、座標（１，１）、（１，２）、（５，２）、（５，１）の枠から文字行を抽出する。 In this case, the character lines from the frames other than the coordinates (1, 1), (1, 2), (5, 2), (5, 1) among the frames of the matching result shown in FIG. Since it has been extracted, a character line is extracted from the frame of coordinates (1,1), (1,2), (5,2), (5,1).

図１５に示す「１２３４５」「お客様番号」１５１２は、図１４Ｂ（Ｆ）に示す座標（１，１）、（１，２）、（５，２）、（５，１）の枠から抽出された文字行であり、「１２３４５」「お客様番号」１５１２は、座標（１，１）、（１，２）、（５，２）、（５，１）と対応付けて記憶される。 “12345” and “customer number” 1512 shown in FIG. 15 are extracted from the frame of coordinates (1, 1), (1, 2), (5, 2), (5, 1) shown in FIG. 14B (F). “12345” and “customer number” 1512 are stored in association with coordinates (1,1), (1,2), (5,2), and (5,1).

次に、制御部１０８は、ステップ１００１の処理で、処理対象の照合結果として図１４Ｃ（Ｇ）に示す照合結果を選択する。 Next, in step 1001, the control unit 108 selects the collation result shown in FIG. 14C (G) as the collation result to be processed.

この場合、図１４Ｃ（Ｇ）に示す照合結果の枠のうち、座標（０，１）、（０，２）、（４，２）、（４，１）以外の枠から文字行は、すでに抽出されているので、座標（０，１）、（０，２）、（４，２）、（４，１）の枠から文字行を抽出する。 In this case, character lines from frames other than the coordinates (0, 1), (0, 2), (4, 2), and (4, 1) in the matching result frame shown in FIG. Since it has been extracted, a character line is extracted from the frame of coordinates (0, 1), (0, 2), (4, 2), (4, 1).

図１５に示す「収納機関番号」「１２３４５」１５１３は、図１４Ｃ（Ｇ）に示す座標座標（０，１）、（０，２）、（４，２）、（４，１）の枠から抽出された文字行であり、「収納機関番号」「１２３４５」１５１３は、座標（０，１）、（０，２）、（４，２）、（４，１）と対応付けて記憶される。 “Storage engine number” “12345” 1513 shown in FIG. 15 is obtained from the frame of coordinate coordinates (0, 1), (0, 2), (4, 2), (4, 1) shown in FIG. 14C (G). The extracted character line and “storage organization number” “12345” 1513 are stored in association with the coordinates (0, 1), (0, 2), (4, 2), (4, 1). .

次に、制御部１０８は、ステップ１００１の処理で、処理対象の照合結果として図１４Ｃ（Ｈ）に示す照合結果を選択する。 Next, in step 1001, the control unit 108 selects the collation result shown in FIG. 14C (H) as the collation result to be processed.

この場合、図１４Ｃ（Ｇ）に示す照合結果のすべての枠の文字行はすでに抽出されているので、制御部１０８は文字行を抽出しない。 In this case, since the character lines in all the frames of the matching result shown in FIG. 14C (G) have already been extracted, the control unit 108 does not extract the character lines.

次に、制御部１０８は、ステップ１００１の処理で、処理対象の照合結果として図１４Ｃ（Ｉ）に示す照合結果を選択する。 Next, in step 1001, the control unit 108 selects the collation result shown in FIG. 14C (I) as the collation result to be processed.

図１４Ｃ（Ｉ）に示す照合結果の枠から抽出される文字行は図１５に示す文字行１５２１〜１５２６である。文字行１５２１〜１５２６については図１１に示す文字行１１１１〜１１１６と同じであるので説明を省略する。 Character lines extracted from the collation result frame shown in FIG. 14C (I) are character lines 1521 to 1526 shown in FIG. The character lines 1521 to 1526 are the same as the character lines 1111 to 1116 shown in FIG.

図１６は、本発明の実施形態の候補文字ネットワークの説明図である。 FIG. 16 is an explanatory diagram of a candidate character network according to the embodiment of this invention.

図２に示すステップ２０７の処理では、抽出された文字行から文字パタン候補が切り出され、ステップ２０８の処理では、切り出された文字パタン候補に対して、文字パタン候補に類似する文字（識別候補）と文字パタン候補の各識別候補に対する類似度とを算出し、候補文字ネットワークを生成する文字識別処理が実行される。 In the process of step 207 shown in FIG. 2, character pattern candidates are cut out from the extracted character line. In the process of step 208, characters similar to the character pattern candidates (identification candidates) are compared with the cut out character pattern candidates. And character recognition processing for calculating the similarity of each character pattern candidate to each identification candidate and generating a candidate character network.

文字パタン候補切出し及び文字識別処理は、特開２００４−１７１３１６号公報に開示された手法を用いるが、他の手法を用いてもよい。 For the character pattern candidate extraction and character identification processing, the method disclosed in Japanese Patent Application Laid-Open No. 2004-171316 is used, but other methods may be used.

図１６の（ａ）は、文字行抽出処理で抽出された文字行１６０１を示す。ここで、制御部１０８は、ステップ２０７の処理で、文字行１６０１から文字らしいパタンの候補を切出す。切り出された文字パタン候補は、図１６の（ｂ）に示すようにネットワークの形式で保存する。 (A) of FIG. 16 shows the character line 1601 extracted by the character line extraction process. Here, the control unit 108 cuts out a pattern candidate that seems to be a character from the character line 1601 in the process of step 207. The extracted character pattern candidates are stored in a network format as shown in FIG.

次に、ステップ２０８の処理で、制御部１０８は、切り出された文字パタン候補に対して文字識別処理を実行する。文字識別処理では、識別誤りを考慮し、複数の識別候補及び文字パタン候補の各識別候補に対する類似度が算出され、文字識別処理の識別結果として、図１６の（ｂ）に示す候補文字ネットワークが生成される。文字パタン候補の切り出し位置をノード１６０２とし、切り出された文字パタン候補はアーク１６０３とする。 Next, in the process of step 208, the control unit 108 executes a character identification process on the extracted character pattern candidate. In the character identification process, a recognition error is taken into consideration, the similarity to each identification candidate of a plurality of identification candidates and character pattern candidates is calculated, and the candidate character network shown in FIG. Generated. The cutout position of the character pattern candidate is a node 1602, and the cutout character pattern candidate is an arc 1603.

図１６の（ｃ）は、候補文字ネットワークのデータの詳細を示す。図１６の（ｃ）では、識別候補を１６０５に示し、文字パタン候補の各識別候補に対する類似度を１６０６に示す。 FIG. 16C shows details of candidate character network data. In FIG. 16C, identification candidates are indicated by 1605, and the similarity of each character pattern candidate to each identification candidate is indicated by 1606.

図１７は、本発明の実施形態の特定項目文字列照合処理のＰＡＤ図である。 FIG. 17 is a PAD diagram of specific item character string matching processing according to the embodiment of this invention.

特定項目文字列照合処理は、文字識別処理の結果と、当該文字識別処理を実行した枠に対応付けられた枠構造辞書に登録された枠の文字列情報と、を照合する。 The specific item character string collation process collates the result of the character identification process with the character string information of the frame registered in the frame structure dictionary associated with the frame on which the character identification process is executed.

まず、制御部１０８は、図７に示すステップ７０５の処理で記憶部１１０に記憶された照合結果の数だけステップ１７０２の処理以降の処理を実行する（１７０１）。ステップ１７０１の処理では、制御部１０８は、記憶部１１０に記憶された照合結果から、処理対象となる照合結果を一つ選択する。 First, the control unit 108 executes processing subsequent to step 1702 by the number of collation results stored in the storage unit 110 in step 705 shown in FIG. 7 (1701). In the processing of step 1701, the control unit 108 selects one matching result to be processed from the matching results stored in the storage unit 110.

次に、制御部１０８は、ステップ１７０１の処理で選択された照合結果に存在する枠の数だけ、ステップ１７０３の処理以降の処理を実行する（１７０２）。ステップ１７０２の処理では、制御部１０８は、ステップ１７０１の処理で選択された照合結果に存在する枠から、処理対象となる枠を一つ選択する。 Next, the control unit 108 executes processing subsequent to the processing in step 1703 by the number of frames present in the collation result selected in the processing in step 1701 (1702). In the processing in step 1702, the control unit 108 selects one frame to be processed from the frames existing in the collation result selected in the processing in step 1701.

次に、制御部１０８は、処理対象の枠から抽出された文字行が複数行存在するか否かを判定する（１７０３）。ここで、文字行が複数行存在するとは、例えば、図１１に示す文字行１１０１、１１０３、及び１１０５である。 Next, the control unit 108 determines whether or not there are a plurality of character lines extracted from the frame to be processed (1703). Here, there are a plurality of character lines, for example, the character lines 1101, 1103, and 1105 shown in FIG.

ステップ１７０３の処理で、処理対象の枠から抽出された文字行が複数行存在すると判定された場合、制御部１０８は、複数行の文字行に対してステップ２０８の処理の文字識別処理で算出された候補文字ネットワークを連結し（１７０４）、ステップ１７０５の処理に進む。具体的には、制御部１０８は、Ｙ座標で上から下の順番で複数行の文字行の候補文字ネットワークを連結する。ステップ１７０４の処理の詳細は図１８で詳細を説明する。 If it is determined in step 1703 that there are a plurality of character lines extracted from the frame to be processed, the control unit 108 calculates the character identification process of step 208 for the plurality of character lines. The candidate character networks are connected (1704), and the process proceeds to step 1705. Specifically, the control unit 108 connects the candidate character networks of a plurality of character lines in the order from top to bottom in the Y coordinate. Details of the processing in step 1704 will be described with reference to FIG.

ステップ１７０３の処理で、処理対象の枠から抽出された文字行が複数行存在しないと判定された場合、又は、ステップ１７０４の処理が実行された場合、制御部１０８は、処理対象の枠に対応する枠構造辞書６００の枠の属性情報と、処理対象の枠の候補文字ネットワークとを照合することによって、処理対象の枠の属性情報に対する枠スコアを算出する（１７０５）。 When it is determined in the process of step 1703 that there are not a plurality of character lines extracted from the frame to be processed, or when the process of step 1704 is executed, the control unit 108 corresponds to the frame to be processed. By comparing the frame attribute information of the frame structure dictionary 600 to be processed with the candidate character network of the frame to be processed, a frame score for the attribute information of the frame to be processed is calculated (1705).

ステップ１７０２の処理では、処理対象の照合結果のすべての枠にステップ１７０３〜１７０５の処理が実行された場合、つまり、処理対象の照合結果のすべての枠の枠スコアが算出された場合、制御部１０８は、枠スコアに基づいて当該処理対象の照合結果のスコアを算出する（１７０６）。照合結果のスコアは、例えば、枠スコアの平均値であってもよいし、枠スコアの和であってもよい。 In the processing of Step 1702, when the processing of Steps 1703 to 1705 is executed for all the frames of the processing target collation results, that is, when the frame scores of all the frames of the processing target collation results are calculated, the control unit 108 calculates the score of the verification result of the processing target based on the frame score (1706). The score of the matching result may be, for example, an average value of frame scores or a sum of frame scores.

ステップ１７０１の処理では、記憶部１１０に記憶されたすべての照合結果に対してステップ１７０６の処理でスコアが算出された場合、制御部１０８は、スコアが最大の照合結果を項目照合結果として返し（１７０７）、特定項目文字列照合処理を終了する。 In the processing of step 1701, when scores are calculated in the processing of step 1706 for all the matching results stored in the storage unit 110, the control unit 108 returns the matching result having the maximum score as an item matching result ( 1707), the specific item character string matching process is terminated.

図１８は、本発明の実施形態の複数行の文字行の候補文字ネットワークを連結する処理の説明図である。 FIG. 18 is an explanatory diagram of processing for connecting candidate character networks of a plurality of character lines according to the embodiment of this invention.

文字行１８０１は、「収納機関」の文字行１８０２と「番号」の文字行１８０４との２行の文字行によって構成される。 The character line 1801 is composed of two character lines, a character line 1802 of “storage organization” and a character line 1804 of “number”.

文字行１８０２に対して文字識別処理が実行されて、候補文字ネットワーク１８０３が生成されている。また、文字行１８０４に対して文字識別処理が実行されて、候補文字ネットワーク１８０５が生成されている。 A character identification process is executed on the character line 1802 to generate a candidate character network 1803. In addition, a character identification process is executed on the character line 1804 to generate a candidate character network 1805.

ステップ１７０４の処理で、文字行１８０１の候補文字ネットワーク１８０３と候補文字ネットワーク１８０５とが連結され、連結候補文字ネットワーク１８０６が生成される。 In the process of step 1704, the candidate character network 1803 and the candidate character network 1805 of the character line 1801 are connected, and a connection candidate character network 1806 is generated.

図１９は、本発明の実施形態の候補文字ネットワークと枠構造辞書の文字列情報とを照合する処理の説明図である。 FIG. 19 is an explanatory diagram of a process of collating the candidate character network with the character string information of the frame structure dictionary according to the embodiment of this invention.

図１９に示す１９０１は、図１８に示す連結候補文字ネットワーク１８０６の各識別候補を枠構造辞書６００の属性情報と照合した結果のパスである。また、図１９に示す１９０２は、連結候補文字ネットワーク１８０６のデータの詳細である。 1901 shown in FIG. 19 is a path as a result of collating each identification candidate of the connection candidate character network 1806 shown in FIG. 18 with the attribute information of the frame structure dictionary 600. Further, reference numeral 1902 shown in FIG. 19 indicates the details of the data of the link candidate character network 1806.

図１９に示す１９０３は、連結候補文字ネットワーク１８０６の切出候補の識別候補のうち、対応する枠構造辞書６００の属性情報と一致する識別候補（識別結果）を示す。 19 indicates an identification candidate (identification result) that matches the attribute information of the corresponding frame structure dictionary 600 among the identification candidates of the extraction candidates of the connection candidate character network 1806.

連結候補文字ネットワーク１８０６の基になった文字行が、図９（Ａ）に示す照合結果の座標（０，１）、（０、２）、（１，２）、（１，１）の枠から抽出されているとした場合、当該枠は、枠構造辞書６００の第１座標６０５の座標（０，０）、（０，１）、（１，１）、（１，０）に対応する。このため、連結候補文字ネットワーク１８０６の切出候補の識別候補を、枠構造辞書６００の文字列定義６０４に登録された「収納機関番号」と照合する。 The character line that is the basis of the concatenation candidate character network 1806 is a frame of coordinates (0, 1), (0, 2), (1, 2), (1, 1) of the collation result shown in FIG. Is extracted from the frame, the frame corresponds to the coordinates (0, 0), (0, 1), (1, 1), (1, 0) of the first coordinate 605 of the frame structure dictionary 600. . For this reason, the candidate for identification of the extraction candidate in the connection candidate character network 1806 is collated with the “storage organization number” registered in the character string definition 604 of the frame structure dictionary 600.

図１８に示す連結候補文字ネットワーク１８０６の同じノードの間に位置するアークが示す切出候補と照合される枠構造辞書６００の文字列定義６０４の同じ文字である。例えば、図１９に示す番号１、２、３の切出候補は、枠構造辞書６００の文字列定義６０４の「収」と照合される。 This is the same character in the character string definition 604 of the frame structure dictionary 600 that is collated with the extraction candidate indicated by the arc located between the same nodes of the linked candidate character network 1806 shown in FIG. For example, the extraction candidates of numbers 1, 2, and 3 shown in FIG. 19 are collated with “Yen” in the character string definition 604 of the frame structure dictionary 600.

図１９に示す番号１の切出候補の識別候補には、文字列定義６０４に登録された「収納機関番号」の「収」と一致する識別候補が存在するので、当該識別候補を識別結果とする。 In the identification candidate of the extraction candidate No. 1 shown in FIG. 19, there exists an identification candidate that matches the “accommodation” of the “storage organization number” registered in the character string definition 604. To do.

図１９に示す番号２及び番号３の切出候補の識別候補には、文字列定義６０４に登録された「収納機関番号」の「収」と一致する識別候補が存在しない。 In the identification candidates of the extraction candidates No. 2 and No. 3 shown in FIG. 19, there is no identification candidate that matches the “accommodation” of the “storage organization number” registered in the character string definition 604.

図１９に示す番号４の切出候補の識別候補には、文字列定義６０４に登録された「収納機関番号」の「納」と一致する識別候補が存在するので、当該識別候補を識別結果とする。 In the identification candidate of the extraction candidate number 4 shown in FIG. 19, there exists an identification candidate that matches the “payment” of the “storage organization number” registered in the character string definition 604. To do.

図１９に示す番号５及び番号６の切出候補の識別候補には、文字列定義６０４に登録された「収納機関番号」の「納」と一致する識別候補が存在しない。 The identification candidates corresponding to the extraction candidates No. 5 and No. 6 shown in FIG. 19 do not have an identification candidate that matches the “payment” of the “storage organization number” registered in the character string definition 604.

図１９に示す番号７の切出候補の識別候補には、文字列定義６０４に登録された「収納機関番号」の「機」と一致する識別候補が存在するので、当該識別候補を識別結果とする。 In the identification candidate of the extraction candidate No. 7 shown in FIG. 19, there is an identification candidate that matches the “machine” of the “storage organization number” registered in the character string definition 604. To do.

図１９に示す番号８の切出候補の識別候補には、文字列定義６０４に登録された「収納機関番号」の「関」と一致する識別候補が存在するので、当該識別候補を識別結果とする。 In the identification candidate of the extraction candidate number 8 shown in FIG. 19, there exists an identification candidate that matches the “relationship” of the “storage organization number” registered in the character string definition 604. To do.

図１９に示す番号９及び番号１０の切出候補の識別候補には、文字列定義６０４に登録された「収納機関番号」の「関」と一致する識別候補が存在しない。 In the extraction candidate identification candidates No. 9 and No. 10 shown in FIG. 19, there is no identification candidate that coincides with “Seki” of “Storage Organization Number” registered in the character string definition 604.

図１９に示す番号１１の切出候補の識別候補には、文字列定義６０４に登録された「収納機関番号」の「番」と一致する識別候補が存在するので、当該識別候補を識別結果とする。 In the identification candidate of the extraction candidate number 11 shown in FIG. 19, there is an identification candidate that matches the “number” of the “storage organization number” registered in the character string definition 604. To do.

図１９に示す番号１２の切出候補の識別候補には、文字列定義６０４に登録された「収納機関番号」の「号」と一致する識別候補が存在するので、当該識別候補を識別結果とする。 The identification candidate of the extraction candidate number 12 shown in FIG. 19 includes an identification candidate that matches the “number” of the “storage organization number” registered in the character string definition 604. Therefore, the identification candidate is identified as the identification result. To do.

なお、各切出候補の識別結果に対応する類似度に基づいて枠スコアが算出される。枠スコアは、各切出候補の識別結果に対応する類似度の平均であってもよいし、各切出候補の識別結果に対応する類似度の和であってもよい。 A frame score is calculated based on the similarity corresponding to the identification result of each extraction candidate. The frame score may be the average of the similarities corresponding to the identification results of each extraction candidate, or may be the sum of the similarities corresponding to the identification results of each extraction candidate.

また、図１８に示す連結候補字ネットワーク１８０６の同じノードの間に位置するアークが示すすべての切出候補に識別結果が存在しない場合がある。これは、帳票に記載された文字が小さく、スキャナが帳票を取り込む際の解像度が低い場合等に、文字がつぶれて取り込まれることに起因する。このように、枠構造辞書６００の文字列定義６０４に登録された文字列の文字のうち、識別結果が存在しない文字を不読文字という。 Further, there may be a case where there is no identification result for all extraction candidates indicated by arcs located between the same nodes of the linked candidate character network 1806 shown in FIG. This is because the characters written on the form are small and the characters are crushed and captured when the resolution when the scanner captures the form is low. As described above, among the characters in the character string registered in the character string definition 604 of the frame structure dictionary 600, a character for which no identification result exists is referred to as an unread character.

本実施形態では、文字列定義６０４に登録された文字列に不読文字が存在しても、不読文字が所定数以下であれば、枠の文字行を不読とせずに、次の切出候補に対して照合処理を継続して実行する。所定数は、例えば、文字列定義６０４に登録された文字列の文字数の１／３とする。 In the present embodiment, even if there is an unread character in the character string registered in the character string definition 604, if the number of unread characters is equal to or less than a predetermined number, the character line in the frame is not unread and The matching process is continuously executed for the candidate. The predetermined number is, for example, 1/3 of the number of characters in the character string registered in the character string definition 604.

これによって、帳票画像の枠の文字が認識でなくても、枠構造辞書６００の当該枠に対応する文字列定義６０４によって当該文字を補完することができる。 Thus, even if the characters in the frame of the form image are not recognized, the characters can be complemented by the character string definition 604 corresponding to the frame in the frame structure dictionary 600.

さらに、本実施形態では、制御部１０８は、枠スコアが所定値以上であれば、当該枠スコアが算出された枠を、当該枠と照合した枠構造辞書６００の文字列定義６０４として認識する。これによって、不読文字が存在しても、枠が枠構造辞書６００のいずれの枠かを特定することができる。 Furthermore, in this embodiment, if the frame score is equal to or greater than a predetermined value, the control unit 108 recognizes the frame for which the frame score is calculated as the character string definition 604 of the frame structure dictionary 600 collated with the frame. As a result, even if an unread character exists, it is possible to specify which frame of the frame structure dictionary 600 is the frame.

図２０は、本発明の実施形態の特定項目文字列照合結果の説明図である。 FIG. 20 is an explanatory diagram of a specific item character string matching result according to the embodiment of this invention.

説明のために、図９（Ａ）に示す枠構造照合結果９０１の帳票画像内における領域を照合領域２００１として図示する。 For the sake of explanation, an area in the form image of the frame structure matching result 901 shown in FIG.

枠構造照合結果９０１の各枠に対して特定項目文字列照合処理を実行した結果である特定項目文字列照合結果は、読取結果２００２、枠スコア２００３、枠構造照合結果のスコア２０１７を含む。読取結果２００２、及び枠スコア２００３は、枠構造照合結果９０１の各枠に対応する枠構造辞書６００の項目６０２、属性６０３、及び文字列定義６０４と対応付けて、記憶部１１０に記憶される。 The specific item character string matching result that is a result of executing the specific item character string matching process for each frame of the frame structure matching result 901 includes a reading result 2002, a frame score 2003, and a score 2017 of the frame structure matching result. The reading result 2002 and the frame score 2003 are stored in the storage unit 110 in association with the item 602, the attribute 603, and the character string definition 604 of the frame structure dictionary 600 corresponding to each frame of the frame structure matching result 901.

図２０に示す２０１１は、項目６０２が「収納機関番号」で、属性６０３が「項目名」で、文字列定義６０４が「収納機関番号」である枠に対応する枠構造照合結果９０１の枠から抽出した文字行の読取結果２００２が「機」が不読文字であることを示し、枠スコア２００３が「２０６」であることを示す。 In FIG. 20, reference numeral 2011 denotes a frame structure matching result 901 corresponding to a frame in which the item 602 is “storage organization number”, the attribute 603 is “item name”, and the character string definition 604 is “storage organization number”. The read result 2002 of the extracted character line indicates that “machine” is an unread character, and the frame score 2003 is “206”.

図２０に示す２０１２は、項目６０２が「収納機関番号」で、属性６０３が「データ」で、文字列定義６０４が「特定桁数数字列」である枠に対応する枠構造照合結果９０１の枠から抽出した文字行の読取結果２００２が「１２３４５」で、枠スコア２００３が「２４６」であることを示す。 In FIG. 20, a frame structure check result 901 corresponding to a frame in which the item 602 is “storage organization number”, the attribute 603 is “data”, and the character string definition 604 is “specific digit number string”. This indicates that the read result 2002 of the character line extracted from “12345” is “12345” and the frame score 2003 is “246”.

図２０に示す２０１３は、項目６０２が「お客様番号」で、属性６０３が「項目名」で、文字列定義６０４が「お客様番号」である枠に対応する枠構造照合結果９０１の枠から抽出した文字行の読取結果２００２が「お客様番号」で、枠スコア２００３が「２３５」であることを示す。 20 shown in FIG. 20 is extracted from the frame of the frame structure matching result 901 corresponding to the frame in which the item 602 is “customer number”, the attribute 603 is “item name”, and the character string definition 604 is “customer number”. It indicates that the read result 2002 of the character line is “customer number” and the frame score 2003 is “235”.

図２０に示す２０１４は、項目６０２が「お客様番号」で、属性６０３が「データ」で、文字列定義６０４が「数字列」である枠に対応する枠構造照合結果９０１の枠から抽出した文字行の読取結果２００２が「６７８９０１２３４５６」で、枠スコア２００３が「２４２」であることを示す。 In FIG. 20, a character extracted from the frame of the frame structure matching result 901 corresponding to the frame in which the item 602 is “customer number”, the attribute 603 is “data”, and the character string definition 604 is “numeric string”. The row reading result 2002 is “67890123456”, and the frame score 2003 is “242”.

図２０に示す２０１５は、項目６０２が「確認番号」で、属性６０３が「項目名」で、文字列定義６０４が「確認番号」である枠に対応する枠構造照合結果９０１の枠から抽出した文字行が文字列定義６０４と照合できなかったことを示す。このため、読取結果２００２が「？」で枠スコア２００３が「０」である。 20 shown in FIG. 20 is extracted from the frame of the frame structure matching result 901 corresponding to the frame in which the item 602 is “confirmation number”, the attribute 603 is “item name”, and the character string definition 604 is “confirmation number”. This indicates that the character line could not be matched with the character string definition 604. For this reason, the read result 2002 is “?” And the frame score 2003 is “0”.

図２０に示す２０１６は、項目６０２が「確認番号」で、属性６０３が「データ」で、文字列定義６０４が「特定桁数数字列」である枠に対応する枠構造照合結果９０１の枠から抽出した文字行の読取結果２００２が「７８９０１２」で、枠スコア２００３が「２３４」であることを示す。 In FIG. 20, reference numeral 2016 denotes a frame structure matching result 901 corresponding to a frame in which the item 602 is “confirmation number”, the attribute 603 is “data”, and the character string definition 604 is “specific number digit string”. This indicates that the read result 2002 of the extracted character line is “789012” and the frame score 2003 is “234”.

枠構造照合結果９０１のスコア２０１７は各枠スコアの総和であり、「１１６３」である。なお、枠構造照合結果のスコア２０１７は各枠スコアの平均であってもよい。 The score 2017 of the frame structure matching result 901 is the sum of the frame scores and is “1163”. The frame structure matching result score 2017 may be an average of the frame scores.

図２１は、本発明の実施形態の特定項目文字列照合の説明図である。なお、図２１のうち図２０の構成と同じ構成は、同じ符号を付与し、説明を省略する。 FIG. 21 is an explanatory diagram of specific item character string matching according to the embodiment of this invention. 21 that are the same as those in FIG. 20 are given the same reference numerals, and descriptions thereof are omitted.

説明のために、図９（Ｂ）に示す枠構造照合結果９０２の帳票画像内における領域を照合領域２１０１として図示する。 For the sake of explanation, an area in the form image of the frame structure matching result 902 shown in FIG. 9B is shown as a matching area 2101.

枠構造照合結果９０２の特定項目文字列照合結果は、読取結果２００２、枠スコア２００３、枠構造照合結果のスコア２０１７を含む。 The specific item character string matching result of the frame structure matching result 902 includes a reading result 2002, a frame score 2003, and a score 2017 of the frame structure matching result.

図２１に示す２１１１は、項目６０２が「収納機関番号」で、属性６０３が「項目名」で、文字列定義６０４が「収納機関番号」である枠に対応する枠構造照合結果９０２の枠から抽出した文字行の読取結果２００２が「収納機関」が不読文字であり、「番号」が照合されたことを示し、枠スコア２００３が「５４」であることを示す。 2111 shown in FIG. 21 is from the frame of the frame structure matching result 902 corresponding to the frame in which the item 602 is “storage organization number”, the attribute 603 is “item name”, and the character string definition 604 is “storage organization number”. The read result 2002 of the extracted character line indicates that “storage organization” is an unread character, “number” is collated, and the frame score 2003 is “54”.

図２１に示す２１１２は、項目６０２が「収納機関番号」で、属性６０３が「データ」で、文字列定義６０４が「特定桁数数字列」である枠に対応する枠構造照合結果９０２の枠から抽出した文字行の読取結果２００２が「０００−０００−０００」で、枠スコア２００３が「２３３」であることを示す。 2112 shown in FIG. 21 is a frame of a frame structure matching result 902 corresponding to a frame in which the item 602 is “storage organization number”, the attribute 603 is “data”, and the character string definition 604 is “specific digit number string”. It indicates that the read result 2002 of the character line extracted from “000-000-000” and the frame score 2003 is “233”.

図２１に示す２１１３は、項目６０２が「お客様番号」で、属性６０３が「項目名」で、文字列定義６０４が「お客様番号」である枠に対応する枠構造照合結果９０２の枠から抽出した文字行の読取結果２００２が「お客様」が不読文字であり、「番号」が照合されたことを示し、枠スコア２００３が「１０２」であることを示す。 21 is extracted from the frame of the frame structure matching result 902 corresponding to the frame in which the item 602 is “customer number”, the attribute 603 is “item name”, and the character string definition 604 is “customer number”. The read result 2002 of the character line indicates that “customer” is an unread character, “number” is collated, and the frame score 2003 is “102”.

図２１に示す２１１４は、項目６０２が「お客様番号」で、属性６０３が「データ」で、文字列定義６０４が「数字列」である枠に対応する枠構造照合結果９０２の枠から抽出した文字行の読取結果２００２が「１２３４」で、枠スコア２００３が「２４２」であることを示す。 21 is a character extracted from the frame of the frame structure matching result 902 corresponding to the frame in which the item 602 is “customer number”, the attribute 603 is “data”, and the character string definition 604 is “numeric string”. The row reading result 2002 is “1234”, and the frame score 2003 is “242”.

図２１に示す２１１５は、項目６０２が「確認番号」で、属性６０３が「項目名」で、文字列定義６０４が「確認番号」である枠に対応する枠構造照合結果９０２の枠から抽出した文字行が文字列定義６０４と照合できなかったことを示す。このため、読取結果２００２が「？」で枠スコア２００３が「０」である。 21 is extracted from the frame of the frame structure matching result 902 corresponding to the frame in which the item 602 is “confirmation number”, the attribute 603 is “item name”, and the character string definition 604 is “confirmation number”. This indicates that the character line could not be matched with the character string definition 604. For this reason, the read result 2002 is “?” And the frame score 2003 is “0”.

図２１に示す２１１６は、項目６０２が「確認番号」で、属性６０３が「データ」で、文字列定義６０４が「特定桁数数字列」である枠に対応する枠構造照合結果９０１の枠から抽出した文字行が文字列定義６０４と照合できなかったことを示す。このため、読取結果２００２が「？」で枠スコア２００３が「０」である。 21 indicates a frame structure matching result 901 corresponding to a frame in which the item 602 is “confirmation number”, the attribute 603 is “data”, and the character string definition 604 is “specific digit number string”. This indicates that the extracted character line could not be matched with the character string definition 604. For this reason, the read result 2002 is “?” And the frame score 2003 is “0”.

枠構造照合結果９０２のスコア２０１７は各枠スコアの総和であり、「６３１」である。 The score 2017 of the frame structure matching result 902 is the sum of the frame scores, which is “631”.

図１７に示すステップ１７０７の処理では、図２０に示す枠構造照合結果９０１のスコア２０１７は、図２１に示す枠構造照合結果９０２のスコア２０１７より大きいので、枠構造照合結果９０１の特定項目文字列照合結果を返す。 In the processing of step 1707 shown in FIG. 17, the score 2017 of the frame structure matching result 901 shown in FIG. 20 is larger than the score 2017 of the frame structure matching result 902 shown in FIG. Returns the verification result.

以上、本発明者によりなされた発明を実施形態に基づき具体的に説明したが、本発明は上述した実施形態に限定されるものではなく、その要旨を逸脱しない範囲で種々変更可能であることは言うまでもない。 Although the invention made by the present inventor has been specifically described based on the embodiments, the present invention is not limited to the above-described embodiments, and various modifications can be made without departing from the scope of the invention. Needless to say.

本願発明は、帳票を認識する帳票処理システムに適用可能である。 The present invention is applicable to a form processing system that recognizes forms.

１００帳票処理用計算機
１０１帳票
１０２スキャナ
１０３画像入力部
１０４電子データ
１０５入力部
１０６出力部
１０７通信部
１０８制御部
１０９補助記憶部
１１０記憶部
１１１外部補助記憶部
１１２内部バス
１１３ネットワーク
１１４外部サーバ 100 Computer for Form Processing 101 Form 102 Scanner 103 Image Input Unit 104 Electronic Data 105 Input Unit 106 Output Unit 107 Communication Unit 108 Control Unit 109 Auxiliary Storage Unit 110 Storage Unit 111 External Auxiliary Storage Unit 112 Internal Bus 113 Network 114 External Server

Claims

In a form processing system that extracts character lines that are included in a form image and exists in a frame that is an area surrounded by line segments and recognizes the extracted character lines,
The form processing system stores frame structure dictionary data in which a character string corresponding to a plurality of extraction target frames from which the character line is to be extracted and an arrangement relationship of the plurality of extraction target frames are associated with each other.
A frame extraction unit for extracting a frame from the form image as an extraction frame;
A frame matching unit that refers to the frame structure dictionary data and calculates a matching result in which the extracted frame extracted by the frame extracting unit is associated with the extraction target frame;
A character line extraction unit that extracts a character line existing in an extraction frame associated with the extraction target frame with respect to the collation result calculated by the frame collation unit;
For each matching result calculated by the frame matching unit, the similarity of the character line with respect to the character string of the extraction target frame corresponding to the extraction frame from which the character line extracted by the character line extracting unit is extracted. A score calculation unit that calculates a frame score to indicate a matching result score that is a score of each matching result based on a frame score of the frame associated with the extraction target frame included in each matching result;
A form processing system characterized in that, based on the collation result score calculated by the score calculation unit, one collation result among the collation results calculated by the frame collation unit is used as an extraction collation result.

The frame extraction unit extracts grid points from the form image, extracts an area surrounded by predetermined grid points as the extraction frame,
When the number of extraction frames extracted by the frame extraction unit is greater than the number of extraction target frames, the frame verification unit calculates a verification result in which a plurality of the extraction frames are associated with one extraction target frame. The form processing system according to claim 1.

The character line extraction unit
Storing the extracted character line and the extraction frame from which the character line is extracted as an extracted character line;
The form processing according to claim 1 or 2, wherein when an extraction frame that is a target for extracting the character line is stored as the extracted character line, a character line is not extracted from the extraction frame. system.

Stores character data with multiple characters registered,
The score calculation unit
Extracting a candidate for a character-like size from the character line extracted by the character line extraction unit,
Referencing the character data, extracting a character similar to the extraction candidate as an identification candidate, calculating the similarity of the identification candidate to the extraction candidate,
Extracting an identification candidate that matches a character constituting a character string of the extraction target frame corresponding to the extraction frame from which the character line has been extracted from the identification candidate;
The form processing system according to claim 1, wherein the frame score is calculated based on a similarity of the extracted identification candidates.

The score calculation unit
When there is no identification candidate that matches the character constituting the character string of the extraction target frame corresponding to the extraction frame from which the character line has been extracted, identification of the extraction candidate next to the extraction candidate for which the identification candidate does not exist 5. The form processing system according to claim 4, wherein it is determined whether or not the candidate matches a character next to a character constituting the character string of the extraction target frame.

The form processing system according to claim 5, wherein the score calculation unit recognizes the extraction frame as an extraction target frame corresponding to the extraction frame if the frame score is equal to or greater than a predetermined value.

In a form processing method in which a form processing system extracts character lines existing in a frame that is included in a form image and is surrounded by a line segment, and recognizes the extracted character lines,
The form processing system stores frame structure dictionary data in which a character string corresponding to a plurality of extraction target frames from which the character line is to be extracted and an arrangement relationship of the plurality of extraction target frames are associated with each other.
The method
A frame extraction step of extracting a frame from the form image as an extraction frame;
A frame matching step of referring to the frame structure dictionary data and calculating a matching result in which the extraction frame extracted in the frame extraction step is associated with the extraction target frame;
A character line extraction step for extracting a character line existing in an extraction frame associated with the extraction target frame with respect to the collation result calculated in the frame collation step;
For each matching result calculated in the frame matching step, the similarity of the character line to the character string in the extraction target frame corresponding to the extraction frame from which the character line extracted in the character line extracting step is extracted. A score calculation step of calculating a matching score that is a score of each matching result based on a frame score of a frame associated with the extraction target frame included in each matching result;
A form processing comprising: a step of setting one collation result as an extraction collation result among the collation results calculated in the frame collation step based on the collation result score calculated in the score calculation step. Method.

In the frame extraction step, a grid point is extracted from the form image, an area surrounded by predetermined grid points is extracted as the extraction frame,
In the frame collation step, when the number of extraction frames extracted in the frame extraction step is larger than the number of extraction target frames, a collation result in which a plurality of the extraction frames are associated with one extraction target frame is calculated. The form processing method according to claim 7.

In the character line extraction step,
Storing the extracted character line and the extraction frame from which the character line is extracted as an extracted character line;
The form processing according to claim 7 or 8, wherein when an extraction frame that is a target for extracting the character line is stored as the extracted character line, a character line is not extracted from the extraction frame. Method.

The form processing system stores character data in which multiple characters are registered,
In the score calculation step,
Extracting a candidate for a character-like size from the character line extracted in the character line extraction step,
Referencing the character data, extracting a character similar to the extraction candidate as an identification candidate, calculating the similarity of the identification candidate to the extraction candidate,
Extracting an identification candidate that matches a character constituting a character string of the extraction target frame corresponding to the extraction frame from which the character line has been extracted from the identification candidate;
The form processing method according to any one of claims 7 to 9, wherein the frame score is calculated based on a similarity of the extracted identification candidates.

In the score calculation step,
When there is no identification candidate that matches the character constituting the character string of the extraction target frame corresponding to the extraction frame from which the character line has been extracted, identification of the extraction candidate next to the extraction candidate for which the identification candidate does not exist The form processing method according to claim 10, wherein it is determined whether or not the candidate matches a character next to a character constituting the character string of the extraction target frame.

12. The form processing method according to claim 11, wherein, in the score calculation step, if the frame score is equal to or greater than a predetermined value, the extraction frame is recognized as an extraction target frame corresponding to the extraction frame.