JP2002288675A

JP2002288675A - Method, program, and system for document identification

Info

Publication number: JP2002288675A
Application number: JP2001093160A
Authority: JP
Inventors: Naoya Tanaka; 直哉田中
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2001-03-28
Filing date: 2001-03-28
Publication date: 2002-10-04

Abstract

PROBLEM TO BE SOLVED: To provide a system capable of realizing a highly accurate identification, in the identification operation of a document by reducing the operation amount of an operator in the registration operation for the document of a dictionary. SOLUTION: This document identification system comprises an image input part 52 for storing the digital image of the document read through a scanner in an image memory 12, after correcting a skew of the document and removing blanks around the document; a characteristic extraction part 53 for detecting the size of the document from the digital image in the image memory 12, detecting the character patterns in unit of character string, converting the character patterns into a character amount in units of character string, and holding the character amount in a character memory 13; a common character extracting part 55 for extracting the character amount common to a plurality sets of character strings from the character amount of the character strings in the character memory 13 and registering the extracted character amount in a dictionary memory 15, when the format of the document to be inputted is registered; and a collation part 54 for collating the character amount of the character string of the document to be processed in the character memory 13 with the character amount of the character string for each document in the dictionary memory 15, when the format of the document to be inputted is identified.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、帳票を識別するた
めのシステムに関し、特に、帳票が入力された場合に、
記入内容の異なる等の帳票をフォーマットして登録す
る、または登録された帳票のフォーマットの種別を識別
するための帳票識別方法、帳票識別プログラム、および
帳票識別方式に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a system for identifying a form, and more particularly to a system for identifying a form when a form is input.
The present invention relates to a form identification method, a form identification program, and a form identification method for formatting and registering forms having different contents, or for identifying the type of format of a registered form.

【０００２】[0002]

【従来の技術】従来の帳票識別方法の一例が、特開平１
１−２８２９５９号公報に記載されている。この従来の
帳票識別システムは、画像入力手段と、画像記憶部と、
画像中の帳票位置を検出する帳票位置検出手段と、辞書
記憶部に登録された文字列の位置を前記画像記憶部の座
標系へと変換する座標変換手段と、座標変換手段で得ら
れた文字列の位置と前記画像中の実際の文字列の位置と
のずれを文字単位に補正する文字座標補正手段と、文字
単位に辞書と処理対象の画像中の文字パターンとを照合
する図形照合手段とから構成されている。2. Description of the Related Art An example of a conventional form identification method is disclosed in
No. 1-282959. This conventional form identification system includes an image input unit, an image storage unit,
Form position detecting means for detecting a form position in an image, coordinate converting means for converting the position of a character string registered in the dictionary storage unit into the coordinate system of the image storing unit, and characters obtained by the coordinate converting unit Character coordinate correcting means for correcting the difference between the position of the column and the position of the actual character string in the image in character units; and a figure matching means for matching the dictionary and the character pattern in the image to be processed in character units. It is composed of

【０００３】このような構成を有する従来の帳票識別シ
ステムは、次のように動作する。すなわち、帳票を光学
的に走査して光電変換されディジタル化されて得られた
ディジタル画像は、画像記憶部に保持される。該帳票の
ディジタル画像に対し、まず、帳票位置検出手段におい
て、帳票の４辺の位置が検出され、帳票の画像中での位
置が求められる。次に、座標変換手段において、辞書に
保持された帳票のプレ印刷文字列の文字単位の位置情報
が帳票位置検出手段で得られた該帳票の位置情報を利用
して該ディジタル画像中での位置情報へと座標変換され
る。次に、文字座標補正手段において、座標変換手段に
よって変換された辞書登録された文字パターンの座標と
実際の画像中の文字パターンの座標との位置ずれが補正
される。さらに、図形照合手段において、辞書登録され
た文字パターンの特徴量と画像中の文字パターンの特徴
量が照合される。辞書登録された帳票種毎に全ての文字
パターンが照合された後、類似度が尤も高い帳票種が処
理対象の帳票の帳票種として判定される。A conventional form identification system having such a configuration operates as follows. That is, a digital image obtained by optically scanning a form, photoelectrically converting the form, and digitizing the form is stored in the image storage unit. For the digital image of the form, first, the position of the four sides of the form is detected by the form position detecting means, and the position of the form in the image is obtained. Next, in the coordinate conversion means, the position information in the digital image is obtained by using the position information of the form obtained by the form position detecting means in the form unit of the pre-print character string of the form held in the dictionary. The coordinates are transformed into information. Next, the character coordinate correcting means corrects the positional deviation between the coordinates of the character pattern registered in the dictionary converted by the coordinate converting means and the coordinates of the character pattern in the actual image. Further, in the graphic matching means, the feature amount of the character pattern registered in the dictionary is compared with the feature amount of the character pattern in the image. After all character patterns are collated for each form type registered in the dictionary, the form type having the highest similarity is determined as the form type of the form to be processed.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、この従
来技術には、次のような問題点があった。第１の問題点
は、文字間隔が狭い文字列の場合は文字座標補正手段の
中で行われる文字パターンの切り出しに失敗してしま
い、帳票識別を実行できない場合があるということであ
る。文字サイズが小さくてかつ、文字間隔の狭い文字列
の場合にこの問題は顕著である。However, this prior art has the following problems. A first problem is that in the case of a character string with a narrow character interval, the character pattern cutout performed in the character coordinate correcting means fails, and the form identification may not be performed in some cases. This problem is remarkable in the case of a character string having a small character size and a narrow character interval.

【０００５】第２の問題点は、辞書所登録する文字列を
オペレータが指定する必要があるため、オペレータの負
担が大きいということである。[0005] The second problem is that the operator has to specify a character string to be registered in a dictionary place, which imposes a heavy burden on the operator.

【０００６】本発明の目的は、上記問題点を鑑み、帳票
の辞書登録操作においてオペレータの作業量を低減し、
帳票の識別操作において高精度な識別を実現するシステ
ムを提供することにある。SUMMARY OF THE INVENTION In view of the above problems, an object of the present invention is to reduce the amount of work of an operator in registering a form in a dictionary.
An object of the present invention is to provide a system that realizes highly accurate identification in a form identification operation.

【０００７】[0007]

【課題を解決するための手段】上記の目的を達成するた
めに、本発明の帳票識別方法は、情報処理装置に帳票を
識別させる帳票識別方法であって、入力された未記入の
帳票のディジタル画像から文字列を検出し、その検出し
た文字列毎の座標情報を抽出し、さらにその文字列と同
一の高さを持つ矩形状の特徴抽出範囲を制限する窓領域
を文字列上で文字列に沿って移動させながら特徴量を抽
出し、その抽出した文字列の座標情報と特徴量とをメモ
リに登録することで帳票識別フォーマットを作成するこ
とを特徴としている。In order to achieve the above object, a form identification method according to the present invention is a form identification method for identifying a form by an information processing apparatus. A character string is detected from the image, coordinate information for each detected character string is extracted, and a window area that limits a rectangular feature extraction range having the same height as the character string is displayed on the character string. The feature amount is extracted while moving along the line, and the coordinate information of the extracted character string and the feature amount are registered in a memory to create a form identification format.

【０００８】また、本発明の帳票識別方法は、情報処理
装置に帳票を識別させる帳票識別方法であって、入力さ
れた同一フォーマットで記入内容の異なる複数の記入済
み帳票のディジタル画像から各々文字列を検出し、その
検出した文字列毎の座標情報を抽出し、さらにその文字
列と同一の高さを持つ矩形状の特徴抽出範囲を制限する
窓領域を文字列上で文字列に沿って移動させながら特徴
量を抽出し、その抽出した文字列毎の座標情報および特
徴量が前記の入力された同一フォーマットで記入内容の
異なる複数の記入済み帳票の全てで一致するか否かを比
較照合し、その座標情報および特徴量が一致する文字列
の座標情報と特徴量とをメモリに登録することで帳票識
別フォーマットを作成することを特徴としている。A form identification method according to the present invention is a form identification method for identifying a form by an information processing apparatus, wherein a character string is respectively converted from a digital image of a plurality of completed forms in the same format and having different contents. Is detected, coordinate information for each detected character string is extracted, and a window area for limiting a rectangular feature extraction range having the same height as the character string is moved along the character string along the character string. While extracting the characteristic amount, and comparing and checking whether the coordinate information and the characteristic amount for each of the extracted character strings match in all of the plurality of filled-out forms having the same input format and different entry contents. It is characterized in that a form identification format is created by registering, in a memory, coordinate information and a feature amount of a character string having the same coordinate information and feature amount.

【０００９】また、本発明の帳票識別方法は、情報処理
装置に帳票を識別させる帳票識別方法であって、入力さ
れた帳票のディジタル画像から文字列を検出し、その検
出した文字列毎の座標情報と特徴量とを抽出し、その抽
出した文字列の座標情報および特徴量とあらかじめ帳票
の種別毎に登録された文字列の座標情報および特徴量と
を比較することで帳票の種別を識別することを特徴とし
ている。A form identification method according to the present invention is a form identification method for identifying a form by an information processing apparatus, wherein a character string is detected from a digital image of an input form, and coordinates of each detected character string are detected. The information and the characteristic amount are extracted, and the type of the form is identified by comparing the coordinate information and the characteristic amount of the extracted character string with the coordinate information and the characteristic amount of the character string registered in advance for each type of the form. It is characterized by:

【００１０】また、本発明の帳票識別方法は、情報処理
装置に帳票を識別させる帳票識別方法であって、新しい
フォーマットで未記入の帳票を入力した場合に、入力さ
れた帳票のディジタル画像を第１のメモリに保持する第
１の保持ステップと、前記第１のメモリに保持されたデ
ィジタル画像から帳票サイズを測定するサイズ測定ステ
ップと、前記第１のメモリに保持されたディジタル画像
から文字列を検出してその検出した文字列の座標情報を
算出し、さらにその文字列と同一の高さを持つ矩形状の
特徴抽出範囲を制限する窓領域を文字列上で文字列に沿
って移動させながら特徴量を算出する特徴量算出ステッ
プと、その算出した座標情報と特徴量とを第２のメモリ
に保持する第２の保持ステップと、文字列の存在がなく
なるまで前記特徴量算出ステップと前記第２の保持ステ
ップとを繰り返す第１の繰返しステップと、前記第１の
繰返しステップが終了すると前記第１のメモリに保持さ
れた帳票サイズと前記第２のメモリに保持された全ての
文字列の座標情報毎の特徴量とを第３のメモリに登録す
る登録ステップとを含むことを特徴としている。The form identification method according to the present invention is a form identification method for identifying a form by an information processing apparatus. When a blank form is entered in a new format, a digital image of the entered form is converted to a second form. A first holding step of holding the data in the first memory, a size measuring step of measuring a form size from the digital image held in the first memory, and a character string from the digital image held in the first memory. Detecting and calculating coordinate information of the detected character string, and further moving a window area that limits a rectangular feature extraction range having the same height as the character string along the character string along the character string A feature value calculating step of calculating a feature value; a second holding step of storing the calculated coordinate information and the feature value in a second memory; A first repetition step of repeating the calculation step and the second holding step; and, when the first repetition step is completed, the form size held in the first memory and the total size held in the second memory. And registering a feature amount of each character string for each coordinate information in the third memory.

【００１１】また、本発明の帳票識別方法は、情報処理
装置に帳票を識別させる票識別方法であって、フォーマ
ットに記入された帳票を入力した場合に、入力された帳
票のディジタル画像を第１のメモリに保持する第１の保
持ステップと、前記入力された帳票の識別を行うための
識別コードを発生する発生ステップと、前記第１のメモ
リに保持されたディジタル画像から帳票サイズを測定す
るサイズ測定ステップと、前記第１のメモリに保持され
たディジタル画像から文字列を検出してその検出した文
字列の座標情報を算出し、さらにその文字列と同一の高
さを持つ矩形状の特徴抽出範囲を制限する窓領域を文字
列上で文字列に沿って移動させながら特徴量を算出する
特徴量算出ステップと、その算出した座標情報と特徴量
とを第２のメモリに前記識別コードを基に保持する第２
の保持ステップと、文字列の存在がなくなるまで前記特
徴量算出ステップと前記第２の保持ステップとを繰り返
す第１の繰返しステップと、前記第１の繰返しステップ
終了後に前記入力された帳票と同一のフォーマットで記
入された帳票が入力されると、前記発生ステップと前記
サイズ測定ステップと前記特徴量算出ステップと前記第
２の保持ステップと前記第１の繰返しステップとを繰り
返す第２の繰返しステップと、前記第２の繰返しステッ
プ終了後に前記第２のメモリに保持されたある識別コー
ドにある文字列の座標情報が前記第２のメモリに保持さ
れた別の識別コードにある文字列の座標情報と一致する
かを全て比較する比較ステップと、前記比較ステップで
一致するものがあれば、一致した座標情報に該当する特
徴量との類似度を算出してあらかじめ決められた値以上
であれば一致したものとして文字列の座標情報と特徴量
を全て抽出する抽出ステップと、抽出が終わると抽出し
た全ての文字列の座標情報毎の特徴量と前記第２のメモ
リに書き込まれた帳票サイズとを第３のメモリに登録す
る登録ステップとを含むことを特徴としている。The form identification method according to the present invention is a form identification method for identifying a form to an information processing apparatus. When a form written in a format is input, a digital image of the input form is converted to a first image. A first storing step of storing the form in the memory, a generating step of generating an identification code for identifying the input form, and a size for measuring a form size from the digital image stored in the first memory. A measuring step, detecting a character string from the digital image held in the first memory, calculating coordinate information of the detected character string, and extracting a rectangular feature having the same height as the character string A feature amount calculating step of calculating a feature amount while moving the window region for limiting the range along the character string on the character string; and storing the calculated coordinate information and the feature amount in a second memory. Second holding on the basis of the identification code
And a first repetition step of repeating the feature amount calculation step and the second retention step until the presence of a character string disappears, and the same as the input form after the end of the first repetition step. When a form filled in in a format is input, a second repetition step of repeating the generation step, the size measurement step, the feature amount calculation step, the second holding step, and the first repetition step, After the end of the second repetition step, the coordinate information of the character string in the certain identification code stored in the second memory matches the coordinate information of the character string in another identification code stored in the second memory. And if there is a match in the comparison step, the similarity between the feature quantity corresponding to the matched coordinate information is determined. An extraction step of extracting all the coordinate information and the feature amount of the character string as a match if the value is equal to or more than a predetermined value, and extracting the feature amount for each coordinate information of all the extracted character strings after the extraction. Registering the form size written in the second memory and the third memory.

【００１２】また、本発明の帳票識別方法は、情報処理
装置に帳票を識別させる帳票識別方法であって、フォー
マットに記入された帳票を入力した場合に、入力された
帳票のディジタル画像を第１のメモリに保持する第１の
保持ステップと、前記第１のメモリに保持されたディジ
タル画像から帳票サイズを測定するサイズ測定ステップ
と、前記第１のメモリに保持されたディジタル画像から
文字列を検出してその検出した文字列の座標情報を算出
し、さらにその文字列と同一の高さを持つ矩形状の特徴
抽出範囲を制限する窓領域を文字列上で文字列に沿って
移動させながら特徴量を算出する特徴量算出ステップ
と、その算出した座標情報と特徴量とを第２のメモリに
保持する第２の保持ステップと、文字列の存在がなくな
るまで前記特徴量算出ステップと前記第２の保持ステッ
プとを繰り返す第１の繰返しステップと、前記第１の繰
返しステップが終了後に前記第２のメモリに保持された
文字列の座標と，前記第２のメモリに保存されている帳
票サイズと同じ帳票サイズで第３のメモリに登録されて
いる帳票の文字列の座標情報とを比較する比較ステップ
と、前記比較ステップで一致した場合に文字列の特徴量
同士の類似度を算出してあらかじめ決められた値以上で
あれば文字列が一致したと判断して帳票の種別を識別す
る識別ステップとを含むことを特徴としている。A form identification method according to the present invention is a form identification method for identifying a form by an information processing apparatus. When a form written in a format is input, a digital image of the input form is converted into a first digital image. A first storing step of storing the digital image stored in the first memory, a size measuring step of measuring a form size from the digital image stored in the first memory, and detecting a character string from the digital image stored in the first memory Then, the coordinate information of the detected character string is calculated, and further, the window area that limits the rectangular feature extraction range having the same height as the character string is moved along the character string along the character string. A feature amount calculating step of calculating the amount, a second holding step of holding the calculated coordinate information and the feature amount in a second memory, and the feature amount calculating step until the character string is no longer present. A first repetition step of repeating the step and the second holding step; a coordinate of a character string held in the second memory after the first repetition step is completed; and a character string stored in the second memory. A comparison step of comparing coordinate information of a character string of a form registered in the third memory with the same form size as that of the form, and a similarity between the characteristic amounts of the character strings if the character strings match in the comparison step Is calculated, and if the value is equal to or greater than a predetermined value, it is determined that the character strings match, and an identification step for identifying the type of the form is included.

【００１３】また、本発明の帳票識別プログラムは、コ
ンピュータに処理させるための帳票を識別する帳票識別
プログラムであって、新しいフォーマットで未記入の帳
票を入力した場合に、入力された帳票のディジタル画像
を第１のメモリに保持する第１の保持ステップと、前記
第１のメモリに保持されたディジタル画像から帳票サイ
ズを測定するサイズ測定ステップと、前記第１のメモリ
に保持されたディジタル画像から文字列を検出してその
検出した文字列の座標情報を算出し、さらにその文字列
と同一の高さを持つ矩形状の特徴抽出範囲を制限する窓
領域を文字列上で文字列に沿って移動させながら特徴量
を算出する特徴量算出ステップと、その算出した座標情
報と特徴量とを第２のメモリに保持する第２の保持ステ
ップと、文字列の存在がなくなるまで前記特徴量算出ス
テップと前記第２の保持ステップとを繰り返す第１の繰
返しステップと、前記第１の繰返しステップが終了する
と前記第１のメモリに保持された帳票サイズと前記第２
のメモリに保持された全ての文字列の座標情報毎の特徴
量とを第３のメモリに登録する登録ステップとを含む処
理を前記コンピュータに実行させることを特徴としてい
る。A form identification program according to the present invention is a form identification program for identifying a form to be processed by a computer. When a blank form is entered in a new format, a digital image of the entered form is created. In a first memory, a size measuring step of measuring a form size from the digital image held in the first memory, and a character from the digital image held in the first memory. Detects a column, calculates the coordinate information of the detected character string, and moves a window area that limits the rectangular feature extraction range with the same height as the character string along the character string along the character string A feature amount calculating step of calculating a feature amount while performing the calculation, a second holding step of holding the calculated coordinate information and the feature amount in a second memory, A first repeating step of repeating said second holding step and the feature amount calculating step until the resident is eliminated, the first of said repeating step is completed and form size held in said first memory a second
And a registration step of registering, in a third memory, the feature amounts of all the character strings stored in the memory for each piece of coordinate information in the third memory.

【００１４】また、本発明の帳票識別プログラムは、コ
ンピュータに処理させるための帳票を識別する帳票識別
プログラムであって、フォーマットに記入された帳票を
入力した場合に、入力された帳票のディジタル画像を第
１のメモリに保持する第１の保持ステップと、前記入力
された帳票の識別を行うための識別コードを発生する発
生ステップと、前記第１のメモリに保持されたディジタ
ル画像から帳票サイズを測定するサイズ測定ステップ
と、前記第１のメモリに保持されたディジタル画像から
文字列を検出してその検出した文字列の座標情報を算出
し、さらにその文字列と同一の高さを持つ矩形状の特徴
抽出範囲を制限する窓領域を文字列上で文字列に沿って
移動させながら特徴量を算出する特徴量算出ステップ
と、その算出した座標情報と特徴量とを第２のメモリに
前記識別コードを基に保持する第２の保持ステップと、
文字列の存在がなくなるまで前記特徴量算出ステップと
前記第２の保持ステップとを繰り返す第１の繰返しステ
ップと、前記第１の繰返しステップ終了後に前記入力さ
れた帳票と同一のフォーマットで記入された帳票が入力
されると、前記発生ステップと前記サイズ測定ステップ
と前記特徴量算出ステップと前記第２の保持ステップと
前記第１の繰返しステップとを繰り返す第２の繰返しス
テップと、前記第２の繰返しステップ終了後に前記第２
のメモリに保持されたある識別コードにある文字列の座
標情報が前記第２のメモリに保持された別の識別コード
にある文字列の座標情報と一致するかを全て比較する比
較ステップと、前記比較ステップで一致するものがあれ
ば、一致した座標情報に該当する特徴量との類似度を算
出してあらかじめ決められた値以上であれば一致したも
のとして文字列の座標情報と特徴量を全て抽出する抽出
ステップと、抽出が終わると抽出した全ての文字列の座
標情報毎の特徴量と前記第２のメモリに書き込まれた帳
票サイズとを第３のメモリに登録する登録ステップとを
含む処理を前記コンピュータに実行させることを特徴と
している。A form identification program according to the present invention is a form identification program for identifying a form to be processed by a computer. When a form written in a format is input, a digital image of the input form is converted. A first holding step of holding in a first memory, a generating step of generating an identification code for identifying the input form, and measuring a form size from a digital image held in the first memory A size measuring step of detecting a character string from the digital image held in the first memory, calculating coordinate information of the detected character string, and further calculating a rectangular shape having the same height as the character string. A feature amount calculating step of calculating a feature amount while moving a window region for limiting the feature extraction range on the character string along the character string, and the calculated coordinates A second holding step of holding the distribution and the feature quantity on the basis of the identification code in a second memory,
A first repetition step of repeating the feature amount calculation step and the second holding step until there is no longer a character string; and, after completion of the first repetition step, data is entered in the same format as the input form. When a form is input, a second repetition step of repeating the generation step, the size measurement step, the feature amount calculation step, the second holding step, and the first repetition step, and the second repetition step After the step, the second
A comparison step of comparing all coordinate information of a character string in a certain identification code held in the memory with the coordinate information of a character string in another identification code held in the second memory; If there is a match in the comparison step, the similarity to the feature amount corresponding to the matched coordinate information is calculated, and if the value is equal to or greater than a predetermined value, the coordinate information of the character string and all the feature amounts are determined to be matched. A process including an extraction step of extracting, and a registration step of registering, in a third memory, a feature amount for each coordinate information of all extracted character strings and a form size written in the second memory after the extraction is completed. Is executed by the computer.

【００１５】また、本発明の帳票識別プログラムは、コ
ンピュータに処理させるための帳票を識別する帳票識別
プログラムであって、フォーマットに記入された帳票を
入力した場合に、入力された帳票のディジタル画像を第
１のメモリに保持する第１の保持ステップと、前記第１
のメモリに保持されたディジタル画像から帳票サイズを
測定するサイズ測定ステップと、前記第１のメモリに保
持されたディジタル画像から文字列を検出してその検出
した文字列の座標情報を算出し、さらにその文字列と同
一の高さを持つ矩形状の特徴抽出範囲を制限する窓領域
を文字列上で文字列に沿って移動させながら特徴量を算
出する特徴量算出ステップと、その算出した座標情報と
特徴量とを第２のメモリに保持する第２の保持ステップ
と、文字列の存在がなくなるまで前記特徴量算出ステッ
プと前記第２の保持ステップとを繰り返す第１の繰返し
ステップと、前記第１の繰返しステップが終了後に前記
第２のメモリに保持された文字列の座標と，前記第２の
メモリに保存されている帳票サイズと同じ帳票サイズで
第３のメモリに登録されている帳票の文字列の座標情報
とを比較する比較ステップと、前記比較ステップで一致
した場合に文字列の特徴量同士の類似度を算出してあら
かじめ決められた値以上であれば文字列が一致したと判
断して帳票の種別を識別する識別ステップとを含む処理
を前記コンピュータに実行させることを特徴としてい
る。The form identification program according to the present invention is a form identification program for identifying a form to be processed by a computer, and when a form written in a format is input, a digital image of the input form is converted. A first holding step of holding in a first memory;
A size measuring step of measuring a form size from the digital image held in the memory; detecting a character string from the digital image held in the first memory; calculating coordinate information of the detected character string; A feature amount calculating step of calculating a feature amount while moving a window region for limiting a rectangular feature extraction range having the same height as the character string along the character string on the character string; and the calculated coordinate information. A second storing step of storing the characteristic amount and the characteristic amount in a second memory; a first repeating step of repeating the characteristic amount calculating step and the second storing step until there is no longer a character string; After the first repetition step, the coordinates of the character string held in the second memory and the form size stored in the second memory are registered in the third memory with the same form size as the form size stored in the second memory. A comparing step of comparing the coordinate information of the character string of the form with the document, and calculating the similarity between the characteristic amounts of the character strings when the character strings match in the comparing step, and if the character string is equal to or more than a predetermined value, the character string Is determined to match, and an identification step for identifying the type of the form is executed by the computer.

【００１６】また、本発明の帳票識別方式は、帳票を入
力するとディジタル画像に変換する入力装置と、帳票の
識別処理を行う制御装置とを備えた帳票を識別する帳票
識別方式において、前記制御装置は、前記入力装置から
読み取った帳票のディジタル画像を格納する画像入力手
段と、画像入力手段により保持された前記帳票のディジ
タル画像から帳票のサイズを検出すると共に文字パター
ンを文字列単位で検出して文字列単位で特徴量へ変換し
て保持する特徴抽出手段と、前記入力する帳票のフォー
マットを登録する場合には前記特徴抽出手段から出力さ
れる複数組みの文字列の特徴量から各組共通の特徴量を
抽出して辞書メモリに登録する共通特徴抽出手段と、前
記入力する帳票のフォーマットの識別を行う場合には前
記特徴抽出手段から出力される処理対象の帳票の文字列
の特徴量とあらかじめ前記辞書メモリに帳票毎に保持さ
れた文字列の特徴量とを照合する照合手段とを有するこ
とを特徴としている。According to another aspect of the present invention, there is provided a form identification method for identifying a form, comprising: an input device for converting the form into a digital image when the form is input; and a control device for performing a form identification process. An image input means for storing a digital image of the form read from the input device, and detecting the size of the form from the digital image of the form held by the image input means and detecting the character pattern in units of character strings. A feature extraction unit that converts the character string into a feature amount and holds the same, and a common format for each set from a plurality of sets of character strings output from the feature extraction unit when the format of the input form is registered. A common feature extraction unit for extracting a feature amount and registering it in the dictionary memory; and a feature extraction unit for identifying the format of the input form. It is characterized by having a matching means for matching the feature quantity of the feature with a pre-character string held in each document in said dictionary memory string of the form to be processed to be output.

【００１７】[0017]

【発明の実施の形態】次に、本発明の実施の形態につい
て図面を参照して説明する。図２を参照すると、本発明
の第１の実施の形態は、パソコン等の情報処理装置（コ
ンピュータ）である制御装置１と、帳票表面を光学的に
走査してディジタル画像を生成するイメージスキャナ２
と、制御装置１に文字等を入力するキーボード３と、制
御装置１の制御の下で表示を行う表示装置４（例えば、
ＣＲＴ，ＬＣＤ等）とから構成される。なお、イメージ
スキャナ２は、入力した画像情報をディジタル画像に変
換するディジタルカメラ等の入力装置を使用しても良
い。Next, embodiments of the present invention will be described with reference to the drawings. Referring to FIG. 2, according to a first embodiment of the present invention, a control device 1, which is an information processing device (computer) such as a personal computer, and an image scanner 2 that optically scans a form surface to generate a digital image
A keyboard 3 for inputting characters and the like to the control device 1; and a display device 4 (for example,
CRT, LCD, etc.). The image scanner 2 may use an input device such as a digital camera that converts input image information into a digital image.

【００１８】制御装置１は、図示していないプロセッサ
（ＣＰＵ）によりプログラムで動作する制御部１１と、
帳票のディジタル画像を保持する画像メモリ１２と、特
徴抽出部５３（後述）で記入内容の異なる同一フォーマ
ットの帳票から検出された文字列の特徴量の組を複数個
保持する特徴メモリ１３と、帳票の読取り動作時のモー
ドを含む制御部１１の動作時の制御の状態等を記憶する
制御メモリ１４と、帳票種をインデックスとして文字列
の特徴量を該文字列の座標と共に格納する辞書メモリ１
５と、帳票識別プログラムが格納されているプログラム
メモリ１６と、イメージスキャナ２とキーボード３と表
示装置４とのデータのやり取り（例えば、ＵＳＢインタ
フェースで制御）を行うインタフェース部１７とから構
成される。なお、画像メモリ１２、特徴量メモリ１３、
および制御メモリは、データの読み出しと書き込みので
きるメモリ［この場合のメモリは、内部メモリ（例え
ば、ＲＡＭ）、外部メモリ（例えば磁気ディスク）の種
別は問わない］であって、同一のメモリ内のエリアにそ
れぞれ設けても良い。また、プログラムメモリ１６は、
データの読み出しと書き込みのできるメモリであって、
電源を落としても消えないように不揮発性のメモリ（例
えば、ＥＰＲＯＭ、磁気ディスク等）を使用することが
望ましい。The control device 1 comprises a control unit 11 operated by a program (not shown) by a processor (CPU),
An image memory 12 for holding a digital image of a form, a feature memory 13 for holding a plurality of sets of character string feature amounts detected from forms of the same format having different entry contents by a feature extracting unit 53 (described later), And a dictionary memory 1 for storing the state of the control during operation of the control unit 11 including the mode at the time of the reading operation, and the feature amount of the character string together with the coordinates of the character string using the form type as an index.
5, a program memory 16 storing a form identification program, and an interface unit 17 for exchanging data (for example, controlling with a USB interface) between the image scanner 2, the keyboard 3, and the display device 4. The image memory 12, the feature memory 13,
And the control memory is a memory from which data can be read and written [the memory in this case is any type of internal memory (for example, RAM) and an external memory (for example, magnetic disk)]. It may be provided in each area. Also, the program memory 16
A memory that can read and write data,
It is desirable to use a non-volatile memory (for example, an EPROM, a magnetic disk, or the like) so as not to be lost even when the power is turned off.

【００１９】制御部１１の内部の構成は、制御部１１が
プログラムメモリ１６内の帳票識別プログラムを実行す
る場合の機能構成を示しており、帳票識別プログラムを
実行する場合の全体の動作の制御を行う共通部５１と、
イメージスキャナ２から読取った帳票のディジタル画像
に対して帳票の傾き補正と余白除去を行った後にその画
像を画像メモリ１２に保持（格納）する画像入力部５２
と、画像メモリ１２に保持された帳票のディジタル画像
から帳票のサイズを検出すると共に文字パターンを文字
列単位で検出して文字列単位で特徴量へ変換して特徴メ
モリ１３に保持する特徴抽出部５３と、特徴量メモリ１
３に保持される同一フォーマットで記入内容の異なる複
数の帳票から検出された文字列の特徴量から各帳票共通
の文字列の特徴量のみを選択して辞書メモリ１５へ書き
込む共通特徴抽出部５５と、特徴抽出部５３から出力さ
れる処理対象の帳票の文字列特徴量と辞書メモリ１５に
帳票種毎に保持された文字列特徴量とを照合し、特徴抽
出部５３から出力される処理対象の帳票の文字列特徴と
最も類似度の高い帳票種を決定して出力する照合部５４
とを含む。The internal configuration of the control unit 11 shows a functional configuration when the control unit 11 executes the form identification program in the program memory 16, and controls the overall operation when the form identification program is executed. A common unit 51 for performing;
An image input unit 52 that performs form inclination correction and margin removal on a form digital image read from the image scanner 2 and then holds (stores) the image in the image memory 12.
A feature extraction unit that detects the size of a form from a digital image of the form stored in the image memory 12, detects a character pattern in units of character strings, converts the character pattern into character amounts in units of character strings, and stores the characteristic amount in the characteristic memory 13. 53 and feature memory 1
A common feature extracting unit 55 that selects only the feature amount of a character string common to each form from the feature amounts of the character strings detected from a plurality of forms having different entry contents in the same format and written in the dictionary memory 15, The character string feature amount of the form to be processed output from the feature extracting unit 53 is compared with the character string feature amount held for each form type in the dictionary memory 15, and the processing target output from the feature extracting unit 53 is compared. A collating unit 54 for determining and outputting a form type having the highest similarity to the character string characteristics of the form
And

【００２０】特徴抽出部５３は、画像メモリ１２に保持
されている帳票のディジタル画像から帳票のサイズを検
出する帳票サイズ検出部５３１と、画像メモリ１２に保
持されている帳票のディジタル画像から文字パターンを
文字列単位で検出して文字列単位で特徴量へ変換し特徴
量メモリ１３へ書き込む文字列特徴検出部５３２とを含
む。A feature extraction unit 53 detects a form size from a digital image of the form stored in the image memory 12, and a character pattern from a digital image of the form stored in the image memory 12. And a character string feature detection unit 532 for converting the character string into a feature amount in a character string unit and writing the feature amount in the feature amount memory 13.

【００２１】図１を参照すると、図１の制御装置１のプ
ログラムメモリ１６に保持されている帳票プログラムを
制御部１１が実行した場合における制御部１１内の各機
能ブロックおよび各メモリ１２〜１５間の関係を示した
概略図である。Referring to FIG. 1, when the control unit 11 executes the form program stored in the program memory 16 of the control device 1 of FIG. 1, the function blocks in the control unit 11 and the memories 12 to 15 It is the schematic which showed the relationship of.

【００２２】図２を参照して、本発明の第１の実施の形
態を説明する。共通部５１の下で、画像入力部５２は、
イメージスキャナ２で得られる帳票表面を格子状にサン
プリングしてその濃度値を量子化して得られたディジタ
ル画像を読取った後、このディジタル画像中の帳票の傾
き補正と余白除去を行って得られた画像を画像メモリ１
２に格納する。この場合の画像入力部５２における帳票
の傾き補正と帳票周囲の余白除去の処理について、図９
を用いて説明する。Referring to FIG. 2, a first embodiment of the present invention will be described. Under the common unit 51, the image input unit 52
After reading the digital image obtained by sampling the surface of the form obtained by the image scanner 2 in a grid pattern and quantizing the density value, the inclination of the form in the digital image is corrected and margins are removed. Image to image memory 1
2 is stored. In this case, the process of correcting the inclination of the form and removing the margin around the form in the image input unit 52 is described with reference to FIG.
This will be described with reference to FIG.

【００２３】図９は、イメージスキャナ２で得られるデ
ィジタル画像ｆの例である。帳票の周囲部分の画素値を
１、帳票上の何も書かれていない部分の画素値を０、帳
票上の文字が書かれている部分の画素値を１で表す２値
画像であるとする。ただし、値１の画素を黒画素、値０
の画素を白画素とも呼ぶとする。画像ｆの座標系は、画
像左上隅を原点Oとし、原点から右方向へ＋X軸，原点か
ら下方向へ＋Y軸を取るｘｙ座標系とする。また、この
画像ｆ中における帳票の傾きは最大でも±４５度の範囲
内（＋４５度と−４５度は含まず）とする。まず、画像
入力部５２は、画像ｆ中の帳票の４隅の座標値を検出す
る。具体的には、例えば図９に示されるように画像ｆの
４隅から夫々４５度または１３５度の斜めの方向に画像
ｆを走査して背景を表す黒画素から帳票上（の空白）を
表す白画素へ変化する初めての点の座標を検出する。こ
のようにして画像入力部５２により帳票の４隅が検出さ
れる。ここでは以上のようにして求められた帳票の左上
隅をＰ１（x1, y1）、同右上隅をＰ２（x2, y2）、同左
下隅をＰ３（x3, y3）、同右下隅をＰ４（x4, y4）とす
る。画像ｆ中の帳票の傾きθは、例えばＰ１とＰ２の座
標値から arctan((x2−x1)／(y2−y1)) として求まる。
また、帳票上の領域は、帳票の上辺、下辺、左辺、右辺
を表す方程式をもとにして、不等式(1)(2)(3)(4)： y ≧((y2−y1)／(x2−x1))・x＋(x1・y2−x2・y1)／(x1−x2) ・・・・(1) y ≦((y4−y3)／(x4−x3))・x＋(x3・y4−x4・y3)／(x3−x4) ・・・・(2) x ≧((x3−x1)／(y3−y1))・y＋(y1・x3−y3・x1)／(y1−y3) ・・・・(3) x ≦((x4−x2)／(y4−y2))・y＋(y2・x4−y4・x2)／(y2−y4) ・・・・(4) を全て満たす領域として表現できる。また、帳票の幅W
と高さHはそれぞれ、 W=SQRT((x2−x1)・(x2−x1)+(y2−y1)・(y2−y1)) ・・・・(5) H=SQRT((x3−x1)・(x3−x1)+(y3−y1)・(y3−y1)) ・・・・(6) として求めることができる。ただし、SQRT（ｘ）はｘの
平方根を求める演算を表す。傾き補正および余白除去の
変換は、変換後の画像ｇの座標値(X, Y)から変換前の画
像ｆの座標値(x, y)を求める式： cosθ・X−sinθ・Y＋x1＝ｘ・・・・(7) sinθ・X＋cosθ・Y＋y1＝ｙ・・・・(8) を用いて行うことができる。つまり、ＸとＹの値を夫
々、X={X|0≦X≦W}、 Y＝{Y｜0≦Y≦H} の範囲で変化さ
せ、変換後の画像ｇ(X, Y) の画素値を式（７）（８）
によって変換前の画像ｆ(x, y)から求めることによって
生成することができる。ここで、ｘおよびｙの値が整数
にならない場合は、それぞれ四捨五入した値を用いると
する。変換後の画像上の位置は、帳票の左上隅を原点に
取り原点から右方向へ＋X、原点から下方向へ＋Yを取る
座標系で表すとする。以上の説明はスキャナで得られる
画像が２値画像でなく濃淡画像の場合でも、傾き補正お
よび余白除去の変換において一旦適切なしきい値によっ
て２値画像へ変換して帳票の４隅の座標値を検出した
後、元の濃淡画像を式（７）（８）で変換することによ
り適用できる。FIG. 9 is an example of a digital image f obtained by the image scanner 2. It is assumed that the image is a binary image in which the pixel value of a peripheral portion of the form is 1, the pixel value of a portion where nothing is written on the form is 0, and the pixel value of a portion where a character is written on the form is 1. . However, a pixel having a value of 1 is a black pixel,
Pixel is also called a white pixel. The coordinate system of the image f is an xy coordinate system with the origin O at the upper left corner of the image, the + X axis in the right direction from the origin, and the + Y axis in the downward direction from the origin. Also, the inclination of the form in the image f is within a range of ± 45 degrees at the maximum (excluding +45 degrees and -45 degrees). First, the image input unit 52 detects the coordinate values of the four corners of the form in the image f. More specifically, for example, as shown in FIG. 9, the image f is scanned in an oblique direction of 45 degrees or 135 degrees from each of the four corners of the image f, and a black pixel representing the background represents a blank space on the document. The coordinates of the first point that changes to a white pixel are detected. Thus, the four corners of the form are detected by the image input unit 52. Here, the upper left corner of the form obtained as described above is P1 (x1, y1), the upper right corner is P2 (x2, y2), the lower left corner is P3 (x3, y3), and the lower right corner is P4 (x4, y4). y4). The inclination θ of the form in the image f is obtained as arctan ((x2-x1) / (y2-y1)) from the coordinate values of P1 and P2, for example.
The area on the form is based on equations representing the top, bottom, left, and right sides of the form, and the inequalities (1) (2) (3) (4): y ≧ ((y2-y1) / ( x2-x1)) x + (x1-y2-x2-y1) / (x1-x2) ... (1) y ≤ ((y4-y3) / (x4-x3)) x + (x3-y4 −x4 · y3) / (x3−x4) (2) x ≧ ((x3−x1) / (y3−y1)) · y + (y1 · x3−y3 · x1) / (y1−y3)・・・・ (3) x ≦ ((x4-x2) / (y4-y2)) ・ y + (y2 ・ x4-y4 ・ x2) / (y2-y4) ・・・・ Area that satisfies all of (4) Can be expressed as Also, form width W
And height H are respectively W = SQRT ((x2-x1) ・ (x2-x1) + (y2-y1) ・ (y2-y1)) ・・・・ (5) H = SQRT ((x3-x1 ) · (X3−x1) + (y3−y1) · (y3−y1)) (6) Here, SQRT (x) represents an operation for obtaining the square root of x. In the conversion of the skew correction and the margin removal, an equation for calculating the coordinate value (x, y) of the image f before the conversion from the coordinate value (X, Y) of the image g after the conversion is: cos θ · X−sin θ · Y + x1 = x · (7) It can be performed by using sin θ · X + cos θ · Y + y1 = y (8) That is, the values of X and Y are changed in the range of X = {X | 0 ≦ X ≦ W} and Y = {Y | 0 ≦ Y ≦ H}, respectively, and the converted image g (X, Y) is The pixel value is calculated by the equations (7) and (8).
By converting the image f (x, y) before conversion. Here, when the values of x and y do not become integers, values rounded off are used. The position on the image after the conversion is represented by a coordinate system with the origin at the upper left corner of the form, + X from the origin to the right and + Y from the origin to + Y. In the above description, even when the image obtained by the scanner is not a binary image but a grayscale image, in the conversion of the inclination correction and the margin removal, the image is once converted into a binary image by an appropriate threshold value and the coordinate values of the four corners of the form are calculated. After the detection, it can be applied by converting the original gray-scale image according to equations (7) and (8).

【００２４】画像メモリ１２は、入力された帳票のディ
ジタル画像に対して傾き補正と余白除去を行った画像を
保持することになる。この後、画像入力部５２は、制御
を特徴抽出部５３に渡すため共通部５１に一旦戻す。The image memory 12 holds an image obtained by subjecting the input digital image of the form to inclination correction and margin removal. After that, the image input unit 52 temporarily returns the control to the common unit 51 in order to transfer the control to the feature extraction unit 53.

【００２５】共通部５１から制御を引き継いだ特徴抽出
部５３は、帳票サイズ検出部５３１で帳票のサイズを検
出し、文字列特徴検出部５３２で画像メモリ１２に保持
された帳票のディジタル画像から活字文字パターンを文
字列単位で検出し、文字列単位で特徴量へ変換する。な
お、この場合の活字文字パターンの文字列単位の検出に
ついては、従来、多くの報告がある。例えば、まず、画
像メモリに保持される前記２値画像に対して、黒画素の
連結成分単位にラベリング処理（黒画素の連結成分単位
にラベル値を付与して各黒画素の連結成分を区別して扱
えるようにする処理）を行い、得られたラベル図形成分
の中で上下左右方向で近接（数画素程度の距離に存在）
するもの同士を結合して新たなラベル図形成分とする。
次に、この結合処理の完了したラベル図形成分に対し
て、各々幅および高さを測定して文字らしいサイズのラ
ベル図形成分を選出する。この時、幅については文字間
隔の狭い文字同士が前記結合処理において連結している
ことを考慮して文字数個分の幅に相当する値を持つラベ
ル図形成分も文字として選出する。こうして選出された
ラベル図形成分に対して、ｘ方向の距離が近く、上端と
下端のｙ座標値の近いものを次々にグループ化する。グ
ループ化できるラベル図形成分がなくなった時点でグル
ープ化処理を終了し、グループ化されたラベル図形成分
の組を各々文字列パターンとする。The feature extracting unit 53 that has taken over the control from the common unit 51 detects the form size with the form size detecting unit 531, and the character string feature detecting unit 532 prints the form from the digital image of the form held in the image memory 12. A character pattern is detected for each character string, and is converted into a feature amount for each character string. It should be noted that there have been many reports on the detection of the character string unit of the print character pattern in this case. For example, first, for the binary image held in the image memory, a labeling process is performed on a connected component unit of a black pixel (a label value is assigned to a connected component unit of a black pixel to distinguish a connected component of each black pixel. Processing to make it possible to handle, and close in the vertical and horizontal directions in the obtained label graphic component (existing at a distance of about several pixels)
Are combined into a new label graphic component.
Next, the width and height of each of the label graphic components that have undergone the combining process are measured to select a label graphic component having a character-like size. At this time, regarding the width, a label graphic component having a value corresponding to the width of several characters is also selected as a character in consideration of the fact that characters with a narrow character interval are connected in the combining process. With respect to the label graphic components selected in this way, those having a shorter distance in the x direction and a smaller y coordinate value at the upper end and the lower end are sequentially grouped. When there are no more label graphic components that can be grouped, the grouping process ends, and each set of grouped label graphic components is used as a character string pattern.

【００２６】以上のようにして検出された文字列パター
ンに対して、特徴抽出部５３の文字列特徴検出部５３２
は、文字列単位の特徴量を抽出する。文字列単位の特徴
量は、図７に示されるように、対象とする文字列パター
ン上に２画素〜２０画素程度の幅（２００ＤＰＩ程度の
解像度の画像の場合）を持ち文字列と同じ高さを持つ特
徴量抽出用の窓領域を、文字列方向（横書き文字列では
横方向）に沿って１画素〜数十画素単位で移動させなが
ら、特徴抽出用の窓領域に含まれる文字パターンの特徴
を逐次抽出したものを用いる。このとき特徴抽出用の窓
領域は、図１０に示されるようにさらに上下にＮ（１〜
９程度）個（図１０では５個）の部分領域に分割する。
文字パターンの特徴はこの部分領域毎に集計する。The character string pattern detecting unit 532 of the characteristic extracting unit 53 applies the character string pattern detected as described above.
Extracts a feature amount in character string units. As shown in FIG. 7, the character string unit has a width of about 2 pixels to about 20 pixels (in the case of an image having a resolution of about 200 DPI) on the target character string pattern and has the same height as the character string. While moving the feature extraction window area having the following in the character string direction (horizontal direction in the horizontal writing character string) in units of one pixel to several tens of pixels, the feature of the character pattern included in the feature extraction window area is moved. Is used. At this time, as shown in FIG.
It is divided into (about 9) (5 in FIG. 10) partial areas.
The features of the character pattern are totaled for each of the partial areas.

【００２７】この文字パターンの特徴は文字パターンと
その背景の境界にある画素のこの境界に対する接線方向
（またはその接線方向に直交する方向）とする。接線方
向は、Ｎ（例えば８）方向に量子化して、図１０に示さ
れるように窓領域内に設定した部分領域毎に量子化した
方向別に画素数をカウントする。さらに１つの窓領域内
で検出された境界画素の総数がＫ（例えば１００）とな
るように各部分領域内で方向別に正規化する。つまり、
ある窓領域内で実際に検出された境界画素の総数がＫ
１、そのうち、その窓領域のある部分領域の１つの方向
にカウントされた画素数がＭだった場合、Ｍの値は、Ｍ
×Ｋ／Ｋ１のように正規化される。The feature of the character pattern is a tangent direction (or a direction orthogonal to the tangent direction) to a pixel at a boundary between the character pattern and its background. The tangent direction is quantized in the N (for example, 8) direction, and the number of pixels is counted in each of the quantized directions for each partial region set in the window region as shown in FIG. Furthermore, normalization is performed for each direction in each partial region so that the total number of boundary pixels detected in one window region is K (for example, 100). That is,
The total number of boundary pixels actually detected in a certain window area is K
1. If the number of pixels counted in one direction in a certain partial area of the window area is M, the value of M is M
It is normalized as × K / K1.

【００２８】前記文字パターンの特徴は他にも、部分領
域内の黒画素の密度を用いることもできる。例えば、”
１１１”のようにアラビア数字の１が並ぶ文字列の黒画
素密度の変動を窓領域内の分割領域が１として数量化
（窓領域内の黒画素の占める密度の割合で最大１００と
する）したパターンは、「０，１０，３０，９０，１０
０，９０，３０，１０，０，０，０，１０，３０，９
０，１００，９０，３０，０，０，０，１０，３０，９
０，１００，９０，３０，１０」のようになる。As another characteristic of the character pattern, the density of black pixels in the partial area can be used. For example, "
Variations in the black pixel density of a character string in which Arabic numerals 1 are lined up, such as 111 ″, are quantified assuming that the divided area in the window area is 1 (the maximum ratio of the density of black pixels in the window area is 100). The pattern is “0, 10, 30, 90, 10
0,90,30,10,0,0,0,10,30,9
0,100,90,30,0,0,0,10,30,9
0, 100, 90, 30, 10 ".

【００２９】特徴抽出部５３は、制御メモリ１４からモ
ードを読み出したときに帳票の読取り動作時のモードが
帳票登録モードでかつ未記入モードであれば、検出した
帳票サイズと検出した文字列毎の特徴量とを特徴量メモ
リ１３に保持し、共通特徴抽出部５５へ制御を引き渡す
ために共通部５１に一旦制御を戻す。この場合の特徴量
メモリ１３は、特徴抽出部５３で得られた帳票のサイズ
と文字列単位の特徴量とを保持することになる。特徴抽
出部５３は、帳票の読取り動作時のモードが帳票登録モ
ードでかつ記入済みモードであれば、オペレータの指示
があるまで、繰り返し入力された同一フォーマットで記
入内容の異なる帳票から文字列単位の特徴量を抽出して
帳票サイズと帳票毎に検出した文字列単位の特徴量の組
とを特徴メモリ１３に保持する。この場合の特徴量メモ
リ１３は、特徴抽出部５３で得られた帳票のサイズと同
一フォーマットで記入内容の異なる帳票から抽出された
複数組の文字列単位の特徴量とを保持することになる。
特徴メモリ１３は、同一フォーマットで記入内容の異な
る複数個の帳票から検出された文字列の特徴量を帳票毎
（この場合、辞書メモリ１５で未使用の識別コードを自
動的に付与する）に複数保持する容量を備える。なお、
文字列単位（または文字列ごと）の特徴量を特徴量メモ
リ１３または辞書メモリに保持（または登録）するとい
うことは、本発明では、文字列の座標情報「例えば左上
隅座標（左上隅からのＸ，Ｙ座標）と横方向の長さと縦
方向の高さとを含む」が文字列に相当し、その文字列の
座標情報毎に特徴量（パターン）を保持（または登録）
することを意味する。なお、この場合の座標毎に特徴量
を保持（または登録）するとは、座標情報が必要とされ
るメモリに書き込まれ、その書き込まれた座標情報をに
対比して特徴量が書き込まれることをいう。If the mode at the time of the form reading operation when the mode is read from the control memory 14 is the form registration mode and the blank mode when the mode is read from the control memory 14, the feature extracting unit 53 determines the detected form size and the detected character string for each character string. The characteristic amount is stored in the characteristic amount memory 13, and the control is temporarily returned to the common unit 51 in order to transfer the control to the common characteristic extracting unit 55. In this case, the feature amount memory 13 holds the size of the form obtained by the feature extraction unit 53 and the feature amount in character string units. If the mode at the time of the form reading operation is the form registration mode and the filled-in mode, the feature extracting unit 53 repeats the form entered in the same format and having different contents in the same format until the instruction of the operator. A feature amount is extracted, and a form size and a set of feature amounts per character string detected for each form are stored in the feature memory 13. In this case, the feature amount memory 13 holds the size of the form obtained by the feature extraction unit 53 and a plurality of sets of character string unit feature amounts extracted from forms having different entries in the same format.
The feature memory 13 stores a plurality of feature amounts of a character string detected from a plurality of forms having the same format and different entry contents for each form (in this case, an unused identification code is automatically added to the dictionary memory 15). It has a capacity to hold. In addition,
Holding (or registering) a feature amount for each character string (or for each character string) in the feature amount memory 13 or the dictionary memory means that in the present invention, the coordinate information of the character string “for example, the coordinates of the upper left corner (for example, (Including X, Y coordinates), horizontal length, and vertical height ”corresponds to a character string, and holds (or registers) a feature amount (pattern) for each piece of coordinate information of the character string.
Means to do. In this case, storing (or registering) a feature amount for each coordinate means that the coordinate information is written in a memory that requires it, and the feature amount is written in comparison with the written coordinate information. .

【００３０】制御メモリ１４からモードを読み出したと
きに帳票の読取り動作時のモードが帳票登録モードであ
れば、共通部５１は共通特徴抽出部５５へ制御を引き継
ぐ。共通部５１から制御を引き継いだ共通特徴抽出部５
５は、特徴量メモリ１３に保持された同一フォーマット
で記入内容の異なる帳票間で共通な文字列の座標とその
文字列特徴を、文字列の座標および文字列特徴量の同一
性から抽出する。文字列同士の座標値の差が一定値（数
画素程度）以内なら座標値は同一とみなし、文字列特徴
同士の類似度、つまり、互いの特徴量をそれぞれ１列に
並べた特徴ベクトルをｕ，ｖとするとき、ｕ・ｖ／‖ｕ
‖‖ｖ‖ （ただし、・は内積演算を表す）で計算され
る値が、一定値（例えば０．９５）以上であれば同一で
あるとみなす。ただし、文字列特徴の同一性の判定は従
来の様々な文字認識手法を応用して用いることも可能で
ある。When the mode at the time of the form reading operation when the mode is read from the control memory 14 is the form registration mode, the common unit 51 takes over the control to the common feature extracting unit 55. Common feature extraction unit 5 taking over control from common unit 51
Reference numeral 5 extracts the coordinates of the character string and the character string characteristics common to the forms having different entries in the same format stored in the characteristic amount memory 13 based on the character string coordinates and the character string characteristic amounts. If the difference between the coordinate values of the character strings is within a certain value (about several pixels), the coordinate values are considered to be the same, and the similarity between the character string features, that is, the feature vector in which the respective feature amounts are arranged in one column is u. , V, u · v / ‖u
If the value calculated by {v} (where * represents an inner product operation) is equal to or greater than a certain value (for example, 0.95), it is regarded as the same. However, it is also possible to determine the identity of the character string features by applying various conventional character recognition methods.

【００３１】抽出された共通な文字列の特徴量は、辞書
メモリ１５へ自動付与される帳票識別コードをインデッ
クスとして登録される。この場合、特徴量メモリ１３に
保持された帳票サイズも辞書メモリ１５に登録される。
すなわち、辞書メモリ１５には、帳票サイズと検出され
た文字列の座標値毎（文字列単位毎）の特徴量とが、帳
票識別コードをインデックスとして保持されることにな
る。The extracted characteristic amount of the common character string is registered with the form identification code automatically added to the dictionary memory 15 as an index. In this case, the form size held in the feature memory 13 is also registered in the dictionary memory 15.
That is, the dictionary memory 15 holds the form size and the feature amount of each coordinate value of the detected character string (each character string unit) using the form identification code as an index.

【００３２】特徴抽出部５３は、制御メモリ１４からモ
ードを読み出したときに帳票識別モードであれば、照合
部５４に制御を渡すために共通部５１に制御を一旦戻
す。If the mode is read out from the control memory 14 and the mode is the form identification mode, the feature extracting unit 53 returns the control to the common unit 51 once to pass the control to the collating unit 54.

【００３３】共通部５１から制御を引き継いだ照合部５
４は、帳票認識モードにおいて、特徴抽出部５３から出
力される処理対象の帳票のディジタル画像から検出され
た帳票サイズと文字列の座標値およびその特徴量を、辞
書メモリ１５に登録された帳票種と逐次照合する。処理
対象の帳票と同一サイズを持ち、該帳票種に登録された
全ての文字列について座標情報が処理対象の帳票上のい
ずれかの文字列と座標値が同一であり、かつ、その文字
列の特徴量が同一である場合、処理対象の帳票は該帳票
種であると判定してその判定結果を出力する。文字列の
座標値の同一性および文字列の特徴量の同一性の判定に
ついては、共通特徴抽出部の説明で述べた判定方法を用
いる。Collation unit 5 taking over control from common unit 51
Reference numeral 4 denotes a form type and a coordinate value of a character string detected from a digital image of a form to be processed output from the feature extracting unit 53 in the form recognition mode, and a feature amount of the form, registered in the dictionary memory 15. And collate sequentially. It has the same size as the form to be processed, the coordinate information of all the character strings registered in the form type has the same coordinate value as any character string on the form to be processed, and If the feature amounts are the same, the form to be processed is determined to be of the form type, and the determination result is output. For the determination of the identity of the coordinate values of the character strings and the identity of the feature amounts of the character strings, the determination method described in the description of the common feature extraction unit is used.

【００３４】次に、図１および図２を、図３および図４
のフローチャートを参照して本発明の実施の形態の全体
の動作について説明する。Next, FIGS. 1 and 2 are replaced with FIGS.
The overall operation of the embodiment of the present invention will be described with reference to the flowchart of FIG.

【００３５】本発明の帳票読取り時の動作は、帳票登録
モードと、帳票識別モードに分かれる。帳票登録モード
は、辞書メモリに帳票識別用の特徴量を登録するモード
であり、さらに未記入モードと記入済みモードとに分か
れる。未記入モードは未使用の帳票を用いて帳票の特徴
量を辞書メモリに登録するための処理であり、記入済み
モードは複数枚の記入済み帳票を用いて帳票の特徴量を
辞書メモリに登録するための処理である。帳票識別モー
ドは、辞書メモリに登録された特徴量と処理対象の帳票
から検出された特徴量とを照合して、処理対象の帳票の
種別を判定する処理である。The operation for reading a form according to the present invention is divided into a form registration mode and a form identification mode. The form registration mode is a mode for registering a feature amount for form identification in the dictionary memory, and is further divided into a blank mode and a filled mode. The blank mode is a process for registering the feature amount of the form in the dictionary memory using an unused form, and the completed mode registers the feature amount of the form in the dictionary memory using a plurality of completed forms. It is processing for. The form identification mode is a process in which a feature amount registered in the dictionary memory is compared with a feature amount detected from the form to be processed, and the type of the form to be processed is determined.

【００３６】先ず、オペレータが制御装置１を立上げ、
キーボード３から帳票識別プログラムを実行させると、
制御装置１の制御部１１は、プログラムメモリ１６に保
持されている帳票識別プログラムを読み出して実行す
る。すると、制御部１１の共通部５１は、インタフェー
ス部１７を介して、表示装置４にメニュー画面を表示す
る。このメニュー画面には、帳票登録モードにして処理
するか帳票識別モードに処理するかの選択を促す項目が
含まれている。更に、帳票登録モードには、未記入モー
ド（新規の帳票：帳票のフォーマットだけが記入）また
は記入済みモード（記入された帳票：帳票のフォーマッ
トに従って記入）の２種類があり、帳票識別モード選択
時にメニュー画面から選択できるようになっている。First, the operator starts up the control device 1,
When the form identification program is executed from the keyboard 3,
The control unit 11 of the control device 1 reads out and executes the form identification program stored in the program memory 16. Then, the common unit 51 of the control unit 11 displays a menu screen on the display device 4 via the interface unit 17. This menu screen includes items for prompting the user to select whether to perform processing in the form registration mode or in the form identification mode. Furthermore, there are two types of form registration modes: an unfilled mode (new form: only the form format is filled) and a filled mode (filled form: filled in according to the form format). It can be selected from the menu screen.

【００３７】オペレータがキーボード３を使用して、表
示部４のメニュー画面から「帳票登録モードで未記入モ
ード」を選択して実行させると、制御部１１の共通部５
１はあ、制御メモリ１４のモードエリアに「帳票登録モ
ードで未記入モード（例えば、帳票登録モードで未記入
モードを１６進表示で「００」、帳票登録モードで記入
済みモードを「０Ｆ」、帳票識別モードを１６進表示で
「ＦＦ」とする）」を登録する。登録した後、共通部５
１は、帳票の入力待ちとなる。When the operator uses the keyboard 3 to select and execute “non-form mode in form registration mode” from the menu screen of the display unit 4, the common unit 5 of the control unit 11
In the mode area of the control memory 14, "No entry mode in the form registration mode (for example," 00 "in hexadecimal notation mode in the form registration mode," 0F "in fill mode in the form registration mode, The form identification mode is set to "FF" in hexadecimal notation). " After registration, common part 5
1 is waiting for a form to be input.

【００３８】以下、帳票登録モードで未記入モードの動
作について説明する。オペレータがイメージスキャナ２
から未記入の帳票を入力すると、イメージスキャナ２
は、帳票表面を光学的に走査して光電変換を行い、帳票
のディジタル画像を生成する。その生成した帳票のディ
ジタル画像を共通部５１を介して画像入力部５２が読み
取って、帳票の傾き補正と帳票周囲の余白除去を行った
後に画像メモリ１２に保持する（図３のステップＡ０
１）。画像メモリ１２に保持した後、画像入力部５２
は、特徴抽出部５３に制御を引き継ぐため、共通部５１
に一旦制御を戻す。The operation of the form registration mode in the non-entry mode will be described below. The operator operates the image scanner 2
If you enter a blank form from the
Generates a digital image of a form by optically scanning the form surface and performing photoelectric conversion. The generated digital image of the form is read by the image input unit 52 via the common unit 51, and after correcting the form inclination and removing margins around the form, the digital image is stored in the image memory 12 (step A0 in FIG. 3).
1). After being stored in the image memory 12, the image input unit 52
Since the control is passed to the feature extraction unit 53, the common unit 51
Return control once.

【００３９】制御部５１から制御を引き継いだ特徴抽出
部５３は、読取った帳票の識別を行うために辞書メモリ
に登録されていない識別コードをランダムに発生させ、
帳票サイズ検出部５３１に対して画像メモリ１２に保持
された帳票のサイズを測定させ、発生させた識別コード
と測定した帳票のサイズを特徴量メモリ１３に書き込む
（ステップＡ０２）。更に、特徴抽出部５３は、文字列
特徴検出部５３２に対して画像メモリ１２に保持された
帳票の文字パターンを文字列単位で検出させ、検出され
た文字列単位の文字パターンから特徴量を算出させる
（ステップＡ０３，Ａ０４）。この場合、特徴抽出部５
３の文字列特徴検出部５３２は、特徴量を算出した文字
列毎に座標情報［例えば、左上隅座標（左上隅からの
Ｘ，Ｙ座標）と横方向の長さと高さを示す：文字列のエ
リアの位置を示す］を算出する。この座標情報は文字列
を識別に利用されることになる。特徴抽出部５３は、こ
の算出した座標情報と算出した特徴量とを先程発生させ
た識別コードをインデックスとして特徴量メモリ１３に
書き込む（ステップＡ０５）。このようにして特徴抽出
部５３は、文字列の存在がなくなるまで、ステップＡ０
３〜Ａ０５を繰り返す（ステップＡ０６）。すなわち、
特徴量メモリ１３には、識別コードをインデックスとし
たその帳票のサイズと、各文字列毎（座標情報毎）の特
徴量とが書き込まれることになる。この後、特徴抽出部
５３は、共通部５１に一旦制御を戻す。The feature extraction unit 53, which has taken over the control from the control unit 51, randomly generates an identification code not registered in the dictionary memory in order to identify the read form.
The form size detection unit 531 measures the size of the form held in the image memory 12, and writes the generated identification code and the measured form size to the feature memory 13 (step A02). Further, the feature extraction unit 53 causes the character string feature detection unit 532 to detect the character pattern of the form held in the image memory 12 in units of character strings, and calculates a feature amount from the detected character pattern in units of character strings. (Steps A03 and A04). In this case, the feature extraction unit 5
The character string feature detection unit 532 for each character string for which the feature amount is calculated indicates coordinate information [eg, upper left corner coordinates (X, Y coordinates from the upper left corner) and horizontal length and height: character string Indicating the position of the area of the area] is calculated. This coordinate information is used for identifying a character string. The feature extracting unit 53 writes the calculated coordinate information and the calculated feature amount into the feature amount memory 13 as an index using the identification code generated earlier (step A05). In this way, the feature extracting unit 53 determines in step A0 until there is no more character string.
The steps 3 to A05 are repeated (step A06). That is,
In the feature amount memory 13, the size of the form using the identification code as an index and the feature amount for each character string (for each coordinate information) are written. Thereafter, the feature extraction unit 53 returns control to the common unit 51 once.

【００４０】共通部５１は、制御メモリ１４のモードエ
リアからモードを読み出すと、帳票登録モードで未記入
モードに設定されているため、制御を共通特徴抽出部５
５に制御を渡す（ステップＡ０７）。When the mode is read out from the mode area of the control memory 14, the common unit 51 sets the form registration mode to the blank mode.
Then, control is passed to step 5 (step A07).

【００４１】帳票登録モードで未記入モードの制御を引
き継いだ共通特徴抽出部５５は、未記入モードになって
いるため、特徴メモリ１３に書き込まれた帳票サイズと
特徴メモリ１３に書き込まれた全ての文字列の座標情報
毎の特徴量とを、帳票識別コードをインデックスとして
辞書メモリに登録する（ステップＡ０６，Ａ０７，Ａ０
９）。この場合、共通特徴抽出部５５は、辞書メモリ１
５への登録時における帳票識別コードとして、オペレー
タに帳票識別コードを入力させるかそのまま（特徴量メ
モリ１３に書き込まれた帳票サイズ＋識別コードを帳票
識別コード）にするのかを促す画面をインタフェース部
１７を介して表示装置４に表示させる。すなわち、辞書
メモリには、帳票識別コードをインデックスとして、帳
票のサイズと文字列の座標情報毎の特徴量とが登録され
ることになる。In the form registration mode, the common feature extracting unit 55 which has taken over the control of the blank mode is in the blank mode, and therefore, the form size written in the feature memory 13 and all of the forms written in the feature memory 13 The feature amount of each character string coordinate information is registered in the dictionary memory using the form identification code as an index (steps A06, A07, A0).
9). In this case, the common feature extraction unit 55
A screen prompting the operator to input the form identification code or leave the form as it is (the form size written in the feature amount memory 13 + the form identification code as the form identification code) as the form identification code at the time of registration into the interface unit 17. Is displayed on the display device 4 via the. That is, in the dictionary memory, the form size and the characteristic amount for each coordinate information of the character string are registered using the form identification code as an index.

【００４２】次に、帳票登録モードで記入済みモードの
場合について説明するオペレータがキーボード３を使用して、表示部４のメニ
ュー画面から「帳票登録モードで記入済みモード」を選
択して実行させると、制御部１１の共通部５１は、制御
メモリ１４のモードエリアに「帳票登録モードで記入済
みモード」を登録する。登録した後、共通部５１は、帳
票の入力待ちとなる。Next, a description will be given of the case of the filled-out mode in the form registration mode. When the operator uses the keyboard 3 to select and execute "the filled-out mode in the form registration mode" from the menu screen of the display unit 4. The common unit 51 of the control unit 11 registers the “completed mode in the form registration mode” in the mode area of the control memory 14. After the registration, the common unit 51 waits for input of a form.

【００４３】オペレータがイメージスキャナ２から未記
入の帳票を入力すると、イメージスキャナ２は、帳票表
面を光学的に走査して光電変換を行い、帳票のディジタ
ル画像を生成する。その生成した帳票のディジタル画像
を共通部５１の画像入力部５２が読取って帳票の傾き補
正と帳票周囲の余白除去を行った上で画像メモリ１２に
保持する（図３のステップＡ０１）。画像メモリ１２に
保持した後、画像入力部５２は、特徴抽出部５３に制御
を渡すため、共通部５１に一旦制御を戻す。When the operator inputs a blank form from the image scanner 2, the image scanner 2 optically scans the surface of the form and performs photoelectric conversion to generate a digital image of the form. The generated digital image of the form is read by the image input unit 52 of the common unit 51, and after correcting the form inclination and removing margins around the form, the digital image is stored in the image memory 12 (step A01 in FIG. 3). After being stored in the image memory 12, the image input unit 52 temporarily returns control to the common unit 51 in order to pass control to the feature extraction unit 53.

【００４４】制御部５１から制御を引き継いだ特徴抽出
部５３は、読取った帳票の識別を行うために辞書メモリ
に登録されていない識別コードをランダムに発生させ、
帳票サイズ検出部５３１に対して画像メモリ１２に保持
された帳票のサイズを測定させ、算出した識別コードと
測定した帳票のサイズを特徴量メモリ１３に書き込む
（ステップＡ０２）。更に、特徴抽出部５３は、文字列
特徴検出部５３２に対して画像メモリ１２に保持された
帳票の文字パターンを文字列単位で検出させ、検出され
た文字列単位の文字パターンから特徴量を算出させる
（ステップＡ０３，Ａ０４）。この場合、特徴抽出部５
３の文字列特徴検出部５３２は、特徴量を算出した文字
列毎に座標情報［例えば、左上隅座標（左上隅からの
Ｘ，Ｙ座標）と横方向の長さと高さを示す：文字列のエ
リアの位置を示す］を算出する。特徴抽出部５３は、こ
の算出した座標情報と算出した特徴量とを先程算出した
識別コードをインデックスとして特徴量メモリ１３に書
き込む（ステップＡ０５）。このようにして特徴抽出部
５３は、文字列の存在がなくなるまで、ステップＡ０３
〜Ａ０５を繰り返す（ステップＡ０６）。すなわち、特
徴量メモリ１３には、識別コードをインデックスとした
その帳票のサイズと、各文字列毎（座標情報毎）の特徴
量が書き込まれることになる。この後、特徴抽出部５３
は、共通部５１に一旦制御を戻す。The feature extraction unit 53, which has taken over the control from the control unit 51, randomly generates an identification code not registered in the dictionary memory in order to identify the read form.
The form size detection unit 531 measures the size of the form held in the image memory 12, and writes the calculated identification code and the measured form size to the feature memory 13 (step A02). Further, the feature extraction unit 53 causes the character string feature detection unit 532 to detect the character pattern of the form held in the image memory 12 in units of character strings, and calculates a feature amount from the detected character pattern in units of character strings. (Steps A03 and A04). In this case, the feature extraction unit 5
The character string feature detection unit 532 for each character string for which the feature amount is calculated indicates coordinate information [eg, upper left corner coordinates (X, Y coordinates from the upper left corner) and horizontal length and height: character string Indicating the position of the area of the area] is calculated. The feature extraction unit 53 writes the calculated coordinate information and the calculated feature amount into the feature amount memory 13 using the previously calculated identification code as an index (step A05). In this way, the feature extracting unit 53 proceeds to step A03 until there is no more character string.
To A05 are repeated (step A06). That is, in the feature memory 13, the size of the form using the identification code as an index and the feature for each character string (each coordinate information) are written. Thereafter, the feature extraction unit 53
Returns control to the common unit 51 once.

【００４５】共通部５１は、制御メモリ１４のモードエ
リアからモードを読み出すと、「帳票登録モードで記入
済みモード」に設定されているため、表示部４に前回読
み込ませた帳票の同一のフォーマットで記入済みの帳票
を読み込ませるもの（追加するもの）があるかの選択を
オペレータに促す画面を表示する。オペレータが「追加
するもの」を選択した場合には、オペレータは先に処理
した帳票と同一フォーマットで記入内容の異なるものを
用意し、イメージスキャナ２から読み込ませることにな
る。追加するものがなくなるまで、制御部１１は、ステ
ップＡ０１〜Ａ０７の動作を繰り返す（ステップＡ０
８）。この場合、ステップＡ０５において、特徴量メモ
リ１３に書き込まれる帳票のサイズ、各文字列の座標情
報毎の特徴量は、未使用の識別コードに割り当てられた
特徴メモリ１３のエリアに書き込まれる。When the common section 51 reads the mode from the mode area of the control memory 14, the common mode is set to the "completed mode in the form registration mode". A screen is displayed that prompts the operator to select whether or not there is a form to be filled in (a form to be added). If the operator selects “additional item”, the operator prepares a form having the same format as that of the previously processed form but with different contents, and reads the form from the image scanner 2. The control unit 11 repeats the operations of steps A01 to A07 until there is no more to be added (step A0
8). In this case, in step A05, the size of the form written to the feature memory 13 and the feature of each character string for each coordinate information are written to the area of the feature memory 13 assigned to the unused identification code.

【００４６】ステップＡ０８において、追加するものが
なければ、共通部５１は、共通特徴抽出部５５に制御を
渡す。記入済みモードでの制御を引き継いだ共通特徴抽
出部５５は、記入済みモードになっているため、特徴量
メモリ１３に書き込まれたある識別コードにある文字列
の座標情報が別の識別コードにある文字列の座標情報と
が一致するかを全て比較する。一致するものがあれば、
共通特徴抽出部５５は、一致した座標情報に該当する特
徴量との類似度を算出し、あらかじめ決められた値以上
であれば、一致したものとして、そのときの文字列の座
標情報と特徴量を全て抽出する（ステップＡ０９）。こ
の場合、類似度を算出する共通特徴抽出部５５は、窓領
域毎のパターン値を比較し、一致した数を全体の数（合
計の窓領域の数）で割って得た比率を類似度として求め
る。すなわち、共通抽出部５５は、算出した類似度があ
らかじめ決められた値（例えば０．９５）以上であれ
ば、一致したものと判定する。従って、文字単位のパタ
ーン分解が困難なサイズの小さな文字でも文字列として
求めたパターンで比較するため、文字単位に分解する必
要はなく、処理も簡単となり、帳票識別精度を向上する
ことができる。In step A 08, if there is nothing to be added, common unit 51 passes control to common feature extraction unit 55. Since the common feature extraction unit 55 that has taken over the control in the filled mode is in the filled mode, the coordinate information of the character string in one identification code written in the feature amount memory 13 is in another identification code. All the character strings are compared to see if they match. If there is a match,
The common feature extraction unit 55 calculates the degree of similarity with the feature amount corresponding to the matched coordinate information, and if the similarity is equal to or greater than a predetermined value, it is determined that they match, and the coordinate information of the character string at that time and the feature amount Are all extracted (step A09). In this case, the common feature extraction unit 55 that calculates the similarity compares the pattern values for each window area, and divides the number of matches by the total number (the total number of window areas) to obtain a ratio obtained as the similarity. Ask. That is, if the calculated similarity is equal to or greater than a predetermined value (for example, 0.95), the common extraction unit 55 determines that they match. Therefore, even small-sized characters, which are difficult to decompose in character units, are compared using the pattern obtained as a character string, so that it is not necessary to decompose them in character units, processing is simplified, and form identification accuracy can be improved.

【００４７】抽出が終わると、共通特徴抽出部５５は、
帳票識別コードをインデックスとして、抽出した全ての
文字列の座標情報毎の特徴量と特徴メモリ１３に書き込
まれた帳票のサイズとを辞書メモリ１５に登録する（ス
テップＡ０６，Ａ０７，Ａ０９）。この場合、共通特徴
抽出部５５は、辞書メモリ１５への登録時における帳票
識別コードとして、オペレータに帳票識別コードを入力
させるかそのまま（特徴量メモリ１３に書き込まれた識
別コードを帳票識別コード）にするのかを促す画面をイ
ンタフェース部１７を介して表示装置４に表示させる。
すなわち、辞書メモリには、帳票識別コードをインデッ
クスとして、帳票のサイズと文字列の座標情報毎の特徴
量とが登録されることになる。When the extraction is completed, the common feature extraction unit 55
Using the form identification code as an index, the feature amount of each extracted character string for each piece of coordinate information and the size of the form written in the feature memory 13 are registered in the dictionary memory 15 (steps A06, A07, A09). In this case, the common feature extraction unit 55 allows the operator to input the form identification code as the form identification code at the time of registration in the dictionary memory 15 or leave the identification code written in the feature memory 13 as the form identification code. A screen prompting the user to do so is displayed on the display device 4 via the interface unit 17.
That is, in the dictionary memory, the form size and the characteristic amount for each coordinate information of the character string are registered using the form identification code as an index.

【００４８】次に、帳票識別モードの場合について説明
するオペレータがキーボード３を使用して、表示部４のメニ
ュー画面から「帳票識別モード」を選択して実行させる
と、制御部１１の共通部５１は、制御メモリ１４のモー
ドエリアに「帳票識別モード」を登録する。登録した
後、共通部５１は、帳票の入力待ちとなる。Next, the case of the form identification mode will be described. When the operator selects and executes the "form identification mode" from the menu screen of the display unit 4 using the keyboard 3, the common unit 51 of the control unit 11 Registers "form identification mode" in the mode area of the control memory 14. After the registration, the common unit 51 waits for input of a form.

【００４９】オペレータがイメージスキャナ２から帳票
を入力すると、イメージスキャナ２は、帳票表面を光学
的に走査して光電変換を行い、帳票のディジタル画像を
生成する。その生成した帳票のディジタル画像を共通部
５１の画像入力部５２が読取って帳票の傾き補正と帳票
周囲の余白除去を行った上で画像メモリ１２に保持する
（図４のステップＢ０１）。画像メモリ１２に保持した
後、画像入力部５２は、特徴抽出部５３に制御を渡すた
め、共通部５１に一旦制御を戻す。When the operator inputs a form from the image scanner 2, the image scanner 2 optically scans the surface of the form and performs photoelectric conversion to generate a digital image of the form. The generated digital image of the form is read by the image input unit 52 of the common unit 51, the inclination of the form is corrected, the margin around the form is removed, and the form is stored in the image memory 12 (step B01 in FIG. 4). After being stored in the image memory 12, the image input unit 52 temporarily returns control to the common unit 51 in order to pass control to the feature extraction unit 53.

【００５０】制御部５１から制御を引き継いだ特徴抽出
部５３は、読取った帳票の識別を行うために辞書メモリ
で未使用の識別コードを発生させ、帳票サイズ検出部５
３１に対して画像メモリ１２に保持された帳票のサイズ
を測定させ、算出した識別コードと測定した帳票のサイ
ズを特徴量メモリ１３に書き込む（ステップＢ０２）。
更に、特徴抽出部５３は、文字列特徴検出部５３２に対
して画像メモリ１２に保持された帳票の文字パターンを
文字列単位で検出させ、検出された文字列単位の文字パ
ターンから特徴量を算出させる（ステップＢ０３，Ｂ０
４）。この場合、特徴抽出部５３の文字列特徴検出部５
３２は、特徴量を算出した文字列毎に座標情報［例え
ば、左上隅座標（左上隅からのＸ，Ｙ座標）と横方向の
長さと高さを示す：文字列のエリアの位置を示す］を算
出する。特徴抽出部５３は、この算出した座標情報と算
出した特徴量とを先程発生させた識別コードをインデッ
クスとして特徴量メモリ１３に書き込む（ステップＢ０
５）。このようにして特徴抽出部５３は、文字列の存在
がなくなるまで、ステップＢ０３〜Ｂ０５を繰り返す
（ステップＢ０６）。すなわち、特徴量メモリ１３に
は、識別コードをインデックスとしたその帳票のサイズ
と、各文字列毎（座標情報毎）の特徴量を書き込まれる
ことになる。この後、特徴抽出部５３は、共通部５１に
一旦制御を戻す。以上は、帳票登録モードの動作におけ
るステップＡ０１〜Ａ０６までの処理と同じである。The feature extracting unit 53, which has taken over the control from the control unit 51, generates an unused identification code in the dictionary memory to identify the read form.
31 is made to measure the size of the form held in the image memory 12, and the calculated identification code and the measured form size are written in the feature amount memory 13 (step B02).
Further, the feature extraction unit 53 causes the character string feature detection unit 532 to detect the character pattern of the form held in the image memory 12 in units of character strings, and calculates a feature amount from the detected character pattern in units of character strings. (Steps B03 and B0)
4). In this case, the character string feature detection unit 5 of the feature extraction unit 53
Reference numeral 32 denotes coordinate information for each character string for which the feature amount has been calculated [for example, the upper left corner coordinates (X, Y coordinates from the upper left corner) and the length and height in the horizontal direction: indicating the position of the area of the character string] Is calculated. The feature extracting unit 53 writes the calculated coordinate information and the calculated feature amount into the feature amount memory 13 as an index using the identification code generated earlier (step B0).
5). In this way, the feature extracting unit 53 repeats steps B03 to B05 until there is no more character string (step B06). That is, the feature amount memory 13 is written with the size of the form using the identification code as an index and the feature amount for each character string (each coordinate information). Thereafter, the feature extraction unit 53 returns control to the common unit 51 once. The above is the same as the processing in steps A01 to A06 in the operation in the form registration mode.

【００５１】共通部５１は、制御メモリ１４のモードエ
リアからモードを読み出すと、「帳票識別モード」に設
定されているため、制御を照合部５４に渡す。照合部５
４は、特徴量メモリ１３に書き込まれた特徴量と辞書メ
モリ１５に登録されている同じ帳票サイズの特徴量を逐
次読み出して照合を行う。例えば、辞書メモリ１５に登
録されている帳票Ｃでは、まず、サイズの一致を確認し
た後、帳票Ｃの文字列の座標値を逐次読み出して処理対
象の帳票から検出された全ての文字列の座標値と比較す
る。座標値が一致した場合には、照合部５４は、文字列
の特徴量同士の類似度を算出し、類似度があらかじめ定
めたしきい値以上であれば文字列が一致したと見なす。
更に、照合部５４は、辞書メモリ１５に登録された帳票
Ｃの全ての文字列が処理対象の帳票上にすべて存在した
時、処理対象の帳票の種別を前記帳票Ｃと判定する（ス
テップＢ０７）。この場合、処理対象の帳票の種別が判
定された時には、照合部５４は、辞書メモリに登録され
た未照合の帳票との照合は行わない。よって、出現頻度
の高い帳票から順に辞書メモリへ登録しておくものとす
る。なお、照合部５４が類似度を算出して文字列を一致
とみなす手順は、共通特徴抽出部５５が類似度を算出し
て文字列を一致とみなす手順と同じであるので説明を省
略する。When the common section 51 reads the mode from the mode area of the control memory 14, it passes the control to the collation section 54 because the mode is set to the “form identification mode”. Collation unit 5
Reference numeral 4 sequentially reads out the feature amount written in the feature amount memory 13 and the feature amount of the same form size registered in the dictionary memory 15 and performs collation. For example, in the form C registered in the dictionary memory 15, first, after confirming the matching of the sizes, the coordinate values of the character strings of the form C are sequentially read out, and the coordinates of all the character strings detected from the processing target form are read. Compare with value. When the coordinate values match, the collation unit 54 calculates the similarity between the characteristic amounts of the character strings, and determines that the character strings match if the similarity is equal to or greater than a predetermined threshold value.
Further, when all the character strings of the form C registered in the dictionary memory 15 are all present on the form to be processed, the matching unit 54 determines the type of the form to be processed as the form C (step B07). . In this case, when the type of the form to be processed is determined, the matching unit 54 does not perform the matching with the unmatched form registered in the dictionary memory. Therefore, it is assumed that the forms with the highest appearance frequency are registered in the dictionary memory in order. The procedure in which the matching unit 54 calculates the similarity and regards the character string as a match is the same as the procedure in which the common feature extraction unit 55 calculates the similarity and regards the character string as a match, and a description thereof will be omitted.

【００５２】次に、本発明の第１の実施の形態における
具体例を用いて説明する。図５に示されるような具体例
における未記入の帳票Ａを辞書登録する動作について説
明する。この場合、図５に示される未記入の帳票Ａにお
いて、文字列２と文字列３は文字サイズが小さく（例え
ば高さ２ｍｍ程度）、文字間隔も狭い（例えば０．５ｍ
ｍ）ため、ＯＣＲで通常利用される２００ＤＰＩの解像
度の画像からは文字単位の切出しが困難な文字列の例で
ある。Next, a description will be given using a specific example of the first embodiment of the present invention. An operation of registering a blank form A in a dictionary in a specific example as shown in FIG. 5 will be described. In this case, in the blank form A shown in FIG. 5, the character strings 2 and 3 have a small character size (for example, about 2 mm in height) and a small character spacing (for example, 0.5 m).
m) Therefore, this is an example of a character string that is difficult to cut out in character units from an image with a resolution of 200 DPI normally used in OCR.

【００５３】先ず、イメージスキャナ２の帳票の読取り
によってディジタル画像が生成され、画像入力部５２に
より傾きを補正され余白を除去された帳票画像に変換さ
れ画像メモリ１２に保持される。特徴抽出部５３では、
画像メモリ１２のディジタル画像から帳票サイズと文字
列１、文字列２、文字列３の３つの文字列が検出され
る。３つの文字列からはそれぞれ特徴量が抽出され、特
徴メモリ１３へ書き込まれる。更に、特徴メモリ１３か
らオペレータの与える帳票識別コードをインデックスと
して文字列（座標情報）に対応した特徴量が辞書メモリ
０６へ書き込まれる。First, a digital image is generated by reading a form by the image scanner 2, converted into a form image whose inclination has been corrected by the image input unit 52 and whose margin has been removed, and is stored in the image memory 12. In the feature extraction unit 53,
A form size and three character strings of character strings 1, 2, and 3 are detected from the digital image in the image memory 12. A feature amount is extracted from each of the three character strings, and is written to the feature memory 13. Further, the feature amount corresponding to the character string (coordinate information) is written from the feature memory 13 to the dictionary memory 06 using the form identification code given by the operator as an index.

【００５４】次に、図６に示されるような具体例におけ
る記入済みの帳票Ａを辞書登録する動作について説明す
る。先ず、図６の帳票Ａ１がイメージスキャナ２から画
像入力部５２を介して読込まれ、画像メモリ１２に保持
される。特徴抽出部５３では、画像メモリ１２のディジ
タル画像から帳票サイズと文字列１、文字列２、文字列
３、文字列４、文字列５の５つの文字列が検出される。
５つの文字列からはそれぞれ特徴量が抽出され特徴量メ
モリ１３へ書き込まれる。次に、図６の帳票Ａ２が同様
にイメージスキャナ２から画像入力部５２を介して読込
まれ、画像メモリ１２に保持され、特徴抽出部５３によ
って、帳票サイズと文字列１’、文字列２’、文字列
３’、文字列４’、文字列５’が検出され、特徴量メモ
リ１３へ書き込まれる。次に、共通特徴抽出部５５にお
いて、帳票Ａ１と帳票Ａ２に含まれる文字列が比較され
る。その結果、文字列１と文字列１’は位置情報と特徴
量が共に一致する。文字列２と文字列２’、文字列３と
文字列３’も同様である。しかしながら、文字列４と文
字列４’は位置情報は一致するものの文字パターンが異
なるため、特徴量の照合結果は一致しない。文字列５と
文字列５’も同様に位置情報は一致するが、特徴量の照
合結果は一致しない。よって、帳票Ａの識別用の特徴量
は文字列１（文字列１’）、文字列２（文字列２’）、
文字列３（文字列３’）に限定されて辞書メモリ１５へ
書き込まれる。Next, the operation of registering the filled-out form A in the dictionary in a specific example as shown in FIG. 6 will be described. First, the form A1 in FIG. 6 is read from the image scanner 2 via the image input unit 52, and is held in the image memory 12. The feature extraction unit 53 detects a form size and five character strings of character strings 1, character strings 2, character strings 3, character strings 4, and character strings 5 from the digital image in the image memory 12.
A feature amount is extracted from each of the five character strings and written into the feature amount memory 13. Next, the form A2 in FIG. 6 is similarly read from the image scanner 2 via the image input unit 52, is held in the image memory 12, and is recorded by the feature extraction unit 53 on the form size and the character strings 1 'and 2'. , Character string 3 ′, character string 4 ′, and character string 5 ′ are detected and written to the feature memory 13. Next, in the common feature extraction unit 55, the character strings included in the form A1 and the form A2 are compared. As a result, both the character string 1 and the character string 1 ′ have the same position information and feature amount. The same applies to character strings 2 and 2 'and character strings 3 and 3'. However, although the character string 4 and the character string 4 ′ have the same position information but different character patterns, the matching results of the feature amounts do not match. Similarly, the position information of the character string 5 matches the position information of the character string 5 ′, but the matching result of the feature amount does not match. Therefore, the feature amount for identification of the form A is character string 1 (character string 1 ′), character string 2 (character string 2 ′),
Only the character string 3 (character string 3 ′) is written to the dictionary memory 15.

【００５５】つまり、同一フォーマットの記入済み帳票
を複数枚用意することによって、辞書登録すべき文字列
をオペレータが指定する等の作業を必要とせず、未記入
帳票を用意するのと同様にオペレータの負担を増大させ
ることなく帳票識別用の特徴量の辞書登録ができる。In other words, by preparing a plurality of filled-in forms in the same format, the operator does not need to specify a character string to be registered in the dictionary or the like. It is possible to register a feature amount for form identification in a dictionary without increasing the burden.

【００５６】図５または図６の帳票を用いて登録された
辞書メモリ１５を使って帳票識別を行う場合には、先
ず、処理対象の帳票がイメージスキャナ２から画像入力
部５２を介して読込まれ、帳票の傾き補正と帳票周囲の
余白除去が行われた後に画像メモリ１２に保持される。
特徴抽出部５３では、画像メモリ１２のディジタル画像
から帳票サイズと含まれている文字列が全て検出され
る。その後、辞書登録された帳票Ａと帳票サイズが照合
された後、帳票サイズが一致値したなら、辞書登録され
ている帳票Ａの文字列１、文字列２、文字列３の位置情
報および文字列パターンの特徴量がそれぞれ順に、処理
対象の帳票から検出された文字列の特徴量と照合され
る。その結果、文字列１，文字列２、文字列３の全てが
処理対象の帳票から検出された文字列と一致すれば、処
理対象の帳票は帳票Ａであると判定される。When performing form identification using the dictionary memory 15 registered using the form shown in FIG. 5 or 6, the form to be processed is first read from the image scanner 2 via the image input unit 52. After the inclination of the form is corrected and the margin around the form is removed, the image is held in the image memory 12.
The feature extraction unit 53 detects the form size and all the included character strings from the digital image in the image memory 12. Then, after the form A registered in the dictionary is compared with the form size, if the form size matches, the position information and the character string of the character strings 1, 2, and 3 of the form A registered in the dictionary The feature amounts of the patterns are sequentially collated with the feature amounts of the character strings detected from the form to be processed. As a result, if all of the character strings 1, 2, and 3 match the character string detected from the form to be processed, the form to be processed is determined to be form A.

【００５７】次に、本発明の第２の実施の形態について
図面を参照して説明する。図８を参照すると、本発明の
第２の実施の形態は、パソコン等の情報処理装置である
制御装置１と、帳票を読み出すイメージスキャナ２と、
制御装置１に文字等を入力するキーボード３と、制御装
置１の制御の下で表示を行う表示装置４（例えば、ＣＲ
Ｔ，ＬＣＤ等）と、フロッピディスク装置であるＦＤＤ
５と、帳票識別プログラムを記録した記録媒体６とから
構成される。この記録媒体６は磁気ディスク、半導体メ
モリその他の記録媒体であってよい。この場合のＦＤＤ
５は、その記録媒体に対応した記憶制御装置となる。Next, a second embodiment of the present invention will be described with reference to the drawings. Referring to FIG. 8, a second embodiment of the present invention includes a control device 1 which is an information processing device such as a personal computer, an image scanner 2 for reading out a form,
A keyboard 3 for inputting characters and the like to the control device 1 and a display device 4 (for example, CR
T, LCD, etc.) and FDD which is a floppy disk drive
5 and a recording medium 6 on which a form identification program is recorded. The recording medium 6 may be a magnetic disk, a semiconductor memory, or another recording medium. FDD in this case
Reference numeral 5 denotes a storage control device corresponding to the recording medium.

【００５８】すなわち、第２の実施の形態は、ＦＤＤ５
と記録媒体６とが追加した点で第１の実施の形態で示す
図２と相違する。従って、制御装置１の内部の構成は、
図２の制御装置１の内部と同じなので説明を省略する。That is, in the second embodiment, the FDD5
FIG. 2 is different from FIG. 2 in the first embodiment in that a recording medium 6 and a recording medium 6 are added. Therefore, the internal configuration of the control device 1 is as follows.
The description is omitted because it is the same as the inside of the control device 1 of FIG.

【００５９】次に、本発明の第２の実施の形態の動作に
ついて説明する。帳票識別プログラムを記録した記録媒
体６をＦＤＤ５に挿入し、制御装置１に内蔵されている
ＦＤＤ読み出し用のプログラム（例えば、パソコンであ
れば、エクスプローラ）を実行すると、記録媒体６にあ
る帳票識別プログラムがプログラムメモリ１６に読み込
まれる。すなわち、プログラムメモリ１６に帳票識別プ
ログラムが格納されたことになり、これ以降の動作につ
いては、第１の実施の形態における動作と同じなので説
明を省略する。Next, the operation of the second embodiment of the present invention will be described. When the recording medium 6 on which the form identification program is recorded is inserted into the FDD 5 and the FDD readout program (for example, Explorer in the case of a personal computer) built in the control device 1 is executed, the form identification program on the recording medium 6 Is read into the program memory 16. That is, the form identification program is stored in the program memory 16, and the subsequent operation is the same as the operation in the first embodiment, and the description is omitted.

【００６０】以上説明したように、本発明は、帳票上の
文字を検出して特徴量に変換する際に、１つ１つの文字
に分解することなく文字列単位に検出するために、文字
切り出し誤りを修正するオペレータの作業負担を軽減す
ることができる。As described above, according to the present invention, when a character on a form is detected and converted into a feature value, the character is cut out in order to detect the character string unit without decomposing the character one by one. The work load of the operator who corrects the error can be reduced.

【００６１】また、本発明は、辞書登録に利用する帳票
が記入済みの帳票であった場合や、郵便振込帳票のよう
に帳票レイアウトが一部しか規定されていない場合で
も、特徴メモリの保持される複数組の文字列の特徴量か
ら各組共通の特徴量を抽出して辞書メモリに書き込むよ
うにしているため、オペレータの確認作業を軽減するこ
とができる。Further, according to the present invention, even when a form used for dictionary registration is a completed form or when only a part of the form layout is defined as in a postal transfer form, the feature memory is retained. Since the feature amount common to each set is extracted from the feature amounts of a plurality of sets of character strings and written into the dictionary memory, the operator's confirmation work can be reduced.

【００６２】また、本発明は、文字特徴を文字単位に分
解することなく文字列単位で文字列の特徴量を照合する
ため、文字単位のパターン分解が困難なサイズの小さな
文字も識別特徴として利用ができ、帳票識別精度が向上
する。In the present invention, since character string features are collated on a character string basis without decomposing the character features on a character basis, small-sized characters for which pattern decomposition on a character basis is difficult are also used as identification features. And the form identification accuracy is improved.

【００６３】[0063]

【発明の効果】以上説明したようの、本発明の第１の効
果は、文字パターンの間隔が狭かったり、サイズの小さ
な文字でも帳票識別の特徴として辞書登録できるため、
認識精度が向上することにある。その理由は、文字パタ
ーンの照合を文字単位ではなく文字列単位に行うため、
文字列内で文字パターンを切り分けて検出する必要がな
いためである。また、本発明の第２の効果は、オペレー
タの作業負担を軽減できることにある。その理由は、第
１の効果で述べたように従来不可能であった間隔の狭い
文字パターンや小さい文字パターンの登録が可能になる
他、登録できない文字列が無くなるためと、記入済み帳
票を複数画像入力して共通な文字列での文字パターンを
抽出する手段を備えるため、プレ印字のみを自動的に選
択して辞書登録できるためである。オペレータが帳票識
別の特徴として登録する文字列を選択して指定する必要
がなくなる。As described above, the first effect of the present invention is that a character pattern can be registered in a dictionary as a form identification feature even if the character pattern interval is narrow or a small character is used.
This is to improve recognition accuracy. The reason is that character pattern matching is performed not on a character basis but on a character string basis,
This is because there is no need to separate and detect character patterns in a character string. A second effect of the present invention is that the work load on the operator can be reduced. The reason for this is that, as described in the first effect, it is possible to register a character pattern with a narrow interval or a small character pattern that was impossible in the past, and there is no character string that cannot be registered. This is because a means for inputting an image and extracting a character pattern in a common character string is provided, so that only preprinting can be automatically selected and registered in a dictionary. This eliminates the need for the operator to select and specify a character string to be registered as a form identification feature.

[Brief description of the drawings]

【図１】図２の制御部がプログラムメモリに格納されて
いる帳票識別プログラムを実行した場合の機能ブロック
レベルでの動作の流れを示す構成概略図である。FIG. 1 is a schematic configuration diagram showing a flow of an operation at a functional block level when a control unit of FIG. 2 executes a form identification program stored in a program memory.

【図２】本発明の第１の実施の形態の構成を示すブロッ
ク図である。FIG. 2 is a block diagram illustrating a configuration of a first exemplary embodiment of the present invention.

【図３】図２の制御部が帳票登録モード時における動作
を示すフローチャートである。FIG. 3 is a flowchart showing an operation of the control unit in FIG. 2 in a form registration mode.

【図４】図２の制御部が帳票識別モード時における動作
を示すフローチャートである。FIG. 4 is a flowchart showing an operation of the control unit in FIG. 2 in a form identification mode.

【図５】未記入の帳票例を示した概略図である。FIG. 5 is a schematic diagram showing an example of a blank form;

【図６】記入済みの帳票例を示した概略図である。FIG. 6 is a schematic view showing an example of a completed form.

【図７】図２の特徴抽出部が帳票の文字列から特徴を抽
出する概要を示した概略図である。FIG. 7 is a schematic diagram showing an outline of a feature extraction unit of FIG. 2 extracting a feature from a character string of a form.

【図８】本発明の第２の実施の形態の構成を示すブロッ
ク図である。FIG. 8 is a block diagram showing a configuration of a second exemplary embodiment of the present invention.

【図９】図２の画像入力部が帳票の傾きを補正して帳票
周囲の余白を除去する概要を示した概略図である。9 is a schematic diagram showing an outline in which the image input unit in FIG. 2 corrects the inclination of the form and removes a margin around the form.

【図１０】図７の窓領域を「住」の位置で拡大した図で
ある。FIG. 10 is an enlarged view of the window area of FIG. 7 at the position of “house”.

[Explanation of symbols]

１制御装置２イメージスキャナ３キーボード４表示装置５ＦＤＤ６記録媒体１１制御部１２画像メモリ１３特徴量メモリ１４制御メモリ１５辞書メモリ１６プログラムメモリ１７インタフェース部５１共通部５２画像入力部５３特徴抽出部５４照合部５５共通特徴抽出部５３１帳票サイズ検出部５３２文字列特徴検出部 REFERENCE SIGNS LIST 1 control device 2 image scanner 3 keyboard 4 display device 5 FDD 6 recording medium 11 control unit 12 image memory 13 feature memory 14 control memory 15 dictionary memory 16 program memory 17 interface unit 51 common unit 52 image input unit 53 feature extraction unit 54 Matching unit 55 Common feature extraction unit 531 Form size detection unit 532 Character string feature detection unit

Claims

[Claims]

1. A form identification method for causing an information processing apparatus to identify a form, comprising detecting a character string from a digital image of an input blank form, extracting coordinate information for each detected character string, Furthermore, a feature amount is extracted while moving a window area for limiting a rectangular feature extraction range having the same height as the character string along the character string, and the coordinate information of the extracted character string and A form identification method characterized in that a form identification format is created by registering characteristic amounts in a memory.

2. A form identification method for causing an information processing apparatus to identify a form, wherein a character string is detected from each of digital images of a plurality of completed forms in the same format and having different contents, and the detected character is detected. Extract coordinate information for each column, and further extract a feature amount while moving a window area that limits a rectangular feature extraction range having the same height as the character string along the character string, The coordinate information and the feature amount of each of the extracted character strings are compared and collated for all of a plurality of filled-out forms having different entry contents in the same input format, and the coordinate information and the feature amount are compared. A form identification method characterized in that a form identification format is created by registering coordinate information and a feature amount of a matching character string in a memory.

3. A form identification method for causing an information processing apparatus to identify a form, wherein a character string is detected from a digital image of an input form, and coordinate information and a feature amount of each detected character string are extracted. A form identification method characterized by comparing the coordinate information and the characteristic amount of the extracted character string with the coordinate information and the characteristic amount of the character string registered in advance for each type of the form to identify the form type. .

4. A form identification method for causing an information processing apparatus to identify a form, wherein when an unfilled form is entered in a new format, a digital image of the entered form is stored in a first memory. Holding step;
A size measuring step of measuring a form size from the digital image held in the memory; detecting a character string from the digital image held in the first memory; calculating coordinate information of the detected character string; A feature amount calculating step of calculating a feature amount while moving a window region for limiting a rectangular feature extraction range having the same height as the character string along the character string on the character string; and the calculated coordinate information. A second storing step of storing the characteristic amount and the characteristic amount in a second memory; a first repeating step of repeating the characteristic amount calculating step and the second storing step until there is no longer a character string; When the first repetition step is completed, the form size held in the first memory and the feature amounts for each coordinate information of all the character strings held in the second memory are stored in the third memory. Form identification method characterized by comprising a registration step of recording.

5. A form identification method for causing an information processing apparatus to identify a form, wherein when a form written in a format is input, a first image storing a digital image of the input form in a first memory. A holding step; a generating step of generating an identification code for identifying the input form; a size measuring step of measuring a form size from a digital image held in the first memory; A character string is detected from the digital image held in the memory, the coordinate information of the detected character string is calculated, and a window area for limiting a rectangular feature extraction range having the same height as the character string is defined as a character. A feature value calculating step of calculating a feature value while moving along a character string on a column, and storing the calculated coordinate information and the feature value in a second memory based on the identification code. A second holding step, a first repeating step of repeating the feature amount calculating step and the second holding step until there is no longer a character string, and a form input after the first repeating step. When a form filled in in the same format is entered,
A second repeating step of repeating the generating step, the size measuring step, the feature amount calculating step, the second holding step, and the first repeating step;
After the end of the second repetition step, the coordinate information of the character string in the certain identification code stored in the second memory matches the coordinate information of the character string in another identification code stored in the second memory. And if there is a match in the comparison step, calculate the similarity with the feature amount corresponding to the coordinate information and match if it is equal to or greater than a predetermined value. An extraction step of extracting all the coordinate information and the characteristic amount of the character string, and extracting the characteristic amount for each coordinate information of all the extracted character strings and the form size written in the second memory after the extraction. And a registration step of registering the form in a memory.

6. A form identification method for causing an information processing apparatus to identify a form, wherein, when a form written in a format is input, a first image storing a digital image of the input form in a first memory. A holding step; a size measuring step of measuring a form size from the digital image held in the first memory; and a character string detected from the digital image held in the first memory, and A feature amount calculating step of calculating coordinate information and further calculating a feature amount while moving a window region for limiting a rectangular feature extraction range having the same height as the character string along the character string on the character string; And a second holding step of holding the calculated coordinate information and the feature amount in a second memory;
A first repetition step of repeating the feature amount calculation step and the second holding step until there is no longer a character string, and the second repetition step after the first repetition step is completed.
Of the character string held in the memory of the third form and the third form having the same form size as the form size stored in the second memory.
A comparing step of comparing the coordinate information of the character string of the form registered in the memory of the form, and calculating a similarity between the characteristic amounts of the character strings when the character strings match in the comparing step, and calculating the similarity between the character strings with a predetermined value or more. And determining the type of the form by determining that the character strings match, if any.

7. A form identification program for identifying a form to be processed by a computer, wherein when an unfilled form is input in a new format, a digital image of the input form is held in a first memory. A first holding step; a size measuring step of measuring a form size from the digital image held in the first memory;
Detecting a character string from the digital image held in the first memory, calculating coordinate information of the detected character string,
A feature amount calculating step of calculating a feature amount while moving a window region for limiting a rectangular feature extraction range having the same height as the character string along the character string, and the calculated coordinates; A second holding step of holding information and a feature amount in a second memory, a first repetition step of repeating the feature amount calculating step and the second holding step until there is no longer a character string; Upon completion of the first repetition step, registration for registering the form size held in the first memory and the feature amount for each coordinate information of all the character strings held in the second memory in a third memory A form identification program characterized by causing the computer to execute a process including the steps of:

8. A form identification program for identifying a form to be processed by a computer, wherein when a form written in a format is input, a digital image of the input form is stored in a first memory. 1, a generating step of generating an identification code for identifying the input form, a size measuring step of measuring a form size from a digital image stored in the first memory, A window area for detecting a character string from the digital image held in the first memory, calculating coordinate information of the detected character string, and further restricting a rectangular feature extraction range having the same height as the character string. Calculating a feature amount while moving the character along the character string on the character string; and storing the calculated coordinate information and the feature amount in the second memory. A second holding step of holding based on another code, and a first step of repeating the feature amount calculating step and the second holding step until there is no longer a character string
And when a form filled in in the same format as the entered form is input after the end of the first repeating step, the generating step, the size measuring step, the feature amount calculating step, and the 2, a second repetition step of repeating the holding step and the first repetition step, and after the end of the second repetition step, the coordinate information of the character string in the identification code stored in the second memory is stored in the second memory. A comparing step of comparing whether or not the character string matches the coordinate information of a character string in another identification code held in the second memory; and if there is a match in the comparing step, a feature corresponding to the matched coordinate information If the degree of similarity with the amount is equal to or greater than a predetermined value, it is determined that they match, and the extraction step for extracting all the coordinate information and the characteristic amount of the character string. And a registration step of registering, in the third memory, a feature amount for each coordinate information of all extracted character strings and a form size written in the second memory after the extraction is completed. A form identification program characterized by being executed by a user.

9. A form identification program for identifying a form to be processed by a computer, wherein when a form entered in a format is inputted, a digital image of the entered form is stored in a first memory. 1; a size measuring step of measuring a form size from the digital image held in the first memory; and a character string detected from the digital image held in the first memory by detecting a character string. A feature amount that calculates coordinate information of a column and further calculates a feature amount by moving a window region that limits a rectangular feature extraction range having the same height as the character string along the character string. A calculating step, a second holding step of holding the calculated coordinate information and the feature amount in a second memory, and a feature amount calculating step until there is no longer a character string. A first repetition step of repeating the first and second holding steps, a coordinate of a character string held in the second memory after the first repetition step is completed, and a character string stored in the second memory. Comparing the coordinate information of the character string of the form registered in the third memory with the same form size as the form size of the form, and determining the similarity between the characteristic amounts of the character strings when the character strings match in the comparison step. And determining the character string matches if the calculated value is equal to or greater than a predetermined value and causing the computer to execute processing including an identification step of identifying the type of the form.

10. A form identification method for identifying a form comprising an input device for converting a form into a digital image when the form is input, and a control device for performing a form identification process, wherein the control device reads the form from the input device. Image input means for storing a digital image of the form, and detecting the size of the form from the digital image of the form held by the image input means, and detecting the character pattern in units of character strings and converting them into feature amounts in units of character strings Feature extraction means for holding
When registering the format of the form to be input, a common feature extracting unit that extracts a feature amount common to each set from the feature amounts of a plurality of sets of character strings output from the feature extracting unit and registers the feature amount in a dictionary memory; When the format of the input form is identified, the characteristic amount of the character string of the processing target form output from the characteristic extracting unit and the characteristic amount of the character string held for each form in the dictionary memory in advance are used. A form identification method, comprising: a collation means for collating.