JP2001312696A

JP2001312696A - Document sorting device, character reader, genuineness judging device and method for these

Info

Publication number: JP2001312696A
Application number: JP2001101803A
Authority: JP
Inventors: Naoya Tanaka; 直哉田中
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2001-03-30
Filing date: 2001-03-30
Publication date: 2001-11-09
Anticipated expiration: 2018-03-27
Also published as: JP3606218B2

Abstract

PROBLEM TO BE SOLVED: To speedily and precisely sort a document by using the fast and precise collation of a character string. SOLUTION: Picture data of a slip including a character string is fetched from a picture input device 1 and stored in a picture storing part 21. A slip position detecting means 31 detects the position of the slip in the stored picture data. In accordance with this detecting result, a coordinate transformation means 32 transforms the coordinate system of coordinates where the character string has to be positioned in the slip stored in a coordinate storing area 22a to the picture coordinate system being the coordinate system of the picture data. The deviation of the coordinates of the character in the picture data stored in the part 21 is corrected by the unit of the character string by a character string coordinate value correcting means 33 and by the unit of the character by a character coordinate value correcting means 34. A graphic collation means 35 extracts the feature quantity of each character in picture data whose coordinates are corrected and collates it with a collating featured value stored in a collating feature storing area 22b. In accordance with this collating result, an output device 9 outputs the code information on each character stored in a code information storing area 81a.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、文字列照合を利用
した文書分類装置、文字読み取り装置及び真贋判定装
置、並びにこれらの方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document classifying apparatus utilizing character string collation, a character reading apparatus, an authenticity judging apparatus, and a method thereof.

【０００２】[0002]

【従来の技術】近年、情報処理のコンピュータ化が進ん
でおり、例えば、帳票の画像をイメージスキャナによっ
て読み取り、読み取った画像中の文字列をコード情報に
変換したデータをデータベースなどに投入することが行
われている。この場合、データの投入を正確に行うため
には、コンピュータとしては帳票の構造を知る必要があ
り、所定の書式の帳票であるかどうかを判別し、書式毎
に分類する必要がある。2. Description of the Related Art In recent years, computerization of information processing has been advanced. For example, an image of a form is read by an image scanner, and data obtained by converting a character string in the read image into code information is input to a database or the like. Is being done. In this case, in order to correctly input data, the computer needs to know the structure of the form, and it is necessary to determine whether or not the form is in a predetermined format and to classify the form for each form.

【０００３】このような帳票の分類の方法として、読み
取った帳票上の特定の位置に記載されている帳票名など
の文字列のパターンを予め記憶されているパターン（テ
ンプレートなど）と照合するものがある。しかしなが
ら、帳票上の文字列は、一般に印刷ずれや読み取りの際
に生じる歪みによって位置ずれや、読み取りの際に生じ
る画像濃度の局所的な変動が生じるため、正確に切り出
すことができない。このように不正確に切り出された文
字列のパターンを予め記憶されているパターンと照合し
ても文字列の有無を判定することは困難であり、帳票の
分類を行うことができない。As a method of classifying such forms, a method of collating a character string pattern such as a form name described at a specific position on a read form with a previously stored pattern (such as a template) is known. is there. However, a character string on a form cannot be cut out accurately because, in general, misalignment occurs due to printing misalignment or distortion during reading, and local fluctuation in image density occurs during reading. It is difficult to determine the presence or absence of a character string even if the pattern of the character string cut out incorrectly as described above is compared with a pattern stored in advance, and the form cannot be classified.

【０００４】そこで、従来の文字列照合方法（以下、従
来例１という）として、次のようなものがあった。この
方法では、画像の濃度情報に基づいて文字部分と背景部
分とを分離する閾値を求め、この閾値によって画像を２
値化して階調が同じ画素が連結した連結成分を検出す
る。そして、物理的な距離が近い連結成分を矩形で囲ん
で得られる文字単位の図形成分を検出し、検出した文字
単位の図形成分を予め登録された文字テンプレートと照
合していた。Therefore, there is the following conventional character string collation method (hereinafter referred to as Conventional Example 1). In this method, a threshold value for separating a character portion from a background portion is obtained based on density information of an image, and the image is divided into two by this threshold value.
A connected component in which pixels having the same gradation are connected by a value is detected. Then, a graphic component in a character unit obtained by surrounding a connected component having a short physical distance with a rectangle is detected, and the detected graphic component in a character unit is collated with a character template registered in advance.

【０００５】また、特開平７−２３９９１２号公報に示
されている技術（以下、従来例２という）では、サンプ
ル画像であるテンプレートの位置を、読み取った画像中
の各文字の近傍の領域内で変更しながら、各位置で文字
とテンプレートとの相互相関値を計算し、この相互相関
値に従って文字照合を行っていた。In the technique disclosed in Japanese Patent Application Laid-Open No. 7-239912 (hereinafter referred to as Conventional Example 2), the position of a template, which is a sample image, is determined within a region near each character in a read image. While changing, a cross-correlation value between the character and the template is calculated at each position, and character matching is performed according to the cross-correlation value.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら、上記従
来例１では、１つの文字領域内で背景部分の濃度が変化
する場合には、文字部分と背景部分とを分離するための
適切に閾値を設定できないという問題点があった。すな
わち、いずれの閾値に設定しても、文字の一部を背景部
分と誤ってしまうか、背景の一部を文字部分と誤ってし
まうという場合が生じ、文字単位の図形成文を予め登録
されたテンプレートと正しく照合することができなかっ
た。However, in the above-mentioned prior art example 1, when the density of the background portion changes in one character region, an appropriate threshold value is set for separating the character portion from the background portion. There was a problem that it was not possible. That is, no matter which threshold is set, a part of a character may be mistaken for a background part or a part of a background may be mistaken for a character part. Could not match template correctly.

【０００７】また、上記従来例２では、テンプレートの
位置を変更する度に、その前の位置での相互相関値と比
較しなければならない。そのため、必要となる演算コス
トが非常に大きく、文字列の照合を高速に実行すること
が困難であるという問題点があった。Further, in the above-mentioned conventional example 2, every time the position of the template is changed, it must be compared with the cross-correlation value at the previous position. For this reason, there has been a problem that the required operation cost is extremely large, and it is difficult to perform character string collation at high speed.

【０００８】本発明は、上記従来例の問題点を解消する
ためになされたものであり、印刷ずれや画像入力時に生
じた歪みによる位置ずれなどを高速、高精度に補正する
文字列照合を利用することによって、高速かつ正確に帳
票等の文書を分類する文書分類装置及び方法を提供する
ことを目的とする。The present invention has been made in order to solve the above-described problems of the conventional example, and utilizes a character string collation for correcting a print shift or a position shift due to a distortion generated at the time of image input with high speed and high accuracy. Accordingly, an object of the present invention is to provide a document classification device and method for quickly and accurately classifying documents such as forms.

【０００９】本発明は、また、上記の文字列照合を利用
することによって、高速かつ正確に文書上の文字列をコ
ード情報に変換する文字読み取り装置及び方法を提供す
ることを目的とする。Another object of the present invention is to provide a character reading apparatus and method for converting a character string in a document into code information at high speed and accurately by utilizing the above character string collation.

【００１０】本発明は、さらに、上記の文字列照合を利
用することによって、高速かつ正確に有価証券等の真贋
を判定する真贋判定装置及び方法を提供することを目的
とする。A further object of the present invention is to provide an authentication apparatus and method for quickly and accurately determining the authenticity of securities or the like by utilizing the above-described character string collation.

【００１１】[0011]

【課題を解決するための手段】上記目的を達成するた
め、本発明の第１の観点にかかる文書分類装置は、文字
列を含む所定の書式の文書から取り込んだ多階調の画像
データを記憶する画像記憶手段と、前記文書中で文字列
の各文字が位置すべき座標と、前記文字列が有するべき
特徴量と、前記文字列を含む文書の書式を識別するため
の識別情報とを対応付けて記憶する辞書記憶手段と、前
記画像記憶手段に記憶されている画像データ中での前記
文書の位置を検出する位置検出手段と、前記位置検出手
段の検出結果に従って、前記文書中で文字列中の各文字
が位置すべき座標の座標系を前記文書を元にした文書座
標系から前記画像データの取り込み状態を元にした画像
座標系に変換する座標変換手段と、前記画像記憶手段に
記憶された画像データに含まれる文字列全体の座標と前
記座標変換手段によって座標系が変換された前記文書中
で文字列が位置すべき座標とのずれを文字列単位で補正
する文字列座標補正手段と、前記文字列座標補正手段で
文字列を単位として座標が補正された文字列中に含まれ
る各文字の座標と前記座標変換手段によって座標系が変
換された前記文字列の各文字が位置すべき座標とのずれ
を文字単位で補正する文字座標補正手段と、前記文字座
標補正手段で各文字の座標が補正された前記文書から取
り込まれた画像データ中の文字列の特徴量を抽出し、抽
出した特徴量を前記辞書記憶手段に記憶されている前記
文字列が有するべき特徴量と照合する図形照合手段と、
前記図形照合手段による照合結果に従って、前記辞書記
憶手段に記憶されている識別情報を出力する出力手段
と、を備えることを特徴とする。In order to achieve the above object, a document classifying apparatus according to a first aspect of the present invention stores multi-gradation image data taken from a document in a predetermined format including a character string. Image storage means, coordinates of the position of each character of the character string in the document, feature values that the character string should have, and identification information for identifying the format of the document containing the character string. Dictionary storage means for adding and storing; a position detection means for detecting the position of the document in the image data stored in the image storage means; and a character string in the document according to a detection result of the position detection means. Coordinate conversion means for converting a coordinate system of coordinates where each character in the document should be located from a document coordinate system based on the document to an image coordinate system based on a state of capturing the image data, and stored in the image storage means. Image data Character string coordinate correction means for correcting a deviation between the coordinates of the entire character string included in the document and the coordinates at which the character string should be located in the document whose coordinate system has been converted by the coordinate conversion means in units of character strings; and The coordinates of each character included in the character string whose coordinates have been corrected in units of the character string by the column coordinate correction means and the coordinates where each character of the character string whose coordinate system has been converted by the coordinate conversion means should be located. A character coordinate correcting means for correcting the displacement in units of characters, and a characteristic amount of the character string in the image data taken from the document in which the coordinates of each character are corrected by the character coordinate correcting means, and the extracted characteristic amount Figure matching means for comparing the character string stored in the dictionary storage means with a feature amount to be possessed,
Output means for outputting the identification information stored in the dictionary storage means in accordance with the result of matching by the graphic matching means.

【００１２】上記目的を達成するため、本発明の第２の
観点にかかる文字読み取り装置は、文字列を含む所定の
書式の文書から取り込んだ多階調の画像データを記憶す
る画像記憶手段と、前記文書中で文字列の各文字が位置
すべき座標と、前記文字列が有するべき特徴量と、前記
文字列に対応する文字コードとを対応付けて記憶する辞
書記憶手段と、前記画像記憶手段に記憶されている画像
データ中での前記文書の位置を検出する位置検出手段
と、前記位置検出手段の検出結果に従って、前記文書中
で文字列中の各文字が位置すべき座標の座標系を前記文
書を元にした文書座標系から前記画像データの取り込み
状態を元にした画像座標系に変換する座標変換手段と、
前記画像記憶手段に記憶された画像データに含まれる文
字列全体の座標と前記座標変換手段によって座標系が変
換された前記文書中で文字列が位置すべき座標とのずれ
を文字列単位で補正する文字列座標補正手段と、前記文
字列座標補正手段で文字列を単位として座標が補正され
た文字列中に含まれる各文字の座標と前記座標変換手段
によって座標系が変換された前記文字列の各文字が位置
すべき座標とのずれを文字単位で補正する文字座標補正
手段と、前記文字座標補正手段で各文字の座標が補正さ
れた前記文書から取り込まれた画像データ中の文字列の
特徴量を抽出し、抽出した特徴量を前記辞書記憶手段に
記憶されている前記文字列が有するべき特徴量と照合す
る図形照合手段と、前記図形照合手段による照合結果に
従って、前記辞書記憶手段に記憶されている文字コード
を出力する出力手段と、を備えることを特徴とする。In order to achieve the above object, a character reading apparatus according to a second aspect of the present invention comprises: an image storage means for storing multi-gradation image data taken from a document in a predetermined format including a character string; A dictionary storage means for storing coordinates in which each character of the character string should be located in the document, a feature quantity to be included in the character string, and a character code corresponding to the character string, and the image storage means Position detecting means for detecting the position of the document in the image data stored in the image data, and a coordinate system of coordinates where each character in a character string should be located in the document according to the detection result of the position detecting means. Coordinate conversion means for converting from a document coordinate system based on the document to an image coordinate system based on the state of capture of the image data,
A deviation between the coordinates of the entire character string included in the image data stored in the image storage means and the coordinates where the character string should be located in the document whose coordinate system has been converted by the coordinate conversion means is corrected in character string units. Character string coordinate correcting means, and the coordinates of each character included in the character string whose coordinates have been corrected in units of the character string by the character string coordinate correcting means, and the character string whose coordinate system has been converted by the coordinate converting means. Character coordinate correcting means for correcting a deviation from the coordinates where each character should be located in character units, and a character string in image data taken from the document in which the coordinates of each character have been corrected by the character coordinate correcting means. A graphic matching means for extracting a feature quantity and matching the extracted feature quantity with a feature quantity to be possessed by the character string stored in the dictionary storage means; and And output means for outputting the character code stored in 憶 means, characterized in that it comprises a.

【００１３】上記目的を達成するため、本発明の第３の
観点にかかる真贋判定装置は、文字列を含む所定の書式
の文書から取り込んだ多階調の画像データを記憶する画
像記憶手段と、前記文書中で文字列の各文字が位置すべ
き座標と、前記文字列が有するべき特徴量と、前記特徴
量の許容域とを対応付けて記憶する辞書記憶手段と、前
記画像記憶手段に記憶されている画像データ中での前記
文書の位置を検出する位置検出手段と、前記位置検出手
段の検出結果に従って、前記文書中で文字列中の各文字
が位置すべき座標の座標系を前記文書を元にした文書座
標系から前記画像データの取り込み状態を元にした画像
座標系に変換する座標変換手段と、前記画像記憶手段に
記憶された画像データに含まれる文字列全体の座標と前
記座標変換手段によって座標系が変換された前記文書中
で文字列が位置すべき座標とのずれを文字列単位で補正
する文字列座標補正手段と、前記文字列座標補正手段で
文字列を単位として座標が補正された文字列中に含まれ
る各文字の座標と前記座標変換手段によって座標系が変
換された前記文字列の各文字が位置すべき座標とのずれ
を文字単位で補正する文字座標補正手段と、前記文字座
標補正手段で各文字の座標が補正された前記文書から取
り込まれた画像データ中の文字列の特徴量を抽出し、抽
出した特徴量を前記辞書記憶手段に記憶されている前記
文字列が有するべき特徴量と照合する図形照合手段と、
前記図形照合手段の照合した特徴量の違いが前記辞書記
憶手段に記憶されている許容域の範囲にあるかどうかを
判定する真贋判定手段と、を備えることを特徴とする。[0013] To achieve the above object, an authenticity determination device according to a third aspect of the present invention comprises image storage means for storing multi-tone image data taken from a document in a predetermined format including a character string, A dictionary storage unit that stores coordinates in which the characters of the character string should be located in the document, a feature amount that the character string should have, and an allowable range of the feature amount in association with each other; Position detecting means for detecting the position of the document in the image data being processed, and a coordinate system of coordinates where each character in a character string should be located in the document according to the detection result of the position detecting means. Coordinate conversion means for converting from a document coordinate system based on the image data to an image coordinate system based on the state of capture of the image data; and coordinates of the entire character string included in the image data stored in the image storage means and the coordinates Conversion means A character string coordinate correcting unit for correcting a deviation from a coordinate at which a character string should be located in the document whose coordinate system has been converted, in units of character strings; and Character coordinate correcting means for correcting a deviation between the coordinates of each character included in the corrected character string and the coordinates at which each character of the character string whose coordinate system has been converted by the coordinate converting means should be located in character units; Extracting a characteristic amount of a character string in image data taken from the document in which the coordinates of each character are corrected by the character coordinate correcting unit, and extracting the extracted characteristic amount from the character stored in the dictionary storage unit. Graphic matching means for matching with a feature quantity that the column should have;
Authenticity judging means for judging whether or not the difference in the feature amount collated by the graphic collating means is within a range of an allowable range stored in the dictionary storage means.

【００１４】上記の第１〜第３の観点にかかる文書分類
装置、文字読み取り装置及び真贋判定装置では、文字列
照合を利用して、それぞれ文書の分類、文字読み取り、
或いは有価証券などの真贋の判定を高速、かつ正確に行
うことができる。In the document classification device, character reading device and authenticity judging device according to the first to third aspects, document classification, character reading, and character reading are performed using character string collation.
Alternatively, the authenticity of a security or the like can be determined quickly and accurately.

【００１５】上記目的を達成するため、本発明の第４の
観点にかかる文書分類方法は、文書中で文字列の各文字
が位置すべき座標と、前記文字列が有するべき特徴量
と、前記文字列を含む文書の書式を識別するための識別
情報とを対応付けて記憶した辞書を有するコンピュータ
システムにおいて文書を分類する方法であって、文字列
を含む所定の書式の文書から取り込んだ多階調の画像デ
ータをメモリに記憶させるステップと、前記メモリに記
憶された画像データ中での前記文書の位置を検出するス
テップと、検出した前記文書の位置に従って、前記文書
中で文字列中の各文字が位置すべき座標の座標系を前記
文書を元にした文書座標系から前記画像データの取り込
み状態を元にした画像座標系に変換するステップと、前
記メモリに記憶された画像データに含まれる文字列全体
の座標と画像座標系に変換された前記文書中で文字列が
位置すべき座標とのずれを文字列単位で補正するステッ
プと、文字列単位で座標が補正された文字列中に含まれ
る各文字の座標と画像座標系に変換された前記文書中で
文字列の各文字が位置すべき座標とのずれを文字単位で
補正するステップと、各文字の座標が補正された前記文
書から取り込まれた画像データ中の文字列の特徴量を抽
出し、抽出した特徴量を前記辞書に記憶されている前記
文字列が有するべき特徴量と照合するステップと、前記
照合結果に従って、前記辞書に記憶されている識別情報
を出力するステップと、を含むことを特徴とする。In order to achieve the above object, a document classification method according to a fourth aspect of the present invention is a document classification method, comprising: a coordinate at which each character of a character string should be located in a document; A method for classifying a document in a computer system having a dictionary in which identification information for identifying a format of a document including a character string is stored in association with the identification information, wherein a multi-level document read from a document in a predetermined format including a character string is provided. Storing the key image data in a memory; detecting the position of the document in the image data stored in the memory; and detecting each position of the character string in the document according to the detected position of the document. Converting the coordinate system of the coordinates where the characters should be located from the document coordinate system based on the document to the image coordinate system based on the state of capture of the image data; and Correcting the deviation between the coordinates of the entire character string included in the image data and the coordinates where the character string should be located in the document converted into the image coordinate system in units of character strings, and correcting the coordinates in units of character strings. Correcting in each character a deviation between the coordinates of each character included in the character string and the coordinates where each character of the character string should be located in the document converted into the image coordinate system; and Extracting a characteristic amount of a character string in the image data taken in from the corrected document, and comparing the extracted characteristic amount with a characteristic amount to be included in the character string stored in the dictionary; Outputting the identification information stored in the dictionary according to the result.

【００１６】上記目的を達成するため、本発明の第５の
観点にかかる文書読み取り方法は、文書中で文字列の各
文字が位置すべき座標と、前記文字列が有するべき特徴
量と、前記文字列に対応する文字コードとを対応付けて
記憶した辞書を有するコンピュータシステムにおいて文
字を読み取る方法であって、文字列を含む所定の書式の
文書から取り込んだ多階調の画像データをメモリに記憶
させるステップと、前記メモリに記憶された画像データ
中での前記文書の位置を検出するステップと、検出した
前記文書の位置に従って、前記文書中で文字列中の各文
字が位置すべき座標の座標系を前記文書を元にした文書
座標系から前記画像データの取り込み状態を元にした画
像座標系に変換するステップと、前記メモリに記憶され
た画像データに含まれる文字列全体の座標と画像座標系
に変換された前記文書中で文字列が位置すべき座標との
ずれを文字列単位で補正するステップと、文字列単位で
座標が補正された文字列中に含まれる各文字の座標と画
像座標系に変換された前記文書中で文字列の各文字が位
置すべき座標とのずれを文字単位で補正するステップ
と、各文字の座標が補正された前記文書から取り込まれ
た画像データ中の文字列の特徴量を抽出し、抽出した特
徴量を前記辞書に記憶されている前記文字列が有するべ
き特徴量と照合するステップと、前記照合結果に従っ
て、前記辞書に記憶されている文字コードを出力するス
テップと、を含むことを特徴とする。In order to achieve the above object, a document reading method according to a fifth aspect of the present invention provides a document reading method, comprising the steps of: A method for reading characters in a computer system having a dictionary storing character codes corresponding to character strings in association with each other, wherein multi-tone image data taken from a document in a predetermined format including the character strings is stored in a memory. Causing the document to be located in the image data stored in the memory; and, according to the detected location of the document, the coordinates of the coordinates where each character in the character string should be located in the document. Converting the system from a document coordinate system based on the document to an image coordinate system based on the state of capturing the image data; Correcting the deviation between the coordinates of the entire character string to be converted and the coordinates where the character string should be located in the document converted into the image coordinate system in units of character strings; and Correcting the deviation between the coordinates of each character included in the document and the coordinates where each character of the character string should be located in the document converted into the image coordinate system on a character-by-character basis; and Extracting a feature amount of a character string in the image data captured from the document, comparing the extracted feature amount with a feature amount to be possessed by the character string stored in the dictionary, and according to the matching result, Outputting the character code stored in the dictionary.

【００１７】上記目的を達成するため、本発明の第６の
観点にかかる真贋判定方法は、前記文書中で文字列の各
文字が位置すべき座標と、前記文字列が有するべき特徴
量と、前記特徴量の許容域とを対応付けて記憶した辞書
を有するコンピュータシステムにおいて文書の真贋を判
定する方法であって、文字列を含む所定の書式の文書か
ら取り込んだ多階調の画像データをメモリに記憶させる
ステップと、前記メモリに記憶された画像データ中での
前記文書の位置を検出するステップと、検出した前記文
書の位置に従って、メモリに予め記憶されている前記文
書中で文字列中の各文字が位置すべき座標の座標系を前
記文書を元にした文書座標系から前記画像データの取り
込み状態を元にした画像座標系に変換するステップと、
前記メモリに記憶された画像データに含まれる文字列全
体の座標と画像座標系に変換された前記文書中で文字列
が位置すべき座標とのずれを文字列単位で補正するステ
ップと、文字列単位で座標が補正された文字列中に含ま
れる各文字の座標と画像座標系に変換された前記文書中
で文字列の各文字が位置すべき座標とのずれを文字単位
で補正するステップと、各文字の座標が補正された前記
文書から取り込まれた画像データ中の文字列の特徴量を
抽出し、抽出した特徴量を前記辞書に記憶されている前
記文字列が有するべき特徴量と照合するステップと、照
合した特徴量の違いが前記辞書に記憶されている許容域
の範囲にあるかどうかを判定するステップと、を含むこ
とを特徴とする。In order to achieve the above object, an authenticity judgment method according to a sixth aspect of the present invention comprises: a coordinate where each character of a character string should be located in the document; A method for judging the authenticity of a document in a computer system having a dictionary in which the allowable range of the feature amount is stored in association with the dictionary, wherein multi-tone image data taken from a document in a predetermined format including a character string is stored in a memory. And detecting the position of the document in the image data stored in the memory, and according to the detected position of the document, a character string in the document stored in the memory in advance. Converting a coordinate system of coordinates where each character should be located from a document coordinate system based on the document to an image coordinate system based on a state of capturing the image data;
Correcting the deviation between the coordinates of the entire character string included in the image data stored in the memory and the coordinates where the character string should be located in the document converted into the image coordinate system in units of character strings; Correcting the difference between the coordinates of each character included in the character string whose coordinates have been corrected in units and the coordinates where each character of the character string should be located in the document converted into the image coordinate system in units of characters; Extracting a feature amount of a character string in image data taken from the document in which the coordinates of each character are corrected, and comparing the extracted feature amount with a feature amount that the character string should have in the dictionary stored in the dictionary And a step of determining whether or not the difference between the matched feature amounts falls within the allowable range stored in the dictionary.

【００１８】[0018]

【発明の実施の形態】以下、添付図面を参照して、本発
明の実施の形態について説明する。この実施の形態で
は、帳票上の文字列を読み取り、読み取った文字列をも
とに帳票を分類する帳票分類装置に適用した場合を例と
して説明する。Embodiments of the present invention will be described below with reference to the accompanying drawings. In this embodiment, an example will be described in which a character string on a form is read and the form is applied to a form classification apparatus that classifies the form based on the read character string.

【００１９】図１は、この実施の形態にかかる帳票分類
装置の機能構成を示す機能ブロック図である。図示する
ように、この帳票分類装置は、画像入力装置１と、記憶
装置２と、データ処理装置３と、出力装置４とを備え
る。FIG. 1 is a functional block diagram showing a functional configuration of a form classification device according to this embodiment. As shown in the figure, the form classification device includes an image input device 1, a storage device 2, a data processing device 3, and an output device 4.

【００２０】画像入力装置１は、イメージスキャナ等に
よって構成され、帳票の画像を読み取って、多階調を有
する２次元のディジタル画像として取り込む。画像入力
装置１による帳票の画像の取り込みに際して、±５°程
度の傾きは許容されるものとする。また、この実施の形
態において、画像入力装置１は、少なくとも帳票中の帳
票名が記載されている領域の画像を読み取ることができ
ればよいものとする。The image input device 1 is constituted by an image scanner or the like, reads an image of a form, and takes it in as a two-dimensional digital image having multiple gradations. When the image of the form is captured by the image input device 1, a tilt of about ± 5 ° is allowed. Further, in this embodiment, it is assumed that the image input device 1 only needs to be able to read at least the image of the area in the form where the form name is described.

【００２１】記憶装置２は、半導体メモリ或いは磁気デ
ィスク等によって構成され、画像記憶部２１と、辞書記
憶部２２とを含む。画像記憶部２１は、画像入力装置１
によって帳票から読み取られたディジタル画像を記憶す
る。画像記憶部２１に記憶されるディジタル画像は、画
像の左上隅を原点とし、原点からそれぞれ水平方向及び
垂直方向にとられたｘ軸とｙ軸とからなる座標系（以
下、画像座標系という）によって各画素の位置が参照さ
れる。The storage device 2 is constituted by a semiconductor memory or a magnetic disk or the like, and includes an image storage unit 21 and a dictionary storage unit 22. The image storage unit 21 stores the image input device 1
The digital image read from the form is stored. The digital image stored in the image storage unit 21 has an origin at the upper left corner of the image, and a coordinate system (hereinafter, referred to as an image coordinate system) including an x-axis and a y-axis taken in the horizontal and vertical directions from the origin, respectively. Refers to the position of each pixel.

【００２２】辞書記憶部２２は、文字列の照合用のデー
タを記憶するもので、座標記憶域２２ａと、照合用特徴
記憶域２２ｂと、分類記憶域２２ｃとがある。座標記憶
域２２ａには、照合用文字列の帳票上での座標が記憶さ
れている。ここで、照合用文字列の座標は、帳票の左上
隅を原点とし、上辺をｕ軸、左辺をｖ軸とする座標系
（以下、帳票座標系という）によって表されている。The dictionary storage section 22 stores data for character string collation, and includes a coordinate storage area 22a, a collation characteristic storage area 22b, and a classification storage area 22c. The coordinate storage area 22a stores the coordinates of the collation character string on the form. Here, the coordinates of the collation character string are represented by a coordinate system in which the upper left corner of the form is the origin, the upper side is the u axis, and the left side is the v axis (hereinafter referred to as a form coordinate system).

【００２３】照合用特徴記憶域２２ｂには、照合用とな
る文字列の各文字図形領域を格子状の部分画像に分割
し、各部分画像の階調値を平均化した特徴量が文字毎に
記憶されている。分類記憶域２２ｃには、分類対象とな
る帳票を識別するための識別情報が記憶されている。な
お、座標記憶域２２ａに記憶されている照合用文字列の
座標、照合用特徴記憶域２２ｂに記憶されている特徴量
及び分類記憶域２２ｃに記憶されている識別情報は、例
えば、ポインタによって関連付けられて記憶されてい
る。In the collation feature storage area 22b, each character / graphic region of the collation character string is divided into lattice-like partial images, and the feature amount obtained by averaging the gradation values of each partial image is stored for each character. It is remembered. The classification storage area 22c stores identification information for identifying a form to be classified. The coordinates of the collation character string stored in the coordinate storage area 22a, the feature amounts stored in the collation characteristic storage area 22b, and the identification information stored in the classification storage area 22c are associated, for example, by a pointer. It is stored and remembered.

【００２４】データ処理装置３は、プログラムを記憶す
るメモリと、プログラム制御されるＣＰＵ（Central Pr
ocessing Unit）とから構成され、ＣＰＵがプログラム
を実行することによって実現される機能として帳票位置
検出手段３１、座標変換手段３２、文字列座標値補正手
段３３、文字座標値補正手段３４及び図形照合手段３５
を含む。The data processing device 3 includes a memory for storing a program and a CPU (Central Pr
and a function realized by the CPU executing the program, such as a form position detecting means 31, a coordinate converting means 32, a character string coordinate value correcting means 33, a character coordinate value correcting means 34, and a figure collating means. 35
including.

【００２５】帳票位置検出手段３１は、画像記憶部２１
からディジタル画像を読み出し、読み出した画像中の帳
票の位置を検出する。座標変換手段３２は、帳票位置検
出手段３１が検出した帳票の位置に関する情報を受け取
り、受け取った帳票の位置に関する情報に従って、座標
記憶域２２ａに記憶されている照合用文字列の帳票上で
の座標系を帳票座標系から画像座標系に変換する。The form position detecting means 31 is provided in the image storage unit 21.
, And the position of the form in the read image is detected. The coordinate conversion means 32 receives the information on the position of the form detected by the form position detection means 31, and according to the received information on the position of the form, the coordinates on the form of the collation character string stored in the coordinate storage area 22a. The system is converted from the form coordinate system to the image coordinate system.

【００２６】文字列座標値補正手段３３は、主として印
刷ずれなどの文字列の位置のずれを補正するもので、座
標変換手段３２で座標を変換した照合用文字列の座標値
と画像入力手段で読み取った帳票上の文字列の座標値の
ずれを、画素投影プロファイルを用いて文字列単位で補
正する。The character string coordinate value correcting means 33 mainly corrects the positional deviation of the character string such as printing deviation, and the coordinate value of the collating character string whose coordinates have been converted by the coordinate converting means 32 and the image input means. The deviation of the coordinate value of the character string on the read form is corrected for each character string using the pixel projection profile.

【００２７】文字座標値補正手段３４は、主として帳票
の反り等によって生じる個々の文字毎の位置のずれを補
正するもので、画素投影プロファイルを用いて座標値の
ずれをさらに文字毎に詳細に補正する。The character coordinate value correcting means 34 corrects the positional deviation of each character mainly caused by the warpage of the form or the like, and further corrects the positional deviation of the coordinate value for each character by using a pixel projection profile. I do.

【００２８】図形照合手段３５は、文字列座標値補正手
段３３及び文字座標値補正手段３４で座標値が補正され
た文字列を部分画像に切り出して特徴量を抽出し、抽出
した特徴量を照合用特徴記憶域２２ｂに記憶されている
濃淡特徴量と照合する。図形照合手段３５は、照合結果
を出力装置４に出力する。なお、データ処理装置３に含
まれる各手段３１〜３５の機能については、さらに詳し
く後述する。The figure collating means 35 cuts out the character string whose coordinate values have been corrected by the character string coordinate value correcting means 33 and the character coordinate value correcting means 34 into a partial image, extracts feature amounts, and compares the extracted feature amounts. Is compared with the light and dark feature stored in the feature storage area 22b. The graphic matching unit 35 outputs the matching result to the output device 4. The functions of the units 31 to 35 included in the data processing device 3 will be described later in more detail.

【００２９】出力装置４は、例えば、表示装置などによ
って構成され、図形照合手段３５の照合結果に基づいて
分類記憶域２２ｃに記憶されている帳票の分類を示す識
別コードを出力する。出力装置４は、また、画像入力装
置１で読み取られた帳票が図形照合手段３５の照合結果
により予め記憶されている分類に属する帳票でないと判
断されるときは、その旨を出力する。The output device 4 is composed of, for example, a display device, and outputs an identification code indicating the classification of the form stored in the classification storage area 22c based on the result of the comparison by the graphic matching means 35. When the form read by the image input apparatus 1 is determined to be not a form belonging to a class stored in advance based on the collation result of the graphic collating unit 35, the output device 4 outputs a statement to that effect.

【００３０】以下、データ処理装置３において実現され
ている各機能について、詳しく説明する。Hereinafter, each function realized in the data processing device 3 will be described in detail.

【００３１】帳票位置検出手段３１は、画像記憶部２１
に記憶されている帳票の画像を読み出し、まず、ｘ軸方
向に上端から下端に向かって画像を順に走査し、走査線
上で階調が大きく変化する点を検出する。このような走
査で階調が大きく変化する点を２点以上検出し、検出し
た点群を回帰分析して、帳票の上辺の直線Ｌ１を求め
る。The form position detecting means 31 is provided in the image storage unit 21.
Is read out, and the image is first scanned in order from the upper end to the lower end in the x-axis direction to detect a point on the scanning line at which the gradation greatly changes. Two or more points at which the gradation greatly changes by such scanning are detected, and the detected point group is subjected to regression analysis to obtain a straight line L1 on the upper side of the form.

【００３２】帳票位置検出手段３１は、次に、ｙ軸方向
に左端から右端に向かって画像を順に走査し、走査線上
で階調が大きく変化する点を検出する。このような走査
で階調が大きく変化する点を２点以上検出し、検出した
点群を回帰分析して、帳票の左辺の直線Ｌ２を求める。
帳票位置検出手段３１は、求めた上辺と左辺との直線Ｌ
１、Ｌ２の交点（ｘ０，ｙ０）を求め、帳票の左上隅の
点の位置として検出する。Next, the form position detecting means 31 sequentially scans the image in the y-axis direction from the left end to the right end, and detects a point on the scanning line at which the gradation greatly changes. Two or more points where the gradation changes greatly by such scanning are detected, and the detected point group is subjected to regression analysis to obtain a straight line L2 on the left side of the form.
The form position detecting means 31 calculates a straight line L between the obtained upper side and the left side.
1. The intersection (x0, y0) of L2 is determined and detected as the position of the point at the upper left corner of the form.

【００３３】帳票位置検出手段３１は、さらに、直線Ｌ
１とｘ軸がなす角の大きさ、または直線Ｌ２のｙ軸がな
す角の大きさ、もしくはこれらの平均値を、帳票の回転
角θ０として求める。The form position detecting means 31 further includes a line L
The size of the angle formed by 1 and the x-axis, the size of the angle formed by the y-axis of the straight line L2, or the average thereof is determined as the rotation angle θ0 of the form.

【００３４】座標変換手段３２は、辞書記憶部２２の座
標記憶域２２ａに記憶されている照合用文字列の各文字
の帳票座標系による座標値を読み出し、読み出した帳票
座標系による座標値を、数式１の計算を行って画像座標
系の座標値に変換する。The coordinate conversion means 32 reads the coordinate value of each character of the collation character string stored in the coordinate storage area 22a of the dictionary storage unit 22 in the form coordinate system, and converts the read coordinate value in the form coordinate system into Equation 1 is calculated and converted into coordinate values in the image coordinate system.

【００３５】[0035]

【数１】ｘ＝ｕｃｏｓθ０＋ｖｓｉｎθ０＋ｘ０ｙ＝−ｕｓｉｎθ０＋ｖｃｏｓθ０＋ｙ０但し、ｕ、ｖは、帳票座標系での座標値、ｘ、ｙは、画
像座標系での座標値である。X = ucosθ0 + vsinθ0 + x0 y = −usinθ0 + vcosθ0 + y0 where u and v are coordinate values in the form coordinate system, and x and y are coordinate values in the image coordinate system.

【００３６】文字列座標値補正手段３３は、座標記憶部
２２ａに記憶されている文字列がＩこの文字からなる場
合、文字列中のｉ番目の文字の外接矩形の左上隅の座標
変換後の座標値を（ｘｓ（ｉ），ｙｓ（ｉ））、高さと
幅をそれぞれＨ（ｉ）、Ｗ（ｉ）とする。また、画像中
に含まれる文字の左上隅の座標を（ｘｓ２（ｉ），ｙｓ
２（ｉ））、高さと幅をそれぞれＨ２（ｉ）、Ｗ２
（ｉ）とする。When the character string stored in the coordinate storage unit 22a is composed of I characters, the character string coordinate value correcting means 33 converts the coordinate of the upper left corner of the circumscribed rectangle of the i-th character in the character string after the coordinate conversion. The coordinate values are (xs (i), ys (i)), and the height and width are H (i) and W (i), respectively. Also, the coordinates of the upper left corner of the character included in the image are represented by (xs2 (i), ys
2 (i)), the height and width are H2 (i) and W2, respectively.
(I).

【００３７】ここで、文字列全体に一様に生じているず
れ成文をΔｘ、Δｙ、ΔＨとし、残りのずれ成文をｄｘ
（ｉ）、ｄｙ（ｉ）、ｄＨ（ｉ）、ｄＷ（ｉ）とする
と、ｘｓ２（ｉ）、ｙｓ２（ｉ）、Ｈ２（ｉ）、Ｗ２
（ｉ）は、それぞれ数式２によって表すことができる。Here, the offset sentences uniformly occurring in the entire character string are represented by Δx, Δy, ΔH, and the remaining offset sentences are dx
Assuming that (i), dy (i), dH (i), dW (i), xs2 (i), ys2 (i), H2 (i), W2
(I) can be represented by Equation 2, respectively.

【００３８】[0038]

【数２】ｘｓ２（ｉ）＝ｘｓ（ｉ）＋Δｘ＋ｄｘ（ｉ）ｙｓ２（ｉ）＝ｙｓ（ｉ）＋Δｙ＋ｄｙ（ｉ）Ｈ２（ｉ）＝Ｈ（ｉ）＋ΔＨ＋ｄＨ（ｉ）Ｗ２（ｉ）＝Ｗ（ｉ）＋ｄＷ（ｉ）Xs2 (i) = xs (i) + Δx + dx (i) ys2 (i) = ys (i) + Δy + dy (i) H2 (i) = H (i) + ΔH + dH (i) W2 (i) = W ( i) + dW (i)

【００３９】文字列座標値補正手段３３は、画像上でｘ
ｓ（ｉ）、ｙｓ（ｉ）、Ｈ（ｉ）、Ｗ（ｉ）の４つのデ
ータで表される文字の外接矩形を周囲にＬだけ拡大した
矩形領域Ｂ（ｉ）を求める。文字列座標値補正手段３３
は、各文字の上端のｙ座標値であるｙｓ（ｉ）が一致す
るようにｙ座標を揃え、この領域Ｂ（ｉ）内においてｘ
軸方向に画素投影（画素毎の階調値を加算することをい
う。以下、同じ）を行って、Ｉ個の文字それぞれの水平
方向画素投影プロファイルＰＹ（ｉ，ｙ）を生成する。The character string coordinate value correcting means 33 outputs x
A rectangular area B (i) is obtained by enlarging a circumscribed rectangle of a character represented by four data of s (i), ys (i), H (i) and W (i) by L around the character. Character string coordinate value correction means 33
Aligns the y-coordinates so that ys (i), which is the y-coordinate value at the upper end of each character, matches, and x in this area B (i)
By performing pixel projection in the axial direction (to add a gradation value for each pixel; the same applies hereinafter), a horizontal pixel projection profile PY (i, y) of each of the I characters is generated.

【００４０】文字列座標値補正手段３３は、次に、文字
列中のＩ個の文字すべての水平方向画素投影プロファイ
ルＰＹ（ｉ，ｙ）について、ｙ座標毎に加算し、文字列
を単位とする水平方向画素投影プロファイルＰＹＡ
（ｙ）を求める。Next, the character string coordinate value correcting means 33 adds the horizontal pixel projection profiles PY (i, y) of all I characters in the character string for each y coordinate, and sets the character string as a unit. Horizontal pixel projection profile PYA
(Y) is obtained.

【００４１】文字列座標値補正手段３３は、こうして求
めた水平方向画素投影プロファイルＰＹＡ（ｙ）から、
下記の数式３を用いて、Δｙ＝［−ＴＹ，＋ＴＹ］、Δ
Ｈ＝［−ＴＨ，＋ＴＨ］の範囲で評価値ＶＹを最小化す
るΔｙとΔＨの値の組を見つける。The character string coordinate value correcting means 33 calculates the horizontal pixel projection profile PYA (y)
Using Equation 3 below, Δy = [− TY, + TY], Δ
Find a set of values of Δy and ΔH that minimizes the evaluation value VY in the range of H = [− TH, + TH].

【数３】ＶＹ＝ＰＹＡ（Δｙ）＋ＰＹＡ（Δｙ＋Ｈ＋ΔＨ）但し、ＴＹは、文字列中の文字の高さの平均値の２分の
１程度の値、ＴＨは微少な値であり、予め設定されてい
る。VY = PYA (Δy) + PYA (Δy + H + ΔH) where TY is a value that is about half the average value of the heights of the characters in the character string, and TH is a minute value and is set in advance. ing.

【００４２】文字列座標値補正手段３３は、次に、ｙ軸
方向にも領域Ｂ（ｉ）内の画素を投影し、各文字の左端
のｘ座標値であるｘｓ（ｉ）が一致するようにｘ座標を
揃え、Ｉ個の文字それぞれの垂直方向画素投影プロファ
イルＰＸ（ｉ，ｘ）を生成する。文字列座標値補正手段
３３は、次に、文字列中のＩ個の文字すべての垂直方向
画素投影プロファイルＰＸ（ｉ，ｘ）について、ｘ座標
毎に加算し、文字列を単位とする垂直方向画素投影プロ
ファイルＰＸＡ（ｘ）を求める。Next, the character string coordinate value correcting means 33 projects pixels in the area B (i) also in the y-axis direction so that xs (i), which is the x coordinate value at the left end of each character, matches. , And generate a vertical pixel projection profile PX (i, x) for each of the I characters. Next, the character string coordinate value correcting unit 33 adds the vertical pixel projection profiles PX (i, x) of all the I characters in the character string for each x coordinate, and calculates the vertical direction using the character string as a unit. A pixel projection profile PXA (x) is obtained.

【００４３】文字列座標値補正手段３３は、こうして求
めた垂直方向画素投影プロファイルＰＸＡ（ｘ）から、
下記の数式４を用いて、Δｘ＝［−ＴＸ，＋ＴＸ］の範
囲で評価値ＶＸを最大とするΔｘを見つける。The character string coordinate value correcting means 33 calculates the vertical pixel projection profile PXA (x)
Using Expression 4 below, find Δx that maximizes the evaluation value VX in the range of Δx = [− TX, + TX].

【数４】ＶＸ＝ＰＸＡ（Δｘ）−ＰＸＡ（Δｘ−１）但し、ＴＸは、文字列中の文字の幅の２分の１程度の値
である。VX = PXA ([Delta] x) -PXA ([Delta] x-1) where TX is a value that is about half the width of a character in a character string.

【００４４】文字列座標値補正手段３３は、以上のよう
にして求めたΔｘ、Δｙ、ΔＨの値を用いて、例えば、
印刷ずれなどによる帳票上の文字列のずれを補正する。The character string coordinate value correcting means 33 uses the values of Δx, Δy, and ΔH obtained as described above, for example,
A character string shift on a form due to a print shift or the like is corrected.

【００４５】文字座標値補正手段３４は、文字列中のＩ
個の文字のそれぞれについて、順次次のような処理を行
う。なお、以下の説明では、処理対象となる文字が文字
列中のｉ番目のものであるとする。The character coordinate value correcting means 34 calculates the I
The following processing is sequentially performed for each of the characters. In the following description, it is assumed that the character to be processed is the i-th character in the character string.

【００４６】文字座標値補正手段３４は、文字単位の水
平方向画素投影プロファイルＰＹ（ｉ，ｙ）から、下記
の数式５を用いて、ｄｙ（ｉ）＝［−ＴＹ２，＋ＴＹ
２］、ｄＨ（ｉ）＝［−ＴＨ２，＋ＴＨ２］の範囲でＶ
Ｙ２を最小化するｄｙ（ｉ）とｄＨ（ｉ）の組を見つけ
る。The character coordinate value correcting means 34 calculates dy (i) = [− TY2, + TY from the horizontal pixel projection profile PY (i, y) for each character using the following equation (5).
2], dH (i) = V in the range of [−TH2, + TH2].
Find a set of dy (i) and dH (i) that minimizes Y2.

【００４７】[0047]

【数５】ＶＹ２＝ＰＹ（ｉ，Δｙ＋ｄｙ（ｉ））＋ＰＹ
（ｉ，Δｙ＋Ｈ（ｉ）＋ΔＨ＋ｄｙ（ｉ）＋ｄＨ
（ｉ））但し、ＴＹ２及びＴＨ２はそれぞれ、予め与えれた値で
あり、例えば、ＴＹ＝５、ＴＨ＝５に設定されている。VY2 = PY (i, Δy + dy (i)) + PY
(I, Δy + H (i) + ΔH + dy (i) + dH
(I)) However, TY2 and TH2 are values given in advance, for example, TY = 5 and TH = 5.

【００４８】文字座標値補正手段３４は、文字単位の垂
直方向画素投影プロファイルＰＸ（ｉ，ｙ）から、下記
の数式６を用いて、ｄｘ（ｉ）＝［−ＴＸ２，＋ＴＸ
２］、ｄＷ（ｉ）＝［−ＴＷ２，＋ＴＷ２］の範囲でＶ
Ｘ２を最小化するｄｘ（ｉ）とｄＷ（ｉ）の組を見つけ
る。The character coordinate value correcting means 34 calculates dx (i) = [− TX2, + TX from the vertical pixel projection profile PX (i, y) of each character using the following equation (6).
2], dW (i) = V in the range of [−TW2, + TW2].
Find the set of dx (i) and dW (i) that minimizes X2.

【００４９】[0049]

【数６】ＶＸ２＝ＰＸ（ｉ，Δｘ＋ｄｘ（ｉ））＋ＰＸ
（ｉ，Δｘ＋ｄｘ（ｉ）＋Ｗ（ｉ）＋ｄＷ（ｉ））但し、ＴＸ２及びＴＷ２はそれぞれ、予め与えれた値で
あり、例えば、ＴＸ＝５、ＴＷ＝５に設定されている。VX2 = PX (i, Δx + dx (i)) + PX
(I, Δx + dx (i) + W (i) + dW (i)) where TX2 and TW2 are predetermined values, for example, TX = 5 and TW = 5.

【００５０】図形照合手段３５は、文字列座標値補正手
段３３及び文字座標値補正手段３４によって座標値が補
正された画像中の文字列の各文字の外接矩形領域に従っ
て、各文字を切り出す。図形照合手段３５は、切り出し
た各文字を部分画像に分割し、各文字の特徴量を抽出す
る。The figure collating means 35 cuts out each character according to the circumscribed rectangular area of each character of the character string in the image whose coordinate value has been corrected by the character string coordinate value correcting means 33 and the character coordinate value correcting means 34. The graphic matching unit 35 divides each cut-out character into partial images and extracts a feature amount of each character.

【００５１】図形照合手段３５は、さらに、抽出した各
文字の特徴量を照合用特徴量記憶域２２ｂに記憶されて
いる各文字の照合用特徴量と、数式７に従って照合し、
それぞれの類似度ｓ（ｉ）を求める。The graphic collating means 35 further collates the extracted characteristic amount of each character with the collating characteristic amount of each character stored in the collating characteristic amount storage area 22b according to the following equation (7).
Each similarity s (i) is obtained.

【数７】ｓ（ｉ）＝（ＶｅｃＸ（ｉ）・ＶｅｃＹ
（ｉ））／｜ＶｅｃＸ（ｉ）｜｜ＶｅｃＹ（ｉ）｜S (i) = (VecX (i) · VecY
(I)) / | VecX (i) || VecY (i) |

【００５２】図形照合手段３５は、さらに、求めた類似
度ｓ（ｉ）の平均値ｓｍを求め、これを照合結果として
出力装置４に出力する。The figure matching means 35 further calculates an average value sm of the obtained similarities s (i), and outputs this to the output device 4 as a result of the comparison.

【００５３】以下、この実施の形態にかかる帳票分類装
置の動作について、図２〜図４のフローチャートを参照
して説明する。Hereinafter, the operation of the form classification device according to this embodiment will be described with reference to the flowcharts of FIGS.

【００５４】なお、動作についての理解を容易にするた
めに、具体例として図５〜図７を挙げて説明する。ここ
では、分類対象となる帳票には、「出」、「金」、
「伝」、「票」の４文字からなる文字列が記載されてい
るものとし、標準的な帳票（照合用帳票）での帳票座標
系の座標が座標記憶域２２ａに記憶されているものとす
る。各文字の座標は、外接矩形の左上隅の座標値が（ｕ
ｓ（ｉ），ｖｓ（ｉ））、高さがＨ（ｉ）、幅がＷ
（ｉ）である（但し、ｉ＝１，２，３，４）。また、こ
れらの文字の特徴量が照合用特徴記憶域２２ｂに記憶さ
れているものとする。In order to facilitate understanding of the operation, a specific example will be described with reference to FIGS. Here, the forms to be classified include “out”, “gold”,
It is assumed that a character string composed of four characters of “den” and “vote” is described, and that coordinates of a form coordinate system in a standard form (a form for collation) are stored in the coordinate storage area 22a. I do. For the coordinates of each character, the coordinate value of the upper left corner of the circumscribed rectangle is (u
s (i), vs (i)), height H (i), width W
(I) (where i = 1, 2, 3, 4). In addition, it is assumed that the feature amounts of these characters are stored in the matching feature storage area 22b.

【００５５】図２のフローチャートの処理では、最初
に、画像入力装置１は、階調と有する２次元のディジタ
ル画像として帳票の画像を取り込む。画像入力装置１
は、取り込んだディジタル画像を記憶装置２の画像記憶
部２１に記憶させる（ステップＳ１）。この画像記憶部
２１に記憶されている画像を、図５に示す。In the processing of the flowchart of FIG. 2, first, the image input device 1 takes in a form image as a two-dimensional digital image having a gradation. Image input device 1
Causes the captured digital image to be stored in the image storage unit 21 of the storage device 2 (step S1). The image stored in the image storage unit 21 is shown in FIG.

【００５６】画像の入力が完了すると、次に、帳票位置
検出手段３１は、画像記憶部２１に記憶されている画像
を読み出し、読み出した画像中の帳票の上辺と左辺とを
検出する。帳票位置検出手段３１は、検出した帳票の上
辺と左辺とから、帳票の左上隅の座標値（ｘ０，ｙ０）
及び傾き角θ０を求める（ステップＳ２）。求められた
座標値（ｘ０，ｙ０）及び傾き角θ０は、座標変換手段
３２に供給される。When the input of the image is completed, the form position detecting means 31 reads the image stored in the image storage section 21 and detects the upper side and the left side of the form in the read image. The form position detection means 31 calculates the coordinate value (x0, y0) of the upper left corner of the form from the detected top and left sides of the form.
And the inclination angle θ0 (step S2). The obtained coordinate value (x0, y0) and the inclination angle θ0 are supplied to the coordinate conversion means 32.

【００５７】座標変換手段３２は、辞書記憶部２２の座
標記憶域２２ａに記憶されている文字列の各文字の外接
矩形の座標に関する情報を読み出す（ステップＳ３）。
座標変換手段３２は、読み出した情報の内、各文字の外
接矩形の左上隅の座標値（ｕｓ（ｉ），ｖｓ（ｉ））
を、上記した数式１に従って画像座標系の座標値（ｘｓ
（Ｉ），ｙｓ（ｉ））に変換する（ステップＳ４）。図
５に、これらの外接矩形を、一点鎖線で示す。この画像
座標系に座標変換された外接矩形の座標は、文字列補正
手段３３に供給される。The coordinate conversion means 32 reads information on the coordinates of the circumscribed rectangle of each character of the character string stored in the coordinate storage area 22a of the dictionary storage unit 22 (step S3).
The coordinate conversion means 32 calculates the coordinate value (us (i), vs (i)) of the upper left corner of the circumscribed rectangle of each character in the read information.
Is calculated according to the above equation (1) using the coordinate values (xs
(I), ys (i)) (step S4). FIG. 5 shows these circumscribed rectangles by alternate long and short dash lines. The coordinates of the circumscribed rectangle converted to the image coordinate system are supplied to the character string correction unit 33.

【００５８】次に、文字列座標値補正手段３３は、文字
列単位の位置合わせ（文字列単位での座標のずれの補
正）の処理を行う（ステップＳ５）。図３は、ステップ
Ｓ５の文字列単位の位置合わせの処理を詳細に示すフロ
ーチャートである。Next, the character string coordinate value correcting means 33 performs a process of position adjustment in character string units (correction of coordinate shift in character string units) (step S5). FIG. 3 is a flowchart showing the details of the alignment process in units of character strings in step S5.

【００５９】文字列座標値補正手段３３は、まず、座標
変換手段３２によって座標変換した照合用文字列の外接
矩形を、上下左右にそれぞれＬだけ拡大した部分領域Ｂ
（ｉ）を求める。これらの部分領域Ｂ（ｉ）を、図５に
破線で示す。First, the character string coordinate value correcting means 33 expands the circumscribed rectangle of the collation character string coordinate-converted by the coordinate converting means 32 by L in the upper, lower, left and right directions, respectively.
Find (i). These partial areas B (i) are shown by broken lines in FIG.

【００６０】文字列座標値補正手段３３は、これらの部
分領域Ｂ（ｉ）のそれぞれについて、ｘ軸方向に画素投
影し、文字毎の水平方向画素投影プロファイルＰＹ
（ｉ）を生成する（ステップＳ５０１）。ここで、
「出」、「金」、「伝」、「票」の各文字の部分領域Ｂ
（ｉ）を図６（ａ）〜（ｄ）に、それぞれに対応する水
平方向画素投影プロファイルを図６（ｅ）〜（ｈ）に示
す。The character string coordinate value correcting means 33 projects pixels in the x-axis direction for each of these partial areas B (i), and outputs a horizontal pixel projection profile PY for each character.
(I) is generated (step S501). here,
Partial area B of each character of "de", "gold", "den", "vote"
FIGS. 6A to 6D show (i), and FIGS. 6E to 6H show the corresponding horizontal pixel projection profiles.

【００６１】文字列座標値補正手段３３は、次に、ステ
ップＳ５０１で求めた文字毎の水平方向画素投影プロフ
ァイルＰＹ（ｉ、ｙ）を、各文字の上端のｙ座標値が一
致するように重ね合わせ、文字列を単位とする水平方向
画素投影プロファイルＰＹＡ（ｙ）を生成する（ステッ
プＳ５０２）。この文字列を単位とする水平方向画素投
影プロファイルＰＹＡ（ｙ）の例を、図６（ｉ）に示
す。Next, the character string coordinate value correcting means 33 superimposes the horizontal pixel projection profile PY (i, y) for each character obtained in step S501 so that the y coordinate value at the upper end of each character matches. In addition, a horizontal pixel projection profile PYA (y) using a character string as a unit is generated (step S502). FIG. 6I shows an example of the horizontal pixel projection profile PYA (y) using this character string as a unit.

【００６２】次に、文字列座標値補正手段３３は、上記
の数式３で求められる評価値ＶＹが取り得ない程大きい
値であるＭＡＸＩＮＴで、変数ＶＹｍｉｎを初期化する
（ステップＳ５０３）。文字列座標値補正手段３３は、
さらに、Δｙ２を定数−ＴＹで初期化し（ステップＳ５
０４）、ΔＨを定数−ＴＨで初期化する（ステップＳ５
０５）。Next, the character string coordinate value correcting means 33 initializes the variable VYmin with MAXINT, which is a value that is too large to take the evaluation value VY obtained by the above equation (step S503). The character string coordinate value correcting means 33
Further, Δy2 is initialized by a constant −TY (step S5).
04), ΔH is initialized by a constant −TH (step S5)
05).

【００６３】文字列座標値補正手段３３は、上記の数式
３の演算を行って、評価値ＶＹを算出する（ステップＳ
５０６）。文字列座標値補正手段３３は、次に、ステッ
プＳ５０６で算出した評価値ＶＹが変数ＶＹｍｉｎより
も小さいかどうかを判別する（ステップＳ５０７）。The character string coordinate value correcting means 33 calculates the evaluation value VY by performing the operation of the above equation (step S).
506). Next, the character string coordinate value correcting unit 33 determines whether the evaluation value VY calculated in step S506 is smaller than the variable VYmin (step S507).

【００６４】ステップＳ５０７で評価値ＶＹが変数ＶＹ
ｍｉｎよりも小さくないと判別されたときは、文字列座
標値補正手段３３は、そのままステップＳ５０９の処理
に進む。ステップＳ５０７で評価値ＶＹが変数ＶＹｍｉ
ｎよりも小さいと判別されたときは、文字列座標値補正
手段３３は、評価値ＶＹの値を変数ＶＹｍｉｎに、Δｙ
２の値をΔｙに、ΔＨ２の値をΔＨにそれぞれ代入する
（ステップＳ５０８）。そして、文字列座標値補正手段
３３は、ステップＳ５０９の処理に進む。In step S507, the evaluation value VY is set to the variable VY.
When it is determined that it is not smaller than min, the character string coordinate value correcting means 33 proceeds to the processing of step S509 as it is. In step S507, the evaluation value VY is set to the variable VYmi.
When it is determined that the value is smaller than n, the character string coordinate value correcting unit 33 sets the value of the evaluation value VY to the variable VYmin,
The value of ΔH is substituted for ΔH2, and the value of ΔH2 is substituted for ΔH (step S508). Then, the character string coordinate value correcting unit 33 proceeds to the process of step S509.

【００６５】ステップＳ５０９では、文字列座標値補正
手段３３は、ΔＨ２の値を“１”だけインクリメントす
る。そして、文字列座標値補正手段３３は、ΔＨ２の値
が予め設定されている定数ＴＨよりも大きいかどうかを
判別する（ステップＳ５０９）。In step S509, the character string coordinate value correcting means 33 increments the value of ΔH2 by “1”. Then, the character string coordinate value correcting means 33 determines whether or not the value of ΔH2 is larger than a preset constant TH (step S509).

【００６６】ステップＳ５０９でΔＨ２の値が定数ＴＨ
よりも大きくないと判別されたときは、文字列座標値補
正手段３３は、再びステップＳ５０６の処理を実行す
る。In step S509, the value of ΔH2 is a constant TH
When it is determined that it is not larger than the above, the character string coordinate value correcting means 33 executes the processing of step S506 again.

【００６７】ステップＳ５０９でΔＨ２の値が定数ＴＨ
よりも大きいと判別されたときは、文字列座標値補正手
段３３は、Δｙ２の値を“１”だけインクリメントする
（ステップＳ５１１）。そして、文字列座標値補正手段
３３は、Δｙ２の値が定数ＴＹよりも大きいかどうかを
判別する（ステップＳ５１２）。In step S509, the value of ΔH2 is a constant TH
If it is determined that the value is larger than the value, the character string coordinate value correcting unit 33 increments the value of Δy2 by “1” (step S511). Then, the character string coordinate value correcting means 33 determines whether the value of Δy2 is larger than the constant TY (step S512).

【００６８】ステップＳ５１２でΔｙ２の値が定数ＴＹ
よりも大きくないと判別されたときは、文字列座標値補
正手段３３は、再びステップＳ５０５の処理を実行す
る。ステップＳ５１２でΔｙ２の値が定数ＴＹよりも大
きいと判別されたときは、文字列座標値補正手段３３
は、各部分領域Ｂ（ｉ）においてｙ座標の値をΔｙから
（Δｙ＋Ｈ＋ΔＨ）の範囲に限定して、ｙ軸方向に画素
投影し、文字毎の垂直方向画素投影プロファイルＰＸ
（ｉ，ｘ）を生成する（ステップＳ５１３）。ここで、
「出」、「金」、「伝」、「票」の各文字の部分領域Ｂ
（ｉ）を図７（ａ）〜（ｄ）に、それぞれに対応する垂
直方向画素投影プロファイルを図７（ｅ）〜（ｈ）に示
す。In step S512, the value of Δy2 is a constant TY
When it is determined that it is not larger than the above, the character string coordinate value correcting means 33 executes the processing of step S505 again. If it is determined in step S512 that the value of Δy2 is larger than the constant TY, the character string coordinate value correcting unit 33
Restricts the value of the y coordinate in each partial area B (i) to a range from Δy to (Δy + H + ΔH), and performs pixel projection in the y-axis direction, and a vertical pixel projection profile PX for each character.
(I, x) is generated (step S513). here,
Partial area B of each character of "de", "gold", "den", "vote"
FIGS. 7A to 7D show (i), and FIGS. 7E to 7H show the corresponding vertical pixel projection profiles.

【００６９】文字列座標値補正手段３３は、次に、ステ
ップＳ５１３で求めた文字毎の水平方向画素投影プロフ
ァイルＰＹ（ｉ、ｙ）を、各文字の左端のｘ座標値が一
致するように重ね合わせ、文字列を単位とする垂直方向
画素投影プロファイルＰＸＡ（ｙ）を生成する（ステッ
プＳ５１４）。この文字列を単位とする垂直方向画素投
影プロファイルＰＸＡ（ｘ）の例を、図７（ｉ）に示
す。Next, the character string coordinate value correcting means 33 superimposes the horizontal pixel projection profile PY (i, y) for each character obtained in step S513 so that the x coordinate value at the left end of each character matches. In addition, a vertical pixel projection profile PXA (y) using a character string as a unit is generated (step S514). FIG. 7I shows an example of the vertical pixel projection profile PXA (x) using the character string as a unit.

【００７０】次に、文字列座標値補正手段３３は、変数
ＶＸｍａｘを値“０”で初期化し（ステップＳ５１
５）、さらにΔｘ２を定数−ＴＸで初期化する（ステッ
プＳ５１６）。Next, the character string coordinate value correcting means 33 initializes the variable VXmax with the value "0" (step S51).
5) Then, Δx2 is initialized with a constant −TX (step S516).

【００７１】文字列座標値補正手段３３は、上記の数式
４の演算を行って、評価値ＶＸを算出する（ステップＳ
５１７）。文字列座標値補正手段３３は、次に、ステッ
プＳ５１７で算出した評価値ＶＸが変数ＶＸｍａｘより
も大きいかどうかを判別する（ステップＳ５１８）。The character string coordinate value correcting means 33 calculates the evaluation value VX by performing the operation of the above equation (4) (step S).
517). Next, the character string coordinate value correcting unit 33 determines whether the evaluation value VX calculated in step S517 is larger than the variable VXmax (step S518).

【００７２】ステップＳ５１８で評価値ＶＸが変数ＶＸ
ｍａｘよりも大きくないと判別されたときは、文字列座
標値補正手段３３は、そのままステップＳ５２０の処理
に進む。ステップＳ５１８で評価値ＶＸが変数ＶＸｍａ
ｘの値よりも大きいと判別されたときは、文字列座標値
補正手段３３は、評価値ＶＸの値を変数ＶＸｍａｘに、
Δｘ２の値をΔｘにそれぞれ代入する（ステップＳ５１
９）。そして、文字列座標値補正手段３３は、ステップ
Ｓ５２０の処理に進む。At step S518, the evaluation value VX is
If it is determined that the value is not larger than max, the character string coordinate value correcting unit 33 proceeds to the process of step S520. In step S518, the evaluation value VX is set to the variable VXma.
When it is determined that the value is larger than the value of x, the character string coordinate value correcting unit 33 sets the value of the evaluation value VX to a variable VXmax,
Substitute the value of Δx2 into Δx (Step S51)
9). Then, the character string coordinate value correcting unit 33 proceeds to the process of step S520.

【００７３】ステップＳ５２０では、文字列座標値補正
手段３３は、Δｘ２の値を“１”だけインクリメントす
る。そして、文字列座標値補正手段３３は、Δｘ２の値
が定数ＴＸよりも大きいかどうかを判別する（ステップ
Ｓ５２１）。In step S520, the character string coordinate value correcting means 33 increments the value of Δx2 by “1”. Then, the character string coordinate value correcting means 33 determines whether the value of Δx2 is larger than the constant TX (step S521).

【００７４】ステップＳ５２１でΔｘ２の値が定数ＴＸ
よりも大きくないと判別されたときは、文字列座標値補
正手段３３は、再びステップＳ５１７の処理を実行す
る。ステップＳ５２１でΔｘ２の値が定数ＴＸよりも大
きいと判別されたときは、文字列座標値補正手段３３
は、このフローチャートの処理、すなわち図２のステッ
プＳ５の処理を終了する。In step S521, the value of Δx2 is a constant TX
When it is determined that it is not larger than the above, the character string coordinate value correcting means 33 executes the processing of step S517 again. If it is determined in step S521 that the value of Δx2 is larger than the constant TX, the character string coordinate value correcting unit 33
Ends the processing of this flowchart, that is, the processing of step S5 in FIG.

【００７５】文字列座標値補正手段３３が、ステップＳ
５の処理（図３のフローチャートの処理）を終了する
と、次に、文字座標値補正手段３４が、文字単位位置あ
わせ（文字単位での座標のずれの補正）の処理を行う
（ステップＳ６）。The character string coordinate value correcting means 33 determines in step S
When the process of Step 5 (the process of the flowchart of FIG. 3) is completed, the character coordinate value correcting unit 34 performs a process of character unit alignment (correction of coordinate shift in character units) (Step S6).

【００７６】図４は、ステップＳ６の文字単位の位置合
わせの処理を詳細に示すフローチャートである。このフ
ローチャートの処理は、各文字について実行されるが、
すべて同様の処理によって実行されるため、ここでは、
１文字についてのみの処理内容を説明する。FIG. 4 is a flow chart showing the details of the alignment process for each character in step S6. The process of this flowchart is executed for each character,
Since everything is performed by the same process,
The processing content for only one character will be described.

【００７７】文字座標値補正手段３４は、まず、上記の
数式５で求められる評価値ＶＹ２が取り得ない程大きい
値であるＭＡＸＩＮＴで、変数ＶＹ２ｍｉｎを初期化す
る（ステップＳ６０１）。文字座標値補正手段３４は、
さらに、ｄｙ２を定数−ＴＹ２で初期化し（ステップＳ
６０２）、ｄＨ２を定数−ＴＨ２で初期化する（ステッ
プＳ６０３）。First, the character coordinate value correcting means 34 initializes the variable VY2min with MAXINT, which is a value that is too large for the evaluation value VY2 obtained by the above equation (5) (step S601). The character coordinate value correction means 34
Further, dy2 is initialized by a constant -TY2 (step S
602), dH2 is initialized with a constant -TH2 (step S603).

【００７８】文字座標値補正手段３４は、上記の数式５
の演算を行って、評価値ＶＹ２を算出する（ステップＳ
６０４）。文字座標値補正手段３４は、次に、ステップ
Ｓ６０４で算出した評価値ＶＹ２が変数ＶＹ２ｍｉｎよ
りも小さいかどうかを判別する（ステップＳ６０５）。The character coordinate value correcting means 34 calculates the above equation (5).
To calculate the evaluation value VY2 (step S
604). Next, the character coordinate value correcting unit 34 determines whether the evaluation value VY2 calculated in step S604 is smaller than the variable VY2min (step S605).

【００７９】ステップＳ６０５で評価値ＶＹ２が変数Ｖ
Ｙ２ｍｉｎよりも小さくないと判別されたときは、文字
座標値補正手段３４は、そのままステップＳ６０７の処
理に進む。ステップＳ６０５で評価値ＶＹ２が変数ＶＹ
２ｍｉｎよりも小さいと判別されたときは、文字座標値
補正手段３４は、評価値ＶＹ２の値を変数ＶＹ２ｍｉｎ
に、ｄｙ２の値をｄｙ（ｉ）に、ｄＨ２の値をｄＨ
（ｉ）にそれぞれ代入する（ステップＳ６０６）。そし
て、文字座標値補正手段３４は、ステップＳ６０７の処
理に進む。In step S605, the evaluation value VY2 is
If it is determined that it is not smaller than Y2min, the character coordinate value correcting means 34 proceeds to the processing of step S607 as it is. In step S605, the evaluation value VY2 is set to the variable VY.
When it is determined that the value is smaller than 2 min, the character coordinate value correcting unit 34 sets the value of the evaluation value VY2 to the variable VY2min.
, The value of dy2 is dy (i), and the value of dH2 is dH
(I) are substituted (step S606). Then, the character coordinate value correcting unit 34 proceeds to the process of step S607.

【００８０】ステップＳ６０７では、文字座標値補正手
段３４は、ｄＨ２の値を“１”だけインクリメントす
る。そして、文字座標値補正手段３４は、ｄＨ２の値が
予め設定されている定数ＴＨ２よりも大きいかどうかを
判別する（ステップＳ６０８）。In step S607, the character coordinate value correcting means 34 increments the value of dH2 by "1". Then, the character coordinate value correction unit 34 determines whether the value of dH2 is larger than a preset constant TH2 (step S608).

【００８１】ステップＳ６０８でｄＨ２の値が定数ＴＨ
２よりも大きくないと判別されたときは、文字座標値補
正手段３４は、再びステップＳ６０４の処理を実行す
る。In step S608, the value of dH2 is a constant TH
If it is determined that it is not larger than 2, the character coordinate value correcting means 34 executes the processing of step S604 again.

【００８２】ステップＳ６０８でｄＨ２の値が定数ＴＨ
２よりも大きいと判別されたときは、文字座標値補正手
段３４は、ｄｙ２の値を“１”だけインクリメントする
（ステップＳ６０９）。そして、文字座標値補正手段３
４は、ｄｙ２の値が定数ＴＹ２よりも大きいかどうかを
判別する（ステップＳ６１０）。In step S608, the value of dH2 is a constant TH
When it is determined that the value is larger than 2, the character coordinate value correcting unit 34 increments the value of dy2 by “1” (step S609). Then, the character coordinate value correcting means 3
No. 4 determines whether the value of dy2 is larger than the constant TY2 (step S610).

【００８３】ステップＳ６１０でｄｙ２の値が定数ＴＹ
２よりも大きくないと判別されたときは、文字座標値補
正手段３４は、再びステップＳ６０２の処理を実行す
る。In step S610, the value of dy2 is a constant TY
When it is determined that it is not larger than 2, the character coordinate value correcting means 34 executes the processing of step S602 again.

【００８４】ステップＳ６１０でｄｙ２の値が定数ＴＹ
２よりも大きいと判別されたときは、文字座標値補正手
段３４は、上記の数式６で求められる評価値ＶＸ２が取
り得ない程大きい値であるＭＡＸＩＮＴで、変数ＶＸ２
ｍｉｎを初期化する（ステップＳ６１１）。文字座標値
補正手段３４は、さらに、ｄｘ２を定数−ＴＸ２で初期
化し（ステップＳ６１２）、ｄＷ２を定数−ＴＷ２で初
期化する（ステップＳ６１３）。In step S610, the value of dy2 is a constant TY
If it is determined that the evaluation value VX2 is larger than 2, the character coordinate value correction unit 34 sets the variable VX2
min is initialized (step S611). The character coordinate value correcting unit 34 further initializes dx2 with a constant -TX2 (step S612), and initializes dW2 with a constant -TW2 (step S613).

【００８５】文字座標値補正手段３４は、上記の数式６
の演算を行って、評価値ＶＸ２を算出する（ステップＳ
６１４）。文字座標値補正手段３４は、次に、ステップ
Ｓ６１４で算出した評価値ＶＸ２が変数ＶＸ２ｍｉｎよ
りも小さいかどうかを判別する（ステップＳ６１５）。The character coordinate value correcting means 34 calculates the above equation (6).
To calculate the evaluation value VX2 (step S
614). Next, the character coordinate value correcting unit 34 determines whether or not the evaluation value VX2 calculated in step S614 is smaller than the variable VX2min (step S615).

【００８６】ステップＳ６１５で評価値ＶＸ２が変数Ｖ
Ｘ２ｍｉｎよりも小さくないと判別されたときは、文字
座標値補正手段３４は、そのままステップＳ６１７の処
理に進む。ステップＳ６１５で評価値ＶＸ２が変数ＹＸ
２ｍｉｎよりも小さいと判別されたときは、文字座標値
補正手段３４は、評価値ＶＸ２の値を変数ＶＸ２ｍｉｎ
に、ｄｘ２の値をｄｘ（ｉ）に、ｄＷ２の値をｄＷ
（ｉ）にそれぞれ代入する（ステップＳ６１６）。そし
て、文字座標値補正手段３４は、ステップＳ６１７の処
理に進む。In step S615, the evaluation value VX2 is
If it is determined that it is not smaller than X2min, the character coordinate value correcting means 34 proceeds to the processing of step S617 as it is. In step S615, the evaluation value VX2 is set to the variable YX.
When it is determined that the value is smaller than 2 min, the character coordinate value correcting unit 34 sets the value of the evaluation value VX2 to the variable VX2min.
, The value of dx2 to dx (i) and the value of dW2 to dW
(I) is assigned to each (step S616). Then, the character coordinate value correcting unit 34 proceeds to the process of step S617.

【００８７】ステップＳ６１７では、文字座標値補正手
段３４は、ｄＷ２の値を“１”だけインクリメントす
る。そして、文字座標値補正手段３４は、ｄＷ２の値が
予め設定されている定数ＴＷ２よりも大きいかどうかを
判別する（ステップＳ６１８）。In step S617, the character coordinate value correcting means 34 increments the value of dW2 by "1". Then, the character coordinate value correcting means 34 determines whether or not the value of dW2 is larger than a preset constant TW2 (step S618).

【００８８】ステップＳ６１８でｄＷ２の値が定数ＴＷ
２よりも大きくないと判別されたときは、文字座標値補
正手段３４は、再びステップＳ６０４の処理を実行す
る。In step S618, the value of dW2 is a constant TW
If it is determined that it is not larger than 2, the character coordinate value correcting means 34 executes the processing of step S604 again.

【００８９】ステップＳ６１８でｄＷ２の値が定数ＴＷ
２よりも大きいと判別されたときは、文字座標値補正手
段３４は、ｄｘ２の値を“１”だけインクリメントする
（ステップＳ６１９）。そして、文字座標値補正手段３
４は、ｄｘ２の値が定数ＴＸ２よりも大きいかどうかを
判別する（ステップＳ６２０）。In step S618, the value of dW2 is a constant TW
If it is determined that the value is greater than 2, the character coordinate value correcting means 34 increments the value of dx2 by "1" (step S619). Then, the character coordinate value correcting means 3
No. 4 determines whether the value of dx2 is larger than the constant TX2 (step S620).

【００９０】ステップＳ６２０でｄｘ２の値が定数ＴＸ
２よりも大きくないと判別されたときは、文字座標値補
正手段３４は、再びステップＳ６１３の処理を実行す
る。In step S620, the value of dx2 is a constant TX
When it is determined that it is not larger than 2, the character coordinate value correcting means 34 executes the processing of step S613 again.

【００９１】ステップＳ６２０でｄｘ２の値が定数ＴＸ
２よりも大きいと判別されたときは、画像記憶部２１に
記憶された画像中の文字列の各文字の外接矩形に補正を
加えた座標（Δｘ＋ｄｘ（ｉ），Δｙ＋ｄｙ（ｉ））、
Ｈ（ｉ）＋ΔＨ＋ｄＨ（ｉ）、Ｗ（ｉ）＋ｄＷ（ｉ）を
算出し、図形照合手段３５に供給する。そして、文字座
標値補正手段３４は、このフローチャートの処理、すな
わち図２のステップＳ６の処理を終了する。In step S620, the value of dx2 is a constant TX
When it is determined that it is larger than 2, coordinates (Δx + dx (i), Δy + dy (i)) obtained by correcting the circumscribed rectangle of each character of the character string in the image stored in the image storage unit 21;
H (i) + ΔH + dH (i) and W (i) + dW (i) are calculated and supplied to the graphic matching means 35. Then, the character coordinate value correcting means 34 ends the processing of this flowchart, that is, the processing of step S6 in FIG.

【００９２】文字座標値補正手段３４が、ステップＳ６
の処理（図４のフローチャートの処理）を終了すると、
次に、図形照合手段３５は、ステップＳ５、Ｓ６で補正
した座標に従って「出」、「金」、「伝」、「票」の各
文字から部分画像を切り出す（ステップＳ７）。The character coordinate value correcting means 34 determines in step S6
Is completed (the processing of the flowchart in FIG. 4),
Next, the graphic matching means 35 cuts out a partial image from each of the characters "out", "gold", "den", and "vote" according to the coordinates corrected in steps S5 and S6 (step S7).

【００９３】図形照合手段３５は、切り出した部分画像
から各文字の特徴量を抽出し、辞書記憶部２２の照合用
特徴量記憶域２２ｂに格納されている照合用特徴量と照
合する。そして、照合の結果得られた各文字についての
類似度の平均値ｓｍを算出する（ステップＳ８）。算出
された類似度の平均値ｓｍは、出力装置４に供給され
る。The graphic matching means 35 extracts the characteristic amount of each character from the cut-out partial image and compares it with the matching characteristic amount stored in the matching characteristic amount storage area 22b of the dictionary storage unit 22. Then, the average value sm of the similarity of each character obtained as a result of the comparison is calculated (step S8). The calculated average value sm of the similarities is supplied to the output device 4.

【００９４】そして、出力装置４は、図形照合手段３５
から供給された類似度の平均値ｓｍに従って、辞書記憶
部２２の分類記憶域２２ｃに記憶されている「出金伝
票」の帳票を識別するための識別情報を出力する（ステ
ップＳ９）。以上で、帳票の分類が終了し、このフロー
チャートの処理を終了する。The output device 4 is provided with the graphic matching means 35.
According to the average value sm of the similarities supplied from, the identification information for identifying the form of the “payment slip” stored in the classification storage area 22c of the dictionary storage unit 22 is output (step S9). Thus, the classification of the form is completed, and the processing of this flowchart ends.

【００９５】以上説明したように、この実施の形態の帳
票分類装置では、画像入力装置１から取り込まれ、画像
記憶部２１に記憶されている画像に含まれる文字列の位
置ずれが、まず、文字列座標値補正手段３３によって文
字列全体で大まかに補正され、さらに文字座標値補正手
段３４によって文字単位で細かく補正される。このた
め、照合用文字列の座標との位置ずれを高速に補正する
ことが可能となる。As described above, in the form classifying apparatus of this embodiment, the position shift of the character string included in the image fetched from the image input device 1 and stored in the image storage unit 21 is first determined by the character The entire character string is roughly corrected by the column coordinate value correcting means 33, and further finely corrected in character units by the character coordinate value correcting means. For this reason, it is possible to quickly correct the positional deviation from the coordinates of the collation character string.

【００９６】また、画像入力装置１から多階調の画像が
取り込まれ、各文字を多階調で表現した特徴量に従って
文字の照合が行われる。このため、画像中の各画素を２
値化する場合のように、文字部分を背景部分として誤っ
たり、背景部分を文字部分として誤ったりすることがな
い。このため、精度の高い文字照合が可能となる。Further, a multi-tone image is fetched from the image input device 1, and characters are collated in accordance with a feature amount expressing each character in multi-tone. Therefore, each pixel in the image is 2
As in the case of value conversion, a character portion is not mistaken as a background portion, and a background portion is not mistaken as a character portion. For this reason, highly accurate character matching becomes possible.

【００９７】そして、この実施の形態の帳票分類装置
は、上記のような文字照合を利用することによって、高
速かつ高精度な帳票の分類が可能となる。The form classification apparatus of this embodiment can perform high-speed and high-accuracy classification of a form by utilizing the above-described character matching.

【００９８】本発明は、上記の実施の形態で説明したも
のに限られず、様々な変形、応用が可能である。以下、
本発明において適用可能な上記の実施の形態の変形態様
について、説明する。The present invention is not limited to those described in the above embodiments, and various modifications and applications are possible. Less than,
Modifications of the above embodiment applicable to the present invention will be described.

【００９９】上記の実施の形態では、１文字以上からな
る文字列の照合に本発明を適用した場合について説明し
た。この場合の「文字」の具体例としては、「出」、
「金」、「伝」、「票」という４つの漢字を挙げた。し
かしながら、本発明において照合対象とする「文字」と
しては、漢字の他、平仮名、片仮名、アルファベット等
の自然言語を記述するためのあらゆる種類の文字や、さ
らには、数学記号や星形等の図形などを含めて適用する
ことができる。In the above embodiment, the case where the present invention is applied to the collation of a character string including one or more characters has been described. Specific examples of "character" in this case include "out",
He listed four kanji: “gold”, “den”, and “vote”. However, the "characters" to be collated in the present invention include, in addition to kanji, all kinds of characters for describing natural languages such as hiragana, katakana, and the alphabet, and further, figures such as mathematical symbols and star shapes. It can be applied including the above.

【０１００】上記の実施の形態では、横書きの文字列の
場合についての処理を説明したが、本発明は、縦書きの
文字列の場合であっても同様に適用することができる。
この場合は、垂直方向画素投影プロファイルと水平方向
画素投影プロファイルとの適用が、上記の実施の形態の
場合と逆になる。In the above embodiment, the processing for a horizontally written character string has been described. However, the present invention can be similarly applied to a vertically written character string.
In this case, the application of the vertical pixel projection profile and the horizontal pixel projection profile is opposite to that in the above embodiment.

【０１０１】上記の実施の形態では、画像入力装置１に
よって帳票から階調を有する２次元ディジタル画像を取
り込み、画像記憶部２１に記憶させていた。しかしなが
ら、処理対象となる画像は、例えば、ディジタルスチル
カメラで撮影し、ディジタルスチルカメラ内のメモリか
ら画像記憶部２１に転送してもよい。In the above embodiment, a two-dimensional digital image having gradations is fetched from a form by the image input device 1 and stored in the image storage unit 21. However, the image to be processed may be captured by a digital still camera, for example, and transferred from the memory in the digital still camera to the image storage unit 21.

【０１０２】上記の実施の形態では、帳票分類装置は棄
却判定の機能を有していないが、この機能を有するもの
としてもよい。図８は、棄却判定の機能を有する帳票分
類装置の機能構成を示す機能ブロック図である。この帳
票分類装置は、図１の帳票分類装置とほぼ同じである
が、データ処理装置５において文字列座標値補正手段５
１が第１の実施の形態のものと異なる。In the above embodiment, the form classifying apparatus does not have the function of rejection determination, but may have this function. FIG. 8 is a functional block diagram illustrating a functional configuration of a form classification device having a rejection determination function. This form classification device is substantially the same as the form classification device of FIG.
1 is different from that of the first embodiment.

【０１０３】文字列座標値補正手段５１は、棄却判定部
５１ａを含む。棄却判定部５１ａは、生成された画素投
影プロファイルの値の変動によって画像中に文字列が含
まれるかどうかを判定する。文字列座標値補正手段５１
は、文字列が含まれないと判定したときは、文字列単位
での座標値の補正、文字単位での座標値の補正及び特徴
量の照合といった以後の処理を中止させる。The character string coordinate value correcting means 51 includes a rejection determination section 51a. The rejection determination unit 51a determines whether a character string is included in the image due to a change in the value of the generated pixel projection profile. Character string coordinate value correcting means 51
When it is determined that the character string is not included, the subsequent processes such as the correction of the coordinate value in the unit of the character string, the correction of the coordinate value in the unit of the character, and the comparison of the feature amount are stopped.

【０１０４】上記の実施の形態では、データ処理装置３
は、メモリとＣＰＵとによって構成され、ＣＰＵがメモ
リに記憶されたプログラムを実行することによって各手
段３１〜３５の機能が実現されていた。これに対し、図
９に示すように、データ処理装置７が実行するプログラ
ムをＣＤ−ＲＯＭ６ａ等のコンピュータ読み取り可能な
記録媒体に格納して配布し、ディスクドライブ６を駆動
してＣＤ−ＲＯＭ６ａに格納されたプログラムをデータ
処理装置７のメモリに順次読み込ませ、ＣＰＵが実行す
るようにしてもよい。In the above embodiment, the data processing device 3
Is configured by a memory and a CPU, and the functions of the units 31 to 35 are realized by the CPU executing a program stored in the memory. On the other hand, as shown in FIG. 9, the program executed by the data processing device 7 is stored in a computer-readable recording medium such as a CD-ROM 6a and distributed, and the disk drive 6 is driven to store the program in the CD-ROM 6a. The read programs may be sequentially read into the memory of the data processing device 7 and executed by the CPU.

【０１０５】上記の実施の形態では、本発明を帳票分類
装置に適用した場合について説明した。しかしながら、
本発明は、帳票以外の文書の分類にも適用することがで
きる。また、本発明は、文書の分類以外にも、例えば、
光学式文字読み取り装置（ＯＣＲ：Optical Character
Reader）や有価証券等の印刷文書の真贋を判定する真贋
判定装置にも適用することができる。In the above embodiment, the case where the present invention is applied to the form classification device has been described. However,
The present invention can be applied to classification of documents other than forms. Also, the present invention, besides the classification of documents, for example,
Optical character reader (OCR: Optical Character)
Reader) or an authentication device for determining the authenticity of a printed document such as securities.

【０１０６】図１０は、本発明の実施の形態の変形にか
かるＯＣＲの機能構成を示す機能ブロック図である。こ
のＯＣＲでは、記憶装置８が図１の帳票分類装置の記憶
装置２と異なり、辞書記憶部８１が分類記憶域２２ｃの
代わりに、コード情報記憶域８１ａを有する。また、出
力装置９が図１の帳票分類装置のものと異なる。FIG. 10 is a functional block diagram showing a functional configuration of an OCR according to a modification of the embodiment of the present invention. In this OCR, the storage device 8 differs from the storage device 2 of the form classification device in FIG. 1, and the dictionary storage unit 81 has a code information storage area 81a instead of the classification storage area 22c. The output device 9 is different from that of the form classification device of FIG.

【０１０７】コード情報記憶域８１ａには、文字に対応
するコード情報が記憶されている。また、図形照合手段
３５は、類似度を、平均値ではなく、それぞれの文字に
ついてのものを出力する。そして、出力装置９は、コー
ド情報記憶域８１ａに記憶されている各文字のコード情
報を出力する。The code information storage area 81a stores code information corresponding to characters. Further, the graphic matching means 35 outputs the similarity not for the average value but for each character. Then, the output device 9 outputs the code information of each character stored in the code information storage area 81a.

【０１０８】このような構成により、文字列の書かれた
文書の画像を読み取って、これらの文字列中の各文字を
文字コードに変換することを、高速かつ正確に行うこと
ができる。With such a configuration, it is possible to read an image of a document on which a character string is written and convert each character in the character string into a character code at high speed and accurately.

【０１０９】図１１は、本発明の実施の形態の変形にか
かる真贋判定装置の機能構成を示す機能ブロック図であ
る。この真贋判定装置は、記憶装置１０が図１の帳票分
類装置の記憶装置と異なり、辞書記憶部１０１が分類記
憶域２２ｃの代わりに許容値記憶域１０１ａを有する。
また、出力装置１１が警報手段１１ａを有する点で、図
１の帳票分類装置の出力装置４と異なる。FIG. 11 is a functional block diagram showing a functional configuration of an authentication apparatus according to a modification of the embodiment of the present invention. In this authentication apparatus, the storage device 10 is different from the storage device of the form classification device in FIG. 1, and the dictionary storage unit 101 has an allowable value storage area 101a instead of the classification storage area 22c.
The output device 11 is different from the output device 4 of the form classification device in FIG. 1 in that the output device 11 has an alarm unit 11a.

【０１１０】許容値記憶域１０１ａは、画像中の文字列
の特徴量と照合用文字列の特徴量との差の許容値を記憶
する。また、出力装置１１は、図形照合手段３５から供
給された類似度が許容値記憶域１０１ａに記憶された許
容値以上である場合に、警報手段１１ａから警報を発す
る。The allowable value storage area 101a stores an allowable value of the difference between the characteristic amount of the character string in the image and the characteristic amount of the collation character string. Further, the output device 11 issues an alarm from the alarm unit 11a when the similarity supplied from the graphic matching unit 35 is equal to or larger than the allowable value stored in the allowable value storage area 101a.

【０１１１】このような構成により、所定の文字列が記
載されている有価証券などの文書の真贋の判定を、高速
かつ正確に行うことができる。With this configuration, the authenticity of a document such as a security document in which a predetermined character string is described can be quickly and accurately determined.

【０１１２】[0112]

【発明の効果】以上説明したように、本発明によれば、
文字列の位置ずれを文字列単位で大まかに補正し、さら
に、文字単位で細かく補正するので、文字列の位置ずれ
の補正を高速に行うことができる。また、多階調の画像
データから抽出した特徴量で文字列の照合が可能となる
ので、高精度の照合が可能となる。このような文字列照
合を利用することにより、文書の分類、文字読み取り、
或いは有価証券などの真贋の判定を高速、かつ正確に行
うことができる。As described above, according to the present invention,
Since the positional deviation of the character string is roughly corrected on a character-string basis and further finely corrected on a character-by-character basis, the positional deviation of the character string can be corrected at a high speed. In addition, since character strings can be collated with feature amounts extracted from multi-gradation image data, highly accurate collation can be performed. By utilizing such character string matching, document classification, character reading,
Alternatively, the authenticity of a security or the like can be determined quickly and accurately.

[Brief description of the drawings]

【図１】本発明の実施の形態にかかる帳票分類装置の機
能構成を示す機能ブロック図である。FIG. 1 is a functional block diagram illustrating a functional configuration of a form classification device according to an embodiment of the present invention.

【図２】本発明の実施の形態にかかる動作を示すフロー
チャートである。FIG. 2 is a flowchart showing an operation according to the embodiment of the present invention.

【図３】本発明の実施の形態にかかる動作を示すフロー
チャートである。FIG. 3 is a flowchart showing an operation according to the embodiment of the present invention.

【図４】本発明の実施の形態にかかる動作を示すフロー
チャートである。FIG. 4 is a flowchart showing an operation according to the embodiment of the present invention.

【図５】帳票の具体例を示す図である。FIG. 5 is a diagram showing a specific example of a form.

【図６】（ａ）〜（ｉ）は、本発明の実施の形態にかか
る動作の具体例を示す図である。FIGS. 6A to 6I are diagrams showing specific examples of the operation according to the embodiment of the present invention.

【図７】（ａ）〜（ｉ）は、本発明の実施の形態にかか
る動作の具体例を示す図である。FIGS. 7A to 7I are diagrams illustrating a specific example of an operation according to the embodiment of the present invention;

【図８】本発明の実施の形態の変形にかかる帳票分類装
置の機能構成を示す機能ブロック図である。FIG. 8 is a functional block diagram showing a functional configuration of a form classification device according to a modification of the embodiment of the present invention.

【図９】本発明の実施の形態の変形にかかる帳票分類装
置の機能構成を示す機能ブロック図である。FIG. 9 is a functional block diagram showing a functional configuration of a form classification device according to a modification of the embodiment of the present invention.

【図１０】本発明の実施の形態の変形にかかる光学式文
字読み取り装置の機能構成を示す機能ブロック図であ
る。FIG. 10 is a functional block diagram showing a functional configuration of an optical character reader according to a modification of the embodiment of the present invention.

【図１１】本発明の実施の形態の変形にかかる真贋判定
装置の機能構成を示す機能ブロック図である。FIG. 11 is a functional block diagram showing a functional configuration of an authentication apparatus according to a modification of the embodiment of the present invention.

[Explanation of symbols]

１画像入力装置２記憶装置３データ処理装置４出力装置５データ処理装置６ディスクドライブ６ａＣＤ−ＲＯＭ７データ処理装置８記憶装置９出力装置１０記憶装置１１出力装置１１ａ警報手段２１画像記憶部２２辞書記憶部２２ａ座標記憶域２２ｂ照合用特徴記憶域２２ｃ分類記憶域３１帳票位置検出手段３２座標変換手段３３文字列座標値補正手段３４文字座標値補正手段３５図形照合手段５１文字列座標値補正手段５１ａ棄却判定部８１辞書記憶部８１ａコード情報記憶域１０１辞書記憶部１０１ａ許容値記憶域 Reference Signs List 1 image input device 2 storage device 3 data processing device 4 output device 5 data processing device 6 disk drive 6a CD-ROM 7 data processing device 8 storage device 9 output device 10 storage device 11 output device 11a alarm means 21 image storage unit 22 dictionary Storage unit 22a Coordinate storage area 22b Matching feature storage area 22c Classification storage area 31 Form position detection means 32 Coordinate conversion means 33 Text string coordinate value correction means 34 Text coordinate value correction means 35 Figure matching means 51 Text string coordinate value correction means 51a Rejection determination unit 81 Dictionary storage unit 81a Code information storage area 101 Dictionary storage unit 101a Allowable value storage area

Claims

[Claims]

An image storage means for storing multi-tone image data taken from a document of a predetermined format including a character string, coordinates at which each character of the character string should be located in the document, and the character string A dictionary storage unit that stores a feature amount to be included in the document and identification information for identifying the format of the document including the character string in association with each other; and the document in the image data stored in the image storage unit. A position detecting means for detecting a position of the character, and a coordinate system of coordinates where each character in a character string should be located in the document according to a detection result of the position detecting means, from the document coordinate system based on the document to the image. A coordinate conversion unit for converting the image data into an image coordinate system based on the data capture state; and the coordinates of the entire character string included in the image data stored in the image storage unit and the coordinate system converted by the coordinate conversion unit. documents Character string coordinate correcting means for correcting a deviation from coordinates at which a character string should be located in a character string unit; and each of the characters included in the character string whose coordinates have been corrected in character string units by the character string coordinate correcting means. Character coordinate correcting means for correcting, in character units, a deviation between the coordinates of a character and the coordinates where each character of the character string whose coordinate system has been converted by the coordinate converting means, and A graphic for extracting a characteristic amount of a character string in image data taken from the document whose coordinates have been corrected, and comparing the extracted characteristic amount with a characteristic amount to be included in the character string stored in the dictionary storage unit. A document classification apparatus comprising: a matching unit; and an output unit that outputs identification information stored in the dictionary storage unit according to a result of the matching by the graphic matching unit.

2. An image storage means for storing multi-tone image data taken from a document of a predetermined format including a character string, coordinates at which each character of the character string should be located in the document, and the character string A dictionary storage unit for storing a feature amount to be provided and a character code corresponding to the character string in association with each other; and a position detection for detecting a position of the document in image data stored in the image storage unit. Means, according to the detection result of the position detection means, based on the state of capturing the image data from the document coordinate system based on the document, the coordinate system of the coordinates where each character in the character string in the document should be located A coordinate conversion means for converting the image data into a converted image coordinate system, coordinates of the entire character string included in the image data stored in the image storage means, and a position of the character string in the document whose coordinate system has been converted by the coordinate conversion means. Everything A character string coordinate correcting means for correcting a deviation from a coordinate in a character string unit; and a coordinate of each character included in the character string whose coordinates are corrected in a character string by the character string coordinate correcting means, and the coordinate conversion. Character coordinate correcting means for correcting, in character units, a deviation from coordinates at which each character of the character string whose coordinate system has been converted by the means, and the document in which the coordinates of each character have been corrected by the character coordinate correcting means A graphic matching unit that extracts a feature amount of a character string in image data taken in from the storage device and compares the extracted feature amount with a feature amount that the character string should have in the dictionary storage unit; Output means for outputting a character code stored in the dictionary storage means in accordance with the result of the comparison by the means.

3. An image storage means for storing multi-tone image data taken from a document in a predetermined format including a character string; coordinates at which each character of the character string should be located in the document; A dictionary storage unit that stores a feature amount to be provided and an allowable range of the feature amount in association with each other; a position detection unit that detects a position of the document in image data stored in the image storage unit; According to the detection result of the position detecting means, the coordinate system of the coordinates where each character in the character string should be located in the document is an image based on the state of capturing the image data from the document coordinate system based on the document. Coordinate conversion means for converting to a coordinate system; coordinates of the entire character string included in the image data stored in the image storage means; and a character string to be located in the document whose coordinate system has been converted by the coordinate conversion means. Without coordinates A character string coordinate correcting means for correcting this in a character string unit, and a coordinate system in which the coordinates of each character included in the character string whose coordinates are corrected in units of the character string by the character string coordinate correcting means and the coordinate conversion means are used. Character coordinate correcting means for correcting, in character units, a deviation from coordinates at which each character of the converted character string is to be located; and the character coordinates corrected by the character coordinate correcting means are fetched from the document. A graphic matching unit that extracts a feature amount of a character string in image data, and compares the extracted feature amount with a feature amount that the character string should have, stored in the dictionary storage unit; An authenticity judging device, comprising: authenticity judging means for judging whether or not the difference in the feature amount falls within a range of an allowable range stored in the dictionary storage means.

4. A method of associating coordinates of each character of a character string in a document, a feature value of the character string, and identification information for identifying a format of a document including the character string. A method of classifying a document in a computer system having a stored dictionary, the method comprising: storing in a memory multi-gradation image data captured from a document in a predetermined format including a character string; Detecting a position of the document in data; and a document coordinate system based on the document, the coordinate system of coordinates where each character in a character string should be located in the document according to the detected position of the document. Converting the image data into an image coordinate system based on the state of capturing the image data, and the coordinates of the entire character string included in the image data stored in the memory and the image data before being converted to the image coordinate system. Correcting the deviation from the coordinates where the character string should be located in the written document in units of character strings; and converting the coordinates of each character included in the character string whose coordinates have been corrected in units of character strings into the image coordinate system. Correcting the deviation from the coordinates where each character of the character string should be located in the document on a character-by-character basis; and the characteristic amount of the character string in the image data imported from the document in which the coordinates of each character have been corrected Extracting the extracted feature value with the feature value to be possessed by the character string stored in the dictionary, and outputting the identification information stored in the dictionary according to the matching result. A document classification method comprising:

5. A computer system having a dictionary in which coordinates at which each character of a character string is to be located in a document, feature values to be included in the character string, and a character code corresponding to the character string are stored in association with each other. Storing a multi-gradation image data captured from a document in a predetermined format including a character string in a memory; and a position of the document in the image data stored in the memory. Detecting the position of the image data from the document coordinate system based on the document based on the coordinate system of the coordinates where each character in the character string should be located in the document according to the detected position of the document. Converting to the original image coordinate system; and coordinates of the entire character string included in the image data stored in the memory and character strings in the document converted to the image coordinate system. Correcting the deviation from the coordinates to be placed in units of character strings; and the coordinates of each character included in the character strings corrected in coordinates in units of character strings and the character strings in the document converted into the image coordinate system. Correcting the deviation from the coordinates where each character should be located in units of characters, and extracting the feature amount of the character string in the image data taken from the document in which the coordinates of each character have been corrected, and extracting the extracted features. Collating an amount with a characteristic amount to be possessed by the character string stored in the dictionary; and outputting a character code stored in the dictionary according to the collation result. Character reading method.

6. A computer system having a dictionary in which coordinates at which each character of a character string is to be located in the document, a feature value to be included in the character string, and an allowable range of the feature value are stored in association with each other. A method for determining the authenticity of a document, comprising the steps of: storing in a memory multi-tone image data captured from a document in a predetermined format including a character string; and storing the document in the image data stored in the memory. Detecting the position of the document, and, according to the detected position of the document, a document coordinate based on the document based on a coordinate system of coordinates where each character in the character string should be located in the document stored in the memory in advance. From the system to an image coordinate system based on the state of capture of the image data, and the coordinates of the entire character string included in the image data stored in the memory and the image coordinate system. Correcting the deviation from the coordinates where the character string should be located in the document in units of a character string; and converting the coordinates of each character included in the character string whose coordinates have been corrected in the unit of character string into an image coordinate system. Correcting the deviation from the coordinates where each character of the character string should be located in the written document, on a character-by-character basis; and characterizing the character string in the image data captured from the document in which the coordinates of each character have been corrected Extracting a quantity, and comparing the extracted feature quantity with a feature quantity that should be included in the character string stored in the dictionary; and comparing the difference in the matched feature quantity to a range of an allowable range stored in the dictionary. Determining whether or not there is an authenticity determination method.