JP6024118B2

JP6024118B2 - Character reader and character recognition method

Info

Publication number: JP6024118B2
Application number: JP2012034000A
Authority: JP
Inventors: 泉小坂
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2012-02-20
Filing date: 2012-02-20
Publication date: 2016-11-09
Anticipated expiration: 2032-02-20
Also published as: JP2013171384A

Description

本発明は、文字読取装置及び文字認識方法に関する。 The present invention relates to a character reader and a character recognition method.

金融機関における振込依頼等に対する業務においては、顧客が予め指定された帳票に記入した情報を文字読取装置により文字認識することで、業務の効率化が図られている。 In a business for a transfer request or the like in a financial institution, the efficiency of the business is achieved by recognizing information written in a form designated by a customer in advance by a character reader.

このとき認識精度（読取精度）の向上等の目的で、特許文献１には、帳票毎に顧客の取引履歴を保持しておき、帳票単位で該取引履歴を検索して該文字認識の補正を行う項目群補正部と、帳票内の関連する項目どうしを対にして該項目の辞書を検索して該文字認識の補正を行う二項目補正部と、帳票の個々の項目毎に該項目と対応する辞書を検索して該文字認識の補正を行う項目補正部と、項目毎にその項目属性に基づく関連辞書を検索して該文字認識の補正を行う関連項目補正部とを備えた文字認識処理装置が開示されている。 At this time, for the purpose of improving recognition accuracy (reading accuracy), Patent Document 1 stores customer transaction history for each form, searches the transaction history for each form, and corrects the character recognition. Corresponding to the item group correction unit to be performed, the two item correction unit for correcting the character recognition by searching the dictionary of the item by pairing the related items in the form, and corresponding to each item of the form Character recognition processing comprising: an item correction unit that searches for a dictionary to perform correction of the character recognition; and a related item correction unit that searches for a related dictionary based on the item attribute for each item and corrects the character recognition An apparatus is disclosed.

また、特許文献２には、認識対象と同類のテキストに出現する単語や表現の文字列の並び方の性質を予め数値化し、候補文字の確からしさの順位付けを行うことにより、誤切出しや誤認識によって欠落した正解文字を補正する文字認識装置が開示されている。 Further, Patent Document 2 discloses that the characteristics of the arrangement of words and expressions that appear in the same type of text as the recognition target are digitized in advance, and the probability of candidate characters is ranked. Discloses a character recognition device that corrects missing correct characters.

さらに、特許文献３には、認識結果を予め登録されている単語と比較して、これらが一致している場合には認識結果が正しいとし、一致していない場合には認識結果が正しくないとして削除することにより精度の高い文字認識を行う文字読取装置が開示されている。 Furthermore, in Patent Document 3, the recognition result is compared with a pre-registered word. If these match, the recognition result is correct, and if they do not match, the recognition result is incorrect. A character reading device that performs character recognition with high accuracy by deleting the characters is disclosed.

特開平５−２５８０９９号公報Japanese Patent Laid-Open No. 5-258099 特開平９−１３８８４０号公報JP-A-9-138840 特開２００１−１０１３４０号公報JP 2001-101340 A

しかしながら、上述した各特許文献においては、予め備えた一般的な辞書から該当する情報を検索し、検索された情報に基づき認識結果を補正するため、短時間で高精度な認識補正を行えないことができない場合があった。 However, in each of the above-mentioned patent documents, the corresponding information is searched from a general dictionary prepared in advance, and the recognition result is corrected based on the searched information, so that high-precision recognition correction cannot be performed in a short time. There was a case that could not be.

即ち、一般的な文字情報を文字認識する場合には、認識対象の文字範囲には制限が設けられていない。そして、認識結果である認識文字の補正に用いる辞書も、制限が設けられていない一般的な辞書が用いられている。 That is, when general character information is recognized, there is no restriction on the character range to be recognized. And the dictionary used for correction | amendment of the recognition character which is a recognition result is also a general dictionary without the restriction | limiting.

一方、金融機関で振込依頼書に記入された文字を認識する場合、振込依頼人（銀行利用者）は、特定の銀行（利用者の近所の銀行や取引のある銀行等）を利用して振込依頼をすることが多い。このため、認識文字の範囲は銀行利用者が記載する範囲に限定される。そして、振込依頼書は、フォーマットが規定されているので、記入されている文字も、氏名や住所等のように非常に限定された範囲となる。従って、認識対象が帳票の文字のような場合には、認識結果の補正に用いる辞書は一般的な辞書である必要がない。 On the other hand, when recognizing the characters entered in the transfer request form at a financial institution, the transfer requester (bank user) transfers money using a specific bank (such as a bank near the user or a bank with a transaction). I often request. For this reason, the range of the recognition character is limited to the range described by the bank user. Since the format of the transfer request form is prescribed, the characters entered are in a very limited range such as name and address. Therefore, when the recognition target is a form character, the dictionary used for correcting the recognition result need not be a general dictionary.

しかるに、上述した各特許文献においては、一般的な補正用の辞書を用いているため、該当する情報を検索する際の検索範囲が広くなって検索負荷が大きくなると共に、検索された情報が不適切な情報である確率も高くなり高精度に認識文字を補正できない場合があった。 However, in each of the above-mentioned patent documents, since a general correction dictionary is used, the search range when searching for the corresponding information is widened, the search load is increased, and the searched information is not valid. In some cases, the probability that the information is appropriate increases, and the recognized character cannot be corrected with high accuracy.

そこで、本発明の主目的は、認識結果の補正処理が高速、かつ、高精度に行える文字読取装置及び文字認識方法を提供することである。 Accordingly, a main object of the present invention is to provide a character reading apparatus and a character recognition method capable of correcting a recognition result at high speed and with high accuracy.

上記課題を解決するため、手書文字の画像データから当該手書文字を認識する文字認識装置は、手書文字の文字認識に用いる文字認識辞書と、該手書文字を記入した顧客に関する情報を格納する顧客辞書とを備えた辞書ユニットと、画像データを、文字認識辞書を用いて文字認識し、かつ、当該文字認識された中に含まれる認識文字の情報を用いて顧客辞書を検索し、検索された顧客情報に基づき認識文字を補正する認識ユニットと、を備えることを特徴とする。 In order to solve the above problem, a character recognition device for recognizing handwritten characters from image data of handwritten characters includes a character recognition dictionary used for character recognition of handwritten characters, and information about a customer who has entered the handwritten characters. A dictionary unit including a customer dictionary to store, image data is character-recognized using a character recognition dictionary, and the customer dictionary is searched using information of recognized characters included in the character recognition, A recognition unit that corrects the recognized character based on the retrieved customer information.

また、手書文字の画像データから当該手書文字を認識する文字認識方法は、画像データを、文字認識辞書を用いて文字認識する文字認識手順と、手書文字を記入した顧客に関する顧客基本情報と該顧客基本情報に関連づけられた顧客付随情報とにより構成される顧客辞書から認識文字を用いて顧客基本情報を検索する検索手順と、認識文字に含まれる顧客に関する認識文字を検索キーワードとして顧客辞書を検索して顧客基本情報を取得し、取得された顧客基本情報に基づき認識文字を補正する補正手順と、を含むことを特徴とする。 A character recognition method for recognizing handwritten characters from image data of handwritten characters includes a character recognition procedure for recognizing image data using a character recognition dictionary, and basic customer information about a customer who has entered handwritten characters. A search procedure for searching for customer basic information using a recognition character from a customer dictionary composed of the customer basic information associated with the customer basic information, and a customer dictionary using the recognition character for the customer included in the recognition character as a search keyword And a correction procedure for acquiring the basic customer information and correcting the recognized character based on the acquired basic customer information.

本発明によれば、画像データを、文字認識辞書を用いて文字認識し、かつ、顧客に関する認識結果を用いて顧客辞書を検索して、検索された顧客情報に基づき認識結果を補正するので、認識結果の補正処理が高速、かつ、高精度に行えるようになる。 According to the present invention, the image data is character-recognized using the character recognition dictionary, and the customer dictionary is searched using the recognition result related to the customer, and the recognition result is corrected based on the searched customer information. Recognition result correction processing can be performed at high speed and with high accuracy.

本発明の実施形態にかかる文字認識システムのブロック図である。It is a block diagram of the character recognition system concerning the embodiment of the present invention. 振込依頼書の概略的な書式を示す図である。It is a figure which shows the rough format of a transfer request form. 顧客情報テーブルを例示した図である。It is the figure which illustrated the customer information table. 前処理部及び認識部における処理手順を示すフローチャートである。It is a flowchart which shows the process sequence in a pre-processing part and a recognition part. 検索単語照合処理における処理手順を示すフローチャートである。It is a flowchart which shows the process sequence in a search word collation process.

本発明の実施形態を説明する。図１は、本発明にかかる文字認識装置６を用いた文字認識システム２のブロック図である。文字認識システム２は、文字認識装置６、サーバ４、端末５等を備え、これらがネットワークＬ等により接続されている。また、文字認識装置６には、スキャナ３が直接ケーブル接続されている。 An embodiment of the present invention will be described. FIG. 1 is a block diagram of a character recognition system 2 using a character recognition device 6 according to the present invention. The character recognition system 2 includes a character recognition device 6, a server 4, a terminal 5, and the like, which are connected by a network L or the like. The scanner 3 is directly connected to the character recognition device 6 by a cable.

なお、スキャナ３を、ネットワークＬを介して文字認識装置６と接続することは可能である。また、端末５と文字認識装置６とは別体に設けられている場合を示しているが、これらは一体であってもよい。 The scanner 3 can be connected to the character recognition device 6 via the network L. Moreover, although the case where the terminal 5 and the character recognition device 6 are provided separately is shown, these may be integrated.

スキャナ３は、振込依頼書等の帳票の画像データを取得する。取得された画像データは、識別番号が付与されて文字認識装置６を介してサーバ４に保存される。 The scanner 3 acquires image data of a form such as a transfer request form. The acquired image data is assigned an identification number and stored in the server 4 via the character recognition device 6.

図２は帳票の例として振込依頼書を例示した図である。帳票２０には、ユーザが手書で記入する複数のフィールド（記入欄）が設けられている。代表的なフィールドとしては、振込先の銀行名フィールド２０ａ、支店名フィールド２０ｂ、受取人姓フィールド２０ｃ、受取人名フィールド２０ｄ、口座番号フィールド２０ｅ、振込金額フィールド２０ｆ、依頼人電話番号フィールド２０ｇ、依頼人姓フィールド２０ｈ、依頼人名フィールド２０ｊ、依頼人姓名フィールド２０ｋ、依頼人住所フィールド２０ｍが例示できる。 FIG. 2 is a diagram illustrating a transfer request form as an example of a form. The form 20 is provided with a plurality of fields (entry fields) for the user to fill in by hand. Typical fields include bank name field 20a, bank name field 20b, payee surname field 20c, payee name field 20d, account number field 20e, transfer amount field 20f, client telephone number field 20g, client The surname field 20h, the client name field 20j, the client name field 20k, and the client address field 20m can be exemplified.

文字認識装置６は、図１に示すように、文字認識辞書１１ａ及び顧客辞書１１ｂを含む辞書ユニット１１と、前処理部１５、認識部１６、知識処理部１７を含む認識ユニット１２とを備える。 As shown in FIG. 1, the character recognition device 6 includes a dictionary unit 11 including a character recognition dictionary 11a and a customer dictionary 11b, and a recognition unit 12 including a preprocessing unit 15, a recognition unit 16, and a knowledge processing unit 17.

辞書ユニット１１における文字認識辞書１１ａは、文字の特徴を示す特徴ベクトル（登録特徴ベクトル）により構成されたデータベースである。この登録特徴ベクトルは、文字画像の輪郭を追跡して文字の特徴を抽出して得られた数値データ列である。 The character recognition dictionary 11a in the dictionary unit 11 is a database composed of feature vectors (registered feature vectors) indicating character features. This registered feature vector is a numerical data string obtained by tracing the outline of a character image and extracting character features.

顧客辞書１１ｂは、依頼人情報（顧客基本情報）と、この依頼人情報に関連づけられた受取人情報（顧客付随情報）とで構成された顧客情報データベースである。 The customer dictionary 11b is a customer information database composed of client information (customer basic information) and recipient information (customer incidental information) associated with the client information.

図３は、このような顧客情報データベースを顧客情報テーブル３０として表示した図である。この顧客情報テーブル３０は、顧客情報３１、取引履歴３２、姓情報３３、名情報３４、金融機関名情報３５、住所情報３６等の情報を含んでいる。そして、姓情報３３、名情報３４及び住所情報３６は依頼人情報に相当し、金融機関名情報３５及び取引履歴３２は依頼人付随情報に相当している。なお、姓情報３３、名情報３４、金融機関名情報３５及び住所情報３６は、顧客情報３１の各項目と相互に関連づけられ、また顧客情報３１と取引情報３２とは相互に関連づけられている。 FIG. 3 is a diagram in which such a customer information database is displayed as a customer information table 30. The customer information table 30 includes information such as customer information 31, transaction history 32, last name information 33, name information 34, financial institution name information 35, address information 36, and the like. The surname information 33, the first name information 34, and the address information 36 correspond to client information, and the financial institution name information 35 and the transaction history 32 correspond to client information. The surname information 33, the name information 34, the financial institution name information 35, and the address information 36 are associated with each item of the customer information 31, and the customer information 31 and the transaction information 32 are associated with each other.

認識ユニット１２における前処理部１５は、位置補正処理１５ａ、文字画像切出処理１５ｂ、２値化処理１５ｃ、文字画像補正処理１５ｄを行う。認識部１６は、特徴抽出処理１６ａ、認識処理１６ｂ、棄却判定処理１６ｃを行う。そして、知識処理部１７は、検索単語照合処理１７ａを行う。 The preprocessing unit 15 in the recognition unit 12 performs a position correction process 15a, a character image cutout process 15b, a binarization process 15c, and a character image correction process 15d. The recognition unit 16 performs a feature extraction process 16a, a recognition process 16b, and a rejection determination process 16c. Then, the knowledge processing unit 17 performs a search word matching process 17a.

図４は、前処理部１５及び認識部１６における処理手順を示すフローチャートである。 FIG. 4 is a flowchart showing a processing procedure in the preprocessing unit 15 and the recognition unit 16.

ステップＳ１：認識ユニット１２は、位置補正処理１５ａを行う。この位置補正処理１５ａは、画像データ内の基準マークや罫線位置等の既成情報を利用して、印刷位置ズレ・帳票カット誤差などを検出し、この検出結果に基づき認識対象となるフィールドの位置補正を行う処理である。 Step S1: The recognition unit 12 performs a position correction process 15a. This position correction process 15a detects information such as a print position shift / form cut error using the established information such as the reference mark and ruled line position in the image data, and corrects the position of the field to be recognized based on the detection result. It is a process to perform.

ステップＳ２：次に、文字画像切出処理１５ｂが行われる。文字画像切出処理１５ｂは、位置補正処理１５ａで位置補正された画像データから、認識対象となるフィールドに対応した画像データを切出して文字画像を取得する処理である。なお、画像データの切出しは、フィールドの長さ方向（水平方向）にセグメント単位（例えば、漢字の場合には「へん」や「つくり」に相当する単位）で行われる。そして、隣接する最大３セグメントに対応した画像を連結して、文字画像が生成される。なお、結合する最大セグメント数は、３セグメントに限定するものではない。 Step S2: Next, a character image cutting process 15b is performed. The character image cutout process 15b is a process of cutting out image data corresponding to a field to be recognized from the image data position-corrected by the position correction process 15a and acquiring a character image. The image data is cut out in segment units in the length direction (horizontal direction) of the field (for example, in the case of Chinese characters, a unit corresponding to “hen” or “making”). Then, a character image is generated by concatenating images corresponding to a maximum of three adjacent segments. Note that the maximum number of segments to be combined is not limited to three segments.

ステップＳ３：次に、文字画像に対して２値化処理１５ｃが行われる。２値化処理１５ｃは、文字画像切出処理１５ｂで切出された文字画像から動的に２値化閾値を決めて、この文字画像を２値画像に変換する処理である。動的に閾値を決める際には、切出した文字画像の平均濃度を算出し、この平均濃度の例えば６０％に対応する濃度を２値化閾値とする等の方法が可能である。 Step S3: Next, a binarization process 15c is performed on the character image. The binarization process 15c is a process of dynamically determining a binarization threshold value from the character image cut out by the character image cutout process 15b and converting the character image into a binary image. When the threshold value is dynamically determined, an average density of the extracted character image is calculated, and a density corresponding to, for example, 60% of the average density is set as a binarization threshold value.

ステップＳ４：次に、２値画像に対して、手書文字の「くせ字」等の補正を目的とした文字画像補正処理１５ｄが行われる。文字画像補正処理１５ｄは、２値画像の垂直・水平方向ストローク傾きを検出する。そして、傾きに応じた量の幾何変換を行うことにより、文字画像を水平方向・垂直方向に投影された文字画像に変換する。その後、文字画像の黒色画素の分布が一定になるように、この文字画像を非線形に拡大・縮小して正規化する。この正規化された文字画像は、正規化画像として出力される。 Step S4: Next, a character image correction process 15d is performed on the binary image for the purpose of correcting handwritten characters such as “characters”. The character image correction process 15d detects vertical and horizontal stroke inclinations of the binary image. Then, the character image is converted into a character image projected in the horizontal direction / vertical direction by performing geometric conversion in an amount corresponding to the inclination. Thereafter, the character image is nonlinearly enlarged and reduced and normalized so that the distribution of black pixels in the character image is constant. This normalized character image is output as a normalized image.

ステップＳ５：次に、正規化画像に含まれる文字の特徴を数値化する特徴抽出処理１６ａが行われる。特徴抽出処理１６ａは、帳票の背景色が白色で、文字が黒色の場合、その正規化画像の黒色画素からなる領域の輪郭を追跡することによって、文字画像から多次元の特徴ベクトル（抽出特徴ベクトル）を抽出する処理である。これにより文字画像は数値化される。 Step S5: Next, a feature extraction process 16a for digitizing the character features included in the normalized image is performed. When the background color of the form is white and the character is black, the feature extraction processing 16a tracks a contour of a region composed of black pixels of the normalized image to extract a multidimensional feature vector (extracted feature vector) from the character image. ). As a result, the character image is digitized.

ステップＳ６：認識処理１６ｂは、特徴抽出処理１６ａで得られた抽出特徴ベクトルと、文字認識辞書１１ａに格納されている登録特徴ベクトルとの距離を示す確信度パラメータを算出する。これにより、正規化画像に含まれる文字画像が文字コードと確信度パラメータを介して対応付される。なお、認識処理１６ｂは、文字単位で行われる。即ち、正規化画像が３文字の画像データを含んでいる場合には、文字毎に認識処理１６ｂが行われる。取得された文字コードに対応する文字が認識結果である。 Step S6: The recognition process 16b calculates a certainty parameter indicating the distance between the extracted feature vector obtained in the feature extraction process 16a and the registered feature vector stored in the character recognition dictionary 11a. Thus, the character image included in the normalized image is associated with the character code via the certainty factor parameter. The recognition process 16b is performed on a character basis. That is, when the normalized image includes three-character image data, the recognition process 16b is performed for each character. The character corresponding to the acquired character code is the recognition result.

確信度パラメータは、抽出特徴ベクトルと登録特徴ベクトルとの類似度を示している。従って、値の大きな確信度パラメータの認識結果は、文字認識辞書１１ａに登録されている文字と高い類似姓を持つ。 The certainty factor parameter indicates the similarity between the extracted feature vector and the registered feature vector. Accordingly, the recognition result of the certainty factor parameter having a large value has a similar surname to the characters registered in the character recognition dictionary 11a.

そして、認識処理１６ｂは、確信度パラメータの値が大きい認識結果の順に、最大１０の認識結果を選択し、これを候補文字とする。例えば、２値画像が３文字を含む場合、各文字に対して確信度パラメータの値の異なる複数の認識結果が得られる。そこで、認識処理部１６ｂは、確信度パラメータの値が大きい順に１０個の認識結果を候補文字とする。また、最も大きい値の確信度パラメータに対応した認識結果は認識文字と記載し、その文字列を認識文字列と記載する。
ステップＳ７：棄却判定処理１６ｃは、認識処理１６ｂで得られた候補文字の確信度パラメータが、予め設定された閾値以上である否かを判断し、閾値以上の候補文字は残し、閾値以下の候補文字は棄却する処理である。そして、残った候補文字に関して確信度パラメータが大きい順に第Ｎ候補文字とする。例えば、最も確信度パラメータの大きい候補文字は第１候補文字となり、３番目に確信度パラメータの大きい候補文字は第３候補文字となる。 And the recognition process 16b selects a maximum of 10 recognition results in order of the recognition result with the largest value of a certainty factor parameter, and makes this a candidate character. For example, when the binary image includes three characters, a plurality of recognition results having different confidence parameter values are obtained for each character. Therefore, the recognition processing unit 16b sets 10 recognition results as candidate characters in descending order of the certainty parameter value. The recognition result corresponding to the confidence parameter with the largest value is described as a recognized character, and the character string is described as a recognized character string.
Step S7: The rejection determination process 16c determines whether or not the certainty factor parameter of the candidate character obtained in the recognition process 16b is greater than or equal to a preset threshold, leaves candidate characters that are greater than or equal to the threshold, and candidates that are less than or equal to the threshold Characters are a process to reject. Then, the remaining candidate characters are set as the Nth candidate character in descending order of the certainty factor parameter. For example, the candidate character having the largest certainty parameter is the first candidate character, and the candidate character having the third certainty parameter is the third candidate character.

この棄却判定処理１６ｃにより、例えば正規画像が３文字を含み、その第１の文字には「２個」の候補文字、第２の文字には「５個」の候補文字、第３の文字には「０個」の候補文字のように、文字毎に異なる個数の候補文字が残ることになる。 By this rejection determination process 16c, for example, the regular image includes three characters, the first character is “2” candidate characters, the second character is “5” candidate characters, and the third character is As in “0” candidate characters, different numbers of candidate characters remain for each character.

候補文字が複数存在するため、これらを組み合わせて形成される候補文字列も複数になる。なお、上記例では第３の文字の候補文字の数が「０個」であるが、これは第３の文字が、誤認識（記入時の誤字・脱字含む）・認識不能のために欠落しているような場合に生じる。 Since there are a plurality of candidate characters, there are a plurality of candidate character strings formed by combining them. In the above example, the number of candidate characters for the third character is “0”, but this is missing because the third character is misrecognized (including typographical errors and omissions). This happens when

検索単語照合処理１７ａは、候補文字列を検索キーワードとして顧客辞書１１ｂから検索した顧客情報や取引履歴を検索し、この情報に基づき認識文字の修正を行う。 The search word matching process 17a searches customer information and transaction history searched from the customer dictionary 11b using the candidate character string as a search keyword, and corrects the recognized character based on this information.

図５は、検索単語照合処理１７ａにおける処理手順を示すフローチャートである。１つの候補文字列は、１つのフィールド内の文字列に対応する。そして、依頼人に関する顧客基本情報、振込金融機関や口座番号に関する顧客付随情報等は、それぞれ異なるフィールドに記入されている。 FIG. 5 is a flowchart showing a processing procedure in the search word matching process 17a. One candidate character string corresponds to a character string in one field. The customer basic information related to the client, the customer incidental information related to the transfer financial institution and the account number, etc. are entered in different fields.

ステップＳ８：まず、依頼人に関する候補文字の中に数字候補文字列が存在するか否かを判断する。依頼人に関する候補文字は、例えば、図２において、数字からなる依頼人電話番号フィールドの数字情報、仮名からなる依頼人姓フィールド及び依頼人名フィールドの文字情報、漢字を含む依頼人姓名フィールドの文字情報、漢字や数字を含む依頼人住所フィールドに対応した文字情報である。 Step S8: First, it is determined whether or not a number candidate character string exists among candidate characters related to the client. Candidate characters related to the client are, for example, the number information of the client telephone number field consisting of numbers, the character information of the client surname field and client name field consisting of kana, and the character information of the client surname field including kanji in FIG. This is character information corresponding to the client address field including kanji and numbers.

一般に、数字情報は、文字情報に比べて認識精度は高い。即ち、確信度パラメータの値が大きい。そこで、このような数字情報からなる数字候補文字列を優先度の高い検索キーワードとする。しかし、帳票の種類によっては、数字候補文字列が、必ず存在するとは限らない。そこで、数字候補文字列が存在するか否かを判断する。 In general, numeric information has higher recognition accuracy than character information. That is, the certainty parameter value is large. Therefore, a number candidate character string made up of such number information is set as a search keyword having a high priority. However, the number candidate character string does not always exist depending on the type of form. Therefore, it is determined whether or not a number candidate character string exists.

ステップＳ９〜Ｓ１１：数字候補文字列が存在する場合には、数字候補文字列を検索キーワードに設定し、数字候補文字列が存在しない場合には、非数字候補文字列を検索キーワードに設定する。ここで、非数字候補文字列とは数字候補文字列以外の候補文字列を言う。そして、検索キーワードを用いて顧客辞書１１ｂから顧客基本情報を検索する。 Steps S9 to S11: If a number candidate character string exists, the number candidate character string is set as a search keyword. If no number candidate character string exists, a non-number candidate character string is set as a search keyword. Here, the non-numeric candidate character string refers to a candidate character string other than the numeric candidate character string. Then, the customer basic information is searched from the customer dictionary 11b using the search keyword.

ステップＳ１２，Ｓ１３：このとき、検索キーワードに該当する顧客基本情報が登録されていない場合もある。そこで、顧客基本情報が取得できたか否かを判断する。取得できなかった場合には、依頼人に関する候補文字列は複数存在するので、全ての候補文字列を検索キーワードとした検索を行ったか否かを判断する。全ての候補文字列を検索キーワードとして検索していない場合には、ステップＳ８に戻り、その他の場合は処理を終了する。この場合は、認識処理で得た最も確信度パラメータの値の大きい認識結果から認識文字が最終的な認識結果となる。 Steps S12 and S13: At this time, customer basic information corresponding to the search keyword may not be registered. Therefore, it is determined whether or not the customer basic information has been acquired. If it cannot be obtained, there are a plurality of candidate character strings related to the client, so it is determined whether or not a search has been performed using all candidate character strings as search keywords. If all candidate character strings have not been searched as search keywords, the process returns to step S8, and otherwise the process ends. In this case, the recognized character becomes the final recognition result from the recognition result having the largest certainty parameter value obtained by the recognition process.

ステップＳ１４：取得された顧客基本情報が適正であるか否かを判断する。即ち、検索キーワードが不適切であったり、類似の顧客基本情報が存在していたりして、取得した顧客基本情報が、振込み依頼人でない場合がある。そこで、取得された顧客基本情報に検索キーワードが完全に含まれる場合のみ、取得された顧客基本情報が適正であると判断し、それ以外の場合には不適正と判断する。 Step S14: It is determined whether or not the acquired customer basic information is appropriate. That is, there are cases where the search keyword is inappropriate or similar basic customer information exists, and the acquired basic customer information is not a transfer requester. Therefore, it is determined that the acquired customer basic information is appropriate only when the search keyword is completely included in the acquired customer basic information, and is determined to be inappropriate in other cases.

例えば、依頼人姓名が「特許一郎」で、候補文字に数値候補文字列である郵便番号が含まれていたので、この郵便番号の候補文字を検索キーワードとして検索した。このとき得られた顧客基本情報には、「特許一郎」はなく「特許二郎」があったとする。この場合は、取得された「特許二郎」の顧客基本情報を「特許一郎」の顧客基本情報とすることはできないので不適正と判断する。 For example, since the client's full name is “Patent Ichiro” and the candidate character includes a zip code which is a numerical candidate character string, the candidate character of this zip code was searched as a search keyword. It is assumed that the customer basic information obtained at this time is “Patent Jiro” instead of “Patent Ichiro”. In this case, since the acquired customer basic information of “Patent Jiro” cannot be used as the customer basic information of “Patent Ichiro”, it is determined to be inappropriate.

取得された顧客基本情報が不適正であると判断された場合には、ステップＳ１３に進み、検索キーワードを変えて再検索が行なわれる。一方、検索キーワードと完全に一致する情報がヒットした場合には、ステップＳ１５に進む。 If it is determined that the acquired customer basic information is inappropriate, the process proceeds to step S13, and the search keyword is changed and the search is performed again. On the other hand, if information that completely matches the search keyword is hit, the process proceeds to step S15.

ステップＳ１５：以上の処理により依頼人に関する顧客基本情報が取得されたので、この依頼人が過去に行った振込履歴をソートする指令が辞書ユニット１１に出力される。辞書ユニット１１は、振込履歴のソート指令を受信すると、振込先の頻度順や振込日の順等の予め設定された方法で振込履歴をソートする。 Step S15: Since the customer basic information related to the client is obtained by the above processing, a command for sorting the transfer history made by the client in the past is output to the dictionary unit 11. When the dictionary unit 11 receives the transfer history sorting command, the dictionary unit 11 sorts the transfer history by a preset method such as the frequency order of the transfer destinations or the order of the transfer dates.

ステップＳ１６，Ｓ１７：取得した顧客基本情報に基づき認識結果の補正が必要か否かを判断する。振込み人に関する候補文字には、依頼人の姓名、住所、電話番号等の複数の情報が含まれている。従って、取得した顧客基本情報の依頼人の姓名と電話番号が一致していても、住所が一致しているとは限らない。この場合は、住所に関する補正が必要となる。無論、住所が全て異なる場合は、顧客基本情報に基づき補正してしまうと、実際に依頼人が転居しているような場合には、逆に間違った情報で補正することになる。そこで、補正する際には、一致度が予め設定された値以上（例えば８０％以上）の場合にのみ補正するようにする。 Steps S16 and S17: It is determined whether or not the recognition result needs to be corrected based on the acquired customer basic information. The candidate character regarding the transfer person includes a plurality of pieces of information such as the client's first and last name, address, and telephone number. Therefore, even if the first and last name of the client of the acquired customer basic information matches the telephone number, the address does not always match. In this case, correction relating to the address is required. Of course, if all the addresses are different, if correction is made based on the basic customer information, if the client actually moves, the correction is made with wrong information. Therefore, when the correction is made, the correction is made only when the degree of coincidence is a predetermined value or more (for example, 80% or more).

例えば、取得された顧客基本情報に含まれる依頼人の住所が「東京都港区芝５丁目７−１」（１２文字）であったが、候補文字の住所は「東京都港区芝５丁目１−１」（１１文字）で一致しなかった場合、一致度は９１．７％（＝１１／１２＊１００）なので取得した住所で、認識文字を補正する。 For example, the client's address included in the acquired customer basic information is “5-7-1 Shiba, Minato-ku, Tokyo” (12 characters), but the candidate address is “5-chome Shiba, Minato-ku, Tokyo”. When “1-1” (11 characters) does not match, the matching degree is 91.7% (= 11/12 * 100), so the recognized character is corrected with the acquired address.

ステップＳ１８，Ｓ１９：以上の処理により、依頼人に関する認識結果は適正に補正されたので、この補正された認識文字を検索キーワードとして受取人の情報に相当する顧客付随情報を検索する。そして、取得した顧客付随情報の適否を判断する。顧客付随情報の適否は、顧客基本情報の適否判断と同様の手順で行う。 Steps S18 and S19: As a result of the above processing, the recognition result regarding the client has been properly corrected, and the customer-accompanying information corresponding to the recipient information is searched using the corrected recognition character as a search keyword. And the suitability of the acquired customer accompanying information is determined. Appropriateness of the customer-accompanying information is performed in the same procedure as that for determining whether the customer basic information is appropriate.

ステップＳ２０，Ｓ２１：顧客付随情報により認識文字を補正するか否かを判断し、補正が必要な場合には受取人に関する認識文字を補正する。 Steps S20 and S21: It is determined whether or not the recognized character is to be corrected based on the customer-accompanying information. If correction is necessary, the recognized character regarding the recipient is corrected.

以上説明したように、画像データを、文字認識辞書を用いて文字認識し、かつ、顧客に関する認識結果を用いて顧客辞書を検索して、検索された顧客情報に基づき認識結果を補正するので、認識結果の補正処理が高速、かつ、高精度に行えるようになる。 As described above, the image data is character-recognized using the character recognition dictionary, and the customer dictionary is searched using the recognition result regarding the customer, and the recognition result is corrected based on the searched customer information. Recognition result correction processing can be performed at high speed and with high accuracy.

２文字認識システム
３スキャナ
４サーバ
５端末
６文字認識装置
１１辞書ユニット
１１ａ文字認識辞書
１１ｂ顧客辞書
１２認識ユニット
１５前処理部
１５ａ位置補正処理
１５ｂ文字画像切出処理
１５ｃ２値化処理
１５ｄ文字画像補正処理
１６認識部
１６ａ特徴抽出処理
１６ｃ棄却判定処理
１６ｂ認識処理
１７知識処理部
１７ａ検索単語照合処理
３０顧客情報テーブル 2 character recognition system 3 scanner 4 server 5 terminal 6 character recognition device 11 dictionary unit 11a character recognition dictionary 11b customer dictionary 12 recognition unit 15 preprocessing unit 15a position correction process 15b character image cutout process 15c binarization process 15d character image correction Process 16 Recognition unit 16a Feature extraction process 16c Rejection determination process 16b Recognition process 17 Knowledge processing unit 17a Search word matching process 30 Customer information table

Claims

A character recognition device for recognizing handwritten characters from image data of handwritten characters,
A dictionary unit comprising: a character recognition dictionary used for character recognition of the handwritten characters; and a customer dictionary for storing information about a customer who entered the handwritten characters;
The image data is character-recognized using the character recognition dictionary, and the customer dictionary is searched using information of recognized characters included in the character recognized, and the recognition is performed based on the searched customer information. A recognition unit for correcting characters,
The customer dictionary is composed of customer basic information and customer-accompanying information associated with the customer basic information ,
The recognition unit is
A preprocessing unit for obtaining a character image from the image data;
A recognition unit for obtaining a recognized character based on the acquired character image and the character recognition dictionary;
If the customer basic information is obtained by searching the customer dictionary using information about the customer included in the recognition character as a search keyword, and the acquired customer basic information is appropriate, the acquired customer basic information A knowledge processing unit that corrects the recognized character based on:
Character recognition apparatus according to claim Rukoto equipped with.

The character recognition device according to claim 1,
The recognition unit includes a preprocessing unit that cuts a recognition target from the image data and acquires a character image;
A recognition unit that obtains a feature of a character included in the character image as an extracted feature vector, compares the extracted feature vector with a registered feature vector registered in the character recognition dictionary, and obtains a recognized character;
A knowledge processing unit that searches the customer dictionary using the information about the customer included in the recognized character as a search keyword to acquire the customer basic information, and corrects the recognized character based on the acquired customer basic information. A character recognition device.

The character recognition device according to claim 2,
The knowledge processing unit corrects the recognized character based on the acquired customer basic information, and then corrects the recognized character based on the customer-accompanying information associated with the customer basic information. apparatus.

The character recognition device according to claim 3,
When the recognition character regarding the customer included in the recognition character includes numeric information and character information, the character information is set as the search keyword having a higher search priority than the character information. apparatus.

The character recognition device according to claim 3 or 4,
The knowledge processing unit instructs the dictionary unit to sort the customer-accompanying information associated with the customer basic information when the correction of the recognized character is started based on the acquired customer basic information. A character recognition device.

A character recognition method for recognizing handwritten characters from image data of handwritten characters,
A character recognition procedure for recognizing the image data using a character recognition dictionary;
A search procedure for searching the customer basic information using the recognition character from a customer dictionary composed of customer basic information related to the customer who entered the handwritten characters and customer incidental information associated with the customer basic information;
A first correction procedure for acquiring the customer basic information by searching the customer dictionary using a recognition character related to a customer included in the recognition character as a search keyword, and correcting the recognition character based on the acquired customer basic information; ,
After correcting the recognized character based on the acquired customer basic information, see contains a second correction procedure that corrects the recognized character based on said customer associated information associated with the customer basic information,
The first correction procedure includes:
A preprocessing procedure for obtaining a character image from the image data;
A recognition procedure for obtaining a recognized character based on the acquired character image and the character recognition dictionary;
If the customer basic information is obtained by searching the customer dictionary using information about the customer included in the recognition character as a search keyword, and the acquired customer basic information is appropriate, the acquired customer basic information A knowledge processing procedure for correcting the recognized character based on:
Character recognition method, which comprises a.

The character recognition method according to claim 6,
When the recognition character related to the customer included in the recognition character includes numerical information and character information, it includes a procedure for setting the numerical information as the search keyword having a higher search priority than the character information. Character recognition method.

The character recognition method according to claim 7,
A character including a procedure for instructing the dictionary unit to sort the customer-accompanying information associated with the customer basic information when starting correction of the recognized character based on the acquired customer basic information. Recognition method.