JP5669041B2

JP5669041B2 - Document processing apparatus and document processing method

Info

Publication number: JP5669041B2
Application number: JP2011016490A
Authority: JP
Inventors: 正和藤尾; 英宣谷口; 邦彦高瀬; 羽根　慎吾; 慎吾羽根; 繁幸根本; 崇文臼井; 池田　尚司; 尚司池田; 永崎　健; 健永崎
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2011-01-28
Filing date: 2011-01-28
Publication date: 2015-02-12
Anticipated expiration: 2031-01-28
Also published as: JP2012155662A

Description

本発明は、文字認識技術に関し、特に、活字文字を認識した結果を利用して手書き文字を認識する技術に関する。 The present invention relates to a character recognition technique, and more particularly to a technique for recognizing a handwritten character using a result of recognizing a printed character.

手書き文字及び活字文字を光学的に読み取り、これを文字データに変換して利用する文字認識技術が開示されている。 There has been disclosed a character recognition technique for optically reading handwritten characters and printed characters and converting them into character data for use.

例えば特許文献１には、難解な文字又は判読不能な文字を含む文字列を正確かつ効率的に読み取る技術が開示されている。 For example, Patent Document 1 discloses a technique for accurately and efficiently reading a character string including difficult characters or unreadable characters.

特許文献２には、タイプミス又は語の転置等に起因する表記ゆれを含むテキストから、辞書に登録された文字列を抽出する技術が開示されている。 Patent Document 2 discloses a technique for extracting a character string registered in a dictionary from a text including a notation fluctuation caused by a typo or transposition of a word.

特開２００９−２６５７５８号公報JP 2009-265758 A 特開２００７−２５８３４号公報JP 2007-25834 A

文字認識技術には種々の用途があるが、その一例として、官庁又は企業等における窓口業務が挙げられる。例えば、申請者が手書きした種々の申請書類を自動的に文字データに変換することによって、窓口業務を効率化することができる。しかし、活字文字の認識精度と比較すると、手書き文字の認識精度が著しく低いことが従来の文字認識技術の問題であった。また、一般に窓口では種々の書類が処理されるが、書類の種類ごとにフォーマットが異なる場合、読み取るべき手書き文字が書かれている位置も種類ごとに異なり、このことが原因で手書き文字の認識に失敗する場合もある。 The character recognition technology has various uses, and an example thereof is a window service in a government office or a company. For example, by automatically converting various application documents handwritten by the applicant into character data, the window service can be made more efficient. However, it has been a problem of the conventional character recognition technology that the recognition accuracy of handwritten characters is significantly lower than the recognition accuracy of type characters. In general, various kinds of documents are processed at the counter, but when the format is different for each type of document, the position where the handwritten character to be read is also different for each type. Sometimes it fails.

さらに、このような申請書類を受け付ける際に、申請者の本人確認が要求される場合がある。例えば、申請者が手書きした申請書類と、申請者が持参した本人確認書類（例えば自動車運転免許証等）とが照合され、両者の記載内容が一致すると判定された場合に申請書類が受理される。このような照合作業も文字認識技術を用いて自動化することができれば、窓口業務が大幅に効率化されるとともに、窓口のオペレータによる不正な処理も防止することができる。 Furthermore, when accepting such application documents, the identity of the applicant may be required. For example, the application documents handwritten by the applicant and the identity verification document (for example, a car driver's license) brought by the applicant are collated, and the application documents are accepted when it is determined that the contents of both match. . If such collation work can be automated using character recognition technology, the window service can be greatly improved, and unauthorized processing by the window operator can be prevented.

しかし、このような本人確認を自動化する場合、上記のような手書き文字の認識精度の問題に加えて、文字の種類の相違が原因で本人確認に失敗するという問題がある。本人確認書類に記載された活字文字は、その書類を作成したシステムにおいて使用できるものに限定される。このため、本人確認書類に記載された活字文字の字体が、申請者が手書きした文字の字体と異なる場合がある。このような相違は、典型的には漢字の異体字が使用される場合に発生するが、その他の場合にも発生し得る。例えば、本人確認書類がパスポートであり、申請書類には漢字の記入が要求される場合、漢字の手書き文字とローマ字の活字文字とが照合される。これらの場合には、文字認識そのものが成功したとしても、本人確認は失敗することになる。 However, when automating such identity verification, in addition to the above-mentioned problem of recognition accuracy of handwritten characters, there is a problem that identity verification fails due to the difference in character type. The printed characters described in the identity verification document are limited to those that can be used in the system that created the document. For this reason, the typeface of the printed character described in the identification document may be different from the type of the character handwritten by the applicant. Such a difference typically occurs when Kanji variants are used, but may also occur in other cases. For example, when the identity verification document is a passport and the application document requires entry of kanji, the kanji handwritten character and the roman character are collated. In these cases, even if the character recognition itself is successful, the identity verification will fail.

本発明は上記のような問題点に鑑みてなされたものであり、文字認識の精度を改善するとともに、文字の種類の相違に起因する本人確認の失敗を防止することを目的とする。 The present invention has been made in view of the above-described problems, and it is an object of the present invention to improve the accuracy of character recognition and prevent failure of identity verification due to a difference in character type.

本発明の代表的な一例を示せば次の通りである。すなわち、入力装置と、前記入力装置に接続されたプロセッサと、前記プロセッサに接続された記憶装置と、前記プロセッサに接続された出力装置と、を備える文書処理装置であって、前記記憶装置は、活字文字の特徴量を示す情報を含む活字文字認識用辞書、手書き文字の特徴量を示す情報を含む手書き文字認識用辞書、並びに、複数の文字、前記複数の文字間の接続、及び前記接続の起こりやすさを示すスコアを含む文字列パターン辞書を保持し、前記プロセッサは、前記入力装置を介して入力された手書き文字列及び活字文字列を含む画像データを取得し、前記活字文字認識用辞書を用いて前記活字文字列を認識し、前記活字文字列の認識結果として取得した文字列及びその文字列に含まれる文字間の接続が既に前記文字列パターン辞書に登録されている場合、前記登録されている文字間の接続に関する前記スコアの値を増加させ、前記活字文字列の認識結果として取得した文字列及びその文字列に含まれる文字間の接続が前記文字列パターン辞書に登録されていない場合、前記活字文字列の認識結果として取得した文字列に含まれる文字、それらの文字間の接続及びその接続の起こりやすさを示すスコアを新たに前記文字列パターン辞書に登録し、前記手書き文字認識用辞書及び前記文字列パターン辞書を用いて前記手書き文字列を認識し、前記出力装置を介して前記手書き文字列の認識結果を出力することを特徴とする。 A typical example of the present invention is as follows. That is, a document processing apparatus comprising an input device, a processor connected to the input device, a storage device connected to the processor, and an output device connected to the processor, wherein the storage device is print character recognition dictionary including information indicating the characteristic quantity of printed characters, handwritten character recognition dictionary including information indicating a characteristic of the handwritten characters, as well as a plurality of characters, the connection between the plurality of characters, and of the connection A character string pattern dictionary including a score indicating the likelihood of occurrence is held, and the processor acquires image data including a handwritten character string and a character string input via the input device, and the character character recognition dictionary the print string recognizes, connection already said string pattern prefix between characters included in the obtained character string and the character string as a recognition result of the printed character string using The character string obtained as a recognition result of the printed character string and the connection between the characters included in the character string are increased by increasing the value of the score relating to the connection between the registered characters. If it is not registered in the character string pattern dictionary, the character string newly included in the character string acquired as a recognition result of the printed character string, a connection between those characters and a score indicating the likelihood of the connection are newly added to the character string Registering in a pattern dictionary, recognizing the handwritten character string using the handwritten character recognition dictionary and the character string pattern dictionary, and outputting the recognition result of the handwritten character string via the output device .

本発明の一実施形態によれば、手書き文字の認識精度を改善することができる。 According to one embodiment of the present invention, the recognition accuracy of handwritten characters can be improved.

本発明の実施形態の窓口受付システムの構成を示すブロック図である。It is a block diagram which shows the structure of the window reception system of embodiment of this invention. 本発明の実施形態の窓口受付システムによって読み取られた画像の説明図である。It is explanatory drawing of the image read by the window reception system of embodiment of this invention. 従来のＯＣＲ装置による窓口業務効率化のための文字認識処理のフローチャートである。It is a flowchart of the character recognition process for the window work efficiency improvement by the conventional OCR apparatus. 本発明の実施形態の窓口受付システムが実行する文字認識処理のフローチャートである。It is a flowchart of the character recognition process which the window reception system of embodiment of this invention performs. 本発明の実施形態の項目名辞書の説明図である。It is explanatory drawing of the item name dictionary of embodiment of this invention. 本発明の実施形態のデータ文字列パターン辞書の説明図である。It is explanatory drawing of the data character string pattern dictionary of embodiment of this invention. 本発明の実施形態の窓口受付システムが実行するデータ文字列パターン辞書の動的追加処理のフローチャートである。It is a flowchart of the dynamic addition process of the data character string pattern dictionary which the window reception system of embodiment of this invention performs. 本発明の実施形態の窓口受付システムが実行する、データ文字列パターン辞書を予め保持しない場合の手書き文字認識処理のフローチャートである。It is a flowchart of the handwritten character recognition process in the case of not holding the data character string pattern dictionary previously performed by the window reception system of the embodiment of the present invention. 本発明の実施形態の窓口受付システムが実行する、データ文字列パターン辞書を予め保持しない場合の手書き文字認識処理の別の例を示すフローチャートである。It is a flowchart which shows another example of the handwritten character recognition process in the case of not hold | maintaining the data character string pattern dictionary previously performed by the window reception system of embodiment of this invention.

以下、図面を用いて本発明の実施の形態を説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

図１は、本発明の実施形態の窓口受付システムの構成を示すブロック図である。 FIG. 1 is a block diagram illustrating a configuration of a window reception system according to an embodiment of this invention.

本実施形態の窓口受付システム１０は、窓口業務を支援するために、文字認識を利用して文書処理を行う計算機であり、例えば、入力装置１１、表示装置１２、ＣＰＵ（Central Processing Unit）１３、印刷装置１４、ワークエリア１５及び情報保持手段１６を備える。 The window reception system 10 of the present embodiment is a computer that performs document processing using character recognition in order to support a window service, and includes, for example, an input device 11, a display device 12, a CPU (Central Processing Unit) 13, A printing apparatus 14, a work area 15, and information holding means 16 are provided.

入力装置１１は、入力装置１１０及び画像入力装置１１１を含む。入力装置１１０は、ユーザによる指示等の入力を受け付ける装置であり、例えばキーボード、マウス又はタッチパネル等であってもよい。画像入力装置１１１は、手書き文字及び活字文字等が記載された書面を光学的に読み取ってそれらを画像データに変換する、いわゆるイメージスキャナである。 The input device 11 includes an input device 110 and an image input device 111. The input device 110 is a device that accepts an input such as an instruction from a user, and may be a keyboard, a mouse, a touch panel, or the like, for example. The image input device 111 is a so-called image scanner that optically reads a document on which handwritten characters and printed characters are written and converts them into image data.

表示装置１２は、ユーザに種々の文字及び画像等の情報を出力する装置であり、例えば液晶ディスプレイのような画像表示装置であってもよい。 The display device 12 is a device that outputs information such as various characters and images to the user, and may be an image display device such as a liquid crystal display.

ＣＰＵ１３は、ワークエリア１５に格納されたプログラムを実行することによって種々の機能を実現する処理装置である。以下に説明される各プログラムの処理は、実際にはＣＰＵ１３が実行する。 The CPU 13 is a processing device that implements various functions by executing a program stored in the work area 15. The processing of each program described below is actually executed by the CPU 13.

印刷装置１４は、ユーザに提供する種々の情報を必要に応じて印刷する。 The printing device 14 prints various information provided to the user as necessary.

ワークエリア１５は、ＣＰＵ１３によって実行されるプログラム等が格納される記憶領域である。本実施形態のワークエリア１５には、ＯＳ（Operating System）１５１、通信プログラム１５２、文書処理プログラム１５３及び文字認識プログラム１５４が格納される。ワークエリア１５にはさらに他のプログラムが格納されてもよいし、それらのプログラムをＣＰＵ１３が実行するときに参照されるデータが格納されてもよいし、ＣＰＵ１３が実行した処理の結果が格納されてもよい。これらのプログラムに基づく処理については後述する。 The work area 15 is a storage area for storing programs executed by the CPU 13. The work area 15 of this embodiment stores an OS (Operating System) 151, a communication program 152, a document processing program 153, and a character recognition program 154. Still other programs may be stored in the work area 15, data referred to when the CPU 13 executes these programs may be stored, and results of processing executed by the CPU 13 may be stored. Also good. Processing based on these programs will be described later.

情報保持手段１６は、ＣＰＵ１３が各プログラムに基づいて種々の処理を実行するために参照する情報を格納する。本実施形態の情報保持手段１６には、活字文字認識用辞書１６１、手書き文字認識用辞書１６２、外字・ローマ字辞書１６３、配置知識辞書１６４、項目名辞書１６５及びデータ文字列パターン辞書１６６が格納される。 The information holding means 16 stores information that the CPU 13 refers to in order to execute various processes based on each program. The information holding means 16 of the present embodiment stores a printed character recognition dictionary 161, a handwritten character recognition dictionary 162, an external character / Roman character dictionary 163, an arrangement knowledge dictionary 164, an item name dictionary 165, and a data character string pattern dictionary 166. The

活字文字認識用辞書１６１は、活字文字の特徴量を示す情報を含み、窓口受付システム１０が活字文字認識を実行する際に参照される。手書き文字認識用辞書１６２は、手書き文字の特徴量を示す情報を含み、窓口受付システム手書き文字認識を実行する際に参照される。これらの辞書は、従来の文字認識において使用されるものと同様のものであってよい。 The printed character recognition dictionary 161 includes information indicating the characteristic amount of printed characters, and is referred to when the window reception system 10 performs printed character recognition. The handwritten character recognition dictionary 162 includes information indicating the characteristic amount of the handwritten character, and is referred to when the counter reception system handwritten character recognition is executed. These dictionaries may be similar to those used in conventional character recognition.

外字・ローマ字辞書１６３には、窓口受付システム１０が同一と判定する文字の範囲を示す情報が格納される。例えば、特定の漢字の異体字がその漢字の代わりに慣用される場合であって、窓口受付システム１０がそれらを同一の文字と判定することが許容される場合、それらを対応付ける情報が外字・ローマ字辞書１６３に格納される。さらに、漢字とその漢字に対応するローマ字表記とを対応付ける情報が外字・ローマ字辞書１６３に格納されてもよい。外字・ローマ字辞書１６３及びそれを用いた処理の例については後述する（図９参照）。 The external character / Romaji dictionary 163 stores information indicating a range of characters determined to be the same by the window reception system 10. For example, when a variant of a specific kanji is used in place of that kanji and the window reception system 10 is allowed to determine that they are the same character, the information for associating them is an external character / roman character It is stored in the dictionary 163. Furthermore, information that associates a Chinese character with a Roman character notation corresponding to the Chinese character may be stored in the external character / Roman character dictionary 163. An example of the external character / Romaji dictionary 163 and processing using the dictionary will be described later (see FIG. 9).

配置知識辞書１６４、項目名辞書１６５及びデータ文字列パターン辞書１６６については後述する（図４参照）。なお、後述するように、窓口受付システム１０がデータ文字列パターン辞書１６６を予め保持しない場合もある。 The arrangement knowledge dictionary 164, the item name dictionary 165, and the data character string pattern dictionary 166 will be described later (see FIG. 4). As will be described later, the window reception system 10 may not hold the data character string pattern dictionary 166 in advance.

情報保持手段１６にはさらに他の情報が格納されてもよいが、本実施形態では説明を省略する。 Other information may be stored in the information holding unit 16, but the description is omitted in this embodiment.

ワークエリア１５及び情報保持手段１６は、窓口受付システム１０に実装された記憶装置に相当する。典型的には、ワークエリア１５がＤＲＡＭ（Dynamic Random Access Memory）のような高速かつ揮発性の記憶装置であり、情報保持手段１６がＨＤＤ（Hard Disk Drive）又はフラッシュメモリのような大容量かつ不揮発性の記憶装置であるが、その他の種類の記憶装置であってもよい。各プログラムは、情報保持手段１６に格納され、ＣＰＵ１３によって実行されるときにワークエリア１５にコピーされてもよい。また、情報保持手段１６に格納された辞書の少なくとも一部が、必要に応じて一時的にワークエリア１５にコピーされてもよい。 The work area 15 and the information holding unit 16 correspond to a storage device installed in the window reception system 10. Typically, the work area 15 is a high-speed and volatile storage device such as a DRAM (Dynamic Random Access Memory), and the information holding means 16 is a large-capacity and nonvolatile memory such as an HDD (Hard Disk Drive) or a flash memory. However, other types of storage devices may be used. Each program may be stored in the information holding unit 16 and copied to the work area 15 when executed by the CPU 13. Further, at least a part of the dictionary stored in the information holding unit 16 may be temporarily copied to the work area 15 as necessary.

窓口受付システム１０は、さらに、通信ネットワーク１９を介して一つ以上のファイルサーバ２１と接続されてもよい。各ファイルサーバ２１は、通信ネットワーク１９に接続された計算機であり、そのハードウエア構成は窓口受付システム１０と同様であってもよい。 The window reception system 10 may be further connected to one or more file servers 21 via the communication network 19. Each file server 21 is a computer connected to the communication network 19, and its hardware configuration may be the same as that of the window reception system 10.

図１には、窓口受付システム１０が一つの計算機によって実現される例を示したが、窓口受付システム１０と同様の機能が複数の計算機によって実現されてもよい。例えば、活字文字認識用辞書１６１等の各種の辞書が情報保持手段１６に格納される代わりにいずれかのファイルサーバ２１に格納されてもよい。あるいは、例えば、画像入力装置１１１が一つ又は複数のファイルサーバ２１に設けられ、その画像入力装置１１１が後述する申請書及び本人確認書類を読み取ってもよい。その場合、ファイルサーバ２１によって読み取られた画像データが通信ネットワーク１９を介して窓口受付システム１０に送信される。上記の場合、入力装置１１は、通信ネットワーク１９に接続された通信装置を含んでもよい。通信ネットワーク１９を介した通信は、通信プログラム１５２によって制御される。 Although FIG. 1 shows an example in which the window reception system 10 is realized by one computer, the same function as the window reception system 10 may be realized by a plurality of computers. For example, various dictionaries such as the printed character recognition dictionary 161 may be stored in any one of the file servers 21 instead of being stored in the information holding unit 16. Alternatively, for example, the image input device 111 may be provided in one or a plurality of file servers 21, and the image input device 111 may read an application form and an identification document described later. In that case, the image data read by the file server 21 is transmitted to the window reception system 10 via the communication network 19. In the above case, the input device 11 may include a communication device connected to the communication network 19. Communication via the communication network 19 is controlled by the communication program 152.

図２は、本発明の実施形態の窓口受付システム１０によって読み取られた画像の説明図である。 FIG. 2 is an explanatory diagram of an image read by the window reception system 10 according to the embodiment of this invention.

窓口受付システム１０は、画像入力装置１１１を用いて申請書及びその申請書を受け付けるために必要な本人確認書類を光学的に読み取り、それによって得られた画像データをワークエリア１５又は情報保持手段１６に格納する。このような読み取りはどのような手順で実行されてもよいが、典型的には、それらの書類が１枚の画像に含まれるように読み取られる。図２には、一例として、住民票を取得するための申請書２１１及び運転免許証２１２（すなわち本人確認書類）が読み取られた場合の画像２００を示す。 The window reception system 10 optically reads an application form and an identification document necessary for receiving the application form using the image input device 111, and uses the image data obtained thereby as a work area 15 or information holding means 16. To store. Such reading may be performed by any procedure, but typically, the documents are read so as to be included in one image. FIG. 2 shows, as an example, an image 200 when an application form 211 for obtaining a resident card and a driver's license 212 (that is, an identity confirmation document) are read.

申請書２１１には、申請者の住所、氏名及び生年月日等が手書きされる。以下、申請書２１１の氏名欄２０１（図２の「氏名欄１」）に手書きされた氏名の文字の認識を例として説明するが、本実施形態は他の欄に記載された手書き文字についても適用することができ、また、手書き文字を含む書類であれば、申請書以外の書類にも適用することができる。図２の例では、氏名として手書き文字列「日崎太郎」が記載されている。 The application form 211 is handwritten with the address, name, date of birth, etc. of the applicant. Hereinafter, although the recognition of the character of the name handwritten in the name field 201 (“Name field 1” in FIG. 2) of the application 211 will be described as an example, the present embodiment also applies to the handwritten characters described in other fields. The present invention can be applied to any document other than the application as long as it includes a handwritten character. In the example of FIG. 2, the handwritten character string “Taro Hizaki” is described as the name.

運転免許証２１２には、免許を受けた者の住所、氏名及び生年月日等が活字で記載されている。以下の説明では、氏名欄２０２（図２の「氏名欄２」）に記載された氏名の活字文字が使用される。なお、運転免許証は本人確認書類の一例であり、それ以外の書類（例えばパスポート又は健康保険証等）が本人確認書類として使用されてもよい。図２の例では、氏名として活字文字列「日埼太郎」が記載されている。 In the driver's license 212, the address, name, date of birth, etc. of the person who has received the license are written in print. In the following description, the type characters of the name described in the name field 202 (“name field 2” in FIG. 2) are used. The driver's license is an example of an identity verification document, and other documents (for example, a passport or a health insurance card) may be used as the identity verification document. In the example of FIG. 2, the type character string “Taro Hisaki” is described as the name.

図３は、従来のＯＣＲ（光学式文字読取）装置による窓口業務効率化のための文字認識処理のフローチャートである。 FIG. 3 is a flowchart of a character recognition process for improving the efficiency of window work by a conventional OCR (optical character reading) apparatus.

例えば図２の申請書２１１の画像が入力されると（ステップ３０１）、従来のＯＣＲ装置（図示省略）は、入力された画像における氏名欄２０１の位置を判定し（ステップ３０２）、氏名欄２０１に記載された手書き文字を認識する（ステップ３０３）。 For example, when the image of the application form 211 of FIG. 2 is input (step 301), the conventional OCR device (not shown) determines the position of the name field 201 in the input image (step 302), and the name field 201 Are recognized (step 303).

一方、例えば図２の運転免許証２１２の画像が入力されると（ステップ３１１）、従来のＯＣＲ装置は、入力された画像における氏名欄２０２の位置を判定し（ステップ３１２）、氏名欄２０２に記載された活字文字を認識する（ステップ３１３）。活字文字認識処理（ステップ３０１〜３０３）及び手書き文字認識処理（ステップ３１１〜３１２）は、一つのＯＣＲ装置によって実行されてもよいし、それぞれ別のＯＣＲ装置によって実行されてもよい。 On the other hand, for example, when the image of the driver's license 212 of FIG. 2 is input (step 311), the conventional OCR device determines the position of the name field 202 in the input image (step 312), and the name field 202 is displayed. The printed character is recognized (step 313). The type character recognition processing (steps 301 to 303) and the handwritten character recognition processing (steps 311 to 312) may be executed by one OCR device, or may be executed by different OCR devices.

その後、従来のＯＣＲ装置は、ステップ３０３における認識結果と、ステップ３１３における認識結果とを照合し（ステップ３０４）、両者が一致するか否かを判定する（ステップ３０５）。両者が一致する場合、本人確認に成功したため、申請書は受理され、申請書に基づく申請処理が実行される。この申請処理は従来と同様であるため説明を省略する。一方、両者が一致しない場合、文字認識による本人確認に失敗したため、別の方法による本人確認（例えば窓口オペレータの人手による本人確認等）が実行される。 Thereafter, the conventional OCR device collates the recognition result in step 303 with the recognition result in step 313 (step 304), and determines whether or not they match (step 305). If the two match, the identity verification has succeeded and the application is accepted and application processing based on the application is executed. Since this application process is the same as the conventional process, a description thereof will be omitted. On the other hand, if the two do not match, identity verification by character recognition has failed, and identity verification by another method (for example, identity verification by the hand of the window operator, etc.) is executed.

上記のステップ３１２における氏名欄の判定は、例えば次のように行われる。ＯＣＲ装置は、予め、配置知識辞書、項目名辞書及びデータ文字列パターン辞書を保持する。これらは、本発明の実施形態の配置知識辞書１６４、項目名辞書１６５及びデータ文字列パターン辞書１６６と同様のものであってよい。ＯＣＲ装置は、項目名辞書に基づいて本人確認書類上の文字列の項目名（例えば「氏名」のような項目名を示す文字列）らしさを判定し、データ文字列パターン辞書１６６に基づいて本人確認書類上の文字列の項目値（例えば「田中」のような氏名を示す文字列）らしさを判定し、項目名らしいと判定された文字列と、項目値らしいと判定された文字列との位置関係が配置知識辞書に登録された位置関係と一致する場合に、その領域を氏名欄２０２として抽出する。 The determination of the name column in step 312 is performed as follows, for example. The OCR device holds an arrangement knowledge dictionary, an item name dictionary, and a data character string pattern dictionary in advance. These may be the same as the arrangement knowledge dictionary 164, the item name dictionary 165, and the data character string pattern dictionary 166 of the embodiment of the present invention. The OCR device determines the likelihood of the item name of the character string on the identity verification document based on the item name dictionary (for example, a character string indicating an item name such as “name”), and based on the data character string pattern dictionary 166 Determine the likelihood of the item value of the character string on the confirmation document (for example, a character string indicating a name such as “Tanaka”), and the character string determined to be the item name and the character string determined to be the item value When the positional relationship matches the positional relationship registered in the arrangement knowledge dictionary, the area is extracted as the name field 202.

しかし、この方法によれば、予めデータ文字列パターン辞書に登録されていない氏名が入力されたことによって氏名欄２０２の抽出に失敗する場合がある。氏名欄２０２及び２０１の抽出にいずれも成功したとしても、記載された氏名がデータ文字列パターン辞書に登録されていなければ、その後の文字認識に失敗する可能性が高い。 However, according to this method, extraction of the name field 202 may fail due to the input of a name that is not registered in advance in the data character string pattern dictionary. Even if both of the name columns 202 and 201 are successfully extracted, if the written name is not registered in the data character string pattern dictionary, there is a high possibility that subsequent character recognition will fail.

また、現在の文字認識技術による活字文字の認識精度は実用上十分な程度に高いが、それと比較して手書き文字認識の精度は著しく低い。このため、本来は正しいはずの申請書及び本人確認書類が入力され（すなわち氏名欄２０１と氏名欄２０２に同じ氏名が記載され）、かつ、記載された氏名が辞書に登録されていた場合であっても、ステップ３０５において両者が一致しないと判定される頻度が高くなる。 In addition, although the accuracy of recognition of type characters by the current character recognition technology is high enough for practical use, the accuracy of recognition of handwritten characters is significantly lower than that. For this reason, the application and identity verification document that should have been correct are input (that is, the same name is written in the name field 201 and the name field 202), and the written name is registered in the dictionary. However, the frequency at which it is determined in step 305 that the two do not match increases.

上記のようにＯＣＲ装置による本人確認の失敗が多発すれば、オペレータの作業負担が十分に軽減されず、したがって、窓口業務を十分に効率化することができない。 As described above, if the OCR device frequently fails to verify the identity, the operator's workload is not sufficiently reduced, and therefore the window service cannot be made sufficiently efficient.

図４は、本発明の実施形態の窓口受付システム１０が実行する文字認識処理のフローチャートである。 FIG. 4 is a flowchart of character recognition processing executed by the window reception system 10 according to the embodiment of this invention.

最初に、窓口受付システム１０の画像入力装置１１１を用いて申請書及び本人確認書類が入力される（ステップ４０１）。例えば、申請書２１１及び運転免許証２１２（すなわち本人確認書類）が画像入力装置１１１によって読み取られ、図２に示す画像２００が取得される。 First, an application form and an identification document are input using the image input device 111 of the window reception system 10 (step 401). For example, the application form 211 and the driver's license 212 (that is, the identity confirmation document) are read by the image input device 111, and the image 200 shown in FIG.

次に、窓口受付システム１０は、本人確認書類の項目領域を抽出する（ステップ４０２）。項目領域とは、住所、氏名又は生年月日等の項目の値が記載された領域である。ここでは氏名の認識を例として説明するため、少なくとも氏名欄２０２が項目領域として抽出される。この抽出は、情報保持手段１６に保持された配置知識辞書１６４及び項目名辞書１６５に基づいて行われる。 Next, the window reception system 10 extracts the item area of the personal identification document (step 402). The item area is an area in which the value of an item such as an address, name, or date of birth is described. Here, in order to explain the recognition of the name as an example, at least the name column 202 is extracted as an item area. This extraction is performed based on the arrangement knowledge dictionary 164 and the item name dictionary 165 held in the information holding unit 16.

項目名辞書１６５には、各項目領域に対応して表示される項目名が格納される。例えば、運転免許証２１２の氏名欄２０２の近傍には、項目名として「氏名」の文字が表示されている。この表示は、氏名欄２０２の位置を特定するための標識（ラベル）として使用される。この場合、項目名辞書１６５には、「氏名」の文字列が格納される。 The item name dictionary 165 stores item names displayed corresponding to the respective item areas. For example, in the vicinity of the name field 202 of the driver's license 212, the characters “name” are displayed as the item name. This display is used as a sign (label) for specifying the position of the name field 202. In this case, the character string “name” is stored in the item name dictionary 165.

本人確認書類として運転免許証以外の書類が使用されてもよく、それらの書類では、氏名欄の項目名として「氏名」以外の文字列、例えば「Ｎａｍｅ」等が表示される場合もある。また、後述するように、項目名辞書１６５は、申請書における項目領域（例えば申請書２１１の氏名欄２０１）の位置を特定するためにも使用される。申請書に項目名として表示される文字列は、本人確認書類のそれと異なる場合がある。例えば、申請書に「お名前」の文字列が印刷され、その近傍に申請者の氏名を手書きする領域が確保されている場合がある。また、窓口受付システム１０が複数の種類の申請書を処理する場合があり、申請書の種類によって項目名として表示される文字列が異なる場合もある。このため、項目名辞書１６５には、氏名欄の項目名として、「氏名」以外の文字列、例えば「お名前」及び「Ｎａｍｅ」等がさらに格納されてもよい。 Documents other than the driver's license may be used as the identity verification document, and in those documents, a character string other than “name”, for example, “Name” may be displayed as the item name in the name column. As will be described later, the item name dictionary 165 is also used to specify the position of an item area (for example, the name field 201 of the application 211) in the application. The character string displayed as the item name on the application form may differ from that of the identity verification document. For example, a character string of “name” may be printed on the application form, and an area for handwriting the name of the applicant may be secured in the vicinity thereof. Further, the window reception system 10 may process a plurality of types of application forms, and the character string displayed as the item name may differ depending on the type of application form. For this reason, the item name dictionary 165 may further store character strings other than “name” such as “name” and “Name” as item names in the name field.

氏名欄２０２以外の項目領域についても、項目名の文字列が、各項目領域について一つ以上格納される。項目名辞書１６５の一例については図５を参照して後述する。 For item areas other than the name field 202, one or more character strings of the item names are stored for each item area. An example of the item name dictionary 165 will be described later with reference to FIG.

配置知識辞書１６４には、各項目領域の配置を示す情報、より具体的には、各項目名の表示と、各項目名に対応する項目領域との位置関係を重み付けする情報が格納される。例えば、運転免許証２１２に表示された項目名「氏名」の右側の所定の範囲が氏名欄２０２である。この場合、配置知識辞書１６４には、文字列「氏名」の表示と氏名欄２０２との位置関係を示す情報が格納される。上記の項目名辞書１６５の場合と同様、配置知識辞書１６４にも、種々の本人確認書類及び種々の申請書における項目名の表示と項目領域との位置関係を示す情報が格納される。 The arrangement knowledge dictionary 164 stores information indicating the arrangement of each item area, more specifically, information for weighting the positional relationship between the display of each item name and the item area corresponding to each item name. For example, a predetermined range on the right side of the item name “name” displayed on the driver's license 212 is the name field 202. In this case, the arrangement knowledge dictionary 164 stores information indicating the positional relationship between the display of the character string “name” and the name column 202. As in the case of the item name dictionary 165 described above, the arrangement knowledge dictionary 164 stores information indicating the positional relationship between the item name display and the item area in various identity verification documents and various application forms.

窓口受付システム１０は、これらの辞書に基づいて、氏名欄２０２等の項目領域を抽出する。例えば、窓口受付システム１０は、項目名辞書１６５に基づいて本人確認書類上の文字列の項目名らしさを判定し、項目名らしいと判定された文字列との位置関係が配置知識辞書１６４に登録された位置関係と一致する領域を項目領域として抽出する。 The window reception system 10 extracts item areas such as the name field 202 based on these dictionaries. For example, the window reception system 10 determines the likelihood of the item name of the character string on the identification document based on the item name dictionary 165 and registers the positional relationship with the character string determined to be the item name in the arrangement knowledge dictionary 164. An area matching the determined positional relationship is extracted as an item area.

次に、窓口受付システム１０は、ステップ４０２において抽出された項目領域に記載された住所、氏名及び生年月日等の活字文字を認識する（ステップ４０３）。この手順は従来の文字認識と同様であるため、詳細な説明は省略する。 Next, the window reception system 10 recognizes the printed characters such as the address, name and date of birth described in the item area extracted in Step 402 (Step 403). Since this procedure is the same as the conventional character recognition, a detailed description is omitted.

次に、窓口受付システム１０は、ステップ４０３における活字文字の認識結果を用いて、手書き文字認識用の候補文字列パターン辞書を作成する（ステップ４０４）。窓口受付システム１０は、予めデータ文字列パターン辞書１６６を保持していない場合には、作成された候補文字列パターン辞書をデータ文字列パターン辞書１６６として使用し、予めデータ文字列パターン辞書１６６を保持していた場合には、作成された候補文字列パターン辞書を用いてデータ文字列パターン辞書１６６を更新する。これらの処理については後述する（図７及び図８等参照）。 Next, the window reception system 10 creates a candidate character string pattern dictionary for handwritten character recognition using the recognition result of the printed character in step 403 (step 404). When the window reception system 10 does not hold the data character string pattern dictionary 166 in advance, it uses the created candidate character string pattern dictionary as the data character string pattern dictionary 166 and holds the data character string pattern dictionary 166 in advance. If so, the data character string pattern dictionary 166 is updated using the created candidate character string pattern dictionary. These processes will be described later (see FIGS. 7 and 8).

次に、窓口受付システム１０は、申請書において手書き文字が記載された項目領域を抽出する（ステップ４０５）。ここで抽出される項目領域は、申請書における住所、氏名又は生年月日等が記載された領域であり、申請者による手書き文字が記載されている。この抽出は、ステップ４０２における抽出と同様に、配置知識辞書１６４及び項目名辞書１６５を参照して行われる。 Next, the window reception system 10 extracts item areas in which handwritten characters are described in the application form (step 405). The item area extracted here is an area in which an address, name, date of birth, or the like in the application is described, and handwritten characters by the applicant are described. This extraction is performed with reference to the arrangement knowledge dictionary 164 and the item name dictionary 165, similarly to the extraction in step 402.

次に、窓口受付システム１０は、ステップ４０５において抽出された項目領域に記載された住所、氏名及び生年月日等の手書き文字を認識する（ステップ４０６）。この手順は、ステップ４０４において更新又は作成されたデータ文字列パターン辞書１６６を参照する点を除いて従来の文字認識と同様であるため、その詳細な説明は省略する。すなわち、辞書の参照方法及びそれに基づく文字認識の方法は従来と同様である。 Next, the window reception system 10 recognizes handwritten characters such as an address, name and date of birth described in the item area extracted in step 405 (step 406). Since this procedure is the same as the conventional character recognition except that the data character string pattern dictionary 166 updated or created in step 404 is referred to, detailed description thereof will be omitted. That is, the dictionary reference method and the character recognition method based thereon are the same as in the prior art.

次に、窓口受付システム１０は、認識した結果を出力して（ステップ４０７）、処理を終了する。例えば、窓口受付システム１０は、氏名欄２０１の文字認識（ステップ４０３）によって得られた文字列そのものを表示装置１２に表示してもよいし、その文字列と、氏名欄２０２の文字認識（ステップ４０６）によって得られた文字列とが一致するか否かを判定し、その結果を表示装置１２に表示してもよい。具体的には、例えば、両者が一致しないと判定された場合、本人確認に失敗したことを示すアラートを表示装置１２に表示してもよい。 Next, the window reception system 10 outputs the recognized result (step 407) and ends the process. For example, the window reception system 10 may display the character string itself obtained by the character recognition (step 403) in the name field 201 on the display device 12, or the character recognition (step 403) in the name field 202. It may be determined whether or not the character string obtained in step 406) matches, and the result may be displayed on the display device 12. Specifically, for example, when it is determined that the two do not match, an alert indicating that the identity verification has failed may be displayed on the display device 12.

なお、以上の処理は、詳細には、窓口受付システム１０のＣＰＵ１３がワークエリア１５内のプログラムに従って実行する。例えば、ステップ４０２、４０４及び４０５が文書処理プログラム１５３に従って実行され、ステップ４０３及び４０６が文字認識プログラムに従って実行される。 More specifically, the above processing is executed by the CPU 13 of the window reception system 10 according to a program in the work area 15. For example, steps 402, 404 and 405 are executed according to the document processing program 153, and steps 403 and 406 are executed according to the character recognition program.

図５は、本発明の実施形態の項目名辞書１６５の説明図である。 FIG. 5 is an explanatory diagram of the item name dictionary 165 according to the embodiment of this invention.

項目名辞書１６５は、本人確認書類及び申請書の項目名の文字列を定義する情報を含む。例えば、図５に示す項目名辞書１６５は、「Ｓｕｒｎａｍｅ」５０１、「氏名」５０２、「住所」５０３、「お名前」５０４及び「ご住所」５０５を含む。「Ｓｕｒｎａｍｅ」５０１は、本人確認書類がパスポートである場合の、氏名（姓）が記載された項目領域のラベルの例である。「氏名」５０２は、本人確認書類が運転免許証である場合の、氏名が記載された項目領域のラベルの例である。「住所」５０３は、本人確認書類が運転免許証である場合の、住所が記載された項目領域のラベルの例である。「お名前」５０４は、申請書における、氏名が手書きされる項目領域のラベルの例である。「ご住所」５０５は、申請書における、住所が手書きされる項目領域のラベルの例である。 The item name dictionary 165 includes information that defines character strings of item names of the personal identification document and the application form. For example, the item name dictionary 165 illustrated in FIG. 5 includes “Surname” 501, “Name” 502, “Address” 503, “Name” 504, and “Address” 505. “Surname” 501 is an example of a label of an item area in which a name (last name) is described when the identification document is a passport. “Name” 502 is an example of an item area label in which the name is written when the identification document is a driver's license. “Address” 503 is an example of a label of an item area in which an address is described when the identification document is a driver's license. “Name” 504 is an example of the label of the item area in which the name is handwritten in the application form. “Address” 505 is an example of an item area label in which an address is handwritten in the application form.

図６は、本発明の実施形態のデータ文字列パターン辞書１６６の説明図である。 FIG. 6 is an explanatory diagram of the data character string pattern dictionary 166 according to the embodiment of this invention.

データ文字列パターン辞書１６６は、情報保持手段１６に格納され、必要に応じてその一部又は全部がワークエリア１５にロードされてもよい。 The data character string pattern dictionary 166 may be stored in the information holding unit 16 and part or all of the data character string pattern dictionary 166 may be loaded into the work area 15 as necessary.

データ文字列パターン辞書１６６には、手書き文字列に含まれる可能性がある文字列が登録される。例えば、氏名の文字列の認識に使用されるデータ文字列パターン辞書１６６には、氏名として使用される可能性がある文字列、例えば、姓として使用される文字列＜ＮＡＭＥ１＞として「田中」、「鈴木」、「佐藤」及び「山田」等が、名として使用される文字列＜ＮＡＭＥ２＞として「一郎」、「太郎」、「花子」及び「よし子」等が登録される。さらに、上記の姓の文字列＜ＮＡＭＥ１＞と、その後に続く名の文字列＜ＮＡＭＥ２＞とからなる文字列＜Ｎ４００＞が登録される。 In the data character string pattern dictionary 166, character strings that may be included in the handwritten character string are registered. For example, in the data character string pattern dictionary 166 used for recognition of the character string of the full name, a character string that may be used as the full name, for example, “Tanaka” as a character string <NAME1> used as the surname, “Ichiro”, “Taro”, “Hanako”, “Yoshiko” and the like are registered as character strings <NAME2> used as names such as “Suzuki”, “Sato”, and “Yamada”. Furthermore, a character string <N400> consisting of the above-mentioned surname character string <NAME1> and the subsequent name character string <NAME2> is registered.

同様に、例えば住所の文字列の認識に使用されるデータ文字列パターン辞書１６６には、全国の地名の文字列等が登録される。 Similarly, for example, character strings of place names nationwide are registered in the data character string pattern dictionary 166 used for recognition of address character strings.

後述するように、データ文字列パターン辞書１６６は、必要に応じてワークエリア１５にロードされる。本実施形態では、同一の文字を重複して保持することを避けるため、ロードされたデータ文字列パターン辞書１６６はグラフ形式で表現される。すなわち、データ文字列パターン辞書１６６に登録された各文字がノードとして、文字と文字との接続がエッジとして表現される。例えば、姓の文字列「田中」がデータ文字列パターン辞書１６６に登録されている場合、この文字列は、文字「田」に対応するノード、文字「中」に対応するノード、及びそれらを接続するエッジによって表現される。 As will be described later, the data character string pattern dictionary 166 is loaded into the work area 15 as necessary. In the present embodiment, the loaded data character string pattern dictionary 166 is expressed in a graph format in order to avoid holding the same character repeatedly. That is, each character registered in the data character string pattern dictionary 166 is represented as a node, and the connection between the characters is represented as an edge. For example, when the character string “Tanaka” of the surname is registered in the data character string pattern dictionary 166, the character string includes a node corresponding to the character “Ta”, a node corresponding to the character “Middle”, and connecting them. Expressed by the edge to be.

図６では省略されているが、データ文字列パターン辞書１６６は、文字と文字とが接続される可能性を示す情報を含む。この情報は、例えば、エッジに与えられるスコアであり、例えば新聞の書面等、実際に流通しているテキスト等から予め抽出される。 Although omitted in FIG. 6, the data character string pattern dictionary 166 includes information indicating the possibility of connection between characters. This information is, for example, a score given to the edge, and is extracted in advance from text that is actually distributed, such as a newspaper document.

スコアは、各エッジに対応する文字の接続の起こりやすさ（言い換えると尤もらしさ）を示す重みである。本実施形態において、スコアの値が大きいほど、そのスコアが与えられたエッジに対応する文字の接続が起こりやすいことを示し、スコアの値が「０」であることは、そのエッジに対応する文字の接続が起こり得ないことを示す。 The score is a weight indicating the likelihood of connection of characters corresponding to each edge (in other words, likelihood). In the present embodiment, the larger the score value, the easier connection of the character corresponding to the edge to which the score is given, and the score value “0” means that the character corresponding to the edge Indicates that no connection can occur.

例えば、データ文字列パターン辞書１６６に登録されているように、文字列「田中」及び「鈴木」のような文字列は出現し得るが、文字列「田鈴」は出現し得ない場合、文字「田」と「中」とを接続するエッジ及び「鈴」と「木」とを接続するエッジには、スコアとして「０」より大きい値が与えられ、文字「田」と「鈴」とを接続するエッジには、スコアとして「０」が与えられる。後述するように、これらのスコアに基づいて、最も尤もらしい文字列が手書き文字認識結果として取得される。 For example, as registered in the data character string pattern dictionary 166, character strings such as the character strings “Tanaka” and “Suzuki” can appear, but the character string “Tauzu” cannot appear. A value larger than “0” is given as a score to the edge connecting “field” and “middle” and the edge connecting “bell” and “tree”, and the characters “field” and “bell” are assigned. The edge to be connected is given “0” as a score. As will be described later, based on these scores, the most likely character string is acquired as a handwritten character recognition result.

なお、本実施形態において、ある二つの文字を接続するエッジが辞書に登録されていないことは、そのエッジを介して接続された文字列が登録されていないことを意味する。また、そのエッジが登録されていないことは、そのエッジが登録され、かつ、それのスコアが０であることと等価である。 In the present embodiment, the fact that an edge connecting two characters is not registered in the dictionary means that a character string connected via the edge is not registered. The fact that the edge is not registered is equivalent to the fact that the edge is registered and its score is zero.

上記のような辞書の構造及びそれを用いた文字認識方法は従来と同様であってよい。また、上記のようなスコアの決定方法は一例であり、スコアと接続の起こりやすさとが対応するように決定される限り、上記とは異なる規則に従ってスコアが決定されてもよい。 The structure of the dictionary as described above and the character recognition method using the same may be the same as the conventional one. The score determination method as described above is an example, and the score may be determined according to a rule different from the above as long as the score is determined so that the connection is likely to occur.

ただし、例えば氏名の文字列が登録されたデータ文字列パターン辞書１６６に、入力される可能性のある全ての氏名の文字列（すなわち文字及び文字間の接続）が登録されているとは限らない。従来のように、予め用意されたデータ文字列パターン辞書１６６のみを使用して文字認識を行う場合、その辞書に登録されていない氏名が入力されたときの認識精度は著しく低くなる。このようなことを防ぐためには新たな文字列の追加登録などを行うことも考えられるが、その場合、メンテナンスのコストが上昇する。 However, for example, not all character strings (namely, connections between characters) that may be inputted are registered in the data character string pattern dictionary 166 in which character strings of names are registered. . When performing character recognition using only the data character string pattern dictionary 166 prepared in advance as in the prior art, the recognition accuracy when a name that is not registered in the dictionary is input is significantly reduced. In order to prevent such a situation, it may be possible to perform additional registration of a new character string, but in this case, the cost of maintenance increases.

本実施形態では、メンテナンスのコストを上昇させずに手書き文字の認識精度を改善するために、本人確認書類に記載された活字文字の認識結果がデータ文字列パターン辞書１６６に追加される。以下、その具体的な手順を説明する。 In this embodiment, in order to improve the recognition accuracy of handwritten characters without increasing the maintenance cost, the recognition result of the printed characters described in the personal identification document is added to the data character string pattern dictionary 166. The specific procedure will be described below.

図７は、本発明の実施形態の窓口受付システム１０が実行するデータ文字列パターン辞書の動的追加処理のフローチャートである。 FIG. 7 is a flowchart of the dynamic addition process of the data character string pattern dictionary executed by the window reception system 10 according to the embodiment of this invention.

図７に示す処理は、図４のステップ４０４において実行される。ここでは、例として、申請書２１１の氏名欄２０１に氏名の手書き文字列「渡邊太朗」が記載され、運転免許証２１２の氏名欄２０２に氏名の活字文字列「渡邊太朗」が記載されている場合について説明する。 The process shown in FIG. 7 is executed in step 404 of FIG. Here, as an example, the handwritten character string “Taro Watanabe” is written in the name field 201 of the application form 211, and the type character string “Taro Watanabe” is written in the name field 202 of the driver's license 212. The case will be described.

最初に、窓口受付システム１０は、予め定義されたデータ文字列パターン辞書７０１をワークエリア１５にロードする。これは、図４のステップ４０３が実行される前のデータ文字列パターン辞書１６６に相当する。図６を参照して説明したように、ロードされたデータ文字列パターン辞書７０１は、グラフ形式で表現される。その具体例については後述する。 First, the window reception system 10 loads a predefined data character string pattern dictionary 701 into the work area 15. This corresponds to the data character string pattern dictionary 166 before step 403 of FIG. 4 is executed. As described with reference to FIG. 6, the loaded data character string pattern dictionary 701 is expressed in a graph format. Specific examples thereof will be described later.

次に、窓口受付システム１０は、図４のステップ４０３における活字文字の認識結果を取得し、それをデータ文字列パターン辞書７０１に追加登録する。 Next, the window reception system 10 acquires the recognition result of the printed character in step 403 in FIG. 4 and additionally registers it in the data character string pattern dictionary 701.

ここで、図７に示すデータ文字列パターン辞書７０１の例を説明する。ロードされた時点のデータ文字列パターン辞書７０１には「渡」（文字７２１Ａ）、「辺」（文字７２１Ｂ）、「太」（文字７２１Ｃ）、「朗」（文字７２１Ｄ）、「井」（文字７２１Ｅ）及び「次」（文字７２１Ｆ）がノードとして登録され、さらに、「渡」と「辺」、「辺」と「太」、「太」と「朗」、「渡」と「井」、「井」と「太」、「辺」と「次」、及び「次」と「朗」を接続する、スコアが０より大きいエッジが含まれる。これは、「渡辺太朗」、「渡辺次朗」及び「渡井太朗」の文字列が登録されていることを意味する。 Here, an example of the data character string pattern dictionary 701 shown in FIG. 7 will be described. The data character string pattern dictionary 701 at the time of loading includes “pass” (character 721A), “side” (character 721B), “thick” (character 721C), “ro” (character 721D), “well” (character 721E) and “next” (character 721F) are registered as nodes, and “Wat” and “Side”, “Side” and “Thick”, “Tat” and “Akira”, “Wat” and “I”, Edges with a score greater than 0 are included that connect “well” and “thick”, “side” and “next”, and “next” and “ro”. This means that the character strings “Taro Watanabe”, “Jiro Watanabe”, and “Taro Watai” are registered.

一方、図７の例では、ステップ４０３の認識結果として、氏名の文字列「渡邊太朗」が取得される。このような文字列はデータ文字列パターン辞書７０１に登録されていないため、手書き文字として「渡邊太朗」が入力された場合、その認識は（正しい手書き文字が入力されたとしても）失敗する可能性が高い。このため、窓口受付システム１０は、ステップ４０３において取得された文字列「渡邊太朗」をデータ文字列パターン辞書７０１に追加する。具体的には、窓口受付システム１０は、「邊」（文字７１２）をノードとして追加し、さらに、「渡」と「邊」とを接続するエッジ７１３Ａ、及び、「邊」と「太」とを接続するエッジ７１３Ｂを追加し、それらのエッジのスコアとして０より大きい値を登録する。 On the other hand, in the example of FIG. 7, the character string “Taro Watanabe” is acquired as the recognition result in step 403. Since such a character string is not registered in the data character string pattern dictionary 701, when “Taro Watanabe” is input as a handwritten character, the recognition may fail (even if a correct handwritten character is input). Is expensive. Therefore, the window reception system 10 adds the character string “Taro Watanabe” acquired in step 403 to the data character string pattern dictionary 701. Specifically, the window reception system 10 adds “邊” (character 712) as a node, and further, an edge 713A connecting “pass” and “邊”, and “邊” and “thick”. Are added, and a value greater than 0 is registered as the score of those edges.

一方、ステップ４０３で取得された文字列が既にデータ文字列パターン辞書１６６に登録されている場合もある。例えば、「渡邊太朗」が取得された場合、これに含まれる文字列「太朗」に対応するエッジ、すなわち「太」と「朗」とを接続するエッジ７１３Ｃは、既に登録されている。この場合、窓口受付システム１０は、エッジ７１３Ｃのスコアを増加させる。 On the other hand, the character string acquired in step 403 may already be registered in the data character string pattern dictionary 166. For example, when “Taro Watanabe” is acquired, the edge corresponding to the character string “Taro” included therein, that is, the edge 713C connecting “Tai” and “Roh” has already been registered. In this case, the window reception system 10 increases the score of the edge 713C.

なお、このようにして新たに追加されたエッジのスコアの値及び既存のスコアの増分は、０より大きい限り、任意に決定することができるが、スコアの値が大きいほど、そのエッジに対応する文字列が認識結果として取得され易くなる。例えば、「渡邊太朗」に対応するエッジのスコアが大きいほど、手書き文字列「渡邊太朗」が「渡邊太朗」として認識されないという誤りが発生する可能性は低くなる。したがって、このような誤りを減らすために、ステップ４０３において取得された文字列に対応するエッジのスコアがそれ以外のエッジのスコアより大きくなるように、スコアの値を決定してもよい。ただし、「渡邊太朗」に対応するエッジのスコアが大きいほど、「渡邊太朗」以外の手書き文字列が「渡邊太朗」として認識されるという誤りが発生する可能性は高くなる。 In addition, the score value of the edge newly added in this way and the increment of the existing score can be arbitrarily determined as long as it is larger than 0. However, the larger the score value, the more the edge corresponds. A character string is easily acquired as a recognition result. For example, the greater the score of the edge corresponding to “Taro Watanabe”, the lower the possibility that an error that the handwritten character string “Taro Watanabe” is not recognized as “Taro Watanabe” will occur. Therefore, in order to reduce such an error, the score value may be determined so that the score of the edge corresponding to the character string acquired in step 403 is larger than the score of the other edges. However, the greater the score of the edge corresponding to “Taro Watanabe”, the higher the possibility of an error that a handwritten character string other than “Taro Watanabe” is recognized as “Taro Watanabe”.

窓口受付システム１０は、上記のようにして情報が追加されたデータ文字列パターン辞書７０２を用いて、手書き文字の認識を行う（ステップ７３２）。データ文字列パターン辞書７０２は、図４のステップ４０３が実行された後のデータ文字列パターン辞書１６６に相当し、ステップ７３２は図４のステップ４０６に相当する。この文字認識は従来と同様の方法によって実行できるため、その詳細な説明は省略するが、概要を説明すれば次の通りである。 The window reception system 10 recognizes handwritten characters using the data character string pattern dictionary 702 to which information has been added as described above (step 732). The data character string pattern dictionary 702 corresponds to the data character string pattern dictionary 166 after step 403 in FIG. 4 is executed, and step 732 corresponds to step 406 in FIG. Since this character recognition can be executed by the same method as in the prior art, a detailed description thereof will be omitted, but the outline will be described as follows.

窓口受付システム１０は、まず、手書き文字認識用辞書１６２に基づいて、入力された手書き文字列を含む画像から、それを構成する各文字を切り出す。通常は、切り出された文字として複数の候補が挙げられる。例えば、手書き文字列「渡邊」を含む画像が入力された場合、そこから「渡」及び「邊」を切り出すこともできるが、「三」、「度」及び「邊」を切り出すこともできる。このような「渡邊」及び「三度邊」のような複数の候補が、文字切り出し・識別仮説ネット７４１として作成される。文字切り出し・識別仮説ネット７４１のエッジ（例えば、「渡」と「邊」とを接続するエッジ、「三」と「度」とを接続するエッジ及び「度」と「邊」とを接続するエッジ）についても、切り出す位置の尤もらしさ及び切り出された画像パターンの文字らしさ等に応じたスコアが算出される。 First, the window reception system 10 cuts out each character constituting the image from the input handwritten character string based on the handwritten character recognition dictionary 162. Usually, a plurality of candidates are listed as the extracted characters. For example, when an image including the handwritten character string “Watana” is input, “Wa” and “H” can be cut out therefrom, but “three”, “degree”, and “H” can also be cut out. A plurality of candidates such as “Watanabe” and “Third degree” are created as a character segmentation / identification hypothesis net 741. Edges of the character segmentation / identification hypothesis net 741 (for example, an edge connecting “pass” and “邊”, an edge connecting “three” and “degree”, and an edge connecting “degree” and “邊” ), A score corresponding to the likelihood of the position to be cut out and the character likeness of the cut out image pattern is calculated.

そして、窓口受付システム１０は、情報が追加されたデータ文字列パターン辞書７０２のグラフによって手書き文字の文字切り出し・識別仮説ネット７４１を検索する（ステップ７３２）。この検索は、例えばＤＰ（Dynamic Programming）マッチングのような、公知の手法によって行うことができる。 Then, the window reception system 10 searches the character segmentation / identification hypothesis net 741 of the handwritten character by using the graph of the data character string pattern dictionary 702 to which the information has been added (step 732). This search can be performed by a known method such as DP (Dynamic Programming) matching.

例えば、文字切り出し・識別仮説ネット７４１には文字列「三度邊」が含まれるが、この文字列はデータ文字列パターン辞書７０２に登録されていないため、検索結果としては取得されない。一方、文字列「渡邊」はデータ文字列パターン辞書７０２に登録されているため、検索結果として取得される可能性がある。窓口受付システム１０は、文字切り出し・識別仮説ネット７４１のエッジに与えられたスコアと、データ文字列パターン辞書７０２のエッジに与えられたスコアとに基づいて、文字列「渡邊太朗」が最も尤もらしいと判定した場合、この文字列を手書き文字の認識結果として出力する（ステップ７３３）。このとき、手書き文字列の認識結果と活字文字列の認識結果とが一致するか否かを示す情報を出力してもよい。 For example, the character segmentation / identification hypothesis net 741 includes the character string “Third-throw”, but since this character string is not registered in the data character string pattern dictionary 702, it is not acquired as a search result. On the other hand, since the character string “Watanabe” is registered in the data character string pattern dictionary 702, it may be acquired as a search result. In the window reception system 10, the character string “Taro Watanabe” is most likely based on the score given to the edge of the character segmentation / identification hypothesis net 741 and the score given to the edge of the data character string pattern dictionary 702. If it is determined, the character string is output as a handwritten character recognition result (step 733). At this time, information indicating whether or not the recognition result of the handwritten character string matches the recognition result of the printed character string may be output.

このように、図７に示した方法によれば、予め用意されたデータ文字列パターン辞書に、活字文字の認識結果から作成された辞書情報が追加され、その辞書が文字認識に使用される。活字文字の認識結果として得られた文字列がデータ文字列パターン辞書に登録されていなかった場合には、その文字列を新たに登録することによって、入力された手書き文字列が活字文字列と同一の文字列として認識されやすくなる。一方、活字文字の認識結果として得られた文字列がデータ文字列パターン辞書に既に登録されていた場合には、その文字列に対応するスコアを増加させることによって、入力された手書き文字列が活字文字列と同一の文字列として認識されやすくなる。このように、手書き文字の認識精度が向上するため、本人確認に失敗する頻度も減少し、これによって窓口業務の効率が改善するとともに、窓口オペレータによる不正な処理も防止される。 As described above, according to the method shown in FIG. 7, dictionary information created from the recognition result of type characters is added to a data character string pattern dictionary prepared in advance, and the dictionary is used for character recognition. If the character string obtained as a result of type character recognition is not registered in the data character string pattern dictionary, the input handwritten character string is the same as the type character string by registering the character string anew. It becomes easy to be recognized as a character string. On the other hand, if the character string obtained as a result of recognition of type characters has already been registered in the data character string pattern dictionary, the input handwritten character string is typed by increasing the score corresponding to the character string. It becomes easy to be recognized as the same character string as the character string. Thus, since the recognition accuracy of handwritten characters is improved, the frequency of failure of identity verification is also reduced, thereby improving the efficiency of the counter work and preventing unauthorized processing by the counter operator.

上記の例では、予め用意されたデータ文字列パターン辞書１６６に、活字文字の認識結果に基づく情報が追加されたが、データ文字列パターン辞書１６６が予め用意されていない場合にも、手書き文字の認識を活字文字の認識と連携して行うことによって、手書き文字の認識精度を改善することができる。以下、これについて説明する。 In the above example, information based on the recognition result of type characters is added to the data character string pattern dictionary 166 prepared in advance. However, even if the data character string pattern dictionary 166 is not prepared in advance, The recognition accuracy of handwritten characters can be improved by performing recognition in conjunction with recognition of type characters. This will be described below.

図８は、本発明の実施形態の窓口受付システム１０が実行する、データ文字列パターン辞書を予め保持しない場合の手書き文字認識処理のフローチャートである。 FIG. 8 is a flowchart of handwritten character recognition processing executed when the window reception system 10 according to the embodiment of the present invention does not hold a data character string pattern dictionary in advance.

ここでは、例として、手書き文字列及び活字文字列として「桜井太郎」が入力された場合について説明する。 Here, as an example, a case where “Taro Sakurai” is input as a handwritten character string and a printed character string will be described.

窓口受付システム１０に入力される入力画像８０１は、申請書８０２及び本人確認書類８０３の画像データを含む。申請書８０２は手書きされた氏名の文字列「桜井太郎」を含み、本人確認書類８０３は活字の文字列「桜井太郎」を含む。これらはそれぞれ図２の申請書２１１及び運転免許証２１２に相当する。 An input image 801 input to the window reception system 10 includes image data of an application form 802 and a personal identification document 803. The application form 802 includes a handwritten character string “Taro Sakurai”, and the identification document 803 includes a printed character string “Taro Sakurai”. These correspond to the application form 211 and the driver's license 212 in FIG.

図８では省略されているが、窓口受付システム１０は、図４のステップ４０５と同様の方法によって、氏名の手書き文字列が記載された項目領域を抽出する。そして、窓口受付システム１０は、抽出された項目領域に記載された手書き文字列「桜井太郎」について、図７と同様の方法によって文字切り出し・識別仮説ネット８１１を作成する。図８では、例えば、文字列「桜井」のほかに文字列「木女井」等が、手書き文字の認識結果の候補として抽出される。 Although omitted in FIG. 8, the window reception system 10 extracts an item area in which the handwritten character string of the name is described by the same method as in step 405 in FIG. 4. Then, the window reception system 10 creates a character segmentation / identification hypothesis net 811 for the handwritten character string “Taro Sakurai” described in the extracted item area by the same method as in FIG. In FIG. 8, for example, in addition to the character string “Sakurai”, a character string “Kimai” is extracted as a candidate for a recognition result of handwritten characters.

一方、窓口受付システム１０は、図４のステップ４０２と同様の方法によって、氏名の活字文字列が記載された項目領域を抽出する。そして、窓口受付システム１０は、図４のステップ４０３と同様の方法によって、抽出された項目領域に記載された活字文字列の認識を実行し、その結果として、活字文字列から得られた認識候補の文字列「桜井太郎」８１２を取得する。図８の例では窓口受付システム１０がデータ文字列パターン辞書１６６を予め保持していないため、ここで取得された文字列「桜井太郎」８１２が、次のステップにおいてデータ文字列パターン辞書１６６として参照される。 On the other hand, the window reception system 10 extracts an item area in which a printed character string of a name is described by the same method as in step 402 in FIG. Then, the window reception system 10 executes recognition of the type character string described in the extracted item area by the same method as in Step 403 in FIG. 4, and as a result, the recognition candidate obtained from the type character string The character string “Taro Sakurai” 812 is acquired. In the example of FIG. 8, since the window reception system 10 does not hold the data character string pattern dictionary 166 in advance, the character string “Taro Sakurai” 812 acquired here is referred to as the data character string pattern dictionary 166 in the next step. Is done.

なお、上記の二つのステップが実行される順序は任意であり、両者が並行して実行されてもよい。 The order in which the above two steps are executed is arbitrary, and both may be executed in parallel.

次に、窓口受付システム１０は、活字文字列の認識結果を用いて、文字切り出し・識別仮説ネットを検索する（ステップ８１３）。図８の例では、活字文字列の認識結果「桜井太郎」に対応するパスが文字切り出し・識別仮説ネット８１１に存在し、文字切り出し・識別仮説ネットから抽出された文字列のスコアの合計値が所定の値より大きい場合に、窓口受付システム１０は、「桜井太郎」を手書き文字の認識結果として出力する（ステップ８１４）。このとき、手書き文字列の認識結果と活字文字列の認識結果とが一致するか否かを示す情報を出力してもよい。一方、活字文字列の認識結果に相当するパスが文字切り出し・識別仮説ネットに存在しない場合、窓口受付システム１０は、手書き文字の認識に失敗したことを示す情報を出力してもよい。 Next, the window reception system 10 searches the character segmentation / identification hypothesis net using the recognition result of the printed character string (step 813). In the example of FIG. 8, a path corresponding to the recognition result “Taro Sakurai” of the printed character string exists in the character cutout / identification hypothesis net 811, and the total score of the character strings extracted from the character cutout / identification hypothesis net is When it is larger than the predetermined value, the window reception system 10 outputs “Taro Sakurai” as a recognition result of handwritten characters (step 814). At this time, information indicating whether or not the recognition result of the handwritten character string matches the recognition result of the printed character string may be output. On the other hand, when the path corresponding to the recognition result of the printed character string does not exist in the character cutout / identification hypothesis net, the window reception system 10 may output information indicating that the recognition of the handwritten character has failed.

上記の検索は、図７におけるステップ７３２と同様に実行される。例えば、活字文字列から抽出された「桜井太郎」に対応するパスに含まれる各エッジに０より大きいスコアが与えられ、文字切り出し・識別仮説ネットから抽出された文字列のスコアの合計値が所定の値より大きい場合にその文字列（例えば「桜井太郎」）を認識結果として出力してもよい。 The above search is executed in the same manner as Step 732 in FIG. For example, each edge included in the path corresponding to “Taro Sakurai” extracted from the printed character string is given a score greater than 0, and the total score of the character strings extracted from the character segmentation / identification hypothesis net is predetermined. The character string (for example, “Taro Sakurai”) may be output as the recognition result.

なお、上記の文字切り出し・識別仮説ネット８１１の作成及びステップ８１３は、図４のステップ４０６に相当し、認識候補の文字列８１２の取得及びそれに対応するエッジのコストの決定は、図４のステップ４０４に相当する。 The creation of the character segmentation / identification hypothesis net 811 and the step 813 correspond to the step 406 in FIG. 4. The acquisition of the recognition candidate character string 812 and the determination of the cost of the edge corresponding thereto are performed in the steps of This corresponds to 404.

このように、活字文字の認識結果を用いることによって、予め用意された辞書を用いずに手書き文字の認識精度を改善することができる。 Thus, the recognition accuracy of handwritten characters can be improved without using a dictionary prepared in advance by using the recognition result of printed characters.

なお、図８を参照して説明した文字認識は、図７の処理において、活字文字認識の結果として得られた文字列（図７の例では「渡邊太朗」）に対応するパスに含まれるエッジ以外の全てのエッジのスコアを０にすることと等価である。言い換えると、図８の例では、活字文字の認識結果のみをデータ文字列パターン辞書として用いて手書き文字認識が実行されるのに対して、図７の例では、活字文字の認識結果と、予め用意された辞書とを組み合わせた情報をデータ文字列パターン辞書として用いて手書き文字認識が実行される。 Note that the character recognition described with reference to FIG. 8 is the edge included in the path corresponding to the character string (“Taro Watanabe” in the example of FIG. 7) obtained as a result of the type character recognition in the processing of FIG. This is equivalent to setting the scores of all edges other than 0 to 0. In other words, in the example of FIG. 8, handwritten character recognition is executed using only the recognition result of the printed character as the data character string pattern dictionary, whereas in the example of FIG. Handwritten character recognition is executed using information combined with the prepared dictionary as a data character string pattern dictionary.

ここまでに示した例では、本人確認の際に、入力された手書き文字列と活字文字列とが同一であることが求められる。しかし、実際には、完全な同一性が求められない場合もある。その典型的な例は、漢字の異体字による表記ゆれを許容する場合である。 In the examples shown so far, the input handwritten character string and the typed character string are required to be the same at the time of identity verification. In practice, however, perfect identity may not be required. A typical example is a case where a variation in the notation of kanji variants is allowed.

本人確認書類に記載される漢字は、その書類を作成したシステムにおいて使用可能なものに限られる。このため、氏名の漢字が当該システムにおいて使用できないもの（いわゆる外字）である場合には、氏名の漢字がそれに対応する異体字で置き換えられて本人確認書類に表示され、その結果、実際の氏名の漢字と本人確認書類に記載された漢字とが異なることになる。 Kanji written in the identification document is limited to those that can be used in the system that created the document. For this reason, if the kanji of the name is something that cannot be used in the system (so-called external characters), the kanji of the name will be replaced with the corresponding variant and displayed in the identity verification document. As a result, the actual name The kanji and the kanji described in the identification document will be different.

このような表記ゆれは、異体字が略字として慣用される場合にも発生する。例えば、本来の表記が「渡邊」であるのに、それを筆記するときには略字として「渡辺」を使用する場合などがこれに該当する。システムのポリシーによっては、このような表記ゆれを許容し、本人確認の際に文字列の完全同一を求めない場合もあり得る。 Such notation fluctuation also occurs when a variant character is commonly used as an abbreviation. For example, this may be the case when the original notation is “Watanabe” but “Watanabe” is used as an abbreviation when writing it. Depending on the policy of the system, there may be a case where such a variation in the notation is allowed and the character string is not required to be completely identical at the time of identity verification.

完全な同一性が求められない場合の別の例は、手書き文字列と活字文字列の文字種類が異なる場合である。例えば、本人確認書類としてパスポートが使用され、申請書には漢字を手書きする場合、手書きされた漢字の文字列と、パスポートに印刷されたローマ字の文字列とが照合される。この場合、表記された文字そのものが一致することは要求されないが、漢字の読みとローマ字の読みが一致することは要求される。 Another example of the case where perfect identity is not required is when the character types of the handwritten character string and the printed character string are different. For example, when a passport is used as the identity verification document and handwritten kanji is used in the application form, the handwritten kanji character string is compared with the roman character string printed on the passport. In this case, it is not required that the written characters match, but it is required that the reading of kanji and the reading of romaji match.

上記のような場合、外字・ローマ字辞書１６３が文字認識に使用される。以下、このような場合の文字認識について説明する。 In such a case, the external character / Romaji dictionary 163 is used for character recognition. Hereinafter, character recognition in such a case will be described.

図９は、本発明の実施形態の窓口受付システム１０が実行する、データ文字列パターン辞書を予め保持しない場合の手書き文字認識処理の別の例を示すフローチャートである。 FIG. 9 is a flowchart showing another example of handwritten character recognition processing executed by the window reception system 10 according to the embodiment of the present invention when the data character string pattern dictionary is not held in advance.

ここでは、図８と同様にデータ文字列パターン辞書１６６が予め用意されていない場合において、本人確認書類には図８と同様に活字文字列「桜井太郎」が記載されているが、申請書には手書き文字列「櫻井太郎」が記載されている例を示す。「桜」と「櫻」は、相互に置き換えて慣用される異体字である。以下、図９に示す処理のうち、図８と同様の部分については詳細な説明を省略する。 Here, in the case where the data character string pattern dictionary 166 is not prepared in advance as in FIG. 8, the character string “Taro Sakurai” is described in the identity verification document as in FIG. Indicates an example in which the handwritten character string “Taro Sakurai” is described. "Cherry blossom" and "Samurai" are variant characters that are commonly used interchangeably. Hereinafter, in the processing shown in FIG. 9, detailed description of the same parts as those in FIG. 8 is omitted.

図９に示す入力画像９０１は、申請書９０２及び本人確認書類９０３の画像データを含む。申請書９０２は手書きされた氏名の文字列「櫻井太郎」を含み、本人確認書類９０３は図８の例と同様に活字の文字列「桜井太郎」を含む。これらはそれぞれ図２の申請書２１１及び運転免許証２１２に相当する。 An input image 901 illustrated in FIG. 9 includes image data of the application form 902 and the identity verification document 903. The application form 902 includes a handwritten character string “Taro Sakurai”, and the identity verification document 903 includes a printed character string “Taro Sakurai” as in the example of FIG. These correspond to the application form 211 and the driver's license 212 in FIG.

窓口受付システム１０は、手書き文字列「櫻井太郎」について、図８と同様の方法によって文字切り出し・識別仮説ネット９１１を作成する。図９では、例えば、文字列「櫻井」のほかに文字列「木女井」等が抽出される。 The window reception system 10 creates a character segmentation / identification hypothesis net 911 for the handwritten character string “Taro Sakurai” by the same method as in FIG. In FIG. 9, for example, in addition to the character string “Sakurai”, a character string “Kimoi” and the like are extracted.

さらに、窓口受付システム１０は、認識候補の文字列「桜井太郎」９１２を取得する。これは、図８の例における認識候補の文字列８１２の取得と同様である。 Further, the window reception system 10 acquires the recognition candidate character string “Taro Sakurai” 912. This is the same as the acquisition of the recognition candidate character string 812 in the example of FIG.

次に、窓口受付システム１０は、取得された認識候補の文字列９１２を対象として、外字・ローマ字辞書１６３に基づいて、外字による展開を実行する（ステップ９２１）。外字・ローマ字辞書１６３には、同一であると判定することが許容される（言い換えると、置換することが許容される）複数の文字又は複数の文字列を相互に対応付ける情報が含まれる。例えば、「桜」と「櫻」とが同一であると判定することが許容される場合、外字・ローマ字辞書１６３は、「桜」と「櫻」と対応付ける情報を含む。その結果、認識候補の文字列として、「桜井太郎」の「桜」を、それに対応付けられた文字「櫻」によって置き換えた文字列「櫻井太郎」が追加される。 Next, the window reception system 10 executes expansion using external characters based on the external character / Romaji dictionary 163 for the acquired recognition candidate character string 912 (step 921). The external character / Romaji dictionary 163 includes information that associates a plurality of characters or a plurality of character strings that are allowed to be determined to be the same (in other words, can be replaced). For example, when it is allowed to determine that “sakura” and “櫻” are the same, the external character / Roman dictionary 163 includes information that associates “sakura” with “櫻”. As a result, the character string “Taro Sakurai” in which “Sakura” of “Taro Sakurai” is replaced by the character “櫻” associated therewith is added as the character string of the recognition candidate.

そして、窓口受付システム１０は、展開された活字文字列の認識結果の文字列９２２を用いて、文字切り出し・識別仮説ネット９１１を検索する（ステップ９１３）。図９の例では、認識結果の文字列９２２に文字列「桜井太郎」及び「櫻井太郎」が含まれるが、これらのうち「櫻井太郎」に対応するパスが文字切り出し・識別仮説ネット９１１に存在するため、窓口受付システム１０は、「櫻井太郎」を手書き文字の認識結果として出力する（ステップ９１４）。このとき、手書き文字列の認識結果と活字文字列の認識結果とが一致するか否かを示す情報を出力してもよい。 Then, the window reception system 10 searches the character segmentation / identification hypothesis net 911 using the character string 922 of the recognition result of the developed type character string (step 913). In the example of FIG. 9, the character string 922 of the recognition result includes the character strings “Taro Sakurai” and “Taro Sakurai”. Among these, a path corresponding to “Taro Sakurai” exists in the character segmentation / identification hypothesis net 911. Therefore, the window reception system 10 outputs “Taro Sakurai” as a handwritten character recognition result (step 914). At this time, information indicating whether or not the recognition result of the handwritten character string matches the recognition result of the printed character string may be output.

一方、活字文字列の認識結果の文字列９２２に含まれるいずれの文字列に相当するパスも文字切り出し・識別仮説ネットに存在しない場合、窓口受付システム１０は、手書き文字の認識に失敗したことを示す情報を出力してもよい。 On the other hand, if the path corresponding to any character string included in the character string 922 of the recognition result of the printed character string does not exist in the character cutout / identification hypothesis net, the window reception system 10 indicates that the handwritten character recognition has failed. The information shown may be output.

申請書に手書き文字が記載され、本人確認書類にローマ字等が記載された場合も、上記と同様の処理が行われる。これについて、本人確認書類９０３がパスポートである場合を例として説明する。 When handwritten characters are written on the application form and Roman characters are written on the identification document, the same processing as above is performed. This will be described as an example in which the personal identification document 903 is a passport.

この例において、本人確認書類９０３には、氏名として活字文字列「ＳａｋｕｒａｉＴａｒｏ」（図示省略）が記載され、外字・ローマ字辞書１６３に文字列「ＳａｋｕｒａｉＴａｒｏ」と「櫻井太郎」とを対応付ける情報が含まれる。この場合、ステップ９２１の結果、文字列「ＳａｋｕｒａｉＴａｒｏ」（図示省略）と「櫻井太郎」とを含む活字文字列の認識結果の文字列９２２が取得される。窓口受付システム１０は、上記と同様に、文字列９２２に含まれる「櫻井太郎」を用いて文字切り出し・識別仮説ネット９１１を検索し、「櫻井太郎」を手書き文字の認識結果として出力する（ステップ９１４）。 In this example, the personal identification document 903 includes a printed character string “Sakurai Taro” (not shown) as a name, and information that associates the character string “Sakurai Taro” with “Taro Sakurai” in the external character / Roman character dictionary 163. included. In this case, as a result of step 921, a character string 922 of a recognition result of a type character string including the character string “Sakurai Taro” (not shown) and “Taro Sakurai” is acquired. Similarly to the above, the window reception system 10 searches the character segmentation / identification hypothesis net 911 using “Taro Sakurai” included in the character string 922, and outputs “Taro Sakurai” as a handwritten character recognition result (step) 914).

このような外字・ローマ字辞書１６３を用いた展開は、図７に示す処理に適用することもできる。 Such development using the external character / Romaji dictionary 163 can also be applied to the processing shown in FIG.

例えば、図７の例において、本人確認書類に氏名の活字文字列として「渡辺太朗」が記載され、外字・ローマ字辞書１６３に文字「辺」と「邊」とを対応付ける情報が含まれる場合、窓口受付システム１０は、文字列「渡辺太朗」及びその「辺」を「邊」によって置き換えた文字列「渡邊太朗」とをデータ文字列パターン辞書７０１に追加する。「渡邊太朗」の追加は図７を参照して既に説明した通りである。一方、「渡辺太朗」は既にデータ文字列パターン辞書７０１に登録されているため、窓口受付システム１０は、「渡辺太朗」に対応するパスに含まれるエッジのスコアを増加させる。「渡辺太朗」がデータ文字列パターン辞書７０１に登録されていない場合には、「渡邊太朗」と同様にデータ文字列パターン辞書７０１に追加される。 For example, in the example of FIG. 7, when “Taro Watanabe” is written as the type name character string in the identity verification document, and the external character / Romaji dictionary 163 includes information associating the characters “side” and “邊”, The reception system 10 adds the character string “Taro Watanabe” and the character string “Taro Watanabe” in which the “side” is replaced with “邊” to the data character string pattern dictionary 701. The addition of “Taro Watanabe” has already been described with reference to FIG. On the other hand, since “Taro Watanabe” has already been registered in the data character string pattern dictionary 701, the window reception system 10 increases the score of the edge included in the path corresponding to “Taro Watanabe”. If “Taro Watanabe” is not registered in the data character string pattern dictionary 701, it is added to the data character string pattern dictionary 701 in the same manner as “Taro Watanabe”.

上記のように外字・ローマ字辞書１６３を使用することによって、文字の表記ゆれ又は文字種類の相違に起因する本人確認の失敗を防止することができる。なお、上記のような漢字の異体字及びローマ字による置換は一例であり、それ以外の文字による置換が許容される場合にも、外字・ローマ字辞書１６３と同様の辞書（すなわち相互に置換可能な文字又は文字列を対応付ける情報）を用いて、上記と同様の処理を実行することができる。 By using the external character / Romaji dictionary 163 as described above, it is possible to prevent a failure in identity verification due to a variation in character notation or a difference in character type. It should be noted that the above-described replacement of Kanji characters with variants and Roman characters is an example, and even when replacement with other characters is allowed, the dictionary similar to the external character / Roman character dictionary 163 (that is, mutually replaceable characters) Alternatively, it is possible to execute the same processing as described above using information that associates character strings.

なお、本実施形態は官庁又は企業における窓口業務を例として説明したが、本発明は、手書き文字とそれに対応する活字文字とが入力される業務であれば、窓口業務以外の業務に適用することもできる。 In addition, although this embodiment demonstrated the window service in the government office or a company as an example, if this invention is a service in which a handwritten character and a type letter corresponding to it are input, it is applied to work other than a window service. You can also.

１０窓口受付システム
１１、１１０入力装置
１１１画像入力装置
１２表示装置
１３ＣＰＵ
１４印刷装置
１５ワークエリア
１５１ＯＳ
１５２通信プログラム
１５３文書処理プログラム
１５４文字認識プログラム
１６情報保持手段
１６１活字文字認識用辞書
１６２手書き文字認識用辞書
１６３外字・ローマ字辞書
１６４配置知識辞書
１６５項目名辞書
１６６データ文字列パターン辞書
１９通信ネットワーク
２１ファイルサーバ 10 Window reception system 11, 110 Input device 111 Image input device 12 Display device 13 CPU
14 Printing device 15 Work area 151 OS
152 Communication program 153 Document processing program 154 Character recognition program 16 Information holding means 161 Typed character recognition dictionary 162 Handwritten character recognition dictionary 163 External character / Romaji dictionary 164 Location knowledge dictionary 165 Item name dictionary 166 Data character string pattern dictionary 19 Communication network 21 file server

Claims

A document processing apparatus comprising: an input device; a processor connected to the input device; a storage device connected to the processor; and an output device connected to the processor;
Said storage device, printed characters recognition dictionary including information indicating the characteristic quantity of printed characters, handwritten character recognition dictionary including information indicating a characteristic of the handwritten characters, as well as a plurality of characters, the connection between the plurality of character And a string pattern dictionary including a score indicating the likelihood of the connection ,
The processor is
Obtaining image data including a handwritten character string and a print character string input via the input device;
Recognizing the printed character string using the printed character recognition dictionary;
When the character string acquired as a recognition result of the type character string and the connection between the characters included in the character string are already registered in the character string pattern dictionary, the score of the score related to the connection between the registered characters When the value is incremented and the character string acquired as a recognition result of the printed character string and the connection between the characters included in the character string are not registered in the character string pattern dictionary, acquired as the recognition result of the printed character string Newly registered in the character string pattern dictionary, a character indicating the character included in the character string, the connection between those characters and the likelihood of the connection ,
Recognizing the handwritten character string using the handwritten character recognition dictionary and the character string pattern dictionary,
A document processing apparatus that outputs a recognition result of the handwritten character string via the output apparatus.

The processor is
Based on the handwritten character recognition dictionary, generate a plurality of character strings as candidates for the recognition result of the handwritten character string,
Search the character string included in the character string pattern dictionary from the plurality of generated character strings,
The document processing apparatus according to claim 1, wherein the character string acquired by the search is acquired as a recognition result of the handwritten character string.

The storage device further holds a replacement dictionary that associates a plurality of characters or character strings determined to be the same,
The processor generates a character string in which a character or character string included in a character string acquired as a recognition result of the type character string is replaced with a character or character string determined to be the same based on the replacement dictionary. The document processing apparatus according to claim 1, wherein the generated character string is stored in the storage device as the character string pattern dictionary.

A document processing method executed by a device including an input device, a processor connected to the input device, a storage device connected to the processor, and an output device connected to the processor,
  The storage device includes a printed character recognition dictionary including information indicating a characteristic amount of a printed character, a handwritten character recognition dictionary including information indicating a characteristic amount of a handwritten character, a plurality of characters, and a connection between the plurality of characters. And a string pattern dictionary including a score indicating the likelihood of the connection,
  The document processing method includes:
  A first procedure in which the processor acquires image data including a handwritten character string and a printed character string input via the input device;
  A second procedure in which the processor recognizes the type character string using the type character recognition dictionary;
  When the processor acquires a character string acquired as a recognition result of the printed character string and a connection between characters included in the character string is already registered in the character string pattern dictionary, the connection between the registered characters When the connection between the character string acquired as a recognition result of the type character string and the characters included in the character string is not registered in the character string pattern dictionary, the score value of the type character string is increased. A third procedure for newly registering a character included in a character string acquired as a recognition result, a connection between those characters, and a score indicating the likelihood of the connection in the character string pattern dictionary;
  A fourth procedure for the processor to recognize the handwritten character string using the handwritten character recognition dictionary and the character string pattern dictionary;
  A document processing method comprising: a fifth procedure in which the processor outputs a recognition result of the handwritten character string via the output device.

The fourth procedure includes
  The processor generates a plurality of character strings as candidates for the recognition result of the handwritten character string based on the dictionary for handwritten character recognition;
  A step of searching for a character string included in the character string pattern dictionary from the plurality of generated character strings;
  The document processing method according to claim 4, further comprising: a procedure in which the processor acquires a character string acquired by the search as a recognition result of the handwritten character string.

The storage device further holds a replacement dictionary that associates a plurality of characters or character strings determined to be the same,
In the document processing method, a character or character string that is determined to be the same as a character or character string included in a character string acquired as a recognition result of the type character string by the processor based on the replacement dictionary. Including a procedure to generate a string replaced with
The document processing method according to claim 4, wherein the third procedure includes a procedure of storing the generated character string in the storage device as the character string pattern dictionary.