JP2011048499A

JP2011048499A - Recognition result correction device, image processor, and program

Info

Publication number: JP2011048499A
Application number: JP2009194802A
Authority: JP
Inventors: Takashi Isozaki; 隆司磯崎
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2009-08-25
Filing date: 2009-08-25
Publication date: 2011-03-10

Abstract

<P>PROBLEM TO BE SOLVED: To improve accuracy of either character recognition or voice recognition without preparing a document set for recognition. <P>SOLUTION: A character recognition device 10 includes: a character recognition part 31 that performs character recognition on an image read by a character image reading part 21, and acquires candidate characters and character scores showing its certainty; a morphological analysis part 32 that performs morphological analysis on a character string where the candidate characters are arranged; a retrieval key creation part 33 that creates a retrieval key by connecting the candidate characters based on the result of the morphological analysis; a word evaluation part 34 that calculates retrieval scores by retrieving a document data storage part 41 whose selection has been accepted by a DB selection acceptance part 22 by using the retrieval key, and calculates word scores based on the retrieval scores; and a total evaluation part 35 that calculates total scores from the character stores and the word scores, and displays the total scores on a recognition result display part 23. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、認識結果修正装置、画像処理装置、プログラムに関する。 The present invention relates to a recognition result correction device, an image processing device, and a program.

文字認識装置において、ある文字から他の文字へ遷移する確率とその遷移関係にある文字の組合せに対応する単語識別情報を記憶する文字遷移確率テーブルを用いて候補文字列を最適化し、各単語を識別するための単語識別情報との単語間の階層関係を示す階層情報を記憶する単語辞書を最適化された候補文字列に対応する単語識別情報に基づいて検索し、階層情報に対応する未入力の単語を抽出する技術が知られている（例えば、特許文献１参照）。 In the character recognition device, a candidate character string is optimized using a character transition probability table that stores word identification information corresponding to a combination of a character having a transition relationship from one character to another character and a character having the transition relationship, A word dictionary storing hierarchical information indicating a hierarchical relationship between words with the word identification information for identification is searched based on the word identification information corresponding to the optimized candidate character string, and no input corresponding to the hierarchical information Is known (see, for example, Patent Document 1).

筆記データに似通っている度合いが大きい１つ以上の文字を表す文字コードのうちの何れか１つを認識結果として出力する際に、同じ字種の文字が連続して出現することが多いという日本語の一般的な傾向を利用して、前回認識結果として出力された文字コードが表す文字の字種と同じ字種となる文字を表す文字コードを、優先的に認識結果として出力する技術も知られている（例えば、特許文献２参照）。 Japan that characters of the same character type often appear continuously when outputting one of the character codes representing one or more characters that are highly similar to written data as a recognition result Also known is a technology that uses the general tendency of words to preferentially output character codes that represent characters that have the same character type as the character code that was output as the previous recognition result. (For example, refer to Patent Document 2).

確率が高い順番に任意の個数の形態素解析候補を求める形態素解析手段、単語モデルに基づく単語仮説生成手段、類似語モデルに基づく類似語検索手段を用いて、辞書に登録されていない入力文中の単語の表記と品詞を正しく同定し、正解文字が候補文字に含まれていない場合でも正確単語を提示し、最も尤もらしい順に、単語列と品詞列の組を提示する技術も知られている（例えば、特許文献３参照）。 Words in the input sentence that are not registered in the dictionary using morpheme analysis means for obtaining an arbitrary number of morpheme analysis candidates in descending order of probability, word hypothesis generation means based on word models, and similar word search means based on similar word models There is also known a technique for correctly identifying the notation and part of speech, presenting the correct word even when the correct character is not included in the candidate character, and presenting the pair of the word sequence and the part of speech sequence in the most likely order (for example, And Patent Document 3).

筆点の時系列パターンを認識するオンライン識別器として構造化字体表現及び線形処理時間伸縮マッチングを用い、非時系列の文字画像パターンを認識するオフライン識別器として修正二次識別関数を用いて、両者を統合し、また、オフラインパターンの特徴次元数及び修正二次識別関数の固有地を削減し、更に、文脈処理を後処理として用いる技術も知られている（例えば、非特許文献１参照）。 Both using a structured font representation and linear processing time expansion / contraction matching as an online discriminator that recognizes the time series pattern of writing points, and a modified secondary discriminant function as an offline discriminator that recognizes non-time series character image patterns In addition, a technique is also known in which the number of feature dimensions of the off-line pattern and the specific place of the modified secondary discriminant function are reduced, and further, context processing is used as post-processing (see Non-Patent Document 1, for example).

特開平９−３０５７１６号公報JP-A-9-305716 特開平８−１８０１３７号公報JP-A-8-180137 特開平８−３１５０７８号公報JP-A-8-315078

織田英人、朱碧蘭、小沼元輝、徳野淳子、耒代誠仁、中川正樹、「オフライン識別器を統合したオンライン手書き文字識別器の小型化」、電子情報通信学会、２００７年９月１日、電子情報通信学会論文誌Ｄ、Ｖｏｌ．Ｊ９０−Ｄ、Ｎｏ．９、ｐ．２５８３−２５９４Hideto Oda, Ran Toki, Mototeru Onuma, Atsuko Tokuno, Seito Toshiro, Masaki Nakagawa, “Miniaturization of an online handwritten character classifier integrated with an offline classifier”, IEICE, September 1, 2007, Electronic Information IEICE Transactions D, Vol. J90-D, no. 9, p. 2583-2594

本発明の目的は、文字認識処理及び音声認識処理の何れか一方の認識処理の精度を、認識処理のために文書集合を用意することなく向上することにある。 An object of the present invention is to improve the accuracy of one of character recognition processing and voice recognition processing without preparing a document set for recognition processing.

請求項１に記載の発明は、文字認識処理及び音声認識処理の何れか一方の認識処理の結果として得られた文字列に含まれる特定の文字の当該認識処理における確信度を取得する第１の取得手段と、前記文字列に含まれる前記特定の文字と、前記文字列に含まれる当該特定の文字の直前又は直後の文字とを含む検索語を生成する生成手段と、前記認識処理以外の用途のために用意された文書集合を、前記生成手段により生成された前記検索語を用いて検索することにより、当該検索語の使用に関する指標を取得する第２の取得手段と、前記第１の取得手段により取得された前記特定の文字の前記確信度を、前記第２の取得手段により取得された前記指標に基づいて修正する修正手段とを備えたことを特徴とする認識結果修正装置である。
請求項２に記載の発明は、前記認識処理に関して個別の精度を要求する単位を識別する識別情報を受け付ける識別情報受付手段を更に備え、前記第２の取得手段は、前記識別情報受付手段が受け付けた前記識別情報に予め関連付けられた前記文書集合を検索することを特徴とする請求項１に記載の認識結果修正装置である。
請求項３に記載の発明は、前記第２の取得手段は、前記識別情報受付手段が受け付けた前記識別情報に予め関連付けられた複数の前記文書集合を前記検索語を用いて検索することで得られた各文書集合における当該検索語の使用頻度と、前記識別情報受付手段が受け付けた前記識別情報に予め関連付けられた当該各文書集合の重みとに基づいて、前記指標を取得することを特徴とする請求項２に記載の認識結果修正装置である。
請求項４に記載の発明は、前記文字列に含まれる前記特定の文字を前記認識処理の結果として確定させる指示を受け付ける確定指示受付手段と、前記確定指示受付手段が前記指示を受け付けると、特定の文書集合を前記検索語を用いて検索することで得られた当該検索語の使用頻度に基づいて、当該特定の文書集合の重みを更新する更新手段とを更に備えたことを特徴とする請求項３に記載の認識結果修正装置である。
請求項５に記載の発明は、前記文書集合を選択する利用者の指示を受け付ける選択指示受付手段を更に備え、前記第２の取得手段は、前記選択指示受付手段が受け付けた前記指示により選択された前記文書集合を検索することを特徴とする請求項１に記載の認識結果修正装置である。
請求項６に記載の発明は、前記文字列に対して形態素解析を行う形態素解析手段を更に備え、前記生成手段は、前記形態素解析手段による形態素解析の結果に基づいて、前記検索語に含める前記特定の文字の直前又は直後の文字を特定することを特徴とする請求項１乃至５の何れかに記載の認識結果修正装置である。
請求項７に記載の発明は、画像が記録された記録媒体から当該画像を読み取る読取手段と、前記読取手段により読み取られた前記画像に対して文字認識を行った結果として得られた文字列に含まれる特定の文字の当該文字認識における確信度を取得する第１の取得手段と、前記文字列に含まれる前記特定の文字と、前記文字列に含まれる当該特定の文字の直前又は直後の文字とを含む検索語を生成する生成手段と、前記文字認識以外の用途のために用意された文書集合を、前記生成手段により生成された前記検索語を用いて検索することにより、当該検索語の使用に関する指標を取得する第２の取得手段と、前記第１の取得手段により取得された前記特定の文字の前記確信度を、前記第２の取得手段により取得された前記指標に基づいて修正する修正手段と、前記文字列に含まれる前記特定の文字を、前記修正手段による修正後の当該特定の文字の前記確信度に基づいて表示する表示手段とを備えたことを特徴とする画像処理装置である。
請求項８に記載の発明は、コンピュータに、文字認識処理及び音声認識処理の何れか一方の認識処理の結果として得られた文字列に含まれる特定の文字の当該認識処理における確信度を取得する機能と、前記文字列に含まれる前記特定の文字と、前記文字列に含まれる当該特定の文字の直前又は直後の文字とを含む検索語を生成する機能と、前記認識処理以外の用途のために用意された文書集合を、前記検索語を用いて検索することにより、当該検索語の使用に関する指標を取得する機能と、前記特定の文字の前記確信度を、前記指標に基づいて修正する機能とを実現させるためのプログラムである。 The invention according to claim 1 is a first method for acquiring a certainty factor in the recognition process of a specific character included in a character string obtained as a result of the recognition process of any one of the character recognition process and the voice recognition process. An acquisition unit; a generation unit that generates a search word including the specific character included in the character string; and a character immediately before or immediately after the specific character included in the character string; and uses other than the recognition process A second acquisition unit that acquires an index relating to use of the search word by searching the document set prepared for the search using the search word generated by the generation unit; and the first acquisition A recognition result correcting apparatus comprising: correcting means for correcting the certainty factor of the specific character acquired by the means based on the index acquired by the second acquiring means.
The invention according to claim 2 further includes identification information receiving means for receiving identification information for identifying a unit that requires individual accuracy with respect to the recognition processing, and the second acquisition means is received by the identification information receiving means. 2. The recognition result correcting apparatus according to claim 1, wherein the document set associated in advance with the identification information is searched.
According to a third aspect of the present invention, the second acquisition unit obtains a plurality of document sets previously associated with the identification information received by the identification information reception unit using the search word. The index is acquired based on the frequency of use of the search term in each document set and the weight of each document set associated in advance with the identification information received by the identification information receiving means. The recognition result correcting apparatus according to claim 2.
According to a fourth aspect of the present invention, a confirmation instruction accepting unit that accepts an instruction for confirming the specific character included in the character string as a result of the recognition process; and a specification when the confirmation instruction accepting unit accepts the instruction And updating means for updating the weight of the specific document set based on the frequency of use of the search word obtained by searching the document set using the search word. Item 4. The recognition result correction device according to Item 3.
The invention according to claim 5 further includes selection instruction receiving means for receiving an instruction of a user who selects the document set, and the second acquisition means is selected by the instruction received by the selection instruction receiving means. 2. The recognition result correcting apparatus according to claim 1, wherein the document set is searched.
The invention according to claim 6 further comprises morpheme analysis means for performing morpheme analysis on the character string, and the generation means includes the morpheme analysis included in the search word based on a result of morpheme analysis by the morpheme analysis means. 6. The recognition result correcting apparatus according to claim 1, wherein a character immediately before or after a specific character is specified.
According to a seventh aspect of the present invention, there is provided a reading unit that reads an image from a recording medium on which the image is recorded, and a character string obtained as a result of character recognition performed on the image read by the reading unit. A first acquisition unit configured to acquire a certainty factor in the character recognition of the specific character included; the specific character included in the character string; and a character immediately before or immediately after the specific character included in the character string. Generating a search term including: and a search for a set of documents prepared for use other than the character recognition by using the search term generated by the generation unit. A second acquisition unit that acquires an index relating to use, and the certainty factor of the specific character acquired by the first acquisition unit is corrected based on the index acquired by the second acquisition unit Image processing, and a display unit that displays the specific character included in the character string based on the certainty factor of the specific character after correction by the correction unit. Device.
The invention according to claim 8 acquires a certainty factor in the recognition process of a specific character included in the character string obtained as a result of one of the character recognition process and the voice recognition process. A function, a function for generating a search term including the specific character included in the character string, and a character immediately before or immediately after the specific character included in the character string, and for uses other than the recognition process A function for obtaining an index relating to use of the search word by searching the document set prepared in the search word, and a function for correcting the certainty factor of the specific character based on the index It is a program for realizing.

請求項１の発明によれば、文字認識処理及び音声認識処理の何れか一方の認識処理の精度を、認識処理のために文書集合を用意することなく向上することができる。
請求項２の発明によれば、認識処理に関する個別の精度の要求に合致するように、認識処理の精度を向上することができる。
請求項３の発明によれば、複数の文書集合を用いて、認識処理に関する個別の精度の要求に合致するように、認識処理の精度を向上することができる。
請求項４の発明によれば、認識処理を繰り返すに従って、認識処理の精度が向上する。
請求項５の発明によれば、利用者が自由に選択した文書集合を用いて、認識処理の精度を向上することができる。
請求項６の発明によれば、認識処理の精度を向上するために用いる検索語を、手間をかけることなく生成することができる。
請求項７の発明によれば、文字認識の精度を、文字認識のために文書集合を用意することなく向上することができる。
請求項８の発明によれば、文字認識処理及び音声認識処理の何れか一方の認識処理の精度を、認識処理のために文書集合を用意することなく向上することができる。 According to the first aspect of the present invention, the accuracy of any one of the character recognition process and the voice recognition process can be improved without preparing a document set for the recognition process.
According to the second aspect of the present invention, the accuracy of the recognition process can be improved so as to meet the individual accuracy requirements regarding the recognition process.
According to the invention of claim 3, the accuracy of the recognition processing can be improved by using a plurality of document sets so as to meet individual accuracy requirements regarding the recognition processing.
According to the invention of claim 4, as the recognition process is repeated, the accuracy of the recognition process is improved.
According to the invention of claim 5, the accuracy of the recognition processing can be improved by using the document set freely selected by the user.
According to the sixth aspect of the present invention, it is possible to generate a search term used for improving the accuracy of the recognition processing without taking time and effort.
According to the invention of claim 7, the accuracy of character recognition can be improved without preparing a document set for character recognition.
According to the invention of claim 8, the accuracy of either the character recognition process or the voice recognition process can be improved without preparing a document set for the recognition process.

本発明の第１の実施の形態における文字認識装置の機能構成例を示したブロック図である。It is the block diagram which showed the function structural example of the character recognition apparatus in the 1st Embodiment of this invention. 本発明の第１の実施の形態における文字認識装置の動作例を示したフローチャートである。It is the flowchart which showed the operation example of the character recognition apparatus in the 1st Embodiment of this invention. 本発明の第１の実施の形態及び第２の実施の形態で算出される文字スコアの例を示した図である。It is the figure which showed the example of the character score calculated in the 1st Embodiment and 2nd Embodiment of this invention. 本発明の第１の実施の形態で算出される検索スコアの例を示した図である。It is the figure which showed the example of the search score calculated in the 1st Embodiment of this invention. 本発明の第１の実施の形態及び第２の実施の形態で算出される総合スコアの例を示した図である。It is the figure which showed the example of the total score calculated in the 1st Embodiment and 2nd Embodiment of this invention. 本発明の第２の実施の形態における文字認識装置の機能構成例を示したブロック図である。It is the block diagram which showed the function structural example of the character recognition apparatus in the 2nd Embodiment of this invention. 本発明の第２の実施の形態におけるＤＢ重み保持部の記憶内容の例を示した図である。It is the figure which showed the example of the memory content of the DB weight holding | maintenance part in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における文字認識装置の動作例を示したフローチャートである。It is the flowchart which showed the operation example of the character recognition apparatus in the 2nd Embodiment of this invention. 本発明の第２の実施の形態で算出される検索スコアの例を示した図である。It is the figure which showed the example of the search score calculated in the 2nd Embodiment of this invention. 本発明の実施の形態を実現可能なコンピュータのハードウェア構成図である。It is a hardware block diagram of the computer which can implement | achieve embodiment of this invention.

以下、添付図面を参照して、本発明の実施の形態について詳細に説明する。
本実施の形態は、文字認識や音声認識を行う場合に、その認識の精度を簡便に向上させるものである。かかる認識の精度の向上のためには、一般に、Ｎグラム確率等の膨大なデータを用意する手間が必要となるが、本実施の形態では、文字認識や音声認識の結果が不自然なものとなった場合に、文字認識や音声認識以外の用途のために用意されたデータベースを検索することで、手間をかけずにその結果を修正する（以下、この検索に用いられるデータベースを「検索対象ＤＢ」と称する）。尚、このように、本実施の形態は、文字認識に対しても音声認識に対しても適用可能であるが、以下では、文字認識に適用した場合を例にとって説明する。 Embodiments of the present invention will be described below in detail with reference to the accompanying drawings.
In the present embodiment, when character recognition or voice recognition is performed, the accuracy of the recognition is simply improved. In order to improve the accuracy of such recognition, it is generally necessary to prepare an enormous amount of data such as N-gram probabilities, but in this embodiment, the results of character recognition and speech recognition are unnatural. If this happens, search the database prepared for uses other than character recognition and voice recognition, and correct the result without taking time (hereinafter, the database used for this search is referred to as “search target DB”). "). As described above, the present embodiment can be applied to both character recognition and voice recognition. However, in the following, a case where the present embodiment is applied to character recognition will be described as an example.

［第１の実施の形態］
この第１の実施の形態では、ユーザが検索対象ＤＢを明示的に選択する。
図１は、第１の実施の形態における文字認識装置の機能構成例を示したブロック図である。
図示するように、文字認識装置１０は、ＵＩ（User Interface）部２０と、処理部３０と、ＤＢ（Database）部４０とを備える。
ＵＩ部２０は、ユーザが入力する情報を受け付けたり、ユーザに対して情報を出力したりする部分であり、文字画像読取部２１と、ＤＢ選択受付部２２と、認識結果表示部２３とを備える。
処理部３０は、ＵＩ部２０で受け付けた情報に基づいて文字認識を行った結果を、ＤＢ部４０を用いて修正し、その結果をＵＩ部２０に表示する部分であり、文字認識部３１と、形態素解析部３２と、検索キー作成部３３と、単語評価部３４と、総合評価部３５とを備える。 [First Embodiment]
In the first embodiment, the user explicitly selects the search target DB.
FIG. 1 is a block diagram illustrating a functional configuration example of the character recognition device according to the first embodiment.
As illustrated, the character recognition device 10 includes a UI (User Interface) unit 20, a processing unit 30, and a DB (Database) unit 40.
The UI unit 20 is a part that receives information input by the user and outputs information to the user, and includes a character image reading unit 21, a DB selection receiving unit 22, and a recognition result display unit 23. .
The processing unit 30 is a part that corrects the result of character recognition based on the information received by the UI unit 20 using the DB unit 40 and displays the result on the UI unit 20. The morphological analysis unit 32, the search key creation unit 33, the word evaluation unit 34, and the comprehensive evaluation unit 35 are provided.

ＤＢ部４０は、処理部３０によって用いられるデータベースを含む部分であり、文書データ記憶部４１ａ，４１ｂ，４１ｃを備える。尚、図では、文書データ記憶部４１ａ，４１ｂ，４１ｃを示したが、これらを区別する必要がない場合は、文書データ記憶部４１と称することもある。また、図には、３つの文書データ記憶部４１しか示していないが、４つ以上の文書データ記憶部４１を設けてもよい。更に、図では、ＤＢ部４０を文字認識装置１０の一部として示しているが、ＤＢ部４０は文字認識装置１０の外部に存在していてもよい。本実施の形態では、文書集合の一例として、ＤＢ部４０を設けている。 The DB unit 40 includes a database used by the processing unit 30, and includes document data storage units 41a, 41b, and 41c. Although the document data storage units 41a, 41b, and 41c are shown in the figure, they may be referred to as the document data storage unit 41 when it is not necessary to distinguish them. Although only three document data storage units 41 are shown in the figure, four or more document data storage units 41 may be provided. Furthermore, although the DB unit 40 is shown as a part of the character recognition device 10 in the figure, the DB unit 40 may exist outside the character recognition device 10. In the present embodiment, a DB unit 40 is provided as an example of a document set.

まず、ＵＩ部２０の構成要素について説明する。
文字画像読取部２１は、紙等の記録媒体に記録された文字画像を読み取る。ここで、文字画像は、筆記具で手書きされた手書き文字の画像であってもよいし、プリンタで印刷された印刷文字の画像であってもよい。また、文字画像読取部２１は、例えばスキャナであり、光源から原稿に照射した光に対する反射光をレンズで縮小してＣＣＤ（Charge Coupled Devices）で受光するＣＣＤ方式や、ＬＥＤ光源から原稿に順に照射した光に対する反射光をＣＩＳ（Contact Image Sensor）で受光するＣＩＳ方式のものを用いるとよい。本実施の形態では、画像を読み取る読取手段の一例として、文字画像読取部２１を設けている。 First, components of the UI unit 20 will be described.
The character image reading unit 21 reads a character image recorded on a recording medium such as paper. Here, the character image may be an image of a handwritten character handwritten with a writing instrument, or may be an image of a printed character printed with a printer. The character image reading unit 21 is, for example, a scanner. The character image reading unit 21 is a CCD system in which reflected light with respect to light emitted from a light source to a document is reduced by a lens and received by a CCD (Charge Coupled Devices), or an LED light source is irradiated on a document in order. It is preferable to use a CIS system in which reflected light with respect to the received light is received by a CIS (Contact Image Sensor). In the present embodiment, a character image reading unit 21 is provided as an example of a reading unit that reads an image.

ＤＢ選択受付部２２は、ＤＢ部４０の検索に先立ち、ユーザが検索対象ＤＢとして選択した文書データ記憶部４１を示す選択情報を受け付ける。本実施の形態では、文書集合を選択する利用者の指示を受け付ける選択指示受付手段の一例として、ＤＢ選択受付部２２を設けている。
認識結果表示部２３は、処理部３０によって得られた認識結果を表示する。その際、認識結果は、処理部３０にて算出されたスコアに基づいて表示するとよい。ここで、認識結果表示部２３としては、例えばＬＣＤ（Liquid Crystal Display）を用いるとよい。本実施の形態では、特定の文字を修正後の確信度に基づいて表示する表示手段の一例として、認識結果表示部２３を設けている。 The DB selection receiving unit 22 receives selection information indicating the document data storage unit 41 selected as a search target DB by the user prior to the search of the DB unit 40. In the present embodiment, a DB selection receiving unit 22 is provided as an example of a selection instruction receiving unit that receives an instruction of a user who selects a document set.
The recognition result display unit 23 displays the recognition result obtained by the processing unit 30. At this time, the recognition result may be displayed based on the score calculated by the processing unit 30. Here, as the recognition result display unit 23, for example, an LCD (Liquid Crystal Display) may be used. In the present embodiment, the recognition result display unit 23 is provided as an example of a display unit that displays specific characters based on the certainty after correction.

次に、処理部３０の構成要素について説明する。
文字認識部３１は、文字画像読取部２１により読み取られた文字画像から１つ１つの文字に相当する部分を切り出して文字認識を行う。また、文字認識結果として、予め定められた個数の候補文字について、その候補文字を表す文字コードとその候補文字の文字スコアとを保持する。ここで、文字スコアとは、候補文字の文字認識結果としての確からしさを示す値である。尚、文字認識部３１における文字認識の方法としては、文字を図形的なパターンとして認識するオフライン認識技術を利用する方法や、時系列の情報を利用できる場合にはその情報を利用したオンライン認識技術を利用する方法が用いられることが多く、また、両者の組み合わせによって統合的に判断する方法が用いられることも多い。本実施の形態では、特定の文字の確信度の一例として、文字スコアを用いており、また、確信度を取得する第１の取得手段の一例として、文字認識部３１を設けている。 Next, components of the processing unit 30 will be described.
The character recognition unit 31 performs character recognition by cutting out a portion corresponding to each character from the character image read by the character image reading unit 21. In addition, as a character recognition result, for a predetermined number of candidate characters, a character code representing the candidate characters and a character score of the candidate characters are held. Here, the character score is a value indicating the certainty as a character recognition result of the candidate character. In addition, as a character recognition method in the character recognition unit 31, a method using an offline recognition technology for recognizing characters as a graphic pattern, or an online recognition technology using the information when time-series information is available. In many cases, a method of using the method is used, and a method of making an integrated judgment based on a combination of both is often used. In the present embodiment, a character score is used as an example of the certainty factor of a specific character, and the character recognition unit 31 is provided as an example of a first acquisition unit that acquires the certainty factor.

形態素解析部３２は、文字認識部３１による文字認識で抽出された候補文字からなる文字列を、品詞ごとに複数の語に分解し、各語に品詞情報を付与する。本実施の形態では、文字列に対して形態素解析を行う形態素解析手段の一例として、形態素解析部３２を設けている。 The morpheme analysis unit 32 decomposes a character string made up of candidate characters extracted by character recognition by the character recognition unit 31 into a plurality of words for each part of speech, and gives part of speech information to each word. In the present embodiment, a morpheme analysis unit 32 is provided as an example of a morpheme analysis unit that performs morpheme analysis on a character string.

検索キー作成部３３は、形態素解析部３２による形態素解析の結果に基づいて分割単位を決定し、この分割単位ごとに、候補文字を組み合わせて検索キーを生成する。ここで、分割単位の決定は、形態素解析部３２により得られた語が１つの文字のみを含むものであれば、その文字を前又は後の語と結合することによって行う。このとき、前又は後の何れの語と結合してもよいが、文字スコアが高い方の語と結合するのが好ましい。或いは、特定の品詞どうしの結びつきを優先して決定したり、語の長短によって決定したりしてもよい。また、検索キーの生成は、候補文字を組み合わせることで検索キーの数が膨大になるのを防ぐため、文字スコアが高い候補文字の組み合わせから予め設定された個数だけ選択することによって行ったり、文字スコアが予め設定された閾値より高い候補文字を組み合わせることによって行ったりしてもよい。本実施の形態では、検索語の一例として、検索キーを用いており、また、検索語を生成する生成手段の一例として、検索キー作成部３３を設けている。 The search key creation unit 33 determines a division unit based on the result of the morpheme analysis by the morpheme analysis unit 32, and generates a search key by combining candidate characters for each division unit. Here, if the word obtained by the morphological analysis unit 32 includes only one character, the division unit is determined by combining the character with the previous or subsequent word. At this time, it may be combined with either the previous or subsequent word, but is preferably combined with the word having the higher character score. Alternatively, the connection between specific parts of speech may be determined with priority, or may be determined based on the length of the word. In addition, search key generation is performed by selecting a predetermined number of combinations of candidate characters having a high character score, in order to prevent the number of search keys from becoming enormous by combining candidate characters, It may be performed by combining candidate characters whose score is higher than a preset threshold value. In the present embodiment, a search key is used as an example of a search term, and a search key creation unit 33 is provided as an example of a generation unit that generates a search term.

単語評価部３４は、検索キー作成部３３により生成された検索キーを用いて、ＤＢ選択受付部２２で受け付けた選択情報によって示される文書データ記憶部４１を検索し、その検索結果に基づくスコアである検索スコアを算出する。この場合、検索でヒットした文書の件数を単純に検索スコアとしてもよいし、検索でヒットした文書中の検索キーの出現頻度の総計を検索スコアとしてもよい。そして、この検索スコアに基づいて、単語としてのスコアである単語スコアを算出する。本実施の形態では、検索語の使用に関する指標の一例として、単語スコアを用いており、また、指標を取得する第２の取得手段の一例として、単語評価部３４を設けている。尚、本明細書では、便宜上「単語スコア」という文言を用いるが、このスコアが付される候補文字の組み合わせは、必ずしも文法上の単語を構成するとは限らない。 The word evaluation unit 34 searches the document data storage unit 41 indicated by the selection information received by the DB selection reception unit 22 using the search key generated by the search key creation unit 33, and uses a score based on the search result. A certain search score is calculated. In this case, the number of documents hit by the search may be simply used as the search score, or the total appearance frequency of the search keys in the documents hit by the search may be used as the search score. And based on this search score, the word score which is a score as a word is calculated. In the present embodiment, a word score is used as an example of an index related to use of a search word, and a word evaluation unit 34 is provided as an example of a second acquisition unit that acquires the index. In this specification, the term “word score” is used for convenience, but the combination of candidate characters to which this score is attached does not necessarily constitute a grammatical word.

総合評価部３５は、文字認識部３１による文字認識で得られた文字スコアと、単語評価部３４による検索で得られた単語スコアとを統合して、最終的に各文字画像の文字認識結果を決定する。本実施の形態では、特定の文字の確信度を指標に基づいて修正する修正手段の一例として、総合評価部３５を設けている。 The comprehensive evaluation unit 35 integrates the character score obtained by the character recognition by the character recognition unit 31 and the word score obtained by the search by the word evaluation unit 34, and finally obtains the character recognition result of each character image. decide. In the present embodiment, the comprehensive evaluation unit 35 is provided as an example of a correction unit that corrects the certainty factor of a specific character based on an index.

更に、ＤＢ部４０の構成要素について説明する。
文書データ記憶部４１は、検索が可能なように予めインデックスが付された一般の文書データベースであり、この中から選択されたデータベースが、処理部３０にて用いられる検索対象ＤＢとなる。この文書データベースは、例えば、個人や組織が所有しているものであってもよいし、インターネット検索エンジンで用いられるウェブページの集合であってもよい。 Furthermore, the components of the DB unit 40 will be described.
The document data storage unit 41 is a general document database that is pre-indexed so as to be searchable, and a database selected from these is a search target DB used by the processing unit 30. For example, the document database may be owned by an individual or an organization, or may be a set of web pages used in an Internet search engine.

次に、第１の実施の形態における文字認識装置１０の動作について説明する。
図２は、このときの文字認識装置１０の動作例を示したフローチャートである。
まず、ユーザが手書きされた文字画像や印刷された文字画像を文字認識装置１０に入力すると、文字画像読取部２１が、この文字画像を読み取る（ステップ１０１）。
また、ユーザが１つの検索対象ＤＢを選択する操作を行うと、ＤＢ選択受付部２２が、その選択内容を示す選択情報を受け付ける（ステップ１０２）。尚、この場合、検索対象ＤＢとしては、上述したように、文字認識用の辞書データベースではなく、別の用途のために用意された一般の文書データベースが用いられる。 Next, the operation of the character recognition device 10 in the first embodiment will be described.
FIG. 2 is a flowchart showing an operation example of the character recognition device 10 at this time.
First, when a user inputs a handwritten character image or a printed character image to the character recognition device 10, the character image reading unit 21 reads the character image (step 101).
When the user performs an operation of selecting one search target DB, the DB selection receiving unit 22 receives selection information indicating the selection content (step 102). In this case, as the search target DB, as described above, a general document database prepared for another use is used instead of the dictionary database for character recognition.

ＵＩ部２０はこのようにして情報を取得すると、これらの情報を処理部３０に出力する。
すると、まず、文字認識部３１が、文字画像読取部２１が出力した文字画像に対して文字認識を行う（ステップ１０３）。これにより、文字認識部３１は、予め定められた個数の候補文字について、その文字コードと、その確からしさ（確信度、尤度）を示す文字スコアとを得る。尚、この文字コードと文字スコアは、処理部３０の各機能から参照可能なメモリに記憶される。
次に、形態素解析部３２が、文字認識部３１が得た候補文字からなる文字列を形態素解析によって複数の語に分解する（ステップ１０４）。このとき、形態素解析によって得られた各語には、名詞、動詞、助詞、助動詞等の品詞情報が付されている。 When the UI unit 20 acquires the information in this way, the UI unit 20 outputs the information to the processing unit 30.
Then, first, the character recognition unit 31 performs character recognition on the character image output from the character image reading unit 21 (step 103). Thereby, the character recognizing unit 31 obtains the character code and the character score indicating the certainty (confidence, likelihood) of the predetermined number of candidate characters. The character code and the character score are stored in a memory that can be referred to from each function of the processing unit 30.
Next, the morphological analysis unit 32 decomposes the character string made up of candidate characters obtained by the character recognition unit 31 into a plurality of words by morphological analysis (step 104). At this time, parts of speech information such as nouns, verbs, particles, and auxiliary verbs are attached to each word obtained by morphological analysis.

次いで、検索キー作成部３３は、形態素解析部３２による形態素解析の結果に基づいて、分割単位を決定する（ステップ１０５）。例えば、形態素解析で得られた語のうち、助詞や助動詞の品詞情報が付された語は、前後の語が名詞、形容詞、動詞であればそれと連結して、分割単位とする。その他の場合は、形態素解析で得られた語をそのまま分割単位とする。 Next, the search key creation unit 33 determines a division unit based on the result of the morpheme analysis by the morpheme analysis unit 32 (step 105). For example, among words obtained by morphological analysis, words attached with participle or part of speech information of auxiliary verbs are connected as a division unit if the preceding and following words are nouns, adjectives, and verbs. In other cases, words obtained by morphological analysis are used as division units as they are.

その後、検索キー作成部３３は、分割単位ごとに、検索対象ＤＢの検索に用いる検索キーを作成する。
即ち、検索キー作成部３３は、まず、１つの分割単位に着目する（ステップ１０６）。
次に、着目している分割単位に含まれる各文字に対応する候補文字の組み合わせを取得する（ステップ１０７）。通常は、この取得した候補文字の組み合わせの全てを検索キーとすればよい。
ところが、このような検索キーの生成方法では、膨大な数の検索キーが生成される可能性がある。具体的には、候補文字の数をｍとし、分割単位内の文字の数をｎとすると、ｍのｎ乗個の検索キーが生成されることになる。そこで、このような事態を回避するため、候補文字の文字スコアの平均値が予め与えられた閾値より大きくなるような候補文字の組み合わせのみを検索キーとする。従って、検索キー作成部３３は、ステップ１０３で記憶した文字スコアを参照し、候補文字の文字スコアの平均値が予め定めた閾値よりも大きいかどうかを判定する（ステップ１０８）。 After that, the search key creation unit 33 creates a search key used for searching the search target DB for each division unit.
That is, the search key creation unit 33 first focuses on one division unit (step 106).
Next, a combination of candidate characters corresponding to each character included in the division unit of interest is acquired (step 107). Normally, all of the acquired combinations of candidate characters may be used as search keys.
However, in such a search key generation method, a huge number of search keys may be generated. Specifically, assuming that the number of candidate characters is m and the number of characters in the division unit is n, m-th n search keys are generated. Therefore, in order to avoid such a situation, only combinations of candidate characters whose average character score of candidate characters is larger than a predetermined threshold are used as search keys. Accordingly, the search key creation unit 33 refers to the character score stored in step 103 and determines whether the average value of the character scores of the candidate characters is greater than a predetermined threshold (step 108).

その結果、文字スコアの平均値が閾値以下である場合は、他の候補文字の組み合わせについてステップ１０８の判定を行う。一方、文字スコアの平均値が閾値よりも大きい場合は、その候補文字の組み合わせを検索キーとし（ステップ１０９）、候補文字の組み合わせが他にあるかどうかを判定する（ステップ１１０）。そして、候補文字の組み合わせが他にあれば、ステップ１０７〜１０９の処理を繰り返し、候補文字の組み合わせが他になければ、単語評価部３４の処理に移る。 As a result, if the average value of the character scores is equal to or less than the threshold value, the determination in step 108 is performed for other candidate character combinations. On the other hand, if the average value of the character scores is greater than the threshold value, the candidate character combination is used as a search key (step 109), and it is determined whether there are other candidate character combinations (step 110). Then, if there are other candidate character combinations, the processes of steps 107 to 109 are repeated. If there are no other candidate character combinations, the processing of the word evaluation unit 34 is performed.

そして、単語評価部３４は、検索キー作成部３３が作成した１つ以上の検索キーを受け取り、この検索キーを用いて、ＤＢ選択受付部２２で受け付けた選択情報で示される文書データ記憶部４１を検索する（ステップ１１１）。そして、検索結果に基づいて、検索スコアを算出し、この検索スコアに基づいて、候補文字を連結した単語としての確からしさを示す単語スコアを算出する（ステップ１１２）。ここで、検索スコアは、例えば、検索キーで検索された文書の件数（検索ヒット件数）や、検索された文書における検索キーの出現頻度の総和等で算出すればよい。 Then, the word evaluation unit 34 receives one or more search keys created by the search key creation unit 33, and uses this search key to store the document data storage unit 41 indicated by the selection information received by the DB selection reception unit 22. (Step 111). Then, based on the search result, a search score is calculated, and based on this search score, a word score indicating the probability as a word in which candidate characters are connected is calculated (step 112). Here, the search score may be calculated by, for example, the number of documents searched with the search key (number of search hits), the sum of the appearance frequencies of the search keys in the searched document, or the like.

次に、総合評価部３５は、ステップ１０３で文字認識部３１が得た文字スコアと、ステップ１１２で単語評価部３４が算出した単語スコアとを統合して、候補文字ごとの最終的な確からしさを示す総合スコアを算出する（ステップ１１５）。そして、総合スコアは認識結果表示部２３に送られ、認識結果表示部２３が、候補文字に最終的な順位を付加して表示する（ステップ１１６）。この順位としては、例えば、ステップ１１５で算出された総合スコアの高いものほど上位になるような順位を採用することが考えられる。
更に、総合評価部３５は、分割単位が他にあるかどうかを判定する（ステップ１１７）。そして、分割単位が他にあれば、制御を検索キー作成部３３に戻してステップ１０６〜１１６の処理を繰り返し、分割単位が他になければ、第１の実施の形態の動作は終了する。 Next, the comprehensive evaluation unit 35 integrates the character score obtained by the character recognition unit 31 in step 103 and the word score calculated by the word evaluation unit 34 in step 112 to obtain the final certainty for each candidate character. Is calculated (step 115). Then, the total score is sent to the recognition result display unit 23, and the recognition result display unit 23 adds the final rank to the candidate characters and displays them (step 116). As this rank, for example, it is conceivable to adopt a rank such that the higher the total score calculated in step 115, the higher the rank.
Furthermore, the comprehensive evaluation unit 35 determines whether there are other division units (step 117). If there is another division unit, the control is returned to the search key creation unit 33, and the processing of steps 106 to 116 is repeated. If there is no other division unit, the operation of the first embodiment ends.

尚、以上の動作例によって算出される総合スコアは、次のような式で表される。 The total score calculated by the above operation example is expressed by the following equation.

Ｔ_ｉｊは、ある分割単位におけるｉ番目の文字のｊ番目の候補文字の総合スコアである。
Ｃ_ｉｊは、ある分割単位におけるｉ番目の文字のｊ番目の候補文字の文字スコアである。
Ｓ_ｉｊｋは、ある分割単位におけるｉ番目の文字のｊ番目の候補文字を含むｋ番目の検索キーで検索対象ＤＢを検索したときに得られた検索スコアである。尚、Ｓ_ｉｊｋとしては、ヒット件数や、ヒット全文書中の検索キーの出現頻度等を用いればよい。また、出現頻度が０である場合でも検索スコアが０にならないように事前頻度１等を全ての検索キーに加えてもよい。
そして、上記式は、文字スコアＣ_ｉｊと、Ｓ_ｉｊｋのｋを変動させた場合の最大値である単語スコアとを掛け合わせることにより、総合スコアＴ_ｉｊが得られることを示している。 _Tij is a total score of the jth candidate character of the ith character in a certain division unit.
C _ij is the character score of the j-th candidate character of the i-th character in a certain division unit.
S _ijk is a search score obtained when the search target DB is searched with the k-th search key including the j-th candidate character of the i-th character in a certain division unit. As S _ijk , the number of hits, the appearance frequency of search keys in all hit documents, etc. may be used. Further, even when the appearance frequency is 0, a prior frequency 1 or the like may be added to all search keys so that the search score does not become 0.
The above formula indicates that the overall score T _ij can be obtained by multiplying the character score C _ij by the word score that is the maximum value when k of S _ijk is varied.

ここで、図２に示した動作例を、具体例を用いて説明する。
図３は、ステップ１０３で文字認識部３１によって算出され、図示しないメモリを介して検索キー作成部３３及び総合評価部３５に渡される文字スコアＣ_ｉｊを示した図である。但し、ここでは、ステップ１０６で２文字のみからなる分割単位に着目した場合を想定している。尚、このような分割単位は、ステップ１０４での形態素解析の結果に基づくものであるが、各文字の候補文字のうちどの候補文字からなる文字列について形態素解析を行うかは自由に決めてよい。例えば、文字スコアが最も高い候補文字からなる文字列（図の例では、「・・・士田・・・」という文字列）について形態素解析を行うことが考えられる。
図には、例えば、１番目の文字の１番目の候補文字は「士」で、文字スコアＣ_１１は９３であり、１番目の文字の２番目の候補文字は「キ」で、文字スコアＣ_１２は９０であることが示されている。また、２番目の文字の１番目の候補文字は「田」で、文字スコアＣ_２１は９５であり、２番目の文字の２番目の候補文字は「旧」で、文字スコアＣ_２２は７５であることが示されている。 Here, the operation example shown in FIG. 2 will be described using a specific example.
FIG. 3 is a diagram showing the character score C _ij calculated by the character recognition unit 31 in step 103 and passed to the search key creation unit 33 and the comprehensive evaluation unit 35 via a memory (not shown). However, here, it is assumed that attention is paid to a division unit consisting of only two characters in step 106. Such division units are based on the result of morphological analysis in step 104, but it is up to you to decide which candidate character character string is to be subjected to morphological analysis among candidate characters for each character. . For example, it is conceivable to perform a morphological analysis on a character string composed of candidate characters having the highest character score (in the example of the figure, a character string “... Shita ...”).
In the figure, for example, the first candidate character of the first character is “shi”, the character score C ₁₁ is 93, the second candidate character of the first character is “ki”, and the character score C ₁₂ is shown to be 90. The first candidate character of the second character is “da”, the character score C ₂₁ is 95, the second candidate character of the second character is “old”, and the character score C ₂₂ is 75. It is shown that there is.

図４は、ステップ１１２で単語評価部３４によって算出される検索スコアＳ_ｉｊｋを示した図である。但し、ステップ１０８で用いる閾値を８０とし、文字スコアの平均値がこの閾値を超える候補文字の組み合わせについて検索スコアを示している。
例えば、１番目の文字の１番目の候補文字「士」を含む検索キーのうち、文字スコアの平均値が８０を超えるものは、「士田」と「士旧」である。そこで、図には、「士田」を検索キーとして検索対象ＤＢを検索して得られた検索スコアＳ_１１１と、「士旧」を検索キーとして検索対象ＤＢを検索して得られた検索スコアＳ_１１２とが示されている。この検索スコアＳ_１１１と検索スコアＳ_１１２の中で最大である検索スコアＳ_１１１が単語スコアとして、図示しないメモリを介して総合評価部３５に渡される。同様に、２番目の候補文字「キ」を含む検索キー、３番目の候補文字「土」を含む検索キー、４番目の候補文字「工」を含む検索キーのうち、文字スコアの平均値が８０を超える検索キーを用いた場合の検索スコアも示されている。尚、５番目の候補文字「ユ」を含む検索キーで、文字スコアの平均値が８０を超えるものはないので、候補文字「ユ」を含む検索キーを用いた場合の検索スコアを格納する欄は設けていない。 FIG. 4 is a diagram showing the search score S _ijk calculated by the word evaluation unit 34 in step 112. However, the threshold used in step 108 is 80, and the search score is shown for a combination of candidate characters whose average character score exceeds this threshold.
For example, among the search keys including the first candidate character “shi” of the first character, those having an average character score of more than 80 are “shida” and “shiji”. Therefore, in the figure, a search score S ₁₁₁ obtained by searching the search target DB using “Shida” as a search key, and a search score obtained by searching the search target DB using “Shi old” as a search key. S ₁₁₂ is shown. Maximum a is search score S ₁₁₁ in this and search score S ₁₁₁ search score S ₁₁₂ as a word score, it is passed to the comprehensive evaluation unit 35 via a memory (not shown). Similarly, among the search key including the second candidate character “K”, the search key including the third candidate character “Sat”, and the search key including the fourth candidate character “K”, the average value of the character scores is A search score when using search keys exceeding 80 is also shown. Since there is no search key including the fifth candidate character “yu” and the average character score exceeds 80, a field for storing the search score when the search key including the candidate character “yu” is used. Is not provided.

また、２番目の文字の１番目の候補文字「田」を含む検索キーのうち、文字スコアの平均値が８０を超えるものは、「士田」と「キ田」と「土田」と「工田」である。そこで、図には、「士田」を検索キーとして検索対象ＤＢを検索して得られた検索スコアＳ_２１１と、「キ田」を検索キーとして検索対象ＤＢを検索して得られた検索スコアＳ_２１２と、「土田」を検索キーとして検索対象ＤＢを検索して得られた検索スコアＳ_２１３と、「工田」を検索キーとして検索対象ＤＢを検索して得られた検索スコアＳ_２１４とが示されている。この検索スコアＳ_２１１と検索スコアＳ_２１２と検索スコアＳ_２１３と検索スコアＳ_２１４の中で最大である検索スコアＳ_２１３が単語スコアとして、図示しないメモリを介して総合評価部３５に渡される。同様に、２番目の候補文字「旧」を含む検索キーのうち、文字スコアの平均値が８０を超える検索キーを用いた場合の検索スコアも示されている。尚、３番目の候補文字「口」を含む検索キー、４番目の候補文字「十」を含む検索キー、５番目の候補文字「Ｘ」を含む検索キーで、文字スコアの平均値が８０を超えるものはないので、候補文字「口」、「十」、「Ｘ」を含む検索キーを用いた場合の検索スコアを格納する欄は設けていない。 Of the search keys that include the first candidate character “da” of the second character, those with an average character score exceeding 80 are “Shida”, “Kita”, “Tsuchida”, and “ Rice field ". Therefore, in the figure, the search target DB search search score S ₂₁₁ obtained as a search key "Sita", "Kita" the search key as the search target DB search was obtained search score S ₂₁₂ , a search score S ₂₁₃ obtained by searching the search target DB using “Tsuchida” as a search key, and a search score S ₂₁₄ obtained by searching the search target DB using “Kuda” as a search key It is shown. As search score _{S 213} the word score is the largest among the search score _{S 211} and the search score _{S 212} and the search score _{S 213} and the search score _{S 214,} is passed to the comprehensive evaluation unit 35 via a memory (not shown). Similarly, the search score when the search key including the second candidate character “Old” using the search key with an average value of the character score exceeding 80 is also shown. It should be noted that a search key including the third candidate character “mouth”, a search key including the fourth candidate character “ten”, a search key including the fifth candidate character “X”, and an average character score of 80 Since there is nothing exceeding, there is not provided a column for storing a search score when a search key including candidate characters “mouth”, “ten”, and “X” is used.

図５は、ステップ１１５で総合評価部３５によって算出される総合スコアＴ_ｉｊを示した図である。ここでは、上述したように、Ｓ_ｉｊｋのｋを変動させた場合の最大値である単語スコアをＣ_ｉｊに乗ずることにより、Ｔ_ｉｊを求めている。
例えば、１番目の文字の１番目の候補文字「士」を含む検索キーを用いた場合の検索スコアの最大値である単語スコアは、上記の通り、Ｓ_１１１＝５５である。そこで、図には、総合スコアＴ_１１が５１１５であることが示されている（Ｔ_１１＝Ｃ_１１×ｍａｘＳ_１１Ｋ＝９３×５５＝５１１５）。同様に、２番目の候補文字「キ」、３番目の候補文字「土」、４番目の候補文字「工」についても、総合スコアが示されている。尚、５番目の候補文字「ユ」については、Ｓ_１５Ｋが得られていないので、Ｃ_１５をそのままＴ_１５としている。 FIG. 5 is a diagram showing the total score T _ij calculated by the comprehensive evaluation unit 35 in step 115. Here, as described above, T _ij is obtained by multiplying C _ij by the word score which is the maximum value when k of S _ijk is varied.
For example, as described above, the word score that is the maximum value of the search score when using the search key including the first candidate character “shi” of the first character is S ₁₁₁ = 55. Therefore, the figure shows that the total score T ₁₁ is 5115 (T ₁₁ = C ₁₁ × maxS _11K = 93 × 55 = 5115). Similarly, the overall score is shown for the second candidate character “K”, the third candidate character “Sat”, and the fourth candidate character “K”. In addition, since S _15K is not obtained for the fifth candidate character “Yu”, C ₁₅ is set as T ₁₅ as it is.

また、２番目の文字の１番目の候補文字「田」を含む検索キーを用いた場合の検索スコアの最大値である単語スコアは、上記の通り、Ｓ_２１３＝２１５０である。そこで、図には、総合スコアＴ_２１が２０４２５０であることが示されている（Ｔ_２１＝Ｃ_２１×ｍａｘＳ_２１Ｋ＝９５×２１５０＝２０４２５０）。同様に、２番目の候補文字「旧」についても、総合スコアが示されている。尚、３番目の候補文字「口」、４番目の候補文字「十」、５番目の候補文字「Ｘ」については、Ｓ_２ｊｋが得られていないので、Ｃ_２ｊをそのままＴ_２ｊとしている（ｊ＝３，４，５）。 Further, as described above, the word score that is the maximum value of the search score when the search key including the first candidate character “field” of the second character is used is S ₂₁₃ = 2150. Therefore, in the figure, the total score _{T 21} is shown to be _{_{204250 (T 21 = C 21 ×}} maxS 21K = 95 × 2150 = 204250). Similarly, the overall score is also shown for the second candidate character “Old”. Since S _2jk is not obtained for the third candidate character “mouth”, the fourth candidate character “ten”, and the fifth candidate character “X”, C _2j is used as T _2j as it is (j = 3,4,5).

以上により、１番目の文字の候補文字については、「土」の総合スコアＴ_１３が最大となり、２番目の文字の候補文字については、「田」の総合スコアＴ_２１が最大となっている。従って、この分割単位における文字認識結果としては、「土田」という結果の確信度が最も高いことが分かる。 Thus, for the first character of the candidate character, become a total score T ₁₃ is the largest of the "soil", for the second character of the candidate character, the overall score T ₂₁ of the "field" is the largest. Therefore, it can be seen that the character recognition result in this division unit has the highest certainty of the result “Tsuchida”.

尚、この第１の実施の形態では、ユーザが１つの検索対象ＤＢを指定したが、複数の検索対象ＤＢを指定してもよい。
また、この第１の実施の形態では、形態素解析によって、文字列を自動的に品詞ごとのまとまりに分解するようにした。しかしながら、入力された文字列のどの部分を検索キーとするかをユーザが設定してもよい。
更に、この第１の実施の形態では、文字スコアの平均値が閾値を超えた候補文字の組み合わせからなる検索キーを生成したが、文字スコアの平均値が大きいものから予め設定された個数だけ候補文字の組み合わせを選択して検索キーを生成するようにしてもよい。 In the first embodiment, the user specifies one search target DB, but a plurality of search target DBs may be specified.
In the first embodiment, the character string is automatically decomposed into groups of parts of speech by morphological analysis. However, the user may set which part of the input character string is used as the search key.
Furthermore, in this first embodiment, a search key comprising a combination of candidate characters whose average character score exceeds the threshold value is generated. However, only a preset number of candidates are selected from those having a large average character score. A search key may be generated by selecting a combination of characters.

［第２の実施の形態］
第１の実施の形態では、ユーザが検索対象ＤＢを選択しなければならないため、手間がかかることも懸念される。特に、多数のデータベースの中から１つのデータベースを選択しなければならない場合には、ユーザを補助したり、ユーザに相応しい選択を自動的に行ったりする方が好適である。
そこで、この第２の実施の形態では、単語スコアの算出に適した検索対象ＤＢを文字認識装置１０が自動的に選択する。 [Second Embodiment]
In 1st Embodiment, since a user must select search object DB, we are anxious also about taking time. In particular, when one database must be selected from a large number of databases, it is preferable to assist the user or automatically make a selection suitable for the user.
Therefore, in the second embodiment, the character recognition device 10 automatically selects a search target DB suitable for calculating a word score.

図６は、第２の実施の形態における文字認識装置の機能構成例を示したブロック図である。
図示するように、文字認識装置１０は、ＵＩ（User Interface）部２０と、処理部３０と、ＤＢ（Database）部４０とを備える。
ＵＩ部２０は、ユーザが入力する情報を受け付けたり、ユーザに対して情報を出力したりする部分であり、文字画像読取部２１と、認識結果表示部２３と、ユーザＩＤ受付部２４と、確定指示受付部２５とを備える。
処理部３０は、ＵＩ部２０で受け付けた情報に基づいて文字認識を行った結果を、ＤＢ部４０を用いて修正し、その結果をＵＩ部２０に表示する部分であり、文字認識部３１と、形態素解析部３２と、検索キー作成部３３と、単語評価部３４と、総合評価部３５と、ＤＢ重み保持部３６と、ＤＢ重み計算部３７とを備える。 FIG. 6 is a block diagram illustrating a functional configuration example of the character recognition device according to the second embodiment.
As illustrated, the character recognition device 10 includes a UI (User Interface) unit 20, a processing unit 30, and a DB (Database) unit 40.
The UI unit 20 is a part that receives information input by the user and outputs information to the user. The UI unit 20 is a character image reading unit 21, a recognition result display unit 23, a user ID reception unit 24, and a confirmation. And an instruction receiving unit 25.
The processing unit 30 is a part that corrects the result of character recognition based on the information received by the UI unit 20 using the DB unit 40 and displays the result on the UI unit 20. A morpheme analysis unit 32, a search key creation unit 33, a word evaluation unit 34, a comprehensive evaluation unit 35, a DB weight holding unit 36, and a DB weight calculation unit 37.

まず、ＵＩ部２０の構成要素について説明する。但し、文字画像読取部２１、認識結果表示部２３については、第１の実施の形態と同様なのでここでの説明は省略する。
ユーザＩＤ受付部２４は、処理部３０が検索対象ＤＢを自動的に選択するための手がかりとなる情報として、ユーザＩＤを受け付ける。本実施の形態では、識別情報を受け付ける識別情報受付手段の一例として、ユーザＩＤ受付部２４を設けている。
確定指示受付部２５は、認識結果表示部２３が表示した認識結果を最終的に確定させる指示入力を受け付ける。本実施の形態では、特定の文字を認識処理の結果として確定させる指示を受け付ける確定指示受付手段の一例として、確定指示受付部２５を設けている。 First, components of the UI unit 20 will be described. However, since the character image reading unit 21 and the recognition result display unit 23 are the same as those in the first embodiment, description thereof is omitted here.
The user ID reception unit 24 receives a user ID as information that is a clue for the processing unit 30 to automatically select a search target DB. In the present embodiment, a user ID receiving unit 24 is provided as an example of an identification information receiving unit that receives identification information.
The confirmation instruction receiving unit 25 receives an instruction input for finally confirming the recognition result displayed by the recognition result display unit 23. In the present embodiment, a confirmation instruction receiving unit 25 is provided as an example of a confirmation instruction receiving unit that receives an instruction to fix a specific character as a result of recognition processing.

次に、処理部３０の構成要素について説明する。但し、文字認識部３１、形態素解析部３２、検索キー作成部３３、総合評価部３５については、第１の実施の形態と同様なのでここでの説明は省略する。
単語評価部３４は、ユーザＩＤ受付部２４が受け付けたユーザＩＤをキーにＤＢ重み保持部３６を参照して検索対象ＤＢを選択し、選択された検索対象ＤＢごとに、検索キーの検索スコアを算出する。そして、ＤＢ重み保持部３６を参照して検索スコアを重み付けして単語スコアを算出する。ここで、重み付けは、例えば、重み付け平均をとることによって行われる。本実施の形態では、検索語の使用頻度の一例として、検索スコアを用いている。 Next, components of the processing unit 30 will be described. However, since the character recognition unit 31, the morphological analysis unit 32, the search key creation unit 33, and the comprehensive evaluation unit 35 are the same as those in the first embodiment, description thereof is omitted here.
The word evaluation unit 34 selects a search target DB by referring to the DB weight holding unit 36 using the user ID received by the user ID reception unit 24 as a key, and sets a search key search score for each selected search target DB. calculate. Then, the DB weight holding unit 36 is referred to weight the search score to calculate the word score. Here, the weighting is performed, for example, by taking a weighted average. In the present embodiment, a search score is used as an example of the usage frequency of the search word.

ＤＢ重み保持部３６は、ＤＢ重み計算部３７によってデータベースに付与された重みを蓄積して保持する。例えば、ＤＢ重み計算部３７によって与えられた得点を加算して保持する。
ＤＢ重み計算部３７は、単語スコアをユーザに相応しいものとする検索対象ＤＢが選択されるようにデータベースの重みを計算する。この重みは、確定指示受付部２５と連携しており、ユーザが最終的な文字認識結果として確定したのと同じ文字又は文字列が多く見つかったデータベースに対して例えば得点を付与するといった重み付けの機能を持っている。本実施の形態では、特定の文書集合の重みを更新する更新手段の一例として、ＤＢ重み計算部３７を設けている。 The DB weight holding unit 36 accumulates and holds the weight given to the database by the DB weight calculation unit 37. For example, the score given by the DB weight calculation unit 37 is added and held.
The DB weight calculation unit 37 calculates the database weight so that a search target DB having a word score suitable for the user is selected. This weight is linked to the confirmation instruction accepting unit 25, and a weighting function such as assigning a score to a database in which many of the same characters or character strings that the user has confirmed as the final character recognition result is found. have. In the present embodiment, a DB weight calculation unit 37 is provided as an example of an updating unit that updates the weight of a specific document set.

ＤＢ部４０の構成要素については、第１の実施の形態と同様なのでここでの説明は省略する。 Since the components of the DB unit 40 are the same as those in the first embodiment, description thereof is omitted here.

ここで、ＤＢ重み保持部３６の具体的な内容について説明する。
図７は、ＤＢ重み保持部３６で保持する情報の例について示した図である。
図示するように、ＤＢ重み保持部３６では、ユーザごとに、検索対象ＤＢが各データベースの重みと共に設定されている。
例えば、ユーザＩＤ「Ｕ０１」のユーザが文字認識を指示した場合には、検索対象ＤＢとして、ＤＢ＃１及びＤＢ＃２を用い、ＤＢ＃１を検索して得られた検索スコアの２倍の重みを、ＤＢ＃２を検索して得られた検索スコアに与えることが設定されている。 Here, specific contents of the DB weight holding unit 36 will be described.
FIG. 7 is a diagram showing an example of information held by the DB weight holding unit 36.
As shown in the figure, in the DB weight holding unit 36, a search target DB is set together with the weight of each database for each user.
For example, when the user with the user ID “U01” instructs character recognition, DB # 1 and DB # 2 are used as search target DBs, and the search score obtained by searching DB # 1 is doubled. The weight is set to be given to the search score obtained by searching DB # 2.

次に、第２の実施の形態における文字認識装置１０の動作について説明する。
図８は、このときの文字認識装置１０の動作例を示したフローチャートである。
まず、ユーザが手書きされた文字画像や印刷された文字画像を文字認識装置１０に入力すると、文字画像読取部２１が、この文字画像を読み取る（ステップ１５１）。
また、ユーザがユーザＩＤを入力する操作を行うと、ユーザＩＤ受付部２４が、入力されたユーザＩＤを受け付ける（ステップ１５２）。 Next, the operation of the character recognition device 10 in the second embodiment will be described.
FIG. 8 is a flowchart showing an operation example of the character recognition device 10 at this time.
First, when a user inputs a handwritten character image or a printed character image into the character recognition device 10, the character image reading unit 21 reads the character image (step 151).
Further, when the user performs an operation for inputting a user ID, the user ID receiving unit 24 receives the input user ID (step 152).

ＵＩ部２０はこのようにして情報を取得すると、これらの情報を処理部３０に出力する。
すると、まず、文字認識部３１が、文字画像読取部２１が出力した文字画像に対して文字認識を行う（ステップ１５３）。これにより、文字認識部３１は、予め定められた個数の候補文字について、その文字コードと、その確からしさ（確信度、尤度）を示す文字スコアとを得る。尚、この文字コードと文字スコアは、処理部３０の各機能から参照可能なメモリに記憶される。
次に、形態素解析部３２が、文字認識部３１が得た候補文字からなる文字列を形態素解析によって複数の語に分解する（ステップ１５４）。このとき、形態素解析によって得られた各語には、名詞、動詞、助詞、助動詞等の品詞情報が付されている。 When the UI unit 20 acquires the information in this way, the UI unit 20 outputs the information to the processing unit 30.
Then, first, the character recognition unit 31 performs character recognition on the character image output from the character image reading unit 21 (step 153). Thereby, the character recognizing unit 31 obtains the character code and the character score indicating the certainty (confidence, likelihood) of the predetermined number of candidate characters. The character code and the character score are stored in a memory that can be referred to from each function of the processing unit 30.
Next, the morphological analysis unit 32 decomposes the character string made up of candidate characters obtained by the character recognition unit 31 into a plurality of words by morphological analysis (step 154). At this time, parts of speech information such as nouns, verbs, particles, and auxiliary verbs are attached to each word obtained by morphological analysis.

次いで、検索キー作成部３３は、形態素解析部３２による形態素解析の結果に基づいて、分割単位を決定する（ステップ１５５）。例えば、形態素解析で得られた語のうち、助詞や助動詞の品詞情報が付された語は、前後の語が名詞、形容詞、動詞であればそれと連結して、分割単位とする。その他の場合は、形態素解析で得られた語をそのまま分割単位とする。 Next, the search key creation unit 33 determines a division unit based on the result of the morpheme analysis by the morpheme analysis unit 32 (step 155). For example, among words obtained by morphological analysis, words attached with participle or part of speech information of auxiliary verbs are connected as a division unit if the preceding and following words are nouns, adjectives, and verbs. In other cases, words obtained by morphological analysis are used as division units as they are.

その後、検索キー作成部３３は、分割単位ごとに、検索対象ＤＢの検索に用いる検索キーを作成する。
即ち、検索キー作成部３３は、まず、１つの分割単位に着目する（ステップ１５６）。
次に、着目している分割単位に含まれる各文字に対応する候補文字の組み合わせを取得する（ステップ１５７）。通常は、この取得した候補文字の組み合わせの全てを検索キーとすればよい。
ところが、このような検索キーの生成方法では、膨大な数の検索キーが生成される可能性がある。具体的には、候補文字の数をｍとし、分割単位内の文字の数をｎとすると、ｍのｎ乗個の検索キーが生成されることになる。そこで、このような事態を回避するため、候補文字の文字スコアの平均値が予め与えられた閾値より大きくなるような候補文字の組み合わせのみを検索キーとする。従って、検索キー作成部３３は、ステップ１５３で記憶した文字スコアを参照し、候補文字の文字スコアの平均値が予め定めた閾値よりも大きいかどうかを判定する（ステップ１５８）。 After that, the search key creation unit 33 creates a search key used for searching the search target DB for each division unit.
That is, the search key creation unit 33 first focuses on one division unit (step 156).
Next, a combination of candidate characters corresponding to each character included in the division unit of interest is acquired (step 157). Normally, all of the acquired combinations of candidate characters may be used as search keys.
However, in such a search key generation method, a huge number of search keys may be generated. Specifically, assuming that the number of candidate characters is m and the number of characters in the division unit is n, m-th n search keys are generated. Therefore, in order to avoid such a situation, only combinations of candidate characters whose average character score of candidate characters is larger than a predetermined threshold are used as search keys. Accordingly, the search key creation unit 33 refers to the character score stored in step 153 and determines whether the average value of the character scores of the candidate characters is greater than a predetermined threshold (step 158).

その結果、文字スコアの平均値が閾値以下である場合は、他の候補文字の組み合わせについてステップ１５８の判定を行う。一方、文字スコアの平均値が閾値よりも大きい場合は、その候補文字の組み合わせを検索キーとし（ステップ１５９）、候補文字の組み合わせが他にあるかどうかを判定する（ステップ１６０）。そして、候補文字の組み合わせが他にあれば、ステップ１５７〜１５９の処理を繰り返し、候補文字の組み合わせが他になければ、単語評価部３４の処理に移る。 As a result, if the average value of the character scores is equal to or less than the threshold value, the determination in step 158 is performed for other candidate character combinations. On the other hand, if the average value of the character scores is larger than the threshold value, the candidate character combination is used as a search key (step 159), and it is determined whether there are other candidate character combinations (step 160). If there are other candidate character combinations, the processes of steps 157 to 159 are repeated. If there are no other candidate character combinations, the process proceeds to the word evaluation unit 34.

そして、単語評価部３４は、検索キー作成部３３が作成した１つ以上の検索キーを受け取り、この検索キーを用いて、ユーザＩＤ受付部２４で受け付けたユーザＩＤにＤＢ重み保持部３６にて対応付けられた文書データ記憶部４１を検索対象ＤＢとして検索する（ステップ１６１）。そして、検索結果に基づいて、検索スコアを算出する（ステップ１６２）。ここで、検索スコアは、例えば、検索キーで検索された文書の件数（検索ヒット件数）や、検索された文書における検索キーの出現頻度の総和等で算出すればよい。 Then, the word evaluation unit 34 receives one or more search keys created by the search key creation unit 33 and uses the search key to add the user ID received by the user ID reception unit 24 to the DB weight holding unit 36. The associated document data storage unit 41 is searched as a search target DB (step 161). Then, a search score is calculated based on the search result (step 162). Here, the search score may be calculated by, for example, the number of documents searched with the search key (number of search hits), the sum of the appearance frequencies of the search keys in the searched document, or the like.

その後、単語評価部３４は、ユーザＩＤ受付部２４で受け付けたユーザＩＤにＤＢ重み保持部３６にて対応付けられた文書データ記憶部４１が他にあるかどうかを判定する（ステップ１６３）。そして、文書データ記憶部４１が他にあれば、ステップ１６１〜１６２の処理を繰り返す。また、文書データ記憶部４１が他になければ、単語評価部３４は、ＤＢ重み保持部３６にて保持されたＤＢごとの重みで、検索スコアの加重平均をとることにより、単語スコアを算出する（ステップ１６４）。 Thereafter, the word evaluation unit 34 determines whether there is another document data storage unit 41 associated with the user ID received by the user ID reception unit 24 in the DB weight holding unit 36 (step 163). If there is another document data storage unit 41, the processing of steps 161 to 162 is repeated. If there is no other document data storage unit 41, the word evaluation unit 34 calculates the word score by taking the weighted average of the search scores with the weight for each DB held in the DB weight holding unit 36. (Step 164).

次に、総合評価部３５は、ステップ１５３で文字認識部３１が得た文字スコアと、ステップ１６４で単語評価部３４が算出した単語スコアとを統合して、候補文字ごとの最終的な確からしさを示す総合スコアを算出する（ステップ１６５）。そして、総合スコアは認識結果表示部２３に送られ、認識結果表示部２３が、候補文字に最終的な順位を付加して表示する（ステップ１６６）。この順位としては、例えば、ステップ１６５で算出された総合スコアの高いものほど上位になるような順位を採用することが考えられる。
更に、総合評価部３５は、分割単位が他にあるかどうかを判定する（ステップ１６７）。そして、分割単位が他にあれば、制御を検索キー作成部３３に戻してステップ１５６〜１６６の処理を繰り返し、分割単位が他になければ、ＤＢ重みを修正する処理に移る。 Next, the comprehensive evaluation unit 35 integrates the character score obtained by the character recognition unit 31 in step 153 and the word score calculated by the word evaluation unit 34 in step 164 to obtain the final certainty for each candidate character. Is calculated (step 165). Then, the total score is sent to the recognition result display unit 23, and the recognition result display unit 23 adds the final ranking to the candidate characters and displays them (step 166). As this rank, for example, it is conceivable to adopt a rank such that the higher the total score calculated in step 165, the higher the rank.
Furthermore, the comprehensive evaluation unit 35 determines whether there are other division units (step 167). If there is another division unit, the control is returned to the search key creating unit 33, and the processes of steps 156 to 166 are repeated. If there is no other division unit, the process proceeds to a process of correcting the DB weight.

即ち、認識結果表示部２３が表示した候補文字及びその順位を妥当であると判断すると、ユーザは確定指示を入力し、確定指示受付部２５が、この確定指示の入力を受け付ける（ステップ１６８）。
すると、確定指示の入力を受け付けた旨は確定指示受付部２５からＤＢ重み計算部３７へと伝えられ、ＤＢ重み計算部３７が、確定指示に基づいて、ＤＢ重み保持部３６にて保持されている各データベースの重みを変更し（ステップ１６９）、第２の実施の形態の動作は終了する。 That is, when it is determined that the candidate characters displayed by the recognition result display unit 23 and their ranks are valid, the user inputs a confirmation instruction, and the confirmation instruction receiving unit 25 receives the input of the confirmation instruction (step 168).
Then, the fact that the input of the confirmation instruction has been accepted is transmitted from the confirmation instruction receiving unit 25 to the DB weight calculation unit 37, and the DB weight calculation unit 37 is held by the DB weight holding unit 36 based on the confirmation instruction. The weight of each database is changed (step 169), and the operation of the second embodiment ends.

Ｔ_ｉｊは、ある分割単位におけるｉ番目の文字のｊ番目の候補文字の総合スコアである。
Ｃ_ｉｊは、ある分割単位におけるｉ番目の文字のｊ番目の候補文字の文字スコアである。
Ｗ_ｍは、ｍ番目の検索対象ＤＢに対して付与された重みである。
Ｓ_ｉｊｋｍは、ある分割単位におけるｉ番目の文字のｊ番目の候補文字を含むｋ番目の検索キーでｍ番目の検索対象ＤＢを検索したときに得られた検索スコアである。尚、Ｓ_ｉｊｋｍとしては、ヒット件数や、ヒット全文書中の検索キーの出現頻度等を用いればよい。また、出現頻度が０である場合でも単語スコアが０にならないように事前頻度１等を全ての検索キーに加えてもよい。
そして、上記式は、文字スコアＣ_ｉｊと、（Ｗ_ｍ×Ｓ_ｉｊｋｍ）のｍに関する総和のｋを変動させた場合の最大値である単語スコアとを掛け合わせることにより、総合スコアＴ_ｉｊが得られることを示している。 _Tij is a total score of the jth candidate character of the ith character in a certain division unit.
C _ij is the character score of the j-th candidate character of the i-th character in a certain division unit.
W _m is a weight assigned to the m-th search target DB.
S _ijkm is a search score obtained when the mth search target DB is searched with the kth search key including the jth candidate character of the ith character in a certain division unit. As S _ijkm , the number of hits, the appearance frequency of search keys in all hit documents, and the like may be used. Further, even when the appearance frequency is 0, the prior frequency 1 or the like may be added to all the search keys so that the word score does not become 0.
Then, the above formula is obtained by multiplying the character score C _ij by the word score which is the maximum value when k of the total sum relating to m of (W _m × S _ijkm ) is changed to obtain the total score T _ij. It is shown that.

ここで、図８に示した動作例を、具体例を用いて説明する。
まず、ステップ１５３で文字認識部３１によって算出され、図示しないメモリを介して検索キー作成部３３及び総合評価部３５に渡される文字スコアＣ_ｉｊは、図３に示したものと同様である。 Here, the operation example shown in FIG. 8 will be described using a specific example.
First, the character score C _ij calculated by the character recognition unit 31 in step 153 and passed to the search key creation unit 33 and the comprehensive evaluation unit 35 via a memory (not shown) is the same as that shown in FIG.

図９は、ステップ１６２で単語評価部３４によって算出される検索スコアＳ_ｉｊｋｍを示した図である。但し、ステップ１５８で用いる閾値を８０とし、文字スコアの平均値がこの閾値を超える候補文字の組み合わせについて検索スコアを示している。また、ここでは、図７に示したユーザＩＤ「Ｕ０１」のユーザが文字認識を指示する場合を想定し、検索対象ＤＢとしてＤＢ＃１及びＤＢ＃２を用いるものとする。従って、検索スコアとしては、Ｓ_ｉｊｋ１及びＳ_ｉｊｋ２が算出されている。 FIG. 9 is a diagram illustrating the search score S _ijkm calculated by the word evaluation unit 34 in step 162. However, the threshold used in step 158 is 80, and the search score is shown for a combination of candidate characters whose average character score exceeds this threshold. Here, assuming that the user with the user ID “U01” shown in FIG. 7 instructs character recognition, DB # 1 and DB # 2 are used as search target DBs. Accordingly, S _ijk1 and S _ijk2 are calculated as search scores.

例えば、１番目の文字の１番目の候補文字「士」を含む検索キーのうち、文字スコアの平均値が８０を超えるものは、「士田」と「士旧」である。そこで、図には、「士田」を検索キーとしてＤＢ＃１を検索して得られた検索スコアＳ_１１１１と、「士田」を検索キーとしてＤＢ＃２を検索して得られた検索スコアＳ_１１１２と、「士旧」を検索キーとしてＤＢ＃１を検索して得られた検索スコアＳ_１１２１と、「士旧」を検索キーとしてＤＢ＃２を検索して得られた検索スコアＳ_１１２２とが示されている。この検索スコアＳ_１１１１と検索スコアＳ_１１１２を重み付けして足し合わせた第１の加重平均と、検索スコアＳ_１１２１と検索スコアＳ_１１２２を重み付けして足し合わせた第２の加重平均の中で最大である第１の加重平均が単語スコアとして、図示しないメモリを介して総合評価部３５に渡される。同様に、２番目の候補文字「キ」を含む検索キー、３番目の候補文字「土」を含む検索キー、４番目の候補文字「工」を含む検索キーのうち、文字スコアの平均値が８０を超える検索キーを用いた場合の検索スコアも示されている。尚、５番目の候補文字「ユ」を含む検索キーで、文字スコアの平均値が８０を超えるものはないので、候補文字「ユ」を含む検索キーを用いた場合の検索スコアを格納する欄は設けていない。 For example, among the search keys including the first candidate character “shi” of the first character, those having an average character score of more than 80 are “shida” and “shiji”. Therefore, in the figure, a search score S ₁₁₁₁ obtained by searching DB # 1 using “Sida” as a search key, and a search score obtained by searching DB # 2 using “Sida” as a search key. S ₁₁₁₂ , a search score S ₁₁₂₁ obtained by searching DB # 1 using “shiji” as a search key, and a search score S ₁₁₂₂ obtained by searching DB # 2 using “shiji” as a search key Is shown. Up to the first weighted average and that sum by weighting the search score _{S 1111} a search score _{S 1112,} in the second weighted average sum by weighting the search score _{S 1121} a search score _{S 1122} A certain first weighted average is passed as a word score to the comprehensive evaluation unit 35 via a memory (not shown). Similarly, among the search key including the second candidate character “K”, the search key including the third candidate character “Sat”, and the search key including the fourth candidate character “K”, the average value of the character scores is A search score when using search keys exceeding 80 is also shown. Since there is no search key including the fifth candidate character “yu” and the average character score exceeds 80, a field for storing the search score when the search key including the candidate character “yu” is used. Is not provided.

また、２番目の文字の１番目の候補文字「田」を含む検索キーのうち、文字スコアの平均値が８０を超えるものは、「士田」と「キ田」と「土田」と「工田」である。そこで、図には、「士田」を検索キーとしてＤＢ＃１を検索して得られた検索スコアＳ_２１１１と、「士田」を検索キーとしてＤＢ＃２を検索して得られた検索スコアＳ_２１１２と、「キ田」を検索キーとしてＤＢ＃１を検索して得られた検索スコアＳ_２１２１と、「キ田」を検索キーとしてＤＢ＃２を検索して得られた検索スコアＳ_２１２２と、「土田」を検索キーとしてＤＢ＃１を検索して得られた検索スコアＳ_２１３１と、「土田」を検索キーとしてＤＢ＃２を検索して得られた検索スコアＳ_２１３２と、「工田」を検索キーとしてＤＢ＃１を検索して得られた検索スコアＳ_２１４１と、「工田」を検索キーとしてＤＢ＃２を検索して得られた検索スコアＳ_２１４２とが示されている。この検索スコアＳ_２１１１と検索スコアＳ_２１１２を重み付けして足し合わせた第１の加重平均と、検索スコアＳ_２１２１と検索スコアＳ_２１２２を重み付けして足し合わせた第２の加重平均と、検索スコアＳ_２１３１と検索スコアＳ_２１３２を重み付けして足し合わせた第３の加重平均と、検索スコアＳ_２１４１と検索スコアＳ_２１４２を重み付けして足し合わせた第４の加重平均の中で最大である第３の加重平均が単語スコアとして、図示しないメモリを介して総合評価部３５に渡される。同様に、２番目の候補文字「旧」を含む検索キーのうち、文字スコアの平均値が８０を超える検索キーを用いた場合の検索スコアも示されている。尚、３番目の候補文字「口」を含む検索キー、４番目の候補文字「十」を含む検索キー、５番目の候補文字「Ｘ」を含む検索キーで、文字スコアの平均値が８０を超えるものはないので、候補文字「口」、「十」、「Ｘ」を含む検索キーを用いた場合の検索スコアを格納する欄は設けていない。 Of the search keys that include the first candidate character “da” of the second character, those with an average character score exceeding 80 are “Shida”, “Kita”, “Tsuchida”, and “ Rice field ". Therefore, in the figure, the search score S ₂₁₁₁ obtained by searching the DB # 1 as a search key "Sita", "Sita" search key as DB # 2 search-obtained search score S ₂₁₁₂ , a search score S ₂₁₂₁ obtained by searching DB # 1 using “Kida” as a search key, and a search score S ₂₁₂₂ obtained by searching DB # 2 using “Kida” as a search key And a search score S ₂₁₃₁ obtained by searching DB # 1 using “Tsuchida” as a search key, a search score S ₂₁₃₂ obtained by searching DB # 2 using “Tsuchida” as a search key, A search score S ₂₁₄₁ obtained by searching DB # 1 using “Ta” as a search key and a search score S ₂₁₄₂ obtained by searching DB # 2 using “Koda” as a search key are shown. . First weighted average and the sum are weighted the search score _{S 2111} a search score _{S 2112,} a second weighted average sum by weighting the search score _{S 2121} a search score _{S 2122,} the search score S third weighted average and the sum are weighted ₂₁₃₁ with search score _{S 2132,} the third is the fourth largest of the weighted average of the sum by weighting the search score _{S 2141} a search score _{S 2142} The weighted average is passed as a word score to the comprehensive evaluation unit 35 via a memory (not shown). Similarly, the search score when the search key including the second candidate character “Old” using the search key with an average value of the character score exceeding 80 is also shown. It should be noted that a search key including the third candidate character “mouth”, a search key including the fourth candidate character “ten”, a search key including the fifth candidate character “X”, and an average character score of 80 Since there is nothing exceeding, there is not provided a column for storing a search score when a search key including candidate characters “mouth”, “ten”, and “X” is used.

尚、図９では、ＤＢ＃１を検索することによって得られた検索スコアＳ_ｉｊｋ１の値、及び、ＤＢ＃２を検索することによって得られた検索スコアＳ_ｉｊｋ２の値として、図７でユーザＩＤ「Ｕ０１」のユーザに対して設定されたＤＢごとの重み付けに基づいて加重平均をとると図４のＳ_ｉｊｋと等しくなるような値を例示している。
従って、ステップ１６５で総合評価部３５によって算出される総合スコアＴ_ｉｊは、図５に示したものと同様のものとなる。 In FIG. 9, the value of the search score _{S Ijk1} obtained by searching the DB # 1, and, as the value of the search score _{S Ijk2} obtained by searching the DB # 2, the user ID in FIG. 7 A value that is equal to S _{ijk in} FIG. 4 is illustrated by taking a weighted average based on the weight for each DB set for the user of “U01”.
Therefore, the total score T _ij calculated by the total evaluation unit 35 in step 165 is the same as that shown in FIG.

また、ステップ１６８でのＤＢ重みの修正について説明する。
例えば、ステップ１６７でユーザが「土田」を文字認識結果として確定させる指示を行ったとする。この場合、図９において、「土田」は、ＤＢ＃２において多く見つかっていることが分かる。
一方、図７を参照すると、ユーザＩＤ「Ｕ０１」のユーザに対しては、ＤＢ＃１の重みが１で、ＤＢ＃２の重みが２となっている。このような場合、ユーザが最終的に確定した単語がより多く見つかったＤＢ＃２の重みを２よりも大きな値に変更する。ここで、重みをどの程度上げるかについては、予め基準を設定しておき、その基準に基づくようにするとよい。 The correction of the DB weight in step 168 will be described.
For example, it is assumed that the user gives an instruction to confirm “Tsuchida” as a character recognition result in step 167. In this case, it can be seen in FIG. 9 that many “Tsuchida” are found in DB # 2.
On the other hand, referring to FIG. 7, the weight of DB # 1 is 1 and the weight of DB # 2 is 2 for the user with the user ID “U01”. In such a case, the weight of DB # 2 in which more words finally determined by the user are found is changed to a value larger than 2. Here, as to how much the weight is to be increased, it is preferable to set a reference in advance and make it based on the reference.

尚、この第２の実施の形態において、ステップ１６１では、図７で重みが０以外のデータベースを検索対象ＤＢとして決定したが、これには限らない。例えば、重みの閾値を設定し、重みが０以外のデータベースであってもこの閾値を超える重みのデータベースのみを検索対象ＤＢとすることも考えられる。或いは、例えば、重みの大きいものから予め定めた最大数までデータベースを選択してこれを検索対象ＤＢとしてもよい。また、文字認識装置１０が検索対象ＤＢをユーザの意思に関係なく決定するのではなく、ユーザの選択を補助するように、優先すべきデータベースを提示するようにしてもよい。 In the second embodiment, in step 161, the database having a weight other than 0 in FIG. 7 is determined as the search target DB. However, the present invention is not limited to this. For example, it is also conceivable that a threshold value for weight is set and only a database having a weight exceeding this threshold value is set as a search target DB even if the database has a weight other than zero. Alternatively, for example, databases may be selected from a large weight to a predetermined maximum number, and this may be used as a search target DB. Further, the character recognition device 10 may present a database to be prioritized so as to assist the user's selection instead of determining the search target DB regardless of the user's intention.

また、本実施の形態では、ユーザＩＤに対して検索対象ＤＢ及び重みを定義し、文字認識を指示したユーザのユーザＩＤに対応する検索対象ＤＢ及び重みを用いて単語スコアを算出するようにした。これは、ユーザ特有の語彙等をカバーできる可能性が高まり、文字認識の精度向上が期待できるからである。しかしながら、このように文字認識に関して個別の精度を要求する単位としては、ユーザ以外にも、例えば、組織、文書、文書の種類等が考えられる。即ち、組織に対して検索対象ＤＢ及び重みを定義し、文字認識を指示したユーザが所属する組織に対応する検索対象ＤＢ及び重みを用いて単語スコアを算出したり、文書や文書の種類に対して検索対象ＤＢ及び重みを定義し、文字認識の対象の文書や文書の種類に対応する検索対象ＤＢ及び重みを用いて単語スコアを算出したりする構成を採用してもよい。 In this embodiment, a search target DB and a weight are defined for the user ID, and a word score is calculated using the search target DB and the weight corresponding to the user ID of the user who has instructed character recognition. . This is because the possibility of covering vocabulary and the like unique to the user is increased, and an improvement in character recognition accuracy can be expected. However, in addition to the user, for example, an organization, a document, a document type, and the like can be considered as a unit for requesting individual accuracy regarding character recognition. That is, the search target DB and weight are defined for the organization, and the word score is calculated using the search target DB and the weight corresponding to the organization to which the user who instructed character recognition belongs, Alternatively, a configuration may be employed in which the search target DB and the weight are defined, and the word score is calculated using the search target DB and the weight corresponding to the character recognition target document or document type.

ところで、本実施の形態における文字認識結果の修正処理は、汎用のコンピュータにおいて実現してもよい。そこで、この処理をコンピュータ９０で実現するものとして、そのハードウェア構成について説明する。
図１０は、コンピュータ９０のハードウェア構成を示した図である。
図示するように、コンピュータ９０は、演算手段であるＣＰＵ（Central Processing Unit）９１と、記憶手段であるメインメモリ９２及び磁気ディスク装置（ＨＤＤ：Hard Disk Drive）９３とを備える。ここで、ＣＰＵ９１は、ＯＳ（Operating System）やアプリケーション等の各種ソフトウェアを実行し、上述した各機能を実現する。また、メインメモリ９２は、各種ソフトウェアやその実行に用いるデータ等を記憶する記憶領域であり、磁気ディスク装置９３は、各種ソフトウェアに対する入力データや各種ソフトウェアからの出力データ等を記憶する記憶領域である。
更に、コンピュータ９０は、外部との通信を行うための通信Ｉ／Ｆ９４と、ビデオメモリやディスプレイ等からなる表示機構９５と、キーボードやマウス等の入力デバイス９６とを備える。 By the way, the correction process of the character recognition result in the present embodiment may be realized by a general-purpose computer. Therefore, the hardware configuration will be described assuming that this processing is realized by the computer 90.
FIG. 10 is a diagram illustrating a hardware configuration of the computer 90.
As shown in the figure, the computer 90 includes a CPU (Central Processing Unit) 91 as a calculation means, a main memory 92 as a storage means, and a magnetic disk device (HDD: Hard Disk Drive) 93. Here, the CPU 91 executes various types of software such as an OS (Operating System) and applications to realize the above-described functions. The main memory 92 is a storage area for storing various software and data used for execution thereof, and the magnetic disk device 93 is a storage area for storing input data for various software, output data from various software, and the like. .
Further, the computer 90 includes a communication I / F 94 for performing communication with the outside, a display mechanism 95 including a video memory and a display, and an input device 96 such as a keyboard and a mouse.

尚、本実施の形態を実現するプログラムは、通信手段により提供することはもちろん、ＣＤ−ＲＯＭ等の記録媒体に格納して提供することも可能である。 The program for realizing this embodiment can be provided not only by communication means but also by storing it in a recording medium such as a CD-ROM.

１０…文字認識装置、２０…ＵＩ部、３０…処理部、４０…ＤＢ部 DESCRIPTION OF SYMBOLS 10 ... Character recognition apparatus, 20 ... UI part, 30 ... Processing part, 40 ... DB part

Claims

First acquisition means for acquiring a certainty factor in the recognition process of a specific character included in the character string obtained as a result of the recognition process of any one of the character recognition process and the voice recognition process;
Generating means for generating a search word including the specific character included in the character string and a character immediately before or after the specific character included in the character string;
Second acquisition means for acquiring an index relating to use of the search word by searching a set of documents prepared for a purpose other than the recognition processing using the search word generated by the generation means; ,
A recognition result comprising correction means for correcting the certainty factor of the specific character acquired by the first acquisition means based on the index acquired by the second acquisition means Correction device.

An identification information receiving means for receiving identification information for identifying a unit that requires individual accuracy with respect to the recognition processing;
The recognition result correcting apparatus according to claim 1, wherein the second obtaining unit searches the document set associated in advance with the identification information received by the identification information receiving unit.

The second acquisition means includes the search word in each document set obtained by searching the plurality of document sets previously associated with the identification information received by the identification information receiving means using the search word. The recognition result according to claim 2, wherein the index is acquired based on a use frequency of the document and a weight of each document set associated in advance with the identification information received by the identification information receiving unit. Correction device.

A confirmation instruction accepting unit that accepts an instruction to confirm the specific character included in the character string as a result of the recognition process;
When the confirmation instruction receiving unit receives the instruction, the weight of the specific document set is updated based on the use frequency of the search word obtained by searching the specific document set using the search word. The recognition result correcting apparatus according to claim 3, further comprising update means.

A selection instruction receiving means for receiving an instruction of a user for selecting the document set;
The recognition result correcting apparatus according to claim 1, wherein the second acquisition unit searches the document set selected by the instruction received by the selection instruction receiving unit.

It further comprises morphological analysis means for performing morphological analysis on the character string,
The said generation means specifies the character immediately before or after the said specific character included in the said search word based on the result of the morphological analysis by the said morpheme analysis means, The one in any one of Claim 1 thru | or 5 characterized by the above-mentioned. The recognition result correction device described.

Reading means for reading the image from the recording medium on which the image is recorded;
First acquisition means for acquiring a certainty factor in character recognition of a specific character included in a character string obtained as a result of performing character recognition on the image read by the reading means;
Generating means for generating a search word including the specific character included in the character string and a character immediately before or after the specific character included in the character string;
Second acquisition means for acquiring an index relating to use of the search word by searching a document set prepared for use other than the character recognition using the search word generated by the generation means; ,
Correction means for correcting the certainty factor of the specific character acquired by the first acquisition means based on the index acquired by the second acquisition means;
An image processing apparatus comprising: a display unit configured to display the specific character included in the character string based on the certainty factor of the specific character after correction by the correction unit.

On the computer,
A function of acquiring a certainty factor in the recognition process of a specific character included in the character string obtained as a result of the recognition process of any one of the character recognition process and the voice recognition process;
A function of generating a search term including the specific character included in the character string and a character immediately before or after the specific character included in the character string;
A function for obtaining an index relating to use of the search word by searching a set of documents prepared for use other than the recognition process using the search word;
A program for realizing a function of correcting the certainty factor of the specific character based on the index.