JP4633271B2

JP4633271B2 - Dictionary learning method and dictionary learning program

Info

Publication number: JP4633271B2
Application number: JP2001008981A
Authority: JP
Inventors: 寿太郎石岡
Original assignee: 株式会社日本デジタル研究所
Priority date: 2001-01-17
Filing date: 2001-01-17
Publication date: 2011-02-16
Anticipated expiration: 2021-01-17
Also published as: JP2002216075A

Description

【０００１】
【発明の属する技術分野】
本発明は、文字認識装置に関し、特に、棄却文字或いは誤認識文字の修正入力の際に認識辞書に登録されている標準パターンを更新（修正／追加）する辞書学習技術に関する。
【０００２】
【従来の技術】
文字認識の過程では、先ず、切出された文字の特徴量抽出と、抽出された特徴量と認識辞書内の各テンプレートの特徴量との比較がなされ、最も類似度の高い特徴量を有する候補文字が正解文字とされその文字コードが取得される。
しかし、読取られる文字は同じ文字でも形や大きさや線の太さが異なったり線のかすれがあったりして必ずしも正確に認識できるとは限らない。
特に、手書き文字を認識する場合には上記のような要因に加え、個人の書き癖等の要因も加わって認識率が低下する。
【０００３】
上述のような問題の解決方法として、不読文字或いは誤認識文字の修正入力の際に認識辞書に登録されている標準パターンを更新（修正／追加）する方法、いわゆる'認識辞書の学習'により認識率向上を図る方法がある。認識率の学習方法に関する従来技術として、（１）特開平２−１８６４８４号公報の「文字認識装置」や特開平９ー１８５６８２号公報の「認識辞書の学習方式」等に開示の技術のように修正入力の際に、逐一その文字を学習させるか否かをオペレータが判断して指示を与える方法、（２）特開平７−１２９７２４号公報の「文字認識装置ならびにその辞書学習方法および辞書作成方法」に開示の技術のように、修正と同時に学習を行なう方法、（３）特開平７−２７１９１７号公報の「手書き文字認識辞書作成方法及び装置」や、（４）特開平８−２８７１９１号公報の「光学的文字読取り装置」等に開示の技術のように、学習後の認識辞書の妥当性を認識シミュレーションによって調べる方法、がある。
【０００４】
しかしながら、上記（１）の技術では、オペレータの作業負担が増加する点と文字学習の要否の判断のための専門的知識をオペレータが持っていなければ精度の高い'認識辞書の学習'ができないという問題点があり、上記（２）の技術では、学習文字の妥当性を考慮していないので、学習の効率性および学習後の認識辞書の安定性に欠けるという問題点があり、上記（３）、（４）の技術では、学習後の認識辞書の妥当性のシミュレーションに時間を要し、また、認識シミュレーションに必要なデータを格納する記憶手段を追加する必要があるという不具合があった。
【０００５】
このような問題点を解決する技術として、（５）特開平１１−１８４９７６号に開示の技術（「辞書学習方式及び文字認識装置」）等がある。上記（５）の技術によれば、学習に不適な学習文字を認識距離に基づいて自動的に除外することができ、更に、学習文字を学習対象となる認識辞書のテンプレート別に分類して認識距離に基づいて選択できるので、オペレータの判断を必要とせずオペレータに文字特徴や学習に係わる専門的知識を要求する必要がないといった利点や、学習文字の最小認識距離を基準にテンプレートの追加の要否を判定して認識辞書データの更新または追加を行なうので、オペレータの判断なしにその学習文字について最適な学習方法を選択できるといった利点がある。
【０００６】
【発明が解決しようとする課題】
しかしながら、上記（５）の技術では、認識辞書の作り方によっては追加したい文字（認識辞書データ）が追加できないといった問題点や、このような文字を追加するために閾値を緩めると、追加したくない他の粗悪文字が追加されてしまうといった問題点があった。
【０００７】
本発明は、上記問題点を解決するためになされたものであり、効率的に認識辞書の学習を行なうと共に学習後の認識辞書に高い妥当性と安定性を与え得る辞書学習方法及び辞書学習プログラムの提供を目的とする。
【０００８】
上記課題を解決するために、請求項１に記載の発明では、認識辞書の学習処理を行う文字認識方法において、
学習対象の文字が認識辞書の学習に適しているか否かを判定するための辞書学習閾値を、当該学習対象の文字の認識に用いた辞書が辞書学習閾値による登録処理判定の対象となった回数が所定の値未満の場合、若しくは該登録処理判定の対象となった回数に対する該登録判定処理により辞書データが前記辞書に登録された回数の割合が所定の割合以上の場合は、辞書学習閾値を標準値より大きな値に変更する辞書学習閾値緩和工程と、前記割合が所定の値未満の場合は辞書学習閾値を標準値に戻す工程と、学習対象の文字と該学習対象の文字の認識に用いた辞書との認識距離と、辞書学習閾値とを比較し、認識距離が辞書学習閾値未満の場合は前記学習対象の文字に係る辞書データを該学習対象の文字の認識に用いた辞書に登録する登録処理判定工程と、を備えたことを特徴とする辞書学習方法を提供する。
【００１４】
また、請求項２に記載の発明では、読み取られた原稿イメージから切り出して得た文字イメージの特徴を抽出し、認識辞書を用いて各文字イメージの認識結果を出力する文字認識装置において実行されるプログラムであって、コンピュータに、棄却文字イメージおよび誤認識文字の修正時に修正された棄却文字イメージおよび誤認識文字に対応付けてメモリに記憶された認識情報から学習対象の文字の認識情報を取得するステップ、取得した認識情報から認識に使用した辞書テンプレート番号を取り出し、メモリに記憶されている当該辞書テンプレートの学習処理回数と学習回数を読み出して学習処理回数を所定値αと比較し、学習処理回数≧αの場合は学習回数に対する学習処理回数の割合ｒを計算するステップ、学習処理回数＜αの場合若しくは前記割合ｒ≧所定値βの場合は学習対象の文字が認識辞書の学習に適しているか否かを判定するための辞書学習閾値に標準値より大きな値を設定するステップ、前記割合ｒ＜所定値βの場合は辞書学習閾値に標準値を設定するステップ、学習対象の文字と該学習対象の文字の認識に用いた辞書との認識距離と、辞書学習閾値とを比較し、認識距離＜辞書学習閾値の場合は認識修正処理の対象となった文字の辞書データを該学習対象の文字の認識に用いた辞書に登録すると共に学習回数を１つ増加させるステップ、学習処理回数を１つ増加させるステップ、を実行させる辞書学習プログラムを提供する。
【００１５】
【発明の実施の形態】
１．構成
図１は本発明の認識文字の修正方法を適用可能な文字認識装置の一実施例の構成を示すブロック図であり、図２は認識処理部２の一実施例を示すブロック図である。
【００１６】
図１で、文字認識装置１０は、原稿読取り装置１、認識処理部２、ハードディスク（ＨＤ）３、辞書学習処理部４、モニタ５及びキーボード６を備えている。
原稿読取り装置１はＯＣＲ（光学的文字読取り装置）やスキャナー等のイメージリーダからなり、原稿を読み取ってイメージデータに変換し、認識処理部２に渡す。
また、認識処理部２は、図２に示すように文字認識部２１、認識文字修正部２２、制御部２３及び認識辞書３１を備えている。
【００１７】
図２で、文字認識部２１は原稿読取り装置１から受け取ったイメージデータから１文字分ずつ文字イメージを切り出して文字認識処理を行い、認識結果（文字コード或いは棄却コード）を出力する。
【００１８】
認識文字修正部２２は棄却イメージの修正或いは誤認識の修正のためにオペレータによってキーボード６から修正入力がされた場合には、それら棄却イメージ或いは誤認識された文字の修正を行い、ハードディスク３に書き込まれた認識結果を更新する。また、この際、認識結果を更新した箇所には修正情報を付加してハードディスク３に書き込む。
【００１９】
制御部２３はＣＰＵ、内部メモリ（ＲＡＭ）およびその周辺回路からなり、上述した文字認識装置１０全体の制御と、文字認識装置１０及び認識処理部２の各構成部分の動作を制御する。また、制御部２３はハードディスク３又はプログラム格納用ＲＯＭに格納された認識処理プログラム（図２の文字認識部２１及び認識文字修正部２２に相当）及び本発明の辞書学習プログラム（図２の辞書学習処理部４に相当）の実行を制御し文字認識を行う。
【００２０】
ハードディスク３には認識情報、つまり、文字イメージ、文字イメージの特徴量、認識結果である文字コード（又は、修正結果である文字コード）、使用した辞書テンプレートの番号及び修正のあった場合の修正情報を格納する領域のほか、認識辞書３１を格納する領域及び各辞書の学習処理回数及び学習回数（後述）を書き込む領域が確保されている（認識辞書３１はＲＯＭ又は物理的に別のハードディスクとしてもよい）。また、ハードディスク３には認識処理プログラム及び辞書学習プログラムのほか文字認識装置１０の実行制御に必要な各種プログラム群を格納することもできる。
【００２１】
辞書学習処理部４は、認識処理部２でオペレータによって修正された認識結果（文字コード）と共にハードディスク３に記憶される修正情報を基に、棄却文字又は誤認識文字の特徴量等の辞書データを認識辞書に追加できるかどうかを判定して追加可能な場合にその文字の特徴量をそのカテゴリの辞書テンプレートに追加（登録）する。ここで、修正情報はオペレータによって修正されたか否かを示すフラグである。なお、修正情報の代わりに修正前の認識結果（文字コード）を修正後の認識結果（文字コード）と共にハードディスク３に記憶するようにしてもよい。
【００２２】
２．動作
図３は、図１の辞書学習処理部４の動作の一実施例を示すフローチャートであり、各ステップの動作シーケンスの制御は制御部２３によって行われる。
【００２３】
ステップＳ１：（修正文字の有無判定）
図３で、辞書学習処理部４は、上述した文字認識処理によりハードディスク３に書き込まれた認識情報（ハードディスクには、文字イメージ、文字イメージの特徴量、認識結果である文字コード（又は、修正結果である文字コード）と修正された場合に修正情報が書き込まれている）を順次サーチし、上述した認識文字修正処理のあった認識情報があった場合、つまり、オペレータによって修正された認識情報があった場合には、その認識情報を読み出すと共に、文字イメージの認識情報のハードディスク３上の先頭アドレスを保持してＳ２に遷移し、修正処理の対象となった文字がハードディスク３に格納されている認識結果の中にない場合には辞書学習処理を終了する。なお、認識文字修正処理の有無は実施例ではハードディスク３に書き込まれる認識結果（認識文字修正処理があった場合は修正結果＝認識結果）に対応して修正情報が書き込まれているか否かを調べることによって判定しているがこれに限定されない。
【００２４】
ステップＳ２：（使用辞書の学習処理回数及び学習回数の読み出し）
辞書学習処理部４は、上記ステップＳ１で読み出した認識情報の中から使用した辞書テンプレート番号を取り出し、ハードディスク３から認識文字修正処理で使用された辞書（つまり、登録判定の対象となっている辞書）の学習処理回数と学習回数をハードディスク３から読み出してメモリに保持する。ここで、学習処理回数は辞書に追加（登録）されたか否かに係らず、後述のステップＳ６で辞書が閾値による登録判定処理の対象となった回数を意味し、学習回数はステップＳ６の判定処理の結果、ステップＳ７で辞書データが登録された回数を意味し、学習処理回数はステップＳ８でカウントされ、学習回数はステップＳ７でカウントされる。
【００２５】
ステップＳ３：（辞書学習閾値緩和の要否判定）
辞書学習処理部４は、上記ステップＳ２で取り出した学習処理回数を所定値αと比較し、学習処理回数がα未満の場合はこの辞書の学習処理回数が少ないため辞書が成長段階にあると判定して学習閾値を緩和するためＳ４に遷移する。また、学習処理回数がα以上の場合は辞書が成長期を過ぎていると判定して学習回数と学習処理回数の割合を算出し、算出結果が所定値β（０≦β≦１）未満の場合は学習対象の辞書が安定し、成熟期に達していると判定してＳ５に遷移し、β以上の場合は成熟期に達していないと判定してＳ４に遷移する（つまり、学習回数÷学習処理回数＜βならＳ５に、学習回数÷学習処理回数≧βならＳ４に遷移する）。
【００２６】
ステップＳ４：（辞書学習閾値の緩和）
辞書学習処理部４は、上記ステップＳ３で辞書が成長段階にあるか、成長期を過ぎていてもまだ安定していないと判定した場合には、辞書学習判定用の閾値（辞書学習閾値）の値を標準値より大きな値（緩和値）に設定し直してＳ６に遷移する。なお、緩和値は、学習処理回数又は学習回数と学習処理回数の割合によって予め定めた所定の値（実験値又は経験値）である。
【００２７】
ステップＳ５：（辞書学習閾値の不緩和）
辞書学習処理部４は、上記ステップＳ３で辞書が安定していると判定した場合には、辞書学習判定用の閾値（辞書学習閾値）の値を標準値（所定の値）に設定してＳ６に遷移する。
【００２８】
ステップＳ６：（辞書学習閾値による辞書データ登録可否の判定）
辞書学習処理部４は、認識距離、つまり、登録判定されるカテゴリの辞書の特徴量と修正対象となった文字イメージの特徴量との距離（例えば、ユークリッド距離）を算出して、上記ステップＳ４またはＳ５で設定された辞書学習閾値と比較し、認識修正処理の対象となった文字の辞書データのこの辞書への登録の可否を判定する。ここで、認識距離が辞書学習閾値未満の場合は登録可としてＳ７に遷移する。また、認識距離が辞書学習閾値以上の場合は登録不可としてＳ８に遷移する。
【００２９】
ステップＳ７：（辞書データの登録及びカウント）
辞書学習処理部４は修正対象となった文字イメージの辞書データを上記ステップＳ６で登録判定の対象となった辞書に登録（追加記憶）し、ハードディスク３の学習回数カウンタに１を加える。
【００３０】
ステップＳ８：（学習処理回数のカウント）
辞書学習処理部４はハードディスク３の学習処理回数カウンタに１を加える。
【００３１】
ステップＳ９：（次の認識結果の先頭アドレス取得）
辞書学習処理部４は上記ステップＳ１で保持した認識情報の先頭アドレスに認識情報レコード長を加え、次の認識情報の先頭アドレスを取得してＳ１に遷移する（これにより、ステップＳ１では今回取得した修正済み認識結果の次の認識結果からサーチを行う）。
【００３２】
なお、上記図３のフローチャートでは、文字認識処理でハードディスクに書き込んだ全原稿の認識結果について認識文字修正処理が終わった後で、辞書学習処理のためハードディスクに書き込まれた（全原稿の）認識結果を順次サーチするように構成したが、文字認識処理でハードディスクに書き込んだ全原稿の認識結果について１枚の原稿分の認識文字修正処理が終わった後で、その１枚の原稿分の認識結果についてサーチするように構成してもよく、或いは、文字認識処理でハードディスクに書き込んだ原稿の認識結果について、認識文字修正時にオペレータの認識文字修正入力が行われる毎に上記ステップＳ２〜Ｓ８を実行するように構成してもよい。１枚の原稿分の認識結果についてサーチするように構成した場合には、ステップＳ７、Ｓ８のカウントは原稿単位のカウントとなり、オペレータの認識文字修正入力が行われる毎に辞書学習処理を行うようにした場合はカテゴリ毎のカウントとなる。
【００３３】
上記構成により、従来登録されなかった文字イメージの辞書データについても学習処理回数、学習回数をカウントしてそれらの値を基にそのカテゴリの辞書が成長期か否かを判定し、成長期にある場合には辞書学習閾値を緩めることにより学習辞書に登録することが可能となる。また、辞書が成熟していて安定している場合には辞書学習閾値を標準値に設定するので、辞書のバランスを欠くような辞書データの登録を防止でき、安定した学習辞書とすることができる。
【００３４】
（変形例）
なお、上記図３の辞書学習処理部のステップＳ６の辞書学習閾値による辞書データの登録可否判定時に登録可と判定された場合、ステップＳ６とＳ７（登録処理）の間に、学習辞書内に登録可になった辞書データに近い特徴を持った他のカテゴリの辞書が存在するか否かを調べ、そのような辞書が存在する場合にはその辞書テンプレートに対応付けて使用禁止フラグを設定して当該辞書テンプレートを使用禁止とする第１の禁止処理ステップを設けるようにしてもよい。
【００３５】
更に、第１の禁止処理ステップの後に、登録可とされた辞書データが、以前に、上記禁止処理ステップによって使用禁止とされた辞書テンプレートを使用可とすることによって認識処理に用いることができる場合はその使用禁止を解除する禁止解除ステップを設けるようにしてもよい。
【００３６】
更に、上記禁止解除ステップの後に、使用禁止解除された辞書テンプレートを以前使用禁止にした時に登録された辞書テンプレートを探し出し、使用禁止とする第２の禁止処理ステップを設けるようにしてもよい。
【００３７】
更に、上記第２の使用禁止処理に代えて、上記禁止解除ステップの後に使用禁止解除された辞書テンプレートを以前使用禁止にした時に登録された辞書テンプレートを探し出し、その辞書テンプレートによる認識判定時に使用する閾値を、禁止解除された辞書テンプレートの認識に影響を与えないように、認識範囲を狭くするように変化させる認識範囲制限ステップを設けるようにしてもよい。
【００３８】
また、上記図３の辞書学習処理部のステップＳ６の辞書学習閾値による辞書データの登録可否判定時に登録可と判定された場合、ステップＳ６とＳ７（登録処理）の間に、当該辞書テンプレートに近い特徴を持った他のカテゴリの辞書が存在するか否かを調べ、そのような辞書が存在する場合には、当該辞書テンプレートの認識判定時に使用する閾値を、登録可になった辞書データを登録した辞書テンプレートによる認識に影響を与えないように、認識範囲を狭くするように変化させる認識範囲制限ステップを設けるように構成してもよい。
【００３９】
３．具体例
図４は文字イメージ及びその認識結果の一実施例を示す図であり、上段は文字イメージを示し、下段は認識結果を示す。以下、図３のフローチャートに沿って説明する。なお、この例では、説明上、学習処理回数の判定基準α＝１０、学習回数の判定基準β＝０．３、辞書学習閾値＝γとする。
【００４０】
オペレータはモニタ４に表示された文字イメージ及び認識結果を見て、第１フィールドの最初の文字が棄却されている（棄却記号「？」で示されている）ので、正しい文字（この例では「１」）を入力して修正すると、棄却記号「？」の文字コードが修正結果「１」の文字コードで置換されてハードディスク３に記憶されると共に修正情報がこの修正文字コードに関連付けられてハードディスク３に記憶される。
【００４１】
図４の文字イメージに対する文字認識処理が終わると、辞書学習処理部４はハードディスク３に記憶された認識情報をサーチし、修正情報の有無を判定する。この例では第１フィールドの最初の文字に係る認識情報に修正情報があるので、この認識情報のアドレスを保持してステップＳ２に遷移する（ステップＳ１）。
【００４２】
修正文字があったので、辞書学習処理部４は、第１フィールドの最初の文字（棄却文字）にかかわる認識情報を読み出し、認識情報のうちの使用辞書テンプレート番号をキーとしてハードディスク３をサーチし、第１フィールドの最初の文字の認識に使用した辞書の学習処理回数と学習回数を取り出す。この場合、第１フィールドの最初の文字について初めての学習処理となるので学習処理回数＝０、学習回数＝０である（ステップＳ２）。
【００４３】
次に、学習処理回数（＝０）と判定基準α（＝１０）を比較すると学習回数が判定基準未満のため、辞書学習閾値ｃを緩和するステップＳ４に遷移する（ステップＳ３）。辞書学習処理部４は辞書学習閾値ｃを緩和値ｃ’（ｃ＜ｃ’）に設定し直す（ステップＳ４）。
【００４４】
辞書学習処理部４は、１フィールド目の最初の文字イメージの特徴量とその文字イメージの認識に使用した辞書の特徴量との認識距離Ｒと上記ステップＳ４で設定し直された辞書学習閾値ｃ’を比較し、Ｒ＜ｃ’の場合は登録可としてステップＳ７に遷移し、Ｒ≧c'の場合は登録不可としてステップＳ８に遷移する。この例で、ステップＳ４で辞書学習閾値を緩和したので１フィールド目の最初の文字イメージの特徴量を辞書に登録することができるが、辞書学習閾値が緩和されず標準値ｃのままの場合は登録されない（＝辞書は学習しない）こととなる（ステップＳ６）。
【００４５】
辞書学習処理部４は、１フィールド目の最初の文字イメージの特徴量を辞書データとしてその文字イメージの認識に使用した辞書テンプレートのカテゴリの辞書に登録すると共に、ハードディスク３の学習回数カウンタに１を加える（ステップＳ７）。また、辞書学習処理部４はハードディスク３の学習処理回数カウンタに１を加える（ステップＳ８）。
【００４６】
辞書学習処理部４は、上記ステップＳ１で保持した認識情報のアドレスに認識結果レコード長を加え、次の認識結果の記憶アドレスを取得してＳ１に遷移する（ステップＳ９）。
【００４７】
以上、本発明の一実施例について説明したが本発明は上記実施例に限定されるものではなく、種々の変形実施が可能であることはいうまでもない。
【００４８】
【発明の効果】
上記説明したように、第１の発明の辞書学習方法及び第２の発明の辞書学習プログラムによれば、従来登録されなかった文字イメージの辞書データのように辞書の使用頻度が少ない場合には辞書学習閾値を緩めることにより学習辞書に登録して辞書を成長させることが可能となる。また、辞書の使用頻度が多く辞書が安定している場合には辞書学習閾値を標準値に設定できるので、辞書のバランスを欠くような辞書データの登録を防止できる。
【図面の簡単な説明】
【図１】本発明の辞書学習方法及び辞書学習方法を適用可能な文字認識装置の一実施例の構成を示すブロック図である。
【図２】認識処理部の一実施例を示すブロック図である。
【図３】辞書学習処理部の動作の一実施例を示すフローチャートである。
【図４】認識結果の一実施例を示す図である。
【符号の説明】
１原稿読取り部
２認識処理部
３ハードディスク
４辞書学習処理部
５モニタ
６キーボード
１０文字認識装置
２１文字認識部
２２認識文字修正部
２３制御部
３１認識辞書[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a character recognition device, and more particularly to a dictionary learning technique for updating (correcting / adding) a standard pattern registered in a recognition dictionary when a rejected character or a misrecognized character is corrected and input.
[0002]
[Prior art]
In the process of character recognition, first, the feature amount extraction of the extracted character is compared with the extracted feature amount and the feature amount of each template in the recognition dictionary, and the candidate having the highest similarity feature amount. The character is regarded as a correct character and its character code is acquired.
However, even if the character to be read is the same character, the shape, the size, the thickness of the line is different, or the line is blurred.
In particular, when recognizing handwritten characters, in addition to the above factors, factors such as personal writing habits are added to lower the recognition rate.
[0003]
As a solution to the above problem, a method of updating (correcting / adding) a standard pattern registered in the recognition dictionary when correcting uncorrected characters or misrecognized characters, so-called 'learning the recognition dictionary' There is a method to improve the recognition rate. As prior art relating to a recognition rate learning method, (1) as disclosed in “Character recognition device” of Japanese Patent Laid-Open No. 2-186484, “Learning method of recognition dictionary” of Japanese Patent Laid-Open No. 9-185682, etc. (2) “Character recognition device and its dictionary learning method and dictionary creation method” disclosed in Japanese Patent Laid-Open No. 7-129724. (3) “Handwritten character recognition dictionary creation method and device” disclosed in Japanese Patent Laid-Open No. 7-271919, and (4) Japanese Patent Laid-Open No. 8-287191. There is a method of examining the validity of a recognition dictionary after learning by a recognition simulation, as in the technique disclosed in “Optical Character Reading Device” of the above.
[0004]
However, with the technique of (1) above, a highly accurate 'learning of recognition dictionary' cannot be performed unless the operator has specialized knowledge for determining the necessity of character learning and the point that the operator's workload increases. There is a problem that, in the technique of (2) does not consider the validity of the learning text, there is a problem of lack of stability of the efficiency and recognition dictionary after learning of the learning, the (3 ) And (4) have a problem that it takes time to simulate the validity of the recognition dictionary after learning, and it is necessary to add storage means for storing data necessary for the recognition simulation.
[0005]
As a technique for solving such a problem, there is (5) a technique disclosed in Japanese Patent Laid-Open No. 11-184976 (“dictionary learning method and character recognition device”). According to the above technique (5), learning characters unsuitable for learning can be automatically excluded based on the recognition distance, and the learning characters are further classified according to the recognition dictionary template to be learned. The need to add a template based on the minimum recognition distance of learning characters and the advantage that the operator does not need to make a judgment and does not require specialized knowledge related to character characteristics and learning. Therefore, the recognition dictionary data is updated or added, so that there is an advantage that the optimum learning method can be selected for the learning character without the operator's judgment.
[0006]
[Problems to be solved by the invention]
However, in the technique (5) , there is a problem that a character (recognition dictionary data) to be added cannot be added depending on how to create a recognition dictionary, and if the threshold value is relaxed to add such a character, it is not desired to add it. There was a problem that other bad characters were added.
[0007]
The present invention has been made to solve the above problems, efficiently recognition dictionary learning with performing Ru obtained gives highly relevant and stability in the recognition dictionary after learning dictionary learning method and dictionaries The purpose is to provide a learning program.
[0008]
In order to solve the above-mentioned problem, in the invention according to claim 1, in a character recognition method for performing learning processing of a recognition dictionary,
The number of times the dictionary learning threshold for determining whether or not the learning target character is suitable for learning of the recognition dictionary is the registration processing determination target based on the dictionary learning threshold. Is less than a predetermined value, or when the ratio of the number of times that the dictionary data is registered in the dictionary by the registration determination process with respect to the number of times the registration process is determined is greater than or equal to a predetermined ratio, the dictionary learning threshold is set. It is used to reduce the dictionary learning threshold value to be changed to a value larger than the standard value, to return the dictionary learning threshold value to the standard value when the ratio is less than a predetermined value, and to recognize the learning target character and the learning target character. and recognizing the distance between had dictionary, compared with a dictionary learning threshold, if the recognition distance is less than the dictionary learning threshold is registered in the dictionary using the dictionary data according to the learning target of the character recognition of the learning target character registration process It provides a dictionary learning method characterized by comprising: a constant step.
[0014]
According to the second aspect of the present invention, the character recognition apparatus extracts the characteristics of the character image obtained by cutting out from the read document image, and outputs the recognition result of each character image using the recognition dictionary. It is a program, and the recognition information of the character to be learned is acquired from the recognition information stored in the memory in association with the rejection character image and the misrecognition character corrected when the rejection character image and the erroneous recognition character are corrected. Step : The dictionary template number used for recognition is extracted from the acquired recognition information, the number of learning processes and the number of learning of the dictionary template stored in the memory are read out, the number of learning processes is compared with a predetermined value α, and the number of learning processes If ≧ α, the step of calculating the ratio r of the number of learning processes to the number of learning times; Alternatively, if the ratio r ≧ predetermined value β, a step of setting a dictionary learning threshold value larger than a standard value for determining whether or not the character to be learned is suitable for learning of the recognition dictionary, the ratio r < In the case of the predetermined value β, a step of setting a standard value for the dictionary learning threshold value, comparing the recognition distance between the learning target character and the dictionary used for recognition of the learning target character and the dictionary learning threshold value, and the recognition distance < In the case of the dictionary learning threshold, the step of registering the dictionary data of the character to be recognized and corrected in the dictionary used for the recognition of the character to be learned and increasing the number of learning times by one, increasing the number of learning processes by one And a dictionary learning program for executing the step .
[0015]
DETAILED DESCRIPTION OF THE INVENTION
1. Configuration FIG. 1 is a block diagram showing the configuration of an embodiment of a character recognition apparatus to which the method for correcting a recognized character of the present invention can be applied, and FIG. 2 is a block diagram showing an embodiment of a recognition processing unit 2.
[0016]
In FIG. 1, a character recognition device 10 includes a document reading device 1, a recognition processing unit 2, a hard disk (HD) 3, a dictionary learning processing unit 4, a monitor 5, and a keyboard 6.
The document reading device 1 includes an image reader such as an OCR (optical character reading device) or a scanner. The document reading device 1 reads the document, converts it into image data, and passes it to the recognition processing unit 2.
Moreover, the recognition process part 2 is provided with the character recognition part 21, the recognition character correction part 22, the control part 23, and the recognition dictionary 31, as shown in FIG.
[0017]
In FIG. 2, the character recognition unit 21 cuts out character images for each character from the image data received from the document reading device 1, performs character recognition processing, and outputs a recognition result (character code or rejection code).
[0018]
The recognized character correcting unit 22 corrects the rejected image or the erroneously recognized character when the operator inputs correction from the keyboard 6 to correct the rejected image or correct the erroneous recognition, and writes it to the hard disk 3. Update the recognized result. At this time, correction information is added to the location where the recognition result is updated, and the correction information is written to the hard disk 3.
[0019]
The control unit 23 includes a CPU, an internal memory (RAM), and its peripheral circuits, and controls the entire character recognition device 10 described above and the operations of the components of the character recognition device 10 and the recognition processing unit 2. Further, the control unit 23 recognizes the recognition processing program (corresponding to the character recognition unit 21 and the recognized character correction unit 22 in FIG. 2) stored in the hard disk 3 or the program storage ROM, and the dictionary learning program of the present invention (dictionary learning in FIG. 2). Character recognition is performed by controlling the execution of the processing unit 4.
[0020]
The hard disk 3 has recognition information, that is, a character image, a characteristic amount of the character image, a character code as a recognition result (or a character code as a correction result), the number of the dictionary template used, and correction information when there is correction. In addition to an area for storing the recognition dictionary 31, an area for storing the recognition dictionary 31 and an area for writing the number of learning processes and the number of learning (described later) of each dictionary are secured (the recognition dictionary 31 may be a ROM or a physically separate hard disk). Good). In addition to the recognition processing program and dictionary learning program, the hard disk 3 can also store various program groups necessary for execution control of the character recognition device 10.
[0021]
The dictionary learning processing unit 4 stores dictionary data such as rejected character or misrecognized character features based on the correction information stored in the hard disk 3 together with the recognition result (character code) corrected by the operator in the recognition processing unit 2. If it can be added by determining whether it can be added to the recognition dictionary, the feature amount of the character is added (registered) to the dictionary template of the category. Here, the correction information is a flag indicating whether or not the correction has been made by the operator. Instead of the correction information, the recognition result (character code) before correction may be stored in the hard disk 3 together with the recognition result (character code) after correction.
[0022]
2. Operation FIG. 3 is a flowchart showing an embodiment of the operation of the dictionary learning processing unit 4 of FIG. 1, and the control of the operation sequence of each step is performed by the control unit 23.
[0023]
Step S1: (correction character presence / absence determination)
In FIG. 3, the dictionary learning processing unit 4 recognizes the recognition information written on the hard disk 3 by the character recognition process described above (the character image on the hard disk, the feature amount of the character image, and the character code (or the correction result). Character code), correction information is written in the case of correction, and if there is recognition information that has been subjected to the recognition character correction processing described above, that is, the recognition information corrected by the operator is If there is, the recognition information is read out, the head address of the character image recognition information on the hard disk 3 is held, and the process proceeds to S2, and the character subjected to the correction process is stored in the hard disk 3. If it is not in the recognition result, the dictionary learning process is terminated. In the embodiment, the presence / absence of the recognized character correction process is checked to determine whether correction information is written corresponding to the recognition result written in the hard disk 3 (if there is a recognized character correction process, the correction result = the recognition result). However, the present invention is not limited to this.
[0024]
Step S2: (Reading the number of learning processes and the number of learnings of the used dictionary)
The dictionary learning processing unit 4 extracts the used dictionary template number from the recognition information read out in step S1, and uses the dictionary used for the recognized character correction processing from the hard disk 3 (that is, the dictionary for which registration determination is to be performed). ) Is read from the hard disk 3 and stored in the memory. Here, regardless of whether or not the number of learning processes has been added (registered) to the dictionary, it means the number of times that the dictionary has been subjected to registration determination processing with a threshold value in step S6, which will be described later, and the number of learning times is determined in step S6. As a result of the processing, it means the number of times the dictionary data is registered in step S7, the number of learning processes is counted in step S8, and the number of learning is counted in step S7.
[0025]
Step S3: (Dictionary necessity determination of threshold value relaxation)
The dictionary learning processing unit 4 compares the number of learning processes extracted in step S2 with a predetermined value α, and if the number of learning processes is less than α, it is determined that the dictionary is in a growth stage because the number of learning processes for this dictionary is small. In order to relax the learning threshold, the process proceeds to S4. If the number of learning processes is greater than or equal to α, it is determined that the dictionary has passed the growth period, the ratio between the number of learning processes and the number of learning processes is calculated, and the calculation result is less than a predetermined value β (0 ≦ β ≦ 1). In this case, it is determined that the learning target dictionary is stable and has reached the maturity period, and the process proceeds to S5. If it is equal to or greater than β, it is determined that the maturity period has not been reached and the process proceeds to S4 (that is, the number of learning times ÷ If the number of learning processes <β, the process proceeds to S5, and if the number of learnings / the number of learning processes ≧ β, the process proceeds to S4).
[0026]
Step S4: (Reduction of dictionary learning threshold)
If the dictionary learning processing unit 4 determines in step S3 that the dictionary is in the growth stage or has not yet stabilized even after the growth period, a dictionary learning determination threshold (dictionary learning threshold) is set. The value is reset to a value (relaxation value) larger than the standard value, and the process proceeds to S6. The relaxation value is a predetermined value (experimental value or experience value) determined in advance by the number of learning processes or the ratio of the number of learning processes and the number of learning processes.
[0027]
Step S5: (Non-relaxation of dictionary learning threshold)
If the dictionary learning processing unit 4 determines that the dictionary is stable in step S3, the dictionary learning processing unit 4 sets a threshold value for dictionary learning determination (dictionary learning threshold value) to a standard value (predetermined value), and S6. Transition to.
[0028]
Step S6: (Dictionary of dictionary data registration availability based on dictionary learning threshold)
The dictionary learning processing unit 4 calculates the recognition distance, that is, the distance (for example, the Euclidean distance) between the feature amount of the dictionary of the category determined to be registered and the feature amount of the character image to be corrected, and the above step S4. Alternatively, it is compared with the dictionary learning threshold set in S5, and it is determined whether or not the dictionary data of the character subjected to recognition correction processing can be registered in this dictionary. Here, if the recognition distance is less than the dictionary learning threshold, registration is permitted and the process proceeds to S7. If the recognition distance is greater than or equal to the dictionary learning threshold, registration is impossible and the process proceeds to S8.
[0029]
Step S7: (Registration and counting of dictionary data)
The dictionary learning processing unit 4 registers (additionally stores) the dictionary data of the character image to be corrected in the dictionary that is the object of registration determination in step S6, and adds 1 to the learning number counter of the hard disk 3.
[0030]
Step S8: (Counting the number of learning processes)
The dictionary learning processing unit 4 adds 1 to the learning processing number counter of the hard disk 3.
[0031]
Step S9: (Obtain start address of next recognition result)
The dictionary learning processing unit 4 adds the recognition information record length to the start address of the recognition information held in step S1, acquires the start address of the next recognition information, and makes a transition to S1. Search from the recognition result next to the corrected recognition result).
[0032]
In the flowchart of FIG. 3, the recognition result (for all originals) written to the hard disk for dictionary learning processing after the recognition character correction processing has been completed for the recognition results of all originals written to the hard disk by the character recognition processing. Are sequentially searched, but after the recognition character correction processing for one original has been completed for the recognition results of all originals written to the hard disk in the character recognition processing, the recognition results for that single original are recognized. Alternatively, the above-described steps S2 to S8 may be executed each time the operator recognizes the correction of the recognized character in the recognition result of the original written on the hard disk by the character recognition process. You may comprise. In the case where the search is performed for the recognition result for one original, the counts in steps S7 and S8 are the original unit, and the dictionary learning process is performed each time the operator recognizes the input of correction of the recognized character. If it does, it will be counted for each category.
[0033]
With the above configuration, the dictionary data of character images that have not been registered in the past is also counted as the number of learning processes, the number of learning times, and based on these values, it is determined whether or not the category dictionary is in the growth period. In this case, it is possible to register in the learning dictionary by relaxing the dictionary learning threshold. In addition, when the dictionary is mature and stable, the dictionary learning threshold is set to a standard value, so that registration of dictionary data that lacks dictionary balance can be prevented, and a stable learning dictionary can be obtained. .
[0034]
(Modification)
If it is determined that registration is possible when the dictionary learning processing unit in FIG. 3 determines that registration of dictionary data is possible based on the dictionary learning threshold value in step S6, registration is performed in the learning dictionary between steps S6 and S7 (registration processing). Check whether there is a dictionary of another category with characteristics close to the enabled dictionary data, and if such a dictionary exists, set a use prohibition flag in association with that dictionary template. A first prohibition processing step for prohibiting the use of the dictionary template may be provided.
[0035]
Furthermore, after the first prohibition process step, the dictionary data that can be registered can be used for the recognition process by enabling the dictionary template that was previously prohibited by the prohibition process step. May be provided with a prohibition canceling step for canceling the prohibition of use.
[0036]
Further, after the prohibition release step, a second prohibition processing step may be provided in which a dictionary template registered when the use of the dictionary template whose use prohibition has been canceled is previously prohibited is searched and the use is prohibited.
[0037]
Further, in place of the second use prohibition process, a dictionary template registered when the use of the dictionary template whose use has been canceled after the prohibition release step has been previously used is searched for and used for recognition determination using the dictionary template. A recognition range restriction step for changing the threshold value so as to narrow the recognition range may be provided so as not to affect the recognition of the banned dictionary template.
[0038]
If it is determined that registration is possible at the time of determining whether or not the dictionary data can be registered based on the dictionary learning threshold value in step S6 of the dictionary learning processing unit in FIG. 3, the processing is close to the dictionary template between steps S6 and S7 (registration processing). Check whether there are other categories of dictionaries that have features. If such a dictionary exists, register the dictionary data that can be registered as the threshold value used when determining the recognition of the dictionary template. A recognition range limiting step for changing the recognition range to be narrow may be provided so as not to affect the recognition by the dictionary template.
[0039]
3. Specific Example FIG. 4 is a diagram showing an embodiment of a character image and its recognition result, where the upper part shows the character image and the lower part shows the recognition result. Hereinafter, it demonstrates along the flowchart of FIG. In this example, for the sake of explanation, it is assumed that the number of learning processing determination criteria α = 10, the number of learning determination criteria β = 0.3, and the dictionary learning threshold = γ.
[0040]
The operator looks at the character image displayed on the monitor 4 and the recognition result, and the first character in the first field is rejected (indicated by a reject symbol “?”). 1 ”), the character code of the rejection symbol“? ”Is replaced with the character code of the correction result“ 1 ”and stored in the hard disk 3, and the correction information is associated with the corrected character code and stored in the hard disk. 3 is stored.
[0041]
When the character recognition process for the character image in FIG. 4 is completed, the dictionary learning processing unit 4 searches the recognition information stored in the hard disk 3 to determine whether correction information exists. In this example, since there is correction information in the recognition information related to the first character of the first field, the address of this recognition information is held and the process proceeds to step S2 (step S1).
[0042]
Since there is a corrected character, the dictionary learning processing unit 4 reads the recognition information related to the first character (rejected character) in the first field, searches the hard disk 3 using the used dictionary template number of the recognition information as a key, The number of learning processes and the number of learning times of the dictionary used for the recognition of the first character in the first field are extracted. In this case, since the first learning process is performed for the first character in the first field, the number of learning processes = 0 and the number of learnings = 0 (step S2).
[0043]
Next, when the number of learning processes (= 0) and the criterion α (= 10) are compared, the number of times of learning is less than the criterion, so the process proceeds to step S4 for relaxing the dictionary learning threshold c (step S3). The dictionary learning processing unit 4 resets the dictionary learning threshold c to the relaxation value c ′ (c <c ′) (step S4).
[0044]
The dictionary learning processing unit 4 recognizes the recognition distance R between the feature amount of the first character image in the first field and the feature amount of the dictionary used for the recognition of the character image, and the dictionary learning threshold value c reset in step S4. If R <c ', the process proceeds to step S7 with registration being possible, and if R ≧ c', the process proceeds to step S8 with registration not possible. In this example, since the dictionary learning threshold value is relaxed in step S4, the feature amount of the first character image in the first field can be registered in the dictionary. However, if the dictionary learning threshold value is not relaxed and remains at the standard value c, It is not registered (= the dictionary is not learned) (step S6).
[0045]
The dictionary learning processing unit 4 registers the feature quantity of the first character image in the first field as dictionary data in the dictionary of the category of the dictionary template used for the recognition of the character image, and sets 1 in the learning frequency counter of the hard disk 3. Add (step S7). Further, the dictionary learning processing unit 4 adds 1 to the learning processing counter of the hard disk 3 (step S8).
[0046]
The dictionary learning processing unit 4 adds the recognition result record length to the address of the recognition information held in step S1, acquires the storage address of the next recognition result, and transitions to S1 (step S9).
[0047]
As mentioned above, although one Example of this invention was described, this invention is not limited to the said Example, It cannot be overemphasized that various deformation | transformation implementation is possible.
[0048]
【The invention's effect】
As described above, according to the dictionary learning method of the first invention and the dictionary learning program of the second invention, the dictionary is used when the frequency of use of the dictionary is low, such as dictionary data of character images that have not been registered in the past. By relaxing the learning threshold, it is possible to register the learning dictionary and grow the dictionary. Further, when the dictionary is frequently used and the dictionary is stable, the dictionary learning threshold can be set to a standard value, so that it is possible to prevent registration of dictionary data that lacks the balance of the dictionary.
[Brief description of the drawings]
FIG. 1 is a block diagram showing a configuration of an embodiment of a dictionary learning method and a character recognition apparatus to which the dictionary learning method of the present invention can be applied.
FIG. 2 is a block diagram illustrating an embodiment of a recognition processing unit.
FIG. 3 is a flowchart illustrating an example of the operation of the dictionary learning processing unit.
FIG. 4 is a diagram illustrating an example of recognition results.
[Explanation of symbols]
1 Document Reading Unit 2 Recognition Processing Unit 3 Hard Disk 4 Dictionary Learning Processing Unit 5 Monitor 6 Keyboard 10 Character Recognition Device 21 Character Recognition Unit 22 Recognition Character Correction Unit 23 Control Unit 31 Recognition Dictionary

Claims

In a character recognition method for performing recognition dictionary learning processing,
Number of times the dictionary learning threshold for determining whether or not the learning target character is suitable for learning of the recognition dictionary is the registration processing determination target based on the dictionary learning threshold. Is less than a predetermined value, or when the ratio of the number of times that the dictionary data is registered in the dictionary by the registration determination process with respect to the number of times the registration process is determined is greater than or equal to a predetermined ratio, the dictionary learning threshold is set. A dictionary learning threshold relaxation step for changing to a value larger than the standard value;
When the ratio is less than a predetermined value, returning the dictionary learning threshold value to a standard value ;
The recognition distance between the learning target character and the dictionary used for recognizing the learning target character is compared with the dictionary learning threshold, and if the recognition distance is less than the dictionary learning threshold , the dictionary related to the learning target character A registration process determination step of registering data in a dictionary used for recognition of the learning target character;
A dictionary learning method characterized by comprising:

A program that is executed in a character recognition device that extracts features of a character image obtained by cutting out from a read document image and outputs a recognition result of each character image using a recognition dictionary,
On the computer,
Acquiring recognition information of a character to be learned from recognition information stored in a memory in association with a rejection character image and a misrecognized character that are corrected when the rejection character image and the erroneous recognition character are corrected ;
The dictionary template number used for recognition is extracted from the acquired recognition information, the number of learning processes and the number of learning of the dictionary template stored in the memory are read, the number of learning processes is compared with a predetermined value α, and the number of learning processes ≧ α In the case of calculating the ratio r of the number of learning times to the number of learning processes,
When the number of learning processes <α, or when the ratio r ≧ predetermined value β, a value larger than the standard value is set as the dictionary learning threshold for determining whether or not the character to be learned is suitable for learning of the recognition dictionary. Step,
If the ratio r <predetermined value β, setting a standard value for the dictionary learning threshold;
The recognition distance between the learning target character and the dictionary used for recognizing the learning target character is compared with the dictionary learning threshold value. If the recognition distance <the dictionary learning threshold value, the character subjected to recognition correction processing Registering the dictionary data in the dictionary used for recognizing the character to be learned and increasing the number of learning by one,
Increasing the number of learning processes by one;
Dictionary learning program to execute.