JP3955467B2

JP3955467B2 - Image processing program and image processing apparatus

Info

Publication number: JP3955467B2
Application number: JP2001397674A
Authority: JP
Inventors: 好博嶋
Original assignee: Hitachi Computer Peripherals Co Ltd; Hitachi Ltd
Current assignee: Hitachi Ltd; Hitachi Information and Telecommunication Engineering Ltd
Priority date: 2001-12-27
Filing date: 2001-12-27
Publication date: 2007-08-08
Anticipated expiration: 2021-12-27
Also published as: JP2003196592A

Description

【０００１】
【発明の属する技術分野】
本発明は、画像処理プログラム及び画像処理装置に係り、特に、帳票の表面画像を光学的に読み取り、この読み取った画像の所定の領域を認識する光学的文字読取装置（以下、ＯＣＲという）に読み取らせるために使用して好適な画像処理プログラム及び画像処理装置に関する。
【０００２】
【従来の技術】
従来、帳票等に手書き等によって記入された記入文字は、ＯＣＲにより光学的に読み取られ認識されて処理されているいる。一般に、帳票には、記入する枠や金額単位の円等の読み取りに不要な文字（プレ印刷という）が印刷されている。これらの枠や印刷文字等のプレ印刷は、読み取りには不要であり、ＯＣＲで白色と判定される色、すなわち、ドロップアウトカラーと呼ばれる予め決められた色が使用されていた。このドロップアウトカラーは、光電変換部において使用される光源あるいは光学的フィルタの波長に応じて固定的にかつ厳密に決定されている。このため、帳票のプレ印刷に使用することができる色は、制限されており、ＯＣＲで従来入力対象としていないカラー帳票のように、指定された色以外で印刷された帳票に対してプレ印刷をドロップすることは困難であった。
【０００３】
指定された色以外で印刷された帳票に対してプレ印刷をドロップすることができる従来技術として、例えば、特開平６−１７６１９４号公報等に記載された技術が知られている。この従来技術は、カラー画像を出力することが可能な光電変換部を設け、基準帳票の画像を基に、処理対象とする帳票画像から除くべき部分を色識別によって予め用意しておくことにより、処理対象とする帳票のプレ印刷部分をドロップするというものである。この従来技術は、これにより、帳票のプレ印刷に用いられる色の制限をなくして、記入文字を読み取ってＯＣＲにより認識させることができるが、文字の記入されていない基準帳票を予め用意しておく必要がある。また、この従来技術は、枠線等のプレ印刷と記入文字とが接触、重なった場合について、記入文字がその重なり部分で消えないようにすることについての配慮がなされていないものであった。
【０００４】
また、ＯＣＲに読み取らせるための画像処理に関する従来技術として、特開平６−１９５５０９号公報等に記載された技術が知られている。この従来技術は、記入文字色が黒色であることを前提として、多色刷り画像から、各画素の原色成分（赤、緑、青）を与える光電変換部を設け、それら原色成分の最大値を有する信号を出力してカラーを除去して記入文字だけを抽出するというものである。しかし、この従来技術は、小切手等の文書の多色刷りの背景からカラーを除去することが目的であり、記入文字が色付きの場合、例えば、青色の記入文字の場合、カラーを除去することにより記入文字も除去されてしまうという問題点を有している。
【０００５】
また、さらに、他の従来技術として、例えば、特開平１０−２７２１３号公報等に記載された技術が知られている。この従来技術は、赤鉛筆、赤色ボールペン、青色ボールペン等の有彩色で記入された文字を無彩色の文字と同様に読み取ることができるように、光電変換部から赤、緑、青の３つの原色成分を入力し、その画素の色が無彩色、赤系色、緑系色、青系色のうちのいずれのカラーグループに属するかを判定し、その画素が予め指定している文字色の場合、３原色成分の最大値をその画素の白黒輝度信号として出力するというものである。もし、文字色でない場合、背景レベルをその画素の白黒輝度信号として出力する。そして、その画素の白黒輝度信号に変換する。すなわち、多値のイメージデータから２値のイメージデータに変換する。しかし、この従来技術は、枠線等のプレ印刷と記入文字とが接触、重なった場合について、記入文字を残し、プレ印刷をドロップし、重なり部分が消えないようにすることについての配慮がなされていない。
【０００６】
さらに、他の従来技術として、特開平６−２９０３０２号公報等に記載された技術が知られている。この従来技術は、枠線等のプレ印刷と記入文字とが接触、重なった場合、入力画像データに対して暫定的な色分離を行い、この暫定的に色分離された各部分を構成する画素の幾何学的な情報と、前記各部分及びバックグラウンドの濃度情報とを併用して最終的な色分離を決定するものである。しかし、この従来技術は、２値画像バッファと、グレー画像バッファ、カラー画像バッファとを備え、２値画像を対象とした文字を切り出し、文字認識を行うに際して、入力画像の特定部分、例えば、文字ストローク中の穴の空いている部分や罫線と文字の接触部分を再チェックする場合に、前述のグレー画像バッファ、カラー画像バッファを参照して文字を切り出して文字認識を実行するものである。
【０００７】
このため、この従来技術は、文字切り出し部や文字認識部等の処理部毎にグレー画像やカラー画像にアクセスする必要が生じ、制御が複雑になるという問題点を有している。また、この従来技術は、プレ印刷を消去し、記入文字を残した２値画像を生成して出力しておらず、読み取り結果の修正のためにプレ印刷を除去し、記入文字を残した２値画像を画面に表示することについて配慮されていない。さらに、この従来技術は、同一色とみなせる領域を成長させるというような直線幾何学的な情報を用いているが、直線部分や文字の曲線部分における領域の輪郭の乱れや、色ずれに対する領域成長の信頼性に配慮されていない。
【０００８】
【発明が解決しようとする課題】
前述で説明した従来技術は、それぞれの従来技術の説明と共に説明したように、種々の問題点を有している。
【０００９】
本発明の目的は、前述した従来技術の問題点を解決し、処理対象とするカラー帳票において、枠線等のプレ印刷の色と記入文字の色とが異なるような帳票画像に対して、プレ印刷と記入文字とが接触あるいは重なっていても、プレ印刷を消去し、記入文字を残した２値画像を生成することができ、精度の高い文字認識を行わせることができる画像処理プログラム及び画像処理装置を提供することにある。
【００１０】
【課題を解決するための手段】
本発明によれば前記目的は、入力された帳票画像の３原色成分の最小値に対して２値化を行うステップと、前記２値化した結果の２値化画像の文字認識を行うステップと、前記認識文字がアクセプトされた場合に、認識した文字を出力するステップとを実行させ、前記認識文字がリジェクトされた場合に、前記入力された帳票画像のプレ印刷色と記入文字色とを設定するステップと、前記帳票画像のプレ印刷色をドロップアウトして２値化した２値化画像を生成するステップと、前記ドロップアウトして２値化した２値画像の文字認識を行うステップと、認識した文字を出力するステップとを情報機器に実行させることにより達成される。
【００１２】
さらに、前記目的は、入力された帳票画像の３原色成分の最小値に対して２値化を行う手段と、前記２値化した結果の２値化画像の文字認識を行う手段と、前記認識文字がアクセプトされた場合に、認識した文字を出力する手段と、前記認識文字がリジェクトされた場合に、前記入力された帳票画像のプレ印刷色と記入文字色とを設定する手段と、前記帳票画像のプレ印刷色をドロップアウトして２値化した２値化画像を生成する手段と、前記ドロップアウトして２値化した２値画像の文字認識を行う手段と、認識した文字を出力する手段とを有することにより達成される。
【００１３】
【発明の実施の形態】
以下、本発明による画像処理装置及び文字読み取りシステムの一実施形態を図面により詳細に説明する。
【００１４】
図１は本発明の一実施形態による画像処理装置を含む文字読み取りシステムの構成を示すブロック図である。図１において、１００はカラー画像スキャナ、１０１は画像処理部、１０２は色識別ドロップアウト２値化部、１０３はプレ印刷・記入文字の色識別部、１０４はプレ印刷ドロップアウト２値化部、１０５は３原色成分最小値２値化部、１０６は制御部、１０７は文字読み取り部、１０８は画像蓄積部、１０９はネットワークである。
【００１５】
本発明の一実施形態による文字読み取りシステムは、帳票表面のカラー画像を採取するカラー画像スキャナ１００と、該カラー画像スキャナ１００からのカラー画像情報を白黒に２値化する画像処理部１０４と、２値化された画像データを読み取るＯＣＲ等の文字読み取り部１０７と、画像蓄積部１０８とがネットワーク１０９により接続されて構成されている。
【００１６】
画像処理部１０１は、カラー画像スキャナ１００から入力される画像データの３原色成分の最小値、すなわち、３原色成分の最も濃い部分のデータ部分を選択して白黒２値化処理を行う３原色成分最小値２値化部１０５と、プレ印刷の色を識別し、その色をドロップアウト処理する色識別ドロップアウト２値化部１０２と、これらの処理の選択及び実行を制御する制御部１０６とにより構成される。３原色成分最小値２値化部１０５は、入力されたカラー画像の３原色成分の内の最小値を選択し、その原色成分を濃淡値として白黒２値化する。濃淡値を２値化する方法としては、例えば、特開平１１−１２０３３３号公報に開示されているように、注目画素とその周辺の画素とのそれぞれの濃淡値より２値化閾値を決定する方法であってもよい。
【００１７】
色識別ドロップアウト２値化部１０２は、プレ印刷や記入文字の色識別を行う色識別部１０３と色識別結果を用いてプレ印刷色をドロップアウトする２値化部１０４とにより構成される。文字読み取り部１０７は、画像処理部１０１から送られてきた２値画像からの文字切り出しと文字認識とを行い、また、画像蓄積部１０８は、文字読み取り部１０７に送られる２値画像を保管する。
【００１８】
図２は図１に示す文字読み取りシステムにおける画像処理部１０１及び文字読み取り部１０７での処理動作を説明するフローチャートであり、次に、これについて説明する。図２に示す処理が開始される前に、カラー画像スキャナ１００は、文字読み取りを行う帳票をスキャンして、帳票のカラー画像を採取し、その画像を画像処理部１０１に転送する。この画像は、カラー画像スキャナ１００あるいは画像処理部１０１内に設けられる図示しないバッファに一時的に格納保持される。
【００１９】
（１）画像処理部１０１内の３原色成分の最小値の２値化部１０５は、カラー画像スキャナ１００からの画像について、３原色成分の最小値、すなわち、３原色成分の最も濃い部分のデータ部分を選択し白黒２値化処理した結果の２値画像を制御部１０６を介してネットワーク１０９に送信する（ステップ２００）。
【００２０】
（２）文字読み取り部１０７は、３原色成分最小値２値化部１０５において２値化された画像を入力し、読み取る領域、すなわち、フィールドを指定する。フィールドの指定は、予め帳票の書式（フォーマット）としてフィールドの座標を登録しておき、そのフォーマットを読み取るときに選択してフィールドを指定するという方法によっても、あるいは、帳票の枠線を抽出し、枠の並び、配置から読み取る領域を決定する方法によってもよい（ステップ２０１、２０２）。
【００２１】
（３）次に、文字読み取り部１０７は、フィールド内の文字を切り出して文字認識を実行することによりフィールド内の文字読み取りを行う。次いで、フィールド内の文字読み取りの結果がアクセプトかリジェクトかを判定する。この判定の結果、もし、アクセプトの場合、読み取り結果を統合処理して単純に出力する（ステップ２０３、２０４、２０９）。
【００２２】
（４）ステップ２０４の判定で、文字読み取りの結果がリジェクトであった場合、カラー画像スキャナ１００で獲得されていたカラー画像を、色識別ドロップアウト２値化部１０２のプレ印刷・記入文字の色識別部１０３に入力する（ステップ２０５）。
【００２３】
（５）色識別部１０３は、入力されたカラー画像から帳票の枠線やガイド用文字列等のプレ印刷部分の色識別と記入文字の色識別とを行う（ステップ２０６）。
【００２４】
（６）次に、プレ印刷ドロップアウト２値化部１０４は、色識別部１０３からのプレ印刷の色を利用してその色をドロップアウトして２値画像を生成する。なお、プレ印刷と記入文字とが同色であり、ドロップアウト２値化が不成功と判断された場合、プレ印刷ドロップアウト２値化部１０４は、リジェクトを出力する（ステップ２０７）。
【００２５】
（７）文字読み取り部１０７は、プレ印刷ドロップアウト２値化部１０４からネットワーク１０９に伝送されてきた２値画像に対してフィールド内の文字読み取りを行い、読み取り結果を統合処理して出力する（ステップ２０８、２０９）。
【００２６】
前述したように、本発明の実施形態による文字読み取りシステムは、ステップ２０４の処理で先のフィールド内文字読み取りの結果がアクセプトかリジェクトかを判定して、リジェクトの場合、プレ印刷をドロップアウトした２値画像に対して文字読み取りを行っているので、プレ印刷部分が文字認識の障害になることを防止することができ、文字読み取りの信頼性を向上させることができる。
【００２７】
図３は色識別ドロップアウト２値化部１０２の構成を示すブロック図、図８はフィールド内のプレ印刷と記入文字との色を識別する領域を説明する図、図１０は図３における原色成分選択部３０５の構成を示す図である。図３、図１０において、３０１はプレ印刷色識別部、３０２は記入文字色識別部、３０３は色成分選択判定部、３０５は原色成分選択部、３０６は成分値の濃化・淡化部、３０７は２値化部、３０８は画素識別部であり、他の符号は図１の場合と同一である。
【００２８】
色識別部１０３は、プレ印刷色識別部３０１と、記入文字色識別部３０２と、色成分選択判定部３０３とにより構成され、プレ印刷や記入文字の色識別を行う。そして、カラー画像スキャナ１００からのカラー画像の青色成分３１０、緑色成分３１１、赤色成分３１２がプレ印刷色識別部３０１及び記入文字色識別部３０２に入力される。プレ印刷色識別部３０１は、フィールド内の部分的なカラー画像に対して識別用の処理領域を設定し、その領域内にある画素の色の出現分布を検出してプレ印刷の色を識別する。なお、プレ印刷の色は、１つの色に限定するものではなく、複数の色であってもよく、例えば、赤色と緑色との２色をプレ印刷の色として抽出してもよい。
【００２９】
次に、図８を参照して、フィールド内のプレ印刷と記入文字との色を識別する領域について説明する。図８（ａ）、図８（ｂ）において、フィールド８０２内の枠線８００は、色付きであり、例えば、赤色や青色の線である。また、記入文字８０１は、読み取り対象であり、黒色や色付きの文字である。そして、プレ印刷の枠線８００と記入文字８０１とが重なった状態で記入されているものとして示している。枠線の色と記入文字の色とが異なる場合、枠線をドロップアウトすることができる。
【００３０】
枠線等のプレ印刷の色を識別するため、図８（ａ）に示すように、色識別の処理領域８０３、８０４が設定される。この処理領域において、画素の色識別が行われて、色画素の分布を求め、プレ印刷の色が決定される。また、記入文字の色を識別するため、図８（ｂ）に示すように、色識別の処理領域８１３が設定される。この処理領域において、画素の色識別が行われと、色画素の分布を求め、記入文字の色が決定される。
【００３１】
図３の説明に戻って、記入文字色識別部３０２は、フィールド内の部分的なカラー画像に対して識別用の処理領域、例えば、領域８１３を設定し、その領域内にある画素の色の出現分布を検出して記入文字の色を識別する。また、色成分選択判定部３０３は、識別されたプレ印刷色と記入文字色とを用いて、三原色成分の内、２値化対象の濃淡画像として選択する原色成分を判定する。例えば、もし、プレ印刷色が赤色と識別され、記入文字色が青色と識別された場合、赤色成分を選択すべき原色成分と判定する。また、もし、プレ印刷色が緑色と識別され、記入文字色が黒色と識別された場合、緑色成分を選択すべき原色成分と判定する。そして、色成分選択判定部３０３は、プレ印刷色をドロップアウト２値化部１０４の原色成分選択部３０５に対して出力し、また、プレ印刷色と記入文字色との情報３２０をドロップアウト２値化部１０４の成分値の濃化・淡化部３０６に対して出力する。
【００３２】
前述した色識別部１０３における処理は、１つのフィールドのカラー画像について行われて色の識別を行う。そして、前述の色識別が行われた後、次に説明するドロップアウト２値化部１０４により、同一のフィールドのカラー画像のデータを用いて、プレ印刷色をドロップアウトした白黒の２値画像を生成する処理が行われる。
【００３３】
図３に示すように、ドロップアウト２値化部１０４は、原色成分選択部３０５、成分値の濃化・淡化部３０６、画素色識別部３０８、濃淡２値化部３０７から構成されている。原色成分選択部３０５は、色成分選択判定部３０３の結果に従って、三原色成分、すなわち、入力される各画素毎に青色成分３１０、緑色成分３１１、赤色成分３１２の中から１つの原色成分を単純に選択してその濃淡値を出力する。
【００３４】
原色成分選択部３０５は、図１０に示すように、入力された青色成分３１０、緑色成分３１１、赤色成分３１２のうちから、色成分選択判定部３０３から出力された色の種類情報１１０４に従って１つの原色成分を選択して濃淡値１１０５として出力する。ここでは、色の種類情報１１０４が赤色であれば、赤色成分３１０を濃淡値１１０５として出力する。また、もし、色の種類情報１１０４が緑色であれば、緑色成分３１１を濃淡値１１０５として出力する。さらに、もし、色の種類情報１１０４が青色であれば、青色成分３１１を濃淡値１１０５として出力する。
【００３５】
なお、前述において、色成分選択判定部３０３から出力された色の種類情報１１０４が、プレ印刷色として三原色成分のうち、例えば、赤色と青色との２色を示している場合、原色成分選択部３０５は、入力される各画素の赤色成分と青色成分との最大値を選択し、その最大値を濃淡値として出力してもよい。
【００３６】
成分値の濃化・淡化部３０６は、入力された色の濃淡値の大きさを小さく（濃化）または大きく（淡化）する処理を行う。このとき、画素色識別部３０８は、注目画素の色を識別し、該当する色の種類３２２を濃化・淡化部３０６に送出する。濃化・淡化部３０６は、プレ印刷色、記入文字色、注目画素の色が入力され、これらの入力された色の種類に従って、注目画素の濃淡値の濃化または淡化を行う。例えば、もし、プレ印刷色が赤色で、記入文字色が青色であって、注目画素が赤色であれば、その注目画素を淡化する。すなわち、注目画素の濃淡値を大きくし、白色に近づける。濃化・淡化部３０６の出力である濃化・淡化された濃淡値は濃淡２値化部３０７に入力され、２値画像に生成される。この濃淡２値化部３０７は、例えば、特開平１１−１２０３３３号公報に開示されている方法により実現することができる。
【００３７】
図４は図２により説明したフローにおけるステップ２０６でのプレ印刷・記入文字の色識別の処理動作を説明する図であり、次に、これについて説明する。この処理は、プレ印刷の色識別、記入文字の色識別のそれぞれについて、領域を設定して別々に実行される。
【００３８】
（１）まず、フィールド内のカラー画像における探索範囲を設定する。探索範囲は、例えば、図８により説明したように、プレ印刷色の識別に用いる処理領域８０３、８０４や記入文字色の識別に用いる処理領域８１３である（ステップ４００）。
【００３９】
（２）次に、フィールド内に設定された領域内の画像を走査する。走査の手順は、注目画素を横方向、次いで、縦方向に走査し、画像を走査しながら、次に説明するステップ４０３からステップ４０７までの処理を繰り返して行われる（ステップ４０１）。
【００４０】
（３）画像を走査しながら、まず、注目画素の色を判定する。ここでは、注目画素の色の種類として、青色系、緑色系、赤色系、黒色系の４種類について色識別を行って色の種類を求める（ステップ４０３）。
【００４１】
（４）次に、ステップ４０３で判定された画素の色の個数を計数する。すなわち、ステップ４０３で注目画素が青色と判定された場合、青色画素の個数を１個増加させ、ステップ４０３で注目画素が緑色と判定された場合、緑色画素の個数を１個増加させる。同様に、ステップ４０３で注目画素が赤色、黒色と判定された場合、赤色画素の個数、黒色画素の個数を１個増加させる（ステップ４０４〜４０７）。
【００４２】
（５）次に、計数した青色画素、緑色画素、赤色画素のそれぞれの個数に基づいて、プレ印刷や記入文字の色種を決定する。ここでは、青色画素、緑色画素、赤色画素、黒色画素の個数の内、最大の個数を有する色を対象の色種と決定し、処理を終了する（ステップ４０２）。
【００４３】
図５は文字読み取り部１０７における文字読み取りの処理動作を説明する図であり、次に、これについて説明す。
【００４４】
（１）まず、設定されたフィールド内の２値画像を入力し、文字の並びである文字行を抽出する、そして、文字を切り出す（ステップ５００〜５０２）。
（２）切り出した文字の文字認識を行い、知識処理を行う。知識処理は、予め、住所や姓名の辞書を用意しておき、文字認識結果と辞書中の住所や姓名との突き合わせを行い、認識精度を向上させる処理である（ステップ５０３、５０４）。
【００４５】
図６は処理対象であるカラー帳票の例を説明する図であり、次に、これについて説明する。
【００４６】
カラー帳票６００には、例えば、図６に示すように、枠６０１、６０３、６０８が印刷されており、図示例では、枠６０１に文字６０２が記入されている。また、枠６０３にプレ印刷文字６０４、６０５が予め印刷され、また、文字６０６、６０７が記入されている。さらに、枠６０８に文字６０９が記入されている。図に示す例のように、記入文字とプレ印刷とが接触している帳票や、重なっている帳票がある。例えば、図に示す例では、記入文字６０２と枠６０１とが重なっている。また、記入文字６０６、６０７とプレ印刷６０４、６０５とが重なっている。プレ印刷と記入文字とが接触している場合や、重なっている場合、文字読み取りの障害となる。そのため、プレ印刷をドロップアウトする必要がある。
【００４７】
ここで、一例として、枠６０１が赤色で、記入文字６０２が青色であるフィールドや、枠６０３とプレ印刷６０４、６０５が緑色であるフィールドにおいて、プレ印刷である枠６０１、６０３、プレ印刷文字６０４、６０５をドロップアウトし、すなわち白色とし、記入文字６０２、６０６、６０７を黒色とする。なお、枠６０８の色と記入文字６０９の色とが同色の場合、本発明の実施形態での色情報を利用したプレ印刷のドロップアウトの適用対象外である。記入文字６０９の色とプレ印刷の枠６０８の色とが同色の例として、記入文字６０９が緑色でプレ印刷の枠６０８が緑色の場合や、記入文字６０９が黒色でプレ印刷の枠６０８が黒色の場合がある。このように、同色の場合、プレ印刷のドロップアウトを行うことは困難であり、同色の場合、図２により説明したフローのステップ２０７の処理で、プレ印刷、記入文字とも黒色として出力され。
【００４８】
図７は本発明の実施形態による画像処理を行って得られる結果としての２値画像の例を説明する図である。この図７に示す例は、図６により説明したカラー帳票６００のカラー画像からプレ印刷をドロップアウトした結果の２値画像を７００として示している。図示の２値画像７００は、図６における記入文字６０２、６０６、６０７が、記入文字７０１、７０２、７０３として黒色に処理され、プレ印刷６０１、６０３、６０４、６０５がドロップアウトされ白色となっている。一方、プレ印刷６０８、記入文字６０９は、同色のため２値画像７００ではドロップアウトされず、どちらも黒色に表現される。
【００４９】
図９は図４により説明した画素の色判定のステップ４０３での処理動作の詳細を説明するフローチャートであり、次に、これについて説明する。
【００５０】
（１）まず、注目画素の３原色成分、すなわち、赤色成分値Ｒ、緑色成分値Ｇ、青色成分値Ｂを入力する。なお、一般に、画素の色成分は、８ビットの情報でその大きさが表現され、Ｒ、Ｇ、Ｂのそれぞれのは、０〜２２５の値をとる（ステップ９００）。
【００５１】
（２）次に、赤色成分値Ｒと他の成分値との大小を比較し、Ｒ＞ａ・Ｇ、かつ、Ｒ＞ａ・Ｂの条件を満たしているか否かを判定し、条件を満たしている場合、その注目画素を赤色として登録する。なお、上式において、ａは１以上の予め定めて定数であり、以後のステップにおいても、同様である。そして、この定数ａを大きな値に設定すると純粋な色、この場合純粋な赤だけが赤として判定される（ステップ９０１、９０２）。
【００５２】
（３）ステップ９０１の判定で、条件を満たしていなかった場合、次に、緑色成分値Ｇと他の成分値との大小を比較し、Ｇ＞ａ・Ｒ、かつ、Ｇ＞ａ・Ｂの条件を満たしているか否かを判定し、条件を満たしている場合、その注目画素を緑色として登録する（ステップ９０３、９０４）。
【００５３】
（４）ステップ９０３の判定で、条件を満たしていなかった場合、前述と同様に、青色成分値Ｂと他の成分値との大小を比較し、Ｂ＞ａ・Ｒ、かつ、Ｂ＞ａ・Ｇの条件を満たしているか否かを判定し、条件を満たしている場合、その注目画素を青色として登録する（ステップ９０５、９０６）。
【００５４】
（５）さらに、ステップ９０５の判定で、条件を満たしていなかった場合、次に、赤色成分値Ｒ、緑色成分値Ｇ、青色成分値Ｂが共に、予め定めた所定値ｃより小さく（Ｒ、Ｇ、Ｂ＜ｃ）、かつ、赤色成分値Ｒ、緑色成分値Ｇ、青色成分値Ｂの最大値が最小値にある定数ｄを乗じた値より小さい（Ｍａｘ（Ｒ，Ｇ，Ｂ）＜ｄ・Ｍｉｎ（Ｒ，Ｇ，Ｂ））か否か、すなわち、３原色成分値が共に小さく、かつ、それらの値がほぼ同じであるか否かを判定し、３原色成分値が共に小さく、かつ、それらの値がほぼ同じである場合に、その注目画素を黒色として登録して処理を終了する（ステップ９０７、９０８）。
【００５５】
図１１は図３における成分値の濃化・淡化部３０６の構成を示すブロック図、図１２は濃化・淡化処理種類の決定に使用する濃化・淡化処理決定用の参照テーブルの例を説明する図であり、次に、これらについて説明する。図１１において、１２００は濃化・淡化処理種類の決定部、１２０１は濃化用乗算部、１２０２は淡化用乗算部、１２０３は選択部である。
【００５６】
成分値の濃化・淡化部３０６は、図１１に示すように、濃化・淡化処理種類の決定部１２００と、濃化用乗算部１２０１と、淡化用乗算部１２０２と、選択部１２０３とを備えて構成される。そして、原色成分選択部３０５からの出力である濃淡値３２１と、色成分選択判定部３０３からの記入文字色及びプレ印刷色の情報３２０と、画素色識別部３０８からの注目画素の色情報３２２とが図１１に示す成分値の濃化・淡化部３０６に入力される。
【００５７】
濃化・淡化処理種類の決定部１２００は、プレ印刷色及び記入色の情報３２０、注目画素の色情報３２２を用いて、入力された濃淡値３２１の濃化出力または淡化出力、あるいは、入力と同値を出力のいずれを選択するかを濃化・淡化処理決定用の参照テーブル１３００を参照して決定する。
【００５８】
濃化・淡化処理種類の決定部１２００において使用する濃化・淡化処理決定用の参照テーブルは、図１２に示すように構成されており、横の項目１３４０は、プレ印刷色の種類を示し、プレ印刷色が赤色１３０１、緑色１３０２、青色１３０３、黒色１３０４の場合を示す。また、縦の項目１３４２は、記入文字色の種類を示し、記入文字色が赤色１３０５、緑色１３０６、青色１３０７、黒色１３０８の場合を示す。さらに、縦の細分項目１３４１は、注目画素色の種類を示し、記入文字色のそれぞれに対して、注目画素の色が赤色１３１０、緑色１３１１、青色１３１２、黒色１３１３の場合がある。
【００５９】
そして、図示参照テーブル１３００には、プレ印刷色、記入文字色、注目画素色のそれぞれの組の場合に、濃化、淡化、同値の３種類を示す内容が予め、記憶格納されている。例えば、プレ印刷色１３０４が緑色１３０２で記入文字色１３４２が赤色１３０５の場合、もし注目画素色が赤色１３１０の場合、１３２０に示す濃化処理を濃化・淡化処理の種類として選択する。また、もし、注目画素色が緑色１３１１の場合、１３２１に示す淡化処理を濃化・淡化処理の種類として選択する。もし、注目画素色が青色１３１２、または、黒色１３１３の場合、濃化・淡化処理の種類として同値、即ち、濃化、淡化は行わず、そのままの値を出力するよう決定する。
【００６０】
濃化用乗算部１２０１は、入力された濃淡値３２１に所定の値を乗算し、濃淡値を小さく、すなわち、濃い濃度を出力する。また、淡化用乗算部１２０２は、入力された濃淡値３２１に所定の値を乗算し、濃淡値を大きく、すなわち、淡い濃度を出力する。選択部１２０３は、濃化・淡化処理種類の決定部１２００の結果出力に従って、入力された濃淡値３２１の濃化出力または淡化出力、あるいは、入力と同値を出力のいずれかを選択して１２１３として出力する。
【００６１】
前述までに説明した本発明の実施形態は、読み取るべき帳票のプレ印刷色、記入文字色を自動的に識別して処理を行うものとして説明したが、本発明は、プレ印刷色、記入文字色を予め読み取り帳票の種類を示す帳票識別番号に含ませておき、帳票識別番号を読み取って読み取り帳票の種類を設定することもできる。この場合、図１に示す色識別ドロップアウト２値化部１０２をプレ印刷ドロップアウト２値化部１０４だけで構成することができる。
【００６２】
図１３は帳票識別番号を読み取って読み取り帳票の種類を設定する場合の本発明の他の実施形態による文字読み取りシステムにおける画像処理部１０１及び文字読み取り部１０７での処理動作を説明するフローチャートであり、次に、これについて説明する。なお、この処理は、図２により説明したフローにおけるステップ２０４の処理でリジェクトと判定された後に行われることになる。
【００６３】
（１）まず、対象となるの帳票の種類を設定する。ここでは、帳票に記載されている帳票識別番号を読み取って帳票の種類を設定してもよく、あるいは、帳票画像から枠線の配置を抽出し、予め用意している枠線の配置と照合する帳票識別とにより、帳票の種類を設定してもよい（ステップ１０００）。
【００６４】
（２）次に、ステップ１０００で設定した帳票の種類毎に予め用意されている書式情報、すなわち、フォーマット情報を入力する。フォーマット情報としては、フィールドの位置座標、プレ印刷色や記入色等が予め、帳票毎に用意されているものを用いる（ステップ１００１）。
【００６５】
（３）次に、フォーマット情報として設定されているフィールドが尽きるまで、次に説明するステップ１００３〜ステップ１００８までの処理を繰り返して、処理を終了する（ステップ１００２）。
【００６６】
（４）フォーマット情報からそのフィールドの位置座標を設定し、また、フォーマット情報からそのフィールドのプレ印刷色を設定する。さらに、フォーマット情報からそのフィールドの記入文字色を設定する（ステップ１００３〜１００５）。
【００６７】
（５）次に、フィールド内のカラー画像を入力し、先に設定したプレ印刷色と記入文字色とを用いてプレ印刷色をドロップアウトする２値化処理を行う。そして、２値化画像に対してフィールド内の文字読み取りを行う（ステップ１００６〜１００８）。
【００６８】
前述した本発明の実施形態による処理は、処理プログラムとして構成することができ、この処理プログラムは、ＨＤ、ＤＡＴ、ＦＤ、ＭＯ、ＤＶＤ−ＲＯＭ、ＣＤ−ＲＯＭ等の記録媒体に格納して提供することができる。
【００６９】
前述したように、本発明の実施形態によれば、フィールド内のカラー画像からプレ印刷・記入文字の色を識別する手段と、識別したプレ印刷色をドロップアウトした２値化画像を生成する手段とを備え、ドロップアウトした２値化画像に対して文字読み取りを再試行しているので、プレ印刷と記入文字とが重なったフィールドでの文字読み取りの信頼性を向上させることができる。また、プレ印刷色をドロップアウトした２値化画像を生成する手段のみを備えるより処理時間の軽減を図ることができる。
【００７０】
また、本発明の実施形態によれば、帳票内のフィールドのプレ印刷色と記入文字色を設定する手段を備えているため、帳票毎、また、フィールド毎にプレ印刷色が異なっている帳票でもプレ印刷をドロップアウトすることができ、文字読み取りの信頼性を向上させることができる。さらに、プレ印刷色を基にカラー画像の３原色成分から単色成分または複数成分の最大値を選択して濃淡画像を生成する手段と、プレ印刷色と記入文字色と注目画素色とを基に当該濃淡画像を濃化または淡化する手段とを備えているため、単色のカラードロップアウトや、複数色、例えば、赤色と青色との同時ドロップアウトに濃淡値を用いることができ、カラードロップアウト画像の画質を向上させることができる。
【００７１】
また、本発明の実施形態によれば、帳票画像のフィールド毎に、プレ印刷色または記入文字色を帳票書式として予め登録する手段と、フィールド毎に、登録しているフィールドのプレ印刷色または記入文字色を読み出す手段を備えているため、書式が既知である帳票に対して、カラードロップアウト画像の画質を向上させることができる。
【００７２】
さらに、本発明の実施形態によれば、帳票画像のフィールド毎に、フィールド内に探索領域を複数個設定する手段と、所定の探索領域内でプレ印刷色を識別する手段、または、別の所定の探索領域内で記入文字色を識別する手段を備えているため、プレ印刷色に関する書式が未知である帳票に対しても、カラードロップアウトができるという効果を得ることができる。
【００７３】
なお、前述までに説明した本発明の他の実施の態様を記せば、次の通りである。
【００７４】
１．帳票画像のフィールドから文字読み取りのための白黒２値化画像を生成する画像処理装置において、フィールド内のカラー画像から３原色成分の最小値に対する２値化を行う２値化画像生成手段と、フィールド内のカラー画像からプレ印刷色をドロップアウトした２値化画像を生成する手段とを備えたことを特徴とする画像処理装置。
【００７５】
２．帳票画像のフィールドから文字読み取りのための白黒２値化画像を生成する画像処理装置において、フィールド内のカラー画像から３原色成分の最小値に対する２値化を行う２値化画像生成手段と、フィールド内のカラー画像からプレ印刷色をドロップアウトした２値化画像を生成する手段とを備え、前記フィールド内のカラー画像からプレ印刷色をドロップアウトした２値化画像を生成する手段は、帳票フィールド内のプレ印刷色または記入文字色を設定する手段と、プレ印刷色を基にカラー画像の３原色成分から単色成分または複数成分の最大値を選択して濃淡画像を生成する手段と、プレ印刷色と記入文字色と注目画素色とを基に前記濃淡画像を濃化または淡化する手段と、濃化または淡化した濃淡画像を閾値により白黒２値化する手段とにより構成されることを特徴とする画像処理装置。
【００７６】
３．前記帳票のプレ印刷色または記入文字色を設定する手段は、帳票画像のフィールド毎に、プレ印刷色または記入文字色を帳票書式として予め登録する手段と、フィールド毎に、登録している当該フィールドのプレ印刷色または記入文字色を読み出す手段とにより構成されることを特徴とする請求項２記載の画像処理装置。
【００７７】
４．前記帳票のプレ印刷色または記入文字色を設定する手段は、帳票画像のフィールド毎に、フィールド内に探索領域を複数個設定する手段と、所定の探索領域内でプレ印刷色を識別する手段、または、別の所定の探索領域内で記入文字色を識別する手段とにより構成されることを特徴とする請求項２記載の画像処理装置。
【００７８】
５．帳票画像のフィールドから生成された白黒２値化画像を読み取る文字読み取り装置と、帳票画像のフィールドから白黒２値化画像を生成する画像処理装置とを備えた文字読み取りシステムにおいて、前記画像処理装置は、フィールド内のカラー画像から３原色成分の最小値に対する２値化を行う２値化画像生成手段と、フィールド内のカラー画像からプレ印刷色をドロップアウトした２値化画像を生成する手段と、前記文字読み取り装置に、前記３原色成分の最小値に対する２値化を行う２値化画像生成手段からの２値化画像の読み取りを行わせ、その結果がリジェクトであった場合に、前記カラー画像からプレ印刷色をドロップアウトした２値化画像を生成する手段からの２値化画像の読み取りを再試行させる手段とを備えたことを特徴とする文字読み取りシステム。
【００７９】
【発明の効果】
以上説明したように本発明によれば、処理対象とするカラー帳票において、枠線等のプレ印刷の色と記入文字の色とが異なるような帳票画像に対して、プレ印刷と記入文字とが接触あるいは重なっていても、プレ印刷を消去し、記入文字を残した２値画像を生成することができ、精度の高い文字認識を行わせることができる。
【図面の簡単な説明】
【図１】本発明の一実施形態による画像処理装置を含む文字読み取りシステムの構成を示すブロック図である。
【図２】図１に示す文字読み取りシステムにおける画像処理部及び文字読み取り部での処理動作を説明するフローチャートである。
【図３】色識別ドロップアウト２値化部の構成を示すブロック図である。
【図４】図２により説明したフローにおけるステップ２０６でのプレ印刷・記入文字の色識別の処理動作を説明する図である。
【図５】文字読み取り部における文字読み取りの処理動作を説明する図である。
【図６】処理対象であるカラー帳票の例を説明する図である。
【図７】本発明の実施形態による画像処理を行って得られる結果としての２値画像の例を説明する図である。
【図８】フィールド内のプレ印刷と記入文字との色を識別する領域を説明する図である。
【図９】図４により説明した画素の色判定のステップ４０３での処理動作の詳細を説明するフローチャートである。
【図１０】図３における原色成分選択部の構成を示す図である。
【図１１】図３における成分値の濃化・淡化部の構成を示すブロック図である。
【図１２】濃化・淡化処理種類の決定に使用する濃化・淡化処理決定用の参照テーブルの例を説明する図である。
【図１３】本発明の他の実施形態による文字読み取りシステムにおける処理動作を説明するフローチャートである。
【符号の説明】
１００カラー画像スキャナ
１０１画像処理部
１０２色識別ドロップアウト２値化部
１０３プレ印刷・記入文字の色識別部
１０４プレ印刷ドロップアウト２値化部
１０５３原色成分最小値２値化部
１０６制御部
１０７文字読み取り部
１０８画像蓄積部
１０９ネットワーク
３０１プレ印刷色識別部
３０２記入文字色識別部
３０３色成分選択判定部
３０５原色成分選択部
３０６成分値の濃化・淡化部
３０７２値化部
３０８画素識別部
１２００濃化・淡化処理種類の決定部
１２０１濃化用乗算部
１２０２淡化用乗算部
１２０３選択部[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an image processing program and an image processing apparatus, and more particularly, to an optical character reader (hereinafter referred to as OCR) that optically reads a surface image of a form and recognizes a predetermined area of the read image. The present invention relates to an image processing program and an image processing apparatus suitable for use.
[0002]
[Prior art]
2. Description of the Related Art Conventionally, written characters entered on a form by handwriting or the like are optically read and recognized by an OCR and processed. In general, letters (referred to as pre-printing) that are unnecessary for reading a frame to be filled in, a yen in a monetary unit, or the like are printed on a form. Pre-printing of these frames and printed characters is not necessary for reading, and a color determined to be white by OCR, that is, a predetermined color called a dropout color is used. This dropout color is fixedly and strictly determined according to the wavelength of the light source or optical filter used in the photoelectric conversion unit. For this reason, the colors that can be used for pre-printing of forms are limited, and pre-printing is performed for forms printed in colors other than the specified color, such as color forms that have not been previously input by OCR. It was difficult to drop.
[0003]
As a conventional technique capable of dropping pre-printing on a form printed in a color other than the designated color, for example, a technique described in JP-A-6-176194 is known. This prior art is provided with a photoelectric conversion unit capable of outputting a color image, and based on the image of the reference form, by preparing in advance by color identification a part to be removed from the form image to be processed, The preprinted part of the form to be processed is dropped. With this conventional technique, it is possible to read the entered characters and recognize them by OCR without the restriction on the color used for pre-printing the form, but a reference form with no characters is prepared in advance. There is a need. In addition, in this prior art, when pre-printing such as a frame line and an entry character are in contact with each other and overlapping, no consideration is given to preventing the entry character from disappearing at the overlapping portion.
[0004]
Further, as a conventional technique related to image processing for causing an OCR to read, a technique described in JP-A-6-195509 is known. This prior art is provided with a photoelectric conversion unit that gives primary color components (red, green, blue) of each pixel from a multicolor image on the premise that the character color is black, and has the maximum value of these primary color components. A signal is output to remove the color and extract only the entered characters. However, the purpose of this prior art is to remove the color from the multicolored background of a document such as a check. If the character is colored, for example, if it is a blue character, the color is removed by removing the color. There is a problem that characters are also removed.
[0005]
Further, as another conventional technique, for example, a technique described in Japanese Patent Laid-Open No. 10-27213 is known. This conventional technology uses three primary colors of red, green, and blue from the photoelectric conversion unit so that characters written in chromatic colors, such as red pencils, red ballpoint pens, and blue ballpoint pens, can be read in the same way as achromatic characters. When a component is input, it is determined whether the color of the pixel belongs to an achromatic color, red color, green color, or blue color group, and the pixel is a character color specified in advance The maximum value of the three primary color components is output as a monochrome luminance signal of the pixel. If it is not a character color, the background level is output as a monochrome luminance signal for that pixel. And it converts into the monochrome luminance signal of the pixel. That is, multi-value image data is converted into binary image data. However, in this prior art, when preprints such as frame lines and written characters are in contact with each other and overlapped, consideration is given to leaving the written characters and dropping the preprints so that the overlapping portion does not disappear. Not.
[0006]
Further, as another conventional technique, a technique described in JP-A-6-290302 is known. This prior art performs provisional color separation on input image data when preprints such as frame lines and written characters come in contact with each other, and pixels that constitute each part of the provisional color separation The final color separation is determined by using the geometric information in combination with the density information of each portion and background. However, this prior art includes a binary image buffer, a gray image buffer, and a color image buffer. When a character targeted for a binary image is cut out and character recognition is performed, a specific portion of the input image, for example, a character When rechecking a portion where a hole is formed in a stroke or a contact portion between a ruled line and a character, the character recognition is performed by cutting out the character with reference to the above-described gray image buffer and color image buffer.
[0007]
For this reason, this conventional technique has a problem that it is necessary to access a gray image or a color image for each processing unit such as a character cutout unit or a character recognition unit, and the control becomes complicated. Further, this prior art does not generate and output a binary image in which the pre-printing is erased and the written characters are left, but the pre-printing is removed to correct the reading result, and the written characters are left. No consideration is given to displaying value images on the screen. In addition, this conventional technique uses linear geometric information such as growing an area that can be regarded as the same color. The reliability is not considered.
[0008]
[Problems to be solved by the invention]
The prior art described above has various problems as described together with the explanation of each prior art.
[0009]
An object of the present invention is to solve the above-mentioned problems of the prior art, and in a color form to be processed, a pre-print color such as a frame line is different from a pre-printed color for a form image. Image processing program and image capable of generating pre-printing and generating a binary image with the entry characters left, and performing highly accurate character recognition even if the print and the entry characters contact or overlap It is to provide a processing apparatus.
[0010]
[Means for Solving the Problems]
According to the present invention, the object is to input an input form image. Binarize the minimum value of the three primary color components Step and said A step of performing character recognition of the binarized image as a result of binarization, and a step of outputting the recognized character when the recognized character is accepted, and when the recognized character is rejected, Of the entered form image Pre-print color And The input color and Steps to set The form image No Drop out print color To generate a binarized image Step and said dropout And binarized binary image The step of character recognition and Output the recognized characters, and This is achieved by causing the information equipment to execute.
[0012]
Furthermore, the purpose is to input the form image. Binarize the minimum value of the three primary color components Means and said Means for performing character recognition of a binarized image obtained as a result of binarization; means for outputting a recognized character when the recognized character is accepted; and input when the recognized character is rejected. Of the form image Pre-print color And The input color and A means of setting The form image No Drop out print color Means for generating a binarized image by binarization And the dropout And binarized binary image Means for character recognition and , Means to output recognized characters and This is achieved by having
[0013]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, an embodiment of an image processing apparatus and a character reading system according to the present invention will be described in detail with reference to the drawings.
[0014]
FIG. 1 is a block diagram showing a configuration of a character reading system including an image processing apparatus according to an embodiment of the present invention. In FIG. 1, 100 is a color image scanner, 101 is an image processing unit, 102 is a color identification dropout binarization unit, 103 is a preprint / entry color identification unit, 104 is a preprint dropout binarization unit, Reference numeral 105 denotes a three primary color component minimum value binarization unit, 106 denotes a control unit, 107 denotes a character reading unit, 108 denotes an image storage unit, and 109 denotes a network.
[0015]
A character reading system according to an embodiment of the present invention includes a color image scanner 100 that collects a color image of a form surface, an image processing unit 104 that binarizes color image information from the color image scanner 100 into black and white, and 2 A character reading unit 107 such as an OCR for reading the digitized image data and an image storage unit 108 are connected by a network 109.
[0016]
The image processing unit 101 selects the minimum value of the three primary color components of the image data input from the color image scanner 100, that is, the data portion of the darkest part of the three primary color components, and performs the black and white binarization processing. A minimum value binarization unit 105, a color identification dropout binarization unit 102 that identifies preprinted colors and performs dropout processing on the colors, and a control unit 106 that controls selection and execution of these processes Composed. The three primary color component minimum value binarization unit 105 selects the minimum value of the three primary color components of the input color image and binarizes the primary color component as a gray value. As a method of binarizing the gray value, for example, as disclosed in Japanese Patent Application Laid-Open No. 11-120333, a method of determining a binarization threshold value from respective gray values of a pixel of interest and its surrounding pixels. It may be.
[0017]
The color identification dropout binarization unit 102 includes a color identification unit 103 that performs preprinting and color identification of written characters, and a binarization unit 104 that drops out a preprinted color using a color identification result. The character reading unit 107 performs character segmentation and character recognition from the binary image sent from the image processing unit 101, and the image storage unit 108 stores the binary image sent to the character reading unit 107. .
[0018]
FIG. 2 is a flowchart for explaining processing operations in the image processing unit 101 and the character reading unit 107 in the character reading system shown in FIG. 1, and this will be described next. Before the processing shown in FIG. 2 is started, the color image scanner 100 scans a form for character reading, collects a color image of the form, and transfers the image to the image processing unit 101. This image is temporarily stored and held in a buffer (not shown) provided in the color image scanner 100 or the image processing unit 101.
[0019]
(1) The binarization unit 105 for the minimum value of the three primary color components in the image processing unit 101 is the data of the minimum value of the three primary color components, that is, the darkest part of the three primary color components for the image from the color image scanner 100. A binary image obtained as a result of selecting a portion and performing the binarization process is transmitted to the network 109 via the control unit 106 (step 200).
[0020]
(2) The character reading unit 107 inputs the image binarized by the three primary color component minimum value binarizing unit 105 and designates an area to be read, that is, a field. The field can be specified by registering the field coordinates in advance as a form format (format) and selecting the field to select when reading the format, or by extracting the border of the form, A method of determining an area to be read from the arrangement and arrangement of frames may be used (steps 201 and 202).
[0021]
(3) Next, the character reading unit 107 reads characters in the field by cutting out characters in the field and executing character recognition. Next, it is determined whether the result of character reading in the field is accept or reject. As a result of this determination, if it is accepted, the read results are integrated and simply output (steps 203, 204, and 209).
[0022]
(4) If the result of character reading is rejected in the determination in step 204, the color image acquired by the color image scanner 100 is converted into the color of the preprinted / written character of the color identification dropout binarization unit 102. It inputs into the identification part 103 (step 205).
[0023]
(5) The color identification unit 103 performs color identification of preprinted portions such as form frame lines and guide character strings, and color identification of written characters from the input color image (step 206).
[0024]
(6) Next, the pre-print dropout binarization unit 104 uses the pre-print color from the color identification unit 103 to drop out the color and generate a binary image. If it is determined that pre-printing and written characters have the same color and the drop-out binarization is unsuccessful, the pre-print drop-out binarization unit 104 outputs a reject (step 207).
[0025]
(7) The character reading unit 107 reads characters in the field from the binary image transmitted from the pre-print dropout binarization unit 104 to the network 109, and integrates and outputs the read results ( Steps 208 and 209).
[0026]
As described above, the character reading system according to the embodiment of the present invention determines whether the result of the previous character reading in the field is accept or reject in the process of step 204, and in the case of reject, the pre-print is dropped out 2 Since character reading is performed on the value image, it is possible to prevent the pre-printed portion from becoming an obstacle to character recognition, and it is possible to improve the reliability of character reading.
[0027]
FIG. 3 is a block diagram showing the configuration of the color identification dropout binarization unit 102, FIG. 8 is a diagram for explaining a region for identifying the colors of preprints and written characters in the field, and FIG. 10 is a primary color component in FIG. 3 is a diagram illustrating a configuration of a selection unit 305. FIG. 3 and 10, reference numeral 301 denotes a pre-print color identification unit, 302 denotes an entry character color identification unit, 303 denotes a color component selection determination unit, 305 denotes a primary color component selection unit, 306 denotes a component value thickening / lightening unit, and 307. Is a binarization unit, 308 is a pixel identification unit, and the other symbols are the same as those in FIG.
[0028]
The color identification unit 103 includes a pre-print color identification unit 301, an entry character color identification unit 302, and a color component selection determination unit 303, and performs pre-printing and color identification of entry characters. Then, the blue component 310, the green component 311, and the red component 312 of the color image from the color image scanner 100 are input to the pre-print color identification unit 301 and the entry character color identification unit 302. The pre-print color identification unit 301 sets a process area for identification with respect to a partial color image in the field, and detects the color distribution of the pixels in the area to identify the pre-print color. . Note that the preprint color is not limited to one color, and may be a plurality of colors. For example, two colors of red and green may be extracted as the preprint colors.
[0029]
Next, with reference to FIG. 8, the area | region which identifies the color of the pre-print in a field and the entry character is demonstrated. In FIGS. 8A and 8B, the frame line 800 in the field 802 is colored, for example, a red or blue line. The entry character 801 is a reading target and is a black or colored character. The preprinted frame line 800 and the entry character 801 are shown as being entered in an overlapping state. When the color of the frame line is different from the color of the written characters, the frame line can be dropped out.
[0030]
In order to identify preprinted colors such as frame lines, color identification processing areas 803 and 804 are set as shown in FIG. In this processing region, pixel color identification is performed, the distribution of color pixels is obtained, and the preprint color is determined. Further, in order to identify the color of the entered character, a color identification processing area 813 is set as shown in FIG. In this processing area, when the pixel color is identified, the distribution of the color pixel is obtained, and the color of the written character is determined.
[0031]
Returning to the description of FIG. 3, the entry character color identification unit 302 sets a processing area for identification, for example, an area 813, for the partial color image in the field, and sets the color of the pixel in the area. The appearance distribution is detected and the color of the character is identified. Further, the color component selection determination unit 303 determines a primary color component to be selected as a binarized gray image among the three primary color components using the identified pre-print color and entry character color. For example, if the pre-print color is identified as red and the entry character color is identified as blue, the red component is determined as the primary color component to be selected. Further, if the pre-print color is identified as green and the entry character color is identified as black, the green component is determined as the primary color component to be selected. Then, the color component selection determination unit 303 outputs the preprint color to the primary color component selection unit 305 of the dropout binarization unit 104, and dropsout information 2 on the preprint color and entry character color. The value is output to the component value thickening / lightening unit 306 of the value converting unit 104.
[0032]
The above-described processing in the color identification unit 103 is performed on the color image of one field to identify the color. Then, after the above-described color identification is performed, the dropout binarization unit 104 described below uses the color image data in the same field to generate a monochrome binary image in which the pre-printed color is dropped out. Processing to generate is performed.
[0033]
As shown in FIG. 3, the dropout binarization unit 104 includes a primary color component selection unit 305, a component value thickening / lightening unit 306, a pixel color identification unit 308, and a grayscale binarization unit 307. In accordance with the result of the color component selection determination unit 303, the primary color component selection unit 305 simply selects one primary color component from among the three primary color components, that is, the blue component 310, the green component 311, and the red component 312 for each input pixel. Select and output the gray value.
[0034]
As shown in FIG. 10, the primary color component selection unit 305 selects one of the input blue component 310, green component 311, and red component 312 according to the color type information 1104 output from the color component selection determination unit 303. A primary color component is selected and output as a gray value 1105. Here, if the color type information 1104 is red, the red component 310 is output as the gray value 1105. If the color type information 1104 is green, the green component 311 is output as the gray value 1105. Furthermore, if the color type information 1104 is blue, the blue component 311 is output as a gray value 1105.
[0035]
In the above description, when the color type information 1104 output from the color component selection determination unit 303 indicates, for example, two colors of red and blue among the three primary color components as the pre-print color, the primary color component selection unit 305 may select the maximum value of the red component and blue component of each input pixel and output the maximum value as a gray value.
[0036]
The component value thickening / lightening unit 306 performs a process of reducing (darkening) or increasing (lightening) the magnitude of the lightness value of the input color. At this time, the pixel color identification unit 308 identifies the color of the pixel of interest, and sends the corresponding color type 322 to the darkening / lightening unit 306. The darkening / lightening unit 306 receives the pre-print color, the written character color, and the color of the pixel of interest, and darkens or lightens the gray value of the pixel of interest according to the type of the input color. For example, if the pre-print color is red, the entry character color is blue, and the pixel of interest is red, the pixel of interest is lightened. That is, the gray value of the pixel of interest is increased to approach white. The darkened / lightened gray value that is the output of the darkening / lightening unit 306 is input to the light / dark binarization unit 307 and is generated into a binary image. This density binarization unit 307 can be realized by, for example, a method disclosed in JP-A-11-120333.
[0037]
FIG. 4 is a diagram for explaining the processing operation for color identification of pre-printed / written characters in step 206 in the flow explained with reference to FIG. 2, which will be explained next. This process is executed separately by setting a region for each of the color identification of the pre-print and the color identification of the written characters.
[0038]
(1) First, the search range in the color image in the field is set. The search range is, for example, the processing areas 803 and 804 used for identifying the pre-print color and the processing area 813 used for identifying the entered character color as described with reference to FIG. 8 (step 400).
[0039]
(2) Next, the image in the area set in the field is scanned. The scanning procedure is performed by repeating the processing from step 403 to step 407 described below while scanning the target pixel in the horizontal direction and then in the vertical direction and scanning the image (step 401).
[0040]
(3) First, the color of the target pixel is determined while scanning the image. Here, as the color type of the target pixel, color identification is performed for four types of blue, green, red, and black, and the color type is obtained (step 403).
[0041]
(4) Next, the number of pixel colors determined in step 403 is counted. That is, if the target pixel is determined to be blue in step 403, the number of blue pixels is increased by one, and if the target pixel is determined to be green in step 403, the number of green pixels is increased by one. Similarly, when it is determined in step 403 that the target pixel is red or black, the number of red pixels and the number of black pixels are increased by one (steps 404 to 407).
[0042]
(5) Next, based on the counted number of each of the blue pixel, the green pixel, and the red pixel, the color type of pre-printing or written characters is determined. Here, among the number of blue pixels, green pixels, red pixels, and black pixels, the color having the maximum number is determined as the target color type, and the process ends (step 402).
[0043]
FIG. 5 is a diagram for explaining a character reading processing operation in the character reading unit 107, which will be described next.
[0044]
(1) First, a binary image in a set field is input, a character line that is a sequence of characters is extracted, and characters are cut out (steps 500 to 502).
(2) Character recognition of the cut out characters is performed, and knowledge processing is performed. The knowledge process is a process of preparing a dictionary of addresses and first and last names in advance and matching the character recognition result with the addresses and first and last names in the dictionary to improve the recognition accuracy (steps 503 and 504).
[0045]
FIG. 6 is a diagram for explaining an example of a color form to be processed. Next, this will be explained.
[0046]
For example, as illustrated in FIG. 6, frames 601, 603, and 608 are printed on the color form 600. In the illustrated example, characters 602 are entered in the frame 601. Also, pre-printed characters 604 and 605 are pre-printed on the frame 603, and characters 606 and 607 are entered. Further, a character 609 is entered in a frame 608. As in the example shown in the figure, there are forms in which the input characters and pre-printing are in contact with each other, and forms that overlap. For example, in the example shown in the figure, the entry character 602 and the frame 601 overlap. In addition, entry characters 606 and 607 and pre-prints 604 and 605 overlap. When pre-printing and written characters are in contact with each other or overlapped, it becomes an obstacle to character reading. Therefore, it is necessary to drop out pre-printing.
[0047]
Here, as an example, in a field where the frame 601 is red and the entry character 602 is blue, or in a field where the frame 603 and the preprints 604 and 605 are green, the frames 601 and 603 and the preprint characters 604 are preprinted. 605 is dropped out, that is, white, and the characters 602, 606, and 607 are black. Note that when the color of the frame 608 and the color of the entry character 609 are the same color, the pre-print dropout using the color information in the embodiment of the present invention is not applicable. As an example in which the color of the entry character 609 and the color of the pre-print frame 608 are the same color, the entry character 609 is green and the pre-print frame 608 is green, or the entry character 609 is black and the pre-print frame 608 is black. There are cases. In this way, it is difficult to perform pre-print dropout in the case of the same color, and in the case of the same color, both the pre-print and the input characters are output as black in the process of step 207 described with reference to FIG.
[0048]
FIG. 7 is a diagram illustrating an example of a binary image as a result obtained by performing image processing according to the embodiment of the present invention. The example shown in FIG. 7 shows a binary image 700 as a result of dropping out pre-printing from the color image of the color form 600 described with reference to FIG. In the illustrated binary image 700, the entry characters 602, 606, and 607 in FIG. 6 are processed as black as the entry characters 701, 702, and 703, and the preprints 601, 603, 604, and 605 are dropped out and become white. Yes. On the other hand, the pre-print 608 and the entry character 609 are not dropped out in the binary image 700 because of the same color, and both are expressed in black.
[0049]
FIG. 9 is a flowchart for explaining the details of the processing operation in step 403 of the pixel color determination described with reference to FIG. 4. Next, this will be described.
[0050]
(1) First, the three primary color components of the target pixel, that is, the red component value R, the green component value G, and the blue component value B are input. In general, the size of the color component of the pixel is expressed by 8-bit information, and each of R, G, and B takes a value of 0 to 225 (step 900).
[0051]
(2) Next, the red component value R is compared with other component values to determine whether or not the conditions of R> a · G and R> a · B are satisfied. If so, the pixel of interest is registered as red. In the above equation, a is a predetermined constant of 1 or more, and the same applies to the subsequent steps. When the constant a is set to a large value, only a pure color, in this case pure red, is determined as red (steps 901 and 902).
[0052]
(3) If the condition is not satisfied in the determination of step 901, the green component value G and the other component values are compared, and G> a · R and G> a · B are satisfied. It is determined whether or not the condition is satisfied. If the condition is satisfied, the target pixel is registered as green (steps 903 and 904).
[0053]
(4) If the condition is not satisfied in the determination in step 903, the blue component value B is compared with the other component values as described above, and B> a · R and B> a · It is determined whether or not the condition G is satisfied. If the condition is satisfied, the target pixel is registered as blue (steps 905 and 906).
[0054]
(5) Further, if the condition is not satisfied in the determination of step 905, then the red component value R, the green component value G, and the blue component value B are all smaller than a predetermined value c (R, G, B <c), and the maximum value of the red component value R, the green component value G, and the blue component value B is smaller than a value obtained by multiplying the minimum value by a constant d (Max (R, G, B) <d Whether or not Min (R, G, B)), that is, the three primary color component values are both small and the values are substantially the same, and the three primary color component values are both small and If the values are substantially the same, the pixel of interest is registered as black, and the process ends (steps 907 and 908).
[0055]
FIG. 11 is a block diagram illustrating the configuration of the component value thickening / lightening unit 306 in FIG. 3, and FIG. 12 illustrates an example of a thickening / lightening processing determination reference table used to determine the type of darkening / lightening processing. These will be described next. In FIG. 11, reference numeral 1200 denotes a thickening / lightening processing type determination unit, 1201 denotes a thickening multiplication unit, 1202 denotes a lightening multiplication unit, and 1203 denotes a selection unit.
[0056]
As shown in FIG. 11, the component value thickening / lightening unit 306 includes a thickening / lightening processing type determination unit 1200, a thickening multiplication unit 1201, a lightening multiplication unit 1202, and a selection unit 1203. It is prepared for. Then, the gray value 321 output from the primary color component selection unit 305, the entered character color and preprint color information 320 from the color component selection determination unit 303, and the color information 322 of the target pixel from the pixel color identification unit 308 are displayed. Are input to the component value thickening / lightening unit 306 shown in FIG.
[0057]
The thickening / lightening processing type determination unit 1200 uses the preprint color and entry color information 320 and the color information 322 of the target pixel to perform darkening output or lightening output of the input lightness value 321, or input It is determined with reference to the reference table 1300 for determining the darkening / lightening process which of the outputs to select the same value.
[0058]
The thickening / lightening process determination reference table used in the thickening / lightening process type determination unit 1200 is configured as shown in FIG. 12, and the horizontal item 1340 indicates the type of pre-print color, A case where the pre-print colors are red 1301, green 1302, blue 1303, and black 1304 is shown. Also, the vertical item 1342 indicates the type of entry character color, and indicates the case where the entry character color is red 1305, green 1306, blue 1307, and black 1308. Further, the vertical subdivision item 1341 indicates the type of the target pixel color, and the color of the target pixel may be red 1310, green 1311, blue 1312, or black 1313 for each entry character color.
[0059]
In the illustrated reference table 1300, contents indicating three types of darkening, lightening, and equivalence are stored and stored in advance in the case of each set of pre-print color, entry character color, and target pixel color. For example, if the pre-print color 1304 is green 1302 and the entry character color 1342 is red 1305, and if the target pixel color is red 1310, the darkening process indicated by 1320 is selected as the type of darkening / lightening process. If the target pixel color is green 1311, the lightening process indicated by 1321 is selected as the type of darkening / lightening process. If the target pixel color is blue 1312 or black 1313, the same value as the type of darkening / lightening processing, that is, no darkening or lightening is performed, and the value is output as it is.
[0060]
The thickening multiplication unit 1201 multiplies the input gray value 321 by a predetermined value to decrease the gray value, that is, outputs a dark density. Further, the lightening multiplication unit 1202 multiplies the input gray value 321 by a predetermined value, and increases the gray value, that is, outputs a light density. The selection unit 1203 selects either the density output or density output of the input density value 321 or the output of the same value as the input as 1213 according to the result output of the density / lightening processing type determination unit 1200. Output.
[0061]
In the embodiments of the present invention described above, the preprint color and entry character color of the form to be read are automatically identified and processed. However, the present invention is applied to the preprint color and entry character color. Can be included in the form identification number indicating the type of the read form, and the type of the read form can be set by reading the form identification number. In this case, the color identification dropout binarization unit 102 shown in FIG. 1 can be configured by the pre-print dropout binarization unit 104 alone.
[0062]
FIG. 13 is a flowchart for explaining processing operations in the image processing unit 101 and the character reading unit 107 in the character reading system according to another embodiment of the present invention when the form identification number is read and the type of the reading form is set. Next, this will be described. Note that this processing is performed after it is determined to be rejected in step 204 in the flow described with reference to FIG.
[0063]
(1) First, the type of the target form is set. Here, the type of form may be set by reading the form identification number described in the form, or the layout of the frame line may be extracted from the form image and collated with the layout of the prepared frame line. The form type may be set based on the form identification (step 1000).
[0064]
(2) Next, format information prepared in advance for each type of form set in step 1000, that is, format information is input. As the format information, information in which field position coordinates, pre-print colors, entry colors, etc. are prepared in advance for each form is used (step 1001).
[0065]
(3) Next, the processing from step 1003 to step 1008 described below is repeated until the field set as format information is exhausted, and the processing is terminated (step 1002).
[0066]
(4) The position coordinates of the field are set from the format information, and the pre-print color of the field is set from the format information. Further, the entry character color of the field is set from the format information (steps 1003 to 1005).
[0067]
(5) Next, a color image in the field is input, and binarization processing is performed to drop out the pre-print color using the pre-print color and the entry character color set in advance. Then, the characters in the field are read from the binarized image (steps 1006 to 1008).
[0068]
The above-described processing according to the embodiment of the present invention can be configured as a processing program, and this processing program is provided by being stored in a recording medium such as HD, DAT, FD, MO, DVD-ROM, or CD-ROM. be able to.
[0069]
As described above, according to the embodiment of the present invention, the means for identifying the color of the preprint / entry character from the color image in the field, and the means for generating the binarized image in which the identified preprint color is dropped out Since the character reading is retried with respect to the binarized image that has been dropped out, it is possible to improve the reliability of character reading in a field where pre-printing and written characters overlap. Further, it is possible to reduce the processing time by providing only a means for generating a binarized image in which the pre-printed color is dropped out.
[0070]
In addition, according to the embodiment of the present invention, since a means for setting the pre-print color and the entry character color of the field in the form is provided, even a form having a different pre-print color for each form or for each field. Pre-printing can be dropped out, and the reliability of character reading can be improved. Furthermore, based on the pre-print color, entry character color, and target pixel color, means for generating a grayscale image by selecting the maximum value of a single color component or multiple components from the three primary color components of the color image based on the pre-print color And a means for darkening or lightening the grayscale image, so that a gray value can be used for a single color dropout or a simultaneous dropout of a plurality of colors such as red and blue. Image quality can be improved.
[0071]
Further, according to the embodiment of the present invention, means for pre-registering a pre-print color or a text color as a form format for each field of the form image, and a pre-print color or entry of the registered field for each field Since a means for reading out the character color is provided, the image quality of the color dropout image can be improved with respect to a form whose format is known.
[0072]
Further, according to the embodiment of the present invention, means for setting a plurality of search areas in the field for each field of the form image, means for identifying the pre-print color in the predetermined search area, or another predetermined Since the means for identifying the character color of the entry within the search area is provided, it is possible to obtain an effect that color dropout can be performed even for a form whose format relating to the pre-print color is unknown.
[0073]
In addition, it will be as follows if the other embodiment of this invention demonstrated so far is described.
[0074]
1. In an image processing apparatus for generating a black and white binarized image for reading characters from a field of a form image, a binarized image generating means for binarizing a minimum value of three primary color components from a color image in the field, and a field An image processing apparatus comprising: means for generating a binarized image obtained by dropping out a preprint color from a color image therein.
[0075]
2. In an image processing apparatus for generating a black and white binarized image for reading characters from a field of a form image, a binarized image generating means for binarizing a minimum value of three primary color components from a color image in the field, and a field Means for generating a binarized image in which a pre-printed color is dropped out of a color image in the image, and means for generating a binarized image in which the pre-printed color is dropped out of the color image in the field is a form field. Means for setting a pre-print color or text color, means for selecting a maximum value of a single color component or a plurality of components from the three primary color components of the color image based on the pre-print color, and a pre-print Means for thickening or fading the grayscale image based on the color, the character color of interest and the pixel color of interest, and a method of binarizing the darkened or lightened grayscale image with a threshold value The image processing apparatus characterized by being composed of a.
[0076]
3. The means for setting the pre-print color or entry character color of the form includes means for pre-registering the pre-print color or entry character color as a form format for each field of the form image, and the field registered for each field. The image processing apparatus according to claim 2, comprising: a pre-print color or a character color for reading.
[0077]
4). Means for setting the preprint color or text color of the form, means for setting a plurality of search areas in the field for each field of the form image, means for identifying the preprint color in the predetermined search area, 3. The image processing apparatus according to claim 2, further comprising: means for identifying a character color for entry in another predetermined search area.
[0078]
5). In a character reading system including a character reading device that reads a black and white binarized image generated from a field of a form image and an image processing device that generates a black and white binarized image from the field of a form image, the image processing device includes: Binarized image generating means for binarizing the minimum value of the three primary color components from the color image in the field; means for generating a binarized image in which the pre-printed color is dropped out from the color image in the field; When the character reading device is caused to read a binarized image from a binarized image generating unit that binarizes the minimum value of the three primary color components, and the result is reject, the color image And means for retrying the reading of the binarized image from the means for generating the binarized image in which the pre-printed color is dropped out from Character reading system that.
[0079]
【The invention's effect】
As described above, according to the present invention, in a color form to be processed, pre-printing and entry characters are applied to a form image in which the color of pre-printing such as a frame line is different from the color of entry characters. Even if they are touched or overlapped, it is possible to erase the pre-printing and generate a binary image in which the entered characters remain, and to perform highly accurate character recognition.
[Brief description of the drawings]
FIG. 1 is a block diagram showing a configuration of a character reading system including an image processing apparatus according to an embodiment of the present invention.
FIG. 2 is a flowchart for explaining processing operations in an image processing unit and a character reading unit in the character reading system shown in FIG. 1;
FIG. 3 is a block diagram illustrating a configuration of a color identification dropout binarization unit.
FIG. 4 is a diagram for explaining the color printing processing operation for pre-printing / entry characters in step 206 in the flow described with reference to FIG. 2;
FIG. 5 is a diagram for explaining a character reading processing operation in a character reading unit;
FIG. 6 is a diagram illustrating an example of a color form that is a processing target.
FIG. 7 is a diagram illustrating an example of a binary image as a result obtained by performing image processing according to an embodiment of the present invention.
FIG. 8 is a diagram illustrating an area for identifying colors of pre-printing and entry characters in a field.
9 is a flowchart illustrating details of a processing operation in step 403 of pixel color determination described with reference to FIG. 4;
10 is a diagram illustrating a configuration of a primary color component selection unit in FIG. 3. FIG.
11 is a block diagram showing a configuration of a component value thickening / lightening unit in FIG. 3;
FIG. 12 is a diagram for explaining an example of a reference table for determining a thickening / lightening process used to determine a type of darkening / lightening process;
FIG. 13 is a flowchart illustrating a processing operation in a character reading system according to another embodiment of the present invention.
[Explanation of symbols]
100 color image scanner
101 Image processing unit
102 Color identification dropout binarization unit
103 Color identification part of pre-print / entry
104 Pre-print dropout binarization unit
105 3 primary color component minimum value binarization unit
106 Control unit
107 Character reader
108 Image storage unit
109 network
301 Pre-print color identification unit
302 Character color identification part
303 Color component selection determination unit
305 Primary color component selector
306 Concentration / lightening part of component value
307 binarization part
308 Pixel identification unit
1200 Thickening / lightening processing type decision unit
1201 Thickening multiplier
1202 Multiplier for lightening
1203 selection unit

Claims

Binarizing the minimum value of the three primary color components of the input form image;
Performing character recognition of the binarized image resulting from the binarization;
Outputting the recognized character when the recognized character is accepted, and
If the recognized character is rejected, and setting a pre-printed color of the inputted document image and serial and incoming text color, and binarized by dropout flop Les printing color of the form image 2 A program that causes an information device to execute a step of generating a value image, a step of performing character recognition of the binarized image binarized by dropout, and a step of outputting the recognized character .

2. The program according to claim 1 , further causing the information device to execute a step of reducing (darkening) or increasing (lightening) the gray value of the input color .

3. The program according to claim 1, wherein the pre-print color and the entry character color are set for each field of the form image .

The pre-print color or entry character color of the form is registered in advance as a form format for each field of the form image, and the pre-print color of the registered field for each field. The program according to claim 3 , wherein the program is executed by reading out the character color .

Means for binarizing the minimum value of the three primary color components of the input form image;
Means for performing character recognition of the binarized image resulting from the binarization;
Means for outputting the recognized character when the recognized character is accepted;
If the recognized character is rejected, and means for setting the pre-print color of the inputted document image and serial and incoming text color, and binarized by dropout flop Les printing color of the form image 2 An image processing apparatus comprising: means for generating a value image; means for performing character recognition of the binarized image binarized by dropout; and means for outputting the recognized character .

6. The image processing apparatus according to claim 5 , further comprising a component value thickening / lightening means for reducing (darkening) or increasing (lightening) the gray value of the input color .

7. The image processing apparatus according to claim 5, wherein the pre-print color and the text color are set for each field of the form image .