JP2004185183A

JP2004185183A - Image correction device, character recognition method, and image correction program

Info

Publication number: JP2004185183A
Application number: JP2002349551A
Authority: JP
Inventors: Tetsuji Makino; 哲司牧野
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 2002-12-02
Filing date: 2002-12-02
Publication date: 2004-07-02
Anticipated expiration: 2022-12-02
Also published as: JP4222013B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide an image correction device and image correction program capable of realizing an improvement in recognition ratio on the basis of the distribution state of RGB data of a digital image. <P>SOLUTION: When digital image data for one frame are inputted from an image input part 17 (step P1), the inputted image data are transferred to a RAM, and developed in a bit map image (step P2). When the development to the RAM 14 is ended, values of R, G and B of a processing object are compared with the values of R, G and B of circumferential data according to an image correction processing program to determine whether the processing object is an object (background) of extinction processing or a character (foreground), and the object of extinction processing is corrected by light homogenization. <P>COPYRIGHT: (C)2004,JPO&NCIPI

Description

【０００１】
【発明の属する技術分野】
本発明は、デジタルカメラ等の撮像装置で撮影した画像の修正技術に関する。
【０００２】
【従来の技術】
デジタルカメラ等の電子撮像装置で書類など平面を撮影したデジタル画像データはフラットヘッドスキャナーで得たデジタル画像データと異なり、レンズの特性上周辺部に図７（ａ）に示すような暗い影７２、７３が発生する。これはレンズ周辺部を通過した光量が低下していくためで、撮影対象が文字の場合、この減光作用により認識困難になることが知られている。このような減光作用に対しては、レンズの特性のみで考える場合、レンズの形状に沿った同心円上に光量調整を行えば補正が可能となる。
【０００３】
例えば、撮影レンズの周辺減光に関するレンズ情報に基いて画像データの周辺減光を補正するようにした電子カメラ（特許文献１）、撮影レンズの特性を取得し、その特性を画像の特性に応じて展開し、この減光量を用いて画像の周辺減光を行うようにした画像処理装置（特許文献２）、撮影レンズの中心からの距離に基いて全画素に対して周辺画像の減光分を補正し、周辺部の画像の濃度に中心部の画像の濃度よりも重みをつけた状態の画像全体の濃度バランスに基いて前記減光分の補正の強度度合いを設定するようにした画像処理装置（特許文献３）、撮影時に写真フィルムに周辺減光の強さに対応する情報を記憶しておき、最盛時にこの情報を読み取って周辺減光補正を行う方法（特許文献４）がある。
【０００４】
【特許文献１】
特開２００２−１９０９７９号公報
【特許文献２】
特開２０００−４１１８３号公報
【特許文献３】
特開平１１−３５５５１１号公報
【特許文献４】
特開平１０−１３６６７号公報
【０００５】
【発明が解決しようとする課題】
しかしながら、上記特許文献１〜４に開示されているような従来の周辺減光対策はレンズ特性を重視して光量の増減を行うようにしているため、ムラが出る場合があり、また、外部からの光による光量分布の変化を吸収できないなどの課題があった。
【０００６】
本発明は、上記従来の周辺減光対策の課題を解消するためになされたものであり、デジタル画像のＲＧＢデ―タの分布状況を基に認識率の向上を実現し得る画像修正装置及び画像修正プログラムの提供を目的とする。特に、撮影した画像を文字認識する文字認識装置に適用して好適な画像修正装置及び画像修正プログラムの提供を目的とする。
【０００７】
【課題を解決するための手段】
上記課題を解決するために、第１の発明の画像修正装置は、デジタル画像データをメモリに取り込む画像データ取り込み手段と、メモリに取り込まれたデジタル画像データの全画素の中からＲ、Ｇ、Ｂの最大値を取得する最大値取得手段と、画像中の所定領域内の画素の色データのうち、最も高い頻度で出現する色データを取得する最多色データ取得手段と、所定領域内の処理対象画素の色データが最多色データ取得手段によって取得された色データと略同等か否かを判定する判定手段と、判定手段によって、処理対象画素の色データが最多色データ取得手段によって取得された色データと略同等とされた場合に、Ｒ、Ｇ、Ｂの最大値と処理対象画素の色データのＲ、Ｇ、Ｂの最大値との差を所定領域内の各画素のＲ、Ｇ、Ｂにそれぞれ同量ずつ加算する加算手段と、を備えたことを特徴とする。
【０００８】
また、第２の発明の画像修正装置は、デジタル画像データをメモリに取り込む画像データ取り込み手段と、画素の輝度を取得する輝度取得手段と、メモリに取り込まれたデジタル画像データの全画素の中から取得された輝度の最大値を取得する最大値取得手段と、画像中の所定領域内の画素の輝度データのうち、最も高い頻度で出現する輝度データを取得する最多輝度データ取得手段と、所定領域内の処理対象画素の輝度データが最多輝度データ取得手段によって取得された輝度データと略同等か否かを判定する判定手段と、判定手段によって、処理対象画素の輝度データが最多輝度データ取得手段によって取得された輝度データと略同等とされた場合に、輝度の最大値と処理対象画素の色データの輝度の最大値との差を所定領域内の各画素の輝度にそれぞれ同量ずつ加算する加算手段と、を備えたことを特徴とする。
【０００９】
また、第３の発明は上記第１又は第２の発明の画像修正装置において、更に、文字認識手段を備え、修正された画像データを用いて文字認識を行うことを特徴とする。
【００１０】
また、第４の発明は上記第１、第２の又は第３のいずれかの発明の画像修正装置において、所定領域は矩形であることを特徴とする。
【００１１】
また、第５の発明の文字認識方法は、デジタル画像データをメモリに取り込むステップと、メモリに取り込まれたデジタル画像データの全画素の中からＲ、Ｇ、Ｂの最大値を取得するステップと、画像中の所定領域内の画素の色データのうち、最も高い頻度で出現する色データを取得するステップと、所定領域内の処理対象画素の色データが最多色データ取得手段によって取得された色データと略同等か否かを判定するステップと、処理対象画素の色データが取得された色データと略同等とされた場合に、Ｒ、Ｇ、Ｂの最大値と処理対象画素の色データのＲ、Ｇ、Ｂの最大値との差を所定領域内の各画素のＲ、Ｇ、Ｂにそれぞれ同量ずつ加算するステップと、ステップにより得られた画像データを基に文字認識を行うステップと、を備えたことを特徴とする。
【００１２】
また、第６の発明の画像修正プログラムは、画像の周辺減光を修正する画像修正装置で実行されるプログラムであって、デジタル画像データを読み取るステップと、読み取られたデジタル画像データをメモリに取り込ませるステップと、メモリに取り込まれたデジタル画像データの全画素の中からＲ、Ｇ、Ｂのそれぞれ独立して取りうる最大値を取得させるステップと、画像中の矩形領域内の画素の色データのうち、最も高い頻度で出現する色データを取得させるステップと、矩形領域内の処理対象画素の色データが最多色データ取得手段によって取得された色データと略同等か否かを判定させるステップと、判定によって、処理対象画素の色データが最多色データ取得手段によって取得された色データと略同等であるとされた場合に、Ｒ、Ｇ、Ｂの最大値と処理対象画素の色データのＲ、Ｇ、Ｂの最大値との差を算出させるステップと、算出された差を矩形領域内の各画素のＲ、Ｇ、Ｂにそれぞれ同量ずつ加算させるステップと、を備えたことを特徴とする。
【００１３】
【発明の実施の形態】
図１は本発明に基く画像修正装置のハードウエア構成の一実施例を示すブロック図であり、画像修正装置１は装置全体の動作を制御するＣＰＵ１１と、ユーザがデータ入力や操作指示を行うキー入力部１２と、画像等を表示する表示部１３と、プログラムを一時駐在させると共に、入力されたデジタル画像データ（以下、単に画像データと記す）の一時記憶及び画像修正時の作業用領域として用いられるＲＡＭ１４と、磁気ディスク読取装置等の保存記憶部１５と保存記憶部１５に内蔵又は装着される記憶媒体１６と、画像データを入力する画像入力部１７とイメージプリンター等からなる印刷部１８とを備えている。
【００１４】
なお、画像修正装置１に図示のように画像（文字イメージ）から文字認識を行う文字認識処理部２２及び文字認識処理の際用いる辞書２３を付加することにより、画像修正機能を備えた文字認識装置を構成することができる。このようにした場合は画像修正機能付き文字認識装置は本発明に基く周辺減光対策を施した画像（文字イメージ）に対して文字認識を行うことができるのでデジタルカメラ等の撮像装置で撮影した文字イメージ（画像データ）を取り込んで減光対策等の画像修正処理により明瞭となった文字イメージを基に文字認識を行うことができるので高い認識率による文字認識が実現できる。
【００１５】
ＲＡＭ１４には入力バッファから転送される画像データを一時記憶する領域、矩形範囲中の最多色を調べるためのテーブル（図６）を一時記憶するためのテーブル領域、ＯＳや本発明に基く画像修正処理プログラム等のプログラムを駐在させる領域、定数や、計算値、取得値等を保持するための一時記憶領域等が割り付けられている。
【００１６】
保存記憶部１５には修正処理された画像データを保存記憶する画像記憶領域と、ＯＳや図３に示すような動作の実行指示を行う画像修正処理プログラム等のプログラム群を記憶するプログラム駐在領域が割り付けられている。
【００１７】
なお、画像修正処理プログラムは最大値ＭａｘＣｏｌｏｒを取得するためのサブプログラム１（図４）及び矩形範囲中に出現する最多色を取得するためのサブプログラム２（図５）を含み、これらのプログラムは画像修正装置１の初期設定時または任意の時点にインストールされプログラム記憶領域に記憶される。また、画像修正装置１が起動されると記憶媒体１６から読み出され、ＲＡＭ１４の実行プログラム駐在領域に駐在する。
【００１８】
画像入力部１７は、デジタルカメラ等の撮像装置から取り出されたメモリーカードに書き込まれている画像データを読み取るメモリーカードリーダやＣＤ−ＲＯＭリーダ等の画像記録媒体読取装置からなる。
【００１９】
なお、画像入力部１７に代えて、又は併設して、図示のようにデジタルカメラ１００等の撮像装置とケーブル接続又は赤外線等の近距離無線接続による画像データ入力を行う外部インターフェイス１９、又は有線又は無線通信網を介してデジタルカメラ２００等の撮像装置から送信される画像データを入力する通信インターフェイス２０を備えるようにしてもよい。
【００２０】
文字認識処理部２２は、例えば、画像イメージから１文字ずつ文字イメージを切り出して正規化し、特徴を抽出して辞書２３に登録されている文字の特徴と比較して文字認識を行う。
【００２１】
また、本発明の画像修正処理は実施例ではプログラム（画像修正プログラム）で構成され、ＣＰＵ１１は画像修正プログラム（図３〜図５）に従って画像修正処理を実行するが、専用の画像修正処理部２１を備えるようにしてもよい。
【００２２】
図２は本発明の画像修正装置の動作の概要を示すフローチャートであり、画像入力部１７（又は、外部インターフェイス１９、通信インターフェイス２０）から１フレーム分の画像データ（デジタルデータ）が入力されると（ステップＰ１）、ＣＰＵ１１は入力された画像データを入力バッファからＲＡＭ１４に転送しビットマップイメージで展開（一時記憶）する。ビットマップデータの一例としては、１つの画素を２４ビットで表現し、Ｒ、Ｇ、Ｂ成分をそれぞれ８ビットで表現している（ステップＰ２）。
【００２３】
ＲＡＭ１４への展開が終わると、ＣＰＵ１１は画像修正処理プログラムに従って図３のフローチャートに示すような動作で処理対象のＲ、Ｇ、Ｂの値と周囲のデータのＲ、Ｇ、Ｂの値を比較し、処理対象が減光処理の対象（背景）なのか文字（前景）なのかを判定し、減光処理の対象について光量均一化による補正を行う（ステップＰ３）。
なお、画像修正装置１が文字認識処理部２２を備えている場合には、更に、上記ステップＰ３で補正後の画像について文字認識処理を行なう（ステップＰ４）。
【００２４】
（画像修正処理動作）
図３は図２に示したステップＰ３での画像修正処理動作の一実施例を示すフローチャートである。
図３で、ＣＰＵ１１は、まず、サブプログラム１に従って図４のフローチャートに示すような動作で、ＲＡＭ１４に展開された入力画像の全データ領域｛（Ｘ−ｎ，Ｙ−ｍ）〜（Ｘ＋ｎ，Ｙ＋ｍ）｝をサーチし、それぞれ独立して取り得る最大値ＭａｘＣｏｌｏｒ（ＲｏｒＧｏｒＢ）を取得してＲＡＭ１４に保持（一時記憶）し、ステップＳ２に進む（ステップＳ１）。
【００２５】
次に、画像データの原点（Ｘ＝０、Ｙ＝０）を画像の左上隅に定め（ステップＳ２）、サブプログラム２に従って図５のフローチャートに示すような動作で、ＲＡＭ１４に展開された矩形範囲｛（Ｘ−ｎ，Ｙ−ｍ）〜（Ｘ＋ｎ，Ｙ＋ｍ）｝の中に出現する最多色ＡｖｅＣｏｌｏｒ（Ｒ、Ｇ、Ｂ）を取得してＲＡＭ１４に保持（一時記憶）し、ステップＳ４に進む（ステップＳ３）。なおここで矩形範囲を取るのがベストモードであるが楕円など矩形に限定されるものではない。
【００２６】
次に、上記ステップＳ３で取得した矩形範囲｛（Ｘ−ｎ，Ｙ−ｍ）〜（Ｘ＋ｎ，Ｙ＋ｍ）｝の中のうち最多色の色データ（Ｒ、Ｇ、Ｂ）の値と処理対象の画素（座標（Ｘ、Ｙ））の色データ（Ｒ、Ｇ、Ｂ）の値との差の絶対値ａｂｓ（ＡｖｅＣｏｌｏｒ−（Ｘ、Ｙ））と閾値を比較し、差の絶対値が閾値より小さい場合は背景色と判定してステップＳ５に進み、そうでない場合は文字（線分）等のような前景色と判定してステップＳ８に進む（ステップＳ４）。
【００２７】
次に、画素（座標（Ｘ、Ｙ））のＲ、Ｇ、Ｂのうち最大値ＴｏｐＢｙｔｅを取得し（ステップＳ５）、ＭａｘＣｏｌｏｒとの差（Ｒ＋、Ｇ＋、Ｂ＋）を算出し、ステップＳ７に進む（ステップＳ６）。
算出された最大値との差分値、Ｒ＋、Ｇ＋、Ｂ＋を背景色である現在の処理対象の各画素のＲ、Ｇ、Ｂの値にそれぞれ同量ずつ加算してＲ、Ｇ、Ｂを補正する（ステップＳ７）。
【００２８】
次に、座標Ｘが全データ領域の幅の値を超えているか否かを調べ、超えている場合はステップＳ１０に進み、超えていない場合はステップＳ９に進む（ステップＳ８）。座標Ｘが全データ領域の横幅の値を超えていない場合は座標Ｘに１ドット幅の値を加えて次の矩形範囲に出現する最多色を取得するためにステップＳ３に進む（ステップＳ９）。
【００２９】
また、座標Ｘが全データ領域の横幅の値を超えている場合は、座標Ｙが全データ領域の縦幅の値を超えているか否かを調べ、超えている場合は全画像データ領域について補正処理を終了したものとして画像補正処理動作を終了し、超えていない場合はステップＳ１１に進む（ステップＳ１０）。座標Ｙが全データ領域の縦幅の値を超えていない場合は座標Ｙに１ドット幅の値を加えて次の矩形範囲に出現する最多色を取得するためにステップＳ３に進む（ステップＳ１１）。
【００３０】
また、画像修正装置１が文字認識処理部２２を備えている場合には、更に、文字認識処理部２２は上記ステップＳ１〜Ｓ１１で修正後の画像イメージから１文字ずつ文字イメージを切り出して正規化し、正規化後の文字イメージの特徴を抽出して辞書２３に登録されている文字の特徴と比較し文字認識を行い、認識結果を出力する（ステップＳ１２）。
【００３１】
上記図３のフローチャートのステップＳ４により、矩形範囲内の処理対象画素の色データ（Ｒ、Ｇ、Ｂ）が最多色ＡｖｅＣｏｌｏｒと略同等か否かを判定することができ、上記ステップＳ５により、処理対象画素の色データが最多色データと略同等とされた場合に、Ｒ、Ｇ、Ｂの最大値と処理対象画素の色データのＲ、Ｇ、Ｂの最大値との差を矩形範囲内の各画素のＲ、Ｇ、Ｂにそれぞれ同量ずつ加算することができるので、上記図３に示したフローチャートの動作により、処理対象の画素（座標（Ｘ、Ｙ））のＲ、Ｇ、Ｂの値と、その周囲の最多色のＲ、Ｇ、Ｂの値を比較することにより単に背景として補正処理してよいデータかそれとも文字などの前景データかを判定することができる。そして、ステップＳ７で画像データの光量を均一化して周辺減光等を排除するので、画像を鮮明にできる。
【００３２】
また、前景データについてはステップＳ８以降に示すように補正処理をバイパスするので、従来のように文字の欠損が発生せず図７（ｃ）に示すように文字等の情報が明瞭に表示され、見やすくなる。
【００３３】
また、上記ステップＳ１２により、デジタルカメラ等の撮像装置で撮影した文字イメージ（画像データ）を取り込んで減光対策等の画像修正処理により明瞭となった文字イメージを基に文字認識を行うことができるのでデジタルカメラ等で撮影した文字についても高い文字認識率が実現できる。
【００３４】
なお、上記ステップＳ２では画像データの原点（Ｘ＝０、Ｙ＝０）を画像の左上隅に定めるものとしたが、これに限定されない。左下隅を原点としてもよく、右上隅または右下隅を原点としてもよく、画像の中心を原点としてもよい。但し、原点の位置を左上隅以外とする場合はステップＳ９、Ｓ１１での加算値を原点の位置にしたがって変更する必要がある。
【００３５】
（サブプログラム１による動作）
図４は最大値を取得するためのサブプログラムによる動作例を示すフローチャートである。
まず、画像データの原点（Ｘ＝０、Ｙ＝０）を画像の左上隅に定め、ＲＡＭ１４の最大値ＭａｘＣｏｌｏｒ（Ｒ、Ｇ、Ｂ）エリアを０クリアし（ステップＳ１−１）、指定座標（Ｘ、Ｙ）の画素のＲ、Ｇ、Ｂの値をそれぞれ取得する。ここで、Ｒ＝Ｒｅｄ（Ｘ、Ｙ）、Ｇ＝Ｇｒｅｅｎ（Ｘ、Ｙ）、Ｂ＝Ｂｌｕｅ（Ｘ、Ｙ）である（ステップＳ１−２）。
【００３６】
次に、上記ステップＳ１−１で取得したＲ、Ｇ、Ｂの中の最大値ＭａｘＲＧＢを取得し（ステップＳ１−３）、ＭａｘＲＧＢとＭａｘＣｏｌｏｒとを比較し、ＭａｘＲＧＢ＜ＭａｘＣｏｌｏｒの場合にはステップＳ１−６に進み、そうでない場合にはステップＳ１−５に進む（ステップＳ１−４）。ＭａｘＲＧＢ＜ＭａｘＣｏｌｏｒでない場合はＭａｘＣｏｌｏｒにＭａｘＲＧＢを代入してステップＳ１−６に進む（ステップＳ１−５）。
【００３７】
次に、座標Ｘが全データ領域の横幅の値より小さいか否かを調べ、超えていない場合はステップＳ１−７に進み、超えている場合はステップＳ１−８に進む（ステップＳ１−６）。座標Ｘが全データ領域の横幅を超えていない場合は座標Ｘに１ビット幅の値を加えて次の矩形範囲の最多色を取得するためにステップＳ１−２に進む（ステップＳ１−７）。
【００３８】
座標Ｘが全データ領域の横幅の値を超えている場合は、座標Ｙが全データ領域の縦幅の値を超えているか否かを調べ、超えていない場合はステップＳ１−９に進み、超えている場合は全画像データ領域についてＭａｘＣｏｌｏｒ取得動作を終了したものとしてサブプログラムによる動作を終了して図３のフローチャートのステップＳ１に復帰する（ステップＳ１−８）。座標Ｙが全データ領域の縦幅の値を超えていない場合は座標Ｙに１ビット幅の値を加えてステップＳ１−２に進む（ステップＳ１−９）。
【００３９】
なお、上記ステップＳ１−１では画像データの原点（Ｘ＝０、Ｙ＝０）を画像の左上隅に定めるものとしたが、これに限定されない。左下隅を原点としてもよく、右上隅または右下隅を原点としてもよく、画像の中心を原点としてもよい。但し、原点の位置を左上隅以外とする場合はステップＳ１−７、Ｓ１−９での加算値を原点の位置にしたがって変更する必要がある。
【００４０】
（サブプログラム２による動作）
図５は矩形範囲中に出現する最多色を取得するためのサブプログラムによる動作例を示すフローチャートである。また、図６は処理対象データを中心とした任意の領域中（実施例では矩形範囲）で最も多く出現した色（最多色色データ）を調べるためのテーブルの構成例を示す図であり、符号６０−１、６０−２、・・はテーブル６０を構成するテーブルデータを示し、符号６１−１、６１−２、・・は各テーブルデータが示す色の出現頻度格納欄を示し、出現頻度格納欄６１−１、６１−２、・・・はカウンタとして機能する。また、各テーブルデータ６０−１、６０−２、・・のＲ、Ｇ、ＢはＲ、Ｇ、Ｂ値の格納欄を意味する。また、図５ではテーブル６０（図６）をＴａｂｌｅ［］として示している。テーブル６０はＲＡＭ１４のテーブル領域に生成される。なお、図６の構成ではテーブル６０は任意領域のサイズの４倍（出現頻度＋Ｒ、Ｇ、Ｂ）＋１（データ数）分の領域を必要とするが、テーブルは図６の構成に限定されない。
【００４１】
図３のフローチャートのステップＳ２からステップＳ３に進むとＣＰＵ１１はサブプログラム２に従い、図６に示すようなテーブル６０のテーブルデータ６０−１、６０−２、・・・を０クリアして初期化し（ステップ３−１）、指定座標（Ｘ、Ｙ）の画素のＲ、Ｇ、Ｂの値をそれぞれ取得する。ここで、Ｒ＝Ｒｅｄ（Ｘ、Ｙ）、Ｇ＝Ｇｒｅｅｎ（Ｘ、Ｙ）、Ｂ＝Ｂｌｕｅ（Ｘ、Ｙ）である（ステップＳ３−２）。
【００４２】
次に、テーブル６０のテーブルデータ６０−１、６０−２、・・・のうちすでに値が格納されているテーブルデータのＲ、Ｇ、Ｂ格納欄の値と閾値を比較し閾値以内のデータがあるか否かを調べ（ステップＳ３−３）、閾値以内のデータがない場合はステップＳ３−５に進み、ある場合はステップＳ３−６に進む（ステップＳ３−４）。
【００４３】
閾値以内のデータがない場合は、テーブル６０の空いたテーブルデータのＲ、Ｇ、Ｂ格納欄に上記ステップＳ３−２で取得した指定座標（Ｘ、Ｙ）の画素のＲ、Ｇ、Ｂの値をそれぞれ格納（＝記憶）し、ステップＳ３−７に進む（ステップＳ３−５）。
【００４４】
また、閾値以内のデータがある場合は存在した色のデータテーブルのカウンタ（出現頻度格納欄）に１を加えて更新し、ステップＳ３−７に進む（ステップＳ３−６）。
【００４５】
次に、矩形範囲内の画素について閾値以内のデータがあるか否かを調べ終わったか否かを調べ、調べ終わった場合にはステップＳ３−８に進み、そうでない場合はステップＳ３−２に戻る（ステップＳ３−７）。
【００４６】
次に、テーブル６０の各テーブルデータ６０−１、６０−２、・・・の出現頻度欄６１−１、６１−２、・・・をサーチして出現頻度が最大のテーブルデータが表す色（＝Ｒ、Ｇ、Ｂ）をＡｖｅＣｏｌｏｒ（Ｒ、Ｇ、Ｂ）にセットし、図３のステップＳ３に復帰する（ステップＳ３−８）。
【００４７】
図７はデジタル画像に発生した周辺減光や補正処理の結果等の説明図である。図７（ａ）は従来技術による周辺減光の説明図であり、背景色が同じ色である場合でも左下、右下に暗い領域７２、７３（斜線の部分）が背景色とは別の色で出現する。なお、正確に中心から同心円状に出現しないのは撮影時の外光が影響を及ぼすためである。また、図７（ｂ）は単純に同系色の輝度を上げたり、或いは置き換えを行った場合の例であり、文字７１が消え、符号７１’で表すように不明瞭なイメージとなる。
【００４８】
また、図７（ｃ）は本発明の画像修正処理後の画像であり、処理対象（文字７１）の画素の周囲の色データ（Ｒ、Ｇ、Ｂ）から処理対象の画素が背景なのか前景（文字）なのかを判断し、背景の場合に周囲の色データ（Ｒ、Ｇ、Ｂ）の分布状況を基に光量均一化を行ない、減光部の補正を行っているので、図示のように文字７１は消失せず明瞭なイメージとなる。
【００４９】
（変形例）
なお、上記実施例の説明では処理対象の画素（座標（Ｘ、Ｙ））のＲ、Ｇ、Ｂと、その周囲の最多色のＲ、Ｇ、Ｂを比較して単に背景として補正処理してよいデータかそれとも文字などの前景データかを判定するようにしたが、Ｒ、Ｇ、Ｂから輝度を導き出し、その輝度を比較に用いるようにしてもよい。この場合、図３のステップＳ１の前段にＲＡＭ１４に取り込まれた画像データの全画素のＲ、Ｇ、Ｂから輝度を導出するステップを設け、図３のフローチャートのステップＳ１以下、図４のフローチャートのステップＳ１−１以下、図５のフローチャートのステップＳ３−１以下及びそれらステップの説明でＲ、Ｇ、Ｂを輝度に、色データを輝度データに置き換えればよい。また、Ｒ、Ｇ、Ｂから輝度を導出する方法は公知の方法（変換式）を用いて行うことができる。
また、Ｒ、Ｇ、Ｂのデータに限定せず、映像信号ＹＵＶデータのＹデータを利用してもよい。
以上、本発明の一実施例及び変形例について説明したが本発明は上記実施例及び変形例に限定されるものではなく、種々の変形実施が可能であることはいうまでもない。
【００５０】
【発明の効果】
上記説明したように、第１の発明の画像修正装置によれば、処理対象の画素（座標（Ｘ、Ｙ））のＲ、Ｇ、Ｂの値と、その周囲の最多色のＲ、Ｇ、Ｂの値を比較することにより単に背景として補正処理してよいデータかそれとも文字などの前景データかを判定し、画像データの光量を均一化して周辺減光等を排除するので、補正後も、従来のように周辺に暗部等が出現せず、文字のような線画像も明瞭なイメージとなる。
【００５１】
また、第２の発明の画像修正装置ではＲ、Ｇ、Ｂに代えて輝度を用いるので、第１の発明と同様の効果を得ることができる。
【００５２】
また、第３の発明の画像修正装置では、修正した画像データを基に文字認識を行うので、より認識制度を上げることができる。
【００５３】
また、第４の発明の画像修正装置では、処理対象の所定の領域を矩形としたので、コンピュータのプログラムが組みやすく演算も速いという効果がある。
【００５４】
また、第５の発明の文字認識方法によれば、単に背景として補正処理してよいデータかそれとも文字などの前景データかを判定し、画像データの光量を均一化して周辺減光等を排除するので、補正後も、従来のように周辺に暗部等が出現せず、文字のような線画像も明瞭なイメージとなり、より文字認識率を向上させることができる。
【００５５】
また、第６の発明の画像修正プログラムによれば、プログラムにより第１の発明と同様の効果を得ることができるので、画像修正プログラムをインストールすれば市販されているパソコンを画像修正装置として用いることができるので、デジタルカメラ等の撮像装置のエンドユーザも容易に画像の減光補正を行うことができる。
【図面の簡単な説明】
【図１】本発明に基く画像修正装置のハードウエア構成の一実施例を示すブロック図である。
【図２】本発明の画像修正装置の動作の概要を示すフローチャートである。
【図３】画像修正処理動作の一実施例を示すフローチャートである。
【図４】最大値を取得するためのサブプログラムによる動作例を示すフローチャートである。
【図５】矩形範囲中に出現する最多色を取得するためのサブプログラムによる動作例を示すフローチャートである。
【図６】矩形範囲中に出現する最多色を調べるためのテーブルの構成例を示す図である。
【図７】デジタル画像に発生した周辺減光や補正処理の結果等の説明図である。
【符号の説明】
１１ＣＰＵ（画像データ取り込み手段、最大値取得手段、輝度取得手段、最多色データ取得手段、判定手段、加算手段、最多輝度データ取得手段）
１４ＲＡＭ（メモリ）
１７画像入力部（画像データ取り込み手段）
１９外部インターフェイス（画像データ取り込み手段）
２０通信インターフェイス（画像データ取り込み手段）
６０テーブル（最多色データ取得手段、最多輝度データ取得手段）[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to a technique for correcting an image captured by an imaging device such as a digital camera.
[0002]
[Prior art]
Digital image data obtained by photographing a plane such as a document with an electronic imaging device such as a digital camera is different from digital image data obtained by a flat head scanner. Due to the characteristics of the lens, a dark shadow 72 as shown in FIG. 73 occurs. This is because the amount of light that has passed through the periphery of the lens decreases, and it is known that when the object to be photographed is a character, the dimming effect makes recognition difficult. When such a dimming effect is considered only by the characteristics of the lens, correction can be made by adjusting the light amount on a concentric circle along the shape of the lens.
[0003]
For example, an electronic camera (Patent Literature 1) configured to correct peripheral dimming of image data based on lens information related to peripheral dimming of a photographing lens, acquires characteristics of a photographing lens, and sets the characteristics according to the characteristics of an image An image processing apparatus (Patent Literature 2) configured to perform peripheral dimming of an image by using the reduced light amount, and to reduce the dimming amount of the peripheral image for all pixels based on the distance from the center of the photographing lens Image processing in which the density of the peripheral image is weighted more than the density of the central image, and the intensity of the correction for the dimming is set based on the density balance of the entire image. An apparatus (Patent Literature 3) has a method in which information corresponding to the intensity of the peripheral dimming is stored in a photographic film at the time of photographing, and the information is read at the time of the peak to perform the peripheral dimming correction (Patent Literature 4).
[0004]
[Patent Document 1]
JP-A-2002-190979
[Patent Document 2]
JP 2000-41183 A
[Patent Document 3]
JP-A-11-355511
[Patent Document 4]
JP-A-10-13667
[0005]
[Problems to be solved by the invention]
However, in the conventional peripheral dimming countermeasures as disclosed in Patent Documents 1 to 4, since the amount of light is increased or decreased with emphasis on lens characteristics, unevenness may occur, and external There is a problem that a change in the light amount distribution due to the light cannot be absorbed.
[0006]
SUMMARY OF THE INVENTION The present invention has been made in order to solve the above-described conventional problem of vignetting, and an image correcting apparatus and an image correcting apparatus capable of improving the recognition rate based on the distribution of RGB data of a digital image. The purpose is to provide a fix. In particular, it is an object of the present invention to provide an image correction device and an image correction program suitable for being applied to a character recognition device that recognizes a photographed image as characters.
[0007]
[Means for Solving the Problems]
In order to solve the above-mentioned problems, an image correction apparatus according to a first aspect of the present invention includes an image data capturing unit that captures digital image data into a memory, and R, G, and B among all pixels of the digital image data captured into the memory. A maximum value acquisition unit that acquires the maximum value of the pixel data, a most frequent color data acquisition unit that acquires the color data that appears with the highest frequency among the color data of the pixels in the predetermined region in the image, and a processing target within the predetermined region. Determining means for determining whether or not the color data of the pixel is substantially equal to the color data obtained by the most frequent data obtaining means; and determining the color data of the pixel to be processed by the most frequent data obtaining means. When the values are substantially equal to each other, the difference between the maximum value of R, G, and B and the maximum value of R, G, and B of the color data of the pixel to be processed is calculated as the R, G, and B of each pixel in the predetermined area. Same amount to each And adding adding means One, characterized by comprising a.
[0008]
Further, the image correction device of the second invention includes an image data acquisition unit for acquiring digital image data into a memory, a luminance acquisition unit for acquiring the luminance of a pixel, and A maximum value acquisition unit that acquires a maximum value of the acquired luminance; a maximum luminance data acquisition unit that acquires luminance data that appears most frequently among luminance data of pixels in a predetermined region in an image; A determination unit that determines whether or not the luminance data of the pixel to be processed is substantially equal to the luminance data acquired by the maximum luminance data acquisition unit, and a luminance data of the pixel to be processed is determined by the maximum luminance data acquisition unit. When the luminance data is substantially equal to the acquired luminance data, the difference between the maximum luminance value and the maximum luminance value of the color data of the processing target pixel is calculated for each pixel within the predetermined area. Characterized by comprising adding means for adding the same amount each luminance, the.
[0009]
According to a third aspect of the present invention, there is provided the image correcting apparatus according to the first or second aspect, further comprising a character recognizing means for performing character recognition using the corrected image data.
[0010]
According to a fourth aspect of the present invention, in the image correction device according to any one of the first, second, and third aspects, the predetermined area is rectangular.
[0011]
The character recognition method according to a fifth aspect of the present invention includes the steps of: loading digital image data into a memory; acquiring the maximum values of R, G, and B from all pixels of the digital image data loaded into the memory; Acquiring the color data that appears most frequently among the color data of the pixels in the predetermined area in the image; and obtaining the color data of the pixel to be processed in the predetermined area by the most frequent data acquisition means. Determining whether the color data of the pixel to be processed is substantially equal to the acquired color data, and determining the maximum value of R, G, B and the R of the color data of the pixel to be processed. , G, and B, and adding the same amount to each of R, G, and B of each pixel in the predetermined area, and performing character recognition based on the image data obtained by the steps. With a The features.
[0012]
An image correction program according to a sixth aspect of the present invention is a program executed by an image correction device that corrects peripheral dimming of an image, and includes a step of reading digital image data and a step of loading the read digital image data into a memory. And obtaining a maximum value that can be taken independently for each of R, G, and B from all the pixels of the digital image data taken into the memory. Acquiring the color data that appears most frequently, and determining whether the color data of the pixel to be processed in the rectangular area is substantially equivalent to the color data acquired by the most multicolor data acquisition means, If it is determined by the determination that the color data of the pixel to be processed is substantially equivalent to the color data acquired by the multi-color data acquisition unit, R, Calculating the difference between the maximum value of R, G, and B of the color data of the pixel to be processed and the maximum value of R, G, and B for each pixel in the rectangular area. Adding by the amount.
[0013]
BEST MODE FOR CARRYING OUT THE INVENTION
FIG. 1 is a block diagram showing an embodiment of a hardware configuration of an image correction apparatus according to the present invention. The image correction apparatus 1 includes a CPU 11 for controlling the operation of the entire apparatus, and a key for a user to input data or give an operation instruction. An input unit 12, a display unit 13 for displaying images and the like, and a program temporarily resident, used as a temporary storage for input digital image data (hereinafter simply referred to as image data) and as a work area at the time of image correction. A storage unit 15 such as a magnetic disk reader, a storage medium 16 built in or attached to the storage unit 15, an image input unit 17 for inputting image data, and a printing unit 18 such as an image printer. Have.
[0014]
As shown in the drawing, a character recognition processing unit 22 for performing character recognition from an image (character image) and a dictionary 23 used for character recognition processing are added to the image correction device 1 as shown in FIG. Can be configured. In this case, the character recognition device with the image correction function can perform character recognition on an image (character image) on which the peripheral dimming countermeasures based on the present invention are applied. Character recognition can be performed based on a character image that has been clarified by image correction processing such as light reduction measures by taking in a character image (image data), so that character recognition with a high recognition rate can be realized.
[0015]
An area for temporarily storing image data transferred from the input buffer, a table area for temporarily storing a table (FIG. 6) for checking the most frequent colors in the rectangular range, an OS and image correction processing based on the present invention An area where a program such as a program resides, a constant storage area for holding constants, calculated values, acquired values, and the like are allocated.
[0016]
The storage unit 15 includes an image storage area for storing and storing the corrected image data, and a program resident area for storing a group of programs such as an OS and an image correction processing program for instructing execution of the operation as shown in FIG. Assigned.
[0017]
The image correction processing program includes a subprogram 1 (FIG. 4) for acquiring the maximum value MaxColor and a subprogram 2 (FIG. 5) for acquiring the most frequent colors appearing in the rectangular range. It is installed at the time of initial setting of the image correction device 1 or at an arbitrary time and stored in the program storage area. Further, when the image correction apparatus 1 is started, it is read from the storage medium 16 and resides in the execution program residence area of the RAM 14.
[0018]
The image input unit 17 includes an image recording medium reading device such as a memory card reader or a CD-ROM reader that reads image data written on a memory card taken out of an imaging device such as a digital camera.
[0019]
Instead of or in addition to the image input unit 17, an external interface 19 for inputting image data by a cable connection or a short-range wireless connection such as infrared light with an imaging device such as the digital camera 100 as shown in the drawing, or a wired or A communication interface 20 for inputting image data transmitted from an imaging device such as the digital camera 200 via a wireless communication network may be provided.
[0020]
For example, the character recognition processing unit 22 performs character recognition by extracting a character image one by one from an image image, normalizing the character image, extracting a feature, and comparing the extracted feature with a feature of a character registered in the dictionary 23.
[0021]
The image correction processing of the present invention is configured by a program (image correction program) in the embodiment, and the CPU 11 executes the image correction processing according to the image correction program (FIGS. 3 to 5). May be provided.
[0022]
FIG. 2 is a flowchart showing the outline of the operation of the image correction apparatus of the present invention. When one frame of image data (digital data) is input from the image input unit 17 (or the external interface 19 or the communication interface 20). (Step P1), the CPU 11 transfers the input image data from the input buffer to the RAM 14 and develops (temporarily stores) it as a bitmap image. As an example of bitmap data, one pixel is represented by 24 bits, and each of the R, G, and B components is represented by 8 bits (step P2).
[0023]
When the development to the RAM 14 is completed, the CPU 11 compares the values of R, G, and B to be processed with the values of R, G, and B of the surrounding data by the operation shown in the flowchart of FIG. Then, it is determined whether the processing target is the target (background) or the character (foreground) of the dimming process, and the target of the dimming process is corrected by equalizing the amount of light (step P3).
If the image correction device 1 includes the character recognition processing section 22, the image correction device 1 further performs a character recognition process on the image corrected in step P3 (step P4).
[0024]
(Image correction processing operation)
FIG. 3 is a flowchart showing one embodiment of the image correction processing operation in step P3 shown in FIG.
In FIG. 3, the CPU 11 first operates in the manner shown in the flowchart of FIG. 4 according to the sub-program 1 to store all the data areas 入力 (Xn, Ym) to (X + n, Y + m) of the input image developed in the RAM 14. ) Is searched for, and the maximum value MaxColor (R or G or B) that can be independently obtained is acquired and stored (temporarily stored) in the RAM 14, and the process proceeds to step S2 (step S1).
[0025]
Next, the origin (X = 0, Y = 0) of the image data is determined at the upper left corner of the image (step S2), and the rectangular area developed on the RAM 14 by the operation shown in the flowchart of FIG. The most multicolor AveColor (R, G, B) appearing in {(Xn, Ym) to (X + n, Y + m)} is acquired and stored (temporarily stored) in the RAM 14, and the process proceeds to step S4 ( Step S3). Note that the best mode is to take a rectangular range here, but it is not limited to a rectangle such as an ellipse.
[0026]
Next, the value of the color data (R, G, B) of the most frequent color in the rectangular range {(Xn, Ym) to (X + n, Y + m)} acquired in step S3 and the processing target The absolute value abs (AveColor− (X, Y)) of the difference between the value of the color data (R, G, B) of the pixel (coordinates (X, Y)) is compared with the threshold, and the absolute value of the difference is larger than the threshold. If it is smaller, it is determined to be the background color, and the process proceeds to step S5; otherwise, it is determined to be a foreground color such as a character (line segment) and the process proceeds to step S8 (step S4).
[0027]
Next, the maximum value TopByte among the R, G, and B of the pixels (coordinates (X, Y)) is obtained (step S5), and the difference (R +, G +, B +) from MaxColor is calculated, and the process proceeds to step S7. (Step S6).
R, G, and B are corrected by adding the calculated difference values, R +, G +, and B +, to the R, G, and B values of each pixel to be processed, which is the background color, by the same amount. (Step S7).
[0028]
Next, it is determined whether or not the coordinate X exceeds the value of the width of the entire data area. If it does, the process proceeds to step S10, and if not, the process proceeds to step S9 (step S8). If the coordinate X does not exceed the value of the width of the entire data area, the process proceeds to step S3 in order to add the value of one dot width to the coordinate X and obtain the most frequent color appearing in the next rectangular range (step S9).
[0029]
When the coordinate X exceeds the value of the horizontal width of the entire data area, it is checked whether the coordinate Y exceeds the value of the vertical width of the entire data area. The image correction processing operation is terminated assuming that the processing has been completed, and if not exceeded, the process proceeds to step S11 (step S10). If the coordinate Y does not exceed the value of the vertical width of the entire data area, the process proceeds to step S3 to add the value of one dot width to the coordinate Y and obtain the most frequent color appearing in the next rectangular range (step S11). .
[0030]
When the image correcting apparatus 1 includes the character recognition processing unit 22, the character recognition processing unit 22 further cuts out the character image one by one from the corrected image image in steps S1 to S11 and normalizes the character image. Then, the feature of the character image after the normalization is extracted, compared with the feature of the character registered in the dictionary 23, character recognition is performed, and a recognition result is output (step S12).
[0031]
By the step S4 in the flowchart of FIG. 3, it can be determined whether or not the color data (R, G, B) of the pixel to be processed within the rectangular range is substantially equal to the most multicolor AveColor. When the color data of the target pixel is substantially equivalent to the maximum color data, the difference between the maximum value of R, G, B and the maximum value of R, G, B of the color data of the processing target pixel is determined within the rectangular range. Since the same amount can be added to each of R, G, and B of each pixel, the operation of the flowchart (FIG. 3) of the R, G, and B of the pixel to be processed (coordinates (X, Y)) can be performed. By comparing the value with the R, G, and B values of the most surrounding colors, it is possible to determine whether the data can be simply corrected as the background or the foreground data such as characters. Then, in step S7, the light amount of the image data is made uniform to eliminate peripheral dimming, so that the image can be sharpened.
[0032]
In addition, since the correction process is bypassed for the foreground data as shown in step S8 and thereafter, information such as characters is clearly displayed as shown in FIG. It will be easier to see.
[0033]
In addition, in step S12, a character image (image data) captured by an imaging device such as a digital camera can be captured and character recognition can be performed based on the character image that has been clarified by image correction processing such as dimming measures. Therefore, a high character recognition rate can be realized for characters photographed by a digital camera or the like.
[0034]
In step S2, the origin (X = 0, Y = 0) of the image data is set at the upper left corner of the image. However, the present invention is not limited to this. The origin may be the lower left corner, the upper right corner or the lower right corner may be the origin, or the center of the image may be the origin. However, when the position of the origin is not at the upper left corner, it is necessary to change the added value in steps S9 and S11 according to the position of the origin.
[0035]
(Operation by subprogram 1)
FIG. 4 is a flowchart showing an operation example of a subprogram for acquiring the maximum value.
First, the origin (X = 0, Y = 0) of the image data is set at the upper left corner of the image, the maximum value MaxColor (R, G, B) area of the RAM 14 is cleared to 0 (step S1-1), and the designated coordinates ( The values of R, G, and B of the pixel of (X, Y) are obtained, respectively. Here, R = Red (X, Y), G = Green (X, Y), and B = Blue (X, Y) (step S1-2).
[0036]
Next, the maximum value MaxRGB of the R, G, and B obtained in step S1-1 is obtained (step S1-3), and MaxRGB is compared with MaxColor. If MaxRGB <MaxColor, step S1- 6; otherwise, the process proceeds to step S1-5 (step S1-4). If MaxRGB <MaxColor is not satisfied, MaxRGB is substituted for MaxColor, and the process proceeds to step S1-6 (step S1-5).
[0037]
Next, it is checked whether or not the coordinate X is smaller than the value of the width of the entire data area. If not, the process proceeds to step S1-7. If not, the process proceeds to step S1-8 (step S1-6). . If the coordinate X does not exceed the horizontal width of the entire data area, the process proceeds to step S1-2 to add the value of one bit width to the coordinate X and acquire the most frequent color in the next rectangular range (step S1-7).
[0038]
If the coordinate X exceeds the value of the horizontal width of the entire data area, it is checked whether or not the coordinate Y exceeds the value of the vertical width of the entire data area. If the coordinate Y does not exceed the value, the process proceeds to step S1-9. If so, it is determined that the MaxColor acquisition operation has been completed for all image data areas, the operation by the subprogram ends, and the process returns to step S1 in the flowchart of FIG. 3 (step S1-8). If the coordinate Y does not exceed the value of the vertical width of the entire data area, a value of one bit width is added to the coordinate Y, and the process proceeds to step S1-2 (step S1-9).
[0039]
In the above step S1-1, the origin (X = 0, Y = 0) of the image data is set at the upper left corner of the image. However, the present invention is not limited to this. The origin may be the lower left corner, the upper right corner or the lower right corner may be the origin, or the center of the image may be the origin. However, when the position of the origin is other than the upper left corner, it is necessary to change the added value in steps S1-7 and S1-9 according to the position of the origin.
[0040]
(Operation by subprogram 2)
FIG. 5 is a flowchart showing an operation example of a subprogram for acquiring the most frequent colors appearing in a rectangular range. FIG. 6 is a diagram showing a configuration example of a table for examining the most frequently occurring color (most multicolor data) in an arbitrary area (rectangular range in the embodiment) centered on the processing target data. , 60-2,... Indicate table data constituting the table 60, reference numerals 61-1, 61-2,... Indicate the appearance frequency storage columns of the colors indicated by the respective table data, and the appearance frequency storage columns. .. Function as counters. .., R, G, and B in the table data 60-1, 60-2,. In FIG. 5, the table 60 (FIG. 6) is shown as Table []. The table 60 is generated in a table area of the RAM 14. In the configuration of FIG. 6, the table 60 requires an area of four times the size of the arbitrary area (appearance frequency + R, G, B) +1 (the number of data), but the table is not limited to the configuration of FIG.
[0041]
When the CPU 11 proceeds from step S2 to step S3 in the flowchart of FIG. 3, the CPU 11 clears the table data 60-1, 60-2,... Of the table 60 as shown in FIG. Step 3-1), R, G, and B values of the pixel at the designated coordinates (X, Y) are obtained. Here, R = Red (X, Y), G = Green (X, Y), and B = Blue (X, Y) (step S3-2).
[0042]
Next, among the table data 60-1, 60-2,... Of the table 60, the values in the R, G, and B storage columns of the table data in which the values have already been stored are compared with the threshold value. It is checked whether there is any data (step S3-3). If there is no data within the threshold, the process proceeds to step S3-5, and if there is, the process proceeds to step S3-6 (step S3-4).
[0043]
If there is no data within the threshold value, the R, G, and B values of the pixel at the designated coordinates (X, Y) obtained in step S3-2 are stored in the R, G, and B storage columns of the vacant table data in the table 60. Are stored (= stored), and the process proceeds to step S3-7 (step S3-5).
[0044]
If there is data within the threshold value, 1 is added to the counter of the data table of the existing color (appearance frequency storage column) to update, and the process proceeds to step S3-7 (step S3-6).
[0045]
Next, it is checked whether or not the data within the rectangular range has been checked for any data within the threshold. If the check has been completed, the process proceeds to step S3-8; otherwise, the process returns to step S3-2. (Step S3-7).
[0046]
Next, the appearance frequency columns 61-1, 61-2,... Of the table data 60-1, 60-2,. = R, G, B) is set in AveColor (R, G, B), and the process returns to step S3 in FIG. 3 (step S3-8).
[0047]
FIG. 7 is an explanatory diagram of peripheral dimming that has occurred in a digital image, the result of correction processing, and the like. FIG. 7A is an explanatory diagram of the peripheral dimming according to the related art. Even when the background color is the same color, dark regions 72 and 73 (shaded portions) at the lower left and lower right are different from the background color. Appears in The reason why the light does not appear concentrically from the center is because external light at the time of photographing has an influence. FIG. 7B shows an example in which the brightness of a similar color is simply increased or replaced, and the character 71 disappears and becomes an unclear image as represented by a reference numeral 71 ′.
[0048]
FIG. 7C shows an image after the image correction processing of the present invention. The color data (R, G, B) around the pixel of the processing target (character 71) determines whether the processing target pixel is a background or a foreground. (Character), and in the case of the background, the light amount is made uniform based on the distribution state of the surrounding color data (R, G, B) to correct the dimming part. The character 71 does not disappear and becomes a clear image.
[0049]
(Modification)
In the description of the above embodiment, R, G, and B of a pixel to be processed (coordinates (X, Y)) are compared with R, G, and B of the largest number of surrounding pixels, and correction processing is simply performed as a background. Although it is determined whether the data is good data or foreground data such as characters, the luminance may be derived from R, G, and B, and the luminance may be used for comparison. In this case, a step of deriving the luminance from R, G, and B of all the pixels of the image data taken into the RAM 14 is provided at a stage preceding the step S1 of FIG. 3, and steps S1 and subsequent steps of the flowchart of FIG. In steps S1-1 and subsequent steps, steps S3-1 and subsequent steps in the flowchart of FIG. 5 and the description of those steps, R, G, and B may be replaced with luminance, and the color data may be replaced with luminance data. The method of deriving the luminance from R, G, and B can be performed using a known method (conversion formula).
In addition, Y data of video signal YUV data may be used without being limited to R, G, and B data.
As mentioned above, although one Example and a modification were described, the present invention is not limited to the above-mentioned Example and a modification, and it cannot be overemphasized that various modifications are possible.
[0050]
【The invention's effect】
As described above, according to the image correction apparatus of the first invention, the R, G, and B values of the pixel to be processed (coordinates (X, Y)) and the R, G, and By comparing the value of B, it is determined whether the data can be simply corrected as the background or the foreground data such as characters, and the amount of image data is made uniform to eliminate peripheral dimming and the like. As in the conventional case, no dark portion or the like appears in the periphery, and a line image such as a character becomes a clear image.
[0051]
Further, in the image correcting apparatus of the second invention, since the luminance is used instead of R, G, and B, the same effect as that of the first invention can be obtained.
[0052]
Further, in the image correcting apparatus according to the third aspect of the invention, character recognition is performed based on the corrected image data, so that the recognition system can be further improved.
[0053]
Further, in the image correcting apparatus according to the fourth aspect of the present invention, since the predetermined area to be processed is rectangular, there is an effect that a computer program can be easily assembled and the calculation can be performed quickly.
[0054]
Further, according to the character recognition method of the fifth invention, it is determined whether the data can be simply corrected as the background or the foreground data such as a character, and the light amount of the image data is made uniform to eliminate peripheral dimming and the like. Therefore, even after the correction, a dark portion or the like does not appear in the periphery as in the related art, and a line image such as a character becomes a clear image, and the character recognition rate can be further improved.
[0055]
According to the image correction program of the sixth invention, the same effects as those of the first invention can be obtained by the program. Therefore, if the image correction program is installed, a commercially available personal computer can be used as the image correction device. Therefore, the end user of the imaging device such as a digital camera can easily perform the light reduction correction of the image.
[Brief description of the drawings]
FIG. 1 is a block diagram showing one embodiment of a hardware configuration of an image correction device based on the present invention.
FIG. 2 is a flowchart showing an outline of the operation of the image correction device of the present invention.
FIG. 3 is a flowchart illustrating an embodiment of an image correction processing operation.
FIG. 4 is a flowchart showing an operation example of a subprogram for acquiring a maximum value.
FIG. 5 is a flowchart illustrating an operation example of a subprogram for acquiring the most frequent colors appearing in a rectangular range.
FIG. 6 is a diagram showing a configuration example of a table for checking the most frequent colors appearing in a rectangular range.
FIG. 7 is an explanatory diagram of peripheral dimming occurring in a digital image, a result of a correction process, and the like.
[Explanation of symbols]
11 CPU (image data acquisition means, maximum value acquisition means, luminance acquisition means, most-color data acquisition means, determination means, addition means, most-luminance data acquisition means)
14 RAM (memory)
17 Image input unit (image data capturing means)
19 External interface (image data capturing means)
20 Communication interface (image data capturing means)
60 tables (most color data acquisition means, most luminance data acquisition means)

Claims

Image data capturing means for capturing digital image data into a memory;
Maximum value acquisition means for acquiring the maximum value of R, G, B from among all the pixels of the digital image data taken into the memory;
Among the color data of the pixels in the predetermined area in the image, the most frequent data acquisition means for acquiring the color data that appears most frequently,
Determining means for determining whether or not the color data of the pixel to be processed in the predetermined area is substantially equivalent to the color data obtained by the most multicolor data obtaining means;
When the color data of the pixel to be processed is substantially equivalent to the color data obtained by the multicolor data obtaining unit, the maximum value of the R, G, and B and the color of the pixel to be processed are determined by the determination unit. Adding means for adding the same amount of difference between the maximum value of R, G, and B to the R, G, and B of each pixel in the predetermined area;
An image correction device comprising:

Image data capturing means for capturing digital image data into a memory;
Brightness acquisition means for acquiring the brightness of the pixel;
Maximum value acquisition means for acquiring the maximum value of the acquired luminance from among all the pixels of the digital image data taken into the memory,
Among the luminance data of the pixels in the predetermined area in the image, the most frequent luminance data acquisition means for acquiring the luminance data that appears most frequently,
Determining means for determining whether the luminance data of the processing target pixel in the predetermined area is substantially equivalent to the luminance data acquired by the maximum luminance data acquiring means,
When the luminance data of the pixel to be processed is substantially equivalent to the luminance data acquired by the maximum luminance data acquiring unit, the maximum value of the luminance and the luminance of the color data of the pixel to be processed are determined by the determination unit. Addition means for adding the same amount to the difference between the maximum value and the luminance of each pixel in the predetermined area,
An image correction device comprising:

3. The apparatus according to claim 1, further comprising a character recognizing unit, wherein character recognition is performed using the corrected image data.

The apparatus according to claim 1, wherein the predetermined area is a rectangle.

Capturing digital image data into a memory;
Obtaining the maximum value of R, G, B from among all the pixels of the digital image data taken into the memory;
Obtaining color data that appears most frequently among the color data of the pixels in the predetermined area in the image;
Determining whether the color data of the pixel to be processed in the predetermined area is substantially equivalent to the color data acquired by the multicolor data acquisition means,
When the color data of the pixel to be processed is substantially equivalent to the acquired color data, the maximum value of the R, G, B and the maximum value of R, G, B of the color data of the pixel to be processed are calculated. Adding the same difference to the R, G, and B of each pixel in the predetermined area by the same amount,
Performing character recognition based on the image data obtained in the step,
A character recognition method comprising:

A program executed by an image correction device that corrects peripheral dimming of an image,
Reading digital image data, loading the read digital image data into a memory,
Acquiring a maximum value that can be taken independently of each of R, G, and B from among all the pixels of the digital image data taken into the memory;
Acquiring color data that appears most frequently among the color data of the pixels in the rectangular area in the image;
Determining whether or not the color data of the processing target pixel in the rectangular area is substantially equivalent to the color data acquired by the multicolor data acquisition unit;
When the color data of the pixel to be processed is determined to be substantially equivalent to the color data acquired by the multi-color data acquisition unit, the maximum value of the R, G, and B and the pixel of the pixel to be processed are determined. Calculating a difference between the maximum value of R, G, and B of the color data;
Adding the calculated difference to the R, G, and B of each pixel in the rectangular area by the same amount,
An image correction program characterized by comprising: