JP2004062459A

JP2004062459A - Image processor, image processing method, image processing program, and computer readable recording medium recorded with the program

Info

Publication number: JP2004062459A
Application number: JP2002218851A
Authority: JP
Inventors: Shinobu Yamamoto; 山本　忍
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2002-07-26
Filing date: 2002-07-26
Publication date: 2004-02-26

Abstract

<P>PROBLEM TO BE SOLVED: To reduce the identification errors of a photograph area on a document image. <P>SOLUTION: A preprocessing means 202 reduces the number of the colors of an inputted image to a prescribed number. A background color extraction means 203 specifies the background color of the document image and a partial area extraction means 204 extracts a partial area where the pixels of the colors different from the background color are connected. A pixel color number extraction means 205 obtains the number of pixel colors in the partial area and a photograph area judgement means 206 compares the number of the pixel colors with a predetermined prescribed value and identifies whether or not the partial area is the photograph area. Thus, the photograph area is accurately identified from the document image inputted as a multi-level image without errors. <P>COPYRIGHT: (C)2004,JPO

Description

【０００１】
【発明の属する技術分野】
この発明は、文字、写真、図、表等が混在する文書画像から、特に写真の領域を識別する画像処理装置、画像処理方法、画像処理プログラム及びそのプログラムを記録したコンピュータ読み取り可能な記録媒体に関する。
【０００２】
【従来の技術】
近年、カラーやグレースケールといった多値画像を入出力できるスキャナ、デジタルカメラ、プリンタ等の機器の普及や、記録媒体の低価格化により、画像を多値で保存、処理、出力する用途が増加している。文書画像においても多値画像で扱うことが増えてきている。従来から主に二値の文書画像に対して実行されてきた、領域識別、タイトル文字列の抽出、表処理、ＯＣＲ等の処理においても、多値の画像に対して実行したい要望が出てきている。
【０００３】
二値の文書画像に対しては、既にさまざまな方法が存在するため、多値の文書画像を上手に二値化すれば、二値画像に対する既存の処理が適用可能と考えられていた。図６は、文書内容を示す図である。文書画像６００上には文字領域６０１，表領域（不図示），図領域６０２，写真領域６０３等、異なった属性をもつ領域が混在している。これらのうち、文字領域６０１と、表領域と、図領域６０２においては、それらを記述するのに用いられている色は少数であることが多い。一つの文字そのものが多色で記述されていたり、表の一つのセルが多色であったりすることは稀である。
【０００４】
これらを前提とすれば、例えば、特開２０００−０４０１５３号公報に開示されている画像処理方法のように、多値の文書画像に対して、部分的に適切なしきい値を見つけて二値化すれば、二値画像（文字領域６０１，表領域，図領域６０２等）に対する既存の処理は有効となる。しかし、写真領域６０３においては、写真の内容によるが画素の色の種類が二値画像に比してはるかに多くなる。この写真領域６０３を他の領域と同様にしきい値処理で二値化してしまうと、写真の内容が判別不能になるだけでなく、二値化した後の画像を処理する中で、写真領域６０３を誤って他の領域に分類してしまうことがあった。
【０００５】
多値の文書画像において、予め写真領域６０３だけを識別しておき、写真領域６０３に関して二値化とそれに続く処理の適用を除外しておけば、写真領域６０３を二値化したことによる誤りを防ぐことができるため、多値の文書画像において写真領域６０３を抽出することが必要である。
【０００６】
このような、多値画像に対して写真等の部分領域を分類する方法として、特開２００１−０８６３２８号公報に開示された画像処理方法がある。これは、二値化のしきい値を複数用意し、二値化後の複数の画像に対して矩形情報を抽出し、複数の画像の矩形情報を比較して、予め定めてあった条件をもとに部分領域を分類するものである。
【０００７】
【発明が解決しようとする課題】
しかし、この方法では、複数の二値画像において、矩形情報の位置や大きさのずれが顕著な部分領域を写真領域としている。このため、中間色の文字領域では矩形情報のずれが大きくなるため誤って写真領域と分類されたり、輝度が偏った写真領域では矩形情報がある程度安定するため誤って文字領域と分類されることがあった。また、複数の二値画像を作成するための最適なしきい値と、その数が不定であることから、適切に分類が可能となる矩形情報を抽出できるかどうかは、画像の内容に大きく影響を受けてしまうという欠点があった。
【０００８】
この発明は、上述した従来技術による問題点を解消するため、文書画像上での写真領域を誤りなく識別できる画像処理装置、画像処理方法、画像処理プログラム及びそのプログラムを記録したコンピュータ読み取り可能な記録媒体を提供することを目的とする。
【０００９】
【課題を解決するための手段】
上述した課題を解決し、目的を達成するため、請求項１の発明に係る画像処理装置は、多値画像として入力された文書画像から写真領域を識別する機能を有した画像処理装置において、前記文書画像の背景色を特定する背景色抽出手段と、前記特定した背景色と異なる色の画素が連結した部分領域を抽出する部分領域抽出手段と、前記部分領域の画素色の数を求める画素色数抽出手段と、前記画素色の数と、予め定められた所定の値とを比較して、前記部分領域が写真領域であるか否かを識別する写真領域判定手段と、前記各手段を制御する制御手段とを有することを特徴とする。
【００１０】
この請求項１の発明によれば、多値で入力された文書画像に対し、部分領域の画素色の数に基づき写真領域を判定することができ、多値画像の写真領域を簡単かつ高精度に誤りなく識別することができる。
【００１１】
また、請求項２の発明に係る画像処理装置は、請求項１に記載の発明において、前記画素色数抽出手段は、色空間において距離が所定の値未満のとき、同色とみなす処理を行うことを特徴とする。
【００１２】
この請求項２の発明によれば、色空間上の距離が所定の値未満であるような色同士を同色とみなすことで、僅かな輝度のずれに伴って画素の色がずれてしまうことを防ぐことができ、処理効率の向上を図れるようになる。
【００１３】
また、請求項３の発明に係る画像処理装置は、請求項１に記載の発明において、入力される前記文書画像の画素色を、所定の数に減色する前処理手段を備えたことを特徴とする。
【００１４】
この請求項３の発明によれば、背景色の特定、及び写真領域の識別のときに、予め色の数を減らしておくことで、僅かな輝度のずれに伴って画素の色がずれてしまうことを防ぐことができるようになる。
【００１５】
また、請求項４の発明に係る画像処理装置は、請求項１〜３のいずれか一つに記載の発明において、前記制御手段は、前記写真領域判定手段により写真領域と識別された部分領域を入力画像とみなして各手段による処理を実行させ、再度写真領域を識別することを特徴とする。
【００１６】
この請求項４の発明によれば、再帰的に写真領域を識別することによって、写真領域が他の領域に包含されているような場合でも、正しく写真領域のみを抽出できるようになる。
【００１７】
また、請求項５の発明に係る画像処理方法は、多値画像として入力された文書画像から写真領域を識別する画像処理方法において、前記文書画像の背景色を特定する背景色抽出工程と、前記特定した背景色と異なる色の画素が連結した部分領域を抽出する部分領域抽出工程と、前記部分領域の画素色の数を求める画素色数抽出工程と、前記画素色の数と、予め定められた所定の値とを比較して、前記部分領域が写真領域であるか否かを識別する写真領域判定工程とを含むことを特徴とする。
【００１８】
この請求項５の発明によれば、多値で入力された文書画像に対し、部分領域の画素色の数に基づき写真領域を判定することができ、多値画像の写真領域を簡単かつ高精度に誤りなく識別することができる。
【００１９】
また、請求項６の発明に係る画像処理方法は、請求項５に記載の発明において、前記画素色数抽出工程は、色空間において距離が所定の値未満のとき、同色とみなす処理を行うことを特徴とする。
【００２０】
この請求項６の発明によれば、色空間上の距離が所定の値未満であるような色同士を同色とみなすことで、僅かな輝度のずれに伴って画素の色がずれてしまうことを防ぐことができ、処理効率の向上を図れるようになる。
【００２１】
また、請求項７の発明に係る画像処理方法は、請求項５に記載の発明において、入力される前記文書画像の画素色を、所定の数に減色して前記背景色抽出工程に渡す前処理工程を含むことを特徴とする。
【００２２】
この請求項７の発明によれば、背景色の特定、及び写真領域の識別のときに、予め色の数を減らしておくことで、僅かな輝度のずれに伴って画素の色がずれてしまうことを防ぐことができるようになる。
【００２３】
また、請求項８の発明に係る画像処理方法は、請求項５〜７のいずれか一つに記載の発明において、前記写真領域判定工程により写真領域と識別された後の部分領域を入力画像とみなし、再度前記各工程の処理を実行して、前記写真領域を再識別することを特徴とする。
【００２４】
この請求項８の発明によれば、再帰的に写真領域を識別することによって、写真領域が他の領域に包含されているような場合でも、正しく写真領域のみを抽出できるようになる。
【００２５】
また、請求項９の発明に係る画像処理プログラムは、多値画像として入力された文書画像から写真領域を識別する画像処理プログラムであって、該画像処理プログラムはコンピュータに、前記文書画像の背景色を特定させ、前記特定した背景色と異なる色の画素が連結した部分領域を抽出させ、前記部分領域の画素色の数を求めさせ、前記画素色の数と、予め定められた所定の値とを比較して、前記部分領域が写真領域であるか否かを識別させることを特徴とする。
【００２６】
この請求項９の発明によれば、コンピュータ処理時に多値で入力された文書画像に対し、部分領域の画素色の数に基づき写真領域を判定することができ、多値画像の写真領域を簡単かつ高精度に誤りなく識別することができる。
【００２７】
また、請求項１０の発明に係る画像処理プログラムは、請求項９に記載の発明において、前記画素色数を求める際に、色空間において距離が所定の値未満のとき、同色とみなす処理を行わせることを特徴とする。
【００２８】
この請求項１０の発明によれば、コンピュータ処理時に色空間上の距離が所定の値未満であるような色同士を同色とみなすことで、僅かな輝度のずれに伴って画素の色がずれてしまうことを防ぐことができ、処理効率の向上を図れるようになる。
【００２９】
また、請求項１１の発明に係る画像処理プログラムは、請求項９に記載の発明において、入力される前記文書画像の画素色を、所定の数に減色してから前記背景色の抽出を行わせることを特徴とする。
【００３０】
この請求項１１の発明によれば、コンピュータ処理時に背景色の特定、及び写真領域の識別のときに、予め色の数を減らしておくことで、僅かな輝度のずれに伴って画素の色がずれてしまうことを防ぐことができるようになる。
【００３１】
また、請求項１２の発明に係る画像処理プログラムは、請求項９〜１１のいずれか一つに記載の発明において、前記写真領域の識別により写真領域と識別された後の部分領域を入力画像とみなし、再度前記各処理を実行させ、前記写真領域を再識別させることを特徴とする。
【００３２】
この請求項１２の発明によれば、コンピュータ処理時に再帰的に写真領域を識別することによって、写真領域が他の領域に包含されているような場合でも、正しく写真領域のみを抽出できるようになる。
【００３３】
また、請求項１３の発明に係るコンピュータ読み取り可能な記録媒体は、請求項９〜１２のいずれか一つに記載のプログラムを記録しコンピュータ読み取り可能なことを特徴とする。
【００３４】
【発明の実施の形態】
以下に添付図面を参照して、この発明に係る画像処理装置、画像処理方法、画像処理プログラム及びそのプログラムを記録したコンピュータ読み取り可能な記録媒体の好適な実施の形態を詳細に説明する。
【００３５】
図１は、本発明の画像処理装置の全体構成を示すブロック図である。画像処理装置は、ＣＰＵ（中央処理演算装置）１０１，メモリ１０２，スキャナやデジタルカメラ等の画像入力装置１０３，通信装置１０４，モニタ等の表示装置１０５，キーボード１０６，ハードディスク等の固定記録装置１０７，ＣＤ−ＲＡＭ／ＲＯＭやＤＶＤ−ＲＡＭ／ＲＯＭ、フレキシブルディスク等の記録媒体（記録メディア）１０８を用いる外部記録再生装置１０９を有して構成されている。
【００３６】
ＣＰＵ１０１は全体を制御し、メモリ１０２はＣＰＵ１０１で処理するデータやプログラムを保持する。通信装置１０４はＬＡＮ等のネットワークに接続するための装置である。キーボード１０６はデータやユーザーの指示を入力する。固定記録装置１０７や外部記録再生装置１０９は、データやプログラムなどを記録メディア１０８にセーブ、ロードするためのものである。
【００３７】
図２は、画像処理装置における画像処理手段の内部構成を示すブロック図である。この画像処理手段２００の各機能は、ＣＰＵ１０１に対し画像処理プログラムを実行することにより得ることができる。画像処理手段２００は、カラースキャナ等の画像入力装置１０３から多値の文書画像を入力する画像入力手段２０１と、入力された画像に対して画素色の減色を行う前処理手段２０２と、入力された多値文書画像の背景色を特定する背景色抽出手段２０３と、背景色と異なる画素が連結した領域を抽出する部分領域抽出手段２０４と、部分領域の画素色の種類数を抽出する画素色数抽出手段２０５と、得られた画素色数から部分領域画像が写真領域かどうか判定する写真領域判定手段２０６と、抽出したデータを保存しておくメモリ２０７と、処理の流れを制御する制御手段２０８からなる。メモリ２０７は、図１記載のメモリ１０２を用いることができる。
【００３８】
（実施の形態１）
次に、上記構成による画像処理を説明する。図３は、本発明の実施の形態１による画像処理手順を示すフローチャートである。まず、画像入力処理を行い（ステップＳ３０１）、入力された文書画像に対して背景色を特定する（ステップＳ３０２）。例えば、特開２００１−２２２７１７号公報に記載された方法等、汎用の方法を用いて背景色を特定できる。
【００３９】
次に、背景色以外の画素が連結した部分領域を抽出する（ステップＳ３０３）。連結した画素の抽出は、従来から広く知られた既存の処理を用いて行える。次に、部分領域を構成する画素色の種類の数を求める（ステップＳ３０４）。そして、この画素色の種類の数に基づき写真領域を判定する（ステップＳ３０５）。この写真領域の判定は、予め定めてあったしきい値と比較し、しきい値よりも大きければ、当該部分領域を写真領域とする。しきい値は、実験的に求めておけばよい。
【００４０】
一般に、文字を構成する画素色は１種類、ないしアウトラインや影付き文字であったとしてもせいぜい数種類程度であることが多い。一方、写真ははるかに多くの種類の画素色で構成されている。また、図や表についても、文字を構成する画素の色よりは多いが、１種類のベタな色の図形の組み合わせであることが多く、写真領域ほど画素色の種類が多くはならない。このように、部分領域の画素色の数を用いれば、簡単に高精度で多値画像から写真領域を識別することができる。
【００４１】
・画素色種類のカウントについて
一般に、カラー画像で使用されているＴｒｕｅＣｏｌｏｒにおいては、画素ごとに赤（Ｒ），緑（Ｇ），青（Ｂ）の値を８ビットずつ持っており、２５６×２５６×２５６＝１６７７７２１６色のどれかが割り当てられることになる。従って、画素色数抽出手段２０５における最も単純な色の数のカウント方法は、１６７７７２１６種類の色に対して画素数のヒストグラムを取ることに基づき行える。そして、画素数が０でない色を数え上げれば、色の数を決定できる。
【００４２】
（実施の形態２）
この発明の実施の形態２は、実施の形態１において部分領域を構成する画素色の種類の数を求める際（ステップＳ３０４）、色空間上の距離が所定の値未満であるような色同士は同色とみなして、画素色の種類の数を求めるものである。例えば、カラースキャナ等の入力装置で一般的なＴｒｕｅＣｏｌｏｒ（約１６７７万色）で画像を入力すると、走査における僅かな照明（輝度）のずれでも敏感に反応してしまい、得られた画像の中の本来１種類となるべき文字を構成する画素色のＲＧＢ成分のいずれかの輝度が僅かにずれて異なる種類の色であると判断されることがあり、これを回避するものである。なお、予め同色とみなす色空間上の距離を実験的に求めておく。
【００４３】
色空間上の距離が所定の値未満であるような色同士は同色とみなすことにより、僅かな輝度のずれに伴って画素の色がずれる、ということを防ぐことができるようになる。
【００４４】
・色空間と距離としきい値について
上述した色空間とは、ＲＧＢそれぞれが０〜２５５の整数値をもつＴｒｕｅＣｏｌｏｒを示しており、距離とはいわゆるユークリッド距離である。画素１の色を（ｒ１，ｇ１，ｂ１）、画素２の色を（ｒ２，ｇ２，ｂ２）で表したとき、それらのユークリッド距離ｄは
【００４５】
ｄ＝ｓｑｒｔ（（ｒ１−ｒ２）^２＋（ｇ１−ｇ２）^２＋（ｂ１−ｂ２）^２）
【００４６】
で表される。上記のｓｑｒｔ（）の表記は、平方根を示している。同一種類の色と見なす距離のしきい値として代表的な値は２４を用いる。
【００４７】
（実施の形態３）
図４は、本発明の実施の形態３による画像処理手順を示すフローチャートである。実施の形態１と同一の処理過程には同一のステップ番号を附している。この実施の形態３では、ステップＳ３０１で入力された文書画像に対し、画素色を前処理手段２０２で減色する処理（ステップＳ４０１）を追加して、入力された文書画像の画素色を予め定めた数の種類まで減らした後、背景色の特定（ステップＳ３０２），部分領域抽出（ステップＳ３０３），画素色数抽出（ステップＳ３０４），写真領域判定（ステップＳ３０５）を実行する。
【００４８】
実施の形態２においても説明したが、例えば、カラースキャナ等の入力装置で一般的なＴｒｕｅＣｏｌｏｒ（約１６７７万色）で画像を入力すると、走査における僅かな照明（輝度）のずれでも敏感に反応してしまい、得られた画像の中の本来１種類となるべき文字を構成する画素色のＲＧＢ成分のいずれかの輝度が僅かにずれて異なる種類の色であると判断されることがあり、これを回避するものである。減色後の色数は、人が目で見て１種類の色であるような画素の集合が、量子化によって１種類の色となるように、実験的に求めておけばよい。
【００４９】
このように、背景色の特定、及び写真領域の識別のときに、予め色の数を減らしておくことで、僅かな輝度のずれに伴って画素の色がずれてしまうことを防ぐことができるようになる。
【００５０】
・画素色の減色について
上記のように、前処理手段２０２で減色させる具体例を説明すると、ＲＧＢそれぞれに各８ビット割り当てられていたのを、各３ビットにする。つまり、ＲＧＢ各々の値が０〜２５５までの値であったものを、それぞれ３２で割って小数部分を切り捨て、０〜７までの値に変換する。これにより、８×８×８＝５１２色に減色できる。このようにＲＧＢをそれぞれ８ビットから３ビットに変換（減色）するに限らず、２ビットや４ビットに変換（減色）する構成としても良い。
【００５１】
（実施の形態４）
図５は、本発明の実施の形態４による画像処理手順を示すフローチャートである。実施の形態４では、実施の形態１におけるステップＳ３０５で識別された写真領域を、新たに入力された画像とみなして、再帰的に写真領域を識別する構成である。
【００５２】
図５のフローチャート上では、再帰判定処理（ステップＳ３０６）が追加され、初回は再帰時であるとして（ステップＳ３０６：Ｙｅｓ）、ステップＳ３０２に復帰し、再度一連の処理（ステップＳ３０２〜ステップＳ３０５）を実行した後（ステップＳ３０６：Ｎｏ）、終了する。これによって、文書画像上で、写真領域が他の領域に包含されているような場合でも、正しく写真領域のみを抽出できるようになる。
【００５３】
なお、本実施の形態で説明した画像処理方法は、予め用意されたプログラムをパーソナル・コンピュータやワークステーション等のコンピュータで実行することにより実現することができる。このプログラムは、ハードディスク、フレキシブルディスク、ＣＤ−ＲＯＭ、ＭＯ、ＤＶＤ等のコンピュータで読み取り可能な記録媒体に記録され、コンピュータによって記録媒体から読み出されることによって実行される。またこのプログラムは、インターネット等のネットワークを介して配布することが可能な伝送媒体であってもよい。
【００５４】
【発明の効果】
以上説明したように、請求項１に記載の発明によれば、多値画像として入力された文書画像から写真領域を識別する機能を有した画像処理装置において、前記文書画像の背景色を特定する背景色抽出手段と、前記特定した背景色と異なる色の画素が連結した部分領域を抽出する部分領域抽出手段と、前記部分領域の画素色の数を求める画素色数抽出手段と、前記画素色の数と、予め定められた所定の値とを比較して、前記部分領域が写真領域であるか否かを識別する写真領域判定手段と、前記各手段を制御する制御手段とを有するので、多値で入力された文書画像に対し、部分領域の画素色の数に基づき写真領域を判定することができ、多値画像の写真領域を簡単かつ高精度に誤りなく識別することができるという効果を奏する。
【００５５】
また、請求項２に記載の発明によれば、請求項１に記載の発明において、前記画素色数抽出手段は、色空間において距離が所定の値未満のとき、同色とみなす処理を行うので、色空間上の距離が所定の値未満であるような色同士を同色とみなすことで、僅かな輝度のずれに伴って画素の色がずれてしまうことを防ぐことができ、処理効率の向上が図れるという効果を奏する。
【００５６】
また、請求項３に記載の発明によれば、請求項１に記載の発明において、入力される前記文書画像の画素色を、所定の数に減色する前処理手段を備えたので、背景色の特定、及び写真領域の識別のときに、予め色の数を減らしておくことで、僅かな輝度のずれに伴って画素の色がずれてしまうことを防ぐことができるという効果を奏する。
【００５７】
また、請求項４に記載の発明によれば、請求項１〜３のいずれか一つに記載の発明において、前記制御手段は、前記写真領域判定手段により写真領域と識別された部分領域を入力画像とみなして各手段による処理を実行させ、再度写真領域を識別するので、再帰的に写真領域を識別することによって、写真領域が他の領域に包含されているような場合でも、正しく写真領域のみを抽出できるという効果を奏する。
【００５８】
また、請求項５に記載の発明によれば、多値画像として入力された文書画像から写真領域を識別する画像処理方法において、前記文書画像の背景色を特定する背景色抽出工程と、前記特定した背景色と異なる色の画素が連結した部分領域を抽出する部分領域抽出工程と、前記部分領域の画素色の数を求める画素色数抽出工程と、前記画素色の数と、予め定められた所定の値とを比較して、前記部分領域が写真領域であるか否かを識別する写真領域判定工程とを含むので、多値で入力された文書画像に対し、部分領域の画素色の数に基づき写真領域を判定することができ、多値画像の写真領域を簡単かつ高精度に誤りなく識別することができるという効果を奏する。
【００５９】
また、請求項６に記載の発明によれば、請求項５に記載の発明において、前記画素色数抽出工程は、色空間において距離が所定の値未満のとき、同色とみなす処理を行うので、色空間上の距離が所定の値未満であるような色同士を同色とみなすことで、僅かな輝度のずれに伴って画素の色がずれてしまうことを防ぐことができ、処理効率の向上が図れるという効果を奏する。
【００６０】
また、請求項７に記載の発明によれば、請求項５に記載の発明において、入力される前記文書画像の画素色を、所定の数に減色して前記背景色抽出工程に渡す前処理工程を含むので、背景色の特定、及び写真領域の識別のときに、予め色の数を減らしておくことで、僅かな輝度のずれに伴って画素の色がずれてしまうことを防ぐことができるという効果を奏する。
【００６１】
また、請求項８に記載の発明によれば、請求項５〜７のいずれか一つに記載の発明において、前記写真領域判定工程により写真領域と識別された後の部分領域を入力画像とみなし、再度前記各工程の処理を実行して、前記写真領域を再識別するので、再帰的に写真領域を識別することによって、写真領域が他の領域に包含されているような場合でも、正しく写真領域のみを抽出できるようになるという効果を奏する。
【００６２】
また、請求項９に記載の発明によれば、多値画像として入力された文書画像から写真領域を識別する画像処理プログラムであって、該画像処理プログラムはコンピュータに、前記文書画像の背景色を特定させ、前記特定した背景色と異なる色の画素が連結した部分領域を抽出させ、前記部分領域の画素色の数を求めさせ、前記画素色の数と、予め定められた所定の値とを比較して、前記部分領域が写真領域であるか否かを識別させるので、コンピュータ処理時に多値で入力された文書画像に対し、部分領域の画素色の数に基づき写真領域を判定することができ、多値画像の写真領域を簡単かつ高精度に誤りなく識別することができるという効果を奏する。
【００６３】
また、請求項１０に記載の発明によれば、請求項９に記載の発明において、前記画素色数を求める際に、色空間において距離が所定の値未満のとき、同色とみなす処理を行わせるので、コンピュータ処理時に色空間上の距離が所定の値未満であるような色同士を同色とみなすことで、僅かな輝度のずれに伴って画素の色がずれてしまうことを防ぐことができ、処理効率の向上を図れるようになるという効果を奏する。
【００６４】
また、請求項１１に記載の発明によれば、請求項９に記載の発明において、入力される前記文書画像の画素色を、所定の数に減色してから前記背景色の抽出を行わせるので、コンピュータ処理時に背景色の特定、及び写真領域の識別のときに、予め色の数を減らしておくことで、僅かな輝度のずれに伴って画素の色がずれてしまうことを防ぐことができるという効果を奏する。
【００６５】
また、請求項１２に記載の発明によれば、請求項９〜１１のいずれか一つに記載の発明において、前記写真領域の識別により写真領域と識別された後の部分領域を入力画像とみなし、再度前記各処理を実行させ、前記写真領域を再識別させるので、コンピュータ処理時に再帰的に写真領域を識別することによって、写真領域が他の領域に包含されているような場合でも、正しく写真領域のみを抽出できるようになるという効果を奏する。
【００６６】
また、請求項１３に記載の発明によれば、請求項９〜１２のいずれか一つに記載のプログラムを記録しコンピュータ読み取り可能としたので、コンピュータに対し請求項９〜１２のいずれか一つに記載の動作を実現できるようになるという効果を奏する。
【図面の簡単な説明】
【図１】この発明の本実施の形態に係る画像処理装置の全体構成を示すブロック図である。
【図２】画像処理装置における画像処理手段の内部構成を示すブロック図である。
【図３】本発明の実施の形態１による画像処理手順を示すフローチャートである。
【図４】本発明の実施の形態３による画像処理手順を示すフローチャートである。
【図５】本発明の実施の形態４による画像処理手順を示すフローチャートである。
【図６】文書内容を示す図である。
【符号の説明】
１０１　ＣＰＵ（中央処理演算装置）
１０２　メモリ
１０３　画像入力装置
１０４　通信装置
１０５　表示装置
１０６　キーボード
１０７　固定記録装置
１０８　記録媒体（記録メディア）
１０９　外部記録再生装置
２００　画像処理手段
２０１　画像入力手段
２０２　前処理手段
２０３　背景色抽出手段
２０４　部分領域抽出手段
２０５　画素色数抽出手段
２０６　写真領域判定手段
２０７　メモリ
２０８　制御手段[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to an image processing apparatus, an image processing method, an image processing program, and a computer-readable recording medium on which the program is recorded, in particular, a region of a photograph is identified from a document image in which characters, photographs, figures, tables, and the like are mixed. .
[0002]
[Prior art]
In recent years, scanners, digital cameras, printers, and other devices that can input and output multi-valued images such as color and grayscale have become widespread, and the cost of recording media has been reduced. ing. The use of multivalued images in document images is increasing. There has been a demand for performing multivalued images in processes such as region identification, title character string extraction, table processing, and OCR, which have been performed mainly on binary document images. I have.
[0003]
Since various methods have already been applied to binary document images, it has been considered that existing processing for binary images can be applied if binarized multi-valued document images are successfully processed. FIG. 6 shows the contents of a document. On the document image 600, areas having different attributes such as a character area 601, a table area (not shown), a figure area 602, and a photograph area 603 are mixed. Of these, in the character area 601, the table area, and the figure area 602, the colors used to describe them are often small. It is rare that one character itself is described in multiple colors or one cell of a table is multicolor.
[0004]
On the premise of these, for example, as in the image processing method disclosed in Japanese Patent Application Laid-Open No. 2000-040153, an appropriate threshold is partially found for a multi-valued document image and binarized. Then, the existing processing for the binary image (character area 601, table area, figure area 602, etc.) becomes effective. However, in the photograph area 603, the types of pixel colors are much more than in the binary image, depending on the contents of the photograph. If the photo area 603 is binarized by threshold processing in the same manner as the other areas, not only the content of the photograph cannot be determined, but also the processing of the binarized image causes the photo area 603 to be lost. Was erroneously classified into another area.
[0005]
In a multi-valued document image, if only the photo region 603 is identified in advance and binarization and application of subsequent processing are excluded from the photo region 603, errors caused by binarization of the photo region 603 can be eliminated. Therefore, it is necessary to extract the photograph region 603 in the multi-valued document image.
[0006]
As a method of classifying a partial area such as a photograph with respect to such a multi-valued image, there is an image processing method disclosed in Japanese Patent Application Laid-Open No. 2001-086328. That is, a plurality of thresholds for binarization are prepared, rectangle information is extracted from a plurality of images after binarization, rectangle information of a plurality of images is compared, and a predetermined condition is determined. Based on this, the partial areas are classified.
[0007]
[Problems to be solved by the invention]
However, in this method, in a plurality of binary images, a partial region in which the position and size of the rectangular information are significantly shifted is defined as a photograph region. For this reason, in the character region of the intermediate color, the displacement of the rectangular information becomes large, so that the photograph region is erroneously classified as a photograph region. Was. In addition, since the optimal threshold value for creating a plurality of binary images and the number of the threshold values are not fixed, whether rectangular information that can be appropriately classified can be extracted has a great influence on the content of the image. There was a drawback of receiving it.
[0008]
SUMMARY OF THE INVENTION The present invention solves the above-described problems of the related art, and therefore, an image processing apparatus, an image processing method, an image processing program, and a computer-readable recording recording the program, which can identify a photographic area on a document image without error. The purpose is to provide a medium.
[0009]
[Means for Solving the Problems]
In order to solve the above-described problem and achieve the object, an image processing apparatus according to the present invention is an image processing apparatus having a function of identifying a photographic area from a document image input as a multi-valued image. Background color extracting means for specifying a background color of the document image; partial area extracting means for extracting a partial area in which pixels of colors different from the specified background color are connected; and a pixel color for determining the number of pixel colors of the partial area A number extracting unit, a photographic region determining unit that compares the number of the pixel colors with a predetermined value to determine whether the partial region is a photographic region, and controls the respective units. And control means for performing the control.
[0010]
According to the first aspect of the present invention, a photographic area can be determined based on the number of pixel colors of a partial area in a multi-valued input document image, and the photographic area of the multi-valued image can be determined simply and accurately. Can be identified without error.
[0011]
Further, in the image processing apparatus according to the second aspect of the present invention, in the first aspect of the present invention, the pixel color number extracting means performs a process of deeming the same color when the distance in the color space is less than a predetermined value. It is characterized by.
[0012]
According to the second aspect of the present invention, the colors whose distances in the color space are less than the predetermined value are regarded as the same color, so that the color of the pixel is shifted with a slight shift in luminance. Can be prevented and the processing efficiency can be improved.
[0013]
An image processing apparatus according to a third aspect of the present invention is the image processing apparatus according to the first aspect, further comprising a preprocessing unit that reduces the pixel color of the input document image to a predetermined number. I do.
[0014]
According to the third aspect of the present invention, when the background color is specified and the photograph area is identified, the number of colors is reduced in advance, so that the color of the pixel is shifted with a slight shift in luminance. Can be prevented.
[0015]
Further, in the image processing apparatus according to a fourth aspect of the present invention, in the invention according to any one of the first to third aspects, the control unit determines the partial area identified as a photographic area by the photographic area determination unit. The processing is performed by each means assuming that the image is an input image, and the photograph area is identified again.
[0016]
According to the fourth aspect of the present invention, by recursively identifying a photograph area, only a photograph area can be correctly extracted even when the photograph area is included in another area.
[0017]
The image processing method according to claim 5, wherein in the image processing method for identifying a photograph area from a document image input as a multi-valued image, a background color extracting step of specifying a background color of the document image; A partial area extraction step of extracting a partial area in which pixels of different colors from the specified background color are connected, a pixel color number extraction step of calculating the number of pixel colors of the partial area, and a predetermined number of the pixel colors; And determining whether the partial area is a photographic area by comparing the partial area with a predetermined value.
[0018]
According to the fifth aspect of the present invention, it is possible to determine a photographic area based on the number of pixel colors of a partial area for a multi-valued input document image, and to easily and highly accurately determine a photographic area of a multi-valued image. Can be identified without error.
[0019]
In the image processing method according to a sixth aspect of the present invention, in the invention according to the fifth aspect, the pixel color number extracting step performs a process in which when the distance in the color space is less than a predetermined value, the pixel is regarded as the same color. It is characterized by.
[0020]
According to the sixth aspect of the present invention, the colors whose distances in the color space are less than the predetermined value are regarded as the same color, so that the color of the pixel is shifted with a slight shift in luminance. Can be prevented and the processing efficiency can be improved.
[0021]
According to a seventh aspect of the present invention, in the image processing method according to the fifth aspect, the pre-processing is performed in which the pixel color of the input document image is reduced to a predetermined number and passed to the background color extracting step. It is characterized by including a step.
[0022]
According to the seventh aspect of the present invention, when the background color is specified and the photographic area is identified, the number of colors is reduced in advance, so that the color of the pixel shifts with a slight shift in luminance. Can be prevented.
[0023]
An image processing method according to an eighth aspect of the present invention is the image processing method according to any one of the fifth to seventh aspects, wherein the partial area after being identified as a photographic area in the photographic area determination step is defined as an input image. Assuming that the processing of each step is executed again to re-identify the photograph area.
[0024]
According to the eighth aspect of the present invention, by recursively identifying a photograph region, only a photograph region can be correctly extracted even if the photograph region is included in another region.
[0025]
An image processing program according to a ninth aspect of the present invention is an image processing program for identifying a photographic area from a document image input as a multi-valued image, wherein the image processing program stores a background color of the document image in a computer. Is specified, a partial region in which pixels of colors different from the specified background color are connected is extracted, the number of pixel colors of the partial region is obtained, the number of the pixel colors, a predetermined value, and Are compared to determine whether or not the partial region is a photograph region.
[0026]
According to the ninth aspect of the present invention, it is possible to determine a photographic area based on the number of pixel colors of a partial area in a multi-valued document image during computer processing. In addition, identification can be performed with high accuracy without errors.
[0027]
According to a tenth aspect of the present invention, in the image processing program according to the ninth aspect, when the number of pixel colors is obtained, when the distance in the color space is less than a predetermined value, the processing is regarded as the same color. It is characterized by
[0028]
According to the tenth aspect of the present invention, the colors whose distances in the color space are smaller than a predetermined value are regarded as the same color during the computer processing, so that the color of the pixel is shifted with a slight shift in luminance. Can be prevented, and the processing efficiency can be improved.
[0029]
An image processing program according to an eleventh aspect of the present invention is the image processing program according to the ninth aspect, wherein the background color is extracted after reducing the pixel colors of the input document image to a predetermined number. It is characterized by the following.
[0030]
According to the eleventh aspect of the present invention, the number of colors is reduced in advance at the time of specifying a background color and identifying a photographic region at the time of computer processing, so that the color of a pixel is changed with a slight shift in luminance. It is possible to prevent the displacement.
[0031]
An image processing program according to a twelfth aspect of the present invention is the image processing program according to any one of the ninth to eleventh aspects, wherein the partial area after being identified as a photographic area by the identification of the photographic area is defined as an input image. Assuming that each of the processes is executed again, and the photographic region is re-identified.
[0032]
According to the twelfth aspect of the present invention, by recursively identifying a photograph area during computer processing, even when the photograph area is included in another area, only the photograph area can be correctly extracted. .
[0033]
According to a thirteenth aspect of the present invention, there is provided a computer-readable recording medium which records the program according to any one of the ninth to twelfth aspects and is readable by a computer.
[0034]
BEST MODE FOR CARRYING OUT THE INVENTION
Hereinafter, preferred embodiments of an image processing apparatus, an image processing method, an image processing program, and a computer-readable recording medium storing the program according to the present invention will be described in detail with reference to the accompanying drawings.
[0035]
FIG. 1 is a block diagram showing the overall configuration of the image processing apparatus of the present invention. The image processing device includes a CPU (central processing unit) 101, a memory 102, an image input device 103 such as a scanner or a digital camera, a communication device 104, a display device 105 such as a monitor, a keyboard 106, a fixed recording device 107 such as a hard disk, The system includes an external recording / reproducing device 109 using a recording medium (recording medium) 108 such as a CD-RAM / ROM, a DVD-RAM / ROM, and a flexible disk.
[0036]
The CPU 101 controls the entire system, and the memory 102 stores data and programs to be processed by the CPU 101. The communication device 104 is a device for connecting to a network such as a LAN. The keyboard 106 is used to input data and user instructions. The fixed recording device 107 and the external recording / reproducing device 109 are for saving and loading data, programs, and the like on the recording medium 108.
[0037]
FIG. 2 is a block diagram showing the internal configuration of the image processing means in the image processing device. Each function of the image processing means 200 can be obtained by executing an image processing program for the CPU 101. The image processing means 200 includes an image input means 201 for inputting a multi-valued document image from an image input device 103 such as a color scanner, a preprocessing means 202 for performing pixel color reduction on the input image, and Background color extracting means 203 for specifying the background color of the multi-valued document image, partial area extracting means 204 for extracting an area where pixels different from the background color are connected, and pixel color for extracting the number of pixel colors of the partial area Number extracting means 205, a photograph area determining means 206 for determining whether or not the partial area image is a photograph area based on the obtained number of pixel colors, a memory 207 for storing the extracted data, and a controlling means for controlling the processing flow 208. As the memory 207, the memory 102 illustrated in FIG. 1 can be used.
[0038]
(Embodiment 1)
Next, image processing according to the above configuration will be described. FIG. 3 is a flowchart showing an image processing procedure according to the first embodiment of the present invention. First, an image input process is performed (step S301), and a background color is specified for the input document image (step S302). For example, the background color can be specified using a general-purpose method such as the method described in JP-A-2001-222717.
[0039]
Next, a partial region in which pixels other than the background color are connected is extracted (step S303). The extraction of the connected pixels can be performed using an existing process that has been widely known. Next, the number of types of pixel colors constituting the partial area is obtained (step S304). Then, a photograph area is determined based on the number of types of the pixel colors (step S305). This determination of the photograph area is compared with a predetermined threshold value, and if it is larger than the threshold value, the partial area is set as a photograph area. The threshold value may be determined experimentally.
[0040]
In general, there are many types of pixel colors that make up a character, or at most several types even if they are outline or shaded characters. Photos, on the other hand, are made up of much more types of pixel colors. Also, in the figures and tables, the number of pixel colors is larger than that of the pixels constituting the character, but is often a combination of one type of solid color figure, and the types of pixel colors are not as many as in the photograph area. As described above, by using the number of pixel colors in the partial region, the photograph region can be easily identified from the multi-valued image with high accuracy.
[0041]
・ Counting of pixel color type
Generally, TrueColor used for a color image has eight bits of red (R), green (G), and blue (B) values for each pixel, and which of 256 × 256 × 256 = 16777216 colors Or will be assigned. Therefore, the simplest method of counting the number of colors in the pixel color number extracting means 205 can be performed based on a histogram of the number of pixels for 16777216 types of colors. Then, the number of colors can be determined by counting the colors whose pixel numbers are not 0.
[0042]
(Embodiment 2)
According to the second embodiment of the present invention, when calculating the number of types of pixel colors constituting the partial area in the first embodiment (step S304), colors whose distances in the color space are smaller than a predetermined value are not determined. The number of types of pixel colors is determined assuming the same color. For example, when an image is input with a general TrueColor (about 16.770,000 colors) using an input device such as a color scanner, even a slight shift in illumination (luminance) in scanning reacts sensitively. In some cases, the luminance of any of the RGB components of the pixel colors constituting the characters that should be one type of character is slightly shifted, and is determined to be a different type of color. This is to avoid this. Note that a distance in a color space that is regarded as the same color is experimentally obtained in advance.
[0043]
By regarding colors whose distance in the color space is less than a predetermined value to be the same color, it is possible to prevent the pixel color from being shifted due to a slight luminance shift.
[0044]
・ About color space, distance and threshold
The color space described above indicates TrueColor in which each of RGB has an integer value of 0 to 255, and the distance is a so-called Euclidean distance. When the color of pixel 1 is represented by (r1, g1, b1) and the color of pixel 2 is represented by (r2, g2, b2), their Euclidean distance d is
[0045]
d = sqrt ((r1-r2) ² + (G1-g2) ² + (B1-b2) ² )
[0046]
Is represented by The above notation of sqrt () indicates a square root. A representative value of 24 is used as a threshold value of the distance for considering the same type of color.
[0047]
(Embodiment 3)
FIG. 4 is a flowchart showing an image processing procedure according to the third embodiment of the present invention. The same processing steps as those in the first embodiment are denoted by the same step numbers. In the third embodiment, a process of reducing the pixel color by the pre-processing unit 202 (step S401) is added to the document image input in step S301, and the pixel color of the input document image is determined in advance. After the number is reduced to the number of types, identification of a background color (step S302), extraction of a partial area (step S303), extraction of the number of pixel colors (step S304), and determination of a photograph area (step S305) are executed.
[0048]
As described in the second embodiment, for example, when an image is input with a general TrueColor (about 16.770,000 colors) using an input device such as a color scanner, even a slight shift in illumination (luminance) during scanning is sensitive. In some cases, the brightness of any of the RGB components of the pixel colors constituting the characters that should be one type of character in the obtained image is slightly shifted, so that it is determined that the colors are different types. Is to avoid. The number of colors after color reduction may be determined experimentally so that a set of pixels that are visually one type of color becomes one type of color by quantization.
[0049]
As described above, when the background color is specified and the photograph area is identified, by reducing the number of colors in advance, it is possible to prevent the pixel color from being shifted due to a slight luminance shift. Become like
[0050]
・ About color reduction of pixel color
As described above, a specific example in which the pre-processing unit 202 performs color reduction will be described. Instead of assigning 8 bits to each of RGB, 3 bits are used. That is, each of the RGB values is a value from 0 to 255, is divided by 32, the fractional part is cut off, and the value is converted to a value from 0 to 7. Thereby, the number of colors can be reduced to 8 × 8 × 8 = 512 colors. As described above, the configuration is not limited to converting RGB from 8 bits to 3 bits (color reduction), but may be configured to convert (color reduction) to 2 bits or 4 bits.
[0051]
(Embodiment 4)
FIG. 5 is a flowchart showing an image processing procedure according to the fourth embodiment of the present invention. In the fourth embodiment, the photograph area identified in step S305 in the first embodiment is regarded as a newly input image, and the photograph area is recursively identified.
[0052]
In the flowchart of FIG. 5, a recursion determination process (step S306) is added, and it is determined that the first time is a recursion (step S306: Yes), the process returns to step S302, and a series of processes (steps S302 to S305) is performed again. After the execution (step S306: No), the process ends. This makes it possible to correctly extract only the photographic region even when the photographic region is included in another region on the document image.
[0053]
The image processing method described in the present embodiment can be realized by executing a prepared program on a computer such as a personal computer or a workstation. This program is recorded on a computer-readable recording medium such as a hard disk, a flexible disk, a CD-ROM, an MO, and a DVD, and is executed by being read from the recording medium by the computer. The program may be a transmission medium that can be distributed via a network such as the Internet.
[0054]
【The invention's effect】
As described above, according to the first aspect of the present invention, in an image processing apparatus having a function of identifying a photographic region from a document image input as a multi-valued image, a background color of the document image is specified. Background color extraction means, partial area extraction means for extracting a partial area in which pixels of colors different from the specified background color are connected, pixel color number extraction means for determining the number of pixel colors of the partial area, And a photographic area determining means for comparing the number of the photographic areas with a predetermined value to determine whether the partial area is a photographic area, and a control means for controlling the respective means, The photo region can be determined based on the number of pixel colors in the partial region with respect to the multi-valued input document image, and the photo region of the multi-value image can be easily and accurately identified without error. To play.
[0055]
According to the second aspect of the present invention, in the first aspect of the present invention, the pixel color number extracting means performs a process of assuming the same color when the distance in the color space is less than a predetermined value. By regarding colors whose distances in the color space are smaller than a predetermined value as the same color, it is possible to prevent the color of the pixel from being shifted due to a slight shift in luminance, thereby improving processing efficiency. It has the effect that it can be achieved.
[0056]
According to a third aspect of the present invention, in the first aspect of the present invention, the image processing apparatus further includes a pre-processing means for reducing the pixel color of the input document image to a predetermined number, so that the background color can be reduced. By reducing the number of colors in advance at the time of identification and identification of a photographic region, it is possible to prevent the color of a pixel from being shifted due to a slight shift in luminance.
[0057]
According to the invention described in claim 4, in the invention described in any one of claims 1 to 3, the control means inputs the partial area identified as a photographic area by the photographic area determination means. Since the process is performed by each means assuming that the image is an image and the photo region is identified again, by recursively identifying the photo region, even if the photo region is included in another region, the photo region is correctly identified. It is possible to extract only the information.
[0058]
According to the fifth aspect of the present invention, in the image processing method for identifying a photographic region from a document image input as a multi-valued image, a background color extracting step of identifying a background color of the document image; A partial region extracting step of extracting a partial region in which pixels of different colors from the background color are connected, a pixel color number extracting step of calculating the number of pixel colors of the partial region, and a predetermined number of pixel colors. Comparing with a predetermined value to determine whether or not the partial area is a photographic area. Thus, the photograph area of the multi-value image can be easily and accurately identified without error.
[0059]
Also, according to the invention described in claim 6, in the invention described in claim 5, the pixel color number extraction step performs a process of regarding the same color when the distance in the color space is less than a predetermined value. By regarding colors whose distances in the color space are smaller than a predetermined value as the same color, it is possible to prevent the color of the pixel from being shifted due to a slight shift in luminance, thereby improving processing efficiency. It has the effect that it can be achieved.
[0060]
According to a seventh aspect of the present invention, in the invention of the fifth aspect, a pre-processing step of reducing the pixel color of the input document image to a predetermined number and passing the reduced color to the background color extracting step Therefore, when the background color is specified and the photograph area is identified, by reducing the number of colors in advance, it is possible to prevent the pixel color from being shifted due to a slight shift in luminance. This has the effect.
[0061]
According to the invention described in claim 8, in the invention described in any one of claims 5 to 7, a partial area after being identified as a photographic area in the photographic area determination step is regarded as an input image. Since the processing of each of the steps is performed again to re-identify the photographic area, the photographic area is recursively identified, so that even if the photographic area is included in another area, the photographic area can be correctly identified. There is an effect that only the region can be extracted.
[0062]
According to the ninth aspect of the present invention, there is provided an image processing program for identifying a photographic area from a document image input as a multi-valued image, wherein the image processing program causes a computer to change a background color of the document image. Let the specific area, the partial area in which the pixels of different colors from the specified background color are connected is extracted, the number of the pixel colors of the partial area is determined, the number of the pixel colors and a predetermined value are determined. In comparison, since it is determined whether or not the partial area is a photographic area, it is possible to determine the photographic area based on the number of pixel colors of the partial area with respect to the document image input in multi-value during computer processing. This makes it possible to easily and accurately identify the photographic area of the multi-valued image without error.
[0063]
According to the tenth aspect of the present invention, in the method of the ninth aspect, when the number of pixel colors is obtained, a process of regarding the same color when a distance is less than a predetermined value in a color space is performed. Therefore, it is possible to prevent the colors of the pixels from being shifted due to a slight shift in luminance by regarding the colors whose distance in the color space is less than a predetermined value as the same color during the computer processing, There is an effect that the processing efficiency can be improved.
[0064]
According to the eleventh aspect, in the ninth aspect, the pixel color of the input document image is reduced to a predetermined number before the background color is extracted. When the number of colors is reduced in advance when specifying a background color and identifying a photographic region during computer processing, it is possible to prevent the pixel color from being shifted due to a slight shift in luminance. This has the effect.
[0065]
According to the twelfth aspect of the present invention, in the invention according to any one of the ninth to eleventh aspects, a partial area after being identified as a photographic area by the identification of the photographic area is regarded as an input image. Since each of the processes is executed again and the photo region is re-identified, the photo region is recursively identified during the computer processing, so that even if the photo region is included in another region, the photograph There is an effect that only the region can be extracted.
[0066]
According to the thirteenth aspect of the present invention, the program according to any one of the ninth to twelfth aspects is recorded and can be read by a computer. The effect described above can be achieved.
[Brief description of the drawings]
FIG. 1 is a block diagram illustrating an overall configuration of an image processing apparatus according to an embodiment of the present invention.
FIG. 2 is a block diagram illustrating an internal configuration of an image processing unit in the image processing apparatus.
FIG. 3 is a flowchart illustrating an image processing procedure according to the first embodiment of the present invention.
FIG. 4 is a flowchart showing an image processing procedure according to a third embodiment of the present invention.
FIG. 5 is a flowchart illustrating an image processing procedure according to a fourth embodiment of the present invention.
FIG. 6 is a diagram showing the contents of a document.
[Explanation of symbols]
101 CPU (central processing unit)
102 memory
103 Image input device
104 Communication device
105 Display device
106 keyboard
107 fixed recording device
108 Recording media (recording media)
109 External recording and playback device
200 Image processing means
201 Image input means
202 Preprocessing means
203 Background color extraction means
204 partial area extraction means
205 pixel color number extraction means
206 Photo area judgment means
207 memory
208 control means

Claims

In an image processing apparatus having a function of identifying a photograph area from a document image input as a multi-valued image,
Background color extracting means for specifying a background color of the document image;
Partial region extracting means for extracting a partial region in which pixels of different colors from the specified background color are connected,
A pixel color number extraction unit for obtaining the number of pixel colors of the partial region;
A photograph region determining unit that compares the number of the pixel colors with a predetermined value to determine whether the partial region is a photograph region;
Control means for controlling each of the means,
An image processing apparatus comprising:

The pixel color number extraction means,
2. The image processing apparatus according to claim 1, wherein when the distance in the color space is less than a predetermined value, a process of regarding the same color is performed.

2. The image processing apparatus according to claim 1, further comprising a preprocessing unit configured to reduce a pixel color of the input document image to a predetermined number.

The control means includes:
The partial region identified as a photograph region by the photograph region determination unit is regarded as an input image, and processing by each unit is executed, and the photograph region is identified again, wherein the photograph region is identified again. The image processing apparatus according to claim 1.

In an image processing method for identifying a photographic region from a document image input as a multi-valued image,
A background color extraction step of specifying a background color of the document image,
A partial region extracting step of extracting a partial region in which pixels of different colors from the specified background color are connected,
A pixel color number extraction step of obtaining the number of pixel colors of the partial region;
Comparing the number of the pixel colors with a predetermined value, and determining whether or not the partial area is a photographic area;
An image processing method comprising:

The pixel color number extraction step includes:
6. The image processing method according to claim 5, wherein when the distance in the color space is less than a predetermined value, a process of regarding the same color is performed.

6. The image processing method according to claim 5, further comprising a pre-processing step of reducing a pixel color of the input document image to a predetermined number and passing it to the background color extraction step.

The partial area after being identified as a photographic area in the photographic area determination step is regarded as an input image, and the processing of each of the steps is executed again to re-identify the photographic area. The image processing method according to any one of the above.

An image processing program for identifying a photographic area from a document image input as a multi-valued image, the image processing program
Causing the background color of the document image to be specified,
A partial region in which pixels of different colors from the specified background color are connected,
The number of pixel colors in the partial area is determined,
An image processing program for comparing the number of pixel colors with a predetermined value to identify whether the partial area is a photographic area.

10. The computer-readable storage medium according to claim 9, wherein, when calculating the number of pixel colors, when the distance in the color space is less than a predetermined value, a process of regarding the same color is performed.

10. The image processing program according to claim 9, wherein the background color is extracted after reducing the pixel color of the input document image to a predetermined number.

The partial area after being identified as a photographic area by the identification of the photographic area is regarded as an input image, the processing is executed again, and the photographic area is re-identified. An image processing program according to one.

13. A computer-readable recording medium on which the image processing program according to claim 9 is recorded.