JP4036009B2

JP4036009B2 - Image data classification device

Info

Publication number: JP4036009B2
Application number: JP2002036237A
Authority: JP
Inventors: 仁岡本
Original assignee: Fuji Xerox Co Ltd; Fujifilm Business Innovation Corp
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2002-02-14
Filing date: 2002-02-14
Publication date: 2008-01-23
Anticipated expiration: 2022-02-14
Also published as: JP2003242160A

Description

【０００１】
【発明の属する技術分野】
本発明は、画像データの分類に関する。
【０００２】
【従来の技術】
従来、登録された画像データを分類および検索するために、画像データと共にキーワードなどのコード情報を付加するのが一般的であった。このコード情報は、画像データを登録する際、またはすでに登録された画像データを分類、整理する際に、キーボードなどから入力されていた。しかし、このような方法では、分類あるいは検索対象となる画像データを登録する際に、コード情報を人手によって入力しなければならず、多大な作業量を要するという問題があった。
【０００３】
そこで、画像データから様々な特徴量を抽出して特徴ベクトルを構成し、これをコード情報に代わる情報として分類、検索に用いる方法が提案されている。また、画像データ間の類似度は、画像データから抽出した特徴ベクトル間の距離の近さに応じて定義されているが、特徴ベクトルの分布やベクトル距離計算の定義に類似度が依存してしまい、望ましい類似度が求められない可能性が指摘されていた。この特徴ベクトルの分布による依存性を排除するために、特開平９−２６５５２９号公報に記載された技術においては、特徴ベクトルの分布により分類結果を補正することにより、分布による影響を排除している。
【０００４】
また、特徴ベクトルによる分類を行う場合、様々な側面から画像データを捉えるために多くの種類の特徴量を抽出しており、特徴ベクトルが高次元となっていた。特徴ベクトルを用いて分類、検索を行う場合、これが高次元であると、ベクトルの照合の処理時間が増大するという問題があった。
【０００５】
そこで、多くの場合、蓄積された特徴ベクトルに対して主成分分析を行うことで、特徴ベクトルの直交性を保証し、分散の小さい空間軸を排除することにより、特徴ベクトルの次元数を抑えている。また、特開平１１−２１９３４７号公報では、蓄積された特徴ベクトルを階層的に記述しておき、検索あるいは分類の際には、可能性の低い階層の照合を省略することにより、処理の高速化を図っている。
【０００６】
【発明が解決しようとする課題】
しかし、前述の公報に記載された技術によっても、特徴ベクトルにより分類あるいは検索された結果が、妥当性のあるものであるのか保証することはできなかった。つまり、特徴ベクトルにより分類された結果が、ユーザの望むような分類となっていない場合があった。
【０００７】
また、蓄積されている特徴ベクトルの空間軸数が非常に多かったり、蓄積されている特徴ベクトルの分散が大きい場合には、主成分分析を行っても特徴ベクトルの空間軸数を十分に減らすことができない場合があった。
【０００８】
本発明は、このような問題点を考慮してなされたものであり、画像データから抽出した特徴ベクトルによる分類が、ユーザの望む分類に近くなるようにし、また処理負担の軽減することを目的とする。
【０００９】
【課題を解決するための手段】
前述の課題を解決するために、本発明にかかる画像データ分類装置においては、まず、あらかじめ複数の参照群に分類された画像データの特徴ベクトルに基づき、２種類以上の距離定義により、個々の画像データ間のベクトル区間距離を算出し、この算出されたベクトル間距離により、前記参照群に属する画像データの分類を行う。そして、この分類結果を前記参照群の分類のされ方と比較して、参照群の分類と類似した分類結果が得られた距離定義を得る。この距離定義を適用して未分類の画像データの分類を行う。
【００１０】
また、前記距離算出手段によるベクトル間距離の算出は、前記特徴ベクトルの分散、共分散および密度の少なくとも一つに基づいて行うことができる。
【００１１】
また、前記距離算出手段によるベクトル間距離の算出は、前記特徴ベクトルの所定の空間軸に関する出現確率に基づいて行うことができる。
【００１２】
また、前記画像データの特徴ベクトルは、当該画像データの登録に関する空間軸を含み、前記距離算出手段によるベクトル間距離の算出は、前記画像の登録に関する空間軸に対応した出現確率に基づいて行うことができる。
【００１３】
また、他の画像データ分類装置は、あらかじめ複数の参照群に分類された画像データのｎ次元特徴ベクトルに基づき、前記参照群間のベクトル間距離を算出する。また、このｎ次元特徴ベクトルの各空間軸の、ベクトル間距離に関する寄与度を算出し、この寄与度の高い空間軸を、ｎより小さいｍ個を抽出し、ｍ次元の特徴ベクトル空間を抽出する。そして、未分類の画像データに対し、前記ｍ次元特徴ベクトル空間において、ベクトル間距離に基づき、前記未分類の画像データの分類を行う。
【００１４】
前記寄与度の算出にあたっては、前記ｎ次元特徴ベクトルの少なくとも一つの空間軸を排除した特徴ベクトルに基づき参照群間の距離を算出し、この距離とｎ次元特徴ベクトルに基づく距離との比較により、前記排除した空間軸の寄与度を算出することができる。
【００１５】
【発明の実施の形態】
以下、本発明の実施の形態（以下実施形態という）を、図面に従って説明する。図１は、本実施形態の画像データ分類装置が適用されるネットワークの概略構成を示す図である。ネットワークには、インストールされた所定のプログラムによって本実施形態の画像データ分析装置として機能するコンピュータであるクライアント１、紙などの印刷媒体上の画像を取り込むスキャナ２、データベースを提供するデータベースサーバー３およびネットワークの管理を行うウェブサーバー４が接続されている。また、ノート型コンピュータ、携帯情報端末などの外部端末５とネットワークを接続するルータ６も設けられている。画像データは、前記のスキャナの他、ネット上から、外部端末５から、さらにはクライアント１の外部記憶媒体用のドライブから読み込むことができる。
【００１６】
図２は、本実施形態の機能構成図であって、特に画像データを分類するための距離定義の選択にかかる構成を示している。画像データ１０は、二つの参照群１２，１４に、あらかじめ分類されて、記憶されている。この分類は、ユーザにとって好ましいものとなっている。特徴ベクトル抽出手段１６により、第１の参照群１２と第２の参照群１４に分類されている各画像データより特徴ベクトルを抽出する。抽出された各特徴ベクトル間の距離を距離算出手段１８により算出する。距離算出手段１８は、Ｎ個の距離定義に基づきぞれぞれの距離を算出する第１距離算出部１８-1、第２距離算出部１８-2、・・・第Ｎ距離算出部１８-Nを含む。距離算出手段１８で算出された複数の距離は、それぞれ評価手段２０によりその距離定義の妥当性が評価される。評価手段２０は、Ｎ個の距離算出部１８-1〜１８-Nに１対１に対応してＮ個の評価部、すなわち第１評価部２０-1、第２評価部２０-2、・・・第Ｎ評価部２０-Nを含んでいる。各評価部２０-1〜２０-Nは、対応する距離算出部１８-1〜１８-Nが各々算出した距離に基づき、二つの参照群１２，１４に属する画像データを再度分類する。そして、この再分類が、好ましく分類された参照群の分類と、どの程度類似するかを評価する。各評価部２０-1〜２０-Nの評価結果に基づき、距離定義選択手段２２は、適切な距離定義を選択する。距離定義の選択は、その距離定義によって分類された画像データが、参照群１２，１４と同じように、またはどの程度類似して分類されているかに基づき行うことができる。
【００１７】
図３は、本実施形態の機能構成図であって、特に、好ましい分類を行うために選択され定義された距離に基づく未分類の画像データの分類にかかる構成を示している。未分類画像データ２４のそれぞれの特徴ベクトルが、特徴ベクトル抽出手段２６によって抽出される。さらに、距離算出手段２８によって、抽出された各特徴ベクトル間の距離が算出される。このときの距離算出に用いられる距離定義は、前述の距離定義選択手段２２により選択されたものである。分類手段３０により、算出された各ベクトル間距離に基づき未分類の各画像データ２４の分類を行い、分類された画像データを、データベース３２に登録する。
【００１８】
図４は、本実施形態の、画像データの距離定義の選択に係る処理フローチャートである。まず、すでに好ましい分類が行われている画像データを入力し（Ｓ１００）、特徴ベクトルの抽出を行い（Ｓ１０２）、データベース３２に登録する。このときの好ましい分類とは、例えばユーザによる分類およびその評価を事前に行ったものであり、客観評価の他、主観評価による分類も含まれる。また、特徴ベクトルとは、画像データにより定量的に取り出すことのできる値からなるｎ次元ベクトルである。例えば、画像データに含まれる色情報やエッジ情報など、画像処理によって得られるｎ個の特徴量からなる。上記一連の処理を分析対象となる画像データがなくなるまで繰り返す（Ｓ１０４）。
【００１９】
次に、所定の複数の距離定義から一つを選び（Ｓ１０６）、ステップＳ１００で入力された画像データの分類された群（参照群）間で、特徴ベクトル間距離を求める（Ｓ１０８）。ここでいう距離定義とは、ベクトル間の距離（類似度または非類似度としてもよい）を求める上での定義のことで、例えば、チェビシェフ距離、特徴ベクトルの分散、共分散を考慮に入れたマハラノビス距離あるいはユークリッド距離などを用いる。独自に定義したもの、例えば各分類の密度を考慮に入れた距離や画像データの登録者、登録順序、登録間隔および登録日時などの少なくとも一つに対応した出現確率を反映させた距離などを用いてもよい。出現確率とは、例えば、ある画像データに、登録日時が近い画像データが来たときに、後者の画像データが、前者の画像データに同一の群に分類される確率を示す。登録者が同じであれば、同傾向の画像データが登録される傾向が強いと考えられる。また、登録順序や登録日時が近い画像であれば、一つのイベント（例えば運動会）の各シーンの映像である可能性が高いと考えられる。
【００２０】
計算された距離に基づき、特徴ベクトルの分類を行う（Ｓ１１０）。ここで分類とは、特徴ベクトル間の距離を基に、その距離の近いものをまとめていくというものである。一般にクラスタ分析などといわれる方法を用いる。クラスタ解析には階層的手法と非階層的手法があるが、ここではどちらを用いることもできる。全特徴ベクトルの分類が終了したならば（Ｓ１１２）、事前の分類、すなわち参照群を分けた分類との比較を行う（Ｓ１１４）。上記ステップＳ１０６〜Ｓ１１４を、評価対象となる距離定義がなくなるまで繰り返し（Ｓ１１６）、全ての距離定義について評価が終了したら、最終的に距離定義の選択を行う（Ｓ１１８）。この距離定義の選択は、各距離定義に対してなされた評価の結果を比較し、事前に好ましく分類された参照群に、より類似する分類がなされた距離定義を選択する。
【００２１】
図５は、本実施形態の、未分類画像データの分類に係る処理を示すフローチャートである。まず、これから分類する（未分類の）画像データを取り込み（Ｓ１２０）、これらの各画像データの特徴ベクトルを抽出し、これをデータベース３２に登録する（Ｓ１２２）。この特徴ベクトルは、図４に示すチャートのステップＳ１０２で抽出したものと同等のベクトルであり、画像データより定量的に取り出すことができる値からなるｎ次元ベクトルである。特徴ベクトルの抽出を分類対象となる画像データがなくなるまで繰り返す（Ｓ１２４）。
【００２２】
次に、図４のチャートのステップＳ１１８で選択された距離定義を呼び出し（Ｓ１２６）、この距離計算方法に基づき未分類画像データの各特徴ベクトル間の距離の算出を行う（Ｓ１２８）。算出された距離に基づき、図４のチャートのステップＳ１１０と同様の分類手法により、特徴ベクトルの、すなわち未分類画像データの分類を行う（Ｓ１３０）。分類が終了した（Ｓ１３２）ならば、分類結果をデータベース３２に登録する（Ｓ１３８）。
【００２３】
以上のように、本実施形態においては、すでに分類されている画像データと同様または類似の分類となるような距離定義に基づき新たな画像データの分類を行う。したがって、新たな画像データを分類する際に、すでに分類されている画像データを参照して、この分類と同等または類似の分類を自動的に行うことが可能となる。
【００２４】
図６は、本発明の関連技術の機能構成図であって、特に画像データを分類するための特徴ベクトルの抽出にかかる構成を示している。画像データ５０は、二つの参照群５２，５４に、あらかじめ分類されて、記憶されている。この分類は、ユーザにとって好ましいものとなっている。第１の参照群５２と第２の参照群５４に分類されている各画像データは、ｎ次元の特徴ベクトルを有している。この特徴ベクトルの各空間軸が表す特徴量を一つずつ特徴量抽出手段５８により抽出する。そして、その特徴量の一つずつに対して特徴量評価手段６０にてその特徴量が画像データ５０を二つの参照群５２，５４に分けるに当たってどの程度寄与しているかを評価する。具体的には、特徴量抽出手段５８は、前記ｎ次元特徴ベクトルを構成する第１〜第ｎの空間軸のうち、それぞれ一つを抽出する第１〜第ｎ特徴量抽出部５８-1〜５８-nを含む。特徴量評価手段６０は、前述の第１〜第ｎ特徴量抽出部５８-1〜５８-nに１対１に対応して第１〜第ｎ特徴量評価部６０-1〜６０-nを含む。各特徴量評価部６０-1〜６０-nは、それぞれ対応する各特徴量抽出部５８-1〜５８-nで抽出された特徴量に対して参照群５２，５４の分類の寄与度を評価する。具体的には、抽出された一つの特徴量（空間軸）を除いた（ｎ−１）次元の特徴ベクトルを形成し、この特徴ベクトルに基づき参照群５２，５４に属する画像データ５０を再分類する。この再分類が、二つの参照群５２，５４と同様または類似の分類となるのであれば、除いた特徴量は参照群５２，５４の分類への寄与が低いことが判断できる。逆に、再分類において、大きく分類が異なるようになれば、除いた特徴量が参照群５２，５４の分類への寄与が高いことが判断できる。各空間軸に対して評価を行い、特徴ベクトル構成手段６２により寄与度の高いｍ個（ｎ＞ｍ）の空間軸によりｍ次特徴ベクトルを構成する。
【００２５】
図７は、本発明の関連技術の機能構成図であって、特に、好ましい分類を行うために抽出されたｍ次元特徴ベクトルに基づく未分類の画像データの分類にかかる構成を示している。未分類画像データ６４のそれぞれのｍ個の特徴量が特徴量抽出手段６６により抽出される。特徴量抽出手段は、ｍ個の第１〜第ｍ特徴量抽出部６６-1〜６６-mにより構成され、一つの特徴量に対し、一つの抽出部が対応している。抽出された特徴量より、特徴ベクトル生成手段６８にて、ｍ次元特徴ベクトルを生成する。このｍ次元特徴ベクトル空間において、分類手段７０にて分類を行い、分類結果をデータベース７２に登録する。
【００２６】
図８は、本発明の関連技術の、分類のための特徴量の抽出に係る処理フローチャートである。まず、すでに好ましい分類が行われている画像データを入力し（Ｓ２００）、各画像データごとにｎ個の特徴量を抽出し、データベース７２に登録する（Ｓ２０２）。ここで、特徴量とは、画像データより定量的に取り出すことのできる値であり、例えば、画像データに含まれる色情報やエッジ情報など、画像処理によって得られる値である。この処理を分析対象となる画像データがなくなるまで繰り返す（Ｓ２０４）。
【００２７】
次に、ｎ次元特徴ベクトル空間から、一つの特徴量（空間軸）を除いた特徴量で構成される（ｎ−１）次元特徴ベクトル空間を取り出し（Ｓ２０６）、参照群５２，５４間の距離を測る（Ｓ２０８）。この際（ｎ−１）次元特徴ベクトル空間における参照群の重心を求め、この重心間の距離を測り、参照群間の距離とする。こうして得られた（ｎ−１）次元特徴ベクトル空間における参照群間の距離と、ｎ次元特徴ベクトル空間における参照群間の距離からウイルクスのΛ統計量を求めて比較し、取り除かれた特徴量が分類に寄与しているかを示す寄与度を求める（Ｓ２１０）。この処理をｎ個の特徴量全てに対して行う（Ｓ２１２）。上記の寄与度に基づき、実際の分類に用いるｍ次元特徴ベクトル空間を選択する。この際、統計的に有意と認められる特徴量を選択し、あるいは所定の条件で抽出した寄与度の上位ｉ個または上位ｊ％の特徴量を選択する（Ｓ２１４）。
【００２８】
図９は、本発明の関連技術の、未分類画像データの分類に係る処理を示すフローチャートである。まず、これから分類する（未分類の）画像データを取り込み（Ｓ２１６）、これらの各画像データの特徴量を抽出し、これをデータベース７２に登録する（Ｓ２１８）。この特徴量は、図８に示すチャートのステップＳ２０２で抽出したものと同等のベクトルであり、画像データより定量的に取り出すことができる値からなるｎ個の特徴量である。特徴ベクトルの抽出を分類対象となる画像データがなくなるまで繰り返す（Ｓ２２０）。
【００２９】
次に、図８のチャートのステップＳ２１４で決定した特徴ベクトル空間を用い（Ｓ２２２）、未分類画像データの各特徴ベクトル間の距離の算出を行う（Ｓ２２４）。算出された距離に基づき、特徴ベクトルの分類を行う（Ｓ２２６）。この分類は、特徴ベクトル間の距離を基に、その距離の近いものをまとめていくというものである。一般にクラスタ分析などといわれる方法を用いる。クラスタ解析には、階層的手法と非階層的手法があるが、どちらを用いてもよい。特徴ベクトルの分類が終了したら（Ｓ２２８）、分類結果をデータベースに登録し（Ｓ２３０）、終了する。
【００３０】
上記関連技術によれば、すでに分類されている画像データの分類に関し、寄与度の高い特徴量により構成される特徴量ベクトルにより、新たな画像データの分類を行う。よって、新たな画像データの分類をする際に、すでに分類されている画像データの分類と同様または類似する分類を自動的に行うことができる。また、寄与度の高い特徴量を選び出していることにより、分類に用いる特徴量ベクトルの次元を低くすることができ、処理負担が軽減される。
【図面の簡単な説明】
【図１】ネットワーク環境の一例を示す図である。
【図２】本実施形態の機能を示すブロック図であり、特に分類に用いる距離定義の決定に係る構成を示す図である。
【図３】本実施形態の機能を示すブロック図であり、特に未分類の画像データの分類に係る構成を示す図である。
【図４】本実施形態の処理を示すフローチャートであり、特に分類に用いる距離定義の決定に係るチャートである。
【図５】本実施形態の処理を示すフローチャートであり、特に未分類の画像データの分類に係るチャートである。
【図６】の機能を示すブロック図であり、特に分類に用いる特徴量の抽出に係る構成を示す図である。
【図７】本発明の関連技術の機能を示すブロック図であり、特に未分類の画像データの分類に係る構成を示す図である。
【図８】本発明の関連技術の処理を示すフローチャートであり、特に分類に用いる特徴量の抽出に係る構成を示す図である。
【図９】本発明の関連技術の処理を示すフローチャートであり、特に未分類の画像データの分類に係るチャートである。
【符号の説明】
１０，５０画像データ（分類済み）、１２，５２第１参照群、１４，５４第２参照群、１６特徴ベクトル抽出手段、１８距離算出手段、２０評価手段、２２距離定義選択手段、２４，６４画像データ（未分類）、２６特徴ベクトル抽出手段、２８距離算出手段、３０分類手段、３２，７２データベース、５８特徴量抽出手段、６０特徴量評価手段、６２特徴ベクトル構成手段、６６特徴量抽出手段、６８特徴ベクトル生成手段、７０分類手段。[0001]
BACKGROUND OF THE INVENTION
The present invention relates to image data classification.
[0002]
[Prior art]
Conventionally, in order to classify and search registered image data, it has been common to add code information such as keywords together with the image data. This code information has been input from a keyboard or the like when registering image data or classifying and organizing already registered image data. However, in such a method, when registering image data to be classified or searched, there is a problem that code information must be manually input, and a large amount of work is required.
[0003]
In view of this, a method has been proposed in which a feature vector is constructed by extracting various feature amounts from image data, and this is used for classification and retrieval as information replacing code information. The similarity between image data is defined according to the distance between feature vectors extracted from the image data. However, the similarity depends on the distribution of feature vectors and the definition of vector distance calculation. It was pointed out that the desired similarity may not be required. In order to eliminate the dependency due to the feature vector distribution, the technique described in Japanese Patent Laid-Open No. 9-265529 eliminates the influence of the distribution by correcting the classification result based on the feature vector distribution. .
[0004]
In addition, when performing classification based on feature vectors, many types of feature quantities are extracted in order to capture image data from various aspects, and the feature vectors are high-dimensional. When performing classification and retrieval using feature vectors, there is a problem that the processing time of vector matching increases if this is high-dimensional.
[0005]
Therefore, in many cases, by performing principal component analysis on the accumulated feature vectors, the orthogonality of the feature vectors is guaranteed, and by eliminating the spatial axis with small variance, the number of feature vector dimensions can be reduced. Yes. In Japanese Patent Laid-Open No. 11-219347, the accumulated feature vectors are described hierarchically, and when searching or classifying, the collation of the unlikely hierarchy is omitted, thereby speeding up the processing. I am trying.
[0006]
[Problems to be solved by the invention]
However, even with the technique described in the above-mentioned publication, it has not been possible to guarantee whether the result classified or searched by the feature vector is valid. That is, there are cases where the result classified by the feature vector is not the classification desired by the user.
[0007]
Also, if the number of stored feature vectors has a large number of spatial axes, or if the distribution of stored feature vectors is large, the number of feature vectors can be reduced sufficiently even if principal component analysis is performed. There was a case that could not be.
[0008]
The present invention has been made in consideration of such problems, and it is an object of the present invention to make classification based on feature vectors extracted from image data close to classification desired by a user and to reduce processing load. To do.
[0009]
[Means for Solving the Problems]
In order to solve the above-described problem, in the image data classification device according to the present invention, first, based on the feature vectors of image data previously classified into a plurality of reference groups, individual images are defined by two or more types of distance definitions. A vector section distance between data is calculated, and image data belonging to the reference group is classified based on the calculated distance between vectors. Then, the classification result is compared with the classification method of the reference group to obtain a distance definition that provides a classification result similar to the classification of the reference group. This distance definition is applied to classify unclassified image data.
[0010]
Further, the calculation of the distance between vectors by the distance calculation means can be performed based on at least one of variance, covariance and density of the feature vector.
[0011]
Further, the calculation of the distance between vectors by the distance calculation means can be performed based on the appearance probability of the feature vector with respect to a predetermined spatial axis.
[0012]
Further, the feature vector of the image data includes a spatial axis related to registration of the image data, and the calculation of the inter-vector distance by the distance calculation means is performed based on the appearance probability corresponding to the spatial axis related to the registration of the image. Can do.
[0013]
Another image data classification device calculates the inter-vector distance between the reference groups based on the n-dimensional feature vectors of the image data previously classified into a plurality of reference groups. In addition, the degree of contribution related to the inter-vector distance of each space axis of the n-dimensional feature vector is calculated, m space axes having a high degree of contribution are extracted, and m-dimensional feature vector spaces are extracted. . Then, the unclassified image data is classified in the m-dimensional feature vector space based on the inter-vector distance.
[0014]
In calculating the contribution, a distance between reference groups is calculated based on a feature vector excluding at least one spatial axis of the n-dimensional feature vector, and a comparison between the distance and a distance based on the n-dimensional feature vector is performed. The degree of contribution of the excluded space axis can be calculated.
[0015]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, embodiments of the present invention (hereinafter referred to as embodiments) will be described with reference to the drawings. FIG. 1 is a diagram showing a schematic configuration of a network to which the image data classification device of this embodiment is applied. The network includes a client 1 that is a computer functioning as an image data analysis apparatus according to the present embodiment by a predetermined program installed, a scanner 2 that captures an image on a print medium such as paper, a database server 3 that provides a database, and a network Is connected to a web server 4 for managing the above. A router 6 is also provided for connecting a network with an external terminal 5 such as a notebook computer or a portable information terminal. In addition to the scanner, the image data can be read from the network, from the external terminal 5, and from the client 1 external storage medium drive.
[0016]
FIG. 2 is a functional configuration diagram of the present embodiment, and particularly shows a configuration related to selection of a distance definition for classifying image data. The image data 10 is classified and stored in advance in two reference groups 12 and 14. This classification is preferable for the user. A feature vector extraction unit 16 extracts feature vectors from the image data classified into the first reference group 12 and the second reference group 14. The distance between the extracted feature vectors is calculated by the distance calculation means 18. The distance calculating means 18 includes a first distance calculating unit 18-1, a second distance calculating unit 18-2,..., An Nth distance calculating unit 18− that calculate each distance based on N distance definitions. Contains N. Each of the plurality of distances calculated by the distance calculation means 18 is evaluated by the evaluation means 20 for the validity of the distance definition. The evaluation means 20 has N evaluation units corresponding to the N distance calculation units 18-1 to 18-N on a one-to-one basis, that is, the first evaluation unit 20-1, the second evaluation unit 20-2,. ..Nth evaluation unit 20-N is included. Each of the evaluation units 20-1 to 20-N classifies image data belonging to the two reference groups 12 and 14 again based on the distances calculated by the corresponding distance calculation units 18-1 to 18-N. Then, the degree of similarity between the reclassification and the classification of the preferably classified reference group is evaluated. Based on the evaluation results of the evaluation units 20-1 to 20-N, the distance definition selecting unit 22 selects an appropriate distance definition. The selection of the distance definition can be performed based on how the image data classified by the distance definition is classified in the same manner as the reference groups 12 and 14 or how similar.
[0017]
FIG. 3 is a functional configuration diagram of the present embodiment, and particularly shows a configuration related to classification of unclassified image data based on a distance selected and defined for performing preferable classification. Each feature vector of the unclassified image data 24 is extracted by the feature vector extraction means 26. Furthermore, the distance calculation means 28 calculates the distance between the extracted feature vectors. The distance definition used for the distance calculation at this time is selected by the distance definition selection means 22 described above. The classifying unit 30 classifies the unclassified image data 24 based on the calculated distance between the vectors, and registers the classified image data in the database 32.
[0018]
FIG. 4 is a process flowchart relating to selection of a distance definition of image data according to the present embodiment. First, image data that has already been classified preferably is input (S100), feature vectors are extracted (S102), and registered in the database 32. The preferable classification at this time is, for example, classification performed by the user and its evaluation in advance, and includes classification based on subjective evaluation in addition to objective evaluation. A feature vector is an n-dimensional vector composed of values that can be quantitatively extracted from image data. For example, it consists of n feature quantities obtained by image processing, such as color information and edge information included in image data. The above series of processing is repeated until there is no image data to be analyzed (S104).
[0019]
Next, one is selected from a plurality of predetermined distance definitions (S106), and a distance between feature vectors is obtained between the classified groups (reference groups) of the image data input in step S100 (S108). The distance definition here is a definition for obtaining a distance between vectors (which may be a similarity or dissimilarity). For example, the Chebyshev distance, feature vector variance, and covariance are taken into account. Use Mahalanobis distance or Euclidean distance. For example, a distance that takes into account the density of each classification, a distance that reflects the probability of appearance corresponding to at least one of the registrant of image data, registration order, registration interval, registration date, etc. May be. The appearance probability indicates, for example, the probability that the image data of the latter is classified into the same group as the former image data when the image data having a similar registration date and time comes to certain image data. If the registrants are the same, it is considered that image data having the same tendency is strongly registered. In addition, if the registration order and the registration date and time are close to each other, it is highly possible that the images are scene images of one event (for example, athletic meet).
[0020]
Based on the calculated distance, the feature vectors are classified (S110). Here, the classification is to group together those having a short distance based on the distance between feature vectors. In general, a method called cluster analysis is used. There are a hierarchical method and a non-hierarchical method for cluster analysis, but either can be used here. When the classification of all feature vectors is completed (S112), the comparison with the prior classification, that is, the classification into which the reference group is divided is performed (S114). The above steps S106 to S114 are repeated until there is no distance definition to be evaluated (S116). When the evaluation is completed for all the distance definitions, the distance definition is finally selected (S118). The selection of the distance definition compares the results of evaluations made for each distance definition, and selects a distance definition that has been classified more similar to a reference group that has been preferably classified in advance.
[0021]
FIG. 5 is a flowchart showing processing relating to classification of unclassified image data according to the present embodiment. First, image data to be classified (unclassified) is fetched (S120), feature vectors of these image data are extracted and registered in the database 32 (S122). This feature vector is a vector equivalent to that extracted in step S102 of the chart shown in FIG. 4, and is an n-dimensional vector composed of values that can be extracted quantitatively from the image data. The feature vector extraction is repeated until there is no image data to be classified (S124).
[0022]
Next, the distance definition selected in step S118 of the chart of FIG. 4 is called (S126), and the distance between each feature vector of unclassified image data is calculated based on this distance calculation method (S128). Based on the calculated distance, the feature vector, that is, the unclassified image data is classified by the same classification method as in step S110 of the chart of FIG. 4 (S130). If the classification is completed (S132), the classification result is registered in the database 32 (S138).
[0023]
As described above, in this embodiment, new image data is classified based on a distance definition that is similar to or similar to already classified image data. Therefore, when classifying new image data, it is possible to automatically perform classification equivalent to or similar to this classification with reference to already classified image data.
[0024]
FIG. 6 is a functional configuration diagram of the related art of the present invention , and particularly shows a configuration related to extraction of feature vectors for classifying image data. The image data 50 is classified and stored in advance in two reference groups 52 and 54. This classification is preferable for the user. Each image data classified into the first reference group 52 and the second reference group 54 has an n-dimensional feature vector. The feature quantity represented by each spatial axis of the feature vector is extracted one by one by the feature quantity extraction means 58. Then, the feature amount evaluation means 60 evaluates how much the feature amount contributes to dividing the image data 50 into the two reference groups 52 and 54 for each feature amount. Specifically, the feature quantity extraction unit 58 extracts first to n-th feature quantity extraction units 58-1 to 58-1 for extracting one of the first to n-th spatial axes constituting the n-dimensional feature vector. Includes 58-n. The feature quantity evaluation means 60 has the first to nth feature quantity evaluation units 60-1 to 60-n corresponding to the first to nth feature quantity extraction units 58-1 to 58-n on a one-to-one basis. Including. Each of the feature quantity evaluation units 60-1 to 60-n evaluates the contribution of the classification of the reference groups 52 and 54 to the feature quantities extracted by the corresponding feature quantity extraction units 58-1 to 58-n. To do. Specifically, an (n−1) -dimensional feature vector excluding one extracted feature quantity (spatial axis) is formed, and the image data 50 belonging to the reference groups 52 and 54 is reclassified based on this feature vector. To do. If this reclassification is similar to or similar to the two reference groups 52 and 54, it can be determined that the excluded feature amount has a low contribution to the classification of the reference groups 52 and 54. On the other hand, if the classification is greatly different in the reclassification, it can be determined that the excluded feature amount has a high contribution to the classification of the reference groups 52 and 54. Evaluation is performed for each spatial axis, and an m-th order feature vector is configured by m (n> m) spatial axes having a high degree of contribution by the feature vector configuration unit 62.
[0025]
FIG. 7 is a functional configuration diagram of the related art of the present invention , and particularly shows a configuration related to classification of unclassified image data based on an m-dimensional feature vector extracted for performing preferable classification. The m feature amounts of each of the unclassified image data 64 are extracted by the feature amount extraction unit 66. The feature quantity extraction means is composed of m first to m-th feature quantity extraction units 66-1 to 66-m, and one extraction unit corresponds to one feature quantity. An m-dimensional feature vector is generated by the feature vector generation means 68 from the extracted feature amount. In this m-dimensional feature vector space, classification is performed by the classification means 70, and the classification result is registered in the database 72.
[0026]
FIG. 8 is a flowchart of processing related to feature amount extraction for classification according to the related art of the present invention . First, image data that has been preferably classified is input (S200), n feature quantities are extracted for each image data, and registered in the database 72 (S202). Here, the feature amount is a value that can be quantitatively extracted from the image data, for example, a value obtained by image processing such as color information and edge information included in the image data. This process is repeated until there is no image data to be analyzed (S204).
[0027]
Next, an (n−1) -dimensional feature vector space composed of features obtained by removing one feature (space axis) is extracted from the n-dimensional feature vector space (S206), and the distance between the reference groups 52 and 54 is extracted. Is measured (S208). At this time, the centroids of the reference groups in the (n−1) -dimensional feature vector space are obtained, and the distance between the centroids is measured to obtain the distance between the reference groups. The Λ statistic of Wilkes is obtained from the distance between the reference groups in the (n−1) -dimensional feature vector space thus obtained and the distance between the reference groups in the n-dimensional feature vector space, and is compared. A contribution indicating whether it contributes to the classification is obtained (S210). This process is performed for all n feature values (S212). Based on the above-mentioned contribution degree, an m-dimensional feature vector space used for actual classification is selected. At this time, a feature amount that is statistically significant is selected, or the top i or top j% feature amount of the degree of contribution extracted under a predetermined condition is selected (S214).
[0028]
FIG. 9 is a flowchart showing processing relating to classification of unclassified image data according to the related art of the present invention . First, the image data to be classified (unclassified) is fetched (S216), and the feature amount of each image data is extracted and registered in the database 72 (S218). This feature quantity is a vector equivalent to that extracted in step S202 of the chart shown in FIG. 8, and is n feature quantities having values that can be extracted quantitatively from the image data. The feature vector extraction is repeated until there is no image data to be classified (S220).
[0029]
Next, using the feature vector space determined in step S214 of the chart of FIG. 8 (S222), the distance between each feature vector of unclassified image data is calculated (S224). Based on the calculated distance, the feature vectors are classified (S226). This classification is based on the distances between feature vectors, and those with close distances are collected. In general, a method called cluster analysis is used. Cluster analysis includes a hierarchical method and a non-hierarchical method, either of which may be used. When the feature vector classification is completed (S228), the classification result is registered in the database (S230), and the process ends.
[0030]
According to the related technology , new image data is classified based on feature quantity vectors composed of feature quantities having a high contribution degree with respect to classification of already classified image data. Therefore, when new image data is classified, classification similar to or similar to the classification of already classified image data can be automatically performed. Further, by selecting feature amounts having a high contribution, the dimension of the feature vector used for classification can be reduced, and the processing burden is reduced.
[Brief description of the drawings]
FIG. 1 is a diagram illustrating an example of a network environment.
FIG. 2 is a block diagram illustrating functions of the present embodiment, and particularly a configuration related to determination of a distance definition used for classification.
FIG. 3 is a block diagram illustrating functions of the present embodiment, and particularly a configuration related to classification of unclassified image data.
FIG. 4 is a flowchart showing processing of the present embodiment, and is a chart relating to determination of a distance definition used for classification in particular.
FIG. 5 is a flowchart showing processing of the present embodiment, particularly a chart relating to classification of unclassified image data.
FIG. 6 is a block diagram showing the function of, and is a diagram showing a configuration related to extraction of feature amounts used for classification in particular.
FIG. 7 is a block diagram showing functions of a related technique of the present invention , and particularly a diagram showing a configuration relating to classification of unclassified image data.
FIG. 8 is a flowchart showing processing according to a related technique of the present invention , and particularly shows a configuration related to extraction of feature amounts used for classification.
FIG. 9 is a flowchart showing processing according to the related art of the present invention , particularly a chart relating to classification of unclassified image data.
[Explanation of symbols]
10, 50 Image data (classified), 12, 52 First reference group, 14, 54 Second reference group, 16 Feature vector extraction means, 18 Distance calculation means, 20 Evaluation means, 22 Distance definition selection means, 24, 64 Image data (unclassified), 26 feature vector extraction means, 28 distance calculation means, 30 classification means, 32, 72 database, 58 feature quantity extraction means, 60 feature quantity evaluation means, 62 feature vector construction means, 66 feature quantity extraction means 68 Feature vector generation means, 70 Classification means.

Claims

An image data classification device that classifies the image data based on feature vectors extracted from the image data,
Distance calculating means for calculating a distance between vectors of individual image data based on two or more types of distance definitions based on feature vectors of image data classified in advance into a plurality of reference groups;
The image data belonging to the reference group is classified based on the calculated inter-vector distance, the classification result is compared with the reference group, the distance definition for calculating the inter-vector distance is evaluated, and the reference group A distance definition selection means for selecting a distance definition having a classification similar to the classification;
Applying the selected distance definition to unclassified image data, classifying means for classifying the unclassified image data,
An image data classification device having

2. The image data classification device according to claim 1, wherein the calculation of the inter-vector distance by the distance calculation means is performed based on at least one of variance, covariance and density of the feature vector. .

The image data classification apparatus according to claim 1, wherein the calculation of the distance between vectors by the distance calculation unit is performed based on an appearance probability of the feature vector with respect to a predetermined spatial axis.

2. The image data classification apparatus according to claim 1, wherein the feature vector of the image data includes a spatial axis related to registration of the image data, and the calculation of the inter-vector distance by the distance calculation means relates to registration of the image. An image data classification device, which is performed based on an appearance probability corresponding to a spatial axis.