JP2002140332A

JP2002140332A - Feature quantity importance calculation method, and keyword image feature quantity expression database generation and image database retrieval using the same

Info

Publication number: JP2002140332A
Application number: JP2000336374A
Authority: JP
Inventors: Noburo Taniguchi; 展郎谷口
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2000-11-02
Filing date: 2000-11-02
Publication date: 2002-05-17

Abstract

PROBLEM TO BE SOLVED: To enable a high-precision retrieval even from an image database which is not given proper keywords. SOLUTION: Provided are an image database 120 in which keywords and feature quantity sets are related to each other and stored by every image, and a keyword image feature quantity expression database 130 in which image feature quantity expressions (feature quantity set and importance of each feature quantity) are stored by the keywords. An image retrieving device 140 performs retrieval from the keywords image feature quantity expression database 130 according to an inputted keyword to obtain the image feature quantity expression of the keyword, performs similar retrieval from the image database 120 according to the image feature quantity expression to obtain an image having large similarity, and further perform retrieval from the image database 120 according to the inputted keyword to obtain an image corresponding to the keyword, so that a final retrieval result image is determined among those obtained images and outputted.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明、画像処理システム、
画像検索システム等において画像間の類似度の算出に用
いられる画像特徴量（特徴ベクトル）の重み付け技術、
それを用いたキーワード画像特徴量表現データベース作
成技術および画像データベース検索技術等に関する。TECHNICAL FIELD The present invention relates to an image processing system,
Image feature amount (feature vector) weighting technique used for calculating the similarity between images in an image search system or the like;
The present invention relates to a keyword image feature amount expression database creation technology and an image database search technology using the same.

【０００２】[0002]

【従来の技術】画像検索の従来技術は、検索したい画像
のサンプル画像を入力し、それと類似した画像を画像デ
ータベースから検索する方式及び検索したい画像の特徴
を表すキーワードを入力し、それを表す画像を画像デー
タベースから検索する方式に大別される。また、類似検
索で画像間の類似度の算出に用いられる画像特徴量の重
要度（重み係数）の算出法の従来技術は、分散あるいは
標準偏差の逆数を利用する方法及び、面積の逆数を利用
する方法に大別される。以下、これらについて概要を説
明する。2. Description of the Related Art In a conventional image search technique, a sample image of an image to be searched is input, a method of searching for an image similar to the sample image from an image database, and a keyword representing the feature of the image to be searched are input, and an image representing the image is input. Is roughly divided into a method of searching from an image database. The prior art of calculating the importance (weight coefficient) of an image feature used for calculating the similarity between images in the similarity search includes a method using a reciprocal of a variance or a standard deviation and a method using a reciprocal of an area. The method is roughly divided. Hereinafter, these are outlined.

【０００３】図１は従来のサンプル画像による画像検索
システムの概略構成を示す。図１において、画像検索装
置１０は特徴量抽出部１１、類似度算出部１２、画像出
力部１３からなる。画像データベース２０には、図２に
示すように、各画像とその特徴量セットが関連づけられ
て蓄積されている。特徴量セットは複数の特徴量（色、
形、模様など）の組からなり、各特徴量は複数次元の特
徴ベクトルで表わされる。特徴量抽出部１１は、入力さ
れたサンプル画像から特徴量（特徴量セット）を抽出す
る。類似度算出部１２は、抽出されたサンプル画像の特
徴量セットと画像データベース２０に格納された各画像
の特徴量セットとの類似度を、予め定めた特徴量毎の重
要度（重み係数）を加味して計算し、類似度が大きい順
に一定数あるいは一定割合の画像を画像データベース２
０から読み出す。画像出力部１３は、この読み出された
画像データを検索結果として出力する。FIG. 1 shows a schematic configuration of a conventional image retrieval system using sample images. In FIG. 1, the image search device 10 includes a feature amount extraction unit 11, a similarity calculation unit 12, and an image output unit 13. As shown in FIG. 2, each image and its feature set are stored in the image database 20 in association with each other. A feature set consists of multiple features (color,
Shape, pattern, etc.), and each feature amount is represented by a multi-dimensional feature vector. The feature amount extraction unit 11 extracts a feature amount (feature amount set) from the input sample image. The similarity calculating unit 12 calculates the similarity between the extracted feature amount set of the sample image and the feature amount set of each image stored in the image database 20, and calculates a predetermined importance (weight coefficient) for each feature amount. The calculation is performed in consideration of this, and a certain number or a certain percentage of images are stored in the image database 2 in descending order of similarity.
Read from 0. The image output unit 13 outputs the read image data as a search result.

【０００４】図３は従来のキーワードによる画像検索シ
ステムの概略構成を示す。図３において、画像検索装置
３０はキーワード検索部３１、画像出力部３２からな
る。画像データベース４０には、図４に示すように、各
画像とその特徴を表わす１以上のキーワードが関連づけ
られて蓄積されている。利用者が検索したい画像の特徴
を表すキーワード（リンゴ、花瓶など）を入力すると、
キーワード検索部３１は、入力されたキーワードと画像
データベース４０に格納された各画像のキーワードを照
合し、入力キーワードと一致するキーワードを持つ画像
を画像データベース４０から読み出す昼画像出力部３２
は、この読み出された画像データを検索結果として出力
する。FIG. 3 shows a schematic configuration of a conventional image retrieval system using a keyword. 3, the image search device 30 includes a keyword search unit 31 and an image output unit 32. As shown in FIG. 4, each image and one or more keywords representing its characteristics are stored in the image database 40 in association with each other. When a user enters a keyword (apple, vase, etc.) that describes the characteristics of the image he or she wants to search,
The keyword search unit 31 compares the input keyword with the keyword of each image stored in the image database 40 and reads out an image having a keyword that matches the input keyword from the image database 40.
Outputs the read image data as a search result.

【０００５】次に、類似検索で画像間の類似度の算出に
用いられる画像特徴量の重要度の従来の算出方法につい
て説明する。Next, a description will be given of a conventional method of calculating the importance of an image feature amount used for calculating the similarity between images in the similarity search.

【０００６】一方の画像ｉの特徴量（Ａ，Ｂ，…）の組
をＶｉ＝｛ｖｉＡ，ｖｉＢ，…｝とし、他方の画像ｊの
特徴量｛Ａ，Ｂ，…｝の組をＶｊ＝｛ｖｊＡ，ｖｊＢ，
…｝とすると、ＶｉとＶｊの類似度Ｓ（Ｖｉ，Ｖｊ）
は、Ｓ（Ｖｉ，Ｖｊ）＝ｗＡ×ｄ（ｖｉＡ，ｖｊＡ）＋
ｗＢ×ｄ（ｖｉＢ，ｖｊＢ）＋…として求まる。ここ
で、ｗＡ，ｗＢ，…は特徴量毎の重要度（重み係数）、
ｄ（＊，＊）は特徴量（特徴ベクトル）毎の類似度であ
る。A set of feature quantities (A, B,...) Of one image i is defined as Vi = {viA, viB,...}, And a set of feature quantities {A, B,. ｛VjA, vjB,
..., the similarity S (Vi, Vj) between Vi and Vj
Is S (Vi, Vj) = wA × d (viA, vjA) +
wB × d (viB, vjB) +... Here, wA, wB,... Represent importance (weight coefficient) for each feature amount,
d (*, *) is the similarity for each feature amount (feature vector).

【０００７】特徴量毎の重要度（重み係数）を求める従
来技術の第１の方法は、特徴量毎のベクトル群（特徴量
ベクトル群）の分散（あるいは標準偏差）の逆数を利用
するものである。一般にベクトル群Ｖ＝｛ｖ₁，…ｖ_n｝
が与えられたとき、その分散ｓおよび標準偏差ｄは、Ｖ
の重心ベクトルをＶｇとして、ｓ＝（｜ｖ₁ｖ_g｜²＋…＋｜ｖ_nｖ_g｜²）／ｎｄ＝√ｓで求まる。ｓあるいはｄが小さければ小さいほど、ベク
トル群Ｖ内の各ベクトルは空間中のある特定の領域に集
中して存在し、逆に大きければ大きいほどばらばらに散
らばって存在しているといえる。図５は、この様子を示
したもので、丸印がベクトル群内の各ベクトル、バツ印
がベクトル群の重心である。A first prior art method for obtaining the importance (weight coefficient) for each feature value utilizes the reciprocal of the variance (or standard deviation) of a vector group (feature value vector group) for each feature value. is there. Generally vector set _{V = {v 1, ... v} n}
, The variance s and standard deviation d are given by V
S = (| v ₁ v _g | ² +... + | V _n v _g | ² ) / ^nd = √s It can be said that the smaller the s or d is, the more concentrated each vector in the vector group V is in a specific area in the space, and conversely, the larger the s or d is, the more scattered the vector is. FIG. 5 shows this state, in which a circle represents each vector in the vector group, and a cross represents the center of gravity of the vector group.

【０００８】いま、あるキーワードＷに対応する特徴量
毎のベクトル群が、特徴量ＡにおいてＶ_A＝｛ｖ_A1，ｖ
_A2，…｝、特徴量ＢにおいてＶ_B＝｛ｖ_B1，ｖ_B2，…｝
として与えられるものとする。Now, a vector group for each feature amount corresponding to a certain keyword W is V _A = ｛v _A1 , v
_{_{A2, ...}, V B =}} {v B1, v B2 in the feature amount B, ...}
Shall be given as

【０００９】与えられるベクトル群の分散（あるいは標
準偏差）が小さい場合は、その特徴量は、キーワードと
他のキーワードを識別するうえで重要であるということ
を意味する（図５（Ａ）のケース）。逆に分散（あるい
は標準偏差）が大きい場合は、その特徴量は、そのキー
ワードと他のキーワードを識別するうえであまり重要で
ないといえる（図５（Ｂ）のケース）。これに着目し
て、例えば、特徴量Ａのベクトル群Ｖ_Aの分散（あるい
は標準偏差がｓ_A（あるいはｄ_A）、特徴量Ｂのベクトル
群Ｖ_Bの分散（あるいは標準偏差）がｓ_B（あるいは
ｄ_B）であるとき、検索時に各特徴量Ａ，Ｂに与える重
要度（重み係数）ｗ_A、ｗ_Bをそれぞれ１／ｓ_A（あるい
は１／ｄ_A）、１／ｓ_B（あるいは１／ｄ_B）とするもの
である。他の特徴量についても同様である。When the variance (or standard deviation) of a given vector group is small, it means that the feature amount is important in discriminating a keyword from other keywords (the case of FIG. 5A). ). Conversely, when the variance (or standard deviation) is large, it can be said that the feature amount is not so important in discriminating the keyword from other keywords (the case of FIG. 5B). Focusing on this, for example, variance (or standard deviation s _A (or d _A of the vector group V _A of the feature amounts A), dispersion (or standard deviation of the vector group V _B of the feature B) is s _B ( Or d _B ), the importance (weighting factor) w _A , w _B given to each of the feature amounts A, B at the time of retrieval is 1 / s _A (or 1 / d _A ), 1 / s _B (or 1 / D _B ) The same applies to other feature values.

【００１０】特徴量毎の重要度（重み係数）を求める従
来技術の第２の方法は、特徴量毎のベクトル群の面積の
逆数を利用するものである。同様に、ベクトル群Ｖ＝
｛ｖ₁，…ｖ_n｝が与えられたとき、その外接多次元矩形
Ｒあるいは外接多次元円Ｓなどの面積をａとすると、一
般に、ａが小さければ小さいほど、ベクトル群Ｖ内の各
ベクトルは空間中のある特定の領域に集中して存在し、
逆に大きければ大きいほどばらばらに散らばって存在し
ているといえる。図６及び図７は、それぞれ外接多次元
矩形と外接多次元円について、この様子を示したもので
ある。A second prior art method for determining the importance (weight coefficient) for each feature uses the reciprocal of the area of the vector group for each feature. Similarly, the vector group V =
Given {v ₁ ,... V _n }, if the area of the circumscribed multidimensional rectangle R or the circumscribed multidimensional circle S is a, in general, the smaller a is, the more each vector in the vector group V is Are concentrated in a certain area of space,
Conversely, it can be said that the larger the size, the more they are scattered. FIGS. 6 and 7 show this state of the circumscribed multidimensional rectangle and the circumscribed multidimensional circle, respectively.

【００１１】これに着目して、例えば、特徴量Ａのベク
トル群ＶAの面積がａ_A、特徴量Ｂのベクトル群Ｖ_Bの面
積がａ_Bであるとき、検索時に各特徴量Ａ，Ｂに与える
重要度（重み係数）ｗ_A，ｗ_Bを、それぞれ１／ａ_A，１
／ａ_Bとするものである。他の特徴量についても同様で
ある。[0011] Focusing on this, for example, feature amounts A vector group VA of the area a _A a, when the area of the vector group V _B of the feature B is a _B, the feature amounts A when searching the B The given importance (weight coefficient) w _A , w _B is 1 / a _A , 1
/ A _B. The same applies to other feature values.

【００１２】[0012]

【発明が解決しようとする課題】従来技術において、サ
ンプル画像を用いる画像検索は、利用者にとって、サン
プルとなる画像を入力したり指定するのがかなり煩わし
い問題がある。In the prior art, the image retrieval using a sample image has a problem that it is quite troublesome for a user to input or specify a sample image.

【００１３】また、この類似検索時、入力されたサンプ
ル画像と画像データベースに格納された画像間の類似度
の算出に用いられる特徴量毎の重要度（重み係数）とし
て、分散や標準偏差を利用した場合、一般に分散や標準
偏差は、ベクトルが存在する空間の次元数が大きくなる
ほど大きな値をとる傾向があるため、異なる次元数の特
徴量空間における分散（あるいは標準偏差）を単純に同
列に扱うことはできず、次元数の違いを無視して用いる
と、低次元の特徴量を重視しすぎることになり、検索度
の低下を導びく問題がある。同様に、面積を利用した場
合も、次元数の異なる空間における値を単純に同列に扱
うことはできない。更に、外接矩形面積については、あ
る特定の次元軸の辺長がゼロになった場合、たとえ他の
次元軸における散らばりが大きくても、面積自体は必ず
ゼロになるという問題がある。In this similarity search, variance and standard deviation are used as importance (weighting factor) for each feature used for calculating the similarity between the input sample image and the image stored in the image database. In general, the variance and the standard deviation tend to take a larger value as the number of dimensions of the space in which the vector exists becomes larger. Therefore, the variances (or the standard deviations) in the feature amount space having different dimensions are simply handled in the same column. However, if the difference in the number of dimensions is ignored and used, too much emphasis will be placed on the low-dimensional features, which leads to a problem of lowering the search degree. Similarly, when an area is used, values in a space having a different number of dimensions cannot be simply handled in the same column. Further, with respect to the area of the circumscribed rectangle, there is a problem that when the side length of a specific dimension axis becomes zero, the area itself always becomes zero even if the dispersion in other dimension axes is large.

【００１４】従来技術において、キーワードを用いる画
像検索は、必ずしも画像データベース内の全ての画像
に、必要なキーワードのすべてが付与されているとは限
らず、また、キーワードを付与する人間によって、キー
ワードにばらつきが生じ、高精度の検索を期待できない
問題がある。In the prior art, in an image search using a keyword, not all images in an image database are necessarily provided with all necessary keywords. There is a problem that variation occurs and high-precision search cannot be expected.

【００１５】本発明は、上述したような従来技術の問題
点を解決した画像特徴量の重要度算出方法、それを用い
たキーワード画像特徴量表現データベース作成、画像デ
ータベース構築及び画像検索システム等を提供すること
を目的とする。The present invention provides a method for calculating the importance of an image feature, which solves the above-mentioned problems of the prior art, a keyword image feature expression database creation, an image database construction, and an image search system using the method. The purpose is to do.

【００１６】[0016]

【課題を解決するための手段】本発明の特徴量の重要度
算出方法は、特徴量毎にベクトル群が与えられたとき、
その外接多次元矩形面積を基本とするが、辺長がゼロに
なる次元は除外し、全ての辺長がゼロになる場合を除い
て、必ず外接矩形面積ｓがｓ＞０となるようにする。そ
して、この辺長がゼロでない次元の数をｎとして、ｗ＝
１／ｓ^(1/n)つまり、外接多次元矩形面積ｓのｎ乗根の
逆数を、特徴量毎の重要度（重み係数）とする。これ
は、辺長がゼロでない次元のみで構成された空間（これ
を有効部分空間と称す）上の多次元矩形を多次元正方形
と見做したときの一辺の長さに相当する。図８は、この
様子を示したものである。According to the method of calculating the importance of a feature value according to the present invention, when a vector group is given for each feature value,
Basically, the circumscribed multidimensional rectangular area is used, but the dimension where the side length is zero is excluded, and the circumscribed rectangular area s is always s> 0 except when all the side lengths become zero. . Then, assuming that the number of dimensions whose side length is not zero is n, w =
1 / s ^{(1 / n),} that is, the reciprocal of the nth root of the circumscribed multidimensional rectangular area s is defined as the importance (weight coefficient) for each feature amount. This corresponds to the length of one side when a multidimensional rectangle in a space composed of only dimensions whose side lengths are not zero (this is referred to as an effective subspace) is regarded as a multidimensional square. FIG. 8 shows this state.

【００１７】本発明の特徴量毎の重要度算出方法では、
辺長がゼロでない次元のみで構成された有効部分空間の
次元数で冪開しているので、元々の特徴量空間の次元数
の行程の影響を受けにくい重要度（重み係数）の生成が
可能である。In the method of calculating the importance for each feature value according to the present invention,
Since the exponent is expanded with the number of dimensions of the effective subspace composed of only non-zero dimensions, it is possible to generate importance (weighting factor) that is not easily affected by the process of the number of dimensions of the original feature space. It is.

【００１８】次に、本発明は、各画像に対して１以上の
キーワードと特徴量セットが付与された画像データベー
スから、上記特徴量の重要度算出方法を用いて、キーワ
ード毎の画像特徴量表現（特徴量セットとその各特徴量
の重要度）を算出して、キーワードを特徴量に変換する
ためのキーワード画像特徴量表現データベースを自動構
築することを特徴とする。また、本発明は、このような
キーワード画像特徴量表現データベース等を用いて部分
画像にキーワードを付与し、画像データベースを再構築
することを特徴とする。Next, the present invention provides an image feature amount expression for each keyword from an image database to which one or more keywords and a feature amount set are assigned to each image by using the above-described feature amount importance calculation method. (A feature amount set and the importance of each feature amount) are calculated, and a keyword image feature amount expression database for converting a keyword into a feature amount is automatically constructed. Further, the present invention is characterized in that a keyword is assigned to a partial image using such a keyword image feature expression database or the like, and the image database is reconstructed.

【００１９】さらに、本発明は、キーワード画像特徴量
表現データベースと画像データベースを組み合せ、入力
されたキーワードをキーワード画像特徴量表現データベ
ースを用いて特徴量表現に変換して、キーワードと特徴
量表現の両者を用いて画像データベースを検索する画像
検索システムを構築することを特徴とする。これによ
り、サンプル画像が不要であり、また、適切なキーワー
ドが付与されない画像データベースでも、高精度の画像
検索が可能である。Further, according to the present invention, the keyword image feature amount expression database and the image database are combined, and the inputted keyword is converted into the feature amount expression using the keyword image feature amount expression database. And an image retrieval system for retrieving an image database by using the method. As a result, a sample image is unnecessary, and a high-precision image search can be performed even in an image database to which an appropriate keyword is not assigned.

【００２０】[0020]

【発明の実施の形態】以下、本発明の実施の形態につい
て図面により詳しく説明する。〈特徴量の重要度算出〉図９は、本発明による特徴量の
重要度算出手順を示す。一例として、元々の特徴量空間
の次元数を３（３次元：ｘ，ｙ，ｚ）とし、特徴量Ａに
おけるベクトル群として、特徴量Ａ１＝（ｘ_A1，ｙ_A1，
ｚ_A1）、特徴量Ａ２＝（ｘ_A2，ｙ_A2，ｚ_A2）、特徴量Ａ
３＝（ｘ_A3，ｙ_A3，ｚ_A3）が与えられたとする。Embodiments of the present invention will be described below in detail with reference to the drawings. <Calculation of Feature Value Importance> FIG. 9 shows a procedure for calculating feature value importance according to the present invention. As an example, the number of dimensions of the original feature space is 3 (three dimensions: x, y, z), and the feature amount A1 = (x _A1 , y _A1 ,
z _A1 ), feature A2 = (x _A2 , y _A2 , z _A2 ), feature A
_Suppose that 3 = (x _A3 , y _A3 , z _A3 ) is given.

【００２１】まず、特徴量空間の各次元における座標値
の最大値、最小値を求める（ステップ１０１）。例で
は、ｘ_A1，ｘ_A2，ｘ_A3中の最大値ｘ_maxと最小値ｘ_min，
ｙ_A1，ｙ_A2，ｙ_A3中の最大値ｙ_maxと最小値ｙ_min，
ｚ_A1，ｚ_A2，ｚ_A3中の最大値ｚ_maxと最小値ｚ_minを求め
る。First, the maximum value and the minimum value of the coordinate values in each dimension of the feature space are obtained (step 101). In the example, the maximum value x _max and the minimum value x _{min of} x _A1 , x _A2 , x _A3 are
The maximum value y _max and the minimum value y _{min of} y _A1 , y _A2 , y _A3
Request z _A1, z _A2, the maximum value z _max and the minimum value z _min in z _A3.

【００２２】次に、座標値の最大値と最小値が等しい次
元を求める（ステップ１０２）。すなわち、辺長がゼロ
になる次元を求める。ここでは、ｙ_max＝ｙ_minとし、ｙ
次元が辺長ゼロであったとする。次に、座標値の最大値
＝最小値の次元、すなわち、辺長がゼロとなる次元を除
去し、残った特徴量空間（有効部分空間）の次元数ｎを
求める（ステップ１０３）。例では、ｎ＝２である。Next, a dimension where the maximum value and the minimum value of the coordinate values are equal is obtained (step 102). That is, a dimension in which the side length becomes zero is obtained. Here, y _max = y _min and y
Assume that the dimension has a side length of zero. Next, the dimension of the maximum value = minimum value of the coordinate values, that is, the dimension of which the side length is zero is removed, and the dimension number n of the remaining feature amount space (effective subspace) is obtained (step 103). In the example, n = 2.

【００２３】次に、有効部分空間における特徴量分布の
外接多次元長矩形の面積ｓを算出する（ステップ１０
４）。例では、ｓ＝（ｘ_max−ｘ_min）・（ｚ_max−
ｚ_min）として求まる。従来技術では、ｙ_max＝ｙ_minの
場合、ｓ＝（ｘ_max−ｘ_miｎ）・（ｙ_max−ｙ_min）・
（ｚ_max−ｚ_min）＝０となり、たとえｘ_max≠ｘ_min，ｚ
_max≠ｚ_minであっても、面積ｓは必ずゼロになるという
問題が生じる。Next, the area s of the circumscribed multidimensional long rectangle of the feature distribution in the effective subspace is calculated (step 10).
4). In the example, s = (x _max −x _min ) · (z _max −
z _min ). In the prior art, if the _{_{y max = y min, s =}} (x max -x mi n) · (y max -y min) ·
(Z _max −z _min ) = 0, for example, x _max ≠ x _min , z
_{Even if max} ≠ z _min , there is a problem that the area s always becomes zero.

【００２４】最後に、面積ｓのｎ乗根の逆数を求め、各
特徴量の重要度（重み係数）ｗとする（ステップ１０
５）。例では、ｗ_A＝１／ｓ^(1/2)を特徴量Ａ１，Ａ２，
Ａ３に対する重要度（重み係数）とする。Finally, the reciprocal of the n-th root of the area s is obtained, and is set as the importance (weight coefficient) w of each feature (step 10).
5). In the example, w _A = 1 / s ^{(1/2) is set} to the feature amounts A1, A2,
The importance (weight coefficient) for A3.

【００２５】ここでは、分かり易いように、特徴量（ベ
クトル）を３次元とし、対応する特徴量もＡ１，Ａ２，
Ａ３の３個のみとしたが、実際には特徴量は多次元であ
り、対応する特徴量はそれぞれかなりの数になることは
云うまでもない。Here, for the sake of simplicity, the feature quantity (vector) is three-dimensional, and the corresponding feature quantities are A1, A2,
Although only three A3s are used, it goes without saying that the feature amounts are actually multidimensional and the corresponding feature amounts are considerably large.

【００２６】〈キーワード画像特徴量表現データベース
の作成〉図１０は、本発明によるキーワード画像特徴量
表現データベース作成システムの構成例を示す。図１０
において、キーワード画像特徴量表現データベース作成
装置１１０はキーワード抽出部１１１、特徴量セット抽
出部１１２、特徴量セット算出部１１３、重要度算出部
１１４、データ格納部１１５からなる。画像データベー
ス１２０には、図１１に示すように、各画像と、その特
徴を表わす１以上のキーワード及び特徴量セットが関連
づけられて蓄積されている。キーワード特徴量表現デー
タベース１３０には、図１２の下段に示すように、キー
ワード毎に作成された画像特徴量表現データが蓄積され
る。特徴量表現データは特徴量セットとその各特徴量の
重要度（重み係数）の組で構成される。<Creation of Keyword Image Feature Expression Database> FIG. 10 shows a configuration example of a keyword image feature expression database creation system according to the present invention. FIG.
, The keyword image feature expression database creation device 110 includes a keyword extractor 111, a feature set extractor 112, a feature set calculator 113, an importance calculator 114, and a data storage 115. As shown in FIG. 11, the image database 120 stores each image, one or more keywords representing the characteristics thereof, and a feature amount set in association with each other. As shown in the lower part of FIG. 12, the image feature amount expression data created for each keyword is accumulated in the keyword feature amount expression database 130. The feature quantity expression data is composed of a set of a feature quantity set and the importance (weight coefficient) of each feature quantity.

【００２７】キーワード抽出部１１１は、画像データベ
ース１２０から全てのキーワードを抽出する。特徴量セ
ット抽出部１１２は、キーワード抽出部１１１で抽出さ
れた各キーワードにより画像データベース１２０を検索
して、各キーワード毎に、当該キーワードが付与された
全ての画像の特徴量セットを抽出する。図１２の上段
は、キーワード１に対して画像１、画像２、画像３、…
の特徴量セット１、特徴量セット２、特徴量セット３、
…が抽出されることを示している。The keyword extracting unit 111 extracts all keywords from the image database 120. The feature amount set extraction unit 112 searches the image database 120 based on each keyword extracted by the keyword extraction unit 111, and extracts, for each keyword, a feature amount set of all images to which the keyword is assigned. 12, the image 1, image 2, image 3,...
Feature quantity set 1, feature quantity set 2, feature quantity set 3,
.. Are extracted.

【００２８】特徴量セット算出部１１３は、各キーワー
ド毎に、当該キーワードの画像特徴量表現における特徴
量セットを算出する。具体的には、特徴量セット抽出部
１１２で抽出された当該キーワードに対する各画像の特
徴量セットから、対応する特徴量毎、各次元対応に平均
値を求める。例えば、図１２の下段のキーワード１の画
像特徴量表現において、特徴量セット中の特徴量Ａは、
図１２の上段の当該キーワード１に対する画像１、画像
２、画像３、…の特徴量セット１、特徴量セット２、特
徴量セット３、…における対応する特徴量Ａ１、Ａ２、
Ａ３、…について、各次元対応に平均値を算出して得ら
れる。特徴量Ｂ、Ｃ、Ｄについても同様である。The feature value set calculation unit 113 calculates, for each keyword, a feature value set in the image feature value expression of the keyword. Specifically, from the feature amount set of each image corresponding to the keyword extracted by the feature amount set extracting unit 112, an average value is obtained for each corresponding feature amount and for each dimension. For example, in the image feature amount expression of keyword 1 in the lower part of FIG. 12, the feature amount A in the feature amount set is:
The corresponding feature amounts A1, A2, in feature amount set 1, feature amount set 2, feature amount set 3,... Of image 1, image 2, image 3,.
A3,... Are obtained by calculating an average value for each dimension. The same applies to the feature amounts B, C, and D.

【００２９】重要度算出部１１４は、各キーワード毎
に、当該キーワードの画像量特徴量表現における各特徴
量の重要度を算出する。具体的には、特徴量セット抽出
部１１２で抽出された当該キーワードに対する各画像の
特徴量セットから、本発明による特徴量重要度算出方法
を用いて各特徴量の重要度（重み係数）を算出する。例
えば、図１２の下段のキーワード１の画像特徴量表現に
おいて、特徴量Ａの重要度ｗAは、特徴量Ａ１、Ａ２、
Ａ３、…を対象ベクトル群として、図９の手順で算出す
る。特徴量Ｂ、Ｃ、Ｄの各重要度ｗ_B、ｗ_C、ｗ_Dについ
ても同様である。The importance calculating section 114 calculates, for each keyword, the importance of each feature amount in the image amount feature amount expression of the keyword. Specifically, the importance (weighting coefficient) of each feature is calculated from the feature set of each image corresponding to the keyword extracted by the feature set extraction unit 112 using the feature importance calculation method according to the present invention. I do. For example, in the image feature amount expression of keyword 1 in the lower part of FIG. 12, the importance wA of the feature amount A is the feature amount A1, A2,
A3,... Are calculated as a target vector group in the procedure of FIG. The same applies to the degrees of importance w _B , w _C , and w _D of the feature amounts B, C, and D.

【００３０】なお、キーワードの画像特徴量表現の各特
徴量の重要度は、図９で説明した有効部分空間における
外接長矩形の面積の代わりに、従来技術の分散や標準偏
差あるいは外接矩形面積や外接円面積を用いて算出して
もよい。It should be noted that the importance of each feature amount in the image feature amount expression of the keyword is determined by the variance or standard deviation or the circumscribed rectangle area of the conventional technique instead of the area of the circumscribed rectangle in the effective subspace described with reference to FIG. It may be calculated using the circumscribed circle area.

【００３１】データ格納部１１５は、特徴量セット算出
部１１３および重要度算出部１１４の算出結果を組み合
わせ、順次、キーワードごとの画像特徴量表現（特徴量
セットと各特徴量の重要度）をキーワード特徴量表現デ
ータベース１３０に格納する。この結果、キーワード画
像特徴量表現データベース１３０には、図１２の下段に
示すようなキーワードの画像特徴量表現が、当該キーワ
ードと関連づけられて蓄積される。The data storage unit 115 combines the calculation results of the feature set calculation unit 113 and the importance calculation unit 114, and sequentially stores the image feature expression (the feature set and the importance of each feature) for each keyword. It is stored in the feature quantity expression database 130. As a result, the keyword image feature amount expression as shown in the lower part of FIG. 12 is stored in the keyword image feature amount expression database 130 in association with the keyword.

【００３２】〈画像検索システム〉図１３は、本発明に
よる画像検索システムの構成例を示す。本画像検索シス
テムは画像データベース１２０、キーワード画像特徴量
表現データベース１３０、画像検索装置１４０などを備
える。画像データベース１２０は、図１１に示したよう
に、各画像と、その特徴量を表わす１以上のキーワード
と特徴量セットを関連づけて蓄積している。キーワード
画像特徴量表現データベース１３０は、図１２の下段に
示したように、キーワードごとの画像特徴量表現（特徴
量セットと各特徴量の重要度）を、当該キーワードと関
連づけて蓄積している。画像検索装置１４０はキーワー
ド入力部１４１、キーワード特徴量変換部１４２、特徴
量検索部１４３、キーワード検索部１４４、検索結果統
合部１４５からなる。<Image Search System> FIG. 13 shows an example of the configuration of an image search system according to the present invention. The image search system includes an image database 120, a keyword image feature expression database 130, an image search device 140, and the like. As shown in FIG. 11, the image database 120 stores each image, one or more keywords representing the feature amount thereof, and a feature amount set in association with each other. As shown in the lower part of FIG. 12, the keyword image feature expression database 130 accumulates image feature expression (feature set and importance of each feature) for each keyword in association with the keyword. The image search device 140 includes a keyword input unit 141, a keyword feature amount conversion unit 142, a feature amount search unit 143, a keyword search unit 144, and a search result integration unit 145.

【００３３】利用者は、検索したい画像の特徴を表わす
キーワードを入力する。キーワード入力部１４１は、入
力されたキーワードをキーワード特徴量変換部１４２と
キーワード検索部１４４に送る。キーワード特徴量変換
部１４２は、入力されたキーワードを用いてキーワード
画像特徴量表現データベース１３０を検索し、該入力さ
れたキーワードに対応する画像特徴量表現（特徴量セッ
トと各特徴量の重要度）を取得する。特徴量検索部１４
３は、キーワード特徴量変換部１４２で取得した画像特
徴量表現により画像データベース１２０を検索する。具
体的には、特徴量検索部１４３では、画像データベース
１２０の各画像の特徴量セットと画像特徴量表現の特徴
量セットとの類似度を、画像特徴量表現の各特徴量の重
要度（重み係数）を用いて算出し、類似度が大きい順に
一定数あるいは一定割合の画像データを画像データベー
ス１２０から読み出す。キーワード検索部１４４は、キ
ーワード入力部１４０からの入力キーワードにより画像
データベース１２０を検索し、入力キーワードと一致す
るキーワードを持つ１つあるいは複数の画像データを読
み出す。検索結果統合出力１４５は、特徴量検索部１４
３による検索結果とキーワード検索部１４４による検索
結果から最終的な検索結果を決定し、その画像データを
出力する。The user inputs a keyword representing a feature of an image to be searched. The keyword input unit 141 sends the input keyword to the keyword feature amount conversion unit 142 and the keyword search unit 144. The keyword feature value conversion unit 142 searches the keyword image feature value expression database 130 using the input keyword, and expresses an image feature value expression (feature value set and importance of each feature value) corresponding to the input keyword. To get. Feature amount search unit 14
3 searches the image database 120 based on the image feature expression obtained by the keyword feature conversion unit 142. Specifically, the feature amount search unit 143 determines the similarity between the feature amount set of each image in the image database 120 and the feature amount set of the image feature amount expression by using the importance (weight) of each feature amount of the image feature amount expression. , And a certain number or a certain percentage of image data is read out from the image database 120 in descending order of similarity. The keyword search unit 144 searches the image database 120 based on the input keyword from the keyword input unit 140, and reads out one or a plurality of image data having a keyword that matches the input keyword. The search result integrated output 145 is output to the feature amount search unit 14.
3 and the keyword search unit 144 to determine a final search result, and output the image data.

【００３４】ここで、検索結果統合出力部１４５から出
力される画像データとしては、（１）特徴量検索部とキ
ーワード検索部での検索結果のすべて（ＯＲ条件）、
（２）特徴量検索部とキーワード検索部の両方での検索
結果（ＡＮＤ条件）、（３）特徴量検索部での検索結果
のみ、（４）キーワード検索部での検索結果のみ、
（５）（１）〜（４）について予め定めた順に出力す
る、などが考えられる。そして、これらのいずれを利用
するかは、利用者が任意に選択できるようにする。Here, the image data output from the search result integration output unit 145 includes (1) all of the search results (OR condition) in the feature amount search unit and the keyword search unit,
(2) search results (AND conditions) in both the feature amount search unit and the keyword search unit, (3) only search results in the feature amount search unit, (4) only search results in the keyword search unit,
(5) It is possible to output (1) to (4) in a predetermined order. The user can arbitrarily select which of these is to be used.

【００３５】〈画像データベースの構築〉一つの画像内
には複数の物体が写っているなど、複数の部分画像に分
けることができる。例えば、林檎と梨と花瓶が置かれた
テーブルの昼の情景の画像に、「林檎」、「梨」、「テ
ーブル」、「静物」、「昼」というキーワードがつけら
れている場合、具象物である「林檎」、「梨」、「テー
ブル」は、それぞれ画像の特定の部分（部分画像）を表
わしている。したがって、部分画像とキーワードを結び
つけることができれば、より柔軟かつ精度の高い画像デ
ータベースやキーワード画像特徴量表現データベース、
およびそれを用いた画像検索システムを構築できる。<Construction of Image Database> One image can be divided into a plurality of partial images, such as a plurality of objects appearing in one image. For example, if the image of a daytime scene on a table with apples, pears, and vases has the keywords "apple", "pear", "table", "still life", "day", “Apple”, “pear”, and “table” respectively represent specific portions (partial images) of the image. Therefore, if partial images and keywords can be linked, a more flexible and accurate image database and keyword image feature expression database,
And an image retrieval system using the same.

【００３６】図１４は、このような画像データベース構
築の概念図を示したものである。図１４の左側は、画像
データベースの既存のデータ構成であり、画像に対し
て、１以上のキーワードと特徴量セットが付与され、ま
た、部分画像に対してはその特徴量セットが付与されて
いる。ここでは、この部分画像に対してキーワードを追
加し、図１４の右側のようなデータ構成とするものであ
る。なお、部分画像には、すでに一部のキーワードが付
与されていてもよい。FIG. 14 shows a conceptual diagram of such an image database construction. The left side of FIG. 14 shows an existing data structure of the image database, in which one or more keywords and a feature amount set are assigned to an image, and the feature amount set is assigned to a partial image. . Here, a keyword is added to this partial image, and the data configuration is as shown on the right side of FIG. Note that partial keywords may already be assigned to partial images.

【００３７】図１４の右側に示すデータ構成の画像デー
タベースを適用することにより、図１５に示すように、
より適切なキーワードごとの画像特徴量表現（部分画像
特徴量表現）を作成することができる。また、このキー
ワードの部分画像特徴量表現は、後述するように、さら
に部分画像にキーワードを追加する際にも利用できる。By applying the image database having the data structure shown on the right side of FIG. 14, as shown in FIG.
It is possible to create a more appropriate image feature expression (a partial image feature expression) for each keyword. Further, as described later, the partial image feature amount expression of the keyword can be used when a keyword is further added to the partial image.

【００３８】図１６に、部分画像にキーワードを追加す
る方法の一例を示す。これは、図１４に示したように、
画像データベースの既存のデータを直接利用するもので
ある。まず、利用者が、追加しようとするキーワードを
入力することで、画像データベースから該キーワードの
付与された画像を検索する（ステップ２０１）。検索結
果の画像は表示装置に表示される。次に、利用者は、こ
の検索結果の画像から、当該キーワードに関係すると思
われる部分を指定し（ステップ２０２）、画像データベ
ースから、その部分近辺に位置する部分画像を取得する
（ステップ２０３）。次に、利用者は、この取得された
部分画像の中から該キーワードに関係すると思われるも
のを指定する（ステップ２０４）。この結果、画像デー
タベースの当該画像に対して、指定された部分画像のキ
ーワードリストに該キーワードが追加される（ステップ
２０５）。なお、追加するキーワードは、最初の検索に
利用したものと同一でなくてもよく、利用者が追加すべ
きキーワードを別に入力することでもよい。FIG. 16 shows an example of a method for adding a keyword to a partial image. This is, as shown in FIG.
It directly uses existing data in the image database. First, when a user inputs a keyword to be added, the image database is searched for an image assigned with the keyword (step 201). The image of the search result is displayed on the display device. Next, the user specifies a portion considered to be related to the keyword from the image of the search result (step 202), and acquires a partial image located near the portion from the image database (step 203). Next, the user designates one of the obtained partial images that is considered to be related to the keyword (step 204). As a result, the keyword is added to the keyword list of the specified partial image for the image in the image database (step 205). The keyword to be added may not be the same as that used for the first search, and the user may input a keyword to be added separately.

【００３９】図１７に、部分画像にキーワードを追加す
る方法の別の一例を示す。これは、図１５に示すよう
な、すでに一部作成ずみのキーワードの部分画像特徴量
表現を利用するものである。FIG. 17 shows another example of a method for adding a keyword to a partial image. This uses a partial image feature amount expression of a keyword that has already been partially created as shown in FIG.

【００４０】まず、利用者が、追加しようとするキーワ
ードを入力することで、キーワード画像特徴量表現デー
タベースから該キーワードの部分画像特徴量表現（特徴
量セットと各特徴量の重要度）を取得し（ステップ３０
１）、該キーワードの部分画像特徴量表現を利用して画
像データベースの部分画像との類似検索を実行する（ス
テップ３０２）。検索結果の部分画像は表示装置に表示
される。次に、利用者は、この検索結果の部分画像の中
から、該キーワードに関係すると思われるもののうちか
ら未だ該キーワードが設定されていないものを指定する
（ステップ３０３）。この結果、画像データベースのデ
ータについて、該指定された部分画像のキーワードとし
て該キーワードが追加され（ステップ３０４）、該指定
された部分画像を含む親画像のキーワードとして該キー
ワードが追加される（ステップ３０５）。この場合も、
追加されるキーワードは、最初にキーワード画像特徴量
表現データベースの検索に利用したものと同一でなくて
もよく、利用者が追加すべきキーワードを別に入力する
ことでもよい。First, when a user inputs a keyword to be added, a partial image feature amount expression (feature amount set and importance of each feature amount) of the keyword is obtained from the keyword image feature amount expression database. (Step 30
1) A similarity search with a partial image in an image database is executed using the partial image feature expression of the keyword (step 302). The partial image of the search result is displayed on the display device. Next, from among the partial images of the search result, the user designates, from among the partial images considered to be related to the keyword, those for which the keyword has not been set yet (step 303). As a result, for the data in the image database, the keyword is added as a keyword of the specified partial image (step 304), and the keyword is added as a keyword of a parent image including the specified partial image (step 305). ). Again,
The keyword to be added may not be the same as that used for the search of the keyword image feature amount expression database at first, and the user may input a keyword to be added separately.

【００４１】このように部分画像にキーワードを追加し
た画像データベースを利用して、再び図１５に示すよう
にキーワードの部分画像特徴量表現を作成し、キーワー
ド画像特徴量表現データベースに蓄積する。Using the image database in which the keyword is added to the partial image, a partial image feature amount expression of the keyword is created again as shown in FIG. 15, and stored in the keyword image feature amount expression database.

【００４２】新規の画像データを画像データベースに登
録する場合、図１５の下方のようなキーワードの部分画
像特徴量表現を蓄積したキーワード画像特徴量表現デー
タベースを利用することにより、次のようにして、図１
４の右側のような画像／部分画像キーワード付きデータ
を自動的に構築することができる。まず、新規に登録す
る画像、その部分画像の特徴量を抽出して、画像の特徴
量セット、部分画像の特徴量セットを取得する。次に、
この取得した画像の特徴量セットとキーワード画像特徴
量表現データベースにおける各キーワードの画像特徴量
表現の特徴量セットとの類似度を算出して、類似度の大
きいもののキーワードを該画像に付与する。次に、同様
に、取得した部分画像の特徴量セットとキーワード画像
特徴量表現データベースにおける各キーワードの部分画
像特徴量表現の特徴量セットとの類似度を算出して、類
似度の大きいもものキーワードを該部分画像に付与す
る。When registering new image data in the image database, a keyword image feature expression database in which keyword partial image feature expressions are stored as shown in the lower part of FIG. 15 is used as follows. FIG.
4, the image / partial image keyword data as shown on the right side can be automatically constructed. First, an image to be newly registered and a feature amount of the partial image are extracted, and a feature amount set of the image and a feature amount set of the partial image are obtained. next,
The degree of similarity between the acquired feature amount set of the image and the feature amount set of the image feature amount expression of each keyword in the keyword image feature amount expression database is calculated, and a keyword having a large similarity is assigned to the image. Next, similarly, the similarity between the acquired feature amount set of the partial image and the feature amount set of the partial image feature amount expression of each keyword in the keyword image feature amount expression database is calculated, and a keyword having a large similarity is calculated. Is given to the partial image.

【００４３】これを実現するシステムは、基本的に図１
３に示したような画像検索システムに特徴量抽出部を追
加することで可能である。また、部分画像にキーワード
を付加することで、図１３で説明したと同様の処理で、
部分画像の検索が可能である。The system for realizing this is basically the system shown in FIG.
It is possible by adding a feature amount extraction unit to the image search system as shown in FIG. Also, by adding a keyword to the partial image, processing similar to that described with reference to FIG.
Partial image search is possible.

【００４４】以上、本発明の種々の実施形態について説
明したが、本発明は、ここで述べたものに限定されるも
のでないことは云うまでもない。また、ここで述べたよ
うな特徴量の重要度算出手順、あるいはキーワード画像
特徴量表現データベース作成手順や画像検索手順は、コ
ンピュータで実行可能なプログラム言語で記述し、あら
かじめコンピュータで読み取り可能な記録媒体、例え
ば、フロッピー（登録商標）ディスク、ＣＤ−ＲＯＭ、
メモリカード等に記録して提供することも可能である。
これをコンピュータにロードすることにより、所期の機
能が実現する。Although various embodiments of the present invention have been described above, it is needless to say that the present invention is not limited to the above-described embodiments. In addition, the procedure for calculating the importance of the feature amount, the procedure for creating the keyword image feature amount expression database, and the image search procedure described above are described in a computer-executable program language, and are stored in a computer-readable recording medium in advance. For example, a floppy (registered trademark) disk, a CD-ROM,
It is also possible to provide it by recording it on a memory card or the like.
By loading this into a computer, the desired function is realized.

【００４５】[0045]

【発明の効果】以上説明したように、本発明による主な
効果を挙げれば以下の通りである。（１）特徴量の重み付けを算出する際に、特徴量空間
（特徴量の分布空間）において、次元数で冪開している
ので、特徴量空間の次元数の影響を受けにくい。そのた
め、画像等の検索において、特徴量ごとの重要性に基づ
いた特徴量の類似性を判定する際に、精度の高い類似性
判定を行うことができる。（２）画像に適切なキーワードが付与されていないよ
うな画像データベースからでも、キーワード画像特徴量
表現（特徴量セットと各特徴量の重要度）からなるデー
タベースを構築できる。さらに、画像を構成する部分画
像ごとに適切なキーワードを付与することができる。（３）キーワードを特徴量に変換して、キーワードと
特徴量の両者を用いて画像データベースを検索できるの
で、適切なキーワードが付与されていないような画像デ
ータベースでも精度の高い検索を行うことができる。As described above, the main effects of the present invention are as follows. (1) When calculating the weighting of the feature quantity, since the feature quantity space (feature quantity distribution space) is evolved with the number of dimensions, it is hardly affected by the dimension number of the feature quantity space. Therefore, in the search of an image or the like, when determining the similarity of the feature amounts based on the importance of each feature amount, a highly accurate similarity determination can be performed. (2) Even from an image database in which an appropriate keyword is not assigned to an image, a database composed of keyword image feature expression (feature set and importance of each feature) can be constructed. Furthermore, an appropriate keyword can be assigned to each of the partial images constituting the image. (3) Since a keyword can be converted into a feature value and an image database can be searched using both the keyword and the feature value, a highly accurate search can be performed even in an image database to which an appropriate keyword is not assigned. .

[Brief description of the drawings]

【図１】従来の画像検索システムの構成例を示す図であ
る。FIG. 1 is a diagram illustrating a configuration example of a conventional image search system.

【図２】図１の画像データベースのデータ構成を示す図
である。FIG. 2 is a diagram showing a data configuration of an image database of FIG. 1;

【図３】従来の画像検索システムの別の構成例を示す図
である。FIG. 3 is a diagram showing another configuration example of a conventional image search system.

【図４】図３の画像データベースのデータ構成を示す図
である。FIG. 4 is a diagram showing a data configuration of an image database of FIG. 3;

【図５】従来の特徴量の重要度算出方法を説明する図で
ある。FIG. 5 is a diagram illustrating a conventional method of calculating the importance of a feature amount.

【図６】従来の特徴量の重要度算出の別の方法を説明す
る図である。FIG. 6 is a diagram illustrating another method of calculating the importance of a feature amount in the related art.

【図７】従来の特徴量の重要度算出の更に別の方法を説
明する図である。FIG. 7 is a diagram illustrating still another conventional method of calculating the importance of a feature amount.

【図８】本発明による特徴量の重要度算出方法の原理を
示す図である。FIG. 8 is a diagram showing the principle of a feature value importance calculation method according to the present invention.

【図９】本発明による特徴量の重要度算出手順の一例を
示す図である。FIG. 9 is a diagram showing an example of a procedure for calculating importance of a feature amount according to the present invention.

【図１０】本発明によるキーワード画像特徴量表現デー
タベース作成システムの構成例を示す図である。FIG. 10 is a diagram showing a configuration example of a keyword image feature amount expression database creation system according to the present invention.

【図１１】図１０の画像データベースのデータ構成を示
す図である。FIG. 11 is a diagram showing a data configuration of an image database of FIG. 10;

【図１２】キーワードの画像特徴量表現作成の概念図で
ある。FIG. 12 is a conceptual diagram of creating an image feature amount expression of a keyword.

【図１３】本発明による画像検索システムの構成例を示
す図である。FIG. 13 is a diagram showing a configuration example of an image search system according to the present invention.

【図１４】部分画像へのキーワードの追加を説明する図
である。FIG. 14 is a diagram illustrating addition of a keyword to a partial image.

【図１５】キーワードの部分画像特徴量表現作成の概念
図である。FIG. 15 is a conceptual diagram of creating a partial image feature quantity expression of a keyword.

【図１６】部分画像にキーワードを追加する方法の一例
を示す図である。FIG. 16 is a diagram illustrating an example of a method for adding a keyword to a partial image.

【図１７】部分画像にキーワードを追加する方法の別の
例を示す図である。FIG. 17 is a diagram illustrating another example of a method of adding a keyword to a partial image.

[Explanation of symbols]

１１０キーワード画像特徴量表現作成装置１１１キーワード抽出部１１２特徴量セット抽出部１１３特徴量セット算出部１１４重要度算出部１１５データ格納部１２０画像データベース１３０キーワード画像特徴量表現データベース１４０画像検索装置１４１キーワード入力部１４２キーワード特徴量変換部１４３特徴量検索部１４４キーワード検索部１４５検索結果統合出力部 110 Keyword Image Feature Expression Creation Device 111 Keyword Extraction Unit 112 Feature Amount Set Extraction Unit 113 Feature Amount Set Calculation Unit 114 Importance Calculation Unit 115 Data Storage Unit 120 Image Database 130 Keyword Image Feature Amount Expression Database 140 Image Search Device 141 Keyword Input Unit 142 keyword feature amount conversion unit 143 feature amount search unit 144 keyword search unit 145 search result integration output unit

Claims

[Claims]

1. A feature amount importance calculating method for calculating the importance of a feature amount (feature vector) used in calculating the similarity between images, comprising: a maximum value in each dimension of a target feature amount vector group And the minimum value is determined, the dimension in which the maximum value and the minimum value are equal is removed, and the area of the circumscribed long rectangle in the feature quantity vector group including the remaining dimensions (the number of dimensions is n) of the removed dimension is determined. A feature value importance calculating method, wherein the reciprocal of the nth root of the area is used as the feature value importance.

2. A recording medium readable by a computer, wherein a program for executing the method of calculating the feature value importance according to claim 1 on a computer is recorded.

3. An image feature amount expression (feature amount set and importance of each feature amount) for each keyword is determined for each image by using an image database in which one or more keywords are associated with a feature amount set. A keyword image feature amount expression creation method to be created, wherein a keyword is extracted from the image database, and a feature amount set group of an image to which the keyword is assigned is acquired. An average value is calculated for each feature amount, and a feature amount set of an image feature amount expression of the keyword is used.
A keyword image feature amount expression creating method, wherein the importance is calculated by applying the described feature amount importance degree calculation method, and the importance is calculated for each feature amount of the image feature amount expression of the keyword.

4. The keyword image feature amount expression creating method according to claim 3, wherein a keyword is assigned to the image database for each partial image of the image, in addition to the image feature amount expression of the keyword. A method for creating a keyword image feature amount expression, wherein an image feature amount expression is also created.

5. An image database construction method for reconstructing an image database by adding a keyword to a partial image for an image database in which one or more keywords and partial images are stored in association with each other for each image, Searching the image database using keywords to be specified, specifying a desired part of the image of the search result, obtaining a partial image located in the vicinity thereof, and specifying a desired partial image from the obtained partial images And adding a keyword to the partial image.

6. For an image database in which one or more keywords and a feature amount set are associated with each other and a partial image and a feature amount set are stored in association with each other, a keyword is added to the partial image to reconstruct the image database. A database construction method using a keyword image feature expression database in which partial image feature expression (a feature set and the importance of each feature) for each keyword is stored in advance, and a keyword image feature based on a keyword to be added. A partial expression feature amount expression of the keyword is retrieved by searching the quantitative expression database, and a similarity search with the characteristic amount set of each partial image in the image database is performed based on the acquired partial image characteristic amount expression. And obtain a desired partial image from the acquired partial images. In Rukoto, image database building method characterized by adding a keyword to the partial image.

7. An image retrieval method for retrieving an image by inputting a keyword from an image database in which one or more keywords and a feature amount set are associated with each other for each image, comprising: Using a keyword image feature amount expression database in which expressions (feature amount sets and the importance of each feature amount) are accumulated, a keyword image feature amount expression database is searched by the input keyword, and an image feature amount expression of the keyword is obtained. The similarity between the acquired feature amount set of the image feature amount expression and the feature amount set of each image in the image database is calculated using the importance degree of the image feature amount expression, and an image having a large similarity ( (Hereinafter referred to as a first image), and an image database is searched by the input keyword, and the image corresponding to the keyword is searched. Image (hereinafter, referred to as a second image) acquires image search method characterized in that determines and outputs a first image and a final search result image from the second image which the acquired.

8. A recording medium readable by a computer, wherein a program for executing the image search method according to claim 7 on a computer is recorded.

9. A keyword image feature amount expression database creation system for creating a keyword image feature amount expression database storing image feature amount expressions for each keyword (importance of each feature amount of a feature amount set), An image database in which one or more keywords and a feature amount set are stored in association with each other; a means for extracting a keyword from the image database and acquiring a feature amount set group of an image to which the keyword is assigned; Means for calculating an average value for each feature amount from the feature amount set group to calculate a feature amount set; means for calculating the importance of each feature amount from the feature amount set group; and the calculated feature A method of storing a set of a quantity set and importance of each feature quantity as a keyword image feature quantity expression in a keyword image feature quantity expression database. Keywords image feature amount representing database creation system characterized by having, when.

10. An image database in which one or more keywords are associated with a feature amount set for each image, and image feature amount expressions (feature amount sets and importance of each feature amount) for each keyword are accumulated. A keyword image feature expression database; means for inputting a keyword representing an image to be searched; means for searching the keyword image feature expression database based on the input keyword to obtain an image feature expression for the keyword; The similarity between the acquired feature amount set of the image feature amount expression and the feature amount set of each image in the image database is calculated using the importance of the image feature amount expression, and an image having a large similarity (first image) is calculated. Means for acquiring the image) and searching the image database by the input keyword, and corresponding to the keyword. Image (second image)
And a means for determining and outputting a final search result image from the obtained first image and second image.