JP2007304771A

JP2007304771A - Subject recognition device, subject recognition program and image retrieval service provision method

Info

Publication number: JP2007304771A
Application number: JP2006131308A
Authority: JP
Inventors: Keiichi Nitta; 啓一新田
Original assignee: Nikon Corp
Current assignee: Nikon Corp
Priority date: 2006-05-10
Filing date: 2006-05-10
Publication date: 2007-11-22
Anticipated expiration: 2026-05-10
Also published as: JP4725408B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a technology for achieving the efficiency of the subject recognition of image data. <P>SOLUTION: This subject recognition device is provided with an image input part, a subject dictionary part and a processing part. An image input part captures image data. A subject dictionary part classifies subject conditions for specifying the type of the subject into hierarchies for storage. A processing part successively collates the subject conditions according to the hierarchy, and narrows down subject conditions to be collated, and searches the type of the subject projected on the image data. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、被写体認識装置、被写体認識プログラム、および画像検索サービス提供方法に関する。 The present invention relates to a subject recognition device, a subject recognition program, and an image search service providing method.

従来、画像の顔認識技術のように、画像データに映っている顔部分を認識する技術が知られている。
また、下記の特許文献１には、画像データから被写体を認識し、画像データと共に被写体の情報を表示する装置が開示されている。
一方、インターネット上などの検索サイトでは、画像データに付随するテキスト情報（画像ファイル名、撮影日時情報など）をキーワード検索することにより、画像検索サービスを提供している。
特開2001−113097号公報 2. Description of the Related Art Conventionally, a technique for recognizing a face portion shown in image data, such as an image face recognition technique, is known.
Patent Document 1 below discloses an apparatus that recognizes a subject from image data and displays subject information together with the image data.
On the other hand, search sites on the Internet provide an image search service by searching for text information (image file name, shooting date / time information, etc.) attached to image data.
Japanese Patent Laid-Open No. 2001-113097

上述した顔認識技術のように、画像データから被写体を認識する技術は日々進歩しており、それに伴って、認識すべき被写体種類が将来にわたって増加することが予想される。 Like the face recognition technology described above, a technology for recognizing a subject from image data is progressing day by day, and it is expected that the types of subjects to be recognized will increase in the future.

一方、従来の特許文献１では、被写体の種類１つ１つについて、被写体認識を逐一行う必要がある。そのため、認識すべき被写体種類が膨大になると、被写体認識の処理回数が膨大になり、被写体認識に係る総処理時間が長くなってしまう。 On the other hand, in the conventional patent document 1, it is necessary to perform subject recognition one by one for each kind of subject. For this reason, if the types of subjects to be recognized become enormous, the number of times of subject recognition processing becomes enormous and the total processing time for subject recognition becomes longer.

そこで、本発明では、画像データの被写体認識を効率良く行う技術を提供することを目的とする。 Therefore, an object of the present invention is to provide a technique for efficiently recognizing a subject of image data.

《１》本発明の被写体認識装置は、画像入力部、被写体辞書部、および処理部を備える。
画像入力部は、画像データを取り込む。
被写体辞書部は、被写体種類を特定するための被写体条件を記憶する。
処理部は、画像データを被写体条件に照合して、画像データに映っている被写体種類を特定する。
特に、本発明では、被写体辞書部は、被写体条件を階層に分類して記憶する。処理部は、上位階層に位置する被写体条件と画像データとの照合結果に基づいて、下位階層に位置する被写体条件を絞り込みつつ、画像データと下位階層の被写体条件との照合を行う。
《２》なお好ましくは、処理部は、認識された被写体種類に関連して予め定められたキーワード情報を、画像データに対応付けて記録する。さらに、処理部は、入力されるキーワードを、このキーワード情報と照合することによって画像検索を行う。
《３》なお好ましくは、被写体辞書部は、被写体種類Ｋｄおよび被写体条件Ｃｄを追加登録する機能を有する。処理部は、この追加登録に応じて、被写体種類Ｋｄを包含する上位階層の被写体種類Ｋｕを求め、被写体種類Ｋｕを検出済みの画像データについて、被写体条件Ｃｄとの照合を追加実施する。
《４》また好ましくは、被写体辞書部は、被写体条件のバージョン情報を管理する機能を有する。処理部は、被写体種類の特定に使用した被写体条件のバージョン情報を、画像データに対応付けて記録する。
《５》本発明の被写体認識プログラムは、コンピュータを、請求項１ないし請求項４のいずれか１項に記載の被写体認識装置として機能させるためのプログラムである。
《６》本発明の画像検索サービス提供方法は、コンピュータが画像検索サービスを実施する方法であって、次のステップを備える。
(1)被写体種類を特定するための被写体条件を階層に分類して記憶するステップ。
(2)画像データを階層に従って被写体条件と順次照合することにより、画像データに映っている被写体種類を求めるステップ。
(3)認識された被写体種類に関連して予め定められたキーワード情報を、画像データに対応付けて記録するステップ。
(4)入力されるキーワードを、画像データに対応するキーワード情報と照合して、画像検索を行うステップ。 << 1 >> The subject recognition apparatus of the present invention includes an image input unit, a subject dictionary unit, and a processing unit.
The image input unit captures image data.
The subject dictionary unit stores subject conditions for specifying the subject type.
The processing unit collates the image data with the subject condition, and identifies the subject type shown in the image data.
In particular, in the present invention, the subject dictionary unit stores subject conditions classified into hierarchies. The processing unit collates the image data with the subject condition of the lower layer while narrowing down the subject conditions located at the lower layer based on the collation result between the subject condition located at the upper layer and the image data.
<< 2 >> Preferably, the processing unit records keyword information predetermined in association with the recognized subject type in association with the image data. Further, the processing unit performs an image search by comparing the input keyword with this keyword information.
<< 3 >> Preferably, the subject dictionary unit has a function of additionally registering the subject type Kd and the subject condition Cd. In response to this additional registration, the processing unit obtains a higher-level subject type Ku that includes the subject type Kd, and additionally performs collation with the subject condition Cd for the image data for which the subject type Ku has been detected.
<< 4 >> Also preferably, the subject dictionary unit has a function of managing version information of subject conditions. The processing unit records the version information of the subject condition used for specifying the subject type in association with the image data.
<< 5 >> The subject recognition program of the present invention is a program for causing a computer to function as the subject recognition device according to any one of claims 1 to 4.
<< 6 >> The image search service providing method of the present invention is a method in which a computer executes an image search service, and includes the following steps.
(1) A step of classifying and storing subject conditions for specifying a subject type in a hierarchy.
(2) A step of obtaining a subject type shown in the image data by sequentially comparing the image data with subject conditions according to the hierarchy.
(3) A step of recording keyword information predetermined in relation to the recognized subject type in association with the image data.
(4) A step of performing an image search by collating the input keyword with keyword information corresponding to the image data.

本発明では、階層に分類された被写体条件を上位階層から下位階層に辿りながら、画像データの被写体認識を実施する。この場合、被写体条件を絞り込むように被写体認識が実施され、可能性の低い下位階層の被写体認識を省くことができる。その結果、被写体認識の処理回数を低減し、多数の被写体種類について効率の良い被写体認識を行うことができる。 In the present invention, subject recognition of image data is performed while tracing subject conditions classified into hierarchies from an upper hierarchy to a lower hierarchy. In this case, subject recognition is performed so as to narrow down the subject condition, and it is possible to omit subject recognition of a lower hierarchy with a low possibility. As a result, the number of subject recognition processes can be reduced, and efficient subject recognition can be performed for many types of subjects.

［構成説明］
図１は、画像検索システム１１の構成を示すブロック図である。
図１において、画像検索システム１１は、被写体認識装置１２、画像データベース１３、および端末装置１４（ユーザーのパソコンなど）を、インターネットなどの通信回線により通信接続して構成される。 [Description of configuration]
FIG. 1 is a block diagram showing the configuration of the image search system 11.
In FIG. 1, an image search system 11 is configured by communicably connecting a subject recognition device 12, an image database 13, and a terminal device 14 (such as a user's personal computer) via a communication line such as the Internet.

なお、この画像検索システム１１を、コンピュータ単体で実現することにより、コンピュータ内に記憶される画像データを高速検索するシステムを構築してもよい。 In addition, you may construct | assemble the system which searches the image data memorize | stored in a computer at high speed by implement | achieving this image search system 11 with a computer single-piece | unit.

さらに、上記の被写体認識装置１２は、画像入力部２１、被写体辞書部２２、および処理部２３を備えて構成される。さらに、この処理部２３は、特徴量演算部３１、特徴量記憶部３２、制御部３３、および被写体キーワード記憶部３４を備えて構成される。
なお、このような被写体認識装置１２の一部または全部を、コンピュータ上で被写体認識プログラムを実行することによって、ソフトウェア的に実現してもよい。また、被写体認識装置１２の一部または全部を、ハードウェア的に実現してもよい。
以下、画像検索システム１１の動作について、具体的に説明する。 Further, the subject recognition device 12 includes an image input unit 21, a subject dictionary unit 22, and a processing unit 23. Further, the processing unit 23 includes a feature amount calculation unit 31, a feature amount storage unit 32, a control unit 33, and a subject keyword storage unit 34.
Note that a part or all of the subject recognition apparatus 12 may be realized in software by executing a subject recognition program on a computer. Further, part or all of the subject recognition device 12 may be realized by hardware.
Hereinafter, the operation of the image search system 11 will be specifically described.

［新規画像に対する被写体認識］
図２は、新規の画像データに対する被写体認識を説明する流れ図である。以下、この図２に示すステップ番号に沿って、動作説明を行う。 [Subject recognition for new images]
FIG. 2 is a flowchart illustrating subject recognition for new image data. Hereinafter, the operation will be described along the step numbers shown in FIG.

ステップＳ１：画像入力部２１は、画像データベース１３のデータ更新を定期的に監視する。画像入力部２１は、この監視によって、画像データベース１３内にキーワード未付与の新規な画像データを発見すると、この画像データを取り込む。 Step S1: The image input unit 21 periodically monitors the data update of the image database 13. When the image input unit 21 finds new image data to which no keyword is assigned in the image database 13 by this monitoring, the image input unit 21 takes in the image data.

ステップＳ２：特徴量演算部３１は、被写体認識を効率良く行うため、取り込んだ画像データから画像特徴量を抽出する。この画像特徴量は、後述する被写体条件との照合に使用される。抽出された画像特徴量は、特徴量記憶部３２に記憶される。
この種の画像特徴量としては、例えば、下記のようなデータが有効である。 Step S2: The feature amount calculation unit 31 extracts an image feature amount from the captured image data in order to perform subject recognition efficiently. This image feature amount is used for collation with subject conditions described later. The extracted image feature quantity is stored in the feature quantity storage unit 32.
For example, the following data is effective as this kind of image feature amount.

(1)画像の色相ヒストグラム…風景被写体／人物被写体などを色傾向により認識する際に使用する。
(2)画像の抽出エッジ…エッジ形状に基づいて特定被写体の認識に使用する。
(3)輪郭が閉曲線を示している領域・・顔や物などの閉領域の認識に使用する。
(4)顔（肌色）部分の部分画像…特定個人の認識に使用する。
(5)唇付近の色…男女の認識に使用する。
(6)髪付近の色…男女や高齢者の認識に使用する。
(7)画像内のテクスチャー…目／鼻／口／衣服の模様／草原／樹林などの認識に使用する。
(8)縦ラインの頻度…建物／樹林などの認識に使用する。
(9)画面を横断する横ライン…水平線／地平線などの認識に使用する。
(10)光源の推定色温度…室内／屋外などの認識に使用する。
(11)画像データの部分画像・・パターンマッチングに使用する。
(12)ＧＰＳ情報、撮影日時、被写体距離、露出情報、音声データ、画像の縦、横方向を示す情報、焦点制御情報などの付随情報 (1) Hue histogram of image: Used when recognizing landscape subject / person subject by color tendency.
(2) Image extraction edge: Used to recognize a specific subject based on the edge shape.
(3) Used for recognizing a closed region such as a face or an object where the contour shows a closed curve.
(4) Partial image of the face (skin color) part ... Used for recognition of specific individuals.
(5) Color near the lips: Used to recognize men and women.
(6) Color near hair: Used to recognize men and women and the elderly.
(7) Texture in image: Used to recognize eyes / nose / mouth / clothing pattern / meadow / forest.
(8) Vertical line frequency: Used to recognize buildings / forests.
(9) Horizontal line across the screen: Used to recognize horizontal / horizontal lines.
(10) Estimated color temperature of light source: Used to recognize indoor / outdoor conditions.
(11) Used for partial image pattern matching of image data.
(12) GPS information, shooting date and time, subject distance, exposure information, audio data, information indicating the vertical and horizontal directions of the image, accompanying information such as focus control information

ステップＳ３：被写体辞書部２２の被写体辞書には、被写体種類を特定するための被写体条件が、図３に示すような、階層構造で予め登録されている。制御部３３は、この被写体辞書の階層構造の中から最上位の階層を選んで探索階層とする。 Step S3: In the subject dictionary of the subject dictionary unit 22, subject conditions for specifying the subject type are registered in advance in a hierarchical structure as shown in FIG. The control unit 33 selects the highest hierarchy from the hierarchical structure of the subject dictionary and sets it as the search hierarchy.

ステップＳ４：制御部３３は、探索階層に属する被写体種類１つ１つの被写体条件を被写体辞書部２２から読み出し、画像特徴量と１つずつ照合する。
例えば、図３では、最上位の探索階層であれば、人物・風景といった並列概念（最上位階層からの深さが等しい）の被写体種類が定義されている。
例えば、人物については、色相ヒストグラムに肌色の高頻度分布が現れる等の被写体条件が成立すれば、画像データ内に人物被写体が映っていると判断する。
また、風景については、色相ヒストグラムに青色や緑色の高頻度分布が現れ、かつ被写体距離が遠方である等の被写体条件が成立すれば、画像データ内に風景被写体が映っていると判断する。
以上のような動作により、探索階層に属ずる並列概念の被写体種類の１つ１つについて、画像データ内に映っているか否かの条件判定が行われる。 Step S4: The control unit 33 reads out the subject conditions for each subject type belonging to the search hierarchy from the subject dictionary unit 22, and collates them one by one with the image feature amount.
For example, in FIG. 3, in the highest search hierarchy, a subject type of a parallel concept such as a person / landscape (the depth from the highest hierarchy is equal) is defined.
For example, for a person, if a subject condition such as the appearance of a high frequency distribution of skin color appears in the hue histogram, it is determined that the person subject appears in the image data.
For a landscape, if a subject condition such as a high-frequency distribution of blue or green appears in the hue histogram and the subject distance is far away, it is determined that a landscape subject is reflected in the image data.
With the operation as described above, whether each of the parallel concept subject types belonging to the search hierarchy is reflected in the image data is determined.

ステップＳ５：制御部３３は、条件成立した被写体種類に関連するキーワードを、画像データに対応付けて、被写体キーワード記憶部３４に記録する。 Step S5: The control unit 33 records a keyword related to the subject type that satisfies the condition in the subject keyword storage unit 34 in association with the image data.

ステップＳ６：制御部３３は、条件不成立となった被写体種類に包含される下位階層を被写体辞書の参照範囲から省く。例えば、人物の被写体条件が不成立となった場合、人物被写体に包含される下位階層（図３では、特定個人，男性，女性など）を、被写体辞書の参照範囲から省く。このように被写体辞書の参照範囲を、上位階層の被写体認識結果に従って限定することにより、可能性の高い被写体種類に絞り込みながら、効率良く被写体認識を実行することができる。 Step S6: The control unit 33 omits the lower hierarchy included in the subject type that does not satisfy the condition from the reference range of the subject dictionary. For example, when a person's subject condition is not satisfied, a lower hierarchy included in the person subject (in FIG. 3, a specific individual, male, female, etc.) is omitted from the reference range of the subject dictionary. In this way, by limiting the reference range of the subject dictionary according to the subject recognition result of the upper layer, subject recognition can be performed efficiently while narrowing down to highly likely subject types.

ステップＳ７：ここで、制御部３３は、被写体辞書の探索階層が、最下位の階層に到達したか否かを判断する。
まだ、探索階層の下に下位階層が残っている場合、制御部３３は、この下位階層について被写体認識を行うため、ステップＳ８に動作を移行する。
一方、探索階層が既に最下位の階層に到達している場合、制御部３３は、新規の画像データについて、全ての被写体認識を完了したと見なして、ステップＳ９に動作を移行する。 Step S7: Here, the control unit 33 determines whether or not the search hierarchy of the subject dictionary has reached the lowest hierarchy.
If the lower hierarchy still remains below the search hierarchy, the control unit 33 shifts the operation to step S8 in order to perform subject recognition for this lower hierarchy.
On the other hand, if the search hierarchy has already reached the lowest hierarchy, the control unit 33 regards the new image data as having been subjected to all subject recognition, and proceeds to step S9.

ステップＳ８：制御部３３は、探索階層を１つ下げて、ステップＳ４に動作を戻す。 Step S8: The control unit 33 lowers the search hierarchy by one and returns the operation to step S4.

ステップＳ９：制御部３３は、被写体辞書部２２から被写体辞書のバージョン情報を取得し、画像データに対応付けて被写体キーワード記憶部３４に記録する。
以上の動作により、キーワード未付与の画像データに対する被写体認識と、キーワード付与の処理が完了する。 Step S9: The control unit 33 acquires the version information of the subject dictionary from the subject dictionary unit 22, and records it in the subject keyword storage unit 34 in association with the image data.
With the above operation, the subject recognition and keyword assignment processing for image data to which no keyword is assigned are completed.

［被写体辞書の更新に伴う処理］
図４は、辞書更新に伴う処理を説明する流れ図である。以下、この図４に示すステップ番号に沿って、動作説明を行う。 [Process associated with subject dictionary update]
FIG. 4 is a flowchart for explaining processing associated with dictionary update. Hereinafter, the operation will be described along the step numbers shown in FIG.

ステップＳ２１：被写体辞書部２２は、下記の項目を被写体辞書に追加登録する機能を有する。 Step S21: The subject dictionary unit 22 has a function of additionally registering the following items in the subject dictionary.

（１）被写体種類Ｋｄ
（２）その被写体種類Ｋｄを特定するための被写体条件Ｃｄ
（３）その被写体種類Ｋｄに関連するキーワード
（４）その被写体種類Ｋｄの被写体辞書内での階層位置 (1) Subject type Kd
(2) Subject condition Cd for specifying the subject type Kd
(3) Keyword related to the subject type Kd (4) Hierarchy position in the subject dictionary of the subject type Kd

ユーザーは、端末装置１４を介してこれら項目の追加登録を行うことにより、被写体辞書をカスタマイズすることができる。また、画像検索システム１１の管理者側で、この追加登録を実施することにより、被写体辞書を定期的にバージョンアップすることもできる。 The user can customize the subject dictionary by additionally registering these items via the terminal device 14. Moreover, the subject dictionary can be periodically upgraded by performing this additional registration on the manager side of the image search system 11.

なお、階層位置の登録については、被写体辞書の階層構造の知識が必要になるため、これを省略してもよい。また、被写体条件Ｃｄの登録については、被写体種類Ｋｄに該当する代表的な画像の一部または全部を登録することで簡易に済ませてもよい。
一方、被写体辞書部２２側では、このような追加登録のたびに、被写体辞書の更新内容を履歴として保存し、被写体辞書の版数を示すバージョン情報を更新する。 Note that the registration of the hierarchical position may be omitted because knowledge of the hierarchical structure of the subject dictionary is required. Registration of the subject condition Cd may be simplified by registering a part or all of a representative image corresponding to the subject type Kd.
On the other hand, each time such additional registration is performed, the subject dictionary unit 22 stores the updated contents of the subject dictionary as a history and updates version information indicating the version number of the subject dictionary.

ステップＳ２２：制御部３３は、被写体種類Ｋｄについて、階層位置が追加登録されたか否かを判定する。
ここで、階層位置の登録が省略されていた場合、制御部３３は、被写体種類Ｋｄの階層位置を調べるため、ステップＳ２４に動作を移行する。
一方、被写体種類Ｋｄの階層位置が登録されている場合、制御部３３はステップＳ２３に動作を移行する。 Step S22: The control unit 33 determines whether or not a layer position is additionally registered for the subject type Kd.
Here, when the registration of the hierarchy position is omitted, the control unit 33 shifts the operation to step S24 in order to check the hierarchy position of the subject type Kd.
On the other hand, when the layer position of the subject type Kd is registered, the control unit 33 shifts the operation to step S23.

ステップＳ２３：制御部３３は、登録された階層位置に従って、被写体種類Ｋｄを包含する一階層上の被写体種類Ｋｕを求める。この動作の後、制御部３３はステップＳ２７に動作を移行する。 Step S23: The control unit 33 obtains a subject type Ku one level higher than the subject type Kd according to the registered hierarchical position. After this operation, the control unit 33 shifts the operation to step S27.

ステップＳ２４：制御部３３は、被写体種類Ｋｄの階層位置が不明なため、まずテスト的に被写体認識を行う。すなわち、制御部３３は、過去に被写体認識を済ませている所定数の画像データ（テスト画像）について、画像特徴量と被写体条件Ｃｄとの照合を一通り実施する。 Step S24: Since the hierarchical position of the subject type Kd is unknown, the control unit 33 first performs subject recognition as a test. That is, the control unit 33 performs a comparison between the image feature amount and the subject condition Cd for a predetermined number of image data (test images) that have been subject-recognized in the past.

ステップＳ２５：制御部３３は、ステップＳ２４における照合結果と、テスト画像の過去の被写体認識結果とに基づいて、被写体種類Ｋｄを論理的に包含する被写体種類Ｋｕを求める。 Step S25: The control unit 33 obtains a subject type Ku that logically includes the subject type Kd based on the collation result in step S24 and the past subject recognition result of the test image.

ステップＳ２６：制御部３３は、被写体辞書部２２の被写体辞書を書き換え、被写体種類Ｋｄの階層位置を被写体種類Ｋｕの下位階層とする。 Step S26: The control unit 33 rewrites the subject dictionary in the subject dictionary unit 22 and sets the layer position of the subject type Kd as the lower layer of the subject type Ku.

ステップＳ２７：制御部３３は、過去の被写体認識において被写体種類Ｋｕが映っていると判断された画像データ群を選択する。 Step S27: The control unit 33 selects an image data group that is determined to include the subject type Ku in past subject recognition.

ステップＳ２８：制御部３３は、ステップＳ２７で選択された画像データ群の画像特徴量を、被写体条件Ｃｄと順次に照合し、条件成立する画像データを選別する。 Step S28: The control unit 33 sequentially collates the image feature amount of the image data group selected in Step S27 with the subject condition Cd, and selects image data that satisfies the condition.

ステップＳ２９：制御部３３は、ステップＳ２８で条件成立した画像データに対応付けて、被写体種類Ｋｄに関連するキーワードを被写体キーワード記憶部２４に記録する。 Step S29: The control unit 33 records a keyword related to the subject type Kd in the subject keyword storage unit 24 in association with the image data that satisfies the condition in step S28.

ステップＳ３０：制御部３３は、被写体辞書部２２から被写体辞書のバージョン情報を取得し、ステップＳ２７で選択された画像データに対応付けて、被写体キーワード記憶部３４に記録する。
以上の動作により、被写体辞書の更新に応じて、画像データに新たな被写体キーワードを追加的に付与することができる。 Step S30: The control unit 33 acquires the version information of the subject dictionary from the subject dictionary unit 22, and records it in the subject keyword storage unit 34 in association with the image data selected in step S27.
With the above operation, a new subject keyword can be additionally given to the image data in accordance with the update of the subject dictionary.

［画像検索の動作］
図５は、本実施形態における画像検索を説明する流れ図である。以下、図５に示すステップ番号に沿って、この動作を説明する。 [Image search operation]
FIG. 5 is a flowchart illustrating image search in the present embodiment. Hereinafter, this operation will be described along the step numbers shown in FIG.

ステップＳ４１：ユーザーは、所望の被写体が映っている画像データを検索するため、その被写体に関連するキーワードを端末装置１４に入力する。端末装置１４は、この入力キーワードを制御部３３に伝達する。 Step S41: The user inputs a keyword related to the subject to the terminal device 14 in order to search for image data showing the desired subject. The terminal device 14 transmits this input keyword to the control unit 33.

ステップＳ４２：制御部３３は、被写体キーワード記憶部３４に入力キーワードを照会し、入力キーワードに合致または類似する画像データを選び出す。 Step S42: The control unit 33 queries the subject keyword storage unit 34 for the input keyword, and selects image data that matches or is similar to the input keyword.

ステップＳ４３：制御部３３は、ステップＳ４２で選出された画像データのサムネイルを並べて、リンク付きサムネイル頁を作成する。制御部３３は、作成したサムネイル頁のデータを、画像検索結果として端末装置１４に転送する。ユーザーは、端末装置１４を用いてこのサムネイル頁を閲覧し、サムネイル頁のリンクを辿ることで、探索結果である画像データにアクセスすることができる。
この動作により、画像データ群の中から、入力キーワードに関連する被写体種類の映っているものを迅速に検索することが可能になる。 Step S43: The control unit 33 arranges the thumbnails of the image data selected in Step S42 and creates a linked thumbnail page. The control unit 33 transfers the created thumbnail page data to the terminal device 14 as an image search result. The user can access the image data as the search result by browsing the thumbnail page using the terminal device 14 and following the link of the thumbnail page.
With this operation, it is possible to quickly search the image data group for the subject type related to the input keyword.

［実施形態の効果など］
以上説明したように、本実施形態では、階層構造の被写体辞書を使用するため、被写体種類を絞り込むように被写体認識を実施することができる。そのため、上位階層の認識状況に従って、可能性の低い下位階層の被写体認識を柔軟に省くことができる。その結果、認識すべき被写体種類が多くなっても、被写体認識の回数を効率良く減らし、被写体認識に所要する総処理時間を短縮することができる。 [Effects of the embodiment, etc.]
As described above, in the present embodiment, since a subject dictionary having a hierarchical structure is used, subject recognition can be performed so as to narrow down subject types. Therefore, it is possible to flexibly omit the lower-level subject recognition with low possibility according to the recognition status of the upper layer. As a result, even if the number of types of subjects to be recognized increases, the number of times of subject recognition can be efficiently reduced, and the total processing time required for subject recognition can be shortened.

さらに、本実施形態は、画像データに映っている被写体種類を予め認識し、その被写体種類に関連するキーワードを記録しておく。そのため、キーワードの照合によって、所望の被写体が映っている画像データを簡易に検索することが可能になる。 Further, in the present embodiment, the subject type reflected in the image data is recognized in advance, and a keyword related to the subject type is recorded. Therefore, it is possible to easily search for image data showing a desired subject by matching keywords.

なお、本実施形態は、追加登録された被写体種類Ｋｄの照合範囲を、上位階層の被写体種類Ｋｕに過去分類された画像データ群に絞る。その結果、追加登録のたびに、過去の全ての画像データに対して被写体認識を再実行する必要がなくなり、被写体辞書のバージョンアップに伴う処理負荷を少なくすることができる。 In this embodiment, the collation range of the additionally registered subject type Kd is narrowed down to the image data group that has been previously classified into the higher-level subject type Ku. As a result, it is not necessary to re-execute subject recognition on all past image data every time additional registration is performed, and the processing load associated with version upgrade of the subject dictionary can be reduced.

［実施形態の補足事項］
なお、上述した実施形態では、画像検索の用途を中心に説明した。しかしながら、この用途に限定されるものではない。例えば、画像データに映っている被写体種類を判別し、その被写体種類に対応する画像処理（画質調整を含む）を画像データに施してもよい。この動作により、被写体種類に適した画像処理を選択実行する画像処理装置を実現することができる。 [Supplementary items of the embodiment]
In the above-described embodiment, the description is focused on the use of image search. However, it is not limited to this application. For example, the subject type shown in the image data may be determined, and image processing (including image quality adjustment) corresponding to the subject type may be performed on the image data. With this operation, an image processing apparatus that selects and executes image processing suitable for the type of subject can be realized.

また、上述した実施形態では、画像データから画像特徴量を抽出している。しかしながら、これに限定されるものではない。被写体種類を示すテンプレート画像（被写体条件に相当）と画像データとをパターンマッチングで照合することにより、画像データ内に被写体種類が映っているか否かを判定してもよい。 In the above-described embodiment, the image feature amount is extracted from the image data. However, the present invention is not limited to this. It may be determined whether or not the subject type is reflected in the image data by collating a template image indicating the subject type (corresponding to the subject condition) and image data by pattern matching.

さらに、上述した実施形態において、画像データを複数の分割領域に分け、これら分割領域ごとに被写体認識を実施してもよい。 Furthermore, in the above-described embodiment, the image data may be divided into a plurality of divided areas, and subject recognition may be performed for each of the divided areas.

また、上述した実施形態では、上下階層の包含関係に基づいて被写体種類を絞り込み、被写体認識の回数を省いている。しかしながら、これに限定されるものではない。 Further, in the above-described embodiment, subject types are narrowed down based on the inclusion relationship of the upper and lower layers, and the number of subject recognition is omitted. However, the present invention is not limited to this.

例えば、人物および女性について被写体認識を行った場合、人物であって女性でないと判定された被写体を、男性その他にそのまま分類することが可能である。このように、並列概念の排他関係に基づいて被写体種類の可能性を絞り込むことで、被写体認識の回数を効率良く省くことが可能である。 For example, when subject recognition is performed for a person and a woman, it is possible to classify a subject that is a person and is determined not to be a woman as male or other. In this way, by narrowing down the possibility of the subject type based on the parallel concept exclusive relationship, the number of times of subject recognition can be efficiently eliminated.

なお、上述した実施形態では、被写体キーワード記憶部３４に画像データのキーワードやバージョン情報を記録している。しかしながら、これに限定されるものではない。画像データのファイル内（例えばEXIFデータなど）に、キーワード情報および／またはバージョン情報を記録してもよい。 In the embodiment described above, the keyword and version information of the image data are recorded in the subject keyword storage unit 34. However, the present invention is not limited to this. Keyword information and / or version information may be recorded in a file of image data (for example, EXIF data).

また、上述した実施形態では、静止画像を前提に説明したが、これに限定されるものではない。動画像の被写体認識にも同様に適用することができる。例えば、１つの動画像ファイルのうち、人物が映っているシーンを判別し、さらに、人物シーンから女性が映っているシーンを判別することができる。 In the above-described embodiment, the description has been made on the assumption of a still image, but the present invention is not limited to this. It can be similarly applied to subject recognition of moving images. For example, it is possible to determine a scene in which a person is shown in one moving image file, and further to determine a scene in which a woman is shown from the person scene.

以上説明したように、本発明は、画像検索システムや被写体認識装置などに利用可能な技術である。 As described above, the present invention is a technique that can be used for an image search system, a subject recognition device, and the like.

画像検索システム１１の構成を示すブロック図である。1 is a block diagram illustrating a configuration of an image search system 11. FIG. 新規の画像データに対する被写体認識を説明する流れ図である。6 is a flowchart for explaining subject recognition for new image data. 被写体辞書の階層構造の一例を示す図である。It is a figure which shows an example of the hierarchical structure of a to-be-photographed dictionary. 辞書更新に伴う処理を説明する流れ図である。It is a flowchart explaining the process accompanying a dictionary update. 画像検索の動作を説明する流れ図である。It is a flowchart explaining operation | movement of an image search.

Explanation of symbols

１１…画像検索システム，１２…被写体認識装置，１３…画像データベース，１４…端末装置，２１…画像入力部，２２…被写体辞書部，２３…処理部，２４…被写体キーワード記憶部，３１：…特徴演算部，３１…特徴量演算部，３２…特徴量記憶部，３３…制御部，３４…被写体キーワード記憶部
DESCRIPTION OF SYMBOLS 11 ... Image search system, 12 ... Subject recognition apparatus, 13 ... Image database, 14 ... Terminal device, 21 ... Image input part, 22 ... Subject dictionary part, 23 ... Processing part, 24 ... Subject keyword memory | storage part, 31: ... Feature Calculation unit, 31... Feature amount calculation unit, 32... Feature amount storage unit, 33... Control unit, 34.

Claims

An image input unit for capturing image data;
A subject dictionary section for storing subject conditions for specifying a subject type;
A processing unit that collates the image data with the subject condition and identifies the subject type shown in the image data,
The subject dictionary unit classifies and stores the subject conditions in a hierarchy,
The subject recognition apparatus characterized in that the processing unit performs collation while narrowing down subject conditions located in a lower layer based on a collation result of subject conditions located in an upper layer.

The subject recognition apparatus according to claim 1,
The processor is
Predetermined keyword information related to the recognized subject type is recorded in association with the image data,
A subject recognition apparatus, wherein an image search is performed by comparing an input keyword with the keyword information corresponding to the image data.

The subject recognition apparatus according to any one of claims 1 to 2,
The subject dictionary unit has a function of additionally registering the subject type Kd and the subject condition Cd,
The processing unit obtains a higher-level subject type Ku that includes the subject type Kd according to the additional registration in the subject dictionary unit, and the image data for which the subject type Ku has been detected is compared with the subject condition Cd. An object recognition apparatus characterized by additionally performing collation.

The subject recognition apparatus according to any one of claims 1 to 3,
The subject dictionary unit has a function of managing version information of the subject condition,
The processing unit records version information of the subject condition used for specifying the subject type in association with the image data.

A subject recognition program for causing a computer to function as the subject recognition device according to any one of claims 1 to 4.

A method for a computer to implement an image search service, comprising:
Categorizing and storing subject conditions for specifying a subject type in a hierarchy;
Obtaining a subject type shown in the image data by sequentially comparing the image data with the subject condition according to the hierarchy;
Recording predetermined keyword information related to the recognized subject type in association with the image data;
A method for providing an image search service, comprising: comparing an input keyword with the keyword information corresponding to the image data and performing an image search.