JP2014211895A

JP2014211895A - Image processing apparatus and method, information processing apparatus and method, and program

Info

Publication number: JP2014211895A
Application number: JP2014135789A
Authority: JP
Inventors: 児嶋　環; Tamaki Kojima; 環児嶋; 祥弘山口; Sachihiro Yamaguchi; 幹夫酒本; Mikio Sakamoto; 竹松克浩; Katsuhiro Takematsu; 克浩竹松
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2014-07-01
Filing date: 2014-07-01
Publication date: 2014-11-13
Anticipated expiration: 2026-02-01
Also published as: JP6109118B2

Abstract

PROBLEM TO BE SOLVED: To easily search for a desired image in a device with relatively small processing capability.SOLUTION: An image analysis unit analyzes an image by applying image processing to the image. A similar feature database stores data showing features of the image obtained as a result of the image processing of the image in the image analysis unit. A transmission control unit controls transmission of features to be recorded in a digital still camera for recording information related to the image as data of the same structure as structure of the similar feature database to the digital still camera. The present invention can be applied to a hard disk recorder or a game device mounting a CPU and a hard disk thereon to which a personal computer and the digital still camera can be connected.

Description

本開示は、画像処理装置および方法、情報処理装置および方法、並びにプログラムに関し、特に、画像の特徴を抽出できるようにした画像処理装置および方法、情報処理装置および方法、並びにプログラムに関する。 The present disclosure relates to an image processing device and method, an information processing device and method, and a program, and more particularly to an image processing device and method, an information processing device and method, and a program that can extract image features.

特許文献１に、デジタルスチルカメラなどの小型のCE（consumer electronic）機器において、顔を検出したり、画像の特徴を抽出し、画像を検索する機能が各種提案されている。 Patent Document 1 proposes various functions for detecting a face, extracting image features, and searching for an image in a small CE (consumer electronic) device such as a digital still camera.

特開２００４−６２８６８号公報JP 2004-62868 A

しかしながら、小型のCE機器においては、搭載されているプロセッサの能力が限られていることから、実際のところ、限られた範囲でしか画像の解析ができない。そのため、十分な解析ができず、それにより解析の結果の用途も、その精度も限られたものとなってしまう。 However, in a small CE device, since the capability of the installed processor is limited, the image can actually be analyzed only in a limited range. Therefore, sufficient analysis cannot be performed, thereby limiting the use of the analysis result and its accuracy.

顔の検出においては、解析に使用できる画像の解像度を極めて低くしなければ、解析に極めて長い時間が必要とされ、処理の時間が、ユーザが待つことのできる時間を大幅に超えてしまう。画像の解像度を極めて低くすると、小さく写った顔、特に、集合写真における顔を検出することができなくなり、集合写真を検索したいなどのニーズに応えることができない。 In face detection, unless the resolution of an image that can be used for analysis is extremely low, analysis requires a very long time, and the processing time greatly exceeds the time that the user can wait for. If the resolution of the image is extremely low, it is impossible to detect a small face, particularly a face in a group photo, and it is not possible to meet the needs such as searching for a group photo.

また、デジタルスチルカメラなどにこのような処理を行わせると、デジタルスチルカラに処理が集中するので、デジタルスチルカメラのプロセッサで消費される電力も増えて、デジタルスチルカメラの本来の目的である、撮影できる時間が短くなったり、撮影できる画像の枚数が減ってしまうなどの弊害も生じてしまう。 In addition, if such processing is performed on a digital still camera or the like, the processing is concentrated on the digital still camera, so the power consumed by the processor of the digital still camera is increased, which is the original purpose of the digital still camera. There are also disadvantages such as the time that can be taken is shortened and the number of images that can be taken is reduced.

一方で、デジタルスチルカメラの普及や、携帯電話機へのスチルカメラ機能の搭載が進み、日常生活の中で、写真（静止画像）を撮影する機会は着実に増えている。しかし、撮影した画像をデジタルスチルカメラ本体で閲覧しようとする場合に画像を検索する方法は、縮小した画像（いわゆる、サムネイル画像）を撮影順に表示し閲覧する程度でしかない。デジタルスチルカメラの検索の利便性は、パーソナルコンピュータなどで実行される画像管理プログラムにおける利便性に遙かに劣る。 On the other hand, with the spread of digital still cameras and the mounting of still camera functions in mobile phones, opportunities for taking photographs (still images) are steadily increasing in daily life. However, the method of searching for an image when the photographed image is to be browsed on the digital still camera body is only to display and browse reduced images (so-called thumbnail images) in the order of shooting. The convenience of searching for a digital still camera is far inferior to the convenience of an image management program executed on a personal computer or the like.

このようなことから、大容量のストレージおよび写真アルバム機能を備えるデジタルスチルカメラにおいて、使用者の見たい画像を簡単に探し出す機能が必要とされている。 For this reason, a digital still camera having a large-capacity storage and a photo album function is required to have a function of easily searching for an image that the user wants to see.

本発明は、このような状況に鑑みてなされたものであり、処理能力の比較的小さい機器において、簡単に、所望の画像を検索することができるようにするものである。 The present invention has been made in view of such a situation, and makes it possible to easily search for a desired image in a device having a relatively small processing capability.

本開示の第１の側面の画像処理装置は、画像を解析し、前記画像に含まれる顔の画像に関する顔情報を抽出する特徴抽出手段と、前記特徴抽出手段により前記画像から抽出された顔情報に基づき、前記画像に関連付けられたメタデータを生成する生成手段と、複数の前記画像にそれぞれ対応する縮小画像の中から選択された縮小画像と、前記選択された縮小画像に対応する前記画像に関連付けられたメタデータとを送信する送信手段とを備え、前記メタデータは、外部機器において、前記外部機器の使用者が前記メタデータに基づいて画像を検索することを可能とする構成である。 An image processing apparatus according to a first aspect of the present disclosure includes a feature extraction unit that analyzes an image and extracts face information related to a face image included in the image, and the face information extracted from the image by the feature extraction unit. Generating means for generating metadata associated with the image, a reduced image selected from the reduced images respectively corresponding to the plurality of images, and the image corresponding to the selected reduced image Transmission means for transmitting the associated metadata, and the metadata is configured to allow a user of the external device to search for an image based on the metadata in the external device.

前記送信手段は、前記外部機器の使用者により選択された縮小画像と、前記選択された縮小画像に対応する前記画像に関連付けられたメタデータとを送信することができる。 The transmission unit can transmit a reduced image selected by a user of the external device and metadata associated with the image corresponding to the selected reduced image.

前記特徴量抽出手段は、前記画像に含まれる色情報を抽出し、前記生成手段は、前記色情報に基づき、前記メタデータを生成し、前記メタデータは、前記画像の色に関する情報を含むことができる。 The feature amount extraction unit extracts color information included in the image, the generation unit generates the metadata based on the color information, and the metadata includes information on the color of the image. Can do.

前記メタデータは、前記外部機器において画像の検索に用いるメタデータが選択可能に構成される。 The metadata is configured such that metadata used for image search in the external device can be selected.

前記メタデータは、画像に含まれる顔の幅および高さの情報を含むことができる。 The metadata may include information on the width and height of a face included in the image.

前記メタデータは、文字列を含むコメント情報を含むことができる。 The metadata may include comment information including a character string.

前記メタデータは、グループを特定するデータであるグループＩＤを含むことができる。 The metadata may include a group ID that is data for specifying a group.

前記顔情報は、画像に含まれる顔の数の情報を含むことができる。 The face information may include information on the number of faces included in the image.

前記外部機器とネットワークを介して接続されている。 It is connected to the external device via a network.

本開示の第１の側面の画像処理方法は、画像処理装置が、画像を解析し、前記画像に含まれる顔の画像に関する顔情報を抽出し、前記画像から抽出された顔情報に基づき、前記画像に関連付けられたメタデータを生成し、複数の前記画像にそれぞれ対応する縮小画像の中から選択された縮小画像と、前記選択された縮小画像に対応する前記画像に関連付けられたメタデータとを送信し、前記メタデータは、外部機器において、前記外部機器の使用者が前記メタデータに基づいて画像を検索することを可能とする構成である。 In the image processing method according to the first aspect of the present disclosure, the image processing apparatus analyzes an image, extracts face information regarding a face image included in the image, and based on the face information extracted from the image, Metadata associated with an image is generated, and a reduced image selected from a plurality of reduced images respectively corresponding to the plurality of images, and metadata associated with the image corresponding to the selected reduced image The metadata transmitted is configured to allow a user of the external device to search for an image based on the metadata in the external device.

本開示の第１の側面のプログラムは、画像を解析し、前記画像に含まれる顔の画像に関する顔情報を抽出する特徴抽出手段と、前記特徴抽出手段により前記画像から抽出された顔情報に基づき、前記画像に関連付けられたメタデータを生成する生成手段と、複数の前記画像にそれぞれ対応する縮小画像の中から選択された縮小画像と、前記選択された縮小画像に対応する前記画像に関連付けられたメタデータとを送信する送信手段として、コンピュータを機能させ、前記メタデータは、外部機器において、前記外部機器の使用者が前記メタデータに基づいて画像を検索することを可能とする構成である。 A program according to a first aspect of the present disclosure is based on feature extraction means that analyzes an image and extracts face information related to a face image included in the image, and face information extracted from the image by the feature extraction means. Generation means for generating metadata associated with the image, a reduced image selected from the reduced images respectively corresponding to the plurality of images, and the image corresponding to the selected reduced image. The computer functions as a transmission means for transmitting the received metadata, and the metadata allows the user of the external device to search for an image based on the metadata in the external device. .

本開示の第２の側面の情報処理装置は、画像を解析し、前記画像から抽出された前記画像に含まれる顔の画像に関する顔情報に基づき、前記画像に関連付けられたメタデータを生成するサーバから、複数の前記画像にそれぞれ対応する縮小画像の中から選択された縮小画像と、前記選択された縮小画像に対応する前記画像に関連付けられたメタデータに基づいて、検索された画像を提示する提示手段とを備え、前記メタデータは、使用者が前記メタデータに基づいて画像を検索することを可能とする構成である。 An information processing apparatus according to a second aspect of the present disclosure is a server that analyzes an image and generates metadata associated with the image based on face information related to a face image included in the image extracted from the image The retrieved image is presented based on the reduced image selected from the reduced images respectively corresponding to the plurality of images and the metadata associated with the image corresponding to the selected reduced image. Presenting means, and the metadata is configured to allow a user to search for an image based on the metadata.

前記受信手段は、前記情報処理装置の使用者により選択された縮小画像と、前記選択された縮小画像に対応する前記画像に関連付けられたメタデータとを受信することができる。 The receiving unit can receive a reduced image selected by a user of the information processing apparatus and metadata associated with the image corresponding to the selected reduced image.

画像を撮像する画像撮像手段をさらに備えることができる。 An image capturing unit that captures an image can be further provided.

前記メタデータは、前記情報処理装置において、前記情報処理装置の使用者が前記特メタデータに基づいて画像を検索することを可能とする構成である。 The metadata is configured to allow a user of the information processing apparatus to search for an image based on the special metadata in the information processing apparatus.

前記メタデータは、前記サーバにおいて前記画像から抽出された色情報に基づき生成されたデータであり、前記画像の色に関する情報を含むことができる。 The metadata is data generated based on color information extracted from the image in the server, and may include information on the color of the image.

本開示の第２の側面の情報処理方法は、情報処理装置が、画像を解析し、前記画像から抽出された前記画像に含まれる顔の画像に関する顔情報に基づき、前記画像に関連付けられたメタデータを生成するサーバから、複数の前記画像にそれぞれ対応する縮小画像の中から選択された縮小画像と、前記選択された縮小画像に対応する前記画像に関連付けられたメタデータを受信し、受信された前記メタデータに基づいて、検索された画像を提示し、前記メタデータは、使用者が前記メタデータに基づいて画像を検索することを可能とする構成である。 In the information processing method according to the second aspect of the present disclosure, an information processing apparatus analyzes an image, and based on face information related to a face image included in the image extracted from the image, a meta information associated with the image. A reduced image selected from a reduced image corresponding to each of the plurality of images and metadata associated with the image corresponding to the selected reduced image are received from a server that generates data and received. The retrieved image is presented based on the metadata, and the metadata is configured to allow a user to retrieve an image based on the metadata.

本開示の第２の側面のプログラムは、サーバにより、画像が解析されて、前記画像から抽出された前記画像に含まれる顔の画像に関する顔情報に基づき生成された、前記画像に関連付けられたメタデータを受信する受信手段と、前記受信手段により受信された前記メタデータを記録する記録手段と、前記メタデータに関係付けられた画像を検索する検索手段と、前記検索手段により検索された画像を提示する提示手段として、コンピュータを機能させ、前記メタデータは、使用者が前記メタデータに基づいて画像を検索することを可能とする構成である。 The program according to the second aspect of the present disclosure includes a meta data associated with the image generated by analyzing an image by a server and generated based on face information regarding a face image included in the image extracted from the image. Receiving means for receiving data; recording means for recording the metadata received by the receiving means; search means for searching for an image related to the metadata; and images searched by the searching means A computer is made to function as the presenting means for presenting, and the metadata is configured to allow a user to search for an image based on the metadata.

本開示の第１の側面においては、画像が解析され、前記画像に含まれる顔の画像に関する顔情報が抽出され、前記画像から抽出された顔情報に基づき、前記画像に関連付けられたメタデータが生成される。また、複数の前記画像にそれぞれ対応する縮小画像の中から選択された縮小画像と、前記選択された縮小画像に対応する前記画像に関連付けられたメタデータとが送信される。そして、前記メタデータは、外部機器において、前記外部機器の使用者が前記メタデータに基づいて画像を検索することを可能とする構成である。 In the first aspect of the present disclosure, an image is analyzed, face information related to a face image included in the image is extracted, and metadata associated with the image is based on the face information extracted from the image. Generated. Further, a reduced image selected from the reduced images respectively corresponding to the plurality of images, and metadata associated with the image corresponding to the selected reduced image are transmitted. The metadata is configured to allow a user of the external device to search for an image based on the metadata in the external device.

本開示の第２の側面においては、画像を解析し、前記画像から抽出された前記画像に含まれる顔の画像に関する顔情報に基づき、前記画像に関連付けられたメタデータを生成するサーバから、複数の前記画像にそれぞれ対応する縮小画像の中から選択された縮小画像と、前記選択された縮小画像に対応する前記画像に関連付けられたメタデータが受信される。また、受信された前記メタデータに基づいて、検索された画像が提示される。そして、前記メタデータは、使用者が前記メタデータに基づいて画像を検索することを可能とする構成である。 In the second aspect of the present disclosure, a plurality of analysis results are obtained from a server that generates metadata associated with the image based on face information regarding the face image included in the image extracted from the image. A reduced image selected from the reduced images corresponding to the images, and metadata associated with the image corresponding to the selected reduced image. In addition, the retrieved image is presented based on the received metadata. The metadata is configured to allow a user to search for an image based on the metadata.

本開示によれば、機器において画像を検索することができる。特に、処理能力の比較的小さい機器において、簡単に、所望の画像を検索することができる。 According to the present disclosure, it is possible to search for an image in a device. In particular, it is possible to easily search for a desired image in a device having a relatively small processing capability.

本発明の一実施の形態の画像処理システムの構成を示す図である。It is a figure which shows the structure of the image processing system of one embodiment of this invention. デジタルスチルカメラの構成の例を示すブロック図である。It is a block diagram which shows the example of a structure of a digital still camera. サーバの構成の例を示すブロック図である。It is a block diagram which shows the example of a structure of a server. プログラムを実行するMPUにより実現される機能の構成を示す図である。It is a figure which shows the structure of the function implement | achieved by MPU which performs a program. プログラムを実行するCPUにより実現される機能の構成を示す図である。It is a figure which shows the structure of the function implement | achieved by CPU which performs a program. 画像解析部の構成の例を示すブロック図である。It is a block diagram which shows the example of a structure of an image analysis part. 撮影の処理を説明するフローチャートである。It is a flowchart explaining the process of imaging | photography. 本画像と縮小画像との関係付けを示す図である。It is a figure which shows correlation with a main image and a reduced image. バックアップの処理を説明するフローチャートである。It is a flowchart explaining the process of backup. 画像解析の処理を説明するフローチャートである。It is a flowchart explaining the process of image analysis. 色ヒストグラムの生成を説明する図である。It is a figure explaining the production | generation of a color histogram. 垂直成分ヒストグラムおよび水平成分ヒストグラムの生成を説明する図である。It is a figure explaining the production | generation of a vertical component histogram and a horizontal component histogram. 垂直成分ヒストグラムおよび水平成分ヒストグラムの生成を説明する図である。It is a figure explaining the production | generation of a vertical component histogram and a horizontal component histogram. 画像のバックアップとメタデータの書き戻しを説明する図である。It is a figure explaining the backup of an image, and the writing-back of metadata. メタデータの具体例を示す図である。It is a figure which shows the specific example of metadata. コンテンツデータベースまたはコンテンツデータベースに格納されているメタデータの構成を示す図である。It is a figure which shows the structure of the metadata stored in a content database or a content database. コンテンツデータベースに格納されているメタデータおよび類似特徴データベースに格納されているメタデータの構造を示す図である。It is a figure which shows the structure of the metadata stored in the metadata stored in the content database, and the similar feature database. 類似特徴アイテムの構造を示す図である。It is a figure which shows the structure of a similar feature item. 画像の取得の処理を説明するフローチャートである。It is a flowchart explaining the process of acquisition of an image. 画像の取得とメタデータの書き込みとを説明する図である。It is a figure explaining acquisition of an image and writing of metadata. 検索の処理を説明するフローチャートである。It is a flowchart explaining the process of a search. デジタルスチルカメラおよびサーバにおいて共通するメタデータと画像との関係付けを説明する図である。It is a figure explaining the correlation with the metadata and image which are common in a digital still camera and a server. 検索の処理を説明するフローチャートである。It is a flowchart explaining the process of a search. 縮小画像の表示の例を示す図である。It is a figure which shows the example of a display of a reduction image. 縮小画像の表示の例を示す図である。It is a figure which shows the example of a display of a reduction image. 類似する画像の検索の処理を説明するフローチャートである。It is a flowchart explaining the search process of a similar image. メタデータおよび距離の構造を示す図である。It is a figure which shows the structure of metadata and distance. コンテンツデータベース、類似結果データベース、および時間グループデータベースのそれぞれのレコードの関係付けを示す図である。It is a figure which shows the correlation of each record of a content database, a similar result database, and a time group database. 類似の順の表示の例を示す図である。It is a figure which shows the example of the display of a similar order. 類似の順の表示と、時系列の表示との切り替えを説明する図である。It is a figure explaining the switch of the display of a similar order, and the display of a time series. 検索の処理を説明するフローチャートである。It is a flowchart explaining the process of a search. 類似の順の表示と、時系列の表示との切り替えを説明する図である。It is a figure explaining the switch of the display of a similar order, and the display of a time series. 色特徴抽出部の構成の例を示すブロック図である。It is a block diagram which shows the example of a structure of a color feature extraction part. 関連度抽出部対応保持部に記録されている対応情報の例を示す図である。It is a figure which shows the example of the correspondence information currently recorded on the association degree extraction part correspondence holding | maintenance part. 抽出特徴保持部に記録される関連度の論理構造を示す図である。It is a figure which shows the logical structure of the relevance degree recorded on an extraction feature holding part. 色特徴抽出の処理の詳細を説明するフローチャートである。It is a flowchart explaining the detail of the process of color feature extraction. 関連度抽出の処理の詳細を説明するフローチャートである。It is a flowchart explaining the detail of the process of relevance extraction. RGBの色空間を示す図である。It is a figure which shows the RGB color space. Ｌ*ａ*ｂ*空間を示す図である。It is a figure which shows L * a * b * space. 白のサブ空間および黒のサブ空間の例を示す図である。It is a figure which shows the example of a white subspace and a black subspace. 彩度境界および輝度境界の例を示す図である。It is a figure which shows the example of a saturation boundary and a brightness | luminance boundary. 緑、青、赤、および黄のそれぞれのサブ空間の例を示す図である。It is a figure which shows the example of each subspace of green, blue, red, and yellow. 関連度抽出処理の詳細の他の例を説明するフローチャートである。It is a flowchart explaining the other example of the detail of a relevance degree extraction process. 関連度抽出処理の詳細のさらに他の例を説明するフローチャートである。It is a flowchart explaining the further another example of the detail of a relevance degree extraction process. 判断データの例を示す図である。It is a figure which shows the example of judgment data. 関連度抽出処理の詳細のさらに他の例を説明するフローチャートである。It is a flowchart explaining the further another example of the detail of a relevance degree extraction process. 検索の処理を説明するフローチャートである。It is a flowchart explaining the process of a search. 検索におけるGUIの画像の例を示す図である。It is a figure which shows the example of the image of GUI in a search. 検索された画像の例を示す図である。It is a figure which shows the example of the searched image.

以下、本開示を実施するための形態（以下実施の形態とする）について説明する。 Hereinafter, modes for carrying out the present disclosure (hereinafter referred to as embodiments) will be described.

図１は、本発明の一実施の形態の画像処理システムの構成を示す図である。機器の一例であるデジタルスチルカメラ１１は、画像を撮影して、撮影した画像を画像処理装置の一例であるサーバ１３に供給する。機器の一例である携帯電話機１２は、画像を撮影して、撮影した画像をサーバ１３に供給する。この場合、デジタルスチルカメラ１１および携帯電話機１２は、撮影した画像から、その画像を縮小した縮小画像を生成する。 FIG. 1 is a diagram showing a configuration of an image processing system according to an embodiment of the present invention. A digital still camera 11 that is an example of a device captures an image and supplies the captured image to a server 13 that is an example of an image processing apparatus. The mobile phone 12, which is an example of a device, captures an image and supplies the captured image to the server 13. In this case, the digital still camera 11 and the mobile phone 12 generate a reduced image obtained by reducing the image from the captured image.

なお、デジタルスチルカメラ１１、携帯電話機１２、またはサーバ１３は、表示制御装置の一例でもある。 The digital still camera 11, the mobile phone 12, or the server 13 is also an example of a display control device.

サーバ１３は、パーソナルコンピュータ、据え置き型のレコーダ、ゲーム機器、または専用機器などからなり、デジタルスチルカメラ１１または携帯電話機１２から供給された画像を記録する。また、サーバ１３は、デジタルスチルカメラ１１または携帯電話機１２から供給された画像を画像処理し、画像の特徴を抽出する。サーバ１３は、その結果得られたデータをデジタルスチルカメラ１１または携帯電話機１２に供給する。 The server 13 includes a personal computer, a stationary recorder, a game device, a dedicated device, or the like, and records an image supplied from the digital still camera 11 or the mobile phone 12. In addition, the server 13 performs image processing on an image supplied from the digital still camera 11 or the mobile phone 12 and extracts image features. The server 13 supplies the data obtained as a result to the digital still camera 11 or the mobile phone 12.

さらに、サーバ１３は、ネットワーク１４を介してWebサーバ１５−１またはWebサーバ１５−２から画像を取得して、取得した画像を記録する。また、サーバ１３は、Webサーバ１５−１またはWebサーバ１５−２から取得した画像を画像処理するとともに、取得した画像から、その画像を縮小した縮小画像を生成する。サーバ１３は、画像処理の結果得られたデータを、縮小画像と共にデジタルスチルカメラ１１または携帯電話機１２に供給する。 Furthermore, the server 13 acquires an image from the Web server 15-1 or the Web server 15-2 via the network 14, and records the acquired image. In addition, the server 13 performs image processing on the image acquired from the Web server 15-1 or the Web server 15-2, and generates a reduced image obtained by reducing the image from the acquired image. The server 13 supplies the data obtained as a result of the image processing to the digital still camera 11 or the mobile phone 12 together with the reduced image.

デジタルスチルカメラ１１または携帯電話機１２は、サーバ１３から供給された、画像処理の結果得られたデータを基に、記録している画像から、所望の画像を検索する。また、サーバ１３は、画像処理の結果得られたデータを基に、記録している画像から、所望の画像を検索する。 The digital still camera 11 or the mobile phone 12 searches for a desired image from the recorded images based on the data obtained as a result of the image processing supplied from the server 13. Further, the server 13 searches for a desired image from the recorded images based on the data obtained as a result of the image processing.

デジタルスチルカメラ１１、携帯電話機１２、およびサーバ１３において、画像処理の結果得られた同じデータを基に画像を検索するので、所望の画像が同様に検索できる。 Since the digital still camera 11, the mobile phone 12, and the server 13 search for an image based on the same data obtained as a result of image processing, a desired image can be similarly searched.

図２は、デジタルスチルカメラ１１の構成を示すブロック図である。デジタルスチルカメラ１１は、撮影レンズ３１、絞り３２、撮像デバイス３３、アナログ信号処理部３４、A/D（Analog to Digital）コンバータ３５、デジタル信号処理部３６、MPU（Micro Processing Unit）３７、メモリ３８、D/A（Digital to Analog）コンバータ３９、モニタ４０、圧縮伸張部４１、カードI/F（インタフェース）４２、メモリカード４３、AF（auto focus）モータズームモータ４４、コントロール回路４５、EEPROM（Electrically Erasable Programmable Read Only Memory）４６、通信部４７、通信部４８、および入力部４９から構成される。 FIG. 2 is a block diagram showing a configuration of the digital still camera 11. The digital still camera 11 includes a photographing lens 31, an aperture 32, an imaging device 33, an analog signal processing unit 34, an A / D (Analog to Digital) converter 35, a digital signal processing unit 36, an MPU (Micro Processing Unit) 37, and a memory 38. , D / A (Digital to Analog) converter 39, monitor 40, compression / decompression unit 41, card I / F (interface) 42, memory card 43, AF (auto focus) motor zoom motor 44, control circuit 45, EEPROM (Electrically (Erasable Programmable Read Only Memory) 46, communication unit 47, communication unit 48, and input unit 49.

撮影レンズ３１は、絞り３２を介して、被写体の光学的な像を撮像デバイス３３の受光面に結像させる。撮影レンズ３１は、１枚又は複数枚のレンズで構成される。撮影レンズ３１は、単焦点レンズでもよいし、ズームレンズ等の焦点距離可変のものでもよい。 The photographing lens 31 forms an optical image of the subject on the light receiving surface of the imaging device 33 via the diaphragm 32. The photographing lens 31 is composed of one or a plurality of lenses. The taking lens 31 may be a single focus lens or a variable focal length such as a zoom lens.

絞り３２は、撮像デバイス３３の受光面に結像される光学的な像の光量を調整する。 The diaphragm 32 adjusts the amount of optical image formed on the light receiving surface of the imaging device 33.

撮像デバイス３３は、CCD（Charge Coupled Device）またはCMOS（complementary metal oxide semiconductor）センサなどからなり、受光面に結像した光学的な像を電気信号に変換する。撮像デバイス３３は、変換により得られた電気信号をアナログ信号処理部３４に供給する。 The imaging device 33 includes a charge coupled device (CCD) or a complementary metal oxide semiconductor (CMOS) sensor, and converts an optical image formed on the light receiving surface into an electrical signal. The imaging device 33 supplies the electrical signal obtained by the conversion to the analog signal processing unit 34.

アナログ信号処理部３４は、サンプリングホールド回路、色分離回路、ゲイン調整回路等を含み、撮像デバイス３３からの電気信号に相関二重サンプリング（ＣＤＳ）処理を適用すると共に、電気信号をＲ，Ｇ，Ｂの各色信号に分離し、各色信号の信号レベルを調整（プリホワイトバランス処理）する。アナログ信号処理部３４は、色信号をA/Dコンバータ３５に供給する。 The analog signal processing unit 34 includes a sampling hold circuit, a color separation circuit, a gain adjustment circuit, and the like, applies a correlated double sampling (CDS) process to the electrical signal from the imaging device 33, and converts the electrical signal to R, G, Separated into B color signals, the signal level of each color signal is adjusted (pre-white balance processing). The analog signal processing unit 34 supplies the color signal to the A / D converter 35.

A/Dコンバータ３５は、色信号のそれぞれをデジタル信号に変換し、デジタル信号をデジタル信号処理部３６に供給する。 The A / D converter 35 converts each color signal into a digital signal and supplies the digital signal to the digital signal processing unit 36.

デジタル信号処理部３６は、輝度・色差信号生成回路、シャープネス補正回路、コントラスト補正回路、ホワイトバランス補正回路等を含み、MPU３７の制御に基づいて、デジタル信号を、輝度信号（Ｙ信号）および色差信号（Ｃr,Ｃb信号）に変換する。デジタル信号処理部３６は、各種の処理を適用したデジタル信号をメモリ３８に供給する。 The digital signal processing unit 36 includes a luminance / color difference signal generation circuit, a sharpness correction circuit, a contrast correction circuit, a white balance correction circuit, and the like. Based on the control of the MPU 37, the digital signal is converted into a luminance signal (Y signal) and a color difference signal. (Cr, Cb signal). The digital signal processing unit 36 supplies a digital signal to which various processes are applied to the memory 38.

MPU３７は、組込型のプロセッサであり、プログラムを実行して、デジタルスチルカメラ１１の全体を制御する。 The MPU 37 is an embedded processor, and executes a program to control the entire digital still camera 11.

メモリ３８は、DRAM（Dynamic Random Access Memory）などからなり、MPU３７の制御に基づいて、デジタル信号処理部３６から供給されたデジタル信号を一時的に記憶する。D/Aコンバータ３９は、メモリ３８からデジタル信号を読み出して、読み出したデジタル信号をアナログ信号に変換して、モニタ４０に供給する。モニタ４０は、LCD（Liquid Crystal Display）または有機EL（Electro Luminescence）ディスプレイなどからなり、D/Aコンバータ３９から供給されたアナログ信号に基づいて画像を表示する。 The memory 38 includes a DRAM (Dynamic Random Access Memory) or the like, and temporarily stores the digital signal supplied from the digital signal processing unit 36 based on the control of the MPU 37. The D / A converter 39 reads a digital signal from the memory 38, converts the read digital signal into an analog signal, and supplies the analog signal to the monitor 40. The monitor 40 includes an LCD (Liquid Crystal Display) or an organic EL (Electro Luminescence) display, and displays an image based on an analog signal supplied from the D / A converter 39.

撮像デバイス３３から出力される電気信号によってメモリ３８のデジタル信号が定期的に書き換えられ、そのデジタル信号から生成されるアナログ信号がモニタ４０に供給されることにより、撮像デバイス３３に結像される画像がリアルタイムにモニタ４０に表示される。 The digital signal in the memory 38 is periodically rewritten by the electrical signal output from the imaging device 33, and the analog signal generated from the digital signal is supplied to the monitor 40, whereby the image formed on the imaging device 33. Is displayed on the monitor 40 in real time.

モニタ４０にGUI（Graphical User Interface）の画像を表示させる場合には、MPU３７は、GUIの画像を表示させるための画像データをメモリ３８に書き込んで、D/Aコンバータ３９に画像データをアナログ信号に変換させ、モニタ４０に、そのアナログ信号に基づいてGUIの画像を表示させる。 When displaying a GUI (Graphical User Interface) image on the monitor 40, the MPU 37 writes image data for displaying the GUI image in the memory 38, and the D / A converter 39 converts the image data into an analog signal. The GUI image is displayed on the monitor 40 based on the analog signal.

圧縮伸張部４１は、MPU３７の制御の基に、メモリ３８に記憶されているデジタル信号をJPEG（Joint Photographic Experts Group）またはJPEG2000などの方式で符号化する。圧縮伸張部４１は、符号化により得られた画像データを、カードI/F（インタフェース）４２を介してメモリカード４３に供給する。メモリカード４３は、半導体メモリまたはHDD（Hard Disk Drive）などを内蔵し、着脱自在に、デジタルスチルカメラ１１に装着され、デジタルスチルカメラ１１に装着されている場合、カードI/F４２と電気的に接続する。メモリカード４３は、カードI/F４２から供給される画像データを記録する。 Under the control of the MPU 37, the compression / decompression unit 41 encodes the digital signal stored in the memory 38 using a scheme such as JPEG (Joint Photographic Experts Group) or JPEG2000. The compression / decompression unit 41 supplies the image data obtained by the encoding to the memory card 43 via the card I / F (interface) 42. The memory card 43 has a built-in semiconductor memory or HDD (Hard Disk Drive) and is detachably attached to the digital still camera 11. When the memory card 43 is attached to the digital still camera 11, the memory card 43 is electrically connected to the card I / F 42. Connecting. The memory card 43 records the image data supplied from the card I / F 42.

カードI/F４２は、MPU３７からの指令に応じて、電気的に接続されているメモリカード４３への画像データの記録、およびメモリカード４３からの画像データの読み出しを制御する。 The card I / F 42 controls recording of image data to the electrically connected memory card 43 and reading of image data from the memory card 43 in accordance with a command from the MPU 37.

メモリカード４３に記録されている画像データは、カードI/F４２を介して、読み出されて、圧縮伸張部４１において、デジタル信号に復号される。 The image data recorded on the memory card 43 is read out via the card I / F 42 and decoded into a digital signal by the compression / decompression unit 41.

AFモータズームモータ４４は、コントロール回路４５によって駆動され、撮影レンズ３１の焦点や焦点距離を変更するように、撮像デバイス３３に対して撮影レンズ３１（を構成するレンズ）を移動させる。コントロール回路４５は、MPU３７からの指令に応じて、AFモータズームモータ４４を駆動するとともに、絞り３２や撮像デバイス３３を制御する。 The AF motor zoom motor 44 is driven by the control circuit 45 and moves the photographic lens 31 (a lens constituting the photographic lens 31) with respect to the imaging device 33 so as to change the focal point and focal length of the photographic lens 31. The control circuit 45 drives the AF motor zoom motor 44 and controls the aperture 32 and the imaging device 33 in accordance with a command from the MPU 37.

EEPROM４６は、MPU３７により実行されるプログラムや各種のデータを記憶する。 The EEPROM 46 stores programs executed by the MPU 37 and various data.

通信部４７は、USB（Universal Serial Bus）またはIEEE（Institute of Electrical and Electronic Engineers）1394などの規格に準拠するように構成され、有線の伝送媒体を介して、サーバ１３と各種のデータを送受信する。 The communication unit 47 is configured to comply with a standard such as USB (Universal Serial Bus) or IEEE (Institute of Electrical and Electronic Engineers) 1394, and transmits / receives various data to / from the server 13 via a wired transmission medium. .

通信部４８は、IEEE802.11a、IEEE802.11b、若しくはIEEE802.11g、またはブルートゥースなどの規格に準拠するように構成され、無線の伝送媒体を介して、サーバ１３と各種のデータを送受信する。 The communication unit 48 is configured to comply with a standard such as IEEE802.11a, IEEE802.11b, IEEE802.11g, or Bluetooth, and transmits / receives various data to / from the server 13 via a wireless transmission medium.

入力部４９は、スイッチ、ボタン、またはタッチパネルなどからなり、使用者から加えられた操作に応じた信号をMPU３７に供給する。 The input unit 49 includes a switch, a button, a touch panel, or the like, and supplies a signal corresponding to an operation applied by the user to the MPU 37.

なお、メモリカード４３に画像データが記録されると説明したが、画像データが記録される媒体は、半導体メモリまたは磁気ディスクに限るものではなく、光ディスクまたは光磁気ディスクなどでもよく、電子的、磁気的、光学的、若しくは量子的、またはこれらの組み合わせによる方式に従って読み書き可能な種々の媒体を用いることができる。これらの媒体は、デジタルスチルカメラ１１に内蔵するようにしてもよい。 Although it has been described that the image data is recorded on the memory card 43, the medium on which the image data is recorded is not limited to a semiconductor memory or a magnetic disk, and may be an optical disk or a magneto-optical disk. Various media that can be read and written can be used in accordance with a method based on optical, optical, or quantum, or a combination thereof. These media may be built in the digital still camera 11.

以下、画像データを単に画像とも称する。 Hereinafter, the image data is also simply referred to as an image.

図３は、サーバ１３の構成の例を示すブロック図である。CPU（Central Processing Unit）７１は、ROM（Read Only Memory）７２、または記憶部７８に記憶されているプログラムに従って各種の処理を実行する。RAM（Random Access Memory）７３には、CPU７１が実行するプログラムやデータなどが適宜記憶される。これらのCPU７１、ROM７２、およびRAM７３は、バス７４により相互に接続されている。 FIG. 3 is a block diagram illustrating an example of the configuration of the server 13. A CPU (Central Processing Unit) 71 executes various processes according to a program stored in a ROM (Read Only Memory) 72 or a storage unit 78. A RAM (Random Access Memory) 73 appropriately stores programs executed by the CPU 71 and data. These CPU 71, ROM 72, and RAM 73 are connected to each other by a bus 74.

CPU７１にはまた、バス７４を介して入出力インタフェース７５が接続されている。入出力インタフェース７５には、キーボード、マウス、マイクロホンなどよりなる入力部７６、ディスプレイ、スピーカなどよりなる出力部７７が接続されている。CPU７１は、入力部７６から入力される指令に対応して各種の処理を実行する。そして、CPU７１は、処理の結果を出力部７７に出力する。 An input / output interface 75 is also connected to the CPU 71 via a bus 74. The input / output interface 75 is connected to an input unit 76 such as a keyboard, mouse, and microphone, and an output unit 77 such as a display and a speaker. The CPU 71 executes various processes in response to commands input from the input unit 76. Then, the CPU 71 outputs the processing result to the output unit 77.

入出力インタフェース７５に接続されている記憶部７８は、例えばハードディスクからなり、CPU７１が実行するプログラムや各種のデータを記憶する。通信部７９は、USBまたはIEEE1394などの規格に準拠するように構成され、有線の伝送媒体を介して、デジタルスチルカメラ１１または携帯電話機１２と各種のデータを送受信するか、または、IEEE802.11a、IEEE802.11b、若しくはIEEE802.11g、またはブルートゥースなどの規格に準拠するように構成され、無線の伝送媒体を介して、デジタルスチルカメラ１１または携帯電話機１２と各種のデータを送受信する。通信部８０は、インターネットやローカルエリアネットワークなどのネットワーク１４を介してWebサーバ１５−１またはWebサーバ１５−２と通信する。 The storage unit 78 connected to the input / output interface 75 includes, for example, a hard disk, and stores a program executed by the CPU 71 and various data. The communication unit 79 is configured to comply with a standard such as USB or IEEE1394, and transmits / receives various data to / from the digital still camera 11 or the mobile phone 12 via a wired transmission medium, or IEEE802.11a, It is configured to comply with standards such as IEEE802.11b, IEEE802.11g, or Bluetooth, and transmits / receives various data to / from the digital still camera 11 or the mobile phone 12 via a wireless transmission medium. The communication unit 80 communicates with the Web server 15-1 or the Web server 15-2 via the network 14 such as the Internet or a local area network.

また、通信部８０を介してプログラムを取得し、記憶部７８に記憶してもよい。 A program may be acquired via the communication unit 80 and stored in the storage unit 78.

入出力インタフェース７５に接続されているドライブ８１は、磁気ディスク、光ディスク、光磁気ディスク、或いは半導体メモリなどのリムーバブルメディア８２が装着されたとき、それらを駆動し、そこに記録されているプログラムやデータなどを取得する。取得されたプログラムやデータは、必要に応じて記憶部７８に転送され、記憶される。 The drive 81 connected to the input / output interface 75 drives a removable medium 82 such as a magnetic disk, an optical disk, a magneto-optical disk, or a semiconductor memory, and drives the program or data recorded therein. Get etc. The acquired program and data are transferred to and stored in the storage unit 78 as necessary.

図４は、プログラムを実行するMPU３７により実現される機能の構成を示す図である。MPU３７は、プログラムを実行することにより、撮影制御部１０１、縮小画像生成部１０２、メタデータ生成部１０３、エントリ生成部１０４、記録制御部１０５、表示制御部１０６、検索部１０７、送信制御部１０８、受信制御部１０９、画像保持部１１０、コンテンツデータベース１１１、類似特徴データベース１１２、類似結果データベース１１３、時間グループデータベース１１４、および検索結果保持部１１５を実現する。 FIG. 4 is a diagram illustrating a configuration of functions realized by the MPU 37 that executes a program. By executing the program, the MPU 37 performs shooting control unit 101, reduced image generation unit 102, metadata generation unit 103, entry generation unit 104, recording control unit 105, display control unit 106, search unit 107, and transmission control unit 108. A reception control unit 109, an image holding unit 110, a content database 111, a similar feature database 112, a similar result database 113, a time group database 114, and a search result holding unit 115.

撮影制御部１０１は、撮影レンズ３１乃至デジタル信号処理部３６およびメモリ３８乃至コントロール回路４５を制御することで、デジタルスチルカメラ１１における撮影を制御する。撮影制御部１０１は、撮影した画像を、画像保持部１１０としてのメモリカード４３の記録領域に記録させる。 The photographing control unit 101 controls photographing in the digital still camera 11 by controlling the photographing lens 31 through the digital signal processing unit 36 and the memory 38 through the control circuit 45. The shooting control unit 101 records the shot image in the recording area of the memory card 43 serving as the image holding unit 110.

縮小画像生成部１０２は、撮影された画像のデジタル信号をメモリ３８から読み出して、撮影された画像を縮小し、縮小画像を生成する。生成された縮小画像は、カードI/F４２を介してメモリカード４３に供給され、画像保持部１１０としてのメモリカード４３の記録領域に記録される。 The reduced image generation unit 102 reads out a digital signal of the captured image from the memory 38, reduces the captured image, and generates a reduced image. The generated reduced image is supplied to the memory card 43 via the card I / F 42 and recorded in a recording area of the memory card 43 as the image holding unit 110.

例えば、撮影制御部１０１の制御に基づいて、画素の数が３００万乃至４００万である高解像度の画像が撮影されると、縮小画像生成部１０２は、撮影された画像から、デジタルスチルカメラ１１で閲覧するのに適した６４０画素×４８０画素のVGA（Video Graphics Array）と同じか、またはこれと同等のサイズの縮小画像を生成する。 For example, when a high-resolution image having 3 to 4 million pixels is captured based on the control of the imaging control unit 101, the reduced image generation unit 102 uses the digital still camera 11 from the captured image. A reduced image having the same size as or equivalent to a 640 pixel × 480 pixel VGA (Video Graphics Array) suitable for browsing with the above method is generated.

なお、縮小画像生成部１０２は、画像保持部１１０から画像を読み出して、読み出した画像を縮小し、縮小画像を生成するようにしてもよい。 Note that the reduced image generation unit 102 may read an image from the image holding unit 110, reduce the read image, and generate a reduced image.

以下、縮小画像と、撮影された画像とを区別するために、撮影された画像を本画像と称する。なお、縮小画像と本画像を区別する必要がないとき、単に画像と称する。 Hereinafter, in order to distinguish between a reduced image and a captured image, the captured image is referred to as a main image. Note that when there is no need to distinguish between the reduced image and the main image, they are simply referred to as images.

詳細は、後述するが、本画像と縮小画像とは、コンテンツデータベース１１１に記録されているデータによって紐付けされる。 Although details will be described later, the main image and the reduced image are linked by data recorded in the content database 111.

メタデータ生成部１０３は、本画像についてのメタデータを生成する。例えば、メタデータ生成部１０３は、JEIDA（Japanese Electronic Industry Development Association）によって規格化されているEXIF（Exchangeable Image File Format）方式のデータに格納されるメタデータを生成する。 The metadata generation unit 103 generates metadata about the main image. For example, the metadata generation unit 103 generates metadata stored in EXIF (Exchangeable Image File Format) data standardized by JEIDA (Japanese Electronic Industry Development Association).

エントリ生成部１０４は、データベースマネジメントシステム（Database Management System）として構成され、本画像が撮影されたとき、本画像および縮小画像のエントリを生成する。生成されたエントリは、コンテンツデータベース１１１に格納される。 The entry generation unit 104 is configured as a database management system, and generates an entry for the main image and the reduced image when the main image is captured. The generated entry is stored in the content database 111.

記録制御部１０５は、本画像および縮小画像の画像保持部１１０への記録を制御する。 The recording control unit 105 controls recording of the main image and the reduced image in the image holding unit 110.

表示制御部１０６は、縮小画像およびGUIの画像のモニタ４０への表示を制御する。 The display control unit 106 controls the display of the reduced image and the GUI image on the monitor 40.

検索部１０７は、コンテンツデータベース１１１、類似特徴データベース１１２、類似結果データベース１１３、または時間グループデータベース１１４に格納されているデータを基に、画像保持部１１０に記録されている縮小画像または本画像から、所望の縮小画像または本画像を検索する。検索部１０７は、検索の結果に応じたデータを、検索結果保持部１１５に格納させる。 Based on the data stored in the content database 111, the similar feature database 112, the similar result database 113, or the time group database 114, the search unit 107 uses the reduced image or the main image recorded in the image holding unit 110. A desired reduced image or main image is searched. The search unit 107 causes the search result holding unit 115 to store data corresponding to the search result.

検索部１０７は、距離計算部１２１を含む。距離計算部１２１は、類似特徴データベース１１２に格納されている画像の特徴を示すデータから、２つの画像の類似の度合いを示す距離を計算する。距離計算部１２１は、計算した距離を類似結果データベース１１３に記録させる。 The search unit 107 includes a distance calculation unit 121. The distance calculation unit 121 calculates a distance indicating the degree of similarity between two images from data indicating image features stored in the similar feature database 112. The distance calculation unit 121 records the calculated distance in the similarity result database 113.

送信制御部１０８は、通信部４７を制御して、通信部４７による本画像または縮小画像のサーバ１３への送信を制御する。受信制御部１０９は、通信部４７を制御して、通信部４７による、サーバ１３から送信されてくる、画像に各種の画像処理を適用して得られた画像の特徴の受信を制御する。 The transmission control unit 108 controls the communication unit 47 to control transmission of the main image or the reduced image to the server 13 by the communication unit 47. The reception control unit 109 controls the communication unit 47 to control reception of image features obtained by applying various types of image processing to images transmitted from the server 13 by the communication unit 47.

画像保持部１１０は、メモリカード４３の記録空間に構築され、本画像または縮小画像を記録する。 The image holding unit 110 is constructed in the recording space of the memory card 43 and records a main image or a reduced image.

コンテンツデータベース１１１、類似特徴データベース１１２、類似結果データベース１１３、および時間グループデータベース１１４は、メモリカード４３の所定の記録空間およびそれぞれのデータベースマネジメントシステムから構成される。 The content database 111, the similar feature database 112, the similar result database 113, and the time group database 114 are composed of a predetermined recording space of the memory card 43 and respective database management systems.

コンテンツデータベース１１１は、画像を特定するデータおよびこれに対応させて画像の各種のメタデータを格納する。類似特徴データベース１１２は、サーバ１３における画像の画像処理の結果得られた、画像の特徴を示すデータを格納する。 The content database 111 stores data for specifying an image and various metadata of the image corresponding to the data. The similar feature database 112 stores data indicating image features obtained as a result of image processing of images in the server 13.

類似結果データベース１１３は、検索部１０７の距離計算部１２１において計算された、２つの画像の類似の度合いを示す距離を格納する。 The similarity result database 113 stores a distance indicating the degree of similarity between two images calculated by the distance calculation unit 121 of the search unit 107.

時間グループデータベース１１４は、使用者が画像をグループに分類した場合の、それぞれのグループに属する画像を特定する情報を格納する。 The time group database 114 stores information for specifying images belonging to each group when the user classifies the images into groups.

検索結果保持部１１５は、検索の結果に応じたデータを記録する。例えば、検索結果保持部１１５は、画像の画素の色を基に抽出された、画像が所定の色名によって想起される度合いを示す関連度と、使用者からの操作に応じて入力された、色名で表される色の重みとから検索された、重みに応じた色の画像の検索結果を記録する。 The search result holding unit 115 records data according to the search result. For example, the search result holding unit 115 is input based on the relevance indicating the degree to which the image is recalled by a predetermined color name extracted based on the color of the pixel of the image and the operation from the user. The search result of the image of the color corresponding to the weight retrieved from the color weight represented by the color name is recorded.

関連度の詳細は、後述する。 Details of the relevance will be described later.

図５は、プログラムを実行するCPU７１により実現される機能の構成を示す図である。CPU７１は、プログラムを実行することにより、画像解析部１３１、縮小画像生成部１３２、メタデータ生成部１３３、エントリ生成部１３４、記録制御部１３５、表示制御部１３６、検索部１３７、送信制御部１３８−１および送信制御部１３８−２、受信制御部１３９−１および受信制御部１３９−２、画像保持部１４０、コンテンツデータベース１４１、類似特徴データベース１４２、類似結果データベース１４３、時間グループデータベース１４４、関連度抽出部対応保持部１４５、抽出特徴保持部１４６、並びに検索結果保持部１４７を実現する。 FIG. 5 is a diagram illustrating a configuration of functions realized by the CPU 71 that executes a program. By executing the program, the CPU 71 executes an image analysis unit 131, a reduced image generation unit 132, a metadata generation unit 133, an entry generation unit 134, a recording control unit 135, a display control unit 136, a search unit 137, and a transmission control unit 138. -1 and transmission control unit 138-2, reception control unit 139-1 and reception control unit 139-2, image holding unit 140, content database 141, similar feature database 142, similar result database 143, time group database 144, relevance An extraction unit correspondence holding unit 145, an extraction feature holding unit 146, and a search result holding unit 147 are realized.

画像解析部１３１は、画像の特徴を抽出する。すなわち、画像解析部１３１は、画像に画像処理を適用して、画像を解析する。画像解析部１３１は、画像処理の結果得られた、画像の特徴を類似特徴データベース１４２または送信制御部１３８−１に供給する。 The image analysis unit 131 extracts image features. That is, the image analysis unit 131 analyzes the image by applying image processing to the image. The image analysis unit 131 supplies the image features obtained as a result of the image processing to the similar feature database 142 or the transmission control unit 138-1.

図６は、画像解析部１３１の構成の例を示すブロック図である。画像解析部１３１は、顔画像検出部１６１および類似特徴量抽出部１６２から構成される。 FIG. 6 is a block diagram illustrating an example of the configuration of the image analysis unit 131. The image analysis unit 131 includes a face image detection unit 161 and a similar feature amount extraction unit 162.

顔画像検出部１６１は、画像に含まれる顔の画像に関する情報である画像の特徴を抽出する。例えば、顔画像検出部１６１は、画像に含まれる顔の画像の数、画像における顔の画像の位置、顔の画像の大きさ、または顔の画像における顔の向きなどである画像の特徴を抽出する。 The face image detection unit 161 extracts image features that are information related to the face image included in the image. For example, the face image detection unit 161 extracts image features such as the number of face images included in the image, the position of the face image in the image, the size of the face image, or the orientation of the face in the face image. To do.

類似特徴量抽出部１６２は、画像の類似の度合いを求めるための画像の特徴量を抽出する。類似特徴量抽出部１６２は、類似特徴ベクトル算出部１７１および色特徴抽出部１７２から構成される。類似特徴ベクトル算出部１７１は、２つの画像のそれぞれの特徴からその２つの画像の類似の度合いが計算される特徴を抽出する。色特徴抽出部１７２は、画像から、画像の画素の色を基に、画像が所定の色名によって想起される度合いを示す関連度を抽出する。言い換えれば、色特徴抽出部１７２は、画像の画素のうち、画素の色が所定の名前の色に分類される画素の数を示す特徴を抽出する。 The similar feature amount extraction unit 162 extracts the feature amount of the image for obtaining the degree of similarity of the images. The similar feature amount extraction unit 162 includes a similar feature vector calculation unit 171 and a color feature extraction unit 172. The similar feature vector calculation unit 171 extracts a feature whose degree of similarity between the two images is calculated from the features of the two images. The color feature extraction unit 172 extracts, from the image, a degree of association indicating the degree to which the image is recalled by a predetermined color name based on the color of the pixel of the image. In other words, the color feature extraction unit 172 extracts a feature indicating the number of pixels that are classified into a color with a predetermined name among the pixels of the image.

図５に戻り、縮小画像生成部１３２は、受信制御部１３９−２の制御の基に、ネットワーク１４を介してWebサーバ１５−１またはWebサーバ１５−２から取得した本画像を縮小し、縮小画像を生成する。生成された縮小画像は、画像保持部１４０に記録される。 Returning to FIG. 5, the reduced image generation unit 132 reduces and reduces the main image acquired from the Web server 15-1 or the Web server 15-2 via the network 14 under the control of the reception control unit 139-2. Generate an image. The generated reduced image is recorded in the image holding unit 140.

なお、縮小画像生成部１３２は、画像保持部１４０から画像を読み出して、読み出した画像を縮小し、縮小画像を生成するようにしてもよい。 Note that the reduced image generation unit 132 may read an image from the image holding unit 140, reduce the read image, and generate a reduced image.

メタデータ生成部１３３は、本画像についてのメタデータを生成する。例えば、メタデータ生成部１３３は、JEIDAによって規格化されているEXIF方式のデータに格納されるメタデータを生成する。 The metadata generation unit 133 generates metadata about the main image. For example, the metadata generation unit 133 generates metadata stored in EXIF format data standardized by JEIDA.

エントリ生成部１３４は、データベースマネジメントシステムとして構成され、受信制御部１３９−１の制御の基に、デジタルスチルカメラ１１から取得された本画像のエントリを生成する。または、エントリ生成部１３４は、受信制御部１３９−２の制御の基に、ネットワーク１４を介してWebサーバ１５−１またはWebサーバ１５−２から本画像が取得され、本画像から縮小画像が生成された場合、本画像および縮小画像のエントリを生成する。生成されたエントリは、コンテンツデータベース１４１に格納される。 The entry generation unit 134 is configured as a database management system, and generates an entry of the main image acquired from the digital still camera 11 under the control of the reception control unit 139-1. Alternatively, the entry generation unit 134 acquires a main image from the Web server 15-1 or the Web server 15-2 via the network 14 under the control of the reception control unit 139-2, and generates a reduced image from the main image. If so, entries for the main image and the reduced image are generated. The generated entry is stored in the content database 141.

記録制御部１３５は、本画像および縮小画像の画像保持部１４０への記録を制御する。 The recording control unit 135 controls recording of the main image and the reduced image in the image holding unit 140.

表示制御部１３６は、ディスプレイである出力部７７への、本画像およびGUIの画像の表示を制御する。 The display control unit 136 controls display of the main image and the GUI image on the output unit 77 which is a display.

検索部１３７は、コンテンツデータベース１４１、類似特徴データベース１４２、または時間グループデータベース１４４に格納されているデータを基に、画像保持部１４０に記録されている本画像または縮小画像から、所望の本画像または縮小画像を検索する。または、検索部１３７は、抽出特徴保持部１４６に格納されているデータを基に、画像保持部１４０に記録されている本画像または縮小画像から、所望の本画像または縮小画像を検索する。検索部１３７は、検索の結果に応じたデータを、検索結果保持部１４７に格納する。 Based on the data stored in the content database 141, the similar feature database 142, or the time group database 144, the search unit 137 selects a desired main image or reduced image from the main image or the reduced image recorded in the image holding unit 140. Search for reduced images. Alternatively, the search unit 137 searches for a desired main image or reduced image from the main image or reduced image recorded in the image holding unit 140 based on the data stored in the extracted feature holding unit 146. The search unit 137 stores data corresponding to the search result in the search result holding unit 147.

検索部１３７は、距離計算部１５１を含む。距離計算部１５１は、類似特徴データベース１４２に格納されている画像の特徴を示すデータから、２つの画像の類似の度合いを示す距離を計算する。距離計算部１５１は、計算した距離を類似結果データベース１４３に記録させる。 The search unit 137 includes a distance calculation unit 151. The distance calculation unit 151 calculates a distance indicating the degree of similarity between two images from data indicating image features stored in the similar feature database 142. The distance calculation unit 151 records the calculated distance in the similarity result database 143.

送信制御部１３８−１は、通信部７９を制御して、通信部７９に、画像解析部１３１において画像処理の結果得られた、画像の特徴をデジタルスチルカメラ１１宛てに送信させる。受信制御部１３９−１は、通信部７９を制御して、通信部７９に、デジタルスチルカメラ１１から送信されてくる本画像および縮小画像を受信させる。 The transmission control unit 138-1 controls the communication unit 79 to cause the communication unit 79 to transmit the image characteristics obtained as a result of image processing in the image analysis unit 131 to the digital still camera 11. The reception control unit 139-1 controls the communication unit 79 to cause the communication unit 79 to receive the main image and the reduced image transmitted from the digital still camera 11.

送信制御部１３８−２は、通信部８０を制御する。送信制御部１３８−２は、通信部８０に、ネットワーク１４を介して、画像の要求をWebサーバ１５−１またはWebサーバ１５−２宛てに送信させる。受信制御部１３９−２は、通信部８０を制御して、通信部８０に、Webサーバ１５−１またはWebサーバ１５−２から送信されてくる本画像を受信させる。 The transmission control unit 138-2 controls the communication unit 80. The transmission control unit 138-2 causes the communication unit 80 to transmit an image request to the Web server 15-1 or the Web server 15-2 via the network 14. The reception control unit 139-2 controls the communication unit 80 to cause the communication unit 80 to receive the main image transmitted from the Web server 15-1 or the Web server 15-2.

画像保持部１４０は、ハードディスクなどからなる記憶部７８の記録空間に構築され、本画像または縮小画像を記録する。画像保持部１４０は、ドライブ８１に装着される、磁気ディスク、光ディスク、光磁気ディスク、或いは半導体メモリなどのリムーバブルメディア８２の記録空間に構築するようにしてもよい。 The image holding unit 140 is constructed in a recording space of the storage unit 78 composed of a hard disk or the like, and records a main image or a reduced image. The image holding unit 140 may be constructed in a recording space of a removable medium 82 such as a magnetic disk, an optical disk, a magneto-optical disk, or a semiconductor memory that is mounted on the drive 81.

コンテンツデータベース１４１、類似特徴データベース１４２、類似結果データベース１４３、および時間グループデータベース１４４は、記憶部７８の所定の記録空間およびそれぞれのデータベースマネジメントシステムから構成される。 The content database 141, the similar feature database 142, the similar result database 143, and the time group database 144 are configured by a predetermined recording space of the storage unit 78 and respective database management systems.

コンテンツデータベース１４１は、画像を特定するデータおよびこれに対応させて画像の各種のメタデータを格納する。類似特徴データベース１４２は、画像解析部１３１における画像の画像処理の結果得られた、画像の特徴を示すデータを格納する。 The content database 141 stores data specifying an image and various metadata of the image corresponding to the data. The similar feature database 142 stores data indicating image features obtained as a result of image processing of the image by the image analysis unit 131.

類似結果データベース１１３は、検索部１３７の距離計算部１５１において計算された、２つの画像の類似の度合いを示す距離を格納する。 The similarity result database 113 stores a distance indicating the degree of similarity between two images calculated by the distance calculation unit 151 of the search unit 137.

時間グループデータベース１４４は、使用者が画像をグループに分類した場合の、それぞれのグループに属する画像を特定する情報を格納する。 The time group database 144 stores information for specifying images belonging to each group when the user classifies the images into groups.

関連度抽出部対応保持部１４５は、色特徴抽出部１７２における、色名と、色毎に関連度を抽出する関連度抽出部（詳細は図３３を参照して後述する）との対応を示す対応情報を記録する。 The association degree extraction unit correspondence holding unit 145 indicates the correspondence between the color name in the color feature extraction unit 172 and the association degree extraction unit that extracts the association degree for each color (details will be described later with reference to FIG. 33). Record correspondence information.

抽出特徴保持部１４６は、色特徴抽出部１７２において抽出された、画像が所定の色名によって想起される度合いを示す関連度を保持する。 The extracted feature holding unit 146 holds a degree of association indicating the degree to which an image is recalled by a predetermined color name extracted by the color feature extracting unit 172.

検索結果保持部１４７は、画像の画素の色を基に抽出された、画像が所定の色名によって想起される度合いを示す関連度と、使用者からの操作に応じて入力された検索条件とから検索された、検索条件に応じた色の画像の検索結果を記録する。例えば、検索結果保持部１４７は、関連度と、色名で表される色の重みである検索条件とから検索された、重みに応じた色の画像の検索結果を記録する。 The search result holding unit 147 extracts the degree of relevance indicating the degree to which the image is recalled by a predetermined color name, extracted based on the color of the pixel of the image, and the search condition input according to the operation from the user. The search result of the image of the color corresponding to the search condition retrieved from is recorded. For example, the search result holding unit 147 records the search result of the color image corresponding to the weight searched from the relevance and the search condition that is the weight of the color represented by the color name.

次に、画像から特徴を抽出して、抽出した特徴をサーバ１３およびデジタルスチルカメラ１１において記録する処理について説明する。 Next, processing for extracting features from an image and recording the extracted features in the server 13 and the digital still camera 11 will be described.

まず、図７のフローチャートを参照して、デジタルスチルカメラ１１の撮影の処理を説明する。 First, with reference to the flowchart of FIG. 7, the photographing process of the digital still camera 11 will be described.

ステップＳ１１において、撮影制御部１０１は、撮影レンズ３１乃至デジタル信号処理部３６、メモリ３８、AFモータズームモータ４４、およびコントロール回路４５を制御し、被写体を撮影させる。ステップＳ１２において、撮影制御部１０１は、圧縮伸張部４１に、メモリ３８に記憶されているデジタル信号をJPEGまたはJPEG2000などの方式で符号化させて、画像データである本画像を生成させる。撮影制御部１０１は、本画像を画像保持部１１０に記録させる。 In step S11, the photographing control unit 101 controls the photographing lens 31 through the digital signal processing unit 36, the memory 38, the AF motor zoom motor 44, and the control circuit 45 to photograph the subject. In step S12, the imaging control unit 101 causes the compression / decompression unit 41 to encode the digital signal stored in the memory 38 using a method such as JPEG or JPEG2000 to generate a main image that is image data. The imaging control unit 101 causes the image holding unit 110 to record the main image.

また、メタデータ生成部１０３は、本画像についてのメタデータを生成する。例えば、メタデータ生成部１０３は、JEIDAによって規格化されているEXIF方式のデータに格納される、本画像の撮影時刻または撮影条件などのメタデータを生成する。 In addition, the metadata generation unit 103 generates metadata about the main image. For example, the metadata generation unit 103 generates metadata such as the shooting time or shooting condition of the main image stored in EXIF format data standardized by JEIDA.

ステップＳ１３において、縮小画像生成部１０２は、撮影された画像のデジタル信号をメモリ３８から読み出して、撮影された画像を縮小し、縮小画像を生成する。縮小画像生成部１０２は、縮小画像を画像保持部１１０に記録させる。 In step S 13, the reduced image generation unit 102 reads a digital signal of the captured image from the memory 38, reduces the captured image, and generates a reduced image. The reduced image generation unit 102 causes the image holding unit 110 to record the reduced image.

ステップＳ１４において、エントリ生成部１０４は、本画像および縮小画像のエントリを生成する。エントリ生成部１０４は、生成されたエントリを、メタデータ生成部１０３において生成したメタデータに関係付けて、コンテンツデータベース１１１に追加（格納）し、処理は終了する。 In step S 14, the entry generation unit 104 generates entries for the main image and the reduced image. The entry generation unit 104 adds (stores) the generated entry to the content database 111 in association with the metadata generated by the metadata generation unit 103, and the process ends.

コンテンツデータベース１１１に、撮影時刻または撮影条件などのメタデータが格納されるので、撮影時刻または撮影条件により本画像または縮小画像を検索することができる。 Since metadata such as shooting time or shooting conditions is stored in the content database 111, the main image or the reduced image can be searched based on the shooting time or shooting conditions.

携帯電話機１２においても、図７のフローチャートで示される撮影の処理と同様の処理が実行される。 Also in the mobile phone 12, the same processing as the photographing processing shown in the flowchart of FIG. 7 is executed.

このようにすることで、図８で示されるように、デジタルスチルカメラ１１または携帯電話機１２において、画像が撮影されると、本画像２０１に関係付けられたメタデータがコンテンツデータベース１１１に格納されると共に、本画像２０１を縮小した縮小画像２０２が生成され、本画像２０１に関係付けられたメタデータであって、コンテンツデータベース１１１に格納されているメタデータと縮小画像２０２とが関係付けられる。 In this way, as shown in FIG. 8, when the digital still camera 11 or the mobile phone 12 captures an image, the metadata associated with the main image 201 is stored in the content database 111. At the same time, a reduced image 202 obtained by reducing the main image 201 is generated, and the metadata stored in the content database 111 and the reduced image 202 are related to each other.

次に、図９のフローチャートを参照して、デジタルスチルカメラ１１において撮影された画像をサーバ１３にバックアップする場合の、サーバ１３のバックアップの処理を説明する。サーバ１３のバックアップの処理は、例えば、デジタルスチルカメラ１１に一端が接続されているUSBケーブルがサーバ１３に接続されるとプログラムが起動されることにより開始される。 Next, a backup process of the server 13 when an image captured by the digital still camera 11 is backed up to the server 13 will be described with reference to a flowchart of FIG. The backup process of the server 13 is started by starting the program when a USB cable, one end of which is connected to the digital still camera 11, is connected to the server 13, for example.

ステップＳ３１において、サーバ１３の送信制御部１３８−１および受信制御部１３８−１は、通信部７９に、デジタルスチルカメラ１１と接続させる。 In step S31, the transmission control unit 138-1 and the reception control unit 138-1 of the server 13 cause the communication unit 79 to connect to the digital still camera 11.

ステップＳ３２において、サーバ１３の送信制御部１３８−１および受信制御部１３８−１は、通信部７９に、デジタルスチルカメラ１１から本画像２０１および縮小画像２０２を取得させる。例えば、ステップＳ３２において、送信制御部１３８−１は、通信部７９に、デジタルスチルカメラ１１宛てに本画像２０１および縮小画像２０２の送信要求を送信させる。すると、デジタルスチルカメラ１１が本画像２０１および縮小画像２０２を送信してくるので、受信制御部１３８−１は、通信部７９に、デジタルスチルカメラ１１から送信されてきた本画像２０１および縮小画像２０２を受信させる。受信制御部１３８−１は、取得した（受信した）本画像２０１および縮小画像２０２を画像保持部１４０に供給する。 In step S 32, the transmission control unit 138-1 and the reception control unit 138-1 of the server 13 cause the communication unit 79 to acquire the main image 201 and the reduced image 202 from the digital still camera 11. For example, in step S 32, the transmission control unit 138-1 causes the communication unit 79 to transmit a transmission request for the main image 201 and the reduced image 202 to the digital still camera 11. Then, since the digital still camera 11 transmits the main image 201 and the reduced image 202, the reception control unit 138-1 sends the main image 201 and the reduced image 202 transmitted from the digital still camera 11 to the communication unit 79. To receive. The reception control unit 138-1 supplies the acquired (received) main image 201 and reduced image 202 to the image holding unit 140.

ステップＳ３３において、画像保持部１４０は、デジタルスチルカメラ１１から取得した本画像２０１および縮小画像２０２を記録する。 In step S 33, the image holding unit 140 records the main image 201 and the reduced image 202 acquired from the digital still camera 11.

ステップＳ３４において、画像解析部１３１は、画像保持部１４０に記録された画像を解析する。 In step S 34, the image analysis unit 131 analyzes the image recorded in the image holding unit 140.

なお、画像解析部１３１は、本画像２０１を解析するようにしてもよく、縮小画像２０２を解析するようにしてもよい。 Note that the image analysis unit 131 may analyze the main image 201 or the reduced image 202.

ステップＳ３４の画像の解析の処理の詳細を、図１０のフローチャートを参照して説明する。 Details of the image analysis processing in step S34 will be described with reference to the flowchart of FIG.

ステップＳ４１において、画像解析部１３１の顔画像検出部１６１は、画像から顔画像を検出する。すなわち、ステップＳ４１において、顔画像検出部１６１は、画像に含まれる顔の画像に関する情報である画像の特徴を抽出する。例えば、ステップＳ４１において、顔画像検出部１６１は、画像に含まれる顔の画像の数、画像における顔の画像の位置、顔の画像の大きさ、または顔の画像における顔の向きである画像の特徴を抽出する。 In step S41, the face image detection unit 161 of the image analysis unit 131 detects a face image from the image. That is, in step S41, the face image detection unit 161 extracts image features that are information related to the face image included in the image. For example, in step S41, the face image detection unit 161 detects the number of face images included in the image, the position of the face image in the image, the size of the face image, or the orientation of the face in the face image. Extract features.

より具体的には、例えば、顔画像検出部１６１は、画像の画素のうち、人の肌の色に対応する所定の色の範囲に属する色を示す画素値を有する画素を特定する。そして、顔画像検出部１６１は、色によって特定された画素のうち、所定の数以上、相互に隣接している画素により構成される領域を顔の画像とする。 More specifically, for example, the face image detection unit 161 identifies a pixel having a pixel value indicating a color belonging to a predetermined color range corresponding to the color of human skin among the pixels of the image. Then, the face image detection unit 161 sets, as a face image, an area formed by pixels adjacent to each other by a predetermined number or more among the pixels specified by color.

顔画像検出部１６１は、検出された顔の画像の数を数える。さらに、顔画像検出部１６１は、画像の全体の高さおよび全体の幅をそれぞれ１とした場合、画像における顔の画像の位置として、画像の全体に対する相対的な、顔の画像の縦方向の位置および横方向の位置を検出する。 The face image detection unit 161 counts the number of detected face images. Further, the face image detection unit 161 assumes that the position of the face image in the image is relative to the entire image in the vertical direction of the face image when the overall height and width of the image are each 1. Detect position and lateral position.

また、顔画像検出部１６１は、画像の全体の高さおよび全体の幅をそれぞれ１とした場合、画像における顔の画像の大きさとして、画像の全体に対する相対的な、顔の画像の高さおよび幅を検出する。 Further, the face image detection unit 161 sets the height of the face image relative to the entire image as the size of the face image in the image when the overall height and the overall width of the image are 1, respectively. And detect width.

そして、顔画像検出部１６１は、予め定義されている、想定される顔の方向ごとの複数のパターンと、選択された顔の画像と一致するか否かを判定し、顔の向きを、顔の画像と一致するパターンに対応する向きとすることで、顔の向きを検出する。この場合、顔画像検出部１６１は、選択された顔の画像について、顔の向きとして、顔のロール角、ピッチ角、およびヨー角を検出する。 Then, the face image detection unit 161 determines whether or not the plurality of patterns for each assumed face direction that are defined in advance match the selected face image, and determines the face orientation as the face orientation. The orientation of the face is detected by setting the orientation corresponding to the pattern that matches the image. In this case, the face image detection unit 161 detects the roll angle, pitch angle, and yaw angle of the face as the face direction for the selected face image.

ステップＳ４２において、画像解析部１３１の類似特徴量抽出部１６２の類似特徴ベクトル算出部１７１は、画像の類似の度合いを求める特徴量である類似特徴ベクトルを算出する。すなわち、ステップＳ４２において、類似特徴ベクトル算出部１７１は、２つの画像のそれぞれの特徴からその２つの画像の類似の度合いが計算される特徴を抽出する。 In step S42, the similar feature vector calculation unit 171 of the similar feature amount extraction unit 162 of the image analysis unit 131 calculates a similar feature vector that is a feature amount for obtaining the degree of similarity of images. That is, in step S42, the similar feature vector calculation unit 171 extracts features for which the degree of similarity between the two images is calculated from the features of the two images.

例えば、類似特徴ベクトル算出部１７１は、色ヒストグラムである類似特徴ベクトルを算出する。 For example, the similar feature vector calculation unit 171 calculates a similar feature vector that is a color histogram.

より具体的には、例えば、図１１で示されるように、類似特徴ベクトル算出部１７１は、２４ビットRGBの本画像２０１の１６７７７２１６１色の色を、３２色に減色し、３２色に減色した減色画像２２１を生成する。すなわち、５ビットRGBの減色画像２２１が生成される。例えば、類似特徴ベクトル算出部１７１は、本画像２０１の各画素の画素値から、所定の上位のビットを抽出することで、減色画像２２１を生成する。 More specifically, for example, as illustrated in FIG. 11, the similar feature vector calculation unit 171 reduces the color of 1677772161 colors of the 24-bit RGB main image 201 to 32 colors and reduces the color to 32 colors. An image 221 is generated. That is, a 5-bit RGB reduced color image 221 is generated. For example, the similar feature vector calculation unit 171 generates a reduced color image 221 by extracting a predetermined upper bit from the pixel value of each pixel of the main image 201.

そして、類似特徴ベクトル算出部１７１は、RGBで表される減色画像２２１の各画素の色を、Ｌ*ａ*ｂ*で表すように変換する。すなわち、類似特徴ベクトル算出部１７１は、減色画像２２１の各画素の色を示すＬ*ａ*ｂ*空間上の位置を特定する。言い換えれば、減色画像２２１の画素のそれぞれについて、減色画像２２１の各画素で示される３２色のいずれかの色（Ｌ*ａ*ｂ*空間上の位置）が特定される。 Then, the similar feature vector calculation unit 171 converts the color of each pixel of the subtractive color image 221 represented by RGB so as to represent L * a * b *. That is, the similar feature vector calculation unit 171 specifies a position in the L * a * b * space indicating the color of each pixel of the color-reduced image 221. In other words, for each pixel of the reduced color image 221, any one of the 32 colors (positions in the L * a * b * space) indicated by each pixel of the reduced color image 221 is specified.

さらに、類似特徴ベクトル算出部１７１は、減色画像２２１について、３２色の色毎の画素の数、すなわち、色毎の頻度を求めて、色ヒストグラムを生成する。色ヒストグラムの尺度は、色を示し、色ヒストグラムの度数は、その色の画素の数（頻度）を示す。 Further, the similar feature vector calculation unit 171 determines the number of pixels for each of the 32 colors, that is, the frequency for each color, and generates a color histogram for the subtractive color image 221. The scale of the color histogram indicates a color, and the frequency of the color histogram indicates the number (frequency) of pixels of the color.

また、例えば、類似特徴ベクトル算出部１７１は、垂直成分ヒストグラムおよび水平成分ヒストグラムである類似特徴ベクトルを算出する。 For example, the similar feature vector calculation unit 171 calculates similar feature vectors that are a vertical component histogram and a horizontal component histogram.

この場合、まず、図１２で示されるように、類似特徴ベクトル算出部１７１は、本画像２０１を、１６画素×１６画素のブロック２４１に分割し、それぞれのブロック２４１に、垂直方向（縦）および水平方向（横）にDFT（Discrete Fourier Transform）の処理を適用する。 In this case, first, as shown in FIG. 12, the similar feature vector calculation unit 171 divides the main image 201 into blocks 241 each having 16 pixels × 16 pixels, and each block 241 has a vertical direction (vertical) and a vertical direction. DFT (Discrete Fourier Transform) processing is applied in the horizontal direction (horizontal).

すなわち、類似特徴ベクトル算出部１７１は、各ブロック２４１の縦１列に並ぶ１６の画素にDFTの処理を適用し、縦１列の１６の画素の画像の周波数成分を抽出する。各ブロック２４１には、１６の画素からなる列が、１６並んでいるので、類似特徴ベクトル算出部１７１は、それぞれのブロック２４１についての垂直方向（縦）のDFTの処理によって、１６の画像の周波数成分を抽出することになる。 That is, the similar feature vector calculation unit 171 applies DFT processing to 16 pixels arranged in one vertical column of each block 241 and extracts frequency components of the image of 16 pixels in one vertical column. Since each block 241 includes 16 columns of 16 pixels, the similar feature vector calculation unit 171 performs the frequency of 16 images by performing vertical (vertical) DFT processing for each block 241. The component will be extracted.

そして、類似特徴ベクトル算出部１７１は、各ブロック２４１に垂直方向（縦）のDFTの処理を適用した結果得られた画像の周波数成分を、周波数毎に積算（加算）する。類似特徴ベクトル算出部１７１は、各ブロック２４１に垂直方向（縦）のDFTの処理を適用した結果を積算した値のうち、DC成分を除く、８つのより低い周波数の成分の中から、最大の成分を抽出する。この場合、最大値が予め定めた閾値に満たないときには、そのブロック２４１の処理の結果は破棄される。 Then, the similar feature vector calculation unit 171 accumulates (adds) the frequency components of the image obtained as a result of applying the vertical (vertical) DFT processing to each block 241 for each frequency. The similar feature vector calculation unit 171 calculates the maximum value from eight lower frequency components excluding the DC component among the values obtained by integrating the results of applying the vertical (vertical) DFT processing to each block 241. Extract ingredients. In this case, when the maximum value is less than the predetermined threshold value, the processing result of the block 241 is discarded.

類似特徴ベクトル算出部１７１は、画像について、このように求められたブロック２４１毎の最大値を８つの周波数ごとに積算することで、図１３で示すように、８つの周波数に対する最大値の頻度を示す垂直成分ヒストグラムを生成する。垂直成分ヒストグラムの尺度は、画像の周波数を示し、垂直成分ヒストグラムの度数は、その周波数の成分が最大となる数（頻度）を示す。 The similar feature vector calculation unit 171 accumulates the maximum value for each block 241 in this way for each of the eight frequencies for the image, thereby obtaining the frequency of the maximum value for the eight frequencies as shown in FIG. A vertical component histogram is generated. The scale of the vertical component histogram indicates the frequency of the image, and the frequency of the vertical component histogram indicates the number (frequency) at which the frequency component is maximum.

同様に、類似特徴ベクトル算出部１７１は、各ブロック２４１の横１行に並ぶ１６の画素にDFTの処理を適用し、横１行の１６の画素の画像の周波数成分を抽出する。各ブロック２４１には、１６の画素からなる行が、１６並んでいるので、類似特徴ベクトル算出部１７１は、それぞれのブロック２４１についての水平方向（横）のDFTの処理によって、１６の画像の周波数成分を抽出することになる。 Similarly, the similar feature vector calculation unit 171 applies DFT processing to 16 pixels arranged in one horizontal row of each block 241 and extracts frequency components of the image of 16 pixels in one horizontal row. Since each block 241 includes 16 rows of 16 pixels, the similar feature vector calculation unit 171 performs the frequency of 16 images by performing horizontal (horizontal) DFT processing for each block 241. The component will be extracted.

そして、類似特徴ベクトル算出部１７１は、各ブロック２４１に水平方向（横）にDFTの処理を適用した結果得られた画像の周波数成分を、周波数毎に積算（加算）する。類似特徴ベクトル算出部１７１は、各ブロック２４１に水平方向（横）のDFTの処理を適用した結果を積算した値のうち、DC成分を除く、８つのより低い周波数の成分の中から、最大の成分を抽出する。この場合、最大値が予め定めた閾値に満たないときには、そのブロック２４１の処理の結果は破棄される。 Then, the similar feature vector calculation unit 171 accumulates (adds) the frequency components of the image obtained as a result of applying the DFT processing to each block 241 in the horizontal direction (lateral) for each frequency. The similar feature vector calculation unit 171 calculates the maximum value from eight lower frequency components excluding the DC component among the values obtained by integrating the results of applying the horizontal (horizontal) DFT processing to each block 241. Extract ingredients. In this case, when the maximum value is less than the predetermined threshold value, the processing result of the block 241 is discarded.

類似特徴ベクトル算出部１７１は、画像について、このように求められたブロック２４１毎の最大値を８つの周波数ごとに積算することで、図１３で示すように、８つの周波数に対する最大値の頻度を示す水平成分ヒストグラムを生成する。水平成分ヒストグラムの尺度は、画像の周波数を示し、水平成分ヒストグラムの度数は、その周波数の成分が最大となる数（頻度）を示す。 The similar feature vector calculation unit 171 accumulates the maximum value for each block 241 in this way for each of the eight frequencies for the image, thereby obtaining the frequency of the maximum value for the eight frequencies as shown in FIG. A horizontal component histogram is generated. The scale of the horizontal component histogram indicates the frequency of the image, and the frequency of the horizontal component histogram indicates the number (frequency) at which the frequency component is maximum.

このように、類似特徴ベクトル算出部１７１は、画像について、垂直成分ヒストグラムおよび水平成分ヒストグラムを生成する。 As described above, the similar feature vector calculation unit 171 generates a vertical component histogram and a horizontal component histogram for an image.

例えば、ステップＳ４２において、類似特徴ベクトル算出部１７１は、２つの画像のそれぞれの特徴からその２つの画像の類似の度合いが計算される特徴として、色ヒストグラム、垂直成分ヒストグラム、および水平成分ヒストグラムを抽出する。 For example, in step S42, the similar feature vector calculation unit 171 extracts a color histogram, a vertical component histogram, and a horizontal component histogram as features for calculating the degree of similarity between the two images from the features of the two images. To do.

図１０に戻り、ステップＳ４３において、画像解析部１３１の類似特徴量抽出部１６２の色特徴抽出部１７２は、画像に色特徴抽出の処理を適用して、処理は終了する。色特徴抽出の処理によって、画像から、画像の画素の色を基に、画像が所定の色名によって想起される度合いを示す関連度が抽出される。色特徴抽出の処理の詳細は、図３６のフローチャートを参照して後述する。 Returning to FIG. 10, in step S43, the color feature extraction unit 172 of the similar feature amount extraction unit 162 of the image analysis unit 131 applies the color feature extraction process to the image, and the process ends. Through the color feature extraction process, a degree of association indicating the degree to which the image is recalled by a predetermined color name is extracted from the image based on the color of the pixel of the image. Details of the color feature extraction processing will be described later with reference to the flowchart of FIG.

このように、ステップＳ３４において、画像解析部１３１によって、画像保持部１４０に記録された画像が解析されて、画像の特徴が抽出される。 As described above, in step S34, the image analysis unit 131 analyzes the image recorded in the image holding unit 140, and extracts the feature of the image.

ステップＳ３５において、メタデータ生成部１３３は、ステップＳ３４において抽出された画像の特徴を含む画像のメタデータを生成する。ステップＳ３６において、エントリ生成部１３４は、本画像２０１および縮小画像２０２のエントリを生成する。エントリ生成部１３４は、生成したエントリを、ステップＳ３５において生成されたメタデータに関係付けて、コンテンツデータベース１４１および類似特徴データベース１４２に追加（格納）する。コンテンツデータベース１４１および類似特徴データベース１４２は、サーバ１３において抽出された画像の特徴を含むメタデータを記録する。 In step S35, the metadata generation unit 133 generates image metadata including the image features extracted in step S34. In step S 36, the entry generation unit 134 generates entries for the main image 201 and the reduced image 202. The entry generation unit 134 adds (stores) the generated entry to the content database 141 and the similar feature database 142 in association with the metadata generated in step S35. The content database 141 and the similar feature database 142 record metadata including image features extracted by the server 13.

ステップＳ３７において、送信制御部１３８−１は、通信部７９に、デジタルスチルカメラ１１のコンテンツデータベース１１１および類似特徴データベース１１２に、抽出された画像の特徴を含むメタデータを記入させる。すなわち、ステップＳ３７において、送信制御部１３８−１は、コンテンツデータベース１１１および類似特徴データベース１１２への記入の指令と共に、ステップＳ３５において生成されたメタデータを、通信部７９に、デジタルスチルカメラ１１宛てに送信させる。デジタルスチルカメラ１１の受信制御部１０９は、通信部４７に、メタデータとコンテンツデータベース１１１および類似特徴データベース１１２への記入の指令とを受信させると、メタデータとコンテンツデータベース１１１および類似特徴データベース１１２への記入の指令とをコンテンツデータベース１１１および類似特徴データベース１１２に供給する。コンテンツデータベース１１１および類似特徴データベース１１２は、記入の指令を受けると、サーバ１３において抽出された画像の特徴を含むメタデータを記録する。 In step S 37, the transmission control unit 138-1 causes the communication unit 79 to enter metadata including the extracted image features in the content database 111 and the similar feature database 112 of the digital still camera 11. That is, in step S37, the transmission control unit 138-1 sends the metadata generated in step S35 to the digital still camera 11 to the communication unit 79 together with instructions for filling the content database 111 and the similar feature database 112. Send it. The reception control unit 109 of the digital still camera 11 causes the communication unit 47 to receive the metadata and a command to fill in the content database 111 and the similar feature database 112, and then to the metadata, the content database 111, and the similar feature database 112. The content database 111 and the similar feature database 112 are supplied. When the content database 111 and the similar feature database 112 receive an entry command, the content database 111 and the similar feature database 112 record metadata including image features extracted by the server 13.

このように、コンテンツデータベース１４１および類似特徴データベース１４２と、コンテンツデータベース１１１および類似特徴データベース１１２とは、サーバ１３において抽出された画像の特徴を含む同じメタデータを記録する。 As described above, the content database 141 and the similar feature database 142 and the content database 111 and the similar feature database 112 record the same metadata including the image features extracted in the server 13.

ステップＳ３８において、サーバ１３の送信制御部１３８−１および受信制御部１３８−１は、通信部７９に、デジタルスチルカメラ１１との接続を切断させ、処理は終了する。 In step S38, the transmission control unit 138-1 and the reception control unit 138-1 of the server 13 cause the communication unit 79 to disconnect from the digital still camera 11, and the process ends.

なお、サーバ１３は、携帯電話機１２に対して、携帯電話機１２で撮影された画像について、図９のフローチャートで示されるバックアップの処理と同様に処理を実行することができる。 Note that the server 13 can execute the same processing as the backup processing shown in the flowchart of FIG. 9 on the image captured by the mobile phone 12 with respect to the mobile phone 12.

図１４で示されるように、デジタルスチルカメラ１１または携帯電話機１２で撮影された画像がサーバ１３−１またはサーバ１３−２にバックアップされると、サーバ１３−１またはサーバ１３−２は、バックアップされた画像を解析して、画像の特徴を抽出し、抽出した画像の特徴を記述したメタデータ２６１をデジタルスチルカメラ１１または携帯電話機１２に書き戻す。 As shown in FIG. 14, when an image taken with the digital still camera 11 or the mobile phone 12 is backed up to the server 13-1 or the server 13-2, the server 13-1 or the server 13-2 is backed up. The image is analyzed to extract the feature of the image, and the metadata 261 describing the extracted feature of the image is written back to the digital still camera 11 or the mobile phone 12.

図１５は、本画像２０１および縮小画像２０２に関係付けられた、抽出した画像の特徴を記述したメタデータ２６１の具体例を示す図である。 FIG. 15 is a diagram illustrating a specific example of the metadata 261 describing the characteristics of the extracted image related to the main image 201 and the reduced image 202.

メタデータ２６１は、例えば、XML（eXtensible Mark-up Language）方式で記述される。 The metadata 261 is described in, for example, an XML (eXtensible Mark-up Language) method.

<photo>タグおよび</photo>タグの間には、本画像２０１および縮小画像２０２との関係付けを示す情報並びに本画像２０１および縮小画像２０２の特徴を示す情報が配置される。 Between the <photo> tag and the </ photo> tag, information indicating the relationship between the main image 201 and the reduced image 202 and information indicating the characteristics of the main image 201 and the reduced image 202 are arranged.

<guid>タグおよび</guid>タグの間には、このメタデータ２６１に関係付けられている本画像２０１および縮小画像２０２を特定する特定情報であるコンテンツIDが配置される。例えば、コンテンツIDは、１２８ビットとされる。コンテンツIDは、本画像２０１と、その本画像２０１を縮小した縮小画像２０２とに共通とされる。 Between the <guid> tag and the </ guid> tag, a content ID which is specific information for specifying the main image 201 and the reduced image 202 related to the metadata 261 is arranged. For example, the content ID is 128 bits. The content ID is common to the main image 201 and the reduced image 202 obtained by reducing the main image 201.

<FullImgPath>タグおよび</FullImgPath>タグの間には、画像データである本画像２０１が格納されているファイルのパスおよび画像データである本画像２０１が格納されているファイルのファイル名が配置される。<CacheImgPath>タグおよび</CacheImgPath>タグの間には、画像データである縮小画像２０２が格納されているファイルのパスおよび画像データである縮小画像２０２が格納されているファイルのファイル名が配置される。 Between the <FullImgPath> tag and the </ FullImgPath> tag, the path of the file storing the main image 201 as image data and the file name of the file storing the main image 201 as image data are arranged. The Between the <CacheImgPath> tag and the </ CacheImgPath> tag, the path of the file storing the reduced image 202 as image data and the file name of the file storing the reduced image 202 as image data are arranged. The

<TimeStamp>タグおよび</TimeStamp>タグの間に配置されている2003:03:31 06:52:32は、本画像２０１が、２００３年３月３１日６時５２分３２秒に撮影されたことを示すタイムスタンプである。 In 2003: 03: 31 06:52:32 placed between the <TimeStamp> tag and the </ TimeStamp> tag, the main image 201 was taken at 6:52:32 on March 31, 2003 It is a time stamp indicating that.

<Faceinfo>タグおよび</Faceinfo>タグの間には、コンテンツIDで特定される本画像２０１および縮小画像２０２に含まれる顔の画像に関する情報が配置される。 Between the <Faceinfo> tag and the </ Faceinfo> tag, information related to the face image included in the main image 201 and the reduced image 202 specified by the content ID is arranged.

<TotalFace>タグおよび</TotalFace>タグの間に配置されている1は、コンテンツIDで特定される本画像２０１または縮小画像２０２に含まれる顔の画像の数が１つであることを示す。すなわち、<TotalFace>タグおよび</TotalFace>タグの間に配置されている値は、コンテンツIDで特定される本画像２０１または縮小画像２０２に含まれる顔の画像の総数を示す。 1 arranged between the <TotalFace> tag and the </ TotalFace> tag indicates that the number of face images included in the main image 201 or the reduced image 202 specified by the content ID is one. That is, the value arranged between the <TotalFace> tag and the </ TotalFace> tag indicates the total number of face images included in the main image 201 or the reduced image 202 specified by the content ID.

<FaceEntry>タグおよび</FaceEntry>タグの間には、１つの顔の画像についての情報が配置される。図１５に例示されるメタデータ２６１における顔の画像の総数が１なので、１組の<FaceEntry>タグおよび</FaceEntry>タグが配置されることになる。 Information about one face image is arranged between the <FaceEntry> tag and the </ FaceEntry> tag. Since the total number of face images in the metadata 261 illustrated in FIG. 15 is 1, a set of <FaceEntry> tags and </ FaceEntry> tags are arranged.

<x>タグおよび</x>タグの間に配置されている値は、コンテンツIDで特定される本画像２０１または縮小画像２０２における顔の画像の横方向の位置を示す。図１５において、<x>タグおよび</x>タグの間に配置されている0.328767は、本画像２０１または縮小画像２０２の左端を0.0とし、本画像２０１または縮小画像２０２の右端を1.0とした場合に、顔の画像の右端の横方向の位置が、0.328767であることを示す。 The value arranged between the <x> tag and the </ x> tag indicates the horizontal position of the face image in the main image 201 or the reduced image 202 specified by the content ID. In FIG. 15, 0.328767 arranged between the <x> tag and the </ x> tag sets 0.0 as the left end of the main image 201 or the reduced image 202 and 1.0 as the right end of the main image 201 or the reduced image 202. In this case, the horizontal position of the right end of the face image is 0.328767.

<y>タグおよび</y>タグの間に配置されている値は、コンテンツIDで特定される本画像２０１または縮小画像２０２における顔の画像の縦方向の位置を示す。図１５において、<y>タグおよび</y>タグの間に配置されている0.204082は、本画像２０１または縮小画像２０２の上端を0.0とし、本画像２０１または縮小画像２０２の下端を1.0とした場合に、顔の画像の上端の縦方向の位置が、0.204082であることを示す。 The value arranged between the <y> tag and the </ y> tag indicates the position in the vertical direction of the face image in the main image 201 or the reduced image 202 specified by the content ID. In FIG. 15, 0.204082 arranged between the <y> tag and the </ y> tag sets the upper end of the main image 201 or the reduced image 202 to 0.0 and sets the lower end of the main image 201 or the reduced image 202 to 1.0. In this case, the vertical position of the upper end of the face image is 0.204082.

すなわち、<x>タグおよび</x>タグの間には、顔の画像の正規化された横方向の位置が配置され、<y>タグおよび</y>タグの間には、顔の画像の正規化された縦方向の位置が配置される。 That is, the normalized horizontal position of the face image is placed between the <x> tag and the </ x> tag, and the face image is placed between the <y> tag and the </ y> tag. The normalized vertical position of the image is placed.

<width>タグおよび</width>タグの間に配置されている値は、コンテンツIDで特定される本画像２０１または縮小画像２０２における顔の画像の幅（横方向のサイズ）を示す。図１５において、<width>タグおよび</width>タグの間に配置されている0.408163は、本画像２０１または縮小画像２０２の幅を1.0とした場合に、顔の画像の幅が、0.408163であることを示す。 A value arranged between the <width> tag and the </ width> tag indicates the width (lateral size) of the face image in the main image 201 or the reduced image 202 specified by the content ID. In FIG. 15, 0.408163 arranged between the <width> tag and the </ width> tag is 0.408163 when the width of the main image 201 or the reduced image 202 is 1.0. It shows that.

<height>タグおよび</height>タグの間に配置されているは、コンテンツIDで特定される本画像２０１または縮小画像２０２における顔の画像の高さ（縦方向のサイズ）を示す。図１５において、<height>タグおよび</height>タグの間に配置されている0.273973は、本画像２０１または縮小画像２０２の高さを1.0とした場合に、顔の画像の高さが、0.273973であることを示す。 Arranged between the <height> tag and the </ height> tag indicates the height (vertical size) of the face image in the main image 201 or the reduced image 202 specified by the content ID. In FIG. 15, 0.273973 arranged between the <height> tag and the </ height> tag indicates that the height of the face image is 0.273973 when the height of the main image 201 or the reduced image 202 is 1.0. Indicates that

すなわち、<width>タグおよび</width>タグの間には、顔の画像の正規化された幅が配置され、<height>タグおよび</height>タグの間には、顔の画像の正規化された高さが配置される。 That is, the normalized width of the face image is placed between the <width> tag and the </ width> tag, and the face image normalization is placed between the <height> tag and the </ height> tag. The height is arranged.

<roll>タグおよび</roll>タグの間に配置されている値は、顔の画像における顔のロール角を示す。図１５において、<roll>タグおよび</roll>タグの間に配置されている0.000000は、顔の画像における顔のロール角が、0.000000度であることを示す。 The value arranged between the <roll> tag and the </ roll> tag indicates the roll angle of the face in the face image. In FIG. 15, 0.000000 arranged between the <roll> tag and the </ roll> tag indicates that the face roll angle in the face image is 0.000000 degrees.

<pitch>タグおよび</pitch>タグの間に配置されている値は、顔の画像における顔のピッチ角を示す。図１５において、<pitch>タグおよび</pitch>タグの間に配置されている0.000000は、顔の画像における顔のピッチ角が、0.000000度であることを示す。 The value arranged between the <pitch> tag and the </ pitch> tag indicates the pitch angle of the face in the face image. In FIG. 15, 0.000000 arranged between the <pitch> tag and the </ pitch> tag indicates that the face pitch angle in the face image is 0.000000 degrees.

<yaw>タグおよび</yaw>タグの間に配置されている値は、顔の画像における顔のヨー角を示す。図１５において、<yaw>タグおよび</yaw>タグの間に配置されている0.000000は、顔の画像における顔のヨー角が、0.000000度であることを示す。 The value arranged between the <yaw> tag and the </ yaw> tag indicates the face yaw angle in the face image. In FIG. 15, 0.000000 arranged between the <yaw> tag and the </ yaw> tag indicates that the face yaw angle in the face image is 0.000000 degrees.

ここで、ロール角は、顔の前後方向の位置を示す前後軸（x軸）の周りの移動角である。ピッチ角は、顔の左右方向の位置を示す横軸（y軸）の周りの移動角である。ヨー角は、顔の上下方向の位置を示す垂直軸（z軸）の周りの移動角である。 Here, the roll angle is a movement angle around the front-rear axis (x-axis) indicating the position of the face in the front-rear direction. The pitch angle is a movement angle around the horizontal axis (y axis) indicating the position of the face in the left-right direction. The yaw angle is a movement angle around the vertical axis (z axis) indicating the vertical position of the face.

<Similarityinfo>タグおよび</Similarityinfo>タグの間には、コンテンツIDで特定される本画像２０１または縮小画像２０２と他の画像との類似の度合いを求める場合に用いる、コンテンツIDで特定される本画像２０１および縮小画像２０２の特徴量が配置される。 Between the <Similarityinfo> tag and the </ Similarityinfo> tag, a book specified by the content ID used when obtaining the degree of similarity between the master image 201 specified by the content ID or the reduced image 202 and another image The feature amounts of the image 201 and the reduced image 202 are arranged.

図１５に示す例において、<Similarityinfo>タグおよび</Similarityinfo>タグの間には、本画像２０１または縮小画像２０２が所定の色名によって想起される度合いを示す関連度、および色または画像の周波数成分などの類似の度合いを計算するための特徴量が配置される。 In the example shown in FIG. 15, between the <Similarityinfo> tag and the </ Similarityinfo> tag, the degree of association indicating the degree to which the main image 201 or the reduced image 202 is recalled by a predetermined color name, and the frequency of the color or image A feature amount for calculating the degree of similarity such as a component is arranged.

<ColorInfo>タグおよび</ColorInfo>タグの間には、コンテンツIDで特定される本画像２０１または縮小画像２０２から抽出された、本画像２０１または縮小画像２０２の画素の色を基に、本画像２０１または縮小画像２０２が所定の色名によって想起される度合いを示す関連度が配置される。 Between the <ColorInfo> tag and the </ ColorInfo> tag, the main image is extracted based on the pixel color of the main image 201 or the reduced image 202 extracted from the main image 201 or the reduced image 202 specified by the content ID. A degree of association indicating the degree to which 201 or the reduced image 202 is recalled by a predetermined color name is arranged.

<ColorWhite>タグおよび</ColorWhite>タグの間には、コンテンツIDで特定される本画像２０１または縮小画像２０２から、本画像２０１または縮小画像２０２の画素の色を基に抽出された、本画像２０１または縮小画像２０２が白である色名によって想起される度合いを示す関連度が配置される。図１５において、<ColorWhite>タグおよび</ColorWhite>タグの間に配置されている０は、本画像２０１または縮小画像２０２が白である色名によって想起される度合いを示す関連度が０であることを示す。 Between the <ColorWhite> tag and the </ ColorWhite> tag, the main image extracted from the main image 201 or the reduced image 202 specified by the content ID based on the color of the pixel of the main image 201 or the reduced image 202 A degree of association indicating the degree to which 201 or the reduced image 202 is recalled by the color name of white is arranged. In FIG. 15, 0 arranged between the <ColorWhite> tag and the </ ColorWhite> tag has a relevance level of 0 indicating the degree to which the main image 201 or the reduced image 202 is recalled by a white color name. It shows that.

<ColorBlack>タグおよび</ColorBlack>タグの間には、コンテンツIDで特定される本画像２０１または縮小画像２０２から、本画像２０１または縮小画像２０２の画素の色を基に抽出された、本画像２０１または縮小画像２０２が黒である色名によって想起される度合いを示す関連度が配置される。図１５において、<ColorBlack>タグおよび</ColorBlack>タグの間に配置されている０は、本画像２０１または縮小画像２０２が黒である色名によって想起される度合いを示す関連度が０であることを示す。 Between the <ColorBlack> tag and the </ ColorBlack> tag, the main image extracted from the main image 201 or the reduced image 202 specified by the content ID based on the color of the pixel of the main image 201 or the reduced image 202 A degree of association indicating the degree to which 201 or the reduced image 202 is recalled by the color name of black is arranged. In FIG. 15, 0 arranged between the <ColorBlack> tag and the </ ColorBlack> tag has a relevance degree of 0 indicating the degree to which the main image 201 or the reduced image 202 is recalled by a black color name. It shows that.

<ColorRed>タグおよび</ColorRed>タグの間には、コンテンツIDで特定される本画像２０１または縮小画像２０２から、本画像２０１または縮小画像２０２の画素の色を基に抽出された、本画像２０１または縮小画像２０２が赤である色名によって想起される度合いを示す関連度が配置される。図１５において、<ColorRed>タグおよび</ColorRed>タグの間に配置されている０は、本画像２０１または縮小画像２０２が赤である色名によって想起される度合いを示す関連度が０であることを示す。 Between the <ColorRed> tag and the </ ColorRed> tag, the main image extracted from the main image 201 or the reduced image 202 specified by the content ID based on the pixel color of the main image 201 or the reduced image 202 A degree of association indicating the degree to which 201 or the reduced image 202 is recalled by the color name of red is arranged. In FIG. 15, 0 arranged between the <ColorRed> tag and the </ ColorRed> tag has a relevance of 0 indicating the degree to which the main image 201 or the reduced image 202 is recalled by a red color name. It shows that.

<ColorYellow>タグおよび</ColorYellow>タグの間には、コンテンツIDで特定される本画像２０１または縮小画像２０２から、本画像２０１または縮小画像２０２の画素の色を基に抽出された、本画像２０１または縮小画像２０２が黄である色名によって想起される度合いを示す関連度が配置される。図１５において、<ColorYellow>タグおよび</ColorYellow>タグの間に配置されている０は、本画像２０１または縮小画像２０２が黄である色名によって想起される度合いを示す関連度が０であることを示す。 Between the <ColorYellow> tag and the </ ColorYellow> tag, the main image extracted from the main image 201 or the reduced image 202 specified by the content ID based on the color of the pixel of the main image 201 or the reduced image 202 A degree of association indicating the degree to which 201 or the reduced image 202 is recalled by the color name of yellow is arranged. In FIG. 15, 0 arranged between the <ColorYellow> tag and the </ ColorYellow> tag has a relevance of 0 indicating the degree to which the main image 201 or the reduced image 202 is recalled by a yellow color name. It shows that.

<ColorGreen>タグおよび</ColorGreen>タグの間には、コンテンツIDで特定される本画像２０１または縮小画像２０２から、本画像２０１または縮小画像２０２の画素の色を基に抽出された、本画像２０１または縮小画像２０２が緑である色名によって想起される度合いを示す関連度が配置される。図１５において、<ColorGreen>タグおよび</ColorGreen>タグの間に配置されている１２は、本画像２０１または縮小画像２０２が緑である色名によって想起される度合いを示す関連度が０．１２であることを示す。すなわち、ここでは関連度が％（パーセント）表記にて記録されている。 Between the <ColorGreen> tag and the </ ColorGreen> tag, the main image extracted from the main image 201 or the reduced image 202 specified by the content ID based on the pixel color of the main image 201 or the reduced image 202 A degree of association indicating the degree to which 201 or the reduced image 202 is recalled by a color name of green is arranged. In FIG. 15, 12 arranged between the <ColorGreen> tag and the </ ColorGreen> tag has a relevance of 0.12 indicating the degree to which the main image 201 or the reduced image 202 is recalled by a green color name. Indicates that That is, here, the degree of association is recorded in% (percent) notation.

<ColorBlue>タグおよび</ColorBlue>タグの間には、コンテンツIDで特定される本画像２０１または縮小画像２０２から、本画像２０１または縮小画像２０２の画素の色を基に抽出された、本画像２０１または縮小画像２０２が青である色名によって想起される度合いを示す関連度が配置される。図１５において、<ColorBlue>タグおよび</ColorBlue>タグの間に配置されている０は、本画像２０１または縮小画像２０２が黄である色名によって想起される度合いを示す関連度が０であることを示す。 Between the <ColorBlue> tag and the </ ColorBlue> tag, the main image extracted from the main image 201 or the reduced image 202 specified by the content ID based on the pixel color of the main image 201 or the reduced image 202 A degree of association indicating the degree to which 201 or the reduced image 202 is recalled by the color name of blue is arranged. In FIG. 15, 0 arranged between the <ColorBlue> tag and the </ ColorBlue> tag has a relevance of 0 indicating the degree to which the main image 201 or the reduced image 202 is recalled by the color name being yellow. It shows that.

<VectorInfo>タグおよび</VectorInfo>タグの間には、コンテンツIDで特定される本画像２０１または縮小画像２０２と他の画像との類似の度合いを求めるための、コンテンツIDで特定される本画像２０１または縮小画像２０２についての特徴が配置される。 Between the <VectorInfo> tag and the </ VectorInfo> tag, the main image specified by the content ID for obtaining the degree of similarity between the main image 201 specified by the content ID or the reduced image 202 and another image. Features for 201 or reduced image 202 are placed.

<VectorInfo>タグおよび</VectorInfo>タグの１つの組の間は、コンテンツIDで特定される本画像２０１または縮小画像２０２についての、それぞれ１つの特徴が配置される。図１５のメタデータ２６１の例には、<VectorInfo>タグおよび</VectorInfo>タグの３つの組が記述されている。 Between one set of <VectorInfo> tag and </ VectorInfo> tag, one feature of the main image 201 or the reduced image 202 specified by the content ID is arranged. In the example of the metadata 261 in FIG. 15, three sets of <VectorInfo> tag and </ VectorInfo> tag are described.

それぞれの<VectorInfo>タグおよび</VectorInfo>タグの間には、<method>タグと</method>タグ、および<vector>タグと</vector>タグが配置される。<method>タグおよび</method>タグの間には、類似の度合いを求めるための特徴の方式が記述され、<vector>タグおよび</vector>タグの間には、その特徴の量が記述される。<vector>タグおよび</vector>タグの間に記述される特徴量は、ベクトルとされる。 Between each <VectorInfo> tag and </ VectorInfo> tag, a <method> tag and a </ method> tag, and a <vector> tag and a </ vector> tag are arranged. Between the <method> tag and the </ method> tag, a feature method for determining the degree of similarity is described. Between the <vector> tag and the </ vector> tag, the amount of the feature is described. Is done. The feature amount described between the <vector> tag and the </ vector> tag is a vector.

図１５において、最も上の<VectorInfo>タグおよび</VectorInfo>タグの間の、<method>タグおよび</method>タグの間に配置されているColor Featureは、その次の<vector>タグおよび</vector>タグの間に配置されている特徴量が、色の特徴量であることを示す。色の特徴量は、例えば、図１１を参照して説明した色ヒストグラムで示される特徴量である。 In FIG. 15, the Color Feature arranged between the <method> tag and the </ method> tag between the top <VectorInfo> tag and the </ VectorInfo> tag is the next <vector> tag and The feature quantity arranged between the </ vector> tags is a color feature quantity. The color feature amount is, for example, a feature amount indicated by the color histogram described with reference to FIG.

図１５において、上から２番目の<VectorInfo>タグおよび</VectorInfo>タグの間の、<method>タグおよび</method>タグの間に配置されているTexture Featureは、その次の<vector>タグおよび</vector>タグの間に配置されている特徴量が、模様の特徴量であることを示す。模様の特徴量は、例えば、図１２および図１３を参照して説明した垂直成分ヒストグラムおよび水平成分ヒストグラムからなる周波数成分のヒストグラムで示される特徴量である。 In FIG. 15, the Texture Feature arranged between the <method> tag and the </ method> tag between the <VectorInfo> tag and the </ VectorInfo> tag from the second top is the next <vector>. The feature amount arranged between the tag and the </ vector> tag indicates the feature amount of the pattern. The feature amount of the pattern is, for example, a feature amount indicated by a frequency component histogram including a vertical component histogram and a horizontal component histogram described with reference to FIGS. 12 and 13.

メタデータ２６１は、全体として、デジタルスチルカメラ１１において、コンテンツデータベース１１１と類似特徴データベース１１２とに格納され、サーバ１３において、コンテンツデータベース１４１と類似特徴データベース１４２とに格納される。すなわち、メタデータ２６１は、適宜分割されて、デジタルスチルカメラ１１において、その一部分がコンテンツデータベース１１１に格納され、残りの部分が類似特徴データベース１１２に格納され、サーバ１３において、コンテンツデータベース１１１に格納されている部分と同じ部分がコンテンツデータベース１４１に格納され、類似特徴データベース１１２に格納されている部分と同じ部分が類似特徴データベース１４２に格納される。 As a whole, the metadata 261 is stored in the content database 111 and the similar feature database 112 in the digital still camera 11, and is stored in the content database 141 and the similar feature database 142 in the server 13. That is, the metadata 261 is appropriately divided, and the digital still camera 11 stores a part thereof in the content database 111, the remaining part is stored in the similar feature database 112, and is stored in the content database 111 in the server 13. The same part as the stored part is stored in the content database 141, and the same part as the part stored in the similar feature database 112 is stored in the similar feature database 142.

図１６は、コンテンツデータベース１１１またはコンテンツデータベース１４１に格納されているメタデータ（の部分）の構成を示す図である。 FIG. 16 is a diagram illustrating a configuration of (parts of) metadata stored in the content database 111 or the content database 141.

コンテンツデータベース１１１またはコンテンツデータベース１４１に格納されているメタデータは、コンテンツID、撮影時刻、パス名、ファイル名、グループID、画像に含まれる顔の画像に関する情報（以下、顔画像情報と称する）、ラベルID、およびコメントなどからなる。 The metadata stored in the content database 111 or the content database 141 includes a content ID, a shooting time, a path name, a file name, a group ID, information on a face image included in the image (hereinafter referred to as face image information), It consists of a label ID and a comment.

コンテンツIDは、画像に固有のIDであり、画像を特定する。コンテンツIDによって、本画像２０１および縮小画像２０２が特定される。コンテンツIDは、GUIDであるプロパティとされ、文字列の型で表現される。画像が撮影された日時を示す撮影時刻は、協定世界時およびローカルタイムで表現される。協定世界時で表される撮影時刻は、UTCdateであるプロパティとされ、日付の型で表現される。協定世界時で表される撮影時刻は、EXIF方式のデータのDate Time Originalに記入される撮影時刻（UTC(Universal Coordinated Time)）と同じである。 The content ID is an ID unique to the image and identifies the image. The main image 201 and the reduced image 202 are specified by the content ID. The content ID is a property that is a GUID, and is represented by a character string type. The shooting time indicating the date and time when the image was shot is expressed in coordinated universal time and local time. The shooting time expressed in Coordinated Universal Time is a property that is UTCdate and is expressed in the date type. The shooting time represented in Coordinated Universal Time is the same as the shooting time (UTC (Universal Coordinated Time)) written in the Date Time Original of the EXIF data.

ローカルタイムで表される撮影時刻は、dateであるプロパティとされ、日付の型で表現される。ローカルタイムで表される撮影時刻は、EXIF方式のデータのDate Time Originalに記入される撮影時刻（Local time）と同じである。 The shooting time expressed in local time is a property that is date, and is expressed in a date type. The photographing time represented by the local time is the same as the photographing time (Local time) written in Date Time Original of the EXIF data.

パス名は、ms/DCIM/XXXXX/など、本画像２０１のファイルのディレクトリ名（ファイル名を含まず）を示す。パス名は、pathであるプロパティとされ、文字列の型で表現される。 The path name indicates the directory name (not including the file name) of the file of the main image 201 such as ms / DCIM / XXXXX /. The path name is a property that is path, and is expressed as a string type.

ファイル名は、DSC00001.JPGなど、画像データである本画像２０１が格納されているファイルの名前を示す。ファイル名は、DCFnameであるプロパティとされ、文字列の型で表現される。 The file name indicates the name of a file that stores the main image 201 that is image data, such as DSC00001.JPG. The file name is a property that is DCFname and is represented by a string type.

縮小画像２０２のパス名およびファイル名は、/DATA/EVENTIMAGE/000000000001.JPGなど、縮小画像２０２のファイルのディレクトリ名およびファイル名を示す。縮小画像２０２のパス名およびファイル名は、vgaCachePathであるプロパティとされ、文字列の型で表現される。 The path name and file name of the reduced image 202 indicate the directory name and file name of the file of the reduced image 202 such as /DATA/EVENTIMAGE/000000000001.JPG. The path name and file name of the reduced image 202 are set as a property of vgaCachePath and are expressed in a character string type.

グループIDは、画像が所属するグループを特定するデータである。画像は、使用者によって、所望のグループに分類される。グループIDは、画像が分類されたグループを特定する。例えば、画像が撮影されたイベント（旅行や運動会などの行事や催し）毎に、グループを造り、そのイベントで撮影された画像を、イベントに対応するグループに分類することができる。 The group ID is data for specifying the group to which the image belongs. The images are classified into a desired group by the user. The group ID specifies a group into which images are classified. For example, it is possible to create a group for each event (an event or event such as a trip or athletic meet) in which an image is taken, and classify the images taken at that event into groups corresponding to the event.

グループIDは、groupIDであるプロパティとされ、数値の型で表現される。 The group ID is a property that is groupID, and is expressed by a numeric type.

例えば、顔画像情報は、画像が、風景画（顔が写っていない画像）、少人数の人物画（１乃至５人の顔が写っている画像）、または大人数の人物画（６人以上の顔が写っている画像）のいずれかであることを示す。例えば、１である顔画像情報は、画像が風景画であることを示し、２である顔画像情報は、画像が少人数の人物画であることを示し、３である顔画像情報は、画像が大人数の人物画であることを示す。顔画像情報は、faceExistenceであるプロパティとされ、数値の型で表現される。 For example, the face image information may be a landscape image (image without a face), a small number of person images (images with 1 to 5 faces), or a large number of person images (6 or more people). Image showing the face of). For example, face image information of 1 indicates that the image is a landscape image, face image information of 2 indicates that the image is a small number of person images, and face image information of 3 indicates that the image is an image. Indicates that it is a portrait of a large number of people. The face image information is a property which is faceExistence and is expressed by a numerical value type.

顔画像情報は、画像に含まれる顔の画像の数、画像における顔の画像の位置、顔の画像の大きさ、または顔の画像における顔の向きを示すようにしてもよい。 The face image information may indicate the number of face images included in the image, the position of the face image in the image, the size of the face image, or the orientation of the face in the face image.

ラベルIDは、画像に付されたラベルを示す。ラベルIDは、labelsであるプロパティとされ、数値の配列の型で表現される。 The label ID indicates a label attached to the image. The label ID is a property that is labels, and is expressed as a numeric array type.

コメントは、commentであるプロパティとされ、文字列の型で表現される。 A comment is a property that is a comment and is represented by a string type.

プロテクト状態は、消去付加などのその画像の保護の状態を示す。プロテクト状態は、protectであるプロパティとされ、論理データの型で表現される。 The protected state indicates the state of protection of the image such as deletion and addition. The protect state is a property that is protect and is expressed by a logical data type.

エクスチェンジ／インポートフラグは、その画像が交換されたか、または画像がインポートされたことを示す。エクスチェンジ／インポートフラグは、exchangeOrImportFlagであるプロパティとされ、論理データの型で表現される。 The exchange / import flag indicates that the image has been exchanged or has been imported. The exchange / import flag is a property which is exchangeOrImportFlag, and is represented by a logical data type.

Trueであるメタイネーブルフラグは、サーバ１３によりその画像のメタデータが生成されたことを示す。メタイネーブルフラグは、metaEnableFlagであるプロパティとされ、論理データの型で表現される。 A meta enable flag that is true indicates that the server 13 has generated metadata for the image. The meta enable flag is a property that is metaEnableFlag, and is represented by a logical data type.

Trueであるバックアップフラグは、サーバ１３によりその画像がバックアップされたことを示す。バックアップフラグは、backUpFlagであるプロパティとされ、論理データの型で表現される。 A backup flag that is True indicates that the image has been backed up by the server 13. The backup flag is a property which is backUpFlag and is expressed by a logical data type.

図１７は、コンテンツデータベース１１１に格納されているメタデータ（の部分）および類似特徴データベース１１２に格納されているメタデータ（の部分）の構造を示す図である。 FIG. 17 is a diagram illustrating the structures of metadata (parts) stored in the content database 111 and metadata (parts) stored in the similar feature database 112.

コンテンツデータベース１１１には、画像毎のコンテンツアイテムが格納される。コンテンツアイテムは、メタデータ２６１の一部分のデータからなる。 The content database 111 stores content items for each image. The content item is a part of the metadata 261.

例えば、コンテンツアイテム２８１−１は、格納されているコンテンツIDで特定される１つの画像に対応し、コンテンツID、本画像２０１のパス名およびファイル名（図１７中のPath）、縮小画像２０２のパス名およびファイル名、グループID、ローカルタイムで表される撮影時刻、および顔画像情報などからなり、コンテンツアイテム２８１−２は、他の画像に対応し、コンテンツID、本画像２０１のパス名およびファイル名（図１７中のPath）、縮小画像２０２のパス名およびファイル名、グループID、ローカルタイムで表される撮影時刻、および顔画像情報などからなる。 For example, the content item 281-1 corresponds to one image specified by the stored content ID, the content ID, the path name and file name of the main image 201 (Path in FIG. 17), and the reduced image 202. The content item 281-2 corresponds to other images, and includes a content ID, a path name of the main image 201, and a path name and file name, a group ID, a shooting time represented by local time, and face image information. The file name (Path in FIG. 17), the path name and file name of the reduced image 202, the group ID, the shooting time expressed in local time, face image information, and the like.

以下、コンテンツアイテム２８１−１およびコンテンツアイテム２８１−２を個々に区別する必要がないとき、単に、コンテンツアイテム２８１と称する。 Hereinafter, when there is no need to distinguish between the content item 281-1 and the content item 281-2, they are simply referred to as the content item 281.

類似特徴データベース１１２には、画像毎の類似特徴アイテムが格納される。類似特徴アイテムは、メタデータ２６１を構成するデータのうち、コンテンツアイテム２８１を構成する部分以外の部分のデータからなる。ただし、類似特徴アイテムは、コンテンツIDを含む。 The similar feature database 112 stores similar feature items for each image. The similar feature item includes data of a part other than the part constituting the content item 281 in the data constituting the metadata 261. However, the similar feature item includes a content ID.

例えば、類似特徴アイテム２８２−１は、格納されているコンテンツIDで特定されるコンテンツアイテム２８１−１に対応し、すなわち、格納されているコンテンツIDで特定される１つの画像に対応し、コンテンツID、色ヒストグラム、および周波数成分のヒストグラムなどからなる。 For example, the similar feature item 282-1 corresponds to the content item 281-1 specified by the stored content ID, that is, corresponds to one image specified by the stored content ID, and the content ID , A color histogram, and a histogram of frequency components.

色ヒストグラムは、画像の３２の色毎の頻度を示し、histogramであるプロパティとされる。周波数成分のヒストグラムは、垂直成分ヒストグラムと水平成分ヒストグラムとからなり、画像の縦方向および横方向のそれぞれについての、８つの周波数に対する周波数成分の最大値の頻度を示し、textureであるプロパティとされる。 The color histogram indicates the frequency for each of the 32 colors of the image and is a property that is a histogram. The frequency component histogram is composed of a vertical component histogram and a horizontal component histogram, and indicates the frequency of the maximum value of the frequency component with respect to eight frequencies in each of the vertical direction and the horizontal direction of the image, and is a property which is a texture. .

同様に、例えば、類似特徴アイテム２８２−２は、格納されているコンテンツIDで特定されるコンテンツアイテム２８１−２に対応し、すなわち、格納されているコンテンツIDで特定される１つの画像に対応し、コンテンツID、色ヒストグラム、および周波数成分のヒストグラムなどからなる。 Similarly, for example, the similar feature item 282-2 corresponds to the content item 281-2 specified by the stored content ID, that is, corresponds to one image specified by the stored content ID. , Content ID, color histogram, and frequency component histogram.

以下、類似特徴アイテム２８２−１および類似特徴アイテム２８２−２を個々に区別する必要がないとき、単に、類似特徴アイテム２８２と称する。 Hereinafter, when there is no need to distinguish between the similar feature item 282-1 and the similar feature item 282-2, they are simply referred to as the similar feature item 282.

このように、類似特徴データベース１１２には、コンテンツデータベース１１１に格納されているコンテンツアイテム２８１に対応した類似特徴アイテム２８２が格納される。 As described above, the similar feature database 112 stores the similar feature item 282 corresponding to the content item 281 stored in the content database 111.

図１８は、類似特徴アイテム２８２の構造を示す図である。類似特徴アイテム２８２は、アイテム２９１、アイテム２９２−１乃至アイテム２９２−３２、およびアイテム２９３から構成されている。アイテム２９１は、コンテンツID、アイテム２９２−１乃至アイテム２９２−３２を示すポインタ、およびアイテム２９３を示すポインタから構成される。アイテム２９２−１乃至アイテム２９２−３２を示すポインタは、色ヒストグラムに対応している。アイテム２９３を示すポインタは、周波数成分のヒストグラムに対応している。 FIG. 18 is a diagram illustrating the structure of the similar feature item 282. The similar feature item 282 includes an item 291, an item 292-1 to an item 292-32, and an item 293. The item 291 includes a content ID, a pointer indicating the items 292-1 to 292-32, and a pointer indicating the item 293. The pointers indicating the items 292-1 to 292-32 correspond to the color histogram. A pointer indicating the item 293 corresponds to a histogram of frequency components.

アイテム２９２−１乃至アイテム２９２−３２は、それぞれ、色ヒストグラムの頻度、すなわち、Ｌ*ａ*ｂ*で表される色のそれぞれと、それぞれの色が画像内で占有している割合（例えば、３２色の色毎の画素の数）を示す。アイテム２９２−１は、Ｌ*ａ*ｂ*で表される色であって、３２色のうちの第１の色と、第１の色が画像内で占有している割合を示す。アイテム２９２−２は、Ｌ*ａ*ｂ*で表される色であって、３２色のうちの第２の色と、第２の色が画像内で占有している割合を示す。 Each of the items 292-1 to 292-32 is the frequency of the color histogram, that is, each of the colors represented by L * a * b * and the ratio of each color occupied in the image (for example, The number of pixels for each of the 32 colors). The item 292-1 is a color represented by L * a * b *, and indicates the first color of the 32 colors and the ratio occupied by the first color in the image. The item 292-2 is a color represented by L * a * b *, and indicates the second color of the 32 colors and the ratio occupied by the second color in the image.

アイテム２９２−３乃至アイテム２９２−３２は、それぞれ、Ｌ*ａ*ｂ*で表される色であって、３２色のうちの第３の色乃至第３２の色のそれぞれと、第３の色乃至第３２の色のそれぞれが画像内で占有している割合を示す。 The items 292-3 to 292-32 are colors represented by L * a * b *, respectively, and the third to thirty-second colors of the thirty-two colors and the third color The ratios of the thirty-second to thirty-second colors are indicated in the image.

すなわち、アイテム２９２−１乃至アイテム２９２−３２は、全体として、１つの画像の色ヒストグラムを示す。色ヒストグラムは、色特徴ベクトルCvとして表すこともできる。色特徴ベクトルCvは、Cv={(c1,r1),・・・,(c32,r32)}とも表現される。ここで、(c1,r1)乃至(c32,r32)のそれぞれは、c1乃至c32のいずれかで表される３２色のうちのいずれかの、画像内で占有している割合を示す。 That is, the items 292-1 to 292-32 indicate a color histogram of one image as a whole. The color histogram can also be expressed as a color feature vector Cv. The color feature vector Cv is also expressed as Cv = {(c1, r1),..., (C32, r32)}. Here, each of (c1, r1) to (c32, r32) represents a ratio occupied in any one of the 32 colors represented by any of c1 to c32.

アイテム２９３は、垂直成分ヒストグラムおよび水平成分ヒストグラムを示す。垂直成分ヒストグラムおよび水平成分ヒストグラムは、それぞれ、８つの頻度を示す。 Item 293 shows a vertical component histogram and a horizontal component histogram. Each of the vertical component histogram and the horizontal component histogram shows eight frequencies.

垂直成分ヒストグラムおよび水平成分ヒストグラムを合わせてなる周波数成分のヒストグラムは、周波数成分ベクトルTvとしても表すこともできる。周波数成分ベクトルTvは、Tv={(t1,1),・・・,(t8,1),(t9,1),・・・,(t16,1)}とも表現される。ここで、(t1,1)乃至(t16,1)のそれぞれは、t1乃至t16のいずれかで表される周波数成分の最大となる数（頻度）を示す。 The frequency component histogram formed by combining the vertical component histogram and the horizontal component histogram can also be expressed as a frequency component vector Tv. The frequency component vector Tv is also expressed as Tv = {(t1,1), ..., (t8,1), (t9,1), ..., (t16,1)}. Here, each of (t1,1) to (t16,1) indicates the maximum number (frequency) of frequency components represented by any of t1 to t16.

次に、図１９のフローチャートを参照して、Webサーバ１５−１若しくはWebサーバ１５−２またはその他の機器から画像を取得する、サーバ１３の画像の取得の処理を説明する。以下、Webサーバ１５−１から画像を取得する場合を例に説明する。 Next, an image acquisition process of the server 13 for acquiring an image from the Web server 15-1, the Web server 15-2, or other devices will be described with reference to a flowchart of FIG. Hereinafter, a case where an image is acquired from the Web server 15-1 will be described as an example.

ステップＳ６１において、サーバ１３の送信制御部１３８−２および受信制御部１３８−２は、ネットワーク１４を介して、通信部８０に、Webサーバ１５−１から本画像２０１を取得させる。 In step S61, the transmission control unit 138-2 and the reception control unit 138-2 of the server 13 cause the communication unit 80 to acquire the main image 201 from the Web server 15-1 via the network 14.

例えば、ステップＳ６１において、送信制御部１３８−２および受信制御部１３８−２は、通信部８０に、ネットワーク１４を介してWebサーバ１５−１と接続させる。そして、送信制御部１３８−２は、通信部８０に、ネットワーク１４を介して、Webサーバ１５−１宛てに本画像２０１の送信要求を送信させる。Webサーバ１５−１が要求された本画像２０１をネットワーク１４を介して送信してくるので、受信制御部１３８−２は、通信部８０に、Webサーバ１５−１から送信されてきた本画像２０１を受信させる。受信制御部１３８−２は、受信することによって取得した本画像２０１を画像保持部１４０に供給する。 For example, in step S61, the transmission control unit 138-2 and the reception control unit 138-2 cause the communication unit 80 to connect to the Web server 15-1 via the network 14. Then, the transmission control unit 138-2 causes the communication unit 80 to transmit a transmission request for the main image 201 to the Web server 15-1 via the network 14. Since the Web server 15-1 transmits the requested master image 201 via the network 14, the reception control unit 138-2 sends the master image 201 transmitted from the Web server 15-1 to the communication unit 80. To receive. The reception control unit 138-2 supplies the main image 201 acquired by reception to the image holding unit 140.

ステップＳ６２において、縮小画像生成部１３２は、受信した本画像２０１から縮小画像２０２を生成する。例えば、縮小画像生成部１３２は、本画像２０１から画素を間引きすることにより縮小画像２０２を生成する。または、縮小画像生成部１３２は、本画像２０１の互いに隣接する複数の画素の画素値の平均値を、その複数の画素に対応する１つの画素の画素値とすることにより、縮小画像２０２を生成する。 In step S 62, the reduced image generation unit 132 generates a reduced image 202 from the received main image 201. For example, the reduced image generation unit 132 generates the reduced image 202 by thinning pixels from the main image 201. Alternatively, the reduced image generation unit 132 generates the reduced image 202 by using the average value of the pixel values of a plurality of adjacent pixels of the main image 201 as the pixel value of one pixel corresponding to the plurality of pixels. To do.

縮小画像生成部１３２は、生成した縮小画像２０２を画像保持部１４０に供給する。 The reduced image generation unit 132 supplies the generated reduced image 202 to the image holding unit 140.

ステップＳ６３において、画像保持部１４０は、受信した本画像２０１および縮小画像生成部１３２において生成された縮小画像２０２を記録する。 In step S 63, the image holding unit 140 records the received main image 201 and the reduced image 202 generated by the reduced image generation unit 132.

なお、縮小画像生成部１３２は、画像保持部１４０から本画像２０１を読み出して、読み出した本画像２０１から縮小画像２０２を生成するようにしてもよい。 Note that the reduced image generation unit 132 may read the main image 201 from the image holding unit 140 and generate the reduced image 202 from the read main image 201.

ステップＳ６４において、画像解析部１３１は、画像保持部１４０に記録された画像を解析する。ステップＳ６４の画像の解析の処理は、図１０のフローチャートを参照して説明した処理と同様なので、その説明は省略する。 In step S 64, the image analysis unit 131 analyzes the image recorded in the image holding unit 140. The image analysis processing in step S64 is the same as the processing described with reference to the flowchart of FIG.

ステップＳ６５において、メタデータ生成部１３３は、ステップＳ６４において抽出された画像の特徴を含む画像のメタデータを生成する。ステップＳ６６において、エントリ生成部１３４は、本画像２０１および縮小画像２０２のエントリを生成する。エントリ生成部１３４は、生成したエントリを、ステップＳ６５において生成されたメタデータに関係付けて、コンテンツデータベース１４１（および類似特徴データベース１４２）に追加（格納）する。 In step S65, the metadata generation unit 133 generates image metadata including the image features extracted in step S64. In step S 66, the entry generation unit 134 generates entries for the main image 201 and the reduced image 202. The entry generation unit 134 adds (stores) the generated entry to the content database 141 (and the similar feature database 142) in association with the metadata generated in step S65.

ステップＳ６７において、送信制御部１３８−１および受信制御部１３８−１は、通信部７９に、デジタルスチルカメラ１１と接続させる。 In step S67, the transmission control unit 138-1 and the reception control unit 138-1 cause the communication unit 79 to connect to the digital still camera 11.

ステップＳ６８において、検索部１３７は、デジタルスチルカメラ１１から送信されてくるデータを基に、画像保持部１４０に記録されている縮小画像２０２のうち、デジタルスチルカメラ１１に持ち出す縮小画像２０２を選択する。検索部１３７は、画像保持部１４０から選択した縮小画像２０２を読み出して、読み出した縮小画像２０２を送信制御部１３８−１に供給する。 In step S 68, the search unit 137 selects the reduced image 202 to be brought out to the digital still camera 11 from the reduced images 202 recorded in the image holding unit 140 based on the data transmitted from the digital still camera 11. . The search unit 137 reads the reduced image 202 selected from the image holding unit 140 and supplies the read reduced image 202 to the transmission control unit 138-1.

ステップＳ６９において、送信制御部１３８−１は、通信部７９に、デジタルスチルカメラ１１宛てに選択された縮小画像２０２を送信させる。 In step S 69, the transmission control unit 138-1 causes the communication unit 79 to transmit the selected reduced image 202 to the digital still camera 11.

ステップＳ７０において、送信制御部１３８−１は、ステップＳ３７と同様の処理で、通信部７９に、デジタルスチルカメラ１１のコンテンツデータベース１１１および類似特徴データベース１１２に、送信された縮小画像２０２のメタデータであって、抽出された画像の特徴を含むメタデータを記入させる。 In step S70, the transmission control unit 138-1 performs processing similar to that in step S37, with the metadata of the reduced image 202 transmitted to the content database 111 and the similar feature database 112 of the digital still camera 11 to the communication unit 79. Then, the metadata including the extracted image features is entered.

ステップＳ７２において、サーバ１３の送信制御部１３８−１および受信制御部１３８−１は、通信部７９に、デジタルスチルカメラ１１との接続を切断させ、処理は終了する。 In step S72, the transmission control unit 138-1 and the reception control unit 138-1 of the server 13 cause the communication unit 79 to disconnect from the digital still camera 11, and the process ends.

図２０で示されるように、サーバ１３−１またはサーバ１３−２が、ネットワーク１４を介して、Webサーバ１５−１若しくはWebサーバ１５−２またはその他の機器から本画像２０１を取得し、取得した本画像２０１を記録すると、サーバ１３−１またはサーバ１３−２は、本画像２０１から縮小画像２０２を生成し、本画像２０１を解析して、本画像２０１の特徴を抽出する。そして、サーバ１３−１またはサーバ１３−２は、抽出した本画像２０１の特徴を記述したメタデータ２６１と共に縮小画像２０２をデジタルスチルカメラ１１または携帯電話機１２に書き込む。 As illustrated in FIG. 20, the server 13-1 or the server 13-2 acquires the master image 201 from the Web server 15-1, the Web server 15-2, or other devices via the network 14 and acquires the master image 201. When the main image 201 is recorded, the server 13-1 or the server 13-2 generates a reduced image 202 from the main image 201, analyzes the main image 201, and extracts features of the main image 201. Then, the server 13-1 or the server 13-2 writes the reduced image 202 in the digital still camera 11 or the mobile phone 12 together with the metadata 261 describing the characteristics of the extracted main image 201.

次に、図２１のフローチャートを参照して、デジタルスチルカメラ１１における検索の処理を説明する。ステップＳ８１において、検索部１０７は、コンテンツデータベース１１１または類似特徴データベース１１２に記録されているメタデータのうち、検索に用いるメタデータを選択する。例えば、検索部１０７は、検索に用いるメタデータとして、使用者の操作に応じた入力部４９からの信号を基に、撮影時刻若しくは撮影条件、顔の画像に関する情報、所定の色名によって想起される度合いを示す関連度、または色若しくは画像の周波数成分などの類似の度合いを計算するための特徴のうちのいずれかを選択する。 Next, search processing in the digital still camera 11 will be described with reference to the flowchart of FIG. In step S 81, the search unit 107 selects metadata to be used for search from the metadata recorded in the content database 111 or the similar feature database 112. For example, the search unit 107 is recalled as the metadata used for the search based on the signal from the input unit 49 according to the user's operation based on the shooting time or shooting condition, information on the face image, and a predetermined color name. Or a feature for calculating a degree of similarity such as a color or a frequency component of an image.

また、ステップＳ８１において、検索部１０７は、使用者の操作に応じた入力部４９からの信号を基に、画像保持部１１０に記録されている本画像２０１または縮小画像２０２の検索する範囲を選択する。 In step S81, the search unit 107 selects a search range of the main image 201 or the reduced image 202 recorded in the image holding unit 110 based on a signal from the input unit 49 according to a user operation. To do.

ステップＳ８２において、検索部１０７は、使用者の操作に応じた入力部４９から供給される信号としての、検索開始の指示を取得する。 In step S82, the search unit 107 acquires a search start instruction as a signal supplied from the input unit 49 according to the user's operation.

ステップＳ８３において、検索部１０７は、コンテンツデータベース１１１または類似特徴データベース１１２から、検索する範囲の本画像２０１または縮小画像２０２のメタデータ２６１を順に読み込む。 In step S83, the search unit 107 sequentially reads the metadata 261 of the main image 201 or reduced image 202 in the search range from the content database 111 or the similar feature database 112.

ステップＳ８４において、検索部１０７は、メタデータ２６１が存在するか否か、すなわち、メタデータ２６１がヌル（null）であるか否かを判定し、メタデータ２６１が存在すると判定された場合、ステップＳ８５に進み、検索部１０７は、メタデータ２６１から、検索結果表示制御データを生成する。 In step S84, the search unit 107 determines whether or not the metadata 261 exists, that is, whether or not the metadata 261 is null, and if it is determined that the metadata 261 exists, In step S85, the search unit 107 generates search result display control data from the metadata 261.

例えば、ステップＳ８５において、検索部１０７の距離計算部１２１は、色または画像の周波数成分などの類似の度合いを計算するための特徴を示すベクトルであるメタデータを基に、選択された画像（基準となる画像）についてのベクトルであるメタデータと、検索する範囲の画像についてのベクトルであるメタデータとから、ベクトルの距離を計算し、ベクトルの距離である検索結果表示制御データを生成する。 For example, in step S85, the distance calculation unit 121 of the search unit 107 selects a selected image (standard) based on metadata that is a vector indicating characteristics for calculating the degree of similarity such as a color or a frequency component of an image. The vector distance is calculated from the metadata that is the vector for the image and the metadata that is the vector for the image in the search range, and the search result display control data that is the vector distance is generated.

このベクトルの距離は、短いほど画像同士が似ていることを示すので、ベクトルの距離である検索結果表示制御データを用いることで、より類似している画像を読み出して、画像を類似している順に表示することができる。 The shorter the vector distance, the more similar the images are. Therefore, by using the search result display control data, which is the vector distance, a more similar image is read and the images are similar. They can be displayed in order.

例えば、ステップＳ８５において、検索部１０７は、所定の色名によって想起される度合いを示す関連度であるメタデータを基に、入力された閾値と関連度とを比較し、入力された閾値以上の関連度であることを示す検索結果表示制御データを生成する。 For example, in step S85, the search unit 107 compares the input threshold value with the relevance level based on metadata that is the relevance level indicating the degree recalled by a predetermined color name, and exceeds the input threshold value. Search result display control data indicating the relevance is generated.

入力された閾値以上の関連度であることを示す検索結果表示制御データを用いることで、その色名によって想起される度合いの大きい画像、すなわち、その色名の色を多く含む画像だけを読み出して、その色名の色を多く含む画像だけを表示することができる。 By using the search result display control data indicating that the degree of relevance is greater than or equal to the input threshold, only images with a high degree of recollection by the color name, that is, images containing many colors of the color name are read out. , Only an image including many colors of the color name can be displayed.

または、例えば、検索部１０７は、所定の色名によって想起される度合いを示す関連度であるメタデータを基に、入力された閾値と関連度との距離を計算することで、距離である検索結果表示制御データを生成する。 Alternatively, for example, the search unit 107 calculates the distance between the input threshold value and the degree of association based on the metadata that is the degree of association indicating the degree recalled by a predetermined color name. Generate result display control data.

入力された閾値と関連度との距離である検索結果表示制御データを用いることで、所望の色名の色を所望の量だけ含む画像を読み出して、所望の色名の色を所望の量だけ含む画像を表示することができる。 By using the search result display control data, which is the distance between the input threshold value and the degree of association, an image including a desired color name in a desired amount is read, and the desired color name in a desired amount An image including it can be displayed.

なお、検索結果表示制御データには、コンテンツIDが含まれ、これにより、検索結果表示制御データに対応する本画像２０１または縮小画像２０２が特定される。 Note that the search result display control data includes the content ID, thereby identifying the main image 201 or the reduced image 202 corresponding to the search result display control data.

ステップＳ８６において、検索部１０７は、生成した検索結果表示制御データを検索結果保持部１１５に格納する。 In step S 86, the search unit 107 stores the generated search result display control data in the search result holding unit 115.

ステップＳ８７において、検索部１０７は、検索する範囲の全ての本画像２０１または縮小画像２０２の処理を終了したか否かを判定し、検索する範囲の全ての本画像２０１または縮小画像２０２の処理を終了していないと判定された場合、ステップＳ８３に戻り、検索部１０７は、コンテンツデータベース１１１または類似特徴データベース１１２から、検索する範囲の次の本画像２０１または縮小画像２０２のメタデータ２６１を読み込み、上述した処理を繰り返す。 In step S87, the search unit 107 determines whether or not the processing of all the main images 201 or reduced images 202 in the search range has been completed, and performs the processing of all the main images 201 or reduced images 202 in the search range. When it is determined that the search has not ended, the process returns to step S83, and the search unit 107 reads the metadata 261 of the next main image 201 or reduced image 202 in the search range from the content database 111 or the similar feature database 112, The above processing is repeated.

ステップＳ８４において、メタデータ２６１が存在しないと判定された場合、すなわち、メタデータ２６１がヌル（null）であると判定された場合、ステップＳ８３に戻り、検索部１０７は、コンテンツデータベース１１１または類似特徴データベース１１２から、検索する範囲の次の本画像２０１または縮小画像２０２のメタデータ２６１を読み込み、上述した処理を繰り返す。 If it is determined in step S84 that the metadata 261 does not exist, that is, if it is determined that the metadata 261 is null, the process returns to step S83, and the search unit 107 selects the content database 111 or similar feature. The metadata 261 of the next main image 201 or reduced image 202 in the search range is read from the database 112, and the above-described processing is repeated.

ステップＳ８７において、検索する範囲の全ての本画像２０１または縮小画像２０２の処理を終了したと判定された場合、ステップＳ８８に進み、表示制御部１０６は、検索結果保持部１１５から、検索結果表示制御データを読み出す。ステップＳ８９において、表示制御部１０６は、検索結果表示制御データを基に、画像保持部１１０から本画像２０１または縮小画像２０２を読み出して、本画像２０１または縮小画像２０２をモニタ４０に表示させて、処理は終了する。 If it is determined in step S87 that the processing of all the main images 201 or reduced images 202 in the search range has been completed, the process proceeds to step S88, and the display control unit 106 controls the search result display control from the search result holding unit 115. Read data. In step S89, the display control unit 106 reads the main image 201 or the reduced image 202 from the image holding unit 110 based on the search result display control data, displays the main image 201 or the reduced image 202 on the monitor 40, and The process ends.

例えば、ステップＳ８５において、色または画像の周波数成分などの類似の度合いを計算するための特徴を示すベクトルの距離である検索結果表示制御データが生成された場合、ステップＳ８９において、表示制御部１０６は、本画像２０１または縮小画像２０２を、基準となる画像との類似の順にモニタ４０に表示させる。 For example, when search result display control data that is a vector distance indicating a feature for calculating the degree of similarity such as a color or a frequency component of an image is generated in step S85, the display control unit 106 in step S89 Then, the main image 201 or the reduced image 202 is displayed on the monitor 40 in the order similar to the reference image.

また、例えば、ステップＳ８５において、所定の色名によって想起される度合いを示す関連度が入力された閾値以上であることを示す検索結果表示制御データが生成された場合、ステップＳ８９において、表示制御部１０６は、その色名の色を多く含む本画像２０１または縮小画像２０２をモニタ４０に表示させる。 For example, when search result display control data indicating that the relevance indicating the degree recalled by the predetermined color name is equal to or greater than the input threshold value is generated in step S85, the display control unit is determined in step S89. 106 displays the main image 201 or the reduced image 202 including many colors of the color name on the monitor 40.

さらに、例えば、ステップＳ８５において、所定の色名によって想起される度合いを示す関連度と入力された閾値との距離である検索結果表示制御データが生成された場合、ステップＳ８９において、表示制御部１０６は、所望の色名の色を所望の量だけ含む本画像２０１または縮小画像２０２をモニタ４０に表示させる。 Further, for example, when search result display control data that is the distance between the degree of association indicating the degree recalled by the predetermined color name and the input threshold value is generated in step S85, the display control unit 106 in step S89. Causes the monitor 40 to display the main image 201 or the reduced image 202 including a desired amount of the color of the desired color name.

携帯電話機１２は、図２１のフローチャートを参照して説明した検索の処理と同様の処理を実行する。サーバ１３は、図２１のフローチャートを参照して説明した検索の処理と同様の処理を実行する。 The mobile phone 12 executes the same process as the search process described with reference to the flowchart of FIG. The server 13 executes processing similar to the search processing described with reference to the flowchart of FIG.

その結果、図２２で示されるように、サーバ１３−１またはサーバ１３−２における、例えば、コンテンツデータベース１４１および類似特徴データベース１４２に格納されているメタデータ２６１を基にした本画像２０１の検索と同様に、デジタルスチルカメラ１１または携帯電話機１２において、縮小画像２０２を、コンテンツデータベース１１１および類似特徴データベース１１２に格納されているメタデータ２６１を基にして検索することができる。 As a result, as shown in FIG. 22, the search of the master image 201 based on the metadata 261 stored in the content database 141 and the similar feature database 142, for example, in the server 13-1 or the server 13-2. Similarly, in the digital still camera 11 or the mobile phone 12, the reduced image 202 can be searched based on the metadata 261 stored in the content database 111 and the similar feature database 112.

次に、デジタルスチルカメラ１１による、より具体的な検索の処理について説明する。 Next, more specific search processing by the digital still camera 11 will be described.

図２３は、デジタルスチルカメラ１１による検索の処理の他の例を示すフローチャートである。ステップＳ１０１において、表示制御部１０６は、モニタ４０に、時系列に縮小画像２０２を表示させる。すなわち、ステップＳ１０１において、画像保持部１１０は、記録している縮小画像２０２のうち、使用者の操作に応じた入力部４９からの信号に応じた所定の範囲の縮小画像２０２を表示制御部１０６に供給する。また、コンテンツデータベース１１１は、表示制御部１０６に供給された所定の範囲の縮小画像２０２のメタデータ２６１のうち、撮影時刻のメタデータを表示制御部１０６に供給する。そして、表示制御部１０６は、モニタ４０に、撮影時刻を基に、撮影された順の時系列に縮小画像２０２を表示させる。 FIG. 23 is a flowchart illustrating another example of search processing by the digital still camera 11. In step S101, the display control unit 106 causes the monitor 40 to display the reduced image 202 in time series. That is, in step S101, the image holding unit 110 displays the reduced image 202 in a predetermined range according to the signal from the input unit 49 according to the user's operation among the recorded reduced images 202. To supply. Also, the content database 111 supplies the shooting control metadata to the display control unit 106 among the metadata 261 of the reduced image 202 within a predetermined range supplied to the display control unit 106. Then, the display control unit 106 causes the monitor 40 to display the reduced images 202 in time series in the order of shooting based on the shooting time.

例えば、図２４で示されるように、表示制御部１０６は、モニタ４０に、グループIDで特定されるグループ毎に、撮影された順の時系列に縮小画像２０２を表示させる。図２４の右側における四角は、１つの縮小画像２０２を示し、四角の中の数字は、撮影された順序を示す。すなわち、例えば、表示制御部１０６は、グループ毎に、撮影された順にラスタスキャン順に縮小画像２０２をモニタ４０に表示させる。 For example, as illustrated in FIG. 24, the display control unit 106 causes the monitor 40 to display the reduced images 202 in time series in the order of shooting for each group specified by the group ID. The square on the right side of FIG. 24 shows one reduced image 202, and the numbers in the squares indicate the order of shooting. That is, for example, the display control unit 106 causes the monitor 40 to display the reduced images 202 in the raster scan order in the order of shooting for each group.

なお、ステップＳ１０１において、画像保持部１１０は、モニタ４０に、クラスタリングした画像を表示させるようにしてもよい。 In step S101, the image holding unit 110 may display the clustered image on the monitor 40.

ここで、時刻ｔ1乃至ｔ12のそれぞれのタイミングにおいて撮影された画像ｐ1乃至ｐ12がクラスタリングの対象とされている場合を例に説明する。例えば、クラスタを規定する条件として条件Ａと条件Ｂが設定され、そのうちの条件Ａにより、画像ｐ1乃至ｐ12全体からなる１つのクラスタが規定される。ここで、条件Ａは粒度の低い（粗い）クラスタを規定する条件であり、条件Ｂは条件Ａより粒度の高い（細かい）クラスタを規定する条件である。例えば、条件Ａにより規定されたクラスタにはイベント名「結婚式」が設定される。 Here, a case will be described as an example where images p1 to p12 photographed at respective timings of times t1 to t12 are targeted for clustering. For example, conditions A and B are set as conditions for defining clusters, and one of the images p1 to p12 is defined by condition A among them. Here, the condition A is a condition for defining a cluster having a low particle size (coarse), and the condition B is a condition for defining a cluster having a particle size higher than that by the condition A (fine). For example, the event name “wedding” is set in the cluster defined by the condition A.

「結婚式」のイベント名が設定されているクラスタは、例えば、画像ｐ1乃至ｐ12のそれぞれの画像の撮影時刻の時間間隔のばらつきの程度が、ある閾値より小さいことなどから規定されたものである。 The cluster in which the event name “wedding” is set is defined, for example, because the degree of variation in the time interval of the shooting time of each of the images p1 to p12 is smaller than a certain threshold value. .

また、条件Ｂにより、画像ｐ1乃至ｐ12のうちの画像ｐ1乃至ｐ3から１つのクラスタが規定され、画像ｐ4乃至ｐ7から１つのクラスタが規定される。また、画像ｐ8乃至ｐ12から１つのクラスタが規定される。 Also, according to the condition B, one cluster is defined from the images p1 to p3 of the images p1 to p12, and one cluster is defined from the images p4 to p7. One cluster is defined from the images p8 to p12.

画像ｐ1乃至ｐ3からなるクラスタには「教会での挙式」、画像ｐ4乃至ｐ7からなるクラスタには「披露宴」、画像ｐ8乃至ｐ12からなるクラスタには「二次会」のイベント名がそれぞれ設定される。 “Church Ceremony” is set for the cluster composed of the images p1 to p3, “Reception” is set for the cluster composed of the images p4 to p7, and “Secondary Party” is set for the cluster composed of the images p8 to p12.

「教会での挙式」のイベント名が設定されているクラスタは、それを構成する画像ｐ1乃至ｐ3のそれぞれの撮影時刻の時間間隔のばらつきの程度が近いものであるのに対し、画像ｐ3と、次に（時間軸上で次に）撮影時刻の時間間隔のばらつきの程度が近い画像のまとまりである画像ｐ4乃至ｐ7のうちの最初の画像である画像ｐ4との時間間隔が比較的大きく、その部分で、撮影の頻度に変化があったと判断されたことから規定される。 In the cluster in which the event name “Church Ceremony” is set, the image p3 and the images p1 to p3 constituting the cluster have the same degree of variation in the time interval of the photographing time, whereas the image p3, Next (next on the time axis), the time interval with the image p4 which is the first image among the images p4 to p7 which is a group of images with a similar degree of variation in the time interval of the photographing time is relatively large. This is defined in part because it is determined that the frequency of shooting has changed.

また、「披露宴」のイベント名が設定されているクラスタは、それを構成する画像ｐ4乃至ｐ7のそれぞれの撮影時刻の時間間隔のばらつきの程度が近いものであるのに対し、画像ｐ7と、次に撮影時刻の時間間隔のばらつきの程度が近い画像のまとまりである画像ｐ8乃至ｐ12のうちの最初の画像である画像ｐ8との時間間隔が比較的大きく、その部分で、撮影の頻度に変化があったと判断されたことから規定される。 Also, in the cluster in which the event name “Reception Party” is set, the degree of variation in the time interval of the shooting time of each of the images p4 to p7 constituting the event is close, while the image p7 and the next In addition, the time interval with the image p8 which is the first image among the images p8 to p12, which is a group of images having a close variation in the time interval of the shooting time, is relatively large, and the frequency of shooting changes at that portion. It is stipulated because it was judged that there was.

「二次会」のイベント名が設定されているクラスタは、それを構成する画像ｐ8乃至ｐ12のそれぞれの撮影時刻の時間間隔のばらつきの程度が近いものであるのに対し、画像ｐ1 2と、次に撮影時刻の時間間隔のばらつきの程度が近い画像のまとまりのうちの最初の画像との時間間隔が比較的大きく、その部分で、撮影の頻度に変化があったと判断されたことから規定される。 In the cluster in which the event name “secondary party” is set, the images p8 to p12 constituting the cluster have the same degree of variation in the time interval of the shooting time, whereas the image p12 and the next This is defined by the fact that the time interval with the first image of a group of images with similar variations in the time interval of the shooting time is relatively large, and that it has been determined that the shooting frequency has changed in that portion.

なお、「結婚式」、「教会での挙式」、「披露宴」、「二次会」のそれぞれのイベント名は、例えば、ユーザにより手動で設定される。 The event names of “wedding”, “ceremony at church”, “banquet”, and “second party” are manually set by the user, for example.

このように、同じ対象の画像をクラスタリングする条件として複数の条件が設定され、それぞれの条件に基づいて、異なる粒度のクラスタが規定される。 As described above, a plurality of conditions are set as conditions for clustering the same target image, and clusters having different granularities are defined based on the respective conditions.

以上のようにして規定されたそれぞれのクラスタに含まれる画像は、階層構造を有する形でユーザに提示される。 The images included in each cluster defined as described above are presented to the user in a form having a hierarchical structure.

また、ステップＳ１０１において、画像保持部１１０は、モニタ４０に、日付毎に表示領域を区分して、区分された領域の日付と画像の撮影された日付とが一致するように、所定の領域に縮小画像２０２を表示させるようにしてもよい。すなわち、ステップＳ１０１において、画像保持部１１０は、カレンダ表示によって、縮小画像２０２を表示させるようにしてもよい。 In step S101, the image holding unit 110 divides the display area for each date on the monitor 40, and sets the date of the divided area to the predetermined area so that the date when the image is captured matches. The reduced image 202 may be displayed. That is, in step S101, the image holding unit 110 may display the reduced image 202 by calendar display.

ステップＳ１０２において、検索部１０７は、使用者の操作に応じた入力部４９からの信号を基に、モニタ４０に表示されている縮小画像２０２の中から、１つの縮小画像２０２を選択する。 In step S 102, the search unit 107 selects one reduced image 202 from the reduced images 202 displayed on the monitor 40 based on a signal from the input unit 49 according to the user's operation.

この場合、図２４で示されるように、時系列に表示された縮小画像２０２のいずれかが選択された場合、表示制御部１０６は、選択された縮小画像２０２をハイライト表示するか、選択された縮小画像２０２の縁を強調表示する。 In this case, as shown in FIG. 24, when any one of the reduced images 202 displayed in time series is selected, the display control unit 106 highlights or selects the selected reduced image 202. The edge of the reduced image 202 is highlighted.

また、この場合、図２５で示されるように、時系列に表示された縮小画像２０２のいずれかが選択された場合、表示制御部１０６は、選択された縮小画像２０２を拡大してモニタ４０に表示するようにしてもよい。 In this case, as shown in FIG. 25, when any one of the reduced images 202 displayed in time series is selected, the display control unit 106 enlarges the selected reduced image 202 and displays it on the monitor 40. You may make it display.

ステップＳ１０３において、検索部１０７は、類似する画像の検索の処理を実行する。 In step S103, the search unit 107 executes a process for searching for similar images.

図２６は、ステップＳ１０３に対応する、類似する画像の検索の処理の詳細を説明するフローチャートである。ステップＳ１３１において、検索部１０７は、使用者の操作に応じた入力部４９からの信号を取得することにより、モニタ４０に表示されたメニューの中の「類似検索」の項目の選択による類似検索の指示を取得する。 FIG. 26 is a flowchart for explaining the details of the similar image search processing corresponding to step S103. In step S131, the search unit 107 acquires a signal from the input unit 49 according to the user's operation, thereby performing a similar search by selecting an item of “similar search” in the menu displayed on the monitor 40. Get instructions.

ステップＳ１３２において、検索部１０７は、使用者の操作に応じた入力部４９からの信号を取得することにより、検索開始の指示を取得する。 In step S132, the search unit 107 acquires a search start instruction by acquiring a signal from the input unit 49 according to the user's operation.

ステップＳ１３３において、検索部１０７は、類似特徴データベース１１２から、ステップＳ１０２において選択された縮小画像２０２のコンテンツIDに対応する類似特徴ベクトルを読み込む。ここで、類似特徴ベクトルは、色特徴ベクトルCvであるか、または周波数成分ベクトルTvである。 In step S133, the search unit 107 reads a similar feature vector corresponding to the content ID of the reduced image 202 selected in step S102 from the similar feature database 112. Here, the similar feature vector is the color feature vector Cv or the frequency component vector Tv.

ステップＳ１３４において、検索部１０７は、類似特徴データベース１１２から、検索する範囲の１つの縮小画像２０２のコンテンツIDに対応する類似特徴ベクトルを読み込む。 In step S134, the search unit 107 reads a similar feature vector corresponding to the content ID of one reduced image 202 in the search range from the similar feature database 112.

この場合、ステップＳ１３３において色特徴ベクトルCvである類似特徴ベクトルが読み出された場合、ステップＳ１３４において、色特徴ベクトルCvである類似特徴ベクトルが読み出される。また、ステップＳ１３３において周波数成分ベクトルTvである類似特徴ベクトルが読み出された場合、ステップＳ１３４において、周波数成分ベクトルTvである類似特徴ベクトルが読み出される。 In this case, when the similar feature vector that is the color feature vector Cv is read in step S133, the similar feature vector that is the color feature vector Cv is read in step S134. When the similar feature vector that is the frequency component vector Tv is read in step S133, the similar feature vector that is the frequency component vector Tv is read in step S134.

ステップＳ１３５において、検索部１０７は、検索する範囲の縮小画像２０２の類似特徴ベクトルと選択された縮小画像２０２の類似特徴ベクトルとの距離を算出する。 In step S135, the search unit 107 calculates the distance between the similar feature vector of the reduced image 202 in the search range and the similar feature vector of the selected reduced image 202.

ここで、それぞれ、３２の要素を持つ色特徴ベクトルCv１={(c1_1,r1_1),・・・,(c32_1,r32_1)}と色特徴ベクトルCv2={(c1_2,r1_2),・・・,(c32_2,r32_2)}と距離を例に、距離の算出について説明する。 Here, the color feature vector Cv1 = {(c1_1, r1_1),..., (C32_1, r32_1)} and the color feature vector Cv2 = {(c1_2, r1_2),. c32_2, r32_2)} and distance as an example, the calculation of distance will be described.

まず、ground distance dij=d(c1i,c2j)という概念を導入する。ground distance dijは、色特徴ベクトルの要素の間の距離を表し、この例の場合、２つの色のユークリッド距離（Ｌ*ａ*ｂ*の３軸空間における距離）なので、dij=‖c1i−c2j‖と表される。 First, the concept of ground distance dij = d (c1i, c2j) is introduced. The ground distance dij represents the distance between the elements of the color feature vector. In this example, since the Euclidean distance between the two colors (distance in the triaxial space of L * a * b *), dij = ‖c1i−c2j It is expressed as ‖.

すると、色特徴ベクトルCv１と色特徴ベクトルCv2との間のEMD（Earth Movers Distance）は、それぞれ、色特徴ベクトルCv１を供給地、色特徴ベクトルCv2を需要地、dijを単位輸送コストに対応付けて、色特徴ベクトルCv１から色特徴ベクトルCv2へのフローF={Fji}を決定する輸送問題の解を用いて計算される。 Then, the EMD (Earth Movers Distance) between the color feature vector Cv1 and the color feature vector Cv2 associates the color feature vector Cv1 with the supply location, the color feature vector Cv2 with the demand location, and dij with the unit transportation cost, respectively. , Using the solution of the transport problem to determine the flow F = {Fji} from the color feature vector Cv1 to the color feature vector Cv2.

すなわち、EMDは、輸送問題の最適値（輸送コストの総計の最小値）をフローの数で割り算して正規化することにより、式（１）により求められる。

・・・（１）
このとき、

とされる。 That is, the EMD is obtained by the equation (1) by dividing the optimum value of the transportation problem (the minimum value of the total transportation cost) by the number of flows and normalizing it.

... (1)
At this time,

It is said.

式（１）により求められるEMDが、色特徴ベクトルCv１と色特徴ベクトルCv2との距離とされる。 The EMD obtained by Expression (1) is the distance between the color feature vector Cv1 and the color feature vector Cv2.

周波数成分ベクトルTvの距離は、色特徴ベクトルCvの距離と同様に求められる。 The distance of the frequency component vector Tv is obtained in the same manner as the distance of the color feature vector Cv.

なお、重みWcを色特徴ベクトルCvの距離に対して決めると共に、重みWtを周波数成分ベクトルTvの距離に対して決めて、式（２）から最終的な距離（distance）を求めるようにしてもよい。

・・・（２） Note that the weight Wc is determined with respect to the distance of the color feature vector Cv, and the weight Wt is determined with respect to the distance of the frequency component vector Tv, so that the final distance is obtained from the equation (2). Good.

... (2)

使用者が重みWcおよび重みWtを決めるようにしても、重みWcおよび重みWtを固定としてもよい。例えば、より具体的には、重みWcおよび重みWtをそれぞれ0.5として、最終的な距離を、色特徴ベクトルCvの距離と周波数成分ベクトルTvの距離の平均とするようにしてもよい。 Even if the user determines the weight Wc and the weight Wt, the weight Wc and the weight Wt may be fixed. For example, more specifically, the weight Wc and the weight Wt may be set to 0.5, respectively, and the final distance may be an average of the distance of the color feature vector Cv and the distance of the frequency component vector Tv.

なお、ベクトルの距離計算に、Y. Rubner, C. Tomasi, and L. J. Guibas. A Metric for Distributions with Applications to Image Databases. Proceedings of the 1998 IEEE International Conference on Computer Vision, Bombay, India, January 1998, pp. 59-66に記載されているEMD（Earth Movers Distance）を用いた例を説明したが、これに限らず、例えば、Euclidean distanceやHausdorff distanceのほか、小早川倫広、星守著、「ウェーブレット変換を用いた対話的類似画像検索システム」、「コンピュータサイエンス誌bit １２月号」、（1999年12月１日）、共立出版（株）発行、30頁乃至41頁や、呉君錫、金子邦彦、牧之内顕文、上野敦子著、「自己組織化特徴マップに基づいた類似画像検索システムの設計・実装と性能評価」、「電子情報通信学会技術研究報告 Vol.100 No.31」、（2000年５月２日）、（社）電子情報通信学会発行、９頁乃至16頁等の文献等に記載されているような手法を用いてもよい。 For vector distance calculation, Y. Rubner, C. Tomasi, and LJ Guibas.A Metric for Distributions with Applications to Image Databases.Proceedings of the 1998 IEEE International Conference on Computer Vision, Bombay, India, January 1998, pp. The example using EMD (Earth Movers Distance) described in 59-66 was explained, but not limited to this. "Interactive similar image search system used", "December issue of computer science magazine" (December 1, 1999), published by Kyoritsu Shuppan Co., Ltd., pages 30-41, Kuni Kim, Kunihiko Kaneko, Akifumi Makinouchi, Atsuko Ueno, “Design, Implementation and Performance Evaluation of Similar Image Retrieval System Based on Self-Organizing Feature Map”, “Technical Report of IEICE Vol.100 No.31”, (May 2000) 2), The Institute of Electronics, Information and Communication Engineers Line, may be used techniques such as those described in the literature such as page 9 to page 16.

ステップＳ１３６において、検索部１０７は、検索する範囲の画像に関係付けて、距離を類似結果データベース１１３に格納する。例えば、ステップＳ１３６において、検索部１０７は、検索する範囲の画像のコンテンツIDと共に距離を類似結果データベース１１３に格納する。 In step S 136, the search unit 107 stores the distance in the similarity result database 113 in association with the image in the search range. For example, in step S136, the search unit 107 stores the distance in the similarity result database 113 together with the content ID of the image in the range to be searched.

図２７は、コンテンツデータベース１１１および類似特徴データベース１１２に格納されているメタデータ並びに類似結果データベース１１３に格納されている距離の構造を示す図である。 FIG. 27 is a diagram illustrating the metadata stored in the content database 111 and the similar feature database 112 and the structure of the distance stored in the similar result database 113.

図２７において、データベースレコード３０１−１は、コンテンツアイテム２８１−１およびコンテンツアイテム２８１−１に対応し、データベースレコード３０１−２は、コンテンツアイテム２８１−２およびコンテンツアイテム２８１−２に対応する。 In FIG. 27, the database record 301-1 corresponds to the content item 281-1 and the content item 281-1, and the database record 301-2 corresponds to the content item 281-2 and the content item 281-2.

すなわち、データベースレコード３０１−１およびータベースレコード３０１−２は、それぞれ、コンテンツID、類似特徴ベクトル、本画像２０１のパス名およびファイル名、グループID、撮影時刻、およびその他のプロパティからなる。 That is, the database record 301-1 and the database record 301-2 each include a content ID, a similar feature vector, a path name and file name of the main image 201, a group ID, a shooting time, and other properties.

距離レコード３０２は、類似結果データベース１１３に格納され、コンテンツIDと選択された画像からの距離とからなる。距離レコード３０２は、コンテンツIDによって、データベースレコード３０１−１およびータベースレコード３０１−２に関係付けられる。 The distance record 302 is stored in the similarity result database 113 and includes a content ID and a distance from the selected image. The distance record 302 is related to the database record 301-1 and the database record 301-2 by the content ID.

以下、データベースレコード３０１−１およびータベースレコード３０１−２を個々に区別する必要がない場合、単に、データベースレコード３０１と称する。 Hereinafter, when it is not necessary to distinguish the database record 301-1 and the database record 301-2 from each other, they are simply referred to as the database record 301.

距離レコード３０２における距離は、distanceであるプロパティとされる。 The distance in the distance record 302 is a property that is distance.

また、時間グループレコード３０３は、時間グループデータベース１１４に格納され、グループに固有の（グループを特定するための）グループIDと、グループIDで特定されるグループに属する画像を特定するコンテンツIDの配列とからなる。時間グループレコード３０３におけるコンテンツIDの配列は、PhotoIdArrayであるプロパティとされる。 The time group record 303 is stored in the time group database 114, and includes a group ID unique to the group (for specifying the group) and an array of content IDs for specifying images belonging to the group specified by the group ID. Consists of. An array of content IDs in the time group record 303 is a property that is PhotoIdArray.

図２８で示されるように、コンテンツデータベース１１１、類似結果データベース１１３、および時間グループデータベース１１４のそれぞれのレコードが関係付けられる。コンテンツデータベース１１１および類似特徴データベース１１２（図示せず）には、１または複数のデータベースレコード３０１が格納され、類似結果データベース１１３には、１または複数の距離レコード３０２が格納され、時間グループデータベース１１４には、１または複数の時間グループレコード３０３が格納される。 As shown in FIG. 28, the records of the content database 111, the similarity result database 113, and the time group database 114 are related to each other. The content database 111 and the similar feature database 112 (not shown) store one or a plurality of database records 301, the similarity result database 113 stores one or a plurality of distance records 302, and the time group database 114 stores them. Stores one or more time group records 303.

図２６に戻り、ステップＳ１３７において、検索部１０７は、検索する範囲の全ての画像について処理を終了したか否かを判定し、処理を終了していないと判定された場合、ステップＳ１３４に戻り、類似特徴データベース１１２から、検索する範囲の次の縮小画像２０２のコンテンツIDに対応する類似特徴ベクトルを読み込んで、上述した処理を繰り返す。 Returning to FIG. 26, in step S137, the search unit 107 determines whether or not the processing has been completed for all the images in the search range. If it is determined that the processing has not been completed, the processing returns to step S134. The similar feature vector corresponding to the content ID of the next reduced image 202 in the search range is read from the similar feature database 112, and the above-described processing is repeated.

ステップＳ１３７において、処理を終了したと判定された場合、ステップＳ１３８に進み、検索部１０７は、類似特徴データベース１１２から、検索する範囲の画像に関係付けられた距離を読み出す。例えば、ステップＳ１３８において、検索部１０７は、類似特徴データベース１１２から、検索する範囲の画像を特定するコンテンツIDと共に、距離を読み出す。 If it is determined in step S137 that the processing has been completed, the process proceeds to step S138, and the search unit 107 reads the distance associated with the image in the range to be searched from the similar feature database 112. For example, in step S138, the search unit 107 reads the distance from the similar feature database 112 together with the content ID that specifies the image in the search range.

ステップＳ１３９において、検索部１０７は、ステップＳ１３８で読み出した距離で、検索する範囲の画像を類似順にソートし、処理は終了する。例えば、ステップＳ１３９において、検索部１０７は、距離の順に、検索する範囲の画像を特定するコンテンツIDをソートすることで、検索する範囲の画像を類似順にソートする。 In step S139, the search unit 107 sorts the images in the search range in the order of similarity based on the distance read in step S138, and the process ends. For example, in step S139, the search unit 107 sorts the images in the search range in the order of similarity by sorting the content IDs that specify the images in the search range in the order of distance.

図２３に戻り、ステップＳ１０４において、表示制御部１０６は、モニタ４０に、類似の順に縮小画像２０２を表示させる。すなわち、ステップＳ１０４において、表示制御部１０６は、画像保持部１１０から縮小画像２０２を読み出して、ステップＳ１３９においてソートされた類似の順に縮小画像２０２をモニタ４０に表示させる。 Returning to FIG. 23, in step S104, the display control unit 106 causes the monitor 40 to display the reduced images 202 in the similar order. That is, in step S104, the display control unit 106 reads the reduced image 202 from the image holding unit 110, and causes the monitor 40 to display the reduced image 202 in the similar order sorted in step S139.

例えば、図２９で示されるように、表示制御部１０６は、モニタ４０に、ステップＳ１０２で選択された縮小画像２０２に類似する縮小画像２０２を、類似の順に表示させる。例えば、表示制御部１０６は、モニタ４０の表示領域の左上にステップＳ１０２で選択された縮小画像２０２（図２９中のキー画像）を表示させ、その右側の領域に、キー画像に類似する縮小画像２０２を類似する順でラスタスキャン順に表示させる。図２９の右側における四角は、１つの縮小画像２０２を示し、四角の中のアルファベットは、類似する順を示す。 For example, as illustrated in FIG. 29, the display control unit 106 causes the monitor 40 to display the reduced images 202 similar to the reduced image 202 selected in step S102 in the similar order. For example, the display control unit 106 displays the reduced image 202 (key image in FIG. 29) selected in step S102 at the upper left of the display area of the monitor 40, and a reduced image similar to the key image in the right area thereof. 202 are displayed in a raster scan order in a similar order. A square on the right side of FIG. 29 shows one reduced image 202, and alphabets in the square indicate a similar order.

ステップＳ１０５において、検索部１０７は、使用者の操作に応じた入力部４９からの信号を基に、モニタ４０に表示されている縮小画像２０２の中から、１つの縮小画像２０２を選択する。 In step S 105, the search unit 107 selects one reduced image 202 from the reduced images 202 displayed on the monitor 40 based on a signal from the input unit 49 according to the user's operation.

例えば、図２９で示されるように、モニタ４０に、類似する順でラスタスキャン順に表示されている縮小画像２０２のうち、Ｂのアルファベットが付された縮小画像２０２が選択された場合、選択された縮小画像２０２をハイライト表示するか、または縁を強調表示すると共に、表示制御部１０６は、モニタ４０の表示領域のキー画像の下に、選択された縮小画像２０２を拡大して表示する。 For example, as shown in FIG. 29, when the reduced image 202 with the alphabet B is selected from the reduced images 202 displayed in the raster scan order in a similar order on the monitor 40, the selected image is selected. The reduced image 202 is highlighted or the edge is highlighted, and the display control unit 106 enlarges and displays the selected reduced image 202 below the key image in the display area of the monitor 40.

ステップＳ１０６において、検索部１０７は、使用者の操作に応じた入力部４９からの信号を基に、キャンセルするか否かを判定し、キャンセルしないと判定された場合、ステップＳ１０７に進み、さらに、決定するか否かを判定する。 In step S106, the search unit 107 determines whether or not to cancel based on the signal from the input unit 49 according to the user's operation. If it is determined not to cancel, the search unit 107 proceeds to step S107. It is determined whether or not to decide.

ステップＳ１０７において、決定すると判定された場合、ステップＳ１０８に進み、検索部１０７は、コンテンツデータベース１１１から、ステップＳ１０５の処理で、選択されている縮小画像２０２のグループIDを取得する。すなわち、検索部１０７は、コンテンツデータベース１１１から、ステップＳ１０５の処理で、選択されている縮小画像２０２のコンテンツIDで特定されるメタデータ２６１を読み出して、読み出したメタデータ２６１から、選択されている縮小画像２０２が属するグループを特定するグループIDを抽出することで、選択されている縮小画像２０２のグループIDを取得する。 In step S107, if it is determined to be determined, the process proceeds to step S108, and the search unit 107 acquires the group ID of the selected reduced image 202 from the content database 111 in step S105. That is, the search unit 107 reads out the metadata 261 specified by the content ID of the selected reduced image 202 from the content database 111 in the process of step S105, and is selected from the read metadata 261. By extracting the group ID that identifies the group to which the reduced image 202 belongs, the group ID of the selected reduced image 202 is acquired.

ステップＳ１０９において、検索部１０７は、取得したグループIDで特定されるグループに属する縮小画像２０２を画像保持部１１０から読み出す。より具体的には、検索部１０７は、取得したグループIDで、時間グループデータベース１１４の時間グループレコード３０３を検索する。検索部１０７は、取得したグループIDと同じグループIDを含む時間グループレコード３０３から、グループIDで特定されるグループに属する画像を特定するコンテンツIDの配列を時間グループデータベース１１４から読み出す。そして、検索部１０７は、読み出したコンテンツIDの配列の要素であるコンテンツIDで特定される縮小画像２０２を画像保持部１１０から読み出す。検索部１０７は、読み出した縮小画像２０２を表示制御部１０６に供給する。 In step S 109, the search unit 107 reads out the reduced image 202 belonging to the group specified by the acquired group ID from the image holding unit 110. More specifically, the search unit 107 searches the time group record 303 in the time group database 114 with the acquired group ID. The search unit 107 reads from the time group database 114 an array of content IDs that specify images belonging to the group specified by the group ID from the time group record 303 including the same group ID as the acquired group ID. Then, the search unit 107 reads the reduced image 202 specified by the content ID that is an element of the read content ID array from the image holding unit 110. The search unit 107 supplies the read reduced image 202 to the display control unit 106.

ステップＳ１１０において、表示制御部１０６は、モニタ４０に、読み出した縮小画像２０２を、時系列に表示させ、処理は終了する。 In step S110, the display control unit 106 causes the monitor 40 to display the read reduced image 202 in time series, and the process ends.

なお、ステップＳ１１０において、表示制御部１０６は、モニタ４０に、クラスタリングした画像を表示させるようにしてもよく、また、カレンダ表示によって、縮小画像２０２を表示させるようにしてもよい。 In step S110, the display control unit 106 may display the clustered image on the monitor 40, or may display the reduced image 202 by calendar display.

ステップＳ１０７において、決定すると判定された場合、ステップＳ１０４に戻り、上述した処理を繰り返す。 If it is determined in step S107 that the determination is to be made, the process returns to step S104 and the above-described processing is repeated.

ステップＳ１０６において、キャンセルすると判定された場合、ステップＳ１０１に戻り、上述した処理を繰り返す。 If it is determined in step S106 to cancel, the process returns to step S101 and the above-described processing is repeated.

なお、ステップＳ１０１乃至ステップＳ１１０の処理において、ステップＳ１０２またはステップＳ１０５において、次の画像が選択されるまで、画像の選択の状態は維持される。ステップＳ１０１、ステップＳ１０４、またはステップＳ１１０において、画像が表示されると共に、選択されている画像の縁が強調して表示されるなど、使用者が選択されている画像を識別できるように、画像の選択が示される。 In the processing from step S101 to step S110, the image selection state is maintained until the next image is selected in step S102 or step S105. In step S101, step S104, or step S110, the image is displayed and the edge of the selected image is highlighted so that the user can identify the selected image. A selection is shown.

すなわち、画像の選択の状態を維持したまま、時系列の表示の状態と類似順の表示の状態との間で状態が遷移される。 That is, the state transitions between the time-series display state and the display state in the similar order while maintaining the image selection state.

このようにすることで、所定の画像に類似する画像が撮影された時刻に近い時刻に撮影された画像を即座に表示したり、所定の画像が撮影された時刻に近い時刻に撮影された画像に類似する画像を即座に表示したりすることができる。また、画像を、類似しているか、近い時刻に撮影されたかによって、順に画像を辿るように画像を検索することができる。 In this way, an image taken at a time close to the time when an image similar to the predetermined image is taken is immediately displayed, or an image taken at a time close to the time when the predetermined image is taken. An image similar to can be displayed immediately. Further, it is possible to search for an image so that the images are traced in order depending on whether the images are similar or taken at a close time.

表示画面の小さなデジタルスチルカメラ１１であっても、時間軸の検索と類似検索とを効果的に組み合わせることにより、人の記憶の支配的な要素である、画像の類似の概念と時間の概念とに応じた画像の検索と閲覧とが可能になる。 Even in the digital still camera 11 with a small display screen, by combining the search of the time axis and the similarity search effectively, the concept of similarity between images and the concept of time, which are dominant elements of human memory, It is possible to search and browse images according to the conditions.

また、類似を示す距離は、あくまでも統計的手法に基づく類似性を示すものであり、検索漏れが生じ、人の感覚からすれば似ていると捉えられる画像が検索されないこともあるが、このような検索漏れが生じたとしても、近接するイベントでの画像が一覧表示されるので、人の感覚からすれば似ていると捉えられる画像に到達することができるようになる。 In addition, the distance indicating similarity is merely a similarity based on a statistical method, search omissions occur, and images that are considered to be similar according to human sense may not be searched. Even if a search failure occurs, a list of images of events that are close to each other is displayed, so that it is possible to reach an image that can be regarded as similar from a human perspective.

また、花見の画像、花火の画像、バーベキューの画像など、毎年繰り返される行事や催し（イベント）の画像を、毎年、撮影している場合には、類似検索してから、時系列に瞬時に並び替えることができるので、年代順に同じような行事（イベント）の画像を表示することができ、記憶を思い起こすためのアルバムとして活用することができるようになる。 If images of events and events that are repeated every year, such as images of cherry blossoms, images of fireworks, and images of barbecue, are taken every year, similar images are searched and displayed in chronological order. Since it can be changed, images of similar events (events) can be displayed in chronological order, and can be used as an album for recalling memories.

なお、デジタルスチルカメラ１１は、図２３のフローチャートで示される処理で、本画像２０１を検索するようにしてもよい。 The digital still camera 11 may search for the main image 201 by the processing shown in the flowchart of FIG.

図２３のフローチャートの検索の処理によれば、例えば、図３０の上側に示されるように、まず、縮小画像２０２が、モニタ４０に、グループ毎に、時系列に表示される。例えば、時系列に表示されている縮小画像２０２のうち、Ａのアルファベットが付加された縮小画像２０２（キー画像）が選択されると、Ａのアルファベットが付加された縮小画像２０２の縁が強調して表示される。 According to the search processing in the flowchart of FIG. 23, for example, as shown in the upper side of FIG. 30, first, the reduced image 202 is displayed on the monitor 40 in time series for each group. For example, when the reduced image 202 (key image) to which the alphabet A is added is selected from the reduced images 202 displayed in time series, the edges of the reduced image 202 to which the alphabet A is added are emphasized. Displayed.

Ａのアルファベットが付加された縮小画像２０２（キー画像）が選択されて、類似する画像の検索の処理が実行されると、Ａのアルファベットが付加された縮小画像２０２に類似する縮小画像２０２が検索されて、類似する順にモニタ４０に表示させられる。 When the reduced image 202 (key image) to which the alphabet A is added is selected and a similar image search process is executed, the reduced image 202 similar to the reduced image 202 to which the alphabet A is added is searched. Then, they are displayed on the monitor 40 in a similar order.

この場合、モニタ４０には、Ａのアルファベットが付加された縮小画像２０２であるキー画像が拡大されて表示される。 In this case, the key image which is the reduced image 202 to which the alphabet A is added is enlarged and displayed on the monitor 40.

類似する順に表示されている縮小画像２０２のうち、Ｂのアルファベットが付加された縮小画像２０２が選択されると、モニタ４０には、Ｂのアルファベットが付加された縮小画像２０２であるキー画像が拡大されて表示される。 When the reduced image 202 to which the alphabet B is added is selected from the reduced images 202 displayed in a similar order, the key image that is the reduced image 202 to which the alphabet B is added is enlarged on the monitor 40. Displayed.

Ａのアルファベットが付加された縮小画像２０２に類似する縮小画像２０２が、類似する順にモニタ４０に表示させられている場合、キャンセルされると、時系列に縮小画像２０２を表示する状態に戻る。 When the reduced image 202 similar to the reduced image 202 to which the alphabet A is added is displayed on the monitor 40 in the order of similarity, when canceled, the state returns to the state of displaying the reduced image 202 in time series.

類似する順に表示されている縮小画像２０２のうち、Ｂのアルファベットが付加された縮小画像２０２が選択されて、決定キーが押下されると、Ｂのアルファベットが付加された縮小画像２０２が属するグループに属する縮小画像２０２が、モニタ４０に、時系列に表示される。この場合、Ｂのアルファベットが付加された縮小画像２０２の縁が強調して表示される。 When the reduced image 202 to which the alphabet B is added is selected from the reduced images 202 displayed in a similar order and the determination key is pressed, the reduced image 202 to which the alphabet B is added belongs to the group to which the alphabet belongs. The reduced image 202 to which it belongs is displayed on the monitor 40 in time series. In this case, the edge of the reduced image 202 to which the alphabet B is added is highlighted.

撮影された日付によって縮小画像２０２がグループ分けされている場合、モニタ４０には、Ｂのアルファベットが付加された縮小画像２０２が撮影された日付に近い日付の縮小画像２０２が、グループ毎に時系列で表示される。 When the reduced images 202 are grouped according to the shooting date, the reduced image 202 having a date close to the date when the reduced image 202 to which the alphabet B is added is recorded on the monitor 40 in time series. Is displayed.

次に、サーバ１３における検索の処理について説明する。図３１は、サーバ１３による検索の処理を説明するフローチャートである。ステップＳ１６１において、サーバ１３の表示制御部１３６は、ディスプレイである出力部７７に、時系列に本画像２０１を表示させる。すなわち、ステップＳ１６１において、画像保持部１４０は、記録している本画像２０１のうち、使用者の操作に応じた入力部７６からの信号に応じた所定の範囲の本画像２０１を表示制御部１３６に供給する。また、コンテンツデータベース１４１は、表示制御部１３６に供給された所定の範囲の本画像２０１のメタデータ２６１のうち、撮影時刻のメタデータを表示制御部１３６に供給する。そして、表示制御部１３６は、ディスプレイである出力部７７に、撮影時刻を基に、撮影された順の時系列に本画像２０１を表示させる。 Next, search processing in the server 13 will be described. FIG. 31 is a flowchart for explaining search processing by the server 13. In step S161, the display control unit 136 of the server 13 causes the output unit 77, which is a display, to display the main image 201 in time series. That is, in step S161, the image holding unit 140 displays the main image 201 in a predetermined range according to the signal from the input unit 76 according to the user's operation among the recorded main images 201. To supply. In addition, the content database 141 supplies the shooting control metadata to the display control unit 136 among the metadata 261 of the main image 201 within a predetermined range supplied to the display control unit 136. Then, the display control unit 136 causes the output unit 77, which is a display, to display the main image 201 in a time series in the order of shooting based on the shooting time.

例えば、図３２の右側に示されるように、表示制御部１３６は、ディスプレイである出力部７７に、撮影された順の時系列に本画像２０１を表示させる（時間軸表示される）。例えば、表示制御部１３６は、グループ毎に、撮影された順に本画像２０１をディスプレイである出力部７７に表示させる。 For example, as shown on the right side of FIG. 32, the display control unit 136 causes the output unit 77, which is a display, to display the main image 201 in time series in the order of shooting (displayed in time axis). For example, the display control unit 136 causes the output unit 77, which is a display, to display the main image 201 in the order in which the images were captured for each group.

ステップＳ１６２において、検索部１３７は、使用者の操作に応じた入力部７６からの信号を基に、ディスプレイである出力部７７に表示されている本画像２０１の中から、１つの本画像２０１を選択する。 In step S162, the search unit 137 selects one master image 201 from the master images 201 displayed on the output unit 77, which is a display, based on a signal from the input unit 76 according to the user's operation. select.

ステップＳ１６３において、検索部１３７は、類似する画像の検索の処理を実行する。ステップＳ１６３の類似する画像の検索の処理は、検索部１０７に代わり検索部１３７によって実行される点が異なるが、他の点は、図２６のフローチャートを参照して説明した処理と同様なのでその詳細な説明は省略する。 In step S163, the search unit 137 executes a process for searching for similar images. The similar image search process in step S163 is executed by the search unit 137 instead of the search unit 107, but the other points are the same as the process described with reference to the flowchart of FIG. The detailed explanation is omitted.

ステップＳ１６４において、表示制御部１３６は、ディスプレイである出力部７７に、類似の順に本画像２０１を表示させる。すなわち、ステップＳ１６４において、表示制御部１３６は、ソートされた類似の順に本画像２０１をディスプレイである出力部７７に表示させる。 In step S164, the display control unit 136 causes the output unit 77, which is a display, to display the main image 201 in a similar order. That is, in step S164, the display control unit 136 causes the output unit 77, which is a display, to display the main image 201 in the sorted similar order.

例えば、図３２の左側に示されるように、表示制御部１３６は、ディスプレイである出力部７７に、ステップＳ１６２で選択された本画像２０１に類似する本画像２０１を、類似の順に表示させる。 For example, as shown on the left side of FIG. 32, the display control unit 136 causes the output unit 77, which is a display, to display the main image 201 similar to the main image 201 selected in step S162 in the order of similarity.

ステップＳ１６５において、検索部１３７は、使用者の操作に応じた入力部４９からの信号を基に、ディスプレイである出力部７７に表示されている本画像２０１の中から、１つの本画像２０１を選択する。 In step S165, the search unit 137 selects one master image 201 from the master images 201 displayed on the output unit 77, which is a display, based on a signal from the input unit 49 according to the user's operation. select.

ステップＳ１６６において、検索部１３７は、使用者の操作に応じた入力部４９からの信号を基に、時系列に表示するか否かを判定する。例えば、検索部１３７は、ディスプレイである出力部７７に表示されている切換ボタン３５１または切換ボタン３５２のクリックに応じた、入力部７６からの信号を基に、時系列に表示するか否かを判定する。 In step S166, the search unit 137 determines whether to display in time series based on the signal from the input unit 49 according to the operation of the user. For example, the search unit 137 determines whether or not to display in time series based on a signal from the input unit 76 in response to a click on the switching button 351 or the switching button 352 displayed on the output unit 77 that is a display. judge.

例えば、ディスプレイである出力部７７に表示されている、時系列順の表示を指示する切換ボタン３５１がクリックされた場合、ステップＳ１６６において、時系列に表示すると判定されるので、時系列に表示すると判定されたとき、手続きは、ステップＳ１６７に進む。 For example, when the switch button 351 displayed on the output unit 77 that is a display and instructing display in time series order is clicked, in step S166, it is determined to display in time series. If so, the procedure proceeds to step S167.

ステップＳ１６７において、検索部１３７は、コンテンツデータベース１４１から、選択されている本画像２０１のグループIDを取得する。すなわち、検索部１３７は、コンテンツデータベース１４１から、選択されている本画像２０１のコンテンツIDで特定されるメタデータ２６１を読み出して、読み出したメタデータ２６１から、選択されている本画像２０１が属するグループを特定するグループIDを抽出することで、選択されている本画像２０１のグループIDを取得する。 In step S167, the search unit 137 acquires the group ID of the selected master image 201 from the content database 141. That is, the search unit 137 reads the metadata 261 specified by the content ID of the selected master image 201 from the content database 141, and the group to which the selected master image 201 belongs from the read metadata 261. The group ID of the selected master image 201 is acquired by extracting the group ID that identifies

ステップＳ１６８において、検索部１３７は、取得したグループIDで特定されるグループに属する本画像２０１を画像保持部１４０から読み出す。より具体的には、検索部１３７は、取得したグループIDで、時間グループデータベース１４４の時間グループレコード３０３を検索する。検索部１３７は、取得したグループIDと同じグループIDを含む時間グループレコード３０３から、グループIDで特定されるグループに属する画像を特定するコンテンツIDの配列を時間グループデータベース１４４から読み出す。そして、検索部１３７は、読み出したコンテンツIDの配列の要素であるコンテンツIDで特定される本画像２０１を画像保持部１４０から読み出す。検索部１３７は、読み出した本画像２０１を表示制御部１３６に供給する。 In step S168, the search unit 137 reads the main image 201 belonging to the group specified by the acquired group ID from the image holding unit 140. More specifically, the search unit 137 searches the time group record 303 in the time group database 144 with the acquired group ID. The search unit 137 reads from the time group database 144 an array of content IDs that specify images belonging to the group specified by the group ID from the time group record 303 including the same group ID as the acquired group ID. Then, the search unit 137 reads the main image 201 specified by the content ID that is an element of the read content ID array from the image holding unit 140. The search unit 137 supplies the read main image 201 to the display control unit 136.

ステップＳ１６９において、表示制御部１３６は、ディスプレイである出力部７７に、読み出した本画像２０１を、時系列に表示させる。例えば、ステップＳ１６９において、表示制御部１３６は、ディスプレイである出力部７７に、読み出した本画像２０１を、グループ毎に、時系列に表示させる。 In step S169, the display control unit 136 causes the output unit 77, which is a display, to display the read main image 201 in time series. For example, in step S169, the display control unit 136 causes the output unit 77, which is a display, to display the read main image 201 in time series for each group.

ステップＳ１７０において、検索部１３７は、使用者の操作に応じた入力部７６からの信号を基に、ディスプレイである出力部７７に表示されている本画像２０１の中から、１つの本画像２０１を選択する。 In step S170, the search unit 137 selects one master image 201 from the master images 201 displayed on the output unit 77, which is a display, based on a signal from the input unit 76 according to the user's operation. select.

ステップＳ１７１において、検索部１３７は、使用者の操作に応じた入力部４９からの信号を基に、時系列に表示するか否かを判定する。例えば、検索部１３７は、ディスプレイである出力部７７に表示されている切換ボタン３５１または切換ボタン３５２のクリックに応じた、入力部７６からの信号を基に、時系列に表示するか否かを判定する。 In step S171, the search unit 137 determines whether or not to display in time series based on the signal from the input unit 49 according to the user's operation. For example, the search unit 137 determines whether or not to display in time series based on a signal from the input unit 76 in response to a click on the switching button 351 or the switching button 352 displayed on the output unit 77 that is a display. judge.

例えば、ディスプレイである出力部７７に表示されている、類似順の表示を指示する切換ボタン３５２がクリックされた場合、ステップＳ１７１において、類似順に表示すると判定されるので、時系列に表示すると判定されたとき、手続きは、ステップＳ１６３に戻り、上述した処理を繰り返す。 For example, when the switching button 352 for instructing the display in the similar order displayed on the output unit 77, which is a display, is clicked, since it is determined in step S171 that the display is performed in the similar order, it is determined that the display is performed in time series. If so, the procedure returns to step S163 and repeats the above-described processing.

また、例えば、ディスプレイである出力部７７に表示されている、時系列の表示を指示する切換ボタン３５１がクリックされた場合、ステップＳ１７１において、類似順に表示しないと判定されるので、時系列に表示しないと判定されたとき、手続きは、ステップＳ１６７に戻り、上述した処理を繰り返す。 Further, for example, when the switch button 351 displayed on the output unit 77 that is a display and instructing the display in time series is clicked, it is determined in step S171 that the images are not displayed in the similar order. When it is determined not to do so, the procedure returns to step S167 and repeats the above-described processing.

ステップＳ１６６において、例えば、ディスプレイである出力部７７に表示されている、類似順の表示を指示する切換ボタン３５２がクリックされた場合、時系列に表示しないと判定されるので、手続きは、ステップＳ１６３に戻り、上述した処理を繰り返す。 In step S166, for example, when the switching button 352 displayed on the output unit 77 that is a display and instructing the display in the order of similarity is clicked, it is determined that the display is not performed in time series, so the procedure is step S163. Returning to FIG.

このように、例えば、ディスプレイである出力部７７に表示されている切換ボタン３５１または切換ボタン３５２のクリックに応じて、類似順の表示と時系列の表示とを任意に切り換えることができる。 Thus, for example, according to the click of the switching button 351 or the switching button 352 displayed on the output unit 77 that is a display, the display in the similar order and the time-series display can be arbitrarily switched.

次に、サーバ１３における関連度の抽出について説明する。 Next, the extraction of the degree of association in the server 13 will be described.

デジタルスチルカメラ１１、携帯電話機１２、およびサーバ１３は、画像の特徴として、色名とその色名に対する関連度を用いて、画像を検索する。サーバ１３は、画像の特徴の１つとして、画像から所定の色名に対する関連度を抽出する。 The digital still camera 11, the mobile phone 12, and the server 13 search for an image using the color name and the degree of association with the color name as image features. The server 13 extracts the degree of association with a predetermined color name from the image as one of the features of the image.

ここで、色名に対する関連度とは、ある画像が、特定の色名によって想起される度合いを意味する。換言すれば、関連度は、ある画像において、特定の色名であると想定できる色が含まれる割合を言う。 Here, the degree of association with the color name means the degree to which an image is recalled by a specific color name. In other words, the degree of association refers to a ratio in which a color that can be assumed to be a specific color name is included in a certain image.

ここで、色名は、例えば、赤、青、黄、白、黒、緑などである。 Here, the color names are, for example, red, blue, yellow, white, black, green, and the like.

図３３は、色名に対する関連度を抽出する色特徴抽出部１７２の構成の例を示すブロック図である。色特徴抽出部１７２は、画像入力部４０１、”赤”関連度抽出部４０２、”青”関連度抽出部４０３、”黄”関連度抽出部４０４、および抽出特徴記録部４０５から構成される。 FIG. 33 is a block diagram illustrating an example of the configuration of the color feature extraction unit 172 that extracts the degree of association with color names. The color feature extraction unit 172 includes an image input unit 401, a “red” association degree extraction unit 402, a “blue” association degree extraction unit 403, a “yellow” association degree extraction unit 404, and an extraction feature recording unit 405.

なお、”赤”関連度抽出部４０２、”青”関連度抽出部４０３、および”黄”関連度抽出部４０４は、一例であり、任意の色についての関連度を抽出する任意の数の関連度抽出部が設けられる。すなわち、関連度抽出部は、色名毎に用意される。 Note that the “red” relevance extraction unit 402, the “blue” relevance extraction unit 403, and the “yellow” relevance extraction unit 404 are examples, and an arbitrary number of relevances for extracting relevance for an arbitrary color. A degree extracting unit is provided. That is, the relevance degree extraction unit is prepared for each color name.

以下、”赤”関連度抽出部４０２、”青”関連度抽出部４０３、および”黄”関連度抽出部４０４が設けられている場合を例に説明する。 Hereinafter, a case where a “red” association degree extraction unit 402, a “blue” association degree extraction unit 403, and a “yellow” association degree extraction unit 404 are provided will be described as an example.

画像入力部４０１は、画像保持部１４０から、関連度の抽出の対象となる本画像２０１を取得する。また、画像入力部４０１は、関連度抽出部対応保持部１４５から、色名と、”赤”関連度抽出部４０２、”青”関連度抽出部４０３、または”黄”関連度抽出部４０４との対応を示す対応情報を取得する。 The image input unit 401 acquires, from the image holding unit 140, the main image 201 that is a target for extracting the degree of association. In addition, the image input unit 401 receives the color name, the “red” association degree extraction unit 402, the “blue” association degree extraction unit 403, or the “yellow” association degree extraction unit 404 from the association degree extraction unit correspondence holding unit 145. Correspondence information indicating the correspondence of.

図３４の例で示されるように、関連度抽出部対応保持部１４５に記録されている対応情報には、色名とその色名に対する関連度を抽出する”赤”関連度抽出部４０２、”青”関連度抽出部４０３、または”黄”関連度抽出部４０４のいずれかを特定する情報が配置されている。例えば、図３４に示される対応情報の例において、”赤”である色名と、”赤”関連度抽出部４０２との対応が示され、”青”である色名と、”青”関連度抽出部４０３との対応が示され、”黄”である色名と、”黄”関連度抽出部４０４との対応が示されている。 As shown in the example of FIG. 34, the association information recorded in the association degree extraction unit correspondence holding unit 145 includes a “red” association degree extraction unit 402 that extracts a color name and a degree of association with the color name. Information for specifying either the “blue” relevance extraction unit 403 or the “yellow” relevance extraction unit 404 is arranged. For example, in the example of the correspondence information shown in FIG. 34, the correspondence between the color name “red” and the “red” association degree extraction unit 402 is shown, and the color name “blue” and the association “blue”. The correspondence with the degree extraction unit 403 is shown, and the correspondence between the color name “yellow” and the “yellow” association degree extraction unit 404 is shown.

画像入力部４０１は、対応情報に基づいて、画像保持部１４０から取得した本画像２０１を、”赤”関連度抽出部４０２、”青”関連度抽出部４０３、および”黄”関連度抽出部４０４に供給する。 Based on the correspondence information, the image input unit 401 converts the main image 201 acquired from the image holding unit 140 into a “red” association degree extraction unit 402, a “blue” association degree extraction unit 403, and a “yellow” association degree extraction unit. 404 is supplied.

”赤”関連度抽出部４０２は、画像入力部４０１から供給された本画像２０１から、本画像２０１が赤である色名によって想起される度合いを示す関連度を抽出する。”赤”関連度抽出部４０２は、本画像２０１から抽出した、赤である色名によって想起される度合いを示す関連度を、抽出特徴記録部４０５に供給する。 The “red” relevance degree extraction unit 402 extracts, from the main image 201 supplied from the image input unit 401, a relevance degree indicating the degree to which the main image 201 is recalled by a color name that is red. The “red” association degree extraction unit 402 supplies the extraction feature recording unit 405 with the association degree that is extracted from the main image 201 and indicates the degree that is recalled by the color name that is red.

”青”関連度抽出部４０３は、画像入力部４０１から供給された本画像２０１から、本画像２０１が青である色名によって想起される度合いを示す関連度を抽出する。”青”関連度抽出部４０３は、本画像２０１から抽出した、青である色名によって想起される度合いを示す関連度を、抽出特徴記録部４０５に供給する。 The “blue” relevance extraction unit 403 extracts, from the main image 201 supplied from the image input unit 401, a relevance indicating the degree to which the main image 201 is recalled by a blue color name. The “blue” association degree extraction unit 403 supplies the extraction feature recording unit 405 with the association degree that is extracted from the main image 201 and indicates the degree that is recalled by the blue color name.

”黄”関連度抽出部４０４は、画像入力部４０１から供給された本画像２０１から、本画像２０１が黄である色名によって想起される度合いを示す関連度を抽出する。”黄”関連度抽出部４０４は、本画像２０１から抽出した、黄である色名によって想起される度合いを示す関連度を、抽出特徴記録部４０５に供給する。 The “yellow” relevance degree extraction unit 404 extracts, from the main image 201 supplied from the image input unit 401, a relevance degree indicating the degree to which the main image 201 is recalled by a color name that is yellow. The “yellow” association degree extraction unit 404 supplies the extraction feature recording unit 405 with the association degree that is extracted from the main image 201 and indicates the degree that is recalled by the color name that is yellow.

抽出特徴記録部４０５は、”赤”関連度抽出部４０２、”青”関連度抽出部４０３、および”黄”関連度抽出部４０４のそれぞれから供給された、赤である色名によって想起される度合いを示す関連度、青である色名によって想起される度合いを示す関連度、および黄である色名によって想起される度合いを示す関連度を、本画像２０１に関係付けて、抽出特徴保持部１４６に記録させる。 The extracted feature recording unit 405 is recalled by the color name that is red supplied from each of the “red” association degree extraction unit 402, the “blue” association degree extraction unit 403, and the “yellow” association degree extraction unit 404. The extracted feature storage unit associates the relevance level indicating the level, the relevance level indicating the level recalled by the blue color name, and the relevance level indicating the level recalled by the yellow color name with the main image 201. 146 to record.

例えば、この場合、図３５に示されるように、抽出特徴保持部１４６は、本画像２０１を特定するコンテンツIDと共に、赤である色名によって想起される度合いを示す関連度、青である色名によって想起される度合いを示す関連度、および黄である色名によって想起される度合いを示す関連度を記録する。 For example, in this case, as illustrated in FIG. 35, the extraction feature holding unit 146 has a content ID that identifies the master image 201 and a relevance level that indicates the degree recalled by the color name that is red, and a color name that is blue The degree of association indicating the degree recalled by the color name and the degree of association indicating the degree recalled by the color name being yellow are recorded.

なお、上述の例においては、画像保持部１４０に記録された本画像２０１が画像入力部４０１より入力される例を示したが、本画像２０１に限らず、縮小画像２０２または減色された画像２２１が入力される構成として、縮小画像２０２または減色された画像２２１を処理の対象とするようにしてもよい。また、画像の代わりに、上述した、各関連度を抽出しようとする画像に対応づけられた色ヒストグラムを画像入力部４０１から入力し、各関連度抽出部（例えば、”赤”関連度抽出部４０２、”青”関連度抽出部４０３、および”黄”関連度抽出部４０４）においては該色ヒストグラムから各関連度を抽出する構成としてもよい。 In the above-described example, an example in which the main image 201 recorded in the image holding unit 140 is input from the image input unit 401 is shown. However, the image is not limited to the main image 201, and the reduced image 202 or the reduced color image 221 is displayed. May be input to the reduced image 202 or the color-reduced image 221. Further, instead of the image, the above-described color histogram associated with the image for which each degree of association is to be extracted is input from the image input unit 401, and each degree of association extraction unit (for example, “red” degree of association extraction unit) is input. 402, the “blue” relevance extraction unit 403, and the “yellow” relevance extraction unit 404) may extract each relevance from the color histogram.

図３５は、抽出特徴保持部１４６に記録される関連度の論理構造を示す図である。図３５に示される例において、抽出特徴保持部１４６は、０００であるコンテンツIDに対応させて、０００であるコンテンツIDで特定される本画像２０１から抽出された、０．８０である、赤である色名によって想起される度合いを示す関連度、０．００である、青である色名によって想起される度合いを示す関連度、および０．１０である黄である色名によって想起される度合いを示す関連度を記録する。また、抽出特徴保持部１４６は、００１であるコンテンツIDに対応させて、００１であるコンテンツIDで特定される本画像２０１から抽出された、０．００である、赤である色名によって想起される度合いを示す関連度、０．２５である、青である色名によって想起される度合いを示す関連度、および０．２０である黄である色名によって想起される度合いを示す関連度を記録する。さらに、抽出特徴保持部１４６は、００２であるコンテンツIDに対応させて、００２であるコンテンツIDで特定される本画像２０１から抽出された、０．１５である、赤である色名によって想起される度合いを示す関連度、０．０５である、青である色名によって想起される度合いを示す関連度、および０．００である黄である色名によって想起される度合いを示す関連度を記録する。 FIG. 35 is a diagram illustrating a logical structure of the degree of association recorded in the extracted feature holding unit 146. In the example shown in FIG. 35, the extraction feature holding unit 146 corresponds to the content ID “000” and is extracted from the main image 201 identified by the content ID “000”, which is 0.80 in red. Relevance indicating the degree recalled by a color name, 0.00, relevance indicating the degree recalled by a blue color name, and the degree recalled by a color name yellow of 0.10 Record the relevance indicating. In addition, the extracted feature storage unit 146 is recalled by a color name of red, which is 0.00, extracted from the main image 201 identified by the content ID of 001, corresponding to the content ID of 001. The degree of association indicating the degree of color, the degree of association indicating 0.25, the degree of recalling by the color name of blue, and the degree of association indicating the degree of recall of the color name being 0.20 are recorded. To do. Further, the extracted feature holding unit 146 is recalled by the color name of red, which is 0.15, extracted from the main image 201 identified by the content ID of 002, corresponding to the content ID of 002. Relevance indicating the degree to which the color name is 0.05, relevance indicating the degree that is recalled by the color name that is blue, and relevance indicating the degree that is recalled by the color name that is 0.00 that is yellow To do.

また、抽出特徴記録部４０５は、”赤”関連度抽出部４０２、”青”関連度抽出部４０３、および”黄”関連度抽出部４０４のそれぞれから供給された、赤である色名によって想起される度合いを示す関連度、青である色名によって想起される度合いを示す関連度、および黄である色名によって想起される度合いを示す関連度を、メタデータ２６１として本画像２０１に関係付けて、類似特徴データベース１４２に記録させる。 The extracted feature recording unit 405 is recalled by the color name that is red supplied from each of the “red” association degree extraction unit 402, the “blue” association degree extraction unit 403, and the “yellow” association degree extraction unit 404. Are related to the main image 201 as metadata 261. The association degree indicating the degree recalled by the blue color name and the association degree indicating the degree recalled by the yellow color name are associated with the master image 201 as metadata 261. And recorded in the similar feature database 142.

なお、関連度は、EXIF方式のデータである本画像２０１の所定の領域に格納するようにしてもよい。 The degree of association may be stored in a predetermined area of the main image 201 that is EXIF data.

検索部１３７は、本画像２０１の特徴として、色名とその色名に対する関連度を用いて、本画像２０１を検索する。この場合、例えば、検索部１３７は、検索条件入力部４２１および条件照合部４２２から構成される。 The search unit 137 searches the main image 201 using the color name and the degree of association with the color name as a feature of the main image 201. In this case, for example, the search unit 137 includes a search condition input unit 421 and a condition matching unit 422.

検索条件入力部４２１は、使用者の操作に応じた入力部７６からの信号を基に、関連度についての検索の条件を入力する。検索条件入力部４２１は、関連度についての検索の条件を条件照合部４２２に供給する。 The search condition input unit 421 inputs a search condition for the degree of relevance based on a signal from the input unit 76 according to the user's operation. The search condition input unit 421 supplies the search condition for the degree of relevance to the condition matching unit 422.

条件照合部４２２は、検索条件入力部４２１から供給された検索の条件と、抽出特徴保持部１４６に記録されている関連度とを照合する。条件照合部４２２は、照合の結果、検索の条件を満たす関連度に対応するコンテンツIDを検索結果保持部１４７に格納する。 The condition collation unit 422 collates the search condition supplied from the search condition input unit 421 with the relevance recorded in the extracted feature holding unit 146. The condition matching unit 422 stores, in the search result holding unit 147, the content ID corresponding to the degree of relevance that satisfies the search condition as a result of the matching.

図３６は、ステップＳ４３に対応する、色特徴抽出の処理の詳細を説明するフローチャートである。ステップＳ２０１において、画像入力部４０１は、画像保持部１４０から、関連度の抽出の対象となる画像である本画像２０１を入力する。また、画像入力部４０１は、関連度抽出部対応保持部１４５から、対応情報を入力する。 FIG. 36 is a flowchart for explaining the details of the color feature extraction processing corresponding to step S43. In step S 201, the image input unit 401 inputs the main image 201, which is an image whose relevance is to be extracted, from the image holding unit 140. Further, the image input unit 401 inputs correspondence information from the association degree extraction unit correspondence holding unit 145.

ステップＳ２０２において、画像入力部４０１は、色名を入力する。ステップＳ２０３において、画像入力部４０１は、対応情報を基に、入力した色名に対応する、”赤”関連度抽出部４０２、”青”関連度抽出部４０３、または”黄”関連度抽出部４０４のいずれかを特定する。 In step S202, the image input unit 401 inputs a color name. In step S203, the image input unit 401, based on the correspondence information, the “red” association degree extraction unit 402, the “blue” association degree extraction unit 403, or the “yellow” association degree extraction unit corresponding to the input color name. One of 404 is specified.

例えば、ステップＳ２０３において、画像入力部４０１は、ステップＳ２０２において、”赤”である色名が入力された場合、対応情報を基に”赤”関連度抽出部４０２を特定する。 For example, in step S203, when the color name “red” is input in step S202, the image input unit 401 specifies the “red” association degree extraction unit 402 based on the correspondence information.

画像入力部４０１は、特定された”赤”関連度抽出部４０２、”青”関連度抽出部４０３、または”黄”関連度抽出部４０４のいずれかに、入力した本画像２０１を供給する。 The image input unit 401 supplies the input main image 201 to any one of the identified “red” relevance extraction unit 402, “blue” relevance extraction unit 403, and “yellow” relevance extraction unit 404.

ステップＳ２０４において、ステップＳ２０３で特定された、”赤”関連度抽出部４０２、”青”関連度抽出部４０３、または”黄”関連度抽出部４０４のいずれかは、関連度抽出処理を実行する。関連度抽出処理の詳細は後述する。 In step S204, any one of the “red” relevance extraction unit 402, the “blue” relevance extraction unit 403, and the “yellow” relevance extraction unit 404 specified in step S203 executes the relevance extraction process. . Details of the association degree extraction process will be described later.

抽出された関連度は、抽出特徴記録部４０５に供給される。 The extracted degree of association is supplied to the extracted feature recording unit 405.

ステップＳ２０５において、抽出特徴記録部４０５は、関連度の抽出の対象となった本画像２０１に対応させて、抽出した関連度を色特徴ベクトルとして抽出特徴保持部１４６に記録させる。 In step S205, the extracted feature recording unit 405 causes the extracted feature holding unit 146 to record the extracted degree of association as a color feature vector in association with the main image 201 from which the degree of association is extracted.

ステップＳ２０６において、画像入力部４０１は、色名が終わりであるか否か、すなわち、全ての色名について本画像２０１から関連度を抽出したか否かを判定し、色名が終わりでないと判定された場合、まだ抽出していない色名についての関連度があるので、ステップＳ２０２に戻り、次の色名を入力して、上述した処理を繰り返す。 In step S206, the image input unit 401 determines whether or not the color name is the end, that is, whether or not all the color names have been extracted from the main image 201, and determines that the color name is not the end. If the color name has not been extracted, there is a degree of association with the color name that has not yet been extracted, so the process returns to step S202, the next color name is input, and the above-described processing is repeated.

ステップＳ２０６において、色名が終わりである、すなわち、全ての色名について本画像２０１から関連度を抽出したと判定された場合、処理は終了する。 If it is determined in step S206 that the color name is the end, that is, it is determined that the relevance level has been extracted from the main image 201 for all color names, the process ends.

図３７は、図３６のステップＳ２０４に対応する、ステップＳ２０３で”赤”関連度抽出部４０２が特定された場合の関連度抽出処理の詳細の例を説明するフローチャートである。 FIG. 37 is a flowchart for explaining an example of details of the relevance level extraction process when the “red” relevance level extraction unit 402 is identified in step S203, corresponding to step S204 of FIG.

ステップＳ２２１において、”赤”関連度抽出部４０２は、内蔵しているカウンタをクリアする。最初に実行されるステップＳ２２２において、”赤”関連度抽出部４０２は、本画像２０１の画素のうち、最初の画素の色、すなわち、画素値を入力する。ステップＳ２２３において、”赤”関連度抽出部４０２は、画素の色に対応する、色空間上の位置を計算する。 In step S221, the “red” association degree extraction unit 402 clears the built-in counter. In step S 222 that is first executed, the “red” association degree extraction unit 402 inputs the color of the first pixel among the pixels of the main image 201, that is, the pixel value. In step S223, the “red” association degree extraction unit 402 calculates a position in the color space corresponding to the color of the pixel.

ステップＳ２２４において、”赤”関連度抽出部４０２は、計算された色空間上の位置が、赤である色名に対応するサブ空間内であるか否かを判定する。 In step S224, the “red” association degree extraction unit 402 determines whether or not the calculated position on the color space is in the subspace corresponding to the color name that is red.

ここで、画素の色に対応して計算される、色空間上の位置について説明する。 Here, the position in the color space that is calculated corresponding to the color of the pixel will be described.

例えば、本画像２０１のそれぞれの画素の画素値は、RGBで表現される。この場合、画素値は、Rの値、Gの値、およびBの値からなる。RGBの色空間は、図３８で示されるように、R軸、G軸、およびB軸が相互に直交する空間である。１つの画素値によって、RGBの色空間上の１つの位置が決まる。 For example, the pixel value of each pixel of the main image 201 is expressed in RGB. In this case, the pixel value includes an R value, a G value, and a B value. The RGB color space is a space in which the R axis, the G axis, and the B axis are orthogonal to each other, as shown in FIG. One position in the RGB color space is determined by one pixel value.

RGBの色空間において、人間が所定の色名の色であると認識する色の位置を１つの領域で表現することは困難である（表現しづらい）。 In the RGB color space, it is difficult to represent the position of a color that a human recognizes as a color of a predetermined color name in one area (it is difficult to express).

そこで、Ｌ*ａ*ｂ*空間の位置で、画素の色を表すことを考える。Ｌ*ａ*ｂ*空間は、図３９で示されるように、相互に直交するＬ*軸、ａ*軸、およびｂ*軸で表現される。Ｌ*ａ*ｂ*空間において、Ｌ*軸方向の値であるＬ*が大きくなるに従って、輝度が高くなり、Ｌ*が小さくなるに従って、輝度が低くなる。Ｌ*が一定である場合、Ｌ*軸に近づくに従って、彩度が低くなる。 Therefore, consider representing the color of a pixel at a position in the L * a * b * space. As shown in FIG. 39, the L * a * b * space is expressed by the L * axis, the a * axis, and the b * axis that are orthogonal to each other. In the L * a * b * space, the luminance increases as L * that is the value in the L * axis direction increases, and the luminance decreases as L * decreases. When L * is constant, the saturation decreases as it approaches the L * axis.

１つの画素値によって、Ｌ*ａ*ｂ*空間上の１つの位置が決まる。 One pixel value determines one position in the L * a * b * space.

Ｌ*ａ*ｂ*空間においては、人間が所定の色名の色であると認識する色の位置が１つの領域で表現できる。人間が所定の色名の色であると認識する色の位置を含む領域をサブ空間と称する。サブ空間は、例えば、Ｌ*ａ*ｂ*空間において広がりをもった領域である。 In the L * a * b * space, the position of a color that a human recognizes as a color having a predetermined color name can be expressed by one area. A region including a color position that a human recognizes as a color having a predetermined color name is referred to as a subspace. The subspace is, for example, a region having a spread in the L * a * b * space.

まず、白および黒に対するサブ空間の例を説明する。 First, examples of subspaces for white and black will be described.

図４０は、白のサブ空間および黒のサブ空間の例を示す図である。白のサブ空間４４１は、楕円体の１つの軸がＬ*軸と一致する楕球であって、図形的中心がＬ*ａ*ｂ*空間の最も上の位置（Ｌ*軸上の最大値を示す位置）と一致する楕球の内側の空間と、Ｌ*ａ*ｂ*空間とが重なる空間である。白のサブ空間４４１は、彩度が低く、輝度の高い色を示す空間である。サブ空間４４１内の位置で示される色は、人間に白であると認識される。 FIG. 40 is a diagram illustrating an example of a white subspace and a black subspace. The white subspace 441 is an ellipse in which one axis of the ellipsoid coincides with the L * axis, and the graphic center is the uppermost position in the L * a * b * space (the maximum value on the L * axis). The space inside the ellipse that coincides with the L * a * b * space. The white sub-space 441 is a space showing a color with low saturation and high luminance. The color indicated by the position in the sub space 441 is recognized as white by humans.

黒のサブ空間４４２は、楕円体の１つの軸がＬ*軸と一致する楕球であって、図形的中心がＬ*ａ*ｂ*空間の最も下の位置（Ｌ*軸上の最小値を示す位置）と一致する楕球の内側の空間と、Ｌ*ａ*ｂ*空間とが重なる空間である。黒のサブ空間４４２は、彩度が低く、輝度の低い色を示す空間である。サブ空間４４２内の位置で示される色は、人間に黒であると認識される。 The black subspace 442 is an ellipsoid in which one axis of the ellipsoid coincides with the L * axis, and the graphic center is the lowest position in the L * a * b * space (the minimum value on the L * axis). The space inside the ellipse that coincides with the L * a * b * space. The black sub-space 442 is a space that shows a color with low saturation and low luminance. The color indicated by the position in the sub space 442 is recognized as black by humans.

次に、赤、黄、緑、および青に対するサブ空間の例を説明する。 Next, examples of subspaces for red, yellow, green, and blue will be described.

赤、黄、緑、および青は、有彩色なので、Ｌ*ａ*ｂ*空間から、図４１で示される彩度境界４６１の内側の領域、輝度下限境界４６２の下側の領域、および輝度上限境界４６３の上側の領域を除外する。彩度境界４６１の内側の領域は、彩度の低い色を示す。彩度境界４６１は、その内側の領域で示される色の彩度が低く、その色が人間には、赤、黄、緑、または青と認識されない位置に設けられる。 Since red, yellow, green, and blue are chromatic colors, from the L * a * b * space, the region inside the saturation boundary 461 shown in FIG. 41, the region below the luminance lower limit boundary 462, and the luminance upper limit The region above the boundary 463 is excluded. A region inside the saturation boundary 461 indicates a color with low saturation. The saturation boundary 461 is provided at a position where the saturation of the color indicated by the inner region is low and the color is not recognized by humans as red, yellow, green, or blue.

輝度下限境界４６２の下側の領域は、輝度の低い色を示す。輝度下限境界４６２は、その下側の領域で示される色の輝度が低く、その色が人間には、赤、黄、緑、または青と認識されない位置に設けられる。 The region below the lower luminance limit boundary 462 indicates a color with low luminance. The luminance lower limit boundary 462 is provided at a position where the luminance of the color indicated in the lower region is low and the color is not recognized by humans as red, yellow, green, or blue.

輝度上限境界４６３の上側の領域は、輝度の高い色を示す。輝度上限境界４６３は、その上側の領域で示される色の輝度が高く、その色が人間には、赤、黄、緑、または青と認識されない位置に設けられる。 A region above the luminance upper limit boundary 463 indicates a color with high luminance. The luminance upper limit boundary 463 is provided at a position where the luminance of the color indicated by the upper region is high and the color is not recognized by humans as red, yellow, green, or blue.

従って、Ｌ*ａ*ｂ*空間から、彩度境界４６１の内側の領域、輝度下限境界４６２の下側の領域、および輝度上限境界４６３の上側の領域を除外した空間は、その空間で示される色が、赤、黄、緑、または青などと人間に認識される位置からなることになる。 Accordingly, a space obtained by excluding the area inside the saturation boundary 461, the area below the luminance lower limit boundary 462, and the area above the luminance upper limit boundary 463 from the L * a * b * space is indicated by the space. The color consists of positions that are recognized by humans as red, yellow, green, or blue.

そして、Ｌ*ａ*ｂ*空間から、彩度境界４６１の内側の領域、輝度下限境界４６２の下側の領域、および輝度上限境界４６３の上側の領域を除外した空間が、図４２で示されるように、ａ*軸とｂ*軸とからなる平面に対して垂直であって、Ｌ*軸を中心とした放射状の境界で分割される。例えば、Ｌ*ａ*ｂ*空間をＬ*軸の上側から見た場合、緑のサブ空間４８１は、マイナス側のａ*軸の上側の境界と、マイナス側のａ*軸の下側の境界とで囲まれる、ａ*軸側の空間である。サブ空間４８１内の位置で示される色は、人間に緑であると認識される。 FIG. 42 shows a space obtained by excluding the area inside the saturation boundary 461, the area below the luminance lower limit boundary 462, and the area above the luminance upper limit boundary 463 from the L * a * b * space. As described above, it is perpendicular to the plane composed of the a * axis and the b * axis, and is divided at a radial boundary centered on the L * axis. For example, when the L * a * b * space is viewed from the upper side of the L * axis, the green subspace 481 includes the upper boundary of the negative a * axis and the lower boundary of the negative a * axis. The space on the a * axis side surrounded by The color indicated by the position in the sub space 481 is recognized as green by humans.

また、Ｌ*ａ*ｂ*空間をＬ*軸の上側から見た場合、青のサブ空間４８２は、マイナス側のｂ*軸の右側の境界と、マイナス側のｂ*軸の左側の境界とで囲まれる、ｂ*軸側の空間である。サブ空間４８２内の位置で示される色は、人間に青であると認識される。 Further, when the L * a * b * space is viewed from the upper side of the L * axis, the blue subspace 482 includes a boundary on the right side of the negative b * axis and a boundary on the left side of the negative b * axis. The space on the b * axis side surrounded by. The color indicated by the position in the subspace 482 is recognized by humans as blue.

同様に、例えば、Ｌ*ａ*ｂ*空間をＬ*軸の上側から見た場合、赤のサブ空間４８３は、プラス側のａ*軸の上側の境界と、プラス側のａ*軸の下側の境界とで囲まれる、ａ*軸側の空間である。サブ空間４８３内の位置で示される色は、人間に赤であると認識される。例えば、Ｌ*ａ*ｂ*空間をＬ*軸の上側から見た場合、黄のサブ空間４８４は、プラス側のｂ*軸の右側の境界と、プラス側のｂ*軸の左側の境界とで囲まれる、ｂ*軸側の空間である。サブ空間４８４内の位置で示される色は、人間に黄であると認識される。 Similarly, for example, when the L * a * b * space is viewed from the upper side of the L * axis, the red subspace 483 includes the upper boundary of the positive a * axis and the lower side of the positive a * axis. The space on the a * axis side surrounded by the boundary on the side. The color indicated by the position in the subspace 483 is recognized as red by humans. For example, when the L * a * b * space is viewed from the upper side of the L * axis, the yellow subspace 484 includes the right boundary of the positive b * axis and the left boundary of the positive b * axis. The space on the b * axis side surrounded by. The color indicated by the position in the subspace 484 is recognized by humans as yellow.

すなわち、ステップＳ２２３において、”赤”関連度抽出部４０２は、画素の色に対応する、Ｌ*ａ*ｂ*空間上の位置を計算する。そして、ステップＳ２２４において、”赤”関連度抽出部４０２は、計算されたＬ*ａ*ｂ*空間上の位置が、赤である色名に対応するサブ空間４８３内であるか否かを判定する。すなわち、ステップＳ２２４において、”赤”関連度抽出部４０２は、画素の色が人間に赤であると認識される色であるか否かを判定する。 That is, in step S223, the “red” association degree extraction unit 402 calculates a position in the L * a * b * space corresponding to the color of the pixel. In step S224, the “red” association degree extraction unit 402 determines whether the calculated position in the L * a * b * space is in the subspace 483 corresponding to the color name that is red. To do. That is, in step S224, the “red” relevance extraction unit 402 determines whether or not the pixel color is a color that is recognized by humans as red.

ステップＳ２２４において、計算されたＬ*ａ*ｂ*空間上の位置が、赤である色名に対応するサブ空間４８３内であると判定された場合、画素の色が人間に赤であると認識される色なので、ステップＳ２２５に進み、”赤”関連度抽出部４０２は、カウンタを１だけインクリメントし、手続きは、ステップＳ２２６に進む。 If it is determined in step S224 that the calculated position in the L * a * b * space is within the subspace 483 corresponding to the color name that is red, the pixel color is recognized as red by humans. In step S225, the “red” relevance extraction unit 402 increments the counter by 1, and the procedure proceeds to step S226.

ステップＳ２２４において、計算されたＬ*ａ*ｂ*空間上の位置が、赤である色名に対応するサブ空間４８３内でないと判定された場合、画素の色が人間に赤であると認識されない色なので、ステップＳ２２５をスキップして、カウンタをインクリメントしないで、手続きは、ステップＳ２２６に進む。 If it is determined in step S224 that the calculated position in the L * a * b * space is not within the subspace 483 corresponding to the color name that is red, the color of the pixel is not recognized by human beings as red. Since it is a color, skip step S225 and do not increment the counter, and the procedure proceeds to step S226.

ステップＳ２２６において、”赤”関連度抽出部４０２は、画素が終わりであるか否か、すなわち、本画像２０１の画素の全てについて処理を適用したか否かを判定し、画素が終わりでないと判定された場合、ステップＳ２２２に戻り、本画像２０１の画素のうち、次の画素の色、すなわち、次の画素の画素値を入力して、上述した処理を繰り返す。 In step S226, the “red” association degree extraction unit 402 determines whether or not the pixel is the end, that is, whether or not the process has been applied to all the pixels of the main image 201, and determines that the pixel is not the end. If YES in step S222, the color of the next pixel among the pixels of the main image 201, that is, the pixel value of the next pixel is input, and the above-described processing is repeated.

ステップＳ２２６において、画素が終わりである、すなわち、本画像２０１の画素の全てについて処理を適用したと判定された場合、ステップＳ２２７に進み、”赤”関連度抽出部４０２は、カウンタの数（値）を本画像２０１の画素の数で除算する。その結果、本画像２０１において、赤であると想定できる色が含まれる割合が求められることになる。 If it is determined in step S226 that the pixel is the end, that is, it is determined that the processing has been applied to all the pixels of the main image 201, the process proceeds to step S227, and the “red” relevance extraction unit 402 determines the number of counters (value ) Is divided by the number of pixels of the main image 201. As a result, in the main image 201, a ratio in which a color that can be assumed to be red is included is obtained.

ステップＳ２２８において、”赤”関連度抽出部４０２は、除算の結果を赤の関連度とし、関連度を抽出特徴記録部４０５に赤の関連度を供給して、処理は終了する。 In step S228, the “red” relevance degree extraction unit 402 sets the result of division as red relevance, supplies the relevance degree to the extracted feature recording unit 405, and ends the process.

なお、Ｌ*ａ*ｂ*空間におけるサブ空間を例に説明したが、Ｌ*ａ*ｂ*空間に限らず、所定の色名の色を１つの領域で表現される色空間を用いて、そのサブ空間を基に関連度を求めるようにしてもよい。 The subspace in the L * a * b * space has been described as an example. However, the color space is not limited to the L * a * b * space, and a color space in which a color of a predetermined color name is expressed by one region, The degree of association may be obtained based on the subspace.

図３７を参照して説明した関連度抽出処理においては、画素毎の色がサブ空間の内側であるか否かの２値判断を行ったが、サブ空間の中心に近いのか、それともサブ空間の境界に近いのか（境界ぎりぎりなのか）を関連度に反映させることも考えられる。 In the relevance extraction process described with reference to FIG. 37, a binary determination is made as to whether or not the color for each pixel is inside the subspace. It may be possible to reflect the degree of relevance whether it is close to the boundary (whether it is just the boundary).

次に、この場合の関連度抽出処理を説明する。 Next, the association degree extraction process in this case will be described.

図４３は、図３６のステップＳ２０４に対応する、ステップＳ２０３で”赤”関連度抽出部４０２が特定された場合の関連度抽出処理の詳細の他の例を説明するフローチャートである。ステップＳ２４１において、”赤”関連度抽出部４０２は、記憶している関連度をクリアする。最初に実行されるステップＳ２４２において、”赤”関連度抽出部４０２は、本画像２０１の画素のうち、最初の画素の色、すなわち、画素値を入力する。ステップＳ２４３において、”赤”関連度抽出部４０２は、画素の色に対応する、色空間上の位置を計算する。 FIG. 43 is a flowchart for explaining another example of the details of the relevance level extraction process when the “red” relevance level extraction unit 402 is identified in step S203, corresponding to step S204 of FIG. In step S241, the “red” association degree extraction unit 402 clears the stored association degree. In step S 242 that is first executed, the “red” association degree extraction unit 402 inputs the color of the first pixel among the pixels of the main image 201, that is, the pixel value. In step S243, the “red” association degree extraction unit 402 calculates a position in the color space corresponding to the color of the pixel.

ステップＳ２２４において、”赤”関連度抽出部４０２は、計算された色空間上の位置について、色名に対応するサブ空間に属する確信度を算出する。すなわち、ステップＳ２２４において、”赤”関連度抽出部４０２は、計算された色空間上の位置について、赤である色名に対応するサブ空間４８３に属する確信度を算出する。 In step S224, the “red” association degree extraction unit 402 calculates a certainty factor belonging to the subspace corresponding to the color name for the calculated position in the color space. That is, in step S224, the “red” association degree extraction unit 402 calculates a certainty factor belonging to the subspace 483 corresponding to the color name that is red for the calculated position in the color space.

確信度は、サブ空間の中心に近いのか、それともサブ空間の境界に近いのかを示す、サブ空間の内側から外側に向かって１から０に連続的に変化する指標値である。 The certainty factor is an index value that continuously changes from 1 to 0 from the inner side to the outer side of the sub space, indicating whether it is close to the center of the sub space or the boundary of the sub space.

例えば、ステップＳ２２４において、”赤”関連度抽出部４０２は、計算された色空間上の位置がサブ空間４８３の中心により近い場合、１により近い確信度を算出し、計算された色空間上の位置がサブ空間４８３の境界により近い場合、０により近い確信度を算出する。 For example, in step S224, if the calculated position in the color space is closer to the center of the subspace 483, the “red” association degree extraction unit 402 calculates a certainty factor closer to 1, and calculates the calculated color space. When the position is closer to the boundary of the subspace 483, a certainty factor closer to 0 is calculated.

ステップＳ２４５において、”赤”関連度抽出部４０２は、関連度に確信度を加算する。ステップＳ２４６において、”赤”関連度抽出部４０２は、画素が終わりであるか否か、すなわち、本画像２０１の画素の全てについて処理を適用したか否かを判定し、画素が終わりでないと判定された場合、ステップＳ２４２に戻り、本画像２０１の画素のうち、次の画素の色、すなわち、次の画素の画素値を入力して、上述した処理を繰り返す。 In step S245, the “red” association degree extraction unit 402 adds the certainty factor to the association degree. In step S246, the “red” association degree extraction unit 402 determines whether or not the pixel is the end, that is, whether or not the process has been applied to all the pixels of the main image 201, and determines that the pixel is not the end. If YES in step S242, the color of the next pixel among the pixels of the main image 201, that is, the pixel value of the next pixel is input, and the above-described processing is repeated.

ステップＳ２２６において、画素が終わりである、すなわち、本画像２０１の画素の全てについて処理を適用したと判定された場合、関連度を抽出特徴記録部４０５に赤の関連度を供給して、処理は終了する。 If it is determined in step S226 that the pixel is the end, that is, it is determined that the process has been applied to all the pixels of the main image 201, the degree of association is supplied to the extraction feature recording unit 405, and the process is performed. finish.

確信度を基に関連度を算出した場合には、人の感覚により近い関連度を求めることができるようになる。特に、画像が、サブ空間の境界に近い色を多く含む場合であっても、より的確な関連度を求めることができる。 When the degree of relevance is calculated based on the certainty level, the degree of relevance closer to the human sense can be obtained. In particular, even when the image includes many colors close to the boundary of the subspace, a more accurate degree of association can be obtained.

図３７を参照して説明した関連度抽出処理におけるステップＳ２２４の処理は、画素の色が特定の色名の色と判定されるか否かの２クラス分類問題であり、種々のパターン認識の手法に置き換えることができる。 The process of step S224 in the relevance extraction process described with reference to FIG. 37 is a two-class classification problem as to whether or not a pixel color is determined to be a color of a specific color name, and various pattern recognition methods. Can be replaced.

図４４は、図３６のステップＳ２０４に対応する、ステップＳ２０３で”赤”関連度抽出部４０２が特定された場合の関連度抽出処理の詳細の他の例を説明するフローチャートである。ステップＳ２６１およびステップＳ２６２の処理は、それぞれ、図３７のステップＳ２２１およびステップＳ２２２の処理と同様なので、その説明は省略する。 FIG. 44 is a flowchart for explaining another example of the details of the relevance level extraction process corresponding to step S204 of FIG. 36 when the “red” relevance level extraction unit 402 is specified in step S203. Since the processing of step S261 and step S262 is the same as the processing of step S221 and step S222 of FIG. 37, respectively, description thereof is omitted.

ステップＳ２６３において、”赤”関連度抽出部４０２は、画素の色をパターン認識する。 In step S263, the “red” association degree extraction unit 402 recognizes the color of the pixel.

例えば、ステップＳ２６３において、”赤”関連度抽出部４０２は、ニューラルネットワークにより、画素の色をパターン認識する。ニューラルネットワークによるパターン認識は、例えば、鳥脇純一郎著、認識工学 −パターン認識とその応用−、コロナ社などに記載されている。 For example, in step S 263, the “red” association degree extraction unit 402 recognizes the color of the pixel using a neural network. Pattern recognition using a neural network is described in, for example, Junichiro Toriwaki, recognition engineering-pattern recognition and its application, Corona Co., etc.

パターン認識させる場合には、特定の色値（Ｌ*,ａ*,ｂ*）の色が特定の色名の色であるかどうかを示す判断データを予め人手により複数集めておき、集めた判断データを基に、ニューラルネットワークの学習を行い、識別に必要なパラメータを生成しておく。 In the case of pattern recognition, a plurality of judgment data indicating whether or not the color of a specific color value (L *, a *, b *) is a color of a specific color name is collected in advance by hand, and the collected judgment Based on the data, neural network learning is performed to generate parameters necessary for identification.

図４５は、青の色であるかどうかを示す判断データの例である。図４５の判断データの例は、例えば、０．０２であるＬ*、０．０４であるａ*、および０．１０であるｂ*で特定される色は、青ではなく、０．７２であるＬ*、０．００であるａ*、および０．１２であるｂ*で特定される色は、青であり、０．２８であるＬ*、−０．０２であるａ*、および０．１５であるｂ*で特定される色は、青ではないことを示す。 FIG. 45 is an example of determination data indicating whether the color is blue. In the example of the determination data in FIG. 45, for example, the color specified by L * being 0.02, a * being 0.04, and b * being 0.10 is not blue, but 0.72. The color specified by a certain L *, a * that is 0.00, and b * that is 0.12 is blue, L * that is 0.28, a * that is −0.02, and 0. .15 indicates that the color specified by b * is not blue.

ニューラルネットワークによれば、画素の色に対して、このように生成されたパラメータに従って特定の色名の色であるか否かが判定される。 According to the neural network, it is determined whether the color of a pixel is a color of a specific color name according to the parameters generated in this way.

なお、パターン認識の手法は、画素の色が、所定の色名の色であるか否かを判別できるものであればよく、SVM（Support Vector Machine）などいずれの手法であってもよい。 The pattern recognition method may be any method that can determine whether the color of a pixel is a color of a predetermined color name, and may be any method such as SVM (Support Vector Machine).

ステップＳ２６４において、”赤”関連度抽出部４０２は、認識の結果、画素の色が、赤に属するか否かを判定する。ステップＳ２２４において、画素の色が、赤に属すると判定された場合、ステップＳ２６５に進み、”赤”関連度抽出部４０２は、カウンタを１だけインクリメントし、手続きは、ステップＳ２６６に進む。 In step S264, the “red” association degree extraction unit 402 determines whether the pixel color belongs to red as a result of recognition. When it is determined in step S224 that the color of the pixel belongs to red, the process proceeds to step S265, the “red” relevance extraction unit 402 increments the counter by 1, and the procedure proceeds to step S266.

ステップＳ２６４において、画素の色が、赤に属しないと判定された場合、ステップＳ２６５をスキップして、カウンタをインクリメントしないで、手続きは、ステップＳ２６６に進む。 If it is determined in step S264 that the color of the pixel does not belong to red, step S265 is skipped and the procedure proceeds to step S266 without incrementing the counter.

ステップＳ２６６乃至ステップＳ２６８の処理は、それぞれ、図３７のステップＳ２２６乃至ステップＳ２２８の処理と同様なので、その説明は省略する。 Since the processing from step S266 to step S268 is the same as the processing from step S226 to step S228 in FIG. 37, description thereof will be omitted.

さらに、パターン認識の手法により、確信度を求めるようにしてもよい。 Further, the certainty factor may be obtained by a pattern recognition technique.

図４６は、図３６のステップＳ２０４に対応する、ステップＳ２０３で”赤”関連度抽出部４０２が特定された場合の関連度抽出処理の詳細の他の例を説明するフローチャートである。ステップＳ２８１の処理は、図４３のステップＳ２４１の処理と同様なので、その説明は省略する。ステップＳ２８２およびステップＳ２８３の処理は、それぞれ、図４４のステップＳ２６２およびステップＳ２６３の処理と同様なので、その説明は省略する。 FIG. 46 is a flowchart for explaining another example of the degree of association degree extraction process corresponding to step S204 of FIG. 36 when the “red” degree of association extraction unit 402 is identified in step S203. Since the process of step S281 is the same as the process of step S241 of FIG. 43, the description thereof is omitted. The processes in step S282 and step S283 are the same as the processes in step S262 and step S263 of FIG.

ステップＳ２８４において、”赤”関連度抽出部４０２は、認識の結果としての、色名に属すると判定する確信度を算出する。すなわち、ステップＳ２８４において、”赤”関連度抽出部４０２は、認識の結果としての、画素の色が赤に属すると判定する確信度を算出する。例えば、確信度として、ニューラルネットワークの出力層に入力される値を用いることができる。 In step S284, the “red” association degree extraction unit 402 calculates a certainty factor for determining that the color name belongs as a recognition result. That is, in step S284, the “red” association degree extraction unit 402 calculates a certainty factor for determining that the color of the pixel belongs to red as a result of recognition. For example, a value input to the output layer of the neural network can be used as the certainty factor.

ステップＳ２８５およびステップＳ２８６の処理は、それぞれ、図４３のステップＳ２４５およびステップＳ２４６の処理と同様なのでその説明は省略する。 Since the processing of step S285 and step S286 is the same as the processing of step S245 and step S246 of FIG. 43, the description thereof is omitted.

なお、図３６のステップＳ２０４に対応する、ステップＳ２０３で”青”関連度抽出部４０３が特定された場合、またはステップＳ２０３で”黄”関連度抽出部４０４が特定された場合の関連度抽出処理の詳細は、”赤”関連度抽出部４０２に代わり”青”関連度抽出部４０３または”黄”関連度抽出部４０４が処理を実行する点またはサブ空間などが異なるが、その他の点は、図３７、図４３、図４４、または図４６を参照して説明した処理と同様なので、その説明は省略する。 Note that, when “blue” relevance extraction unit 403 is identified in step S203, or when “yellow” relevance extraction unit 404 is identified in step S203, corresponding relevance extraction processing corresponding to step S204 in FIG. The details of are different in that the “blue” relevance degree extraction unit 403 or the “yellow” relevance degree extraction unit 404 executes processing or subspace instead of the “red” relevance degree extraction part 402, but other points are The processing is the same as that described with reference to FIG. 37, FIG. 43, FIG. 44, or FIG.

図４７は、検索の処理を説明するフローチャートである。ステップＳ３１１において、検索条件入力部４２１は、使用者の操作に応じた入力部７６からの信号を基に、関連度についての検索の条件を取得する。検索条件入力部４２１は、関連度についての検索の条件を条件照合部４２２に供給する。 FIG. 47 is a flowchart for explaining search processing. In step S 311, the search condition input unit 421 acquires a search condition for the degree of association based on a signal from the input unit 76 according to the user's operation. The search condition input unit 421 supplies the search condition for the degree of relevance to the condition matching unit 422.

例えば、図４８で示されるように、ディスプレイである出力部７７に、GUI（Graphical User Interface）の画像が表示される。図４８で示される例において、使用者の操作されるスライドバー４９１は、検索の条件である、色名毎の粒度（閾値）を指定する。色名に対応するチェックボックス４９２が使用者によってチェックされている場合、その色名のスライドバー４９１で指定された、その色名についての粒度が、検索条件としてステップＳ３１１において、取得される。 For example, as shown in FIG. 48, a GUI (Graphical User Interface) image is displayed on the output unit 77 which is a display. In the example shown in FIG. 48, the slide bar 491 operated by the user specifies the granularity (threshold value) for each color name, which is a search condition. When the check box 492 corresponding to the color name is checked by the user, the granularity for the color name specified by the slide bar 491 of the color name is acquired as a search condition in step S311.

例えば、黒のチェックボックス４９２、赤のチェックボックス４９２、緑のチェックボックス４９２がチェックされている場合、黒のスライドバー４９１で指定された、黒の粒度、赤のスライドバー４９１で指定された、赤の粒度、および緑のスライドバー４９１で指定された、緑の粒度が検索条件としてステップＳ３１１において、取得される。 For example, when a black check box 492, a red check box 492, and a green check box 492 are checked, the black granularity specified by the black slide bar 491, the red slide bar 491, In step S311, the red granularity and the green granularity designated by the green slide bar 491 are acquired as search conditions.

なお、AND検索ラジオボタン４９３がオンされている場合、スライドバー４９１で指定された、色名毎の粒度の論理積が最終的な検索条件とされ、OR検索ラジオボタン４９４がオンされている場合、スライドバー４９１で指定された、色名毎の粒度の論理和が最終的な検索条件とされる。 When the AND search radio button 493 is turned on, the logical product of the granularity for each color name specified by the slide bar 491 is the final search condition, and the OR search radio button 494 is turned on. The logical sum of the granularity for each color name designated by the slide bar 491 is the final search condition.

より具体的には、例えば、ステップＳ３１１において、検索条件入力部４２１は、（“赤”＞０．５）AND（“青”≧０．３）AND（“緑”＜０．１）などの、複数の色名に対する論理式で示される検索の条件を取得する。 More specifically, for example, in step S311, the search condition input unit 421 sets (“red”> 0.5) AND (“blue” ≧ 0.3) AND (“green” <0.1) or the like. The search condition indicated by the logical expression for a plurality of color names is acquired.

例えば、使用者は、青空の写った画像を検索したい場合、“青”≧０．３である検索の条件を入力し、ステップＳ３１１において、検索条件入力部４２１は、“青”≧０．３である検索の条件を取得する。 For example, when the user wants to search for an image showing a blue sky, the user inputs a search condition of “blue” ≧ 0.3. In step S311, the search condition input unit 421 reads “blue” ≧ 0.3. Get the search condition that is.

また、使用者は、例えば、イチゴ狩りの画像を検索したい場合には、（“赤”＞０．１）AND（“緑”≧０．３）である検索の条件を入力し、ステップＳ３１１において、検索条件入力部４２１は、（“赤”＞０．１）AND（“緑”≧０．３）である検索の条件を取得する。 For example, when the user wants to search for an image of strawberry picking, the user inputs a search condition of (“red”> 0.1) AND (“green” ≧ 0.3), and in step S311 The search condition input unit 421 acquires search conditions that are (“red”> 0.1) AND (“green” ≧ 0.3).

なお、検索の条件における、色の名前は、定義済み（関連度抽出部が用意されている）全ての色名である必要はなく、すなわち、検索の条件における、色の名前は、定義済みの色名の一部であってもよく、１つの色名であってもよい。 It should be noted that the color names in the search condition need not be all defined color names (relationship extraction units are prepared), that is, the color names in the search condition are defined. It may be a part of the color name or one color name.

また、色名毎に、直接数値を入力し、取得するようにしてもよい。 In addition, a numerical value may be directly input and acquired for each color name.

ステップＳ３１２において、条件照合部４２２は、抽出特徴保持部１４６から、検索の対象となる本画像２０１の色特徴ベクトルを取得する。 In step S 312, the condition matching unit 422 acquires the color feature vector of the main image 201 to be searched from the extracted feature holding unit 146.

ステップＳ３１３において、条件照合部４２２は、取得した色特徴ベクトルが検索の条件に一致するか否かを判定する。例えば、ステップＳ３１３において、条件照合部４２２は、取得した色特徴ベクトルのそれぞれの要素のうち、チェックされているチェックボックス４９２に対応する色名の要素と、スライドバー４９１で指定された、その色名についての粒度とが比較され、色特徴ベクトルの色名の要素が指定された粒度以上である場合、色特徴ベクトルが検索の条件に一致すると判定する。 In step S313, the condition matching unit 422 determines whether or not the acquired color feature vector matches the search condition. For example, in step S313, the condition matching unit 422 selects the color name element corresponding to the check box 492 that is checked and the color specified by the slide bar 491 from among the elements of the acquired color feature vector. The granularity of the name is compared, and if the element of the color name of the color feature vector is equal to or greater than the designated granularity, it is determined that the color feature vector matches the search condition.

また、例えば、色名毎の粒度の論理積が最終的な検索条件とされている場合、ステップＳ３１３において、条件照合部４２２は、チェックされているチェックボックス４９２に対応する色名の要素のすべてにおいて、色特徴ベクトルの色名の要素が指定された粒度以上である場合、色特徴ベクトルが検索の条件に一致すると判定する。例えば、色名毎の粒度の論理和が最終的な検索条件とされている場合、ステップＳ３１３において、条件照合部４２２は、チェックされているチェックボックス４９２に対応する色名の要素のいずれかにおいて、色特徴ベクトルの色名の要素が指定された粒度以上である場合、色特徴ベクトルが検索の条件に一致すると判定する。 For example, when the logical product of the granularity for each color name is the final search condition, in step S313, the condition matching unit 422 determines all the elements of the color name corresponding to the checked check box 492. When the element of the color name of the color feature vector is equal to or greater than the specified granularity, it is determined that the color feature vector matches the search condition. For example, when the logical sum of the granularity for each color name is the final search condition, in step S313, the condition matching unit 422 determines whether any of the color name elements corresponding to the check box 492 being checked. If the color name element of the color feature vector is equal to or greater than the specified granularity, it is determined that the color feature vector matches the search condition.

ステップＳ３１３において、取得した色特徴ベクトルが検索の条件に一致すると判定された場合、ステップＳ３１４に進み、条件照合部４２２は、検索結果保持部１４７に、ステップＳ３１２において取得した色特徴ベクトルに対応する本画像２０１を特定するコンテンツIDを追加して、ステップＳ３１５に進む。 If it is determined in step S313 that the acquired color feature vector matches the search condition, the process proceeds to step S314, and the condition matching unit 422 corresponds to the color feature vector acquired in step S312 in the search result holding unit 147. A content ID for specifying the main image 201 is added, and the process proceeds to step S315.

ステップＳ３１３において、取得した色特徴ベクトルが検索の条件に一致しないと判定された場合、ステップＳ３１４の処理はスキップされ、検索結果保持部１４７にコンテンツIDを追加しないで、ステップＳ３１５に進む。 If it is determined in step S313 that the acquired color feature vector does not match the search condition, the process of step S314 is skipped, and the process proceeds to step S315 without adding the content ID to the search result holding unit 147.

ステップＳ３１５において、検索条件入力部４２１は、画像が終わりであるか否か、すなわち、全ての画像について検索したか否かを判定し、画像が終わりでない、すなわち、まだ、全ての画像について検索していないと判定された場合、ステップＳ３１２に戻り、次の本画像２０１の色特徴ベクトルを取得して、上述した処理を繰り返す。 In step S315, the search condition input unit 421 determines whether or not the image is the end, that is, whether or not all the images have been searched, and the image is not the end, that is, has searched for all the images yet. If it is determined that it is not, the process returns to step S312 to acquire the color feature vector of the next main image 201, and the above-described processing is repeated.

ステップＳ３１５において、画像が終わりである、すなわち、全ての画像について検索したと判定された場合、処理は終了する。 If it is determined in step S315 that the image is the end, that is, all the images have been searched, the process ends.

この処理により、検索結果保持部１４７には、検索の条件を満たす本画像２０１を特定するコンテンツIDが格納されることになる。 As a result of this processing, the search result holding unit 147 stores the content ID that identifies the main image 201 that satisfies the search conditions.

図４９は、ディスプレイである出力部７７に表示される、検索結果保持部１４７に格納されたコンテンツIDで特定される本画像２０１の例を示す図である。例えば、緑のチェックボックス４９２がチェックされ、緑のスライドバー４９１で粒度が指定された場合、図４９の左上に示されるように、緑を多く含む本画像２０１が、ディスプレイである出力部７７に表示される。また、例えば、緑のチェックボックス４９２がチェックされ、緑のスライドバー４９１で粒度が指定され、赤のチェックボックス４９２がチェックされ、赤のスライドバー４９１で粒度が指定され、AND検索ラジオボタン４９３がオンされている場合、図４９の右上に示されるように、緑と赤を多く含む本画像２０１が、ディスプレイである出力部７７に表示される。 FIG. 49 is a diagram illustrating an example of the master image 201 identified by the content ID stored in the search result holding unit 147 and displayed on the output unit 77 serving as a display. For example, when the green check box 492 is checked and the granularity is designated by the green slide bar 491, as shown in the upper left of FIG. 49, the main image 201 containing a lot of green is displayed on the output unit 77 which is a display. Is displayed. Further, for example, the green check box 492 is checked, the granularity is designated by the green slide bar 491, the red check box 492 is checked, the granularity is designated by the red slide bar 491, and the AND search radio button 493 is selected. When turned on, as shown in the upper right of FIG. 49, the main image 201 containing a lot of green and red is displayed on the output unit 77 which is a display.

例えば、青のチェックボックス４９２がチェックされ、青のスライドバー４９１で粒度が指定された場合、図４９の左下に示されるように、青を多く含む本画像２０１が、ディスプレイである出力部７７に表示される。また、例えば、青のチェックボックス４９２がチェックされ、青のスライドバー４９１で粒度が指定され、白のチェックボックス４９２がチェックされ、白のスライドバー４９１で粒度が指定され、AND検索ラジオボタン４９３がオンされている場合、図４９の右下に示されるように、青と白を多く含む本画像２０１が、ディスプレイである出力部７７に表示される。 For example, when the blue check box 492 is checked and the granularity is specified by the blue slide bar 491, as shown in the lower left of FIG. 49, the main image 201 containing a lot of blue is displayed on the output unit 77 which is a display. Is displayed. Also, for example, the blue check box 492 is checked, the granularity is designated by the blue slide bar 491, the white check box 492 is checked, the granularity is designated by the white slide bar 491, and the AND search radio button 493 is selected. When turned on, as shown in the lower right of FIG. 49, the main image 201 including a large amount of blue and white is displayed on the output unit 77 that is a display.

使用者にとって、所望の画像がどのような色をどのくらい含んでいるかを推測することは容易であり、所望の画像を簡単に検索することができるようになる。 It is easy for the user to guess what color the desired image contains and how much the desired image can be retrieved.

さらに、検索の結果に応じて、条件を広げたり狭めたりするなどの任意の粒度に変更して、再度、検索することができる。これにより、さらに簡単に、所望の画像を検索することができる。 Furthermore, the search can be performed again by changing the granularity to an arbitrary granularity such as expanding or narrowing the conditions according to the search result. Thereby, a desired image can be retrieved more easily.

このように、使用者の持っている画像の色のイメージや雰囲気から直感的に画像を検索することができるようになる。 In this way, it is possible to intuitively search for an image from the color image or atmosphere of the image possessed by the user.

画像の全体の集合に対して様々な条件を組み合わせた検索の条件を決めることができるので、検索時に、任意の粒度で、画像である検索結果を取り出すことができる。 Since a search condition combining various conditions can be determined for the entire set of images, a search result that is an image can be taken out at an arbitrary granularity during the search.

画像について、関連度からなる色特徴ベクトルを予め抽出し、関連度との大小の比較または論理演算により画像を検索することができるので、迅速に画像を検索することができる。 A color feature vector composed of the degree of relevance is extracted in advance, and the image can be retrieved by comparing with the degree of relevance or by a logical operation, so that the image can be retrieved quickly.

関連度は、比較的桁数少ない数値で表現することができるので、色特徴ベクトルのデータ量は、より小さくすることができる。従って、色特徴ベクトルの記録に要する記録空間の容量は、比較的小さなもので足りる。 Since the degree of association can be expressed by a numerical value having a relatively small number of digits, the data amount of the color feature vector can be further reduced. Therefore, the capacity of the recording space required for recording the color feature vector may be relatively small.

なお、機器の例として、デジタルスチルカメラ１１および携帯電話機１２を挙げたが、これに限らず、機器は画像を取り扱うものであればよく、携帯型のプレーヤまたはビュワーなどであってもよい。 In addition, although the digital still camera 11 and the mobile phone 12 were mentioned as an example of an apparatus, it is not restricted to this, The apparatus should just handle an image and may be a portable player or a viewer.

このように、画像のメタデータを記録するようにした場合には、機器において画像を検索することができる。また、機器において、画像を撮影し、画像に関係する情報を、画像に関係付けて、所定の構造のデータとして記録し、画像処理装置への画像の送信を制御し、画像処理装置において、機器から送信されてくる画像の受信を制御し、受信した画像の特徴を抽出し、画像から抽出した特徴を、画像に関係付けて、機器における構造と同じ構造のデータとして記録し、特徴の機器への送信を制御するようにした場合には、処理能力の比較的小さい機器において、簡単に、所望の画像を検索することができる。 As described above, when the metadata of the image is recorded, the image can be searched in the device. In addition, the device captures an image, records information related to the image as data of a predetermined structure in association with the image, and controls transmission of the image to the image processing device. Controls the reception of images transmitted from the camera, extracts the features of the received image, associates the features extracted from the image with the image, records them as data with the same structure as the device, and sends them to the device with the features When the transmission of the image is controlled, a desired image can be easily retrieved with a device having a relatively small processing capability.

また、画像のメタデータを記録するようにした場合には、機器において画像を検索することができる。また、画像の特徴を抽出し、画像から抽出した特徴を、画像に関係付けて、所定の構造のデータとして記録させ、構造と同じ構造のデータとして、画像に関係する情報を記録する機器に記録させる特徴の機器への送信を制御するようにした場合には、処理能力の比較的小さい機器において、簡単に、所望の画像を検索することができる。 Further, when image metadata is recorded, an image can be searched for in the device. Also, the features of the image are extracted, the features extracted from the image are related to the image and recorded as data of a predetermined structure, and the information related to the image is recorded as data having the same structure as the structure When transmission to a device having a characteristic to be controlled is controlled, a desired image can be easily searched for in a device having a relatively small processing capability.

上述した一連の処理は、ハードウエアにより実行させることもできるし、ソフトウエアにより実行させることもできる。一連の処理をソフトウエアにより実行させる場合には、そのソフトウエアを構成するプログラムが、専用のハードウエアに組み込まれているコンピュータ、または、各種のプログラムをインストールすることで、各種の機能を実行することが可能な、例えば汎用のパーソナルコンピュータなどに、プログラム記録媒体からインストールされる。 The series of processes described above can be executed by hardware or can be executed by software. When a series of processing is executed by software, a program constituting the software executes various functions by installing a computer incorporated in dedicated hardware or various programs. For example, it is installed from a program recording medium in a general-purpose personal computer or the like.

コンピュータにインストールされ、コンピュータによって実行可能な状態とされるプログラムを記録する記録媒体は、図２または図３に示すように、磁気ディスク（フレキシブルディスクを含む）、光ディスク（CD-ROM(Compact Disc-Read Only Memory),DVD(Digital Versatile Disc)を含む）、光磁気ディスクを含む）、もしくは半導体メモリなどよりなるパッケージメディアであるリムーバブルメディア８２、または、プログラムが一時的もしくは永続的に格納されるROM７２またはEEPROM４６や、記憶部７８を構成するハードディスクなどにより構成される。プログラム記録媒体へのプログラムの格納は、必要に応じてルータ、モデムなどのインタフェースである通信部４７、通信部４８、通信部７９、または通信部８０を介して、ローカルエリアネットワーク、インターネット、デジタル衛星放送といった、有線または無線の通信媒体を利用して行われる。 As shown in FIG. 2 or FIG. 3, a recording medium for recording a program that is installed in a computer and can be executed by the computer is a magnetic disk (including a flexible disk), an optical disk (CD-ROM (Compact Disc- (Including Read Only Memory), DVD (Digital Versatile Disc), magneto-optical disk), or removable media 82 which is a package medium made of semiconductor memory or the like, or ROM 72 in which a program is temporarily or permanently stored Or it is comprised by the hard disk etc. which comprise EEPROM46 and the memory | storage part 78. FIG. The program is stored in the program recording medium via a local area network, the Internet, a digital satellite via the communication unit 47, the communication unit 48, the communication unit 79, or the communication unit 80, which is an interface such as a router or a modem, as necessary It is performed using a wired or wireless communication medium such as broadcasting.

なお、本明細書において、プログラム記録媒体に格納されるプログラムを記述するステップは、記載された順序に沿って時系列的に行われる処理はもちろん、必ずしも時系列的に処理されなくとも、並列的あるいは個別に実行される処理をも含むものである。 In the present specification, the step of describing the program stored in the program recording medium is not limited to the processing performed in time series in the order described, but is not necessarily performed in time series. Or the process performed separately is also included.

また、本明細書において、システムとは、複数の装置により構成される装置全体を表すものである。 Further, in this specification, the system represents the entire apparatus constituted by a plurality of apparatuses.

なお、本発明の実施の形態は、上述した実施の形態に限定されるものではなく、本発明の要旨を逸脱しない範囲において種々の変更が可能である。 The embodiment of the present invention is not limited to the above-described embodiment, and various modifications can be made without departing from the gist of the present invention.

１１デジタルスチルカメラ，１２携帯電話機，１３サーバ，１４ネットワーク，３１撮影レンズ，３２絞り，３３撮像デバイス，３４アナログ信号処理部，３５ A/Dコンバータ，３６デジタル信号処理部，３７ MPU，３８メモリ，４０モニタ，４１圧縮伸張部，４３メモリカード，４６ EEPROM，４７通信部，４８通信部，４９入力部，７１ＣＰＵ，７２ＲＯＭ，７３ＲＡＭ，７６入力部，７７出力部，７８記憶部，７９通信部，８０通信部，８２リムーバブルメディア，１０１撮影制御部，１０２縮小画像生成部，１０３メタデータ生成部，１０４エントリ生成部，１０５記録制御部，１０６表示制御部，１０７検索部，１０８送信制御部，１０９受信制御部，１１０画像保持部，１１１コンテンツデータベース，１１２類似特徴データベース，１１３類似結果データベース，１１４時間グループデータベース，１１５検索結果保持部，１２１距離計算部，１３１画像解析部，１３２縮小画像生成部，１３３メタデータ生成部，１３４エントリ生成部，１３５記録制御部，１３６表示制御部，１３７検索部，１３８−１送信制御部，１３８−２送信制御部，１３９−１受信制御部，１３９−２受信制御部，１４０画像保持部，１４１コンテンツデータベース，１４２類似特徴データベース，１４３類似結果データベース，１４４時間グループデータベース，１４５関連度抽出部対応保持部，１４６抽出特徴保持部，１４７検索結果保持部，１５１距離計算部，１６１顔画像検出部，１６２類似特徴量抽出部，１７１類似特徴ベクトル算出部，１７２色特徴抽出部，２０１本画像，２０２縮小画像，２６１メタデータ，４０１画像入力部，４０２ ”赤”関連度抽出部，４０３ ”青”関連度抽出部，４０４ ”黄”関連度抽出部，４０５抽出特徴記録部，４２１検索条件入力部，４２２条件照合部 11 Digital Still Camera, 12 Mobile Phone, 13 Server, 14 Network, 31 Shooting Lens, 32 Aperture, 33 Imaging Device, 34 Analog Signal Processing Unit, 35 A / D Converter, 36 Digital Signal Processing Unit, 37 MPU, 38 Memory, 40 monitor, 41 compression / decompression unit, 43 memory card, 46 EEPROM, 47 communication unit, 48 communication unit, 49 input unit, 71 CPU, 72 ROM, 73 RAM, 76 input unit, 77 output unit, 78 storage unit, 79 communication Unit, 80 communication unit, 82 removable media, 101 shooting control unit, 102 reduced image generation unit, 103 metadata generation unit, 104 entry generation unit, 105 recording control unit, 106 display control unit, 107 search unit, 108 transmission control unit , 09 reception control unit, 110 image holding unit, 111 content database, 112 similar feature database, 113 similar result database, 114 time group database, 115 search result holding unit, 121 distance calculation unit, 131 image analysis unit, 132 reduced image generation unit 133 metadata generation unit, 134 entry generation unit, 135 recording control unit, 136 display control unit, 137 search unit, 138-1 transmission control unit, 138-2 transmission control unit, 139-1 reception control unit, 139-2 Reception control unit, 140 image holding unit, 141 content database, 142 similar feature database, 143 similar result database, 144 time group database, 145 association degree extracting unit correspondence holding unit, 146 extraction feature Holding unit, 147 search result holding unit, 151 distance calculation unit, 161 face image detection unit, 162 similar feature amount extraction unit, 171 similar feature vector calculation unit, 172 color feature extraction unit, 201 main image, 202 reduced image, 261 meta Data, 401 image input unit, 402 “red” relevance extraction unit, 403 “blue” relevance extraction unit, 404 “yellow” relevance extraction unit, 405 extraction feature recording unit, 421 search condition input unit, 422 condition collation unit

Claims

A feature extraction unit that analyzes the image and extracts face information related to a face image included in the image;
Generating means for generating metadata associated with the image based on the face information extracted from the image by the feature extracting means;
Transmission means for transmitting a reduced image selected from a reduced image corresponding to each of the plurality of images and metadata associated with the image corresponding to the selected reduced image;
The metadata is an image processing apparatus that allows a user of the external device to search for an image based on the metadata in the external device.

The image processing apparatus according to claim 1, wherein the transmission unit transmits a reduced image selected by a user of the external device and metadata associated with the image corresponding to the selected reduced image.

The feature amount extraction unit extracts color information included in the image,
The generation unit generates the metadata based on the color information,
The image processing apparatus according to claim 1, wherein the metadata includes information related to a color of the image.

The image processing apparatus according to claim 1, wherein the metadata is configured such that metadata used for image search in the external device can be selected.

The image processing apparatus according to claim 1, wherein the metadata includes information on a width and a height of a face included in the image.

The image processing apparatus according to claim 1, wherein the metadata includes comment information including a character string.

The image processing apparatus according to claim 1, wherein the metadata includes a group ID that is data for specifying a group.

The image processing apparatus according to claim 1, wherein the face information includes information on the number of faces included in the image.

The image processing apparatus according to claim 1, wherein the image processing apparatus is connected to the external device via a network.

The image processing device
Analyzing the image, extracting face information about the face image included in the image,
Based on the face information extracted from the image, generate metadata associated with the image,
Transmitting a reduced image selected from the reduced images respectively corresponding to the plurality of images, and metadata associated with the image corresponding to the selected reduced image;
The metadata is an image processing method that allows a user of the external device to search for an image based on the metadata in the external device.

A feature extraction unit that analyzes the image and extracts face information related to a face image included in the image;
Generating means for generating metadata associated with the image based on the face information extracted from the image by the feature extracting means;
The computer functions as a transmission unit that transmits a reduced image selected from the reduced images respectively corresponding to the plurality of images and metadata associated with the image corresponding to the selected reduced image. Let
The metadata is a configuration that allows a user of the external device to search for an image based on the metadata in the external device.

Analyzing the image and generating reduced metadata corresponding to each of the plurality of images from a server that generates metadata associated with the image based on face information regarding the face image included in the image extracted from the image. Receiving means for receiving a reduced image selected from among the images and metadata associated with the image corresponding to the selected reduced image;
Presenting means for presenting a searched image based on the metadata received by the receiving means; and
The metadata is configured to allow a user to search for an image based on the metadata.

The information processing apparatus according to claim 12, wherein the receiving unit receives a reduced image selected by a user of the information processing apparatus and metadata associated with the image corresponding to the selected reduced image. .

The information processing apparatus according to claim 12, further comprising an image capturing unit that captures an image.

The information processing apparatus according to claim 12, wherein the metadata is configured to allow a user of the information processing apparatus to search for an image based on the metadata in the information processing apparatus.

The information processing apparatus according to claim 12, wherein the metadata is data generated based on color information extracted from the image in the server, and includes information regarding a color of the image.

Information processing device
Analyzing the image and generating reduced metadata corresponding to each of the plurality of images from a server that generates metadata associated with the image based on face information regarding the face image included in the image extracted from the image. Receiving a reduced image selected from within and metadata associated with the image corresponding to the selected reduced image;
Presenting the retrieved image based on the received metadata;
The metadata is configured to allow a user to search for an image based on the metadata.

Analyzing the image and generating reduced metadata corresponding to each of the plurality of images from a server that generates metadata associated with the image based on face information regarding the face image included in the image extracted from the image. Receiving means for receiving a reduced image selected from among the images and metadata associated with the image corresponding to the selected reduced image;
Based on the metadata received by the receiving means, the computer functions as a presentation means for presenting the searched image,
The metadata is a program that allows a user to search for an image based on the metadata.