JP2001229382A

JP2001229382A - Information storage device, information retrieval device, information storage method, information retrieval method and recording medium with these methods stored therein

Info

Publication number: JP2001229382A
Application number: JP2000038998A
Authority: JP
Inventors: Takeshi Nishimura; 剛西村; Ryoji Kataoka; 良治片岡
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2000-02-17
Filing date: 2000-02-17
Publication date: 2001-08-24

Abstract

PROBLEM TO BE SOLVED: To provide the information storage device/method, information retrieval device/method and a recording medium storing these methods which can shorten the time required for calculating similarity of objects and also can present the information on the photographed objects to users. SOLUTION: Plural images obtained by photographing previously an object in plural conditions are inputted and the feature value of images are extracted (steps 11 and 12), and the information on the object and its relative objects are generated and stored in a data base with each feature value relative to the same object (steps 13-15). At the same time, the feature value relative to the images having high similarity about the same object are selected or summarized, When a user inputs the image of a desired object for acquiring its information, the user retrieves the similarity for each feature value stored in the object in response to the feature value acquired from the image. Thus, the information on an object stored in the object that is coincident with the retrieved feature value or the feature value having high similarity is presented to the user.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、利用者が与えた物
体を撮影した画像，映像等を基に、その物体に関する情
報を利用者に提示するための情報蓄積装置，情報検索装
置および情報蓄積方法，情報検索方法並びにこれらの方
法を記録した記録媒体に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an information storage device, an information retrieval device, and an information storage device for presenting information on an object to a user based on an image, video, or the like of the object given by the user. The present invention relates to a method, an information search method, and a recording medium on which these methods are recorded.

【０００２】[0002]

【従来の技術】予めＣＤ−ＲＯＭやネットワーク上に蓄
積された辞書や図鑑等に対して、キーボード等の入力機
器を用いて検索語を基に検索を行い、検索語と一致ある
いは関連の高い情報を利用者に提示する情報検索装置が
多数存在する。一方、検索語では表現することが難しい
検索対象に対応するものとして、画像を入力として用
い、類似した画像あるいは画像に関連の深い情報を検索
する、類似オブジェクト検索に関する研究・開発が広く
行われている。また、画像入力機器を用いて予め複数の
オブジェクトから何らかの特徴を抽出して、オブジェク
トに関連する情報と関連付けてデータベース（ＤＢ）に
格納しておき、利用者が実世界の物体から算出した特徴
を基に類似検索を行う方法が広く知られている。2. Description of the Related Art A dictionary or picture book stored in advance on a CD-ROM or a network is searched based on a search term using an input device such as a keyboard. There are a large number of information search devices that present the information to the user. On the other hand, research and development on similar object search, which uses an image as an input and searches for similar images or information closely related to images, are widely performed in response to search targets that are difficult to express with search terms. I have. In addition, some features are extracted in advance from a plurality of objects using an image input device, stored in a database (DB) in association with information related to the objects, and the features calculated by the user from real-world objects are stored. A method of performing a similarity search on a basis is widely known.

【０００３】利用者が実世界の物体から算出した特徴を
基に類似検索を行う第１の方法として、実現困難な幾何
学的特徴を抽出することなく、物体を撮影した濃淡画像
を多次元ベクトルデータと見なして、予め物体の構図を
変化させながら連続して撮影した同一物体の映像の特徴
量を固有ベクトル空間にマッピングして軌跡を求め、こ
の軌跡を基に類似検索を行う方法がある（例えば、特開
平８−３２９２４７号公報に開示されている動画像認識
装置の高速固有ベクトル計算装置）。As a first method for performing a similarity search based on features calculated from objects in the real world by a user, a gray-scale image obtained by photographing an object without extracting difficult-to-implement geometric features is converted into a multidimensional vector. There is a method in which the feature amount of a video of the same object continuously photographed while changing the composition of the object in advance is mapped to an eigenvector space to determine a trajectory, and a similarity search is performed based on the trajectory (for example, And a high-speed eigenvector calculation device for a moving image recognition device disclosed in Japanese Patent Application Laid-Open No. 8-329247).

【０００４】また、第２の方法として、類似オブジェク
トの検索において、画像から抽出して与えられた検索キ
ーオブジェクトとＤＢ内のオブジェクトとの間の類似度
を、キーオブジェクトおよび個々のＤＢ内オブジェクト
から抽出した特徴量のベクトル間の類似度によって判定
する方法がある（例えば、特開平１０−１５４１４９号
公報に開示されている類似オブジェクト検索装置）。こ
の際、オブジェクトを数値化するのに用いる特徴量の種
類には、例えばオブジェクトが画像であれば色調，明
度，彩度や画中の線の方向など、さまざまなものが用い
られている。As a second method, in searching for similar objects, a similarity between a search key object extracted from an image and given and an object in the DB is determined from a key object and individual DB objects. There is a method of making a determination based on the degree of similarity between vectors of extracted feature amounts (for example, a similar object search device disclosed in Japanese Patent Application Laid-Open No. H10-154149). At this time, various types of feature amounts used for digitizing the object, such as color tone, brightness, saturation, and the direction of a line in an image, when the object is an image, are used.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、上記第
１の方法では、固有ベクトルを計算し、類似検索におい
ても輪郭を伸長可変することによって類似度を求める
が、一般に多次元データを対象とした固有ベクトルの計
算はかなりの計算時間を必要とする。このため、この方
法は濃淡画像を入力とし、特定の物体の位置や方向は検
出や、入力画像と限定した物体の類似度の導出、あるい
は物体の動きを基にした類似する動きの検出等の方式に
主に用いられ、色や形状の異なる多くの候補の中から類
似した物体に関する情報を提示する方式には適さない。However, in the first method, the eigenvector is calculated, and the similarity is obtained by expanding the contour in the similarity search. In general, the eigenvector of the multidimensional data is obtained. The calculation requires considerable calculation time. For this reason, this method takes a grayscale image as input, and detects the position and direction of a specific object, derives the similarity between the input image and the limited object, or detects similar movement based on the movement of the object. It is mainly used for the method, and is not suitable for a method of presenting information on a similar object from many candidates having different colors and shapes.

【０００６】また、上記第２の方法は、検索の対象とな
る物体を限定し、予め複数の物体を撮影して得た画像か
ら抽出した特徴量をＤＢに格納し、別に撮影した画像か
ら抽出した特徴量を基に検索することで、ある程度の物
体識別が高速に実現できる。しかし、一般的に撮影する
構図によって形状や色等が異なる物体、あるいは形状が
変化する物体の場合、画像から抽出される特徴量は状況
に応じて大きく変化する。このため、たとえ同一の物体
を撮影した画像であっても、異なる構図あるいは形状で
撮影した画像から抽出した特徴量間の類似度は著しく低
くなることも多く、利用者が任意の構図あるいは形状で
物体を撮影した画像を用いて、該当する物体あるいは類
似した物体に関する情報を提示する方式には適さない。In the second method, an object to be searched is limited, a feature amount extracted from an image obtained by photographing a plurality of objects in advance is stored in a DB, and extracted from an image photographed separately. By retrieving based on the obtained feature amount, a certain degree of object identification can be realized at high speed. However, in general, in the case of an object having a different shape, color, or the like depending on a composition to be photographed, or an object whose shape changes, a feature amount extracted from an image greatly changes depending on a situation. For this reason, even if the images of the same object are captured, the similarity between the feature amounts extracted from the images captured with different compositions or shapes is often extremely low, and the user can select an arbitrary composition or shape. It is not suitable for a method of presenting information on a corresponding object or a similar object using an image of the object.

【０００７】また、前記第１の方法のように、予め物体
の構図や形状を変化させながら物体に関する画像を多様
な構図，形状等で連続的に撮影し、得た画像すべてに関
して前記第２の方法を用いることで、その中のどれかの
画像の類似度が高くなることを期待する方法が考えられ
る。しかし、この方法では、同一の物体に関し、差異の
少ない多量の特徴量が発生することとなり、管理が煩雑
になる、検索にかかる時間が増える、同一の物体に関す
る特徴量が大量にヒットするため検索結果の集約が困難
になる、という問題が生じる。このため、同一の物体に
関して多量に抽出した特徴量を効率的に管理・格納する
方法が必要となる。Further, as in the first method, images of the object are continuously photographed in various compositions and shapes while changing the composition and shape of the object in advance, and the second image is obtained for all the obtained images. By using the method, a method of expecting that the degree of similarity of any of the images becomes high can be considered. However, in this method, a large amount of features with a small difference is generated for the same object, which makes management complicated and increases the time required for searching. There is a problem that it is difficult to aggregate results. For this reason, a method for efficiently managing and storing a large amount of feature amounts extracted for the same object is required.

【０００８】本発明は、上記事情に鑑みてなされたもの
で、その目的とするところは、撮影する構図によって形
状や色等が異なるオブジェクト、あるいは形状の変化す
るオブジェクトを含み得る実世界の物体を対象とした画
像入力機器を用いた情報検索を実現し、多種の物体に関
し、オブジェクトの色情報，形状情報等、多様な特徴量
を抽出して効率的に管理・格納し、物体の類似度算出に
かかる計算時間を低減して、撮影された物体に関する情
報を利用者に提示することを可能とする情報蓄積装置，
情報検索装置および情報蓄積方法，情報検索方法並びに
これらの方法を記録した記録媒体を提供することにあ
る。SUMMARY OF THE INVENTION The present invention has been made in view of the above circumstances, and has as its object the object of the real world which may include an object having a different shape or color depending on a composition to be photographed, or an object having a different shape. Realizes information retrieval using the target image input device, extracts and manages and stores various features of various types of objects, such as object color information and shape information, and calculates similarity of objects. An information storage device that can reduce the calculation time required for and can present information on a photographed object to a user;
An object of the present invention is to provide an information search device, an information storage method, an information search method, and a recording medium on which these methods are recorded.

【０００９】[0009]

【課題を解決するための手段】上記目的を達成するた
め、本発明に係る情報蓄積装置は、物体を撮影した２次
元画像データおよび物体名を入力する第１の画像入力手
段と、該第１の画像入力手段により入力された各２次元
画像データから物体の領域を抽出し、得られた領域画像
データ値から、１以上の画像特徴量を抽出する第１の特
徴量抽出手段と、撮影した物体に関する物体情報を入力
する物体情報入力手段と、前記第１の画像入力手段によ
って得た物体名，前記第１の特徴量抽出手段によって得
た画像特徴量および前記物体情報入力手段によって得た
物体情報を物体ごとに関連づけて物体オブジェクトを生
成する物体情報管理・格納手段と、該物体情報管理・格
納手段が生成した物体オブジェクトを蓄積する物体情報
蓄積手段とを有することを特徴とする。In order to achieve the above object, an information storage apparatus according to the present invention comprises: first image input means for inputting two-dimensional image data of an object and an object name; A first feature value extracting means for extracting a region of the object from each two-dimensional image data input by the image input device, and extracting one or more image feature values from the obtained region image data value; Object information input means for inputting object information relating to the object, an object name obtained by the first image input means, an image feature quantity obtained by the first feature quantity extraction means, and an object obtained by the object information input means Object information management / storage means for generating an object object by associating information for each object, and object information storage means for storing the object object generated by the object information management / storage means And wherein the door.

【００１０】ここで、前記物体情報管理・格納手段は、
前記物体情報蓄積手段が蓄積する物体オブジェクトを管
理し、前記第１の画像入力手段により入力された物体名
を含む物体オブジェクトが前記物体情報蓄積手段に蓄積
されているかどうかを検索する手段と、前記第１の画像
入力手段によって入力された物体名を含む物体オブジェ
クトが前記物体情報蓄積手段に蓄積されている場合に
は、前記第１の特徴量抽出手段の抽出した複数の画像特
徴量を、前記検索手段により検索された前記物体オブジ
ェクトと関連づける手段とを有することが好ましい。Here, the object information management / storage means comprises:
Means for managing the object objects stored by the object information storage means and searching whether or not an object object including the object name input by the first image input means is stored in the object information storage means; When the object object including the object name input by the first image input unit is stored in the object information storage unit, the plurality of image feature amounts extracted by the first feature amount extraction unit are Means for associating the object with the object searched by the search means.

【００１１】また、前記物体情報管理・格納手段は、前
記物体情報蓄積手段に蓄積される物体オブジェクトに関
して、同一の物体オブジェクトとして関連付けられた画
像特徴量の中に類似度の高い画像特徴量が存在する場
合、当該物体オブジェクトに関連付ける画像特徴量を選
別あるいは集約する手段を有することが好ましい。ま
た、前記物体情報蓄積手段に蓄積される物体オブジェク
トは、同一の物体に関して、物体を複数の条件で撮影し
た画像からそれぞれ抽出した画像特徴量と、物体に関す
るテキストや画像，音声等の情報を関連付けたものであ
ることが好ましい。The object information management / storage means may include, for the object objects stored in the object information storage means, an image feature amount having a high degree of similarity among image feature amounts associated as the same object object. In such a case, it is preferable to have a means for selecting or collecting image feature amounts to be associated with the object object. Further, the object object stored in the object information storage means associates image feature amounts respectively extracted from images obtained by photographing the object under a plurality of conditions with respect to the same object, and information such as text, images, and voices on the object. It is preferred that it is.

【００１２】一方、本発明に係る情報蓄積方法は、物体
を撮影した２次元画像および物体名を取り込み、取り込
んだ各２次元画像データから物体の領域を抽出し、得ら
れた領域画像データ値から１以上の画像特徴量を抽出
し、撮影した物体に関する物体情報を入力されると、前
記取り込んだ物体名，前記抽出した画像特徴量および前
記入力された物体情報を物体ごとに関連づけて物体オブ
ジェクトを生成して、この物体オブジェクトを蓄積する
ことを特徴とする。On the other hand, the information storage method according to the present invention captures a two-dimensional image of an object and an object name, extracts an area of the object from each of the captured two-dimensional image data, When one or more image features are extracted and object information relating to the photographed object is input, an object object is associated with the captured object name, the extracted image feature amount, and the input object information for each object. It is characterized in that the object is generated and stored.

【００１３】また、上述のような、本発明に係る情報蓄
積方法を記録した記録媒体は、これを商品として流通さ
せることが可能であり、本発明の権利範囲は上記情報蓄
積方法を実現するためのプログラムを記録した記録媒体
にも及ぶものである。Further, the above-described recording medium on which the information storage method according to the present invention is recorded can be distributed as a product, and the scope of the rights of the present invention is to realize the information storage method. The present invention extends to a recording medium on which the above program is recorded.

【００１４】また、前記目的を達成するため、本発明に
係る情報検索装置は、検索の入力となる物体が含まれる
２次元画像を取り込む第２の画像入力手段と、該第２の
画像入力手段により入力された各２次元画像データから
物体の領域を抽出し、得られた画像データ値から１以上
の画像特徴量を抽出する第２の特徴量抽出手段と、該第
２の特徴抽出手段によって抽出された画像特徴量をキー
に用いて、前述の各物体情報蓄積装置が蓄積する物体オ
ブジェクトを検索し、所望の物体情報を検索する物体情
報検索手段と、該物体情報検索手段によって得られた物
体情報を利用者に提示する検索結果出力手段とを具備す
ることを特徴とする。In order to achieve the above object, an information retrieval apparatus according to the present invention comprises: a second image input means for capturing a two-dimensional image including an object to be searched; and a second image input means. Extracting a region of the object from each of the two-dimensional image data input according to the above, extracting one or more image feature values from the obtained image data values, and a second feature extraction device. Using the extracted image feature amount as a key, an object information stored in each of the object information storage devices described above is searched, and object information search means for searching for desired object information is obtained. And a search result output means for presenting the object information to the user.

【００１５】ここで、前記物体情報検索手段は、前記第
２の特徴量抽出手段を用いて抽出した１以上の画像特徴
量をキーとして、前記物体情報蓄積装置が蓄積する物体
オブジェクトに含まれる複数の画像特徴量を検索し、類
似度の高い画像特徴量と関連づけられた物体名および物
体情報を検索結果として得る手段を有することが好まし
い。Here, the object information retrieving means uses the one or more image feature quantities extracted by the second feature quantity extracting means as a key to retrieve a plurality of object objects contained in the object object stored by the object information storage apparatus. It is preferable to have a means for searching for the image feature amount of (1) and obtaining, as a search result, an object name and object information associated with the image feature amount having a high similarity.

【００１６】一方、本発明に係る情報検索方法は、検索
の入力となる物体が含まれる２次元画像を取り込み、取
り込んだ各２次元画像データから物体の領域を抽出し、
得られた画像データ値から１以上の画像特徴量を抽出
し、抽出された画像特徴量をキーに用いて、請求項１〜
４のいずれか１項に記載の物体情報蓄積装置が蓄積する
物体オブジェクトを検索して、所望の物体情報を検索
し、検索された物体情報を利用者に提示することを特徴
とする。On the other hand, the information retrieval method according to the present invention captures a two-dimensional image including an object to be input for retrieval, extracts an object region from each of the captured two-dimensional image data,
At least one image feature value is extracted from the obtained image data value, and the extracted image feature value is used as a key.
5. The object information storage device according to any one of the items 4, wherein the object information storage device searches for an object object, searches for desired object information, and presents the searched object information to a user.

【００１７】また、上述のような、本発明に係る情報検
索方法を記録した記録媒体は、これを商品として流通さ
せることが可能であり、本発明の権利範囲は上記情報検
索方法を実現するためのプログラムを記録した記録媒体
にも及ぶものである。Further, the recording medium on which the information search method according to the present invention is recorded as described above can be distributed as a product, and the scope of the rights of the present invention is to realize the information search method. The present invention extends to a recording medium on which the above program is recorded.

【００１８】上述のように構成される本発明によれば、
撮影する構図によって形状や色等が異なるオブジェク
ト、あるいは形状の変化するオブジェクトを含み得る実
世界の物体を対象とした画像入力機器を用いた情報蓄積
並びに検索方式を実現し、多種の物体に関し、オブジェ
クトの色情報，形状情報等、多様な特徴量を抽出して効
率的に管理・格納し、物体の類似度算出にかかる計算時
間を低減して、撮影された物体に関する情報を利用者に
提示することができるという効果がある。According to the present invention configured as described above,
An information storage and retrieval method using an image input device for an object having a shape, a color, or the like depending on a composition to be photographed or an object in the real world that may include an object having a change in shape is realized. Extracts various features such as color information, shape information, etc., efficiently manages and stores them, reduces the calculation time required to calculate the similarity of the object, and presents information on the photographed object to the user. There is an effect that can be.

【００１９】なお、前述の通り、本発明に係る情報蓄積
および／または検索方法を記録した記録媒体は、これを
商品として流通させることが可能であり、本発明の権利
範囲は上記情報検索および／または検索方法を実現する
ためのプログラムを記録した記録媒体にも及ぶものであ
る。As described above, a recording medium on which the information storage and / or retrieval method according to the present invention is recorded can be distributed as a product, and the scope of the present invention is limited to the information retrieval and / or retrieval method. Alternatively, the present invention extends to a recording medium on which a program for realizing a search method is recorded.

【００２０】[0020]

【発明の実施の形態】以下、本発明の実施の形態を、図
面に示す好適実施例に基づいて、詳細に説明する。Embodiments of the present invention will be described below in detail with reference to preferred embodiments shown in the drawings.

【００２１】図１は、本発明の一実施例に係る情報検索
装置を示す構成図である。図１において、本実施例に係
る情報検索装置は、物体情報格納部，物体情報蓄積部お
よび物体情報検索部からなっている。物体情報格納部
は、複数の物体に関する物体情報および物体から取得し
た特徴量をそれぞれ物体オブジェクトとして関連付ける
モジュールであり、物体情報蓄積部は、前記物体情報格
納部の関連付けた物体オブジェクトを蓄積するモジュー
ルであり、物体情報検索部は、利用者の用意したキー画
像を基に画像から特徴量を取得し、物体情報蓄積部に蓄
積された物体オブジェクトに対して類似検索を行うこと
で、キー画像に類似した特徴量を持つ物体の情報を利用
者に提示するモジュールである。FIG. 1 is a block diagram showing an information retrieval apparatus according to one embodiment of the present invention. In FIG. 1, the information search device according to the present embodiment includes an object information storage unit, an object information storage unit, and an object information search unit. The object information storage unit is a module that associates, as an object object, object information relating to a plurality of objects and feature amounts obtained from the objects, and the object information storage unit is a module that stores the associated object objects of the object information storage unit. Yes, the object information search unit obtains a feature amount from the image based on the key image prepared by the user, and performs a similarity search on the object object stored in the object information storage unit to obtain a similarity to the key image. It is a module that presents information of an object having the obtained feature amount to the user.

【００２２】物体情報格納部は、第１の画像入力手段１
０１，第１の特徴量抽出手段１０２，物体情報入力手段
１０３，物体情報管理・格納手段１０４からなってい
る。物体情報蓄積部は、物体情報蓄積手段２０１からな
っている。物体情報検索部は、画像入力手段３０１，特
徴量抽出手段３０２，物体情報検索手段３０３，検索結
果出力手段３０２からなっている。The object information storage unit includes a first image input unit 1
01, a first feature amount extraction unit 102, an object information input unit 103, and an object information management / storage unit 104. The object information storage unit includes an object information storage unit 201. The object information search unit includes an image input unit 301, a feature amount extraction unit 302, an object information search unit 303, and a search result output unit 302.

【００２３】物体情報格納部の第１の画像入力手段１０
１は、デジタルカメラあるいはイメージスキャナ等の画
像入力機器によって物体を撮影した画像を、画像データ
として取得する。また、第１の特徴量抽出手段１０２
は、上記第１の画像入力手段１０１の取得した画像デー
タから、物体部分に関する複数種類の特徴量、例えば、
色調，明度，彩度等を算出する。物体情報入力手段１０
３は、物体に関する情報を入力する。物体情報管理・格
納手段１０４は、撮影対象である物体毎に、物体に関す
る情報と、同一種類の物体を異なる条件で複数撮影した
画像から得られる上記第１の特徴量抽出手段１０２の算
出した複数種類の特徴量とを、関連付けて物体オブジェ
クトを生成する。First image input means 10 of object information storage unit
1 acquires an image of an object captured by an image input device such as a digital camera or an image scanner as image data. Further, the first feature amount extracting means 102
Are obtained from the image data acquired by the first image input means 101, a plurality of types of feature amounts regarding the object portion, for example,
Calculate color tone, lightness, saturation, etc. Object information input means 10
3 inputs information on the object. The object information management / storage unit 104 includes, for each object to be photographed, information on the object and a plurality of objects calculated by the first feature amount extraction unit 102 obtained from images of the same type of object photographed under different conditions. An object is generated by associating the type of feature with the feature.

【００２４】物体情報蓄積部の物体情報蓄積手段２０１
は、物体情報格納部における物体情報管理・格納手段１
０４の生成した物体オブジェクトを格納する。物体情報
検索部の第２の画像入力手段３０１は、デジタルカメラ
あるいはイメージスキャナ等の画像入力機器によって、
利用者が撮影した物体に関する画像を画像データとして
取得する。また、第２の特徴量抽出手段３０２は、画像
入力手段３０１の取得した画像データから物体部分に関
する複数種類の特徴量を算出する。物体情報検索手段３
０３は、第２の特徴量抽出手段３０２の算出した複数種
類の特徴量を検索キーとして、物体情報蓄積部における
物体情報蓄積手段２０１に蓄積された物体オブジェクト
の中から物体情報を取得する。検索結果出力手段３０４
は、物体情報検索手段３０３の取得した上記物体情報を
最終的に利用者に提示する。The object information storage means 201 of the object information storage unit
Is the object information management / storage means 1 in the object information storage unit.
04 is stored. The second image input unit 301 of the object information search unit is operated by an image input device such as a digital camera or an image scanner.
An image of the object photographed by the user is acquired as image data. Further, the second feature amount extraction unit 302 calculates a plurality of types of feature amounts regarding the object portion from the image data acquired by the image input unit 301. Object information search means 3
03 acquires object information from the object objects stored in the object information storage unit 201 in the object information storage unit, using the plurality of types of feature amounts calculated by the second feature amount extraction unit 302 as search keys. Search result output means 304
Finally presents the object information obtained by the object information search means 303 to the user.

【００２５】以下、図２を用いて、物体オブジェクトを
生成する方法を説明する。図２は、物体情報格納部の処
理の流れを示すフロー図である。予め検索の対象となる
物体を、デジタルカメラあるいはビデオカメラ等の撮影
機器により、１枚撮影する。そして、第１の画像入力手
段１０１により、上述のデジタルカメラあるいはビデオ
カメラ等の機器により撮影された画像を、また、物体情
報入力手段１０３により物体名を入力する（ステップ１
１）。Hereinafter, a method for generating an object will be described with reference to FIG. FIG. 2 is a flowchart illustrating a flow of a process of the object information storage unit. An object to be searched is photographed in advance by a photographing device such as a digital camera or a video camera. Then, an image captured by the above-described device such as a digital camera or a video camera is input by the first image input unit 101, and an object name is input by the object information input unit 103 (step 1).
1).

【００２６】第１の画像入力手段１０１で入力された画
像データからは、第１の特徴量抽出手段１０２により、
必要に応じて平滑化あるいはキャリブレーションを行っ
た後に、輪郭抽出を行い、物体の撮影されている部分だ
けを切り出される。この物体部分が切り出された画像か
らは、第１の特徴量抽出手段１０２により、例えばＲＧ
Ｂそれぞれの出現画素数あるいは輪郭形状等の特徴量を
抽出される（ステップ１２）。From the image data input by the first image input means 101, the first feature amount extracting means 102
After performing smoothing or calibration as necessary, contour extraction is performed, and only the part of the object where the image is captured is cut out. From the image in which the object portion is cut out, the first feature amount extracting means 102 outputs, for example, RG
A feature amount such as the number of appearance pixels or the outline shape of each of B is extracted (step 12).

【００２７】物体情報管理・格納手段１０４は、ステッ
プ１１で入力された物体名を基に、処理している画像が
初めての物体名であるか否かを、物体情報蓄積部におけ
る物体情報蓄積手段２０１が蓄積している物体オブジェ
クトを検索して調べる（ステップ１３）。入力された物
体名を含む物体オブジェクトが存在せず、初めての物体
名である場合には、物体情報入力手段１０３において物
体情報（例えば、金額（価格）や主な調理法，映像等の
ＨＴＭＬ文章）を入力し（ステップ１４）、物体情報管
理・格納手段１０４は、新しく物体オブジェクトを生成
し、物体情報と物体名とを関連付ける（ステップ１
５）。The object information management / storage means 104 determines whether or not the image being processed is the first object name based on the object name input in step 11 in the object information storage unit. The object object 201 is searched for and examined (step 13). If there is no object object including the input object name and the object name is the first object name, the object information input unit 103 outputs the object information (eg, the amount of money (price), the main cooking method, HTML text such as video, etc.). ) (Step 14), the object information management / storage means 104 generates a new object and associates the object information with the object name (step 1).
5).

【００２８】また、入力された物体名を含む物体オブジ
ェクトが存在し、初めての物体名でなかった場合には、
物体情報管理・格納手段１０４は、上述の既存物体オブ
ジェクトに関連付けられている特徴量とステップ１２に
おいて抽出した同一特徴量との類似度を求める（ステッ
プ１６）。ステップ１７では、類似度が予め定められた
値以上であるか否かを調べる。類似度が予め定められた
値未満であった場合は、処理を終了する。物体情報管理
・格納手段１０４は、ステップ１３において抽出した複
数の特徴量を、物体情報，物体名と関連付けて物体オブ
ジェクトのサブクラスとして格納する（ステップ１
８）。If an object object including the input object name exists and is not the first object name,
The object information management / storage unit 104 obtains a similarity between the feature amount associated with the existing object object and the same feature amount extracted in step 12 (step 16). In step 17, it is checked whether or not the similarity is equal to or greater than a predetermined value. If the similarity is less than the predetermined value, the process ends. The object information management / storage unit 104 stores the plurality of feature amounts extracted in step 13 as a subclass of the object object in association with the object information and the object name (step 1).
8).

【００２９】上述の処理を複数の物体に関して、それぞ
れ複数の条件で撮影した画像を入力に用いて繰り返すこ
とで、物体情報蓄積手段２０１で蓄積される物体オブジ
ェクト・データベースを構築する。By repeating the above-described processing for a plurality of objects using images captured under a plurality of conditions, respectively, as input, an object object database accumulated by the object information accumulation means 201 is constructed.

【００３０】図３は、上述の物体情報蓄積手段２０１に
おいて蓄積される物体オブジェクト構成を示す図であ
り、（ａ）は概念図（イメージ名：例えば、画像ファイ
ル名）、（ｂ）はこれをオブジェクト志向データベース
的に表現したものである。物体情報蓄積手段２０１にお
いては、物体オブジェクトが、例えば図３に例示される
ような階層構造で蓄積される。FIGS. 3A and 3B are diagrams showing the structure of the object stored in the above-mentioned object information storage means 201. FIG. 3A is a conceptual diagram (image name: for example, image file name), and FIG. It is expressed like an object-oriented database. In the object information storage unit 201, the object objects are stored in a hierarchical structure as illustrated in FIG. 3, for example.

【００３１】図４は、物体情報検索部における処理の流
れを示すフロー図である。情報検索装置の利用者は、検
索のキーとなる物体を、デジタルカメラあるいはビデオ
カメラ等の撮影機器により撮影する（ステップ２１）。
第２の画像入力手段３０１で入力された画像データから
は、第２の特徴量抽出手段３０２により、必要に応じて
平滑化あるいはキャリブレーションを行った後に、輪郭
抽出を行い、物体の撮影されている部分だけを切り出さ
れる。物体部分が切り出された画像からは、第２の特徴
量抽出手段１０２により、例えばＲＧＢそれぞれの出現
画素数あるいは輪郭形状等の、複数の特徴量を抽出され
る（ステップ２２）。FIG. 4 is a flowchart showing the flow of processing in the object information search unit. The user of the information retrieval apparatus photographs an object serving as a retrieval key using a photographing device such as a digital camera or a video camera (step 21).
From the image data input by the second image input means 301, after smoothing or calibration is performed as necessary by the second feature amount extracting means 302, contour extraction is performed, and the object is photographed. Only the part that is is cut out. From the image from which the object portion has been cut out, a plurality of feature amounts, such as the number of appearing pixels of RGB or the outline shape, are extracted by the second feature amount extracting means 102 (step 22).

【００３２】物体情報検索手段３０３は、特徴量抽出手
段３０２により抽出した複数の特徴量をキーにして、一
定値以上の類似度を持つ物体名を検索し（ステップ２
３）、同じ物体名が複数判断されたか否かを調べる（ス
テップ２４）。検索した結果、同じ物体名が複数ヒット
した場合には、類似度の低い検索結果を削除する（ステ
ップ２５）。物体情報検索手段３０３は、物体名より物
体情報手段２０１の蓄積している物体オブジェクトから
物体情報を検索する（ステップ２６）。検索結果出力手
段３０４は、検索結果として、類似検索でヒットした物
体名と情報名を、例えばパーソナルコンピュータ（Ｐ
Ｃ）のディスプレイに表示することで、利用者に提示す
る（ステップ２７）。The object information retrieving means 303 retrieves an object name having a similarity of a certain value or more by using a plurality of feature amounts extracted by the feature amount extracting means 302 as a key (step 2).
3) It is checked whether a plurality of the same object names have been determined (step 24). As a result of the search, if the same object name is hit a plurality of times, the search result having a low similarity is deleted (step 25). The object information search means 303 searches object information from the object stored in the object information means 201 based on the object name (step 26). The search result output means 304 outputs, as a search result, an object name and an information name hit in a similar search, for example, to a personal computer (P
The information is presented to the user by displaying it on the display of C) (step 27).

【００３３】〔実施例〕物体情報作成者（以下、作成者
という）は、まず、例えばスーパーマーケットに並ぶ野
菜や植物園の花というような、この検索装置の利用者が
「それについて何かを知りたい」と思う物体を、自身で
それぞれ複数の構図によりデジタルカメラ等を用いて撮
影する。以下、具体例として、半分に切られたスイカに
関する情報を、物体情報格納部を用いて格納する場合の
処理の流れを記す。[Embodiment] An object information creator (hereinafter referred to as a creator) firstly asks the user of this search apparatus, for example, vegetables or flowers in a botanical garden in a supermarket, that he wants to know something about it. Is photographed by using a digital camera or the like in a plurality of compositions. Hereinafter, as a specific example, a description will be given of a processing flow when information on a half cut watermelon is stored using the object information storage unit.

【００３４】作成者は、半分に切られたスイカを切り口
側から撮影し、第１の画像入力手段１０１によって、撮
影した物体名であるスイカという名前とともにコンピュ
ータに入力する。第１の画像入力手段１０１は、入力さ
れた画像に対し、例えば入力された年月日時秒を基に生
成した数字列等、一意に決定される画像ＩＤを割り振
り、例えば６４０×４８０×２４bitのビットマップ形
式等で、第１の特徴量抽出手段１０２の扱う画像データ
に変換する。The creator photographs the watermelon cut in half from the cut side, and inputs the photographed watermelon to the computer by the first image input means 101 together with the name of the photographed watermelon. The first image input unit 101 assigns a uniquely determined image ID such as a numeric string generated based on the input date, month, day, hour, and second to the input image, for example, a 640 × 480 × 24 bit. The image data is converted into image data handled by the first feature amount extraction unit 102 in a bitmap format or the like.

【００３５】第１の特徴量抽出手段１０２は、第１の画
像入力手段１０１から画像ＩＤの割り振られた画像デー
タを受け取り、例えば、動的輪郭モデル（Ｓnakes），
画素結合法、あるいはヒストグラム法等の画像内オブジ
ェクト抽出方法を用いて、画像のうちスイカが写ってい
る部分を切り出す。なお、これに関しては、例えば、下
記の文献に記載されている方法が有効に用い得る。（１）美濃，天野による「Ｓnakes ：現在・過去・未
来」（信学技報ＰＲＭＵ97-184(1997-12)，pp81-88）（２）高木，下田（監修）による「画像解析ハンドブッ
ク」（東京大学出版会，1991.pp.692-693）The first feature amount extracting means 102 receives the image data to which the image ID is assigned from the first image input means 101 and, for example, receives a dynamic contour model (Snakes),
Using a pixel combining method or an object extraction method in an image such as a histogram method, a portion where a watermelon appears in the image is cut out. In this connection, for example, a method described in the following document can be effectively used. (1) "Snakes: Present, Past, and Future" by Mino and Amano (IEICE PRMU97-184 (1997-12), pp81-88) (2) "Image Analysis Handbook" by Takagi and Shimoda (supervised) ( The University of Tokyo Press, 1991.pp.692-693)

【００３６】また、第１の特徴量抽出手段１０２は、ス
イカの画像から、例えばＲＧＢの各色について８ｂｉｔ
で構成された画素値に関して、値が０−１２７のときを
０、１２８−２５５のときを１、とした場合、（Ｒ，
Ｇ，Ｂ）の値が、（０，０，０）、（１，０，０）、
（０，１，０）、（０，０，１）、（１，１，０）、
（１，０，１）、（０，１，１）、（１，１，１）のい
ずれに属するかの累計度数を求め、これを８次元ベクト
ルデータに変換する。The first feature amount extracting means 102 converts, for example, 8 bits for each of RGB colors from the watermelon image.
With respect to the pixel value composed of the following, when the value is 0-127, 0, and when the value is 128-255, 1, the value is (R,
G, B) are (0, 0, 0), (1, 0, 0),
(0,1,0), (0,0,1), (1,1,0),
A cumulative frequency of any of (1,0,1), (0,1,1), and (1,1,1) is obtained, and this is converted into 8-dimensional vector data.

【００３７】物体情報管理・格納手段１０３は、物体情
報蓄積部における物体情報蓄積手段２０１に相当するオ
ブジェクト指向データベースを管理しており、例えば図
５に示すようなデータ構造を扱うものとする。物体情報
管理・格納手段１０３は、まず、物体名スイカを含む物
体オブジェクトがすでにデータベースに格納されている
か否かを調べる。The object information management / storage unit 103 manages an object-oriented database corresponding to the object information storage unit 201 in the object information storage unit, and handles a data structure as shown in FIG. 5, for example. First, the object information management / storage unit 103 checks whether or not the object object including the object name watermelon is already stored in the database.

【００３８】物体名スイカがデータベースに存在してい
ない場合は、作成者にスイカに関する情報を入力するこ
とを求める。物体情報入力手段１０４において、利用者
は、例えば、値段あるいはフルーツパンチ等のスイカを
利用した主な料理の名前等のスイカに関する情報あるい
はスイカの映像等を、ＨＴＭＬ文書等の形式を用いて記
述し、物体情報として入力する。物体情報管理・格納手
段１０３は、物体クラスのＯｂｊｅｃｔオブジェクトを
新しく作成し、物体名スイカおよび物体情報をＯｂｊｅ
ｃｔオブジェクトに格納した後、新しく子オブジェクト
として画像オブジェクトＩmageおよび画像特徴量Ｉmage
Ｆeatureを作成し、特徴量抽出手段１０２の抽出特徴
量および画像ＩＤを関連付けて格納する。If the object name watermelon does not exist in the database, the creator is requested to input information about the watermelon. In the object information input means 104, the user describes information about the watermelon such as the name of the main dish using the watermelon such as the price or fruit punch or the image of the watermelon using a format such as an HTML document. , As object information. The object information management / storage unit 103 newly creates an object object of the object class and stores the object name watermelon and the object information in the object.
After storing in the ct object, the image object Image and the image feature amount Image are newly set as child objects.
A feature is created, and the extracted feature amount of the feature amount extracting unit 102 and the image ID are stored in association with each other.

【００３９】一方、物体名スイカを含む物体オブジェク
ト「スイカ」が存在した場合、物体オブジェクトの子オ
ブジェクトである全ての画像特徴量オブジェクトに関し
て、特徴量抽出手段１０２の抽出した特徴量と画像特徴
量オブジェクトに格納されている特徴量ＲＧＢｈｉｓｔ
との間の類似度を算出し、類似度が一定値以上であった
場合、処理を終了する。On the other hand, when there is an object object “watermelon” including an object name watermelon, the feature amount extracted by the feature amount extracting means 102 and the image feature amount object are extracted for all image feature amount objects which are child objects of the object object. Feature amount RGBhist stored in
Is calculated, and if the similarity is equal to or greater than a certain value, the process ends.

【００４０】このとき、特徴量Ａと特徴量Ｂの類似度
は、例えば一般化した楕円距離（generalized ellipsoi
d distance：Ｔ.Ｓeidl and Ｈ.Ｐ.Ｋriegel，”Ｅffci
ent Ｕser-Ａdaptable Ｓimilarity Ｓearch in Ｌarge
Ｍultimedia Ｄatabases”．in Ｐroc. of ＶＬＤＢ，
pp.506-515，Ａthens，Ｇreece，Ａug．1997）に対し、At this time, the similarity between the feature amount A and the feature amount B is, for example, a generalized ellipse distance (generalized ellipsois).
d distance: T. Seidl and HP Kriegel, "Effci
ent User-Adaptable Similarity Search in Large
Multimedia Databases ”in Proc. Of VLDB,
pp. 506-515, Athens, Greece, Aug. 1997)

【数１】（このとき、ｓは類似度、ｄは一般化した楕円距離を表
わす）などという式により変換を行うことで得られ、通
常０＜ｓ＜１（１が最も類似しており、０が最もかけ離
れている）といった範囲をとる。(Equation 1) (At this time, s represents the similarity, d represents the generalized elliptical distance), etc., and is usually obtained by 0 <s <1 (1 is most similar, and 0 is most distant. ).

【００４１】類似度が一定値未満であった場合、物体オ
ブジェクト「スイカ」の子オブジェクトとして新しく画
像オブジェクトＩmageおよび画像特徴量オブジェクトＩ
mageＦeatureを追加し、第１の特徴量抽出手段１０２の
抽出した複数の特徴量および画像ＩＤを関連付けて格納
する。以上の処理を、他にも例えばスイカを切り口の反
対側から撮影した画像、あるいは真横から見た画像等、
他の角度から撮影した画像に対しても行うことで、スイ
カに関する物体オブジェクトを作成する。If the similarity is less than a certain value, the image object Image and the image feature amount object I are newly created as child objects of the object "watermelon".
A mageFeature is added, and the plurality of feature amounts extracted by the first feature amount extraction unit 102 and the image ID are stored in association with each other. In addition to the above processing, for example, an image of watermelon taken from the opposite side of the cut, or an image viewed from the side, etc.
An object object related to a watermelon is created by performing the process on an image captured from another angle.

【００４２】同様に、スイカ以外の青果についても、特
徴量を格納する処理を繰り返すことで、マルチメディア
図鑑に相当する複数の物体オブジェクトを作成する。以
下、例として利用者がスイカに関する情報を求めて物体
情報検索部を用いるときの処理の流れを示す。なお、予
め前記物体情報格納部によって、例えば、スイカ，リン
ゴ，キュウリのそれぞれ全体，半分に切った部分等を、
さまざまな角度から撮影した画像が入力され、物体情報
蓄積部における物体情報蓄積手段２０１に蓄積されてい
るものとする。Similarly, for fruits and vegetables other than watermelon, a plurality of object objects corresponding to a multimedia pictorial book are created by repeating the process of storing the characteristic values. Hereinafter, as an example, a flow of processing when a user uses the object information search unit to obtain information about watermelon will be described. In addition, the object information storage unit previously stores, for example, watermelon, apple, and cucumber in whole and in half, respectively.
It is assumed that images captured from various angles are input and stored in the object information storage unit 201 of the object information storage unit.

【００４３】利用者は、例えばスイカを任意の方向から
デジタルカメラ等の画像入力機器を用いて撮影し、画像
入力手段３０１によって、コンピュータに入力する。画
像は例えば６４０×４８０×２４bitのビットマップ形
式等、特徴量抽出手段３０２の扱える画像データに変換
される。特徴量抽出手段３０２は、前記物体情報格納部
における第１の特徴量抽出手段１０２同様、画像入力手
段３０１から画像データを受け取り、例えば前述の動的
輪郭モデル（Ｓnakes），画素結合法あるいはヒストグ
ラム等の画像内オブジェクト抽出方法を用いて、画像の
うちスイカが写っている部分を切り出す。The user shoots watermelon from an arbitrary direction using an image input device such as a digital camera, and inputs the watermelon to the computer by the image input means 301. The image is converted into image data, such as a 640 × 480 × 24 bit bitmap format, that can be handled by the feature amount extraction unit 302. Similar to the first feature amount extracting unit 102 in the object information storage unit, the feature amount extracting unit 302 receives image data from the image input unit 301 and, for example, receives the above-described active contour model (Snakes), a pixel combination method, a histogram, or the like. The portion of the image where the watermelon is shown is cut out using the in-image object extraction method.

【００４４】また、特徴量抽出手段３０２は、前述の物
体情報格納部における第１の特徴量抽出手段１０２と同
様の方法で特徴量を抽出し、８次元ベクトルデータを抽
出する。物体情報検索手段３０３は、上述の特徴量抽出
手段３０２の抽出した特徴量を検索キーにして、物体情
報蓄積部の物体情報蓄積手段２０１に蓄積された全ての
物体オブジェクトを検索し、物体情報格納部の物体情報
管理・格納手段１０３と同様の方法を用いて類似度を算
出し、類似度の高い特徴量が抽出された画像ＩＤおよび
その類似度を求め、その類似度および画像ＩＤの親クラ
スにある物体名を結果として得る。The feature quantity extracting means 302 extracts feature quantities in the same manner as the first feature quantity extracting means 102 in the above-mentioned object information storage section, and extracts eight-dimensional vector data. The object information search means 303 searches all the object objects stored in the object information storage means 201 of the object information storage unit using the feature amount extracted by the feature amount extraction means 302 as a search key, and stores the object information. The similarity is calculated using the same method as that of the object information management / storage means 103 of the part, and the image ID from which the feature amount having a high similarity is extracted and the similarity are obtained. The similarity and the parent class of the image ID As a result.

【００４５】ここでは、例えば予め類似度に対して閾値
を決めておき、その値より高い類似度を持つ物体名を類
似度の高い物体名とする。このとき、閾値は前記物体情
報格納部が物体オブジェクトを格納する際（図２：ステ
ップ１７）に用いた閾値と同じ値を用いる。なお、検索
の結果、同一の物体名が複数得られた場合には、一番高
い類似度をその物体の類似度と見なし、他の類似度は除
外する。例えば、（スイカ，０．６７８），（スイカ，
０．３２８），（スイカ，０．５６３），（メロン，
０．８３４），（メロン，０．４６３）という結果が得
られた場合、（スイカ，０．６７８）および（メロン，
０．８３４）を有効な結果とする。Here, for example, a threshold value is determined in advance for the similarity, and an object name having a higher similarity is set as an object name having a higher similarity. At this time, the same threshold value is used as the threshold value used when the object information storage unit stores the object object (FIG. 2: Step 17). When a plurality of identical object names are obtained as a result of the search, the highest similarity is regarded as the similarity of the object, and other similarities are excluded. For example, (watermelon, 0.678), (watermelon,
0.328), (watermelon, 0.563), (melon,
0.834), (melon, 0.463), (watermelon, 0.678) and (melon,
0.834) is a valid result.

【００４６】例えば、利用者の入力した画像が緑地と黒
の縞で構成されるスイカの表面を撮影したものであり、
その画像から抽出された特徴量は８次元ベクトルデータ
のうち、（０，０，０）および（０，１，０）のベクト
ル要素の値が多いものとする。この場合、前記物体情報
蓄積手段に蓄積される物体オブジェクトの画像オブジェ
クトのうち、画像特徴量ＲＧＢhistにおいて、（０，
０，０）および（０，１，０）のベクトル要素の値が多
い傾向を持つ特徴量との類似度が高くなり、例えばスイ
カやキュウリの表面を撮影した画像の類似度が高くな
る。一方、同じスイカを撮影した画像であっても、例え
ば半分に切ったスイカの切り口を写した画像は、赤色が
多く含まれるため、類似度は低くなる。For example, the image input by the user is a photograph of the surface of a watermelon composed of green ground and black stripes.
It is assumed that the feature quantity extracted from the image has a large value of the vector elements (0, 0, 0) and (0, 1, 0) in the eight-dimensional vector data. In this case, among the image objects of the object objects stored in the object information storage means, (0,
The degree of similarity to a feature amount having a tendency to have large values of the vector elements (0, 0) and (0, 1, 0) increases, and for example, the degree of similarity of an image obtained by capturing the surface of a watermelon or cucumber increases. On the other hand, even if an image of the same watermelon is captured, for example, an image of a cut watermelon cut in half contains a large amount of red, and thus has a low degree of similarity.

【００４７】逆に、例えば利用者の入力した画像が半分
に切られたスイカの赤い切り口を写したものであり、そ
の画像から抽出された特徴量は８次元ベクトルデータの
うち、（１，０，０）および（１，１，１）のベクトル
要素の値が多いものとする。この場合、前記物体情報蓄
積手段に蓄積される物体オブジェクトの画像オブジェク
トのうち、画像特徴量ＲＧＢhistにおいて、（１，０，
０）および（１，１，１）のベクトル要素の値が多い傾
向を持つ特徴量との類似度が高くなり、例えば赤いリン
ゴの表面を撮影した画像、あるいは半分に切ったスイカ
の切り口を撮影した画像の類似度が高くなる。Conversely, for example, the image input by the user is a photograph of a red cut of a watermelon cut in half, and the feature amount extracted from the image is (1, 0) of the eight-dimensional vector data. , 0) and (1, 1, 1) have large values of vector elements. In this case, of the image objects of the object objects stored in the object information storage unit, (1, 0,
The degree of similarity to the feature quantity having a tendency to have large values of the vector elements of (0) and (1,1,1) is increased, and for example, an image of the surface of a red apple or a cut of a watermelon cut in half is shot. The similarity of the obtained image is increased.

【００４８】従って、利用者の入力した画像がスイカの
表面を撮影した画像であっても、切り口を撮影した画像
であっても、物体オブジェクト「スイカ」に含まれる画
像オブジェクトのいずれかの類似度が高くなることで、
所望の検索結果であるスイカが検索結果の一つとして返
されることになる。物体情報検索手段３０３は、有効な
物体名を基に、物体情報蓄積手段２０１に蓄積された物
体オブジェクトから物体名object nameと関連づけられ
た物体情報object informationを取り出す。検索結果出
力手段３０４は、物体情報検索手段３０３の検索結果を
基に、類似度の高い順に、この場合ではスイカとメロン
に関する情報、例えば金額や主な調理法，映像等のＨＴ
ＭＬ文書などを情報端末の表面に表示するなどの方法を
用いて、利用者に提示する。Therefore, regardless of whether the image input by the user is an image of the surface of a watermelon or an image of a cut, the similarity of any of the image objects included in the object “watermelon” Is higher,
The watermelon that is the desired search result will be returned as one of the search results. The object information search unit 303 extracts object information object information associated with the object name object name from the object objects stored in the object information storage unit 201 based on the valid object name. The search result output means 304 is based on the search results of the object information search means 303 and, in this case, information on watermelon and melon, for example, HT such as amount, main cooking method, video, etc.
The ML document is presented to the user using a method such as displaying the ML document on the surface of the information terminal.

【００４９】上記実施例によれば、撮影する構図などに
よって形状や色等が異なるオブジェクト、あるいは形状
の変化するオブジェクトを含み得る実世界の物体を対象
とした画像入力機器を用いた情報蓄積並びに検索におい
て、多種の物体に関し、オブジェクトの色情報，形状情
報等、多様な特徴量を抽出して効率的に蓄積・管理・格
納することが可能になり、また、物体の類似度算出にか
かる計算時間を低減して、撮影された物体に関する情報
を利用者に提示することが可能な、情報蓄積および情報
検索技法を提供できる。なお、上記実施例は本発明の一
例を示すものであり、本発明はこれに限定されるべきも
のではない。According to the above embodiment, information storage and retrieval using an image input device for an object having a shape, a color, or the like depending on a composition to be photographed, or an object in the real world which may include an object having a change in shape. , It becomes possible to extract, store, manage and store various features such as color information and shape information of an object with respect to various kinds of objects efficiently, and to calculate the time required to calculate the similarity of the objects. And an information storage and information retrieval technique capable of presenting information about a photographed object to a user. Note that the above embodiments are merely examples of the present invention, and the present invention is not limited to these embodiments.

【００５０】[0050]

【発明の効果】以上、詳細に説明したように、本発明に
よれば、１つの物体オブジェクトに関して複数の画像を
作成し、それらの複数の画像を集約して意味のあるもの
だけを選別し、これらの複数の画像から物体に関する特
徴量を抽出して、それらを基になった画像と関連付けて
一緒に蓄積するようにしたので、撮影する構図によって
形状や色等が異なったり、あるいは形状の変化するオブ
ジェクトを含み得る実世界の物体を対象とした画像入力
機器を用いた情報検索を実現し、多種の物体に関して、
オブジェクトの色情報，形状情報等、多様な特徴量を抽
出してこれを適宜集約することで効率的に蓄積・管理・
格納することができると共に、物体の類似度算出にかか
る計算時間を低減して、撮影された物体に関する情報を
利用者に提示することができるという効果が得られる。As described above in detail, according to the present invention, a plurality of images are created for one object object, and the plurality of images are aggregated to select only those that are significant. Since the feature values related to the object are extracted from these multiple images and stored together with the images based on them, the shape, color, etc. differ depending on the composition to be shot, or the shape changes. Realize information retrieval using image input equipment targeting real world objects that may include objects that
By extracting various features such as color information and shape information of objects and collecting them as appropriate, efficient accumulation / management /
In addition to being able to store the information, it is possible to reduce the calculation time required for calculating the similarity of the object, and to provide the user with information on the photographed object.

[Brief description of the drawings]

【図１】本発明の一実施例に係る情報検索装置を示す構
成図である。FIG. 1 is a configuration diagram illustrating an information search device according to an embodiment of the present invention.

【図２】物体情報格納部の処理の流れを示すフロー図で
ある。FIG. 2 is a flowchart showing a flow of processing of an object information storage unit.

【図３】物体オブジェクト構成の概念を説明する図であ
り、（ａ）は概念図、（ｂ）はこれをオブジェクト志向
データベース的に表現したものである。3A and 3B are diagrams for explaining the concept of an object configuration, in which FIG. 3A is a conceptual diagram, and FIG. 3B is a diagram representing this in an object-oriented database.

【図４】物体情報検索部の処理の流れを示すフロー図で
ある。FIG. 4 is a flowchart showing a flow of processing of an object information search unit.

【図５】物体情報蓄積手段に蓄積される物体オブジェク
トのデータ構造例を示す図である。FIG. 5 is a diagram illustrating an example of a data structure of an object stored in an object information storage unit.

【符号の説明】１０１第１の画像入力手段１０２第１の特徴量抽出手段１０３物体情報入力手段１０４物体情報管理・格納手段２０１物体情報蓄積手段３０１第２の画像入力手段３０２第２の特徴量抽出手段３０３物体情報検索手段３０４検索結果出力手段DESCRIPTION OF SYMBOLS 101 First image input means 102 First feature value extraction means 103 Object information input means 104 Object information management / storage means 201 Object information storage means 301 Second image input means 302 Second feature value Extraction means 303 Object information search means 304 Search result output means

Claims

[Claims]

1. First image input means for inputting two-dimensional image data of an object and an object name, and extracting an area of the object from each two-dimensional image data input by the first image input means. A first feature amount extraction unit for extracting one or more image feature amounts from the obtained region image data value; an object information input unit for inputting object information on a photographed object; and the first image input unit Object name obtained by
Object information management and storage means for generating an object object by associating the image feature amount obtained by the feature amount extraction means with the object information obtained by the object information input means for each object, and the object information management and storage means An information storage device, comprising: object information storage means for storing a selected object object.

2. The object information management / storage means manages an object object stored by the object information storage means,
Means for searching whether or not an object object including the object name input by the first image input means is stored in the object information storage means, and including an object name input by the first image input means When an object object is stored in the object information storage unit, the plurality of image feature amounts extracted by the first feature amount extraction unit are
2. The information storage device according to claim 1, further comprising: means for associating the object with the object searched by the search means.

3. The object information managing / storing means includes, for an object object stored in the object information storing means, an image feature amount having a high similarity among image feature amounts associated as the same object object. If you do
2. The image processing apparatus according to claim 1, further comprising: means for selecting or aggregating image features associated with the object.
Or the information storage device according to 2.

4. An object object stored in the object information storage means includes: image feature amounts extracted from images of the same object taken under a plurality of conditions;
4. The information storage device according to claim 3, wherein information such as a text, an image, and a sound related to the object is associated.

5. A two-dimensional image obtained by capturing an object and an object name are taken in, an area of the object is extracted from each of the taken two-dimensional image data, and one or more image feature values are extracted from the obtained area image data value. When object information relating to a photographed object is input, an object object is generated by associating the captured object name, the extracted image feature quantity and the input object information for each object, and accumulating the object object. An information storage method characterized by:

6. A recording medium on which a program for realizing the information storage method according to claim 5 is recorded.

7. A second image input means for capturing a two-dimensional image including an object to be input for a search, and extracting an area of the object from each of the two-dimensional image data input by the second image input means. A second feature amount extracting means for extracting one or more image feature amounts from the obtained image data value; and using the image feature amount extracted by the second feature extracting portion as a key. 4. searching for the object object stored by the object information storage device according to any one of 4.
An information search device comprising: an object information search unit that searches for desired object information; and a search result output unit that presents object information obtained by the object information search unit to a user.

8. The method according to claim 8, wherein the object information search unit uses the one or more image feature amounts extracted by the second feature amount extraction unit as a key to store a plurality of image objects included in the object object stored by the object information storage device. 8. The information retrieval apparatus according to claim 7, further comprising means for retrieving an image feature amount and obtaining, as a search result, an object name and object information associated with the image feature amount having a high degree of similarity.

9. A two-dimensional image including an object to be input for retrieval is fetched, an area of the object is extracted from each fetched two-dimensional image data, and one or more image feature values are extracted from the obtained image data value. And using the extracted image feature amount as a key to search for an object stored by the object information storage device according to any one of claims 1 to 4, and search for desired object information. An information retrieval method, characterized by presenting selected object information to a user.

10. A recording medium on which a program for realizing the information search method according to claim 9 is recorded.