JP4427714B2

JP4427714B2 - Image recognition apparatus, image recognition processing method, and image recognition program

Info

Publication number: JP4427714B2
Application number: JP2003320733A
Authority: JP
Inventors: 真智子瀬川; 太後藤; 敏博渡邉; 文武趙; 誠村田; 圭吾井原
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2002-09-13
Filing date: 2003-09-12
Publication date: 2010-03-10
Anticipated expiration: 2023-09-12
Also published as: JP2004127285A

Description

本発明は、例えば、人物や店舗などに掲示されたロゴマークなどを撮像し、その対象画像から個人や法人を特定する画像認識装置、画像認識処理方法および画像認識プログラムに関する。 The present invention relates to an image recognition apparatus, an image recognition processing method, and an image recognition program for capturing, for example, a logo mark posted on a person or a store and identifying an individual or a corporation from the target image.

カメラで撮像した画像中に写る顔が誰の顔であるかを識別して人物同定する顔認識技術は、個人を識別したり認証するセキュリティ分野に用いられる他、例えば犬等の動物を模した外観形状を有するペットロボットのヒューマンインタフェースにも利用されている。 Face recognition technology that identifies a person by identifying who is the face in the image captured by the camera is used in the security field to identify and authenticate an individual, for example, imitating an animal such as a dog It is also used for the human interface of pet robots with external shapes.

ヒューマンインタフェースに使用される顔認識技術としては、例えばサポートベクタマシン（ＳＶＭ）を用いて撮像画像中から人の顔に相当する濃淡パターンをパターン識別して顔検出し、これにて得られる入力顔画像データと、予めデータベース登録される各人毎の登録顔画像データとの差分値を算出し、その算出結果に基づいて、差分値が最も小さい登録顔画像データに対応付けられた人物を撮像画像中に写る人物と同定する手法が知られており、その詳細については例えば特許文献１に開示されている。 As a face recognition technique used for a human interface, for example, a support vector machine (SVM) is used to detect a face by patterning a shading pattern corresponding to a human face from a captured image, and an input face obtained thereby A difference value between the image data and registered face image data for each person registered in the database in advance is calculated, and a person associated with the registered face image data having the smallest difference value is captured based on the calculation result. A technique for identifying a person appearing inside is known, and details thereof are disclosed in Patent Document 1, for example.

特許公開２００２−１５７５９６号公報Japanese Patent Publication No. 2002-157596

ところで、上述した従来の顔認識技術では、登録顔画像データを記憶管理するデータベースに予め登録されている参照画像のサンプル数が多くなると、必然的に相似する顔も増えることから誤認識が生じ易くなり認識率低下を招致するという問題がある。 By the way, in the conventional face recognition technology described above, if the number of reference image samples registered in advance in the database for storing and managing registered face image data increases, the number of similar faces inevitably increases, and thus erroneous recognition is likely to occur. There is a problem that invites a decrease in recognition rate.

そこで本発明は、このような事情に鑑みてなされたもので、データベースに登録される参照画像のサンプル数が多い場合でも認識率を向上させることができる画像認識装置、画像認識処理方法および画像認識プログラムを提供することを目的としている。 Therefore, the present invention has been made in view of such circumstances, and an image recognition apparatus, an image recognition processing method, and image recognition that can improve the recognition rate even when the number of reference image samples registered in the database is large. The purpose is to provide a program.

上記目的を達成するため、請求項１に記載の発明は、認識すべき対象画像を撮像する撮像手段と、前記対象画像に対するユーザとの親密度を推定する推定手段と、前記親密度に依存する複数の属性毎に分類され、各属性に分類された参照画像と、各参照画像に各々対応して一意に付与された認識ＩＤとをそれぞれ記憶する複数の属性別画像データベース手段と、前記複数の属性別画像データベース手段の内から、現在、前記推定手段によって推定された親密度に対応した属性別画像データベース手段を選択する選択手段と、前記選択手段によって選択された属性別画像データベース手段に記憶された参照画像を参照して、前記撮像手段によって撮像された対象画像に対応した認識ＩＤを特定する対象特定手段とを具備し、前記推定手段は、前記撮像手段によって撮像される全領域に対する対象画像の占有面積比、または過去において前記対象特定手段によって前記対象画像が特定された回数、または前記占有面積比と前記回数の組み合わせに基づいて、前記対象画像に対するユーザとの親密度を推定することを特徴とする。 In order to achieve the above object, the invention according to claim 1 depends on the familiarity, an imaging means for photographing a target image to be recognized, an estimation means for estimating a familiarity with a user for the target image, and the familiarity. A plurality of attribute-specific image database means for storing a reference image classified into a plurality of attributes, each of which is classified into each attribute, and a recognition ID uniquely assigned to each reference image; A selection means for selecting an attribute-specific image database means corresponding to the familiarity estimated by the estimation means from among the attribute-specific image database means, and an attribute-specific image database means selected by the selection means. reference image with reference to the to and a target specifying means for specifying a recognition ID corresponding to the target image captured by the imaging means, said estimating means, before The target image based on the occupation area ratio of the target image with respect to the entire region imaged by the imaging unit, the number of times the target image is specified by the target specifying unit in the past, or the combination of the occupation area ratio and the number of times The intimacy with the user is estimated .

請求項２に記載の発明は、請求項１に加えて、現在の日時を計数する日時計数手段をさらに備え、前記属性別画像データベース手段は、前記対象画像に対するユーザとの親密度と、前記撮像手段によって撮像される時刻とに依存する属性に分類され、前記選択手段は、前記推定手段によって推定された親密度と、前記日時計数手段によって計数された現在日時とに基づいて、前記複数の属性別画像データベース手段の内から、現在、前記撮像手段によって撮像された対象画像に対するユーザとの親密度と、現在の日時とに対応した属性別画像データベース手段を選択することを特徴とする。 Invention according to claim 2, in addition to claim 1, further comprising a sundial number means for counting the current time, the attribute-based image database means includes a degree of intimacy with the user with respect to the target image, the The selection unit is categorized into attributes depending on the time taken by the imaging unit, and the selection unit is configured to select the plurality based on the familiarity estimated by the estimation unit and the current date and time counted by the date and time counting unit. The attribute-specific image database means corresponding to the user's intimacy with the target image currently captured by the imaging means and the current date and time is selected from the attribute-specific image database means .

請求項３に記載の発明は、請求項１に加えて、現在の位置を測位する測位手段をさらに備え、前記属性別画像データベース手段は、前記対象画像に対するユーザとの親密度と、前記撮像手段によって撮像される位置とに依存する属性によって分類され、前記選択手段は、前記推定手段によって推定された親密度と、前記測位手段によって測位された現在の位置とに基づいて、前記複数の属性別画像データベース手段の内から、現在、前記撮像手段によって撮像された対象画像に対するユーザとの親密度と、現在の位置とに対応した属性別画像データベース手段を選択することを特徴とする。 Invention of claim 3, in addition to claim 1, further comprising a positioning means for positioning a current position, the attribute-based image database means includes a degree of intimacy with the user with respect to the target image, the image pickup means are classified by attributes that depend on the position to be imaged by said selection means, said a parent density estimated by the estimating means, on the basis of the positioning has been in the current position by the positioning means, by said plurality of attributes An attribute-specific image database means corresponding to the user's intimacy with the target image currently captured by the imaging means and the current position is selected from the image database means .

請求項４に記載の発明は、請求項１に加えて、現在の日時を計数する日時計数手段と、現在の位置を測位する測位手段とをさらに備え、前記属性別画像データベース手段は、前記対象画像に対するユーザとの親密度と、前記撮像手段によって撮像される時刻、および位置とに依存する属性によって分類され、前記選択手段は、前記推定手段によって推定された親密度と、前記日時計数手段によって計数された現在日時、および前記測位手段によって測位された現在の位置とに基づいて、前記複数の属性別画像データベース手段の内から、現在、前記撮像手段によって撮像された対象画像に対するユーザとの親密度と、現在の日時、および現在の位置とに対応した属性別画像データベース手段を選択することを特徴とする。 Invention of claim 4, in addition to claim 1, a sundial number means for counting the current time and date, further comprising a positioning means for positioning a current position, the attribute-based image database means, wherein and closeness of the user with respect to the target image, the time to be captured by the imaging unit, and is classified by the attributes depending on the position, the selection means includes a closeness estimated by said estimating means, said sundial number Based on the current date and time counted by the means and the current position measured by the positioning means, the user for the target image currently picked up by the image pickup means is selected from the plurality of attribute-specific image database means. The attribute-specific image database means corresponding to the intimacy, the current date and time, and the current position is selected .

請求項５及び９に記載の発明は、認識すべき対象画像を撮像する撮像ステップと、前記対象画像に対するユーザとの親密度を推定する推定ステップと、前記親密度に依存する複数の属性毎に分類され、各属性に分類された参照画像と、各参照画像に各々対応して一意に付与された認識ＩＤとをそれぞれ記憶する複数の属性別画像データベースの内から、現在、前記推定ステップの処理によって推定されたユーザとの親密度に対応した属性別画像データベースを選択する選択処理ステップと、前記選択処理ステップの処理によって選択された属性別画像データベースに記憶された参照画像を参照して、前記撮像ステップの処理で撮像された対象画像に対応した認識ＩＤを特定する対象特定処理ステップとを具備し、前記推定ステップの処理は、前記撮像ステップの処理によって撮像される全領域に対する対象画像の占有面積比、または過去において前記対象特定ステップの処理によって前記対象画像が特定された回数、または前記占有面積比と前記回数の組み合わせに基づいて、前記対象画像に対するユーザとの親密度を推定することを特徴とする。 The invention according to claims 5 and 9 includes an imaging step of capturing a target image to be recognized, an estimation step of estimating a closeness of the user with respect to the target image, and a plurality of attributes depending on the closeness. are categorized, and the reference images classified into each attribute, each corresponding uniquely granted recognition ID and from the plurality of demographic image database for storing the respective reference images, now, the process of the estimating step A selection process step of selecting an attribute-specific image database corresponding to the intimacy with the user estimated by the above, and referring to a reference image stored in the attribute-specific image database selected by the process of the selection process step, ; and a target specifying process step of identifying the recognition ID corresponding to the target image captured by the process of the imaging step, the process of the estimation step, Based on the occupation area ratio of the target image with respect to the entire region imaged by the processing of the imaging step, the number of times the target image has been identified in the past by the processing of the target identification step, or a combination of the occupation area ratio and the number of times Then, the closeness with the user for the target image is estimated .

請求項６及び１０に記載の発明は、請求項５及び９に加えて、現在の日時を計数する日時計数処理ステップをさらに備え、前記属性別画像データベースは、前記対象画像に対するユーザとの親密度と、前記撮像ステップの処理によって撮像される時刻とに依存する属性に分類され、前記選択処理ステップの処理は、前記推定処理ステップの処理によって推定された親密度と、前記日時計数処理ステップの処理によって計数された現在日時とに基づいて、前記複数の属性別画像データベースの内から、現在、前記撮像ステップの処理によって撮像された対象画像に対するユーザとの親密度と、現在の日時とに対応した属性別画像データベースを選択することを特徴とする。 In addition to the fifth and ninth aspects, the invention described in claims 6 and 10 further includes a date and time counting processing step for counting the current date and time, and the attribute-specific image database is a parent to the user for the target image. and density, are classified to the attributes that depend on the time taken by the processing of the image pickup step, said processing of the selected process step, the parent density estimated by the processing of the estimation processing step, the sundial number of processing steps based on the current date and time counted by the processing, from among the plurality of attribute-based image database, now the closeness of the user with respect to the captured target image by the processing of the imaging step, on the current date and time A corresponding attribute-specific image database is selected .

請求項７及び１１に記載の発明は、請求項５及び９に加えて、現在の位置を測位する測位処理ステップをさらに備え、前記属性別画像データベースは、前記対象画像に対するユーザとの親密度と、前記撮像ステップの処理によって撮像される位置とに依存する属性によって分類され、前記選択処理ステップの処理は、前記推定処理ステップの処理によって推定された親密度と、前記測位処理ステップの処理によって測位された現在の位置とに基づいて、前記複数の属性別画像データベースの内から、現在、前記撮像ステップの処理によって撮像された対象画像に対する親密度と、現在の位置とに対応した属性別画像データベースを選択することを特徴とする。 In addition to the fifth and ninth aspects, the invention according to the seventh and eleventh aspects further includes a positioning processing step for positioning a current position, and the attribute-specific image database includes a closeness with the user for the target image. are classified by attribute that depends on the position taken by the processing of the image pickup step, said processing of the selected process step, the parent density estimated by the processing of the estimation process step, positioning the processing of the positioning processing step based on the has been the current position, from the plurality of demographic image database, now and familiarity for the captured target image by the processing of the image pickup step, the attribute-based image database corresponding to the current position It is characterized by selecting .

請求項８及び１２に記載の発明は、請求項５及び９に加えて、現在の日時を計数する日時計数処理ステップと、現在の位置を測位する測位処理ステップとをさらに備え、前記属性別画像データベースは、前記対象画像に対するユーザとの親密度と、前記撮像ステップの処理によって撮像される時刻、および位置とに依存する属性によって分類され、前記選択処理ステップの処理は、前記推定処理ステップの処理によって推定された親密度と、前記日時計数処理ステップの処理によって計数された現在日時、および前記測位処理ステップによって測位された現在の位置とに基づいて、前記複数の属性別画像データベースの内から、現在、前記撮像ステップの処理によって撮像された対象画像に対する親密度と、現在の日時、および現在の位置とに対応した属性別画像データベースを選択することを特徴とする。 In addition to claims 5 and 9 , the invention described in claims 8 and 12 further comprises a date and time counting processing step for counting the current date and time, and a positioning processing step for positioning the current position, and each attribute is classified. image database, and closeness to the user with respect to the target image, the time taken by the processing of the image pickup step, and is classified by the attributes depending on the position, the processing of the selected processing steps of the estimation process step Based on the familiarity estimated by the process , the current date and time counted by the process of the date and time counting process step , and the current position measured by the positioning process step, the plurality of attribute-specific image databases from now, the closeness to the target image captured by the process of the imaging step, and the current date, and the current position And selects the corresponding attribute-based image database.

本発明によれば、例えば、人物や店舗などに掲示されたロゴマークなどを撮像し、その対象画像から個人や法人等を特定する際に、認識が行われる日時や場所、対象画像に対する親密度などの、その場の状況に対応した属性の画像データベースが自動的に選択され、選択された属性の画像データベースに記憶された参照画像を参照して、撮像された画像から個人や法人等を特定するようにしたので、画像データベースに登録される参照画像の総数が増加しても、属性別に画像データベースの細分化が図られ、１つの属性に対応する画像データベースに登録される参照画像の個数を少なく抑えることができ、しかも認識が行われる日時や場所や親密度などの状況に対応して、最適な属性の画像データベースが自動的に選択されるので、ユーザが意図的に選択操作を行わなくても、参照画像の個数が絞り込まれると共に、状況という付加的な情報によって参照画像が絞り込まれるので、認識率の向上を図ることができる。 According to the present invention, for example, when a logo mark or the like posted on a person or a store is imaged and an individual or a corporation is identified from the target image, the date and place where the recognition is performed, the familiarity with the target image An image database with an attribute corresponding to the situation at the moment is automatically selected, and a reference image stored in the image database with the selected attribute is referenced to identify an individual or a corporation from the captured image. As a result, even if the total number of reference images registered in the image database increases, the image database is segmented by attribute, and the number of reference images registered in the image database corresponding to one attribute is determined. The image database with the most appropriate attributes is automatically selected according to the situation such as date, place, familiarity, etc. Without diagrammatically perform selection operation, with the number of the reference image is narrowed, since the reference image is narrowed by the additional information that situation, it is possible to improve the recognition rate.

以下、図面を参照して本発明の実施の一形態について説明する。 Hereinafter, an embodiment of the present invention will be described with reference to the drawings.

（１）概要
図１（ａ）は、本発明の実施の一形態である縫い包み１００の外観を示す外観図である。縫い包み１００は、ネコを模したキャラクタ人形であり、その内部には顔認識装置２０（後述する）が内蔵される。顔認識装置２０は、縫い包み１００がユーザの肩に乗せられた状態（同図（ｂ）参照）で「肩乗せモード」として動作し、一方、同図（ｃ）に示すように、ユーザの膝あるいは机の上に乗せられた状態で「膝乗せモード」として動作する。これらのモードは、後述するモード切替スイッチ９によって自動的に切り替えられる。 (1) Outline FIG. 1A is an external view showing an external appearance of a sewing packet 100 according to an embodiment of the present invention. The stitch wrap 100 is a character doll that imitates a cat, and a face recognition device 20 (described later) is built therein. The face recognition device 20 operates as a “shouldering mode” in a state in which the stitching 100 is placed on the user's shoulder (see FIG. 5B), while the user's face recognition device 20 operates as shown in FIG. It operates as “knee-mounting mode” while it is placed on the knee or desk. These modes are automatically switched by a mode switch 9 described later.

「肩乗せモード」では撮像した画像中に写る顔が誰の顔であるかを識別して人物同定し、その人物の名前をユーザにネコ語（鳴き声）で教えたり、撮像した画像中に写る顔を新たにデータベース登録したり、撮像した画像を、後述するアルバム閲覧処理で表示される画像として順次記録する処理を実行する。 In “shoulder-mounting mode”, the person in the captured image is identified by identifying who the face is, and the name of the person is taught to the user in a cat (scream) or captured in the captured image. Processing for newly registering a face in the database and sequentially recording captured images as images to be displayed in an album browsing process described later is executed.

一方、「膝乗せモード」では図１（ｃ）に図示するように、縫い包み１００の尻尾部分に設けられたビデオ出力ポート１０（後述する）に、外部モニタＭのケーブルＣＡを接続し、上記「肩乗せモード」下で撮像した一連の顔画像を外部モニタＭにおいてアルバムとして閲覧させる処理を実行する。これら各モードの処理動作については追って詳述する。 On the other hand, in the “knee-mounting mode”, as shown in FIG. 1C, the cable CA of the external monitor M is connected to the video output port 10 (described later) provided in the tail portion of the sewing wrap 100. A process of browsing a series of face images taken under the “shouldering mode” as an album on the external monitor M is executed. The processing operation in each mode will be described in detail later.

（２）顔認識装置２０の構成
次に、図２を参照して顔認識装置２０の構成について説明する。図２において、ＣＰＵ１はＲＯＭ２に格納される制御プログラム（ＢＩＯＳ）を実行して装置各部の入出力インタフェースを確立させた後、ＨＤＤ４（ハードディスク装置）に格納されるＯＳプログラムをＲＡＭ３にロードして起動させる。ＯＳ（オペレーティングシステム）プログラム起動後、ＣＰＵ１はユーザ操作によって実行指示されるアプリケーションプログラムをＨＤＤ４から読み出してＲＡＭ３にロードして実行する。 (2) Configuration of Face Recognition Device 20 Next, the configuration of the face recognition device 20 will be described with reference to FIG. In FIG. 2, the CPU 1 executes a control program (BIOS) stored in the ROM 2 to establish an input / output interface for each part of the device, and then loads an OS program stored in the HDD 4 (hard disk device) into the RAM 3 and starts it. Let After the OS (operating system) program is started, the CPU 1 reads an application program instructed to be executed by a user operation from the HDD 4, loads it into the RAM 3, and executes it.

なお、ここで言うアプリケーションプログラムは、後述する「データベース選択処理」、「人物登録処理」および「名前お知らせ処理」などから構成されるメインルーチンを含んで構成されている。 The application program referred to here includes a main routine including “database selection processing”, “person registration processing”, “name notification processing”, and the like, which will be described later.

ＲＡＭ３は、各種プログラムデータを格納するプログラムエリア、ＣＰＵ１の演算結果を一時記憶するワークエリアおよびＣＣＤカメラ７によって撮像される画像を一時記憶する画像データエリアを備える。ＨＤＤ４は、各種プログラムを記憶すると共に、各種データベース、テーブルデータおよび音声ファイルを記憶する。 The RAM 3 includes a program area for storing various program data, a work area for temporarily storing the calculation result of the CPU 1, and an image data area for temporarily storing an image captured by the CCD camera 7. The HDD 4 stores various programs and various databases, table data, and audio files.

ＨＤＤ４に記憶される各種データベースとは、顔認識時に参照される複数の画像データベースＩＤＢおよび親密度データベースＦＤＢを指す。複数の画像データベースＩＤＢとは、顔認識が行われる状況に対応した複数の属性毎に設けられ、各属性に含まれる人物の参照顔画像をそれぞれ記憶するデータベースである。 The various databases stored in the HDD 4 indicate a plurality of image database IDBs and closeness databases FDB that are referred to during face recognition. The plurality of image database IDBs are databases that are provided for each of a plurality of attributes corresponding to a situation where face recognition is performed, and store a reference face image of a person included in each attribute.

具体的には、例えば図３に図示する一例のように、ユーザが勤務する会社に属する人物（上司や同僚、部下など）の参照顔画像を登録した画像データベースＩＤＢ１や、友人・知人の参照顔画像を登録した画像データベースＩＤＢ２等、ユーザの公私にわたる人間関係を属性別に分けて人物の参照顔画像を登録したデータベースである。 Specifically, as in the example illustrated in FIG. 3, for example, an image database IDB1 in which reference face images of persons belonging to a company where the user works (such as a boss, a colleague, and a subordinate) are registered, or reference faces of friends / acquaintances This is a database in which a person's public / private human relationship is classified by attribute and registered as a reference face image of a person, such as an image database IDB2 in which images are registered.

これら画像データベースＩＤＢは、後述するように、顔認識が行われる状況に応じて自動的に選択される。各画像データベースＩＤＢは、少なくとも認識ＩＤと、それに対応付けられた人物の参照顔画像データとから構成されるレコードを複数記憶管理する。認識率の低下を避けるため、１つの画像データベース当り１０人程度の参照顔画像データを登録することが好ましい。 These image database IDBs are automatically selected according to the situation where face recognition is performed, as will be described later. Each image database IDB stores and manages a plurality of records including at least a recognition ID and a reference face image data of a person associated therewith. In order to avoid a decrease in the recognition rate, it is preferable to register reference face image data of about 10 people per image database.

親密度データベースＦＤＢとは、各画像データベースＩＤＢと連係するリレーショナルデータベースであって、各画像データベースＩＤＢに登録された人物の認識ＩＤ毎の親密度、後述するCCDカメラ７によって撮像される全領域に対する対象画像の占有面積比、および過去において同じ対象画像として特定された認識回数を記憶管理するものであり、その一例を図４に示す。 The intimacy database FDB is a relational database linked to each image database IDB, and the intimacy for each person's recognition ID registered in each image database IDB, the target for all areas imaged by the CCD camera 7 described later The occupation area ratio of the image and the number of times of recognition identified as the same target image in the past are stored and managed, and an example is shown in FIG.

ここで言う親密度とは、顔認識された人物の顔領域の大きさおよび認識回数に応じて定まる値である。例えば、図１１（ａ）に示す様に認識した顔領域Ａ１が大きく、かつ同一人物として認識した回数も多ければ、ユーザと親密な関係にあるとして親密度が高く、これに対し、例えば図１１（ｂ）に示す様に顔領域Ａ２が小さく、かつ同一人物として認識した回数も少なければ、ユーザとあまり親密な関係でないとして親密度が低くなるよう定義されている。 The familiarity referred to here is a value determined according to the size of the face area of the person whose face is recognized and the number of times of recognition. For example, as shown in FIG. 11 (a), if the recognized face area A1 is large and the number of times of recognition as the same person is large, the closeness is high because it is intimately related to the user. As shown in (b), if the face area A2 is small and the number of times of recognition as the same person is small, it is defined that the intimacy is low because it is not so close to the user.

ここで、顔認識装置２０で定義された親密度について、より具体的に説明する。図１１（ａ）に示す顔領域Ａ１とは、後述する輝度パターンによって顔を検出する際に算出される領域で、個人を特定するために必要な顔面の内の眼、鼻、口をほぼ内包する矩形領域を意味する。そして、顔領域Ａ１が大きいとは、撮像される全領域に対する顔領域Ａ１（対象画像）が占める割合、すなわち占有面積比が大であることを意味し、この場合、対象となる人物との距離が近く、親密度がより高いものとの推定が成り立つ。 Here, the familiarity defined by the face recognition device 20 will be described more specifically. The face area A1 shown in FIG. 11A is an area calculated when a face is detected based on a luminance pattern described later, and substantially includes eyes, nose, and mouth in the face necessary for specifying an individual. Means a rectangular area. The large face area A1 means that the ratio of the face area A1 (target image) to the entire imaged area, that is, the occupied area ratio is large. In this case, the distance from the target person Is close, and it is estimated that the intimacy is higher.

一方、図１１（ｂ）に示す様に顔領域Ａ２が小さいとは、撮像される全領域に対する顔領域Ａ２（対象画像）が占める割合、すなわち占有面積比が小であることを意味し、この場合、対象となる人物との距離が遠く、親密度がより低いものとの推定が成り立つ。 On the other hand, as shown in FIG. 11B, the small face area A2 means that the ratio of the face area A2 (target image) to the entire imaged area, that is, the occupied area ratio is small. In this case, it is estimated that the distance to the target person is far and the intimacy is lower.

以上の推定に基づき、親密度Ｒは、例えば、以下の算出式（１）に基づいて算出される。
Ｒ=a×(N／Nmax)＋（1−a）×（N／Nfull）‥‥‥（１） Based on the above estimation, the familiarity R is calculated based on the following calculation formula (1), for example.
R = a × (N / Nmax) + (1−a) × (N / Nfull) (1)

ここで、aは0.0から1.0の範囲内で任意に設定される重み付け定数、Ｎは現在認識している顔領域Ａ１に関して過去同一人物として認識した認識回数、Ｎmaxは親密度データベースＦＤＢに蓄積された認識回数の内の最大値(図４に示す例では、10回)、Nは現在認識している顔領域Ａ１の面積、Nfullは撮像される全領域の面積である。上述した、算出式（１）によって、図４に示す親密度Ｒが算出され、新たな親密度Ｒが算出される度に、親密度データベースＦＤＢが更新される。なお、顔領域Ａ１としては、顔面の内の眼、鼻、口をほぼ内包する矩形領域ではなく、顔面が内接する矩形領域の面積等、対象となる人物との距離に応じて変化するパラメータを利用しても勿論構わない。 Here, a is a weighting constant arbitrarily set within the range of 0.0 to 1.0, N is the number of recognitions recognized as the same person in the past with respect to the currently recognized face area A1, and Nmax is stored in the familiarity database FDB. The maximum number of times of recognition (in the example shown in FIG. 4, 10 times), N is the area of the currently recognized face area A1, and Nfull is the area of the entire area to be imaged. The intimacy R shown in FIG. 4 is calculated by the calculation formula (1) described above, and the intimacy database FDB is updated each time a new intimacy R is calculated. Note that the face area A1 is not a rectangular area that substantially includes the eyes, nose, and mouth in the face, but parameters that change according to the distance from the target person, such as the area of the rectangular area that the face inscribes. Of course, you can use it.

ＨＤＤ４に記憶されるテーブルデータとは、データベース選択テーブルＤＳＴおよび名前お知らせテーブルＮＩＴを指す。データベース選択テーブルＤＳＴは、現在日時およびユーザの現在位置に応じて、上述した複数の画像データベースＩＤＢのいずれを選択するかを指定するテーブルデータである。 The table data stored in the HDD 4 refers to the database selection table DST and the name notification table NIT. The database selection table DST is table data that specifies which of the plurality of image database IDBs described above is selected according to the current date and time and the current position of the user.

このデータベース選択テーブルＤＳＴでは、現在の日時および現在位置に対応させてユーザが任意に指定値を登録することが可能になっており、例えば日時が平日の日中で、現在位置が会社である場合には上述の画像データベースＩＤＢ１を選択する指定値を登録し、日時が土曜日または日曜日で、現在位置を特定しない場合には上述の画像データベースＩＤＢ２を選択する指定値を登録する。 In this database selection table DST, the user can arbitrarily register a specified value in correspondence with the current date and time and the current position. For example, when the date and time is a weekday day and the current position is a company Is registered with a designated value for selecting the above-mentioned image database IDB1, and when the date and time is Saturday or Sunday and the current position is not specified, the designated value for selecting the above-mentioned image database IDB2 is registered.

名前お知らせテーブルＮＩＴとは、顔認識にて同定された人物の認識ＩＤと、音声ファイルとを対応付けたテーブルデータであり、同定された人物の認識ＩＤに対応する音声ファイルを選択する際に用いられる。 The name notification table NIT is table data in which a recognition ID of a person identified by face recognition is associated with an audio file, and is used when selecting an audio file corresponding to the identified ID of the person. It is done.

サウンドシステム５は、ＣＰＵ１が再生指示する音声ファイルからＰＣＭ波形データを読み出し、それをＤ／Ａ変換して音声出力する。マウス６は、ユーザ操作に応じたポインティング信号やスイッチイベントを発生するものであり、縫い包み１００（図１参照）の右手部分ＲＨに設けられる。ＣＣＤカメラ７は、縫い包み１００の左目部分ＬＥに設けられ、ＣＰＵ１の制御の下に撮像して画像データを発生する。位置検出部８は、ＣＰＵ１の制御の下にＧＰＳ（Global Positioning System）信号を受信して現在位置を測位し、測位結果位置データをとして発生する。 The sound system 5 reads PCM waveform data from the audio file that the CPU 1 instructs to reproduce, D / A converts it, and outputs the audio. The mouse 6 generates a pointing signal or a switch event corresponding to a user operation, and is provided in the right hand portion RH of the sewing package 100 (see FIG. 1). The CCD camera 7 is provided in the left eye part LE of the stitching 100 and takes an image under the control of the CPU 1 to generate image data. The position detection unit 8 receives a GPS (Global Positioning System) signal under the control of the CPU 1, measures the current position, and generates position result position data.

モード切替スイッチ９は、縫い包み１００の腰部に設けられ、腰の曲げ伸ばしに応じたモード切替イベントを発生する。すなわち、図１（ｂ）に図示するように、縫い包み１００がユーザの肩に乗せられた場合には「肩乗せモード」を表すスイッチイベントを発生し、同図（ｃ）に示すように、ユーザの膝あるいは机の上に乗せられた場合には「膝乗せモード」を表すスイッチイベントを発生する。ビデオ出力ポート（ＶＧＡコネクタ）１０は、縫い包み１００の尻尾に設けられ、表示制御信号を出力する。 The mode changeover switch 9 is provided at the waist of the sewn wrap 100, and generates a mode changeover event corresponding to the bending and stretching of the waist. That is, as shown in FIG. 1B, when the stitching 100 is placed on the shoulder of the user, a switch event indicating “shouldering mode” is generated, and as shown in FIG. When it is placed on the user's knee or desk, a switch event indicating the “knee-mounting mode” is generated. A video output port (VGA connector) 10 is provided at the tail of the stitching 100 and outputs a display control signal.

なお、マウス６、ＣＣＤカメラ７、モード切替スイッチ９およびビデオ出力ポート９以外の構成要件は装置本体として縫い包み１００の胴体部分に内蔵されるようになっている。 The constituent elements other than the mouse 6, the CCD camera 7, the mode switch 9 and the video output port 9 are built in the body portion of the sewn wrap 100 as the apparatus body.

（３）顔認識アルゴリズムの詳細
上述した顔認識装置２０で使用される顔認識アルゴリズムの詳細については、本出願人が先に提案した特許文献１（特許公開２００２−１５７５９６号公報（対応米国特許公開２００３／００５９０９２号公報））に開示されている。 (3) Details of Face Recognition Algorithm Details of the face recognition algorithm used in the above-described face recognition device 20 are disclosed in Patent Document 1 (Patent Publication 2002-157596 (patent publication No. 2002-157596) previously proposed by the present applicant). 2003/0059092)).

すなわち、顔認識装置２０においては、次の３つの技術により顔認識を実現している。 That is, in the face recognition device 20, face recognition is realized by the following three techniques.

（i）複雑なシーンからの顔の検出
（ii）顔の実時間トラッキング
（iii）顔の識別 (I) Face detection from complex scenes (ii) Real-time tracking of faces (iii) Face identification

顔の検出手法は、対象物の識別に色、動き及びパターンを使うものに大別できるが、複雑なシーンの中から精度良く顔を切り出すためには顔のパターンを使うのが最も高性能である。しかしながら、シーン全体にわたり全てのスケールの顔を探索するのは、非常に処理が重いため、従来より、この手法は静止画に対してしか用いられていない。 Face detection methods can be broadly classified into those that use colors, movements, and patterns to identify objects, but using the face pattern is the most powerful method for accurately extracting faces from complex scenes. is there. However, searching for faces of all scales throughout the scene is very heavy and so far this technique has been used only for still images.

その一方で、実時間で顔を検出するシステムのほとんどは肌色の検出を行っている。しかし、色は照明条件により変化してしまうし、肌色にも人種や個人差があるため、単純な肌色認識のみでは有効な手段となり得ない。 On the other hand, most systems that detect faces in real time detect skin color. However, the color changes depending on the illumination conditions, and the skin color also has races and individual differences. Therefore, simple skin color recognition alone cannot be an effective means.

そこで、検出した顔パターンに含まれる色分布を元に顔の実時間トラッキングを行い、その動的変化に顔検出を適応していく手法を取る。また、推定した色分布から求めた顔領域に対してのみ、顔パターンの探索を行う。これにより、顔検出における演算時間の短縮を図る。 Therefore, a method is adopted in which real-time tracking of the face is performed based on the color distribution included in the detected face pattern, and face detection is adapted to the dynamic change. In addition, the face pattern is searched only for the face area obtained from the estimated color distribution. This shortens the calculation time in face detection.

さらに、顔の識別には、パターン探索により切り出した顔画像を用いて行う。そして、トラッキングに成功している間は同じ顔の識別結果として扱うことで複数の識別結果から総合的な判断を下すことを可能としている。 Furthermore, the face is identified using a face image cut out by pattern search. Then, while tracking is successful, it is possible to make a comprehensive judgment from a plurality of identification results by treating it as the identification result of the same face.

例えば、顔識別のための処理は、（i）複雑なシーンからの顔の検出を、輝度パターンによる顔の検出（顔認識）により行い、（ii）顔の実時間トラッキングを、色による顔の追跡（顔のトラッキング）により行い、そして、（iii）顔の識別を、差分顔を利用した人物識別を行っている。 For example, the processing for face identification is as follows: (i) face detection from a complex scene is performed by face detection (face recognition) using a luminance pattern, and (ii) real-time tracking of a face is performed using face detection by color. Tracking (face tracking) is performed, and (iii) face identification is performed by person identification using a differential face.

例えば、顔認識装置２０におけるこのような各処理は、モジュール或いはオブジェクトとして実現される。すなわち、顔認識装置２０は、顔の追跡モジュール、顔の検出モジュール及び顔の識別モジュールを備える。ここで、顔の追跡モジュールは、ＣＣＤカメラ７に撮像された画像内で変化する顔を追跡する顔追跡手段として機能し、顔の検出モジュールは、顔の追跡モジュールによる顔の追跡情報に基づいて、ＣＣＤカメラ７により撮像された画像内の顔の顔データを検出する顔データ検出手段として機能し、顔の識別モジュールは、顔の検出モジュールが検出した顔データに基づいて、特定顔を識別する顔識別手段として機能する。 For example, each process in the face recognition apparatus 20 is realized as a module or an object. That is, the face recognition device 20 includes a face tracking module, a face detection module, and a face identification module. Here, the face tracking module functions as a face tracking unit that tracks a face that changes in an image captured by the CCD camera 7, and the face detection module is based on the face tracking information by the face tracking module. The face identification module functions as face data detection means for detecting face data of a face in the image captured by the CCD camera 7, and the face identification module identifies a specific face based on the face data detected by the face detection module. It functions as a face identification means.

ここで、輝度パターンによる顔の検出では、入力画像中から顔を検出（認識）する処理を行う。具体的には、この顔の検出では、サポートベクタマシン（ＳＶＭ）による顔、非顔の識別を行う。この処理は、通常、環境変化に強く、計算量が多く、姿勢変化に弱いといった特徴がある。ここで、環境変化としては、例えば、周囲の照明の変化が挙げられる。 Here, in the face detection based on the luminance pattern, processing for detecting (recognizing) the face from the input image is performed. Specifically, in this face detection, a face or non-face is identified by a support vector machine (SVM). This processing is usually characterized by being resistant to environmental changes, requiring a large amount of calculation, and being vulnerable to posture changes. Here, as an environmental change, the change of ambient illumination is mentioned, for example.

また、色による顔の追跡では、入力画像中の顔を追跡する処理を行う。具体的には、この顔の追跡では、顔の色分布の推定、顔領域の推定を行う。この処理は、通常、環境変化に弱く、計算量が少なく、姿勢変化に強いといった特徴がある。 In the face tracking by color, a process for tracking the face in the input image is performed. Specifically, in this face tracking, estimation of the face color distribution and estimation of the face area are performed. This processing is usually characterized by being weak against environmental changes, having a small amount of calculation, and being strong against posture changes.

また、人物識別では、上述の顔の検出により認識された顔を特定の顔として識別する処理を行う。具体的には、この人物識別では、目、鼻の位置同定から位置合わせ（モーフィング）を行い、差分顔から同一人物判定を行う。 In the person identification, the face recognized by the above-described face detection is identified as a specific face. Specifically, in this person identification, positioning (morphing) is performed from the position identification of eyes and nose, and the same person is determined from the difference face.

顔識別のシステムでは、以上のような処理を顔識別における各工程として適宜分担させ、相互に補完しあう関係とすることで、精度を高くした顔検出を可能としている。例えば、次のように各処理が補完しあう関係とされている。 In the face identification system, the processing described above is appropriately shared as each step in face identification, and a mutually complementary relationship is achieved, thereby enabling face detection with high accuracy. For example, each process complements each other as follows.

例えば、色による顔の追跡では環境変化に弱いが、輝度パターンによる顔の検出では環境に強いことを利用することで補完している。逆に、輝度パターンによる顔の検出では計算量が多く、姿勢変化に弱いが、色による顔の追跡が計算量が少なく、姿勢変化に強いことを利用することが補完している。 For example, the tracking of a face by color is weak against environmental changes, but the detection of a face by a luminance pattern is complemented by utilizing the fact that it is strong to the environment. On the other hand, the detection of a face using a luminance pattern requires a large amount of calculation and is vulnerable to changes in posture. However, the use of the fact that tracking of a face by color has a small amount of calculation and is strong against posture change is supplemented.

すなわち、概略すれば次のように言える。もともと計算量が多い処理とされる顔の検出を実時間で行うことは困難である。しかし、所定のタイミングにより一定期間行うとすれば、計算量の負担は軽減される。一方で、各タイミングにおいて入力画像内から顔の位置まで毎回検出したのでは、負担が大きい。 In summary, the following can be said. It is difficult to detect in real time a face that is originally a process with a large amount of calculation. However, if it is performed for a certain period at a predetermined timing, the burden of calculation amount is reduced. On the other hand, if it is detected every time from the input image to the face position at each timing, the burden is large.

そこで、計算量が少なく、姿勢変化に強い処理を利用して、入力画像内の顔の変化を実時間で追跡して、この追跡結果から得られる入力画像内の顔の推定位置についてだけ顔を検出する処理をすることとすれば、顔の位置を特定した状態での顔の検出が可能になる。すなわち、大雑把ではあるが早い処理と、信頼性は高いが遅い処理とを組み合わせて、役割を分担することで、システム全体においては各処理相互間で補完させ、これにより、協調して実時間による顔検出を可能としている。 Therefore, using a process that requires a small amount of computation and is resistant to posture changes, the face change in the input image is tracked in real time, and the face is detected only for the estimated face position in the input image obtained from the tracking result. If the detection process is performed, the face can be detected with the face position specified. In other words, by combining rough but fast processing with high reliability but slow processing and sharing roles, the entire system is complemented between each processing, so that in real time in cooperation Face detection is possible.

これにより、多くの顔検出結果を短時間で獲得でき、この獲得した顔検出結果に基づいて顔の識別を行い、そのような処理を統計的に処理することで、高精度の顔識別を可能にしている。 As a result, many face detection results can be acquired in a short time, face identification is performed based on the acquired face detection results, and such processing is processed statistically, thereby enabling highly accurate face identification. I have to.

顔認識装置２０は、このような顔識別のシステムにより、シーン中から人間を見つけ出し（顔検出処理）、それを注視し（顔のトラッキング処理）、そこから得られた情報を用いた顔の識別による人物の特定（顔識別処理）に至るまで、全ての処理を自動的に行い、信頼性の高い顔識別を実現している。 The face recognition device 20 uses such a face identification system to find a person in the scene (face detection process), gaze at it (face tracking process), and identify the face using the information obtained therefrom. All processes are automatically performed until the person is identified (face identification process) by, thereby realizing highly reliable face identification.

（４）顔認識装置２０の動作
次に、図５〜図９を参照して上記構成による顔認識装置２０の動作について説明する。以下では、最初にメインルーチンの動作について説明した後、そのメインルーチンを構成するデータベース選択処理、人物登録処理および名前お知らせ処理の各動作を説明する。 (4) Operation of Face Recognition Device 20 Next, the operation of the face recognition device 20 configured as described above will be described with reference to FIGS. In the following, the operation of the main routine will be described first, and then the operations of database selection processing, person registration processing, and name notification processing that constitute the main routine will be described.

［１］メインルーチンの動作
ユーザが装置電源を投入して図５に図示するメインルーチンを実行させると、顔認識装置２０は、ステップＳＡ１に処理を進め、「肩乗せモード」下にあるか否かを判断する。ここで、図１（ｂ）に図示したように、縫い包み１００がユーザの肩に乗せられていると、モード切替スイッチ９が「肩乗せモード」を表すスイッチイベントを発生するので、判断結果は「ＹＥＳ」となり、ステップＳＡ２に進む。 [1] Operation of Main Routine When the user turns on the apparatus power and causes the main routine shown in FIG. 5 to be executed, the face recognition apparatus 20 proceeds to step SA1 and determines whether or not it is under “shouldering mode”. Determine whether. Here, as illustrated in FIG. 1B, when the stitching 100 is placed on the shoulder of the user, the mode change switch 9 generates a switch event indicating “shouldering mode”. "YES" is determined, and the process proceeds to Step SA2.

ステップＳＡ２では、ＣＣＤカメラ７に対して撮像実行を指示し、続くステップＳＡ３では、撮像した画像中から人の顔に相当する濃淡パターンをパターン識別して顔検出する。次いで、ステップＳＡ４では、顔画像の撮像条件（人物撮影条件）を満たしているか否か、すなわち先に撮像した画像中から検出した顔領域が所定の大きさを超えているかどうか、より具体的には、撮像される全領域に対する顔領域Ａ１（対象画像）が占める割合として算出される占有面積比が、所定値よりも大であるか否かを判断する。 In step SA2, the CCD camera 7 is instructed to execute imaging, and in the subsequent step SA3, a gray pattern corresponding to a human face is identified from the captured image to detect the face. Next, in step SA4, more specifically, whether or not a face image capturing condition (person photographing condition) is satisfied, that is, whether or not the face area detected from the previously captured image exceeds a predetermined size is determined. Determines whether the ratio of the occupied area calculated as the ratio of the face area A1 (target image) to the entire imaged area is larger than a predetermined value.

撮像した画像中から検出した顔領域が所定の大きさを超えていなければ、撮像条件を満たしていないとして判断結果は「ＮＯ」となり、上記ステップＳＡ２に処理を戻す。以後、撮像した画像中から検出した顔領域が所定の大きさを超えるまで撮像および顔検出を随時繰り返す。 If the face area detected from the captured image does not exceed a predetermined size, the determination result is “NO” because the imaging condition is not satisfied, and the process returns to step SA2. Thereafter, imaging and face detection are repeated as needed until the face area detected from the captured image exceeds a predetermined size.

そして、撮像した画像中から検出した顔領域が所定の大きさを超えて撮像条件を満たすと、ステップＳＡ４の判断結果が「ＹＥＳ」となり、顔画像を取得すべくＣＣＤカメラ７に対して撮像実行を指示する。これにより得られる顔画像データは、いったんＲＡＭ３の画像データエリアに一時記憶される。 If the face area detected from the captured image exceeds a predetermined size and satisfies the image capturing condition, the determination result in step SA4 is “YES”, and the CCD camera 7 is imaged to acquire the face image. Instruct. The face image data thus obtained is temporarily stored in the image data area of the RAM 3 once.

なお、上記ステップＳＡ４では、撮像した画像中から検出した顔領域が所定の大きさを超えているか否かを撮像条件としたが、これに限らず、例えば顔検出してから一定時間経過後に対象人物の顔画像を撮像する態様としても構わない。 In step SA4, whether or not the face area detected from the captured image exceeds a predetermined size is set as an imaging condition. However, the present invention is not limited to this. It does not matter as a mode for capturing a human face image.

さて、こうして顔画像データを取得すると、顔認識装置２０はステップＳＡ６に処理を進め、データベース選択処理を実行する。データベース選択処理では、ＨＤＤ４に格納される複数の画像データベース、すなわちユーザの公私にわたる人間関係を属性別に分け、各属性に含まれる人物の顔画像をそれぞれ登録した複数の画像データベースの内から、顔認識する場面に最も適する属性の画像データベースを選択する。具体的には、現在の日時およびユーザの現在位置に応じて、前述したデータベース選択テーブルＤＳＴから該当する画像データベースを選択する。 Now, when the face image data is acquired in this way, the face recognition device 20 advances the process to step SA6 and executes the database selection process. In the database selection process, face recognition is performed from among a plurality of image databases stored in the HDD 4, that is, from the plurality of image databases in which the human relations between the user's public and private are divided according to attributes and the face images of persons included in each attribute are respectively registered. Select an image database with attributes that best suit the scene you want to play. Specifically, the corresponding image database is selected from the database selection table DST described above according to the current date and time and the current position of the user.

次に、ステップＳＡ７ではユーザが縫い包み１００の右手部分ＲＨに設けられたマウス６の右ボタンをクリックして登録指示イベントを発生させた場合に人物登録処理を実行する。この処理では、上記ステップＳＡ５にて撮像した顔画像データに認識ＩＤを付与して、上記ステップＳＡ６にて選択された画像データベースに新規登録したり、名前お知らせテーブルＮＩＴに新規認識ＩＤに対応する音声ファイルをアサインする。 Next, in step SA7, the person registration process is executed when the user clicks the right button of the mouse 6 provided in the right hand portion RH of the stitching 100 to generate a registration instruction event. In this process, a recognition ID is assigned to the face image data imaged in step SA5 and newly registered in the image database selected in step SA6, or the voice corresponding to the new recognition ID in the name notification table NIT. Assign a file.

ステップＳＡ８では、上記ステップＳＡ６にて選択された画像データベースに登録される各人毎の登録顔画像データと、撮像して新たに得られた顔画像データとの差分値が各々算出され、この差分値が最も小さい登録顔画像データに対応付けられた個人を撮像画像中に写る人物と同定する顔認識処理を実行する。 In step SA8, a difference value between the registered face image data for each person registered in the image database selected in step SA6 and the face image data newly obtained by imaging is calculated. A face recognition process is performed for identifying an individual associated with the registered face image data having the smallest value as a person shown in the captured image.

次いで、ステップＳＡ９では、顔認識結果に基づき、前述した親密度データベースＦＤＢの内容を更新する。すなわち、顔認識にて同定された人物の認識ＩＤに対応する親密度を認識回数および顔領域の大きさに応じて更新する。また、このステップＳＡ９では、顔認識が完了した顔画像データをＲＡＭ３の画像データエリアから読み出してＨＤＤ４のアルバムフォルダ下に格納する。 In step SA9, the contents of the familiarity database FDB described above are updated based on the face recognition result. That is, the familiarity corresponding to the recognition ID of the person identified by face recognition is updated according to the number of recognitions and the size of the face area. In step SA9, the face image data for which face recognition has been completed is read from the image data area of the RAM 3 and stored in the album folder of the HDD 4.

なお、親密度を更新する態様としては、上記ステップＳＡ９の処理に限らず、時間の概念を採り入れることも可能である。すなわち、図４に図示した親密度データベースＦＤＢにおいて、顔認識された日時もデータベース項目として記憶管理しておき、前回顔認識した日時と今回顔認識した日時とから認識間隔を求め、求めた認識間隔が短ければ親密度を上げ、長ければ親密度を下げるようにすることもできる。 Note that the manner of updating the intimacy is not limited to the process of step SA9, and the concept of time can be adopted. That is, in the familiarity database FDB shown in FIG. 4, the date and time of face recognition are also stored and managed as database items, the recognition interval is obtained from the date and time of previous face recognition and the date and time of current face recognition, and the obtained recognition interval. It is possible to increase the intimacy if the length is short, and decrease the intimacy if the length is long.

次に、ステップＳＡ１０では、ユーザが縫い包み１００の右手部分ＲＨに設けられたマウス６の左ボタンをクリックして名前お知らせ指示イベントを発生させた場合に名前お知らせ処理を実行する。この処理では、名前お知らせテーブルＮＩＴを参照して、顔認識にて同定された人物の認識ＩＤに対応する音声ファイルを選択して再生する。 Next, in step SA10, name notification processing is executed when the user clicks the left button of the mouse 6 provided in the right hand portion RH of the stitching 100 to generate a name notification instruction event. In this process, referring to the name notification table NIT, an audio file corresponding to the recognition ID of the person identified by face recognition is selected and reproduced.

この後、前述のステップＳＡ１に処理を戻し、以後、ユーザが縫い包み１００を肩に乗せている状態ではステップＳＡ２〜ＳＡ１０から構成される肩乗せモードの動作を繰り返す。 Thereafter, the processing is returned to the above-described step SA1, and thereafter, in the state where the user puts the stitching 100 on the shoulder, the operation in the shoulder mode composed of steps SA2 to SA10 is repeated.

そして、図１（ｃ）に図示したように、縫い包み１００の尻尾部分に設けられたビデオ出力ポート（ＶＧＡコネクタ）１０に、外部モニタＭのケーブルＣＡに接続した状態で、その縫い包み１００をユーザの膝に乗せたりあるいは机に乗せると、モード切替スイッチ９が「膝乗せモード」を表すスイッチイベントを発生するので、ステップＳＡ２の判断結果が「ＮＯ」となり、ステップＳＡ１１に進み、アルバム閲覧処理を実行する。 Then, as shown in FIG. 1C, the sewn packet 100 is connected to the video output port (VGA connector) 10 provided at the tail portion of the sewn packet 100 while being connected to the cable CA of the external monitor M. When the user is placed on the knee or placed on the desk, the mode change switch 9 generates a switch event indicating “knee-mounting mode”, so that the determination result in step SA2 is “NO”, the process proceeds to step SA11, and the album browsing process is performed. Execute.

アルバム閲覧処理では、図６に図示するアルバム閲覧ウインドウＷを生成し、外部モニタＭに表示する。このアルバム閲覧ウインドウＷは、ＨＤＤ４のアルバムフォルダ内に格納された顔画像データをインデックス表示（サムネイル表示）するもので、これらインデックス表示された中から選択された顔画像データにフレーム（画面枠）を付加して一画面表示させる機能も有している。 In the album browsing process, an album browsing window W shown in FIG. 6 is generated and displayed on the external monitor M. This album browsing window W is an index display (thumbnail display) of face image data stored in the album folder of the HDD 4, and a frame (screen frame) is added to the face image data selected from the index display. In addition, it has a function to display one screen.

例えば、アルバム閲覧ウインドウＷ内の日付指定ボタン２０をマウス操作でクリックすると、ＨＤＤ４のアルバムフォルダ内に格納された全ての顔画像データに対応した撮影日付のリストが日付リストウインドウ２１に表示される。 For example, when the date designation button 20 in the album browsing window W is clicked by a mouse operation, a list of shooting dates corresponding to all face image data stored in the album folder of the HDD 4 is displayed in the date list window 21.

一方、アルバム閲覧ウインドウＷ内の人物指定ボタン２２をマウス操作でクリックすると、ＨＤＤ４のアルバムフォルダ内に格納された全ての顔画像データに対応した人物の認識ＩＤ（例えば、００１，００２，００３，…等）のリストが人物リストウインドウ２３に表示される。 On the other hand, when the person designation button 22 in the album browsing window W is clicked by a mouse operation, the person recognition IDs (for example, 001, 002, 003...) Corresponding to all the face image data stored in the album folder of the HDD 4 are clicked. Etc.) is displayed in the person list window 23.

これら日付リストウインドウ２１に表示された任意の日付、または人物リストウインドウ２３に表示された任意の人物の認識ＩＤをマウス操作でクリックして指定した後、表示ボタン２４をマウス操作でクリックすると、該当する日付もしくは人物ＩＤに対応する顔画像データがＨＤＤ４から読み出され、インデックス画像Ｐ１，Ｐ２，Ｐ３，Ｐ４としてサムネイル表示エリア２５に一覧表示される。 When an arbitrary date displayed in the date list window 21 or a recognition ID of an arbitrary person displayed in the person list window 23 is specified by clicking with the mouse, the corresponding button is clicked with the mouse. The face image data corresponding to the date or person ID is read from the HDD 4 and displayed as a list in the thumbnail display area 25 as index images P1, P2, P3, P4.

さらに、このサムネイル表示エリア２５に一覧表示されたインデックス画像Ｐ１，Ｐ２，Ｐ３，Ｐ４の中から任意の画像をマウス操作でクリックして指定すると、選択された顔画像データにフレームを付加した画像を表示する。 Further, when an arbitrary image is clicked and designated from among the index images P1, P2, P3, and P4 displayed as a list in the thumbnail display area 25, an image obtained by adding a frame to the selected face image data is selected. indicate.

その際、画面表示する顔画像データの認識ＩＤに基づき親密度データベースＦＤＢから該当人物の親密度を検索し、検索した親密度に対応したフレーム（画面枠）を選択する。例えば、親密度が高い人物の顔画像データを画面表示する場合には、図１２（ａ）に示す様に華やかなフレームを付与し、一方、親密度が低い人物の顔画像データを画面表示する場合には図１２（ｂ）に示す様に暗い感じのフレームを付与する等のエンターテイメント性を提供するようになっている。 At that time, the closeness of the corresponding person is searched from the closeness database FDB based on the recognition ID of the face image data displayed on the screen, and a frame (screen frame) corresponding to the searched closeness is selected. For example, when face image data of a person with a high degree of closeness is displayed on the screen, a gorgeous frame is added as shown in FIG. 12A, while face image data of a person with a low degree of closeness is displayed on the screen. In such a case, as shown in FIG. 12B, entertainment such as providing a dark frame is provided.

［２］データベース選択処理の動作
次に、図７を参照してデータベース選択処理の動作について説明する。上述したステップＳＡ６を介して本処理が実行されると、顔認識装置２０は図７に示すステップＳＢ１に処理を進め、稼働中のＯＳプログラム側から現在の日時データを取得する。次いで、ステップＳＢ２に進み、現在日時に対応する予定の有無を判断する。すなわち、常駐ソフトウェアとして稼働しているスケジュール帳（スケジュール管理ソフト）に、現在日時に対応した予定が登録されているかどうかを判断する。 [2] Operation of Database Selection Process Next, the operation of the database selection process will be described with reference to FIG. When this process is executed via step SA6 described above, the face recognition apparatus 20 advances the process to step SB1 shown in FIG. 7, and acquires the current date and time data from the operating OS program side. Next, the process proceeds to step SB2 to determine whether or not there is a plan corresponding to the current date and time. That is, it is determined whether or not a schedule corresponding to the current date and time is registered in a schedule book (schedule management software) operating as resident software.

そして、現在日時に対応した予定が登録されていれば、判断結果は「ＹＥＳ」となり、次のステップＳＢ３に処理を進め、スケジュール帳に設定されているデータベース指定値に対応した画像データベースを選択する。 If a schedule corresponding to the current date and time is registered, the determination result is “YES”, the process proceeds to the next step SB3, and an image database corresponding to the database designated value set in the schedule book is selected. .

一方、現在日時に対応した予定がスケジュール帳に登録されていない場合には、上記ステップＳＢ２の判断結果が「ＮＯ」となり、ステップＳＢ４に進む。ステップＳＢ４では、位置検出部８が発生するＧＰＳ位置情報から現在の場所を決定する。なお、ＧＰＳ信号を受信できず現在の場所を特定できない状態では、ＧＰＳ信号をロストした時点（例えば、屋内に入った時など）のＧＰＳ位置情報に基づき現在の場所を決定する。 On the other hand, if the schedule corresponding to the current date and time is not registered in the schedule book, the determination result in step SB2 is “NO”, and the flow advances to step SB4. In step SB4, the current location is determined from the GPS position information generated by the position detector 8. In a state where the GPS signal cannot be received and the current location cannot be specified, the current location is determined based on GPS position information at the time when the GPS signal is lost (for example, when entering the indoors).

続いて、ステップＳＢ５では、現在の日時およびユーザの現在位置に応じて、前述したデータベース選択テーブルＤＳＴから該当する画像データベースを選択する。これにより、顔認識する場面に最も適合した属性の画像データベースが選択される。具体的には、例えば日時が平日の日中で場所が会社であれば、前述の画像データベースＩＤＢ１が選択され、日時が土曜日または日曜日で場所を特定しない場合には前述の画像データベースＩＤＢ２が選択される。 Subsequently, in step SB5, the corresponding image database is selected from the database selection table DST described above according to the current date and time and the current position of the user. Thereby, the image database having the attribute most suitable for the scene for face recognition is selected. Specifically, for example, if the date is a weekday and the place is a company, the image database IDB1 is selected, and if the date is Saturday or Sunday and the location is not specified, the image database IDB2 is selected. The

［３］人物登録処理の動作
次に、図８を参照して人物登録処理の動作について説明する。前述したステップＳＡ７を介して本処理が実行されると、顔認識装置２０は図８に示すステップＳＣ１に処理を進め、登録指示イベントの有無を判断する。登録指示イベントが無ければ、判断結果は「ＮＯ」となり、何も処理せずにメインルーチン（図３参照）に復帰する。 [3] Operation of Person Registration Process Next, the operation of the person registration process will be described with reference to FIG. When this process is executed via step SA7 described above, face recognition device 20 advances the process to step SC1 shown in FIG. 8, and determines whether there is a registration instruction event. If there is no registration instruction event, the determination result is “NO”, and the processing returns to the main routine (see FIG. 3) without performing any processing.

一方、ユーザが縫い包み１００の右手部分ＲＨに設けられたマウス６の右ボタンをクリックして登録指示イベントを発生させると、判断結果が「ＹＥＳ」となり、次のステップＳＣ２に処理を進める。ステップＳＣ２では、メインルーチンのステップＳＡ５において撮像した顔画像データに新規認識ＩＤを付与して、上記データベース選択処理にて選択された画像データベースに新規登録する。 On the other hand, when the user clicks the right button of the mouse 6 provided in the right hand portion RH of the sewing wrap 100 to generate a registration instruction event, the determination result is “YES”, and the process proceeds to the next step SC2. In step SC2, a new recognition ID is assigned to the face image data captured in step SA5 of the main routine, and the face image data is newly registered in the image database selected in the database selection process.

次いで、ステップＳＣ３では、登録終了指示イベントが発生するまで待機する。そして、ユーザが縫い包み１００の右手部分ＲＨに設けられたマウス６の右ボタンを再度クリックして登録終了指示イベントを発生させると、判断結果が「ＹＥＳ」となり、ステップＳＣ４に進む。 Next, in step SC3, the process waits until a registration end instruction event occurs. When the user clicks again on the right button of the mouse 6 provided on the right hand portion RH of the sewn wrap 100 to generate a registration end instruction event, the determination result is “YES”, and the flow proceeds to step SC4.

ステップＳＣ４では、名前お知らせテーブルＮＩＴ中の未使用の音声ファイルに、新規認識ＩＤを対応付けて登録する。そして、ステップＳＣ５では、その新規認識ＩＤに対応付けた音声ファイルを再生する。これにより、画像データベースに新規登録された人物の名前を知らせる音声ファイルの内容（例えば「にゃ〜ん」というネコの鳴き声）がユーザに通知される。 In step SC4, a new recognition ID is registered in association with an unused voice file in the name notification table NIT. In step SC5, an audio file associated with the new recognition ID is reproduced. As a result, the user is notified of the contents of the audio file that informs the name of the person newly registered in the image database (for example, the cat's cry of “Nyan”).

［４］名前お知らせ処理の動作
次に、図９を参照して名前お知らせ処理の動作について説明する。前述したステップＳＡ１０を介して本処理が実行されると、顔認識装置２０は図９に示すステップＳＤ１に処理を進め、名前お知らせ指示イベントの有無を判断する。名前お知らせ指示イベントが無ければ、判断結果は「ＮＯ」となり、何も処理せずにメインルーチン（図５参照）に復帰する。 [4] Operation of Name Notification Process Next, the operation of the name notification process will be described with reference to FIG. When this process is executed via step SA10 described above, face recognition device 20 advances the process to step SD1 shown in FIG. 9, and determines whether there is a name notification instruction event. If there is no name notification instruction event, the determination result is “NO”, and no processing is performed, and the process returns to the main routine (see FIG. 5).

一方、ユーザが縫い包み１００の右手部分ＲＨに設けられたマウス６の左ボタンをクリックして名前お知らせ指示イベントを発生させると、判断結果が「ＹＥＳ」となり、ステップＳＤ２に進み、名前お知らせテーブルＮＩＴを参照して、顔認識にて同定された人物の認識ＩＤに対応する音声ファイルを選択して再生する。これにより、縫い包み１００は顔認識にて同定された人物の名前をネコ語（鳴き声）で知らせる。 On the other hand, when the user clicks the left button of the mouse 6 provided in the right hand portion RH of the sewing packet 100 to generate a name notification instruction event, the determination result is “YES”, the process proceeds to step SD2, and the name notification table NIT Referring to FIG. 4, an audio file corresponding to the recognition ID of the person identified by face recognition is selected and reproduced. As a result, the stitching 100 informs the name of the person identified by the face recognition in the cat language (scream).

以上のように、本実施の形態によれば、顔認識が行われる状況に対応した複数の属性、言い換えればユーザの公私にわたる人間関係を日時あるいは場所などの属性別に分け、各属性に含まれる人物の顔画像をそれぞれ登録した複数の画像データベースを設けておき、これら画像データベースの内から顔認識する場面に最も適する属性の画像データベースを選択し、選択した画像データベースを参照して撮像画像シーン中に写る顔が誰の顔であるかを識別して人物同定するようになっている。 As described above, according to the present embodiment, a plurality of attributes corresponding to the situation in which face recognition is performed, in other words, the personal relationship between the user's public and private is divided according to attributes such as date / time or place, and the persons included in each attribute A plurality of image databases each of which are registered face images are selected, an image database having an attribute most suitable for the face recognition scene is selected from these image databases, and the selected image database is referred to in the captured image scene. A person is identified by identifying who the face is in.

この為、データベース登録される人数が多い場合であっても、属性別に画像データベースを細分化するから、１つの画像データベース当りに登録される人数を最適化でき、しかも顔認識する場面に最も適合する属性の画像データベースを選択してそれに基づき顔認識する結果、認識率向上を図ることが可能になる。 For this reason, even if there are many people registered in the database, the image database is subdivided by attribute, so the number of people registered per image database can be optimized, and it is most suitable for face recognition situations. As a result of selecting an image database of attributes and performing face recognition based on the selected image database, it is possible to improve the recognition rate.

（５）変形例
上述した実施の一形態では、縫い包み１００に顔認識装置２０を内蔵させて撮像画像シーン中に写る顔が誰の顔であるか人物同定し、同定した人物の名前を知らせるようにしたが、本発明の要旨はこうした態様に限定されず種々変形が可能である。 (5) Modification In the embodiment described above, the face recognition device 20 is incorporated in the stitching 100 to identify the person who is the face in the captured image scene, and inform the name of the identified person. However, the gist of the present invention is not limited to such an embodiment, and various modifications are possible.

例えば図１０に示すように、縫い包み１００に替えて顔認識装置２０の本体部分をショルダーバックＢに収納し、マウス６やＣＣＤカメラ７を当該バックのショルダーベルトＳＢに配設させる形態にすることもできる。 For example, as shown in FIG. 10, the main body portion of the face recognition device 20 is accommodated in the shoulder back B in place of the sewing wrap 100, and the mouse 6 and the CCD camera 7 are arranged on the shoulder belt SB of the back. You can also.

また、本実施の一形態では、日時あるいは場所に応じて、顔認識する場面に最も適する画像データベースを選択するようにしたが、これに限らず、前述した親密度データベースＦＤＢ（図４参照）を用いて顔認識する場面に適した画像データベースを作成することも可能である。 In the present embodiment, the image database most suitable for the face recognition scene is selected according to the date and time or the place. However, the present invention is not limited to this, and the above-described familiarity database FDB (see FIG. 4) is used. It is also possible to create an image database suitable for scenes that are used for face recognition.

すなわち、顔検出時に得られる顔領域の大きさに基づき、その顔領域の大きさに対応した親密度を有する人物の認識ＩＤを親密度データベースＦＤＢから検索し、検索した認識ＩＤに対応する登録顔画像データを各画像データベースから抽出して新たな画像データベースを作成し、それを用いて顔認識すれば、撮像画像中から検出される顔領域の大きさに対応した親密度を有する人物に限定して顔認識が行われる為、認識率向上を図ることができる。 That is, based on the size of the face area obtained at the time of face detection, a recognition ID of a person having a closeness corresponding to the size of the face area is searched from the closeness database FDB, and the registered face corresponding to the searched recognition ID If image data is extracted from each image database to create a new image database and face recognition is performed using it, it is limited to persons with closeness corresponding to the size of the face area detected in the captured image. Therefore, the recognition rate can be improved.

また、上述の実施の形態においては、人物の顔から個人を特定する場合を一例として説明したが、本発明はこれに限らず、例えば、店舗などに掲示されたロゴマークなどを撮像し、そのロゴマークの画像から店舗名や法人等を特定する画像認識装置に適用し、認識が行われる日時や場所、対象となるロゴマーク画像に対する親密度などの、その場の状況に対応した属性の画像データベースを自動的に選択し、選択された属性の画像データベースに記憶された参照画像を参照して、撮像されたロゴマークの画像から店舗名や法人等を特定するようにしても構わない。 In the above-described embodiment, the case where an individual is specified from the face of a person has been described as an example. However, the present invention is not limited thereto, and for example, a logo mark posted at a store or the like is imaged. Applicable to image recognition devices that identify store names, corporations, etc. from logo mark images, images with attributes corresponding to the current situation, such as date and time of recognition, familiarity with the target logo mark image, etc. A database may be automatically selected, and a store name, a corporation, or the like may be specified from the captured logo mark image with reference to a reference image stored in the image database of the selected attribute.

さらに、本発明の要旨は、上述の実施の形態のみならず、撮像機能やＧＰＳ位置検出機能を備えた携帯電話もしくは撮像機能、ＧＰＳ位置検出機能および無線通信機能を有する携帯端末にも適用可能である。その場合、携帯電話あるいは携帯端末はＣＰＵの演算処理能力が充分でないものが多い為、端末側で撮像した画像および撮像位置を、ネットワークを介して顔認識処理するサーバ側へ送出し、サーバ側が顔認識した結果を端末側に返送する形態とすれば良い。 Further, the gist of the present invention can be applied not only to the above-described embodiment, but also to a mobile phone having an imaging function or a GPS position detection function or a mobile terminal having an imaging function, a GPS position detection function, and a wireless communication function. is there. In that case, since many mobile phones or mobile terminals do not have sufficient CPU processing capability, the image captured at the terminal side and the imaging position are sent to the server side that performs face recognition processing via the network. The recognized result may be returned to the terminal side.

本発明による実施の一形態である縫い包み１００の外観およびその動作モードを説明するための図である。It is a figure for demonstrating the external appearance and the operation mode of the sewing packet 100 which are one Embodiment by this invention. 顔認識装置２０の構成を示すブロック図である。2 is a block diagram illustrating a configuration of a face recognition device 20. FIG. 画像データベースＩＤＢ１，ＩＤＢ２の概念を示す概念図である。It is a conceptual diagram which shows the concept of image database IDB1, IDB2. 親密度データベースＦＤＢの概念を示す概念図である。It is a conceptual diagram which shows the concept of the familiarity database FDB. メインルーチンの動作を示すフローチャートである。It is a flowchart which shows operation | movement of a main routine. アルバム閲覧処理に画面表示されるＧＵＩ画面の一例を示す図である。It is a figure which shows an example of the GUI screen displayed on a screen by an album browsing process. データベース選択処理の動作を示すフローチャートである。It is a flowchart which shows operation | movement of a database selection process. 人物登録処理の動作を示すフローチャートである。It is a flowchart which shows operation | movement of a person registration process. 名前お知らせ処理の動作を示すフローチャートである。It is a flowchart which shows operation | movement of a name notification process. 変形例を示す図である。It is a figure which shows a modification. 親密度の定義を説明するための図である。It is a figure for demonstrating the definition of intimacy. アルバム閲覧処理にて画面表示される顔画像データに付加されるフレームの一例を示す図である。It is a figure which shows an example of the flame | frame added to the face image data displayed on a screen by an album browsing process.

Explanation of symbols

１ＣＰＵ，２ＲＯＭ，３ＲＡＭ，４ＨＤＤ，５サウンドシステム，６マウス，７ＣＣＤカメラ，８位置検出部，９モード切替スイッチ，１０ビデオ出力ポート，２０顔認識装置，１００縫い包み 1 CPU, 2 ROM, 3 RAM, 4 HDD, 5 Sound system, 6 Mouse, 7 CCD camera, 8 Position detection unit, 9 Mode switch, 10 Video output port, 20 Face recognition device, 100 Sewing

Claims

Imaging means for imaging a target image to be recognized;
Estimating means for estimating a closeness with a user for the target image;
A plurality of attribute-specific image databases each storing a reference image classified into a plurality of attributes depending on the familiarity, and a recognition ID uniquely assigned to each reference image. Means,
A selection means for selecting an attribute-specific image database means corresponding to the intimacy estimated by the estimation means from the plurality of attribute-specific image database means;
With reference to a reference image stored in the attribute-specific image database means selected by the selection means, and a target specifying means for specifying a recognition ID corresponding to the target image imaged by the imaging means,
The estimation means is based on the occupation area ratio of the target image with respect to the entire region imaged by the imaging means, or the number of times the target image has been identified by the target identification means in the past, or a combination of the occupation area ratio and the number of times. An image recognition apparatus characterized by estimating a closeness with a user for the target image .

It further comprises a date and time counting means for counting the current date and time,
The attribute-specific image database means is classified into attributes depending on a closeness with the user for the target image and a time when the image is picked up by the imaging means, and the selecting means is a closeness estimated by the estimating means. And, based on the current date and time counted by the date and time counting means, from among the plurality of attribute-specific image database means, the familiarity with the user for the target image currently captured by the imaging means, the image recognition apparatus according to claim 1, wherein the selecting the demographic image database means corresponding to the date and time.

A positioning means for positioning the current position;
The attribute-specific image database unit is classified according to an attribute depending on a closeness with a user with respect to the target image and a position captured by the imaging unit, and the selection unit is determined by the closeness estimated by the estimation unit If, on the basis of the current position as the positioning by said positioning means, from among the plurality of attribute-based image database unit, now the closeness of the user with respect to the captured target image by the imaging means, the current the image recognition apparatus according to claim 1, wherein the selecting demographic image database means corresponding to the position.

A date and time counting means for counting the current date and time;
A positioning means for positioning the current position;
The attribute-based image database means includes a degree of intimacy with the user with respect to the target image, are classified by the attributes that depend on the time and location to be imaged by said imaging means, said selection means is estimated by said estimating means Based on the familiarity, the current date and time counted by the date and time counting means , and the current position measured by the positioning means, the imaging means is currently selected from the plurality of attribute-specific image database means. image recognition apparatus familiarity and the current date and time, and claim 1, wherein the selecting demographic image database means corresponding to the current position of the user with respect to the captured target image by.

An imaging step of imaging a target image to be recognized ;
An estimation step for estimating a closeness with the user for the target image;
A plurality of attribute-specific image databases each storing a reference image classified into a plurality of attributes depending on the familiarity, and a recognition ID uniquely assigned to each reference image. from among a selection processing step of selecting the current, the attribute-image database corresponding to closeness of the user estimated by the processing of said estimating step,
A target identification processing step of identifying a recognition ID corresponding to the target image captured in the processing of the imaging step with reference to a reference image stored in the attribute-specific image database selected by the processing of the selection processing step; Equipped ,
The process of the estimation step is the ratio of the occupied area of the target image to the entire region imaged by the process of the imaging step, or the number of times the target image is specified by the process of the target specifying step in the past, or the occupied area ratio And the intimacy with the user for the target image based on the combination of the number of times and the number of times
Image recognition processing method characterized by.

A date and time counting step for counting the current date and time,
The attribute-specific image database is categorized into attributes that depend on a closeness with the user for the target image and a time imaged by the imaging step process, and the selection process step process includes the estimation process step . Based on the closeness estimated by the process and the current date and time counted by the process of the date and time counting process step , the image is currently captured by the process of the imaging step from the plurality of attribute-specific image databases. 6. The image recognition processing method according to claim 5 , wherein an attribute-specific image database corresponding to the closeness with the user for the target image and the current date and time is selected .

It further includes a positioning processing step for positioning the current position,
The attribute-specific image database is classified according to an attribute depending on a closeness with a user with respect to the target image and a position imaged by the imaging step processing, and the selection processing step processing is performed by the estimation processing step . a closeness estimated by processing, on the basis of the the process the current position as the positioning by the positioning process step, from among the plurality of attribute-based image database, currently taken by the processing of the imaging step object 6. The image recognition processing method according to claim 5 , wherein an attribute-specific image database corresponding to the closeness to the image and the current position is selected .

A date and time counting process step for counting the current date and time;
A positioning process step for positioning the current position;
The demographic image database, and closeness to the user with respect to the target image, the time taken by the processing of the image pickup step, and is classified by the attributes depends on the position, the processing of the selection processing step, the estimated Based on the closeness estimated by the processing of the processing step , the current date and time counted by the processing of the date and time counting processing step , and the current position measured by the positioning processing step, the plurality of attribute-specific images from among the database, the current, and familiarity for the subject image picked up by the processing in the imaging step, claim 5, characterized in that to select the current date and time, and the attribute-based image database corresponding to the current position The image recognition processing method described.

An imaging step of imaging a target image to be recognized ;
An estimation step for estimating a closeness with the user for the target image;
A plurality of attribute-specific image databases each storing a reference image classified into a plurality of attributes depending on the familiarity, and a recognition ID uniquely assigned to each reference image. from among a selection processing step of selecting the current, the attribute-image database corresponding to closeness of the user estimated by the processing of said estimating step,
A target identification processing step of identifying a recognition ID corresponding to the target image captured in the processing of the imaging step with reference to a reference image stored in the attribute-specific image database selected by the processing of the selection processing step; Equipped ,
The process of the estimation step is the ratio of the occupied area of the target image to the entire region imaged by the process of the imaging step, or the number of times the target image is specified by the process of the target specifying step in the past, or the occupied area ratio And the intimacy with the user for the target image based on the combination of the number of times and the number of times
An image recognition program characterized by that .

A date and time counting step for counting the current date and time,
The attribute-specific image database is categorized into attributes that depend on a closeness with the user for the target image and a time imaged by the imaging step process , and the selection process step process includes the estimation process step . Based on the closeness estimated by the process and the current date and time counted by the process of the date and time counting process step , the image is currently captured by the process of the imaging step from the plurality of attribute-specific image databases. The image recognition program according to claim 9 , wherein an attribute-specific image database corresponding to a user's familiarity with the target image and the current date and time is selected .

It further includes a positioning processing step for positioning the current position,
The attribute-specific image database is classified according to an attribute depending on a closeness with a user with respect to the target image and a position imaged by the imaging step processing , and the selection processing step processing is performed by the estimation processing step . a closeness estimated by processing, on the basis of the the process the current position as the positioning by the positioning process step, from among the plurality of attribute-based image database, currently taken by the processing of the imaging step object The image recognition processing program according to claim 9 , wherein an image database classified by attribute corresponding to a user's familiarity with an image and a current position is selected .

A date and time counting process step for counting the current date and time;
A positioning process step for positioning the current position;
The demographic image database, and closeness to the user with respect to the target image, the time taken by the processing of the image pickup step, and is classified by the attributes depends on the position, the processing of the selection processing step, the estimated The plurality of attributes based on the closeness estimated by the processing of the processing step , the current date and time counted by the processing of the date and time counting processing step , and the current position measured by the processing of the positioning processing step from among the different image database, now to the closeness to the target image captured by the process of the imaging step, and selects the current date and time, and the attribute-based image database corresponding to the current position according Item 10. The image recognition program according to Item 9 .