JP5267695B2

JP5267695B2 - Imaging apparatus, face recognition method and program thereof

Info

Publication number: JP5267695B2
Application number: JP2012045142A
Authority: JP
Inventors: 一記喜多
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 2012-03-01
Filing date: 2012-03-01
Publication date: 2013-08-21
Anticipated expiration: 2027-08-31
Also published as: JP2012157016A

Abstract

PROBLEM TO BE SOLVED: To realize an imaging device which can improve face recognition accuracy, and to realize its program. SOLUTION: An imaging device determines whether or not a human face detection mode is set (S31), and if it is determined that the human face detection mode is set, the imaging device performs processing for detecting a human face by two-dimensional face detection processing (S32). If it is determined that there is no human face by the two-dimensional face detection processing, the imaging device performs the processing for detecting the human face by three-dimensional face detection processing (S35 and S36). On the other hand, if an animal face detection mode is set, the imaging device performs processing for detecting an animal-type face in the configured detection mode by the three-dimensional face detection processing (S36). COPYRIGHT: (C)2012,JPO&INPIT

Description

本発明は、撮像装置、顔認識方法及びそのプログラムに係り、詳しくは、顔を認識する機能を有した撮像装置、顔認識方法及びそのプログラムに関する。 The present invention relates to an imaging device , a face recognition method, and a program thereof, and more particularly, to an imaging device having a function of recognizing a face, a face recognition method, and a program thereof.

近年、電子カメラ等の撮像装置においては、被写体の顔を認識する技術が登場したことにより、該認識された顔に対してピントを合わせたり、顔が適正露出となるように露出条件を設定する技術がある（特許文献１）。 In recent years, in imaging devices such as electronic cameras, a technique for recognizing the face of a subject has appeared, so that the recognized face is focused and exposure conditions are set so that the face is properly exposed. There is technology (Patent Document 1).

公開特許公報特開２００７−０８１９９１JP Patent Publication No. 2007-081991

しかしながら、従来の顔認識は、人間の顔、それも正面の顔などある特定の向きから撮像された場合のみを対象としており、横顔やサングラスの装着時などは認識することができなかった。また、人間以外の顔、例えば、猫や犬などの顔は認識することができなかった。 However, the conventional face recognition is intended only when the human face is taken from a specific direction such as a front face, and cannot be recognized when wearing a side face or sunglasses. Further, faces other than humans, such as cats and dogs, could not be recognized.

そこで本発明は、かかる従来の問題点に鑑みてなされたものであり、顔認識の精度を向上させることができる撮像装置、顔認識方法及びそのプログラムを提供することを目的とする。 Therefore, the present invention has been made in view of such conventional problems, and an object thereof is to provide an imaging apparatus , a face recognition method, and a program thereof that can improve the accuracy of face recognition .

上記目的達成のため、請求項１記載の発明による撮像装置は、被写体を撮像する撮像手段と、被写体の顔を認識する際に実行する顔認識処理の内容が各々異なる複数の顔認識モードを備え、該複数の顔認識モードの中から何れかの顔認識モードを設定する顔認識モード設定手段と、前記撮像手段により撮像された画像データに対して、前記顔認識モード設定手段により設定された顔認識モードに応じた顔認識処理を実行することにより、該撮像された画像データ内にある顔を認識する顔認識手段と、を備え、前記顔認識モード設定手段は、設定可能な顔認識モードとして、認識対象となる被写体の種類の違いと、被写体の種類以外の条件の違いとの組み合わせに応じた複数の顔認識モードを含み、前記顔認識手段は、前記撮像手段により撮像される画像データに基づいて自動的に前記被写体の種類の違いと前記被写体の種類以外の条件の違いとを識別して実行する顔認識処理を選択することを特徴とする。 For the purposes achieved, the imaging apparatus according to a first aspect of the present invention, comprises imaging means for imaging an object, a plurality of face recognition mode, the contents of each of the different face recognition processing to be executed when recognizing the face of a subject a face recognition mode setting means for setting one of the face recognition mode from a said plurality of facial recognition mode, the image data captured by the imaging unit, set by the facial recognition mode setting means face Face recognition means for recognizing a face in the captured image data by executing face recognition processing according to the recognition mode, and the face recognition mode setting means is a settable face recognition mode. A plurality of face recognition modes corresponding to a combination of a difference in the type of subject to be recognized and a difference in conditions other than the type of subject, and the face recognition unit captures an image by the imaging unit And selects the face recognition process performed to identify the differences between automatically the non kinds of kinds of differences between the object of the subject condition based on image data.

また、例えば、請求項２に記載されているように、前記被写体の種類以外の条件の違いは、顔認識処理の条件の違いであり、前記顔認識手段は、前記撮像手段により撮像された画像データに対して、前記顔認識モード設定手段により設定された顔認識モードに対応する条件で顔認識処理を実行することにより、該撮像された画像データ内にある顔の中で、該顔認識モードに対応する種類の被写体の顔を認識するようにしてもよい。 Further, for example, as described in claim 2, the difference in conditions other than the type of the subject is a difference in conditions of face recognition processing, and the face recognition unit is an image captured by the imaging unit. to the data, by executing the face recognition mode setting means by the setting face recognition mode to the face recognition processing in the corresponding conditions, in the face in the image pickup has been the image data, said pigment recognition mode It is also possible to recognize the face of the type of subject corresponding to.

また、例えば、請求項３に記載されているように、前記顔認識処理の条件は、２次元的に顔を認識する２次元顔認識処理と、３次元的に顔を認識する３次元顔認識処理を含むようにしてもよい。 Further, for example, as described in claim 3, the condition for the face recognition processing includes two-dimensional face recognition processing for two-dimensional face recognition and three-dimensional face recognition for three-dimensional face recognition. Processing may be included.

また、例えば、請求項４に記載されているように、前記顔認識モード設定手段は、ユーザによって選択された顔認識モードを設定するようにしてもよい。 Further, for example, as described in claim 4, wherein the face recognition mode setting means may set the face recognition mode selected by the user.

また、例えば、請求項５に記載されているように、前記顔認識モード設定手段は、顔認識モードを自動的に選択して設定するようにしてもよい。 For example, as described in claim 5, the face recognition mode setting means may automatically select and set the face recognition mode .

また、例えば、請求項６に記載されているように、前記被写体の種類の違いは、人の顔と人以外の顔であり、前記顔認識手段は、前記顔認識モード設定手段により人の顔を対象とした第１の顔認識モードが設定された場合は、人の顔を認識する顔認識処理実行することにより、前記撮像された画像データ内にある人の顔を認識し、前記顔認識モード設定手段により人以外の顔を対象とした第２の顔認識モードが設定された場合は、人以外の種類の顔を認識するようにしてもよい。 Further, for example, as described in claim 6, the difference in the type of the subject is a human face and a face other than a person, and the face recognition means uses the face recognition mode setting means to When the first face recognition mode for the target is set, a face recognition process for recognizing a person's face is executed to recognize a person's face in the captured image data, and the face recognition When the second face recognition mode for a face other than a person is set by the mode setting means, a face other than a person may be recognized .

また、例えば、請求項７に記載されているように、前記顔認識処理の条件は、２次元的に顔を認識する条件と、３次元的に顔を認識する条件とを含み、前記顔認識手段は、前記顔認識モード設定手段により２次元的に顔を認識する条件に対応した第３の顔認識モードが設定された場合は、２次元的に顔を認識する２次元顔認識処理を実行することにより、前記撮像された画像データ内にある顔を認識し、前記顔認識モード設定手段により３次元的に顔を認識する条件に対応した第４の顔認識モードが設定された場合は、３次元的に顔を認識する３次元顔認識処理を実行することにより、前記撮像された画像データ内にある顔を認識するようにしてもよい。 Further, for example, as described in claim 7, the face recognition processing conditions include a condition for recognizing a face in two dimensions and a condition for recognizing a face in three dimensions, The means executes a two-dimensional face recognition process for two-dimensionally recognizing the face when the third face recognition mode corresponding to the condition for recognizing the face two-dimensionally is set by the face recognition mode setting means. When a fourth face recognition mode corresponding to a condition for recognizing a face in three dimensions is set by the face recognition mode setting means by recognizing a face in the captured image data, You may make it recognize the face in the said imaged image data by performing the three-dimensional face recognition process which recognizes a face three-dimensionally .

また、例えば、請求項８に記載されているように、前記複数の顔認識モードは、顔認識処理を切り換えて実行する第５の顔認識モードを含み、前記顔認識手段は、前記顔認識モード設定手段により前記第５の顔認識モードが設定された場合は、人の顔を認識する顔認識処理を実行することにより、前記撮像された画像データ内にある人の顔を認識し、この顔認識処理により人の顔が認識されなかった場合に、人以外の種類の顔を認識する顔認識処理に切り換えて実行することにより、前記撮像された画像データ内にある人以外の種類の顔を認識するようにしてもよい。 Further, for example, as described in claim 8, the plurality of face recognition modes includes a fifth face recognition mode for switching and executing face recognition processing, and the face recognition means includes the face recognition mode. When the fifth face recognition mode is set by the setting unit, a face recognition process for recognizing a person's face is executed to recognize a person's face in the captured image data. When a face of a person is not recognized by the recognition process, a face other than a person in the captured image data is changed to a face recognition process for recognizing a face other than a person. You may make it recognize .

また、例えば、請求項９に記載されているように、前記複数の顔認識モードは、顔認識処理を切り換えて実行する第６の顔認識モードを含み、前記顔認識手段は、前記顔認識モード設定手段により前記第６の顔認識モードが設定された場合は、２次元顔認識処理を実行することにより、前記撮像された画像データ内にある人の顔を認識し、この顔認識処理により人の顔が認識されなかった場合に、３次元顔認識処理に切り換えて実行することにより、前記撮像された画像データ内にある人以外の種類の顔を認識するようにしてもよい。 Further, for example, as described in claim 9, the plurality of face recognition modes includes a sixth face recognition mode for switching and executing face recognition processing, and the face recognition means includes the face recognition mode. When the sixth face recognition mode is set by the setting means, a two-dimensional face recognition process is executed to recognize a person's face in the captured image data. If the face is not recognized, a face other than a person in the captured image data may be recognized by switching to and executing the three-dimensional face recognition process .

また、例えば、請求項１０に記載されているように、前記顔認識手段は、前記顔認識モード設定手段により前記第１の顔認識モードが設定された場合は、前記撮像手段により撮像された画像データに対して、２次元顔認識処理を実行して、該撮像された画像データ内にある人の顔を認識し、前記顔認識モード設定手段により前記第２の顔認識モードが設定された場合は、前記撮像手段により撮像された画像データに基づいて、３次元顔認識処理を実行して、該撮像された画像データ内にある前記顔認識モード設定手段により設定された顔認識モードの種類の顔を認識するようにしてもよい。 Further, for example, as described in claim 10, when the first face recognition mode is set by the face recognition mode setting unit, the face recognition unit is an image captured by the imaging unit. for the data, by performing a two-dimensional face recognition processing to recognize a human face in the image pickup has been the image data, if the second face recognition mode is set by the face recognition mode setting means Performs a three-dimensional face recognition process based on the image data picked up by the image pickup means, and sets the type of the face recognition mode set by the face recognition mode setting means in the picked-up image data. You may make it recognize a face.

また、例えば、請求項１１に記載されているように、前記第２の顔認識モードは、更に、人以外の複数種類の被写体の各々を認識対象とする複数の顔認識モードに分割され、前記顔認識手段は、前記顔認識モード設定手段により前記第１の顔認識モードが設定された場合は、前記撮像手段により撮像された画像データに対して、２次元顔認識処理を実行して、該撮像された画像データ内にある人の顔を認識し、前記顔認識モード設定手段により前記第２の顔認識モードが設定された場合は、該設定された種類の顔検出モードに応じて、実行させる顔認識処理を、２次元顔認識処理と３次元顔認識処理とに切り替えることにより、前記撮像手段により撮像された画像データ内にある前記顔認識モード設定手段により設定された顔認識モードの種類の顔を認識するようにしてもよい。 Further, for example, as described in claim 11, the second face recognition mode is further divided into a plurality of face recognition modes in which each of a plurality of types of subjects other than a person is a recognition target, When the first face recognition mode is set by the face recognition mode setting means, the face recognition means performs a two-dimensional face recognition process on the image data picked up by the image pickup means, When a face of a person in the captured image data is recognized and the second face recognition mode is set by the face recognition mode setting means, execution is performed according to the set type of face detection mode. By switching the face recognition processing to be performed between two-dimensional face recognition processing and three-dimensional face recognition processing, the type of face recognition mode set by the face recognition mode setting means in the image data picked up by the image pickup means You may recognize the face.

また、例えば、請求項１２に記載されているように、前記顔認識手段は、
２次元顔認識処理の実行により顔を認識することができなかった場合は、前記撮像手段により撮像された画像データに基づいて、３次元顔認識処理を実行して、該撮像された画像データ内にある顔を認識するようにしてもよい。 For example, as described in claim 12, the face recognition unit includes:
If the face cannot be recognized by the execution of the two-dimensional face recognition process, the three-dimensional face recognition process is executed based on the image data picked up by the image pickup means, and the inside of the picked-up image data You may make it recognize the face which exists in.

また、例えば、請求項１３に記載されているように、前記顔認識手段は、
前記撮像手段により撮像された画像データの全領域、又は、画像データの画角の中央領域、又は、ユーザによって指定された領域の画像データに基づいて、顔認識処理を実行して、該撮像された画像データ内にある顔を認識するようにしてもよい。 For example, as described in claim 13, the face recognition unit includes:
Based on the image data of the entire area of the image data imaged by the imaging means, the central area of the angle of view of the image data, or the area specified by the user, the face recognition process is executed and the image is captured. A face in the image data may be recognized.

また、例えば、請求項１４に記載されているように、前記顔認識手段は、
２次元顔認識処理の実行により、顔を認識することができなかったが、顔らしきものが認識できた場合は、前記撮像手段により撮像された画像データの該顔らしきものがあると認識された領域の画像データに基づいて、３次元顔認識処理を実行して、該撮像された画像データ内にある顔を認識するようにしてもよい。 For example, as described in claim 14, the face recognition means includes
Although the face could not be recognized by the execution of the two-dimensional face recognition process, but the face-like object could be recognized, it was recognized that there was the face-like image data captured by the imaging means. Based on the image data of the area, a three-dimensional face recognition process may be executed to recognize a face in the captured image data.

また、例えば、請求項１５に記載されているように、前記顔認識手段は、
前記撮像手段により撮像された画像データ内にある顔を検出するようにしてもよい。 For example, as described in claim 15, the face recognition unit includes:
You may make it detect the face in the image data imaged by the said imaging means.

また、例えば、請求項１６に記載されているように、前記顔認識手段は、
前記撮像手段により撮像された画像データ内にある顔が何であるかを具体的に識別するようにしてもよい。 Further, for example, as described in claim 16, the face recognition means includes:
You may make it identify specifically what the face is in the image data imaged by the said imaging means.

また、例えば、請求項１７に記載されているように、前記撮像手段により撮像された画像データ内にある顔を検出する２次元顔検出手段を備え、
前記顔認識手段は、
前記２次元顔検出手段により顔が検出された場合は、前記撮像手段により撮像された画像データの前記２次元顔検出手段により検出された顔領域の画像データに基づいて、３次元顔認識処理を実行することにより、該撮像された画像データ内にある顔を識別するようにしてもよい。 Further, for example, as described in claim 17, two-dimensional face detection means for detecting a face in the image data imaged by the imaging means is provided,
The face recognition means
When a face is detected by the two-dimensional face detection means, a three-dimensional face recognition process is performed based on the image data of the face area detected by the two-dimensional face detection means of the image data picked up by the image pickup means. By executing, a face in the captured image data may be identified.

また、例えば、請求項１８に記載されているように、前記顔認識モード設定手段は、
前記顔認識手段により顔が認識されると、前記撮像手段により撮像された画像データに基づいて、前記顔認識手段により認識された顔が画像データ内のどこにあるかを２次元的に検出していくことにより、該認識された顔に対して追従していく追従顔認識モードに切り替えて設定するようにしてもよい。 Further, for example, as described in claim 18, the face recognition mode setting means includes:
When the face is recognized by the face recognition means, based on the image data picked up by the image pickup means, two-dimensionally detecting where the face recognized by the face recognition means is in the image data is detected. By going, the following face recognition mode may be set to follow the recognized face.

また、例えば、請求項１９に記載されているように、前記顔認識モード設定手段は、
前記顔認識手段による追従顔認識モードに応じた顔認識処理により人の顔が認識されなくなった場合は、前記追従顔認識モードが設定される前に設定されていた顔認識モードを再設定するようにしてもよい。 Further, for example, as described in claim 19, the face recognition mode setting means includes:
When a human face is no longer recognized by the face recognition process according to the following face recognition mode by the face recognition means, the face recognition mode set before the following face recognition mode is set is reset. It may be.

上記目的達成のため、請求項２０記載の発明による顔認識方法は、コンピュータに、被写体の顔を認識する際に実行する顔認識処理の内容が各々異なる複数の顔認識モードを備え、該複数の顔認識モードの中から何れかの顔認識モードを設定する顔認識モード設定処理と、画像データに対して、前記顔認識モード設定処理により設定された顔認識モードに応じた顔認識処理を実行することにより、該撮像された画像データ内にある顔を認識する顔認識処理と、を実行させる顔認識方法であって、前記顔認識モード設定処理は、設定可能な顔認識モードとして、認識対象となる被写体の種類の違いと、被写体の種類以外の条件の違いとの組み合わせに応じた複数の顔認識モードを含み、前記顔認識処理は、前記画像データに基づいて自動的に前記被写体の種類の違いと前記被写体の種類以外の条件の違いとを識別して実行する顔認識処理を選択することを特徴とする。
上記目的達成のため、請求項２１記載の発明によるプログラムは、撮像手段を備えたコンピュータを、被写体の顔を認識する際に実行する顔認識処理の内容が各々異なる複数の顔認識モードを備え、該複数の顔認識モードの中から何れかの顔認識モードを設定する顔認識モード設定手段、前記撮像手段により撮像された画像データに対して、前記顔認識モード設定手段により設定された顔認識モードに応じた顔認識処理を実行することにより、該撮像された画像データ内にある顔を認識する顔認識手段、として機能させ、前記顔認識モード設定手段は、設定可能な顔認識モードとして、認識対象となる被写体の種類の違いと、被写体の種類以外の条件の違いとの組み合わせに応じた複数の顔認識モードを含み、前記顔認識手段は、前記撮像手段により撮像される画像データに基づいて自動的に前記被写体の種類の違いと前記被写体の種類以外の条件の違いとを識別して実行する顔認識処理を選択することを特徴とする。 In order to achieve the above object, a face recognition method according to a twentieth aspect of the invention is provided with a plurality of face recognition modes, each of which includes different face recognition processes executed when recognizing the face of a subject. A face recognition mode setting process for setting any one of the face recognition modes from among the face recognition modes and a face recognition process corresponding to the face recognition mode set by the face recognition mode setting process are performed on the image data. A face recognition method for recognizing a face in the imaged image data, wherein the face recognition mode setting process includes a recognition target as a settable face recognition mode. A plurality of face recognition modes according to a combination of a difference in subject type and a difference in conditions other than the subject type, and the face recognition processing is automatically performed based on the image data. And selects the face recognition process performed to identify the differences between conditions other than the type of the object and the difference type of subject.
To achieve the above object, a program according to the invention described in claim 21 includes a plurality of face recognition modes, each of which includes different face recognition processes executed when a computer including an imaging unit recognizes the face of a subject. Face recognition mode setting means for setting any one of the plurality of face recognition modes, and the face recognition mode set by the face recognition mode setting means for the image data picked up by the image pickup means By executing face recognition processing according to the function, it is made to function as a face recognition unit that recognizes a face in the captured image data, and the face recognition mode setting unit recognizes as a settable face recognition mode. A plurality of face recognition modes corresponding to a combination of a difference in the type of the subject to be processed and a difference in conditions other than the type of the subject, and the face recognition means includes the imaging hand And selects the face recognition process performed to identify the differences between automatically the non kinds of kinds of differences between the object of the subject condition based on image data captured by.

本願発明によれば、顔認識の精度を向上させることができる。 According to the present invention, the accuracy of face recognition can be improved.

本発明の実施の形態のデジタルカメラのブロック図である。1 is a block diagram of a digital camera according to an embodiment of the present invention. メモリ１２に記録されている顔データテーブルの様子を示す図である。It is a figure which shows the mode of the face data table currently recorded on the memory. 第１の実施の形態のデジタルカメラ１の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the digital camera 1 of 1st Embodiment. 第１の実施の形態のデジタルカメラ１の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the digital camera 1 of 1st Embodiment. 設定された顔検出モードの種類に応じた顔検出処理の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the face detection process according to the kind of set face detection mode. １つ目の３次元の顔検出処理の動作を示すフローチャートである。It is a flowchart which shows the operation | movement of the 1st three-dimensional face detection process. 生成された被写体（ここでは猫科の顔のみ）の３次元モデルを３角形ポリゴンで表したときの様子を示す図である。It is a figure which shows a mode when the generated three-dimensional model of the to-be-photographed object (here only a cat department face) is represented by the triangular polygon. ２つ目の３次元の顔検出処理の動作を示すフローチャートである。It is a flowchart which shows the operation | movement of the 2nd three-dimensional face detection process. 第２の実施の形態のデジタルカメラ１の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the digital camera 1 of 2nd Embodiment. 第２の実施の形態のデジタルカメラ１の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the digital camera 1 of 2nd Embodiment. 設定された顔識別モードの種類に応じた顔識別処理の動作を示すフローチャートである。It is a flowchart which shows the operation | movement of the face identification process according to the kind of set face identification mode.

以下、本実施の形態について、本発明の撮像装置をデジタルカメラに適用した一例として図面を参照して詳細に説明する。
[第１の実施の形態]
Ａ．デジタルカメラの構成
図１は、本発明の撮像装置を実現するデジタルカメラ１の電気的な概略構成を示すブロック図である。
デジタルカメラ１は、撮影レンズ２、レンズ駆動ブロック３、絞り４、ＣＣＤ５、ドライバ６、ＴＧ（timing generator）７、ユニット回路８、画像生成部９、ＣＰＵ１０、キー入力部１１、メモリ１２、ＤＲＡＭ１３、フラッシュメモリ１４、画像表示部１５、バス１６を備えている。 Hereinafter, the present embodiment will be described in detail with reference to the drawings as an example in which the imaging apparatus of the present invention is applied to a digital camera.
[First embodiment]
A. Configuration of Digital Camera FIG. 1 is a block diagram showing a schematic electrical configuration of a digital camera 1 that implements the imaging apparatus of the present invention.
The digital camera 1 includes a photographing lens 2, a lens driving block 3, an aperture 4, a CCD 5, a driver 6, a TG (timing generator) 7, a unit circuit 8, an image generation unit 9, a CPU 10, a key input unit 11, a memory 12, a DRAM 13, A flash memory 14, an image display unit 15, and a bus 16 are provided.

撮影レンズ２は、図示しない複数のレンズ群から構成されるフォーカスレンズ、ズームレンズ等を含む。そして、撮影レンズ２にはレンズ駆動ブロック３が接続されている。レンズ駆動ブロック３は、フォーカスレンズ、ズームレンズをそれぞれ光軸方向に沿って駆動させるフォーカスモータ、ズームモータと、ＣＰＵ１０から送られてくる制御信号にしたがって、フォーカスモータ、ズームモータを駆動させるフォーカスモータドライバ、ズームモータドライバから構成されている（図示略）。 The photographing lens 2 includes a focus lens, a zoom lens, and the like that are constituted by a plurality of lens groups (not shown). A lens driving block 3 is connected to the photographing lens 2. The lens driving block 3 includes a focus motor and a zoom motor that drive the focus lens and the zoom lens along the optical axis direction, respectively, and a focus motor driver that drives the focus motor and the zoom motor according to a control signal sent from the CPU 10. And a zoom motor driver (not shown).

絞り４は、図示しない駆動回路を含み、駆動回路はＣＰＵ１０から送られてくる制御信号にしたがって絞り４を動作させる。
絞り４とは、撮影レンズ２から入ってくる光の量を制御する機構のことをいう。 The diaphragm 4 includes a drive circuit (not shown), and the drive circuit operates the diaphragm 4 in accordance with a control signal sent from the CPU 10.
The diaphragm 4 is a mechanism that controls the amount of light that enters from the photographing lens 2.

ＣＣＤ５は、ドライバ６によって駆動され、一定周期毎に被写体像のＲＧＢ値の各色の光の強さを光電変換して撮像信号としてユニット回路８に出力する。このドライバ６、ユニット回路８の動作タイミングはＴＧ７を介してＣＰＵ１０により制御される。なお、ＣＣＤ５はベイヤー配列の色フィルターを有しており、電子シャッタとしての機能も有する。この電子シャッタのシャッタ速度は、ドライバ６、ＴＧ７を介してＣＰＵ１０によって制御される。 The CCD 5 is driven by the driver 6 and photoelectrically converts the intensity of light of each color of the RGB value of the subject image for every fixed period and outputs it to the unit circuit 8 as an imaging signal. The operation timing of the driver 6 and the unit circuit 8 is controlled by the CPU 10 via the TG 7. The CCD 5 has a Bayer color filter and also functions as an electronic shutter. The shutter speed of the electronic shutter is controlled by the CPU 10 via the driver 6 and TG7.

ユニット回路８には、ＴＧ７が接続されており、ＣＣＤ５から出力される撮像信号を相関二重サンプリングして保持するＣＤＳ（Correlated Double Sampling）回路、そのサンプリング後の撮像信号の自動利得調整を行なうＡＧＣ（Automatic Gain Control）回路、その自動利得調整後のアナログの撮像信号をデジタル信号に変換するＡ／Ｄ変換器から構成されており、ＣＣＤ５から出力された撮像信号はユニット回路８を経てデジタル信号として画像生成部９に送られる。 A TG 7 is connected to the unit circuit 8, a CDS (Correlated Double Sampling) circuit that holds the imaged signal output from the CCD 5 by correlated double sampling, and an AGC that performs automatic gain adjustment of the imaged signal after the sampling. An (Automatic Gain Control) circuit and an A / D converter that converts an analog image pickup signal after the automatic gain adjustment into a digital signal, the image pickup signal output from the CCD 5 passes through the unit circuit 8 as a digital signal. It is sent to the image generator 9.

画像生成部９は、ユニット回路８から送られてきた画像データに対してγ補正処理、ホワイトバランス処理などの処理を施すとともに、輝度色差信号（ＹＵＶデータ）を生成し、該生成された輝度色差信号の画像データはＤＲＡＭ１３（バッファメモリ）に記憶される。つまり、画像生成部９は、ＣＣＤ５から出力された画像データに対して画像処理を施す。 The image generation unit 9 performs processing such as γ correction processing and white balance processing on the image data sent from the unit circuit 8, generates a luminance color difference signal (YUV data), and generates the generated luminance color difference. The image data of the signal is stored in the DRAM 13 (buffer memory). That is, the image generation unit 9 performs image processing on the image data output from the CCD 5.

ＣＰＵ１０は、ＣＣＤ５への撮像制御、画像データの圧縮・伸張処理、フラッシュメモリ１４への記録処理、画像データの表示処理を行う機能を有するとともに、デジタルカメラ１の各部を制御するワンチップマイコンである。また、ＣＰＵ１０はクロック回路を含み、タイマーとしての機能も有する。 The CPU 10 is a one-chip microcomputer that has functions of performing imaging control on the CCD 5, image data compression / expansion processing, recording processing on the flash memory 14, and image data display processing, and controls each part of the digital camera 1. . Further, the CPU 10 includes a clock circuit and has a function as a timer.

特に、ＣＰＵ１０は、複数種類の顔検出モードの中から何れかの顔検出モードを設定する顔検出モード設定部１０１と、画像データ内にある顔を認識する顔認識部１０２とを有する。
この顔検出モード設定部１０１は、設定する顔検出モードの種類を記憶する記憶領域を備え、最新に設定された顔検出モードのみが記憶される。
また、この顔認識部１０２は、２次元のデータのみを用いて顔を認識する機能（２次元の顔認識処理）と、３次元のデータを用いて顔を認識する機能（３次元の顔認識処理）とを有する。また、この顔認識とは、画像データ内にある顔を検出したり、画像データ内にある顔が誰なのか？何であるか？を具体的に識別したりすることをいう。つまり、顔検出、顔識別を総称して顔認識という。 In particular, the CPU 10 includes a face detection mode setting unit 101 that sets any one of a plurality of face detection modes and a face recognition unit 102 that recognizes a face in the image data.
The face detection mode setting unit 101 includes a storage area for storing the type of face detection mode to be set, and stores only the most recently set face detection mode.
Further, the face recognition unit 102 has a function of recognizing a face using only two-dimensional data (two-dimensional face recognition processing) and a function of recognizing a face using three-dimensional data (three-dimensional face recognition). Processing). In addition, this face recognition detects a face in the image data and who is the face in the image data? What is it? Is specifically identified. That is, face detection and face identification are collectively referred to as face recognition.

キー入力部１１は、半押し全押し可能なシャッタボタン、モード切替キー、十字キー、ＳＥＴキー、被写体追従Ｏｎ／Ｏｆｆキー等の複数の操作キーを含み、ユーザのキー操作に応じた操作信号をＣＰＵ１０に出力する。 The key input unit 11 includes a plurality of operation keys such as a shutter button that can be half-pressed and fully pressed, a mode switching key, a cross key, a SET key, a subject tracking On / Off key, and an operation signal corresponding to a user's key operation. It outputs to CPU10.

メモリ１２には、ＣＰＵ１０が各部を制御するのに必要な制御プログラム、及び必要なデータ（被写体の種類毎（人、犬、猫等）の顔データを設定した顔データテーブル）が記録されており、ＣＰＵ１０は、該プログラムに従い動作する。なお、このメモリ１２は書き換え可能な不揮発性メモリである。 The memory 12 stores a control program necessary for the CPU 10 to control each unit and necessary data (a face data table in which face data is set for each type of subject (person, dog, cat, etc.)). The CPU 10 operates according to the program. The memory 12 is a rewritable nonvolatile memory.

図２は、メモリ１２に記録されている顔データテーブルの様子を示すものである。
図２を見ると分かるように、被写体の種類を大まかに人と動物に分け、更に、動物は、犬、猫、馬・・・、というように種類毎に分かれている。そして、各被写体の種類毎に顔データが記録されている。また、顔データは、２次元（平面）と３次元（立体）と２種類有り、該被写体の種類毎に、２次元、又は／及び３次元の顔データが設定されている。ここでは、人の顔データは、２次元と３次元ともに設定されており、動物の顔データは３次元のみである。つまり、動物の場合は２次元の顔データが設定されていない。動物の顔は見る角度によって極端に変わり、立体的であるためである。 FIG. 2 shows the face data table recorded in the memory 12.
As can be seen from FIG. 2, the types of subjects are roughly divided into humans and animals, and the animals are further divided into types such as dogs, cats, horses, and so on. Face data is recorded for each type of subject. There are two types of face data, two-dimensional (plane) and three-dimensional (solid), and two-dimensional and / or three-dimensional face data is set for each type of subject. Here, human face data is set for both 2D and 3D, and animal face data is only 3D. That is, in the case of animals, two-dimensional face data is not set. This is because the animal's face changes extremely depending on the viewing angle and is three-dimensional.

ＤＲＡＭ１３は、ＣＣＤ５によって撮像された後、ＣＰＵ１０に送られてきた画像データを一時記憶するバッファメモリとして使用されるとともに、ＣＰＵ１０のワーキングメモリとして使用される。
フラッシュメモリ１４は、圧縮された画像データを保存する記録媒体である。 The DRAM 13 is used as a buffer memory for temporarily storing image data sent to the CPU 10 after being imaged by the CCD 5 and also as a working memory for the CPU 10.
The flash memory 14 is a recording medium that stores compressed image data.

画像表示部１５は、カラーＬＣＤとその駆動回路を含み、撮影待機状態にあるときには、ＣＣＤ５によって撮像された被写体をスルー画像として表示し、記録画像の再生時には、フラッシュメモリ１４から読み出され、伸張された記録画像を表示させる。 The image display unit 15 includes a color LCD and its driving circuit, and displays the subject imaged by the CCD 5 as a through image when in a shooting standby state, and is read out from the flash memory 14 and decompressed when a recorded image is reproduced. The recorded image is displayed.

Ｂ．デジタルカメラ１の動作
第１の実施の形態におけるデジタルカメラ１の動作を図３及び図４のフローチャートに従って説明する。
ユーザのモード切替キーの操作により、顔検出静止画撮影モードに設定されると、ステップＳ１で、ＣＰＵ１０は、複数種類の顔検出モードを画像表示部１５に一覧表示させる。 B. Operation of Digital Camera 1 The operation of the digital camera 1 according to the first embodiment will be described with reference to the flowcharts of FIGS.
When the face detection still image shooting mode is set by the user's operation of the mode switching key, the CPU 10 causes the image display unit 15 to display a list of a plurality of types of face detection modes in step S1.

ここで、複数種類の顔検出モードは、大きく分けて、人の顔を検出する顔検出モード（人用の顔検出モード）と動物（犬、猫、馬等）の顔を検出する顔検出モード（動物用の顔検出モード）とに分かれ、更に、動物用の顔検出モードは、犬の顔を検出する顔検出モード（犬用の顔検出モード）、猫の顔を検出する顔検出モード（猫用の顔検出モード）、馬の顔を検出する顔検出モード（馬用の顔検出モード）というように動物の種類毎に分かれる。つまり、複数種類の顔検出モードは、顔検出の対象となる被写体の種類毎にあり、これらの顔検出モードが画像表示部１５に一覧表示される。このとき、カーソルを所定の種類の顔検出モードに合わせて表示させる。 Here, the plurality of types of face detection modes are roughly divided into a face detection mode for detecting a human face (face detection mode for humans) and a face detection mode for detecting the face of an animal (dog, cat, horse, etc.). (Face detection mode for animals). Furthermore, the face detection mode for animals includes a face detection mode for detecting the face of a dog (face detection mode for dog), and a face detection mode for detecting the face of a cat ( Cat face detection mode) and horse detection face detection mode (horse face detection mode). That is, there are a plurality of types of face detection modes for each type of subject to be face-detected, and these face detection modes are displayed in a list on the image display unit 15. At this time, the cursor is displayed in accordance with a predetermined type of face detection mode.

次いで、ステップＳ２で、ＣＰＵ１０は、ユーザによって何れかの顔検出モードが選択されたか否かを判断する。この判断はＳＥＴキーの操作に対応する操作信号がキー入力部１１から送られてきたか否かにより判断する。
このとき、ユーザは十字キーを操作することにより、選択したい種類の顔検出モードにカーソルを合わせることができ、該カーソルが合わさっている種類の顔検出モードでＯＫと思う場合はＳＥＴキーの操作を行なう。
ステップＳ２で、顔検出モードが選択されたと判断すると、ステップＳ３に進み、ＣＰＵ１０の顔検出モード設定部１０１は、該ＳＥＴキーの操作が行なわれたときにカーソルが合わさっている種類の顔検出モードに設定する。 Next, in step S2, the CPU 10 determines whether any face detection mode has been selected by the user. This determination is made based on whether or not an operation signal corresponding to the operation of the SET key is sent from the key input unit 11.
At this time, the user can move the cursor to the type of face detection mode to be selected by operating the cross key, and if the user is satisfied with the type of face detection mode in which the cursor is set, press the SET key. Do.
If it is determined in step S2 that the face detection mode has been selected, the process proceeds to step S3, where the face detection mode setting unit 101 of the CPU 10 selects the type of face detection mode in which the cursor is positioned when the SET key is operated. Set to.

顔検出モードの設定を行なうと、ステップＳ４に進み、ＣＰＵ１０は、ＣＣＤ５に所定のフレームレートで被写体を撮像させる処理を開始させ、ＣＣＤ５により順次撮像され画像生成部９によって生成された輝度色差信号のフレーム画像データ（ＹＵＶデータ）をバッファメモリ（ＤＲＡＭ１３）に記憶させていき、該記憶されたフレーム画像データに基づく画像を順次画像表示部１５に表示させていくという、いわゆるスルー画像表示を開始する。 When the face detection mode is set, the process proceeds to step S4, where the CPU 10 starts the process of causing the CCD 5 to image a subject at a predetermined frame rate, and sequentially captures the luminance and color difference signals generated by the image generation unit 9 after being sequentially imaged by the CCD 5. The frame image data (YUV data) is stored in the buffer memory (DRAM 13), and so-called through image display is started in which images based on the stored frame image data are sequentially displayed on the image display unit 15.

次いで、ステップＳ５で、ＣＰＵ１０は、被写体追従Ｏｎ／Ｏｆｆキーの操作が行なわれたか否かを判断する。
ステップＳ５で、被写体追従キーの操作が行われたと判断すると、ステップＳ６に進み、ＣＰＵ１０は、現在の被写体追従モードがＯｎであるか否かを判断する。 Next, in step S5, the CPU 10 determines whether or not the subject tracking On / Off key has been operated.
If it is determined in step S5 that the subject tracking key has been operated, the process proceeds to step S6, and the CPU 10 determines whether or not the current subject tracking mode is On.

ステップＳ６で、現在の被写体追従モードがＯｎでない、つまり、Ｏｆｆであると判断すると、ステップＳ７に進み、ＣＰＵ１０は、被写体追従モードをＯｆｆからＯｎに切り替えて、ステップＳ９に進む。
一方、ステップＳ６で、現在の被写体追従モードがＯｎであると判断すると、ステップＳ８に進み、ＣＰＵ１０は、被写体追従モードをＯｎからＯｆｆに切り替えて、ステップＳ９に進む。
また、ステップＳ５で、被写体追従Ｏｎ／Ｏｆｆキーの操作が行なわれていないと判断すると、そのままステップＳ９に進む。 If it is determined in step S6 that the current subject tracking mode is not On, that is, it is Off, the process proceeds to step S7, and the CPU 10 switches the subject tracking mode from Off to On and proceeds to step S9.
On the other hand, if it is determined in step S6 that the current subject tracking mode is On, the process proceeds to step S8, and the CPU 10 switches the subject tracking mode from On to Off, and proceeds to step S9.
If it is determined in step S5 that the subject tracking On / Off key is not operated, the process proceeds to step S9.

ステップＳ９に進むと、ＣＰＵ１０は、ユーザによってシャッタボタンが半押しされたか否かを判断する。この判断は、シャッタボタンの半押し操作に対応する操作信号がキー入力部１１から送られてきたか否かにより判断する。
このとき、ユーザは、撮影したい被写体（メイン被写体）を画角内に収めて構図を決めてから撮影の準備を指示するためシャッタボタンを半押し操作する。 In step S9, the CPU 10 determines whether or not the shutter button is half-pressed by the user. This determination is made based on whether or not an operation signal corresponding to a half-press operation of the shutter button is sent from the key input unit 11.
At this time, the user half-presses the shutter button in order to instruct preparation for photographing after placing the subject (main subject) to be photographed within the angle of view and determining the composition.

ステップＳ９で、シャッタボタンが半押しされていないと判断するとステップＳ５に戻り、シャッタボタンが半押しされたと判断すると、ステップＳ１０に進み、ＣＰＵ１０の顔認識部１０２は、ステップＳ３で設定された顔検出モードの種類に応じた顔検出処理を行う。この顔検出処理は後で詳細に説明する。この顔検出処理により設定された顔検出モードの種類の顔が検出されることになる。例えば、人用の顔検出モードが設定されている場合は、顔検出処理により人の顔が検出され、馬用の顔検出モードが設定されている場合は、顔検出処理により馬の顔が検出される。 If it is determined in step S9 that the shutter button is not half-pressed, the process returns to step S5. If it is determined that the shutter button is half-pressed, the process proceeds to step S10, where the face recognition unit 102 of the CPU 10 sets the face set in step S3. Face detection processing according to the type of detection mode is performed. This face detection process will be described in detail later. The face of the face detection mode type set by this face detection process is detected. For example, when the human face detection mode is set, the human face is detected by the face detection process, and when the horse face detection mode is set, the horse face is detected by the face detection process. Is done.

顔検出処理を行うと、ステップＳ１１に進み、ＣＰＵ１０は、現在の被写体追従モードがＯｎであるか否かを判断する。
ステップＳ１１で、被写体追従モードがＯｎでない、つまり、Ｏｆｆであると判断すると、ステップＳ１２に進み、ＣＰＵ１０は、ステップＳ１０の顔検出処理により顔を検出することができたか否かを判断する。 When face detection processing is performed, the process proceeds to step S11, and the CPU 10 determines whether or not the current subject tracking mode is On.
If it is determined in step S11 that the subject tracking mode is not On, that is, it is Off, the process proceeds to step S12, and the CPU 10 determines whether or not the face can be detected by the face detection process in step S10.

ステップＳ１２で、顔を検出することができたと判断すると、ステップＳ１３に進み、ＣＰＵ１０は、該検出した顔に対してコントラスト検出方式によるＡＦ処理を行って、ステップＳ１５に進む。このコントラスト検出方式によるＡＦ処理とは、周知技術なので詳しく説明しないが、フォーカス対象となる領域の画像データの高周波成分が最も大きくなるレンズ位置に、フォーカスレンズを合わせることにより、該フォーカス対象となる領域の画像のピントを合わせるというものである。 If it is determined in step S12 that a face has been detected, the process proceeds to step S13, and the CPU 10 performs an AF process using the contrast detection method on the detected face, and then proceeds to step S15. The AF processing by the contrast detection method is a well-known technique and will not be described in detail. However, by focusing the focus lens on the lens position where the high-frequency component of the image data in the focus target area is the largest, the focus target area This is to adjust the focus of the image.

一方、ステップＳ１２で、顔を検出することができなかったと判断すると、ステップＳ１４に進み、ＣＰＵ１０は、所定の領域（例えば、画角の中央領域）に対してコントラスト検出方式によるＡＦ処理を行って、ステップＳ１５に進む。
なお、ステップＳ１３、ステップＳ１４では、コントラスト検出方式によるＡＦ処理を行うようにしたが、他の方式（例えば、位相差方式）によってＡＦ処理を行うようにしてもよい。要は、ピントを合わせたい領域に対してＡＦ処理を行うことができればよい。 On the other hand, if it is determined in step S12 that the face could not be detected, the process proceeds to step S14, where the CPU 10 performs AF processing by a contrast detection method on a predetermined area (for example, the center area of the angle of view). The process proceeds to step S15.
In step S13 and step S14, the AF process is performed by the contrast detection method, but the AF process may be performed by another method (for example, a phase difference method). In short, it suffices if AF processing can be performed on an area to be focused.

ステップＳ１５に進むと、ＣＰＵ１０は、シャッタボタンが全押しされたか否かを判断する。この判断は、シャッタボタンの全押し操作に対応する操作信号がキー入力部１１から送られてきたか否かにより判断する。
ステップＳ１５で、シャッタボタンが全押しされていないと判断すると、シャッタボタンが全押しされるまでステップＳ１５に留まり、シャッタボタンが全押しされたと判断すると、ステップＳ１６に進み、ＣＰＵ１０は、静止画撮影処理を行い、該撮影処理により得られた静止画像データを圧縮してフラッシュメモリ１４に記録する処理を行う。 In step S15, the CPU 10 determines whether or not the shutter button has been fully pressed. This determination is made based on whether or not an operation signal corresponding to the full pressing operation of the shutter button is sent from the key input unit 11.
If it is determined in step S15 that the shutter button is not fully pressed, the process stays in step S15 until the shutter button is fully pressed. If it is determined that the shutter button is fully pressed, the process proceeds to step S16, and the CPU 10 captures a still image. Processing is performed, and still image data obtained by the photographing processing is compressed and recorded in the flash memory 14.

一方、ステップＳ１１で、被写体追従モードがＯｎであると判断すると、図４のステップＳ２１に進み、ＣＰＵ１０は、ステップＳ１０の顔検出処理により顔を検出することができたか否かを判断する。 On the other hand, if it is determined in step S11 that the subject tracking mode is On, the process proceeds to step S21 in FIG. 4, and the CPU 10 determines whether or not the face can be detected by the face detection process in step S10.

ステップＳ２１で、顔を検出することができたと判断すると、ステップＳ２２に進み、ＣＰＵ１０の顔認識部１０２は、該検出された顔を２次元的に検出していくことにより、該検出された顔に対して追従していく追従処理を開始して、ステップＳ２３に進む。ここで、２次元的に顔を検出するとは、２次元のデータのみを用いて顔検出することを指し、このステップＳ２２での、２次元的に顔を検出する処理は、前述のステップＳ１０の顔検出処理により検出された顔領域の２次元画像から抽出した特徴データを登録し、ブロックマッチング法などを用いて該登録した特徴データを有する顔を検出していく。このブロックマッチング法は、周知技術なので詳しく説明しないが、２枚の画像データを用いて、一方の画像データのある領域の画像データと最も相関度が高くなる他方の画像データの領域を検出する方法のことをいう。
一方、ステップＳ２１で、顔を検出することができないと判断すると、そのままステップＳ２３に進む。 If it is determined in step S21 that the face has been detected, the process proceeds to step S22, and the face recognition unit 102 of the CPU 10 detects the detected face by two-dimensionally detecting the detected face. Is started, and the process proceeds to step S23. Here, detecting a face in two dimensions means detecting a face using only two-dimensional data, and the process of detecting a face in two dimensions in step S22 is performed in step S10 described above. Feature data extracted from the two-dimensional image of the face area detected by the face detection process is registered, and a face having the registered feature data is detected using a block matching method or the like. Since this block matching method is a well-known technique, it will not be described in detail. However, a method of detecting the area of the other image data having the highest correlation with the image data of a certain area of one image data using two pieces of image data. I mean.
On the other hand, if it is determined in step S21 that a face cannot be detected, the process proceeds to step S23.

ステップＳ２３に進むと、ＣＰＵ１０は、シャッタボタンが全押しされたか否かを判断する。
ステップＳ２３で、シャッタボタンが全押しされていないと判断すると、ステップＳ２４に進み、ＣＰＵ１０は、該追従している顔を見失ったか否かを判断する。つまり、ステップＳ２２の２次元的に顔を検出する処理により顔を検出することができなくなったか否かを判断する。なお、ステップＳ２１で顔が検出できないと判断された場合、つまり、図３のステップＳ１０の顔検出処理により追従元となる顔がそもそも検出できないと判断された場合も顔を見失ったと判断する。 In step S23, the CPU 10 determines whether or not the shutter button has been fully pressed.
If it is determined in step S23 that the shutter button is not fully pressed, the process proceeds to step S24, and the CPU 10 determines whether or not the face being followed is lost. That is, it is determined whether the face cannot be detected by the two-dimensional face detection process in step S22. If it is determined in step S21 that the face cannot be detected, that is, if the face detection process in step S10 of FIG. 3 determines that the follow-up face cannot be detected in the first place, it is determined that the face has been lost.

ステップＳ２４で、顔を見失っていないと判断するとステップＳ２３に戻り、ステップＳ２４で、顔を見失ったと判断すると、図３のステップＳ１０に戻る。これにより、顔を見失った場合は再びステップＳ３で設定された顔検出モードの種類に応じた顔検出処理を行うことになる。 If it is determined in step S24 that the face is not lost, the process returns to step S23. If it is determined in step S24 that the face is lost, the process returns to step S10 in FIG. As a result, when the face is lost, face detection processing corresponding to the type of face detection mode set in step S3 is performed again.

一方、ステップＳ２３で、シャッタボタンが全押しされたと判断すると、ステップＳ２５に進み、ＣＰＵ１０は、該追従している顔に対してコントラスト検出方式によるＡＦ処理を行う。つまり、シャッタボタンが全押しされた直前に２次元的に検出された顔に対してＡＦ処理を行う。
次いで、ステップＳ２６で、ＣＰＵ１０は、静止画撮影処理を行い、該撮影処理により得られた静止画像データを圧縮してフラッシュメモリ１４に記録する処理を行う。 On the other hand, if it is determined in step S23 that the shutter button has been fully pressed, the process proceeds to step S25, and the CPU 10 performs an AF process using the contrast detection method on the following face. That is, AF processing is performed on a face that is two-dimensionally detected immediately before the shutter button is fully pressed.
Next, in step S 26, the CPU 10 performs still image shooting processing, and performs processing for compressing still image data obtained by the shooting processing and recording it in the flash memory 14.

Ｃ．設定された顔検出モードの種類に応じた顔検出処理について
次に、設定された顔検出モードの種類に応じた顔検出処理の動作を図５のフローチャートにしたがって説明する。
図３のステップＳ１０に進むと、図５のステップＳ３１に進み、ＣＰＵ１０の顔認識部１０２は、現在、人用の顔検出モードが設定されているか否かを判断する。 C. Next, the face detection process according to the set face detection mode will be described with reference to the flowchart of FIG.
When the process proceeds to step S10 in FIG. 3, the process proceeds to step S31 in FIG. 5, and the face recognition unit 102 of the CPU 10 determines whether or not the human face detection mode is currently set.

ステップＳ３１で、人用の顔検出モードが設定されていると判断すると、ステップＳ３２に進み、ＣＰＵ１０の顔認識部１０２は、２次元の顔検出処理により人の顔を検出する。
ここでの、２次元の顔検出処理は、直近に撮像されたフレーム画像データから全被写体の特徴データ（２次元の特徴データ）を算出し、該算出した被写体の特徴データと、図２に示す顔データテーブルに設定されている人の２次元の顔データとを比較照合することにより、人の顔がどこにあるかを検出する。 If it is determined in step S31 that the human face detection mode is set, the process proceeds to step S32, and the face recognition unit 102 of the CPU 10 detects a human face by two-dimensional face detection processing.
In this two-dimensional face detection process, feature data (two-dimensional feature data) of all subjects is calculated from frame image data captured most recently, and the calculated feature data of the subject is shown in FIG. The person's face is detected by comparing and collating with the two-dimensional face data of the person set in the face data table.

また、「全被写体」とは、ＣＣＤ５によって撮像された全ての被写体のことを指す。例えば、建物の前に立っているメイン被写体である人を撮像した場合は、全被写体は建物と人ということになる。つまり、画像データの全領域から被写体の特徴データを算出する。
また、全被写体の特徴データとは、例えば、撮影された全被写体が建物とその前に立っている人の場合は、建物及び人の特徴点を複数抽出し、該抽出した特徴点の座標位置や相対位置関係等を数値化したデータのことをいう。この特徴点の抽出により、目、鼻、口、顔の輪郭等から顔の特徴点も複数抽出される。つまり、全被写体の特徴データは、顔だけではなく撮像された全ての被写体の特徴点に基づいて数値化されたデータである。
また、この比較照合は、生成した全被写体の特徴データの中に顔データと一致する部分かあるか否かを判断するためのものである。したがって、この比較照合は、生成された全被写体の特徴データの各部と顔データとを比較照合していく（サーチしていく）ことにより、各部との一致度を得る。 “All subjects” refers to all subjects imaged by the CCD 5. For example, when a person who is a main subject standing in front of a building is imaged, all the subjects are a building and a person. That is, subject feature data is calculated from the entire area of the image data.
Further, the feature data of all subjects is, for example, in the case where all photographed subjects are a building and a person standing in front of it, a plurality of building and person feature points are extracted, and the coordinate positions of the extracted feature points are extracted. Or the relative positional relationship. By extracting the feature points, a plurality of facial feature points are also extracted from the eyes, nose, mouth, face outline, and the like. That is, the feature data of all subjects is data that is digitized based on the feature points of not only the face but all of the captured subjects.
This comparison and collation is for determining whether or not the generated feature data of all the subjects is a portion that matches the face data. Therefore, in this comparison and collation, the degree of coincidence with each part is obtained by comparing and collating (searching) each part of the generated feature data of all subjects with the face data.

ステップＳ３３で、ＣＰＵ１０の顔認識部１０２は、画像内に人の顔があるか否かを判断する。つまり、２次元の顔検出処理により人の顔を検出することができたか否かを判断する。つまり、算出した全被写体の特徴データの中に、顔データと所定値以上で（所定の範囲内で）一致する部分があるか否かを判断する。
ステップＳ３３で、人の顔があると判断すると、ステップＳ３８に進み、ＣＰＵ１０の顔認識部１０２は、顔を検出することができたと判断する。 In step S33, the face recognition unit 102 of the CPU 10 determines whether or not there is a human face in the image. That is, it is determined whether or not a human face can be detected by the two-dimensional face detection process. That is, it is determined whether or not the calculated feature data of all subjects includes a portion that matches the face data at a predetermined value or more (within a predetermined range).
If it is determined in step S33 that there is a human face, the process proceeds to step S38, and the face recognition unit 102 of the CPU 10 determines that the face has been detected.

一方、ステップＳ３３で、人の顔がないと判断すると、ステップＳ３４に進み、ＣＰＵ１０の顔認識部１０２は、顔らしきものがあるか否かを判断する。この顔らしきものとは、ステップＳ３２の全被写体の特徴データと、人の２次元の顔データとの比較照合により、口と鼻はあるが片目、若しくは両目が検出されなかったり（人が横を向いている場合、サングラスをしている場合）、目があるが鼻と口が検出されなかったり場合（マスクをしている場合）のことである。つまり、顔データに基づく顔のパーツが一部検出されなかった場合のことである。 On the other hand, if it is determined in step S33 that there is no human face, the process proceeds to step S34, and the face recognition unit 102 of the CPU 10 determines whether there is something that looks like a face. This face-like thing is the result of the comparison between the feature data of all the subjects in step S32 and the two-dimensional face data of the person, but there is a mouth and nose but one eye or both eyes are not detected (the person moves sideways). (If you are facing, wearing sunglasses), if you have eyes but nose and mouth are not detected (if you are wearing a mask). In other words, this is a case where some face parts based on the face data are not detected.

ステップＳ３４で、顔らしきものがあると判断されると、ステップＳ３５に進み、ＣＰＵ１０の顔認識部１０２は、該検出された顔らしきものがある領域に基づいて３次元の顔検出処理により人の顔を検出する処理を行って、ステップＳ３７に進む。この３次元の顔検出処理については後で説明する。これにより、３次元の顔検出処理による処理負担を軽減することができる。
一方、ステップＳ３１で、人用の顔検出モードが設定されていないと判断した場合、又は、ステップＳ３４で、顔らしきものがないと判断した場合は、ステップＳ３６に進み、ＣＰＵ１０の顔認識部１０２は、フレーム画像データの全領域に基づいて３次元の顔検出処理により、現在設定されている顔検出モードの種類の顔を検出する処理を行って、ステップＳ３７に進む。この３次元の顔検出処理については後で説明する。ここで、ステップＳ３４で顔らしきものがないと判断されることによりステップＳ３６に進んだ場合は、ステップＳ３６において、フレーム画像データの全領域に基づいて３次元の顔検出処理により人の顔を検出する処理が行われることになる。 If it is determined in step S34 that there is something that looks like a face, the process proceeds to step S35, and the face recognition unit 102 of the CPU 10 performs human face detection processing by three-dimensional face detection processing based on the detected area that appears to be a face. Processing for detecting a face is performed, and the process proceeds to step S37. This three-dimensional face detection process will be described later. Thereby, the processing load by the three-dimensional face detection process can be reduced.
On the other hand, if it is determined in step S31 that the human face detection mode is not set, or if it is determined in step S34 that there is no human face detection mode, the process proceeds to step S36, where the face recognition unit 102 of the CPU 10 is processed. Performs a process for detecting a face of the currently set face detection mode type by a three-dimensional face detection process based on the entire region of the frame image data, and the process proceeds to step S37. This three-dimensional face detection process will be described later. If it is determined in step S34 that there is no face-like object, the process proceeds to step S36. In step S36, a human face is detected by three-dimensional face detection processing based on the entire area of the frame image data. Will be performed.

ステップＳ３７に進むと、ＣＰＵ１０の顔検出部１０２は、画像内に人の顔があるか否かを判断する。つまり、３次元の顔検出処理により顔を検出することができたか否かを判断する。
ステップＳ３７で、画像内に顔があると判断すると、ステップＳ３８に進み、顔認識部１０２は、顔を検出することができたと判断し、ステップＳ３７で、顔がないと判断すると、ステップＳ３９に進み、ＣＰＵ１０の顔認識部１０２は、顔を検出することができないと判断する。 In step S37, the face detection unit 102 of the CPU 10 determines whether or not there is a human face in the image. That is, it is determined whether or not a face can be detected by the three-dimensional face detection process.
If it is determined in step S37 that there is a face in the image, the process proceeds to step S38. The face recognition unit 102 determines that a face has been detected. If it is determined in step S37 that there is no face, the process proceeds to step S39. The face recognition unit 102 of the CPU 10 determines that the face cannot be detected.

このステップＳ３８で、顔を検出することができたと判断すると、図３のステップＳ１２、図４のステップＳ２１で顔を検出することができたと判断し、ステップＳ３９で、顔を検出することができないと判断すると、図３のステップＳ１２、図４のステップＳ２１で顔を検出することができないと判断する。 If it is determined in step S38 that a face has been detected, it is determined that a face has been detected in step S12 in FIG. 3 and step S21 in FIG. 4, and a face cannot be detected in step S39. If it is determined that the face cannot be detected in step S12 in FIG. 3 and step S21 in FIG.

Ｄ．３次元の顔検出処理について
次に、３次元の顔検出処理の動作を説明する。２次元の顔検出処理は、２次元的に顔を検出するが、つまり、撮像されたフレーム画像データに基づく２次元の被写体の特徴データと２次元の顔データとを比較照合することにより、該フレーム画像データ内にある顔を検出していたが、３次元の顔検出処理は、３次元のデータを用いて顔を検出するというものである。
ここでは、３次元のデータを用いて顔を検出する動作を２つ紹介するが、これに限られるものではなく、要は、３次元データを用いて被写体の顔を検出するものであればよい。 D. Next, the operation of the three-dimensional face detection process will be described. The two-dimensional face detection process detects a face two-dimensionally, that is, by comparing and comparing feature data of a two-dimensional subject based on captured frame image data and two-dimensional face data, The face in the frame image data is detected, but the three-dimensional face detection process is to detect a face using three-dimensional data.
Here, two operations for detecting a face using three-dimensional data are introduced. However, the present invention is not limited to this, and any operation may be used as long as it detects a face of a subject using three-dimensional data. .

Ｄ−１．１つ目の３次元の顔検出処理について
まず、３次元の顔検出処理の動作を図６のフローチャートにしたがって説明する。
図５のステップＳ３５及びステップＳ３６で、３次元の顔検出処理を行うことにより顔を検出する場合は、図６のステップＳ５１に進み、ＣＰＵ１０の顔認識部１０２は、異なる撮影角度でメイン被写体の撮影を促す表示を行なう。例えば、「撮影したい被写体を異なる角度から撮影してください。」というような表示を行う。これにしたがって、ユーザは撮影角度を変えてメイン被写体を撮像することになる。なお、このときユーザは、メイン被写体となる顔の画像内での位置を変えずに撮影角度を変えることが好ましい。 D-1.1 First Three-dimensional Face Detection Process First, the operation of the three-dimensional face detection process will be described with reference to the flowchart of FIG.
When the face is detected by performing the three-dimensional face detection process in step S35 and step S36 in FIG. 5, the process proceeds to step S51 in FIG. 6, and the face recognition unit 102 of the CPU 10 determines the main subject at different shooting angles. Display to prompt shooting. For example, a display such as “Please photograph the subject to be photographed from different angles” is performed. In accordance with this, the user changes the shooting angle and images the main subject. At this time, the user preferably changes the shooting angle without changing the position in the image of the face that is the main subject.

次いで、ステップＳ５２で、ＣＰＵ１０の顔認識部１０２は、異なる撮影角度でのメイン被写体の撮影を促す表示を行なってから所定時間（例えば、１秒）が経過したか否かを判断する。このとき、ＣＰＵ１０は、スルー画像表示用に撮像された複数枚のフレーム画像データをバッファメモリに保持させておく。 Next, in step S52, the face recognition unit 102 of the CPU 10 determines whether or not a predetermined time (for example, 1 second) has elapsed since the display for prompting the main subject to be photographed at a different photographing angle. At this time, the CPU 10 holds a plurality of frame image data captured for through image display in the buffer memory.

ステップＳ５２で、所定時間が経過していないと判断すると所定時間が経過するまでステップＳ５２に留まり、所定時間が経過したと判断すると、ステップＳ５３に進み、ＣＰＵ１０の顔認識部１０２は、撮像された複数枚のフレーム画像データから、被写体の特徴データをそれぞれ算出する。
このとき、図５のステップＳ３６での３次元の顔検出処理の場合は、撮像されたフレーム画像データの全領域に基づいて被写体の特徴データ（全被写体の特徴データ）を算出する。
また、図５のステップＳ３５での３次元の顔検出処理の場合は、撮像された各フレーム画像データのうち、顔らしきものがあると判断された領域の画像データに基づいて被写体の特徴データ（顔らしき被写体の特徴データ）を算出する。 If it is determined in step S52 that the predetermined time has not elapsed, the process stays in step S52 until the predetermined time elapses. If it is determined that the predetermined time has elapsed, the process proceeds to step S53, and the face recognition unit 102 of the CPU 10 captures the image. Characteristic data of the subject is calculated from a plurality of frame image data.
At this time, in the case of the three-dimensional face detection processing in step S36 of FIG. 5, subject feature data (feature data of all subjects) is calculated based on the entire area of the captured frame image data.
Further, in the case of the three-dimensional face detection process in step S35 of FIG. 5, the subject feature data (based on the image data of the area determined to be a face among the captured frame image data). (Feature data of a subject that looks like a face) is calculated.

ここで、フレーム画像データから算出される全被写体の特徴データとは、例えば、撮影された全被写体が建物とその前に立っている人の場合は、建物及び人の特徴点を複数抽出し、該抽出した特徴点の座標位置や相対位置関係等を数値化したデータのことをいう。この特徴点の抽出により、目、鼻、口、顔の輪郭等から顔の特徴点も複数抽出される。つまり、全被写体の特徴データは、顔だけではなく撮像された全ての被写体の特徴点に基づいて数値化されたデータのことである。
また、顔らしき被写体の特徴データとは、顔らしきものがあると判断された領域内で撮像されている全ての被写体の特徴データに基づいて数値化されたデータのことである。 Here, the feature data of all subjects calculated from the frame image data is, for example, in the case where all the photographed subjects are a building and a person standing in front of it, extract a plurality of building and person feature points, This is data obtained by quantifying the coordinate positions and relative positional relationships of the extracted feature points. By extracting the feature points, a plurality of facial feature points are also extracted from the eyes, nose, mouth, face outline, and the like. That is, the feature data of all subjects is data that is digitized based on the feature points of not only the face but all of the captured subjects.
The feature data of a subject that looks like a face is data that has been digitized based on the feature data of all the subjects that have been imaged in an area where it is determined that there is something that looks like a face.

なお、ここでは、顔らしきものがあると判断された領域の位置、大きさを撮像された全てのフレーム画像データに対して同じにするようにしたが、所定時間が経過するまで各フレーム画像データに対して２次元の顔検出処理を行い続けることにより、各フレーム画像データ毎に顔らしき領域を検出し、各フレーム画像データ毎に該検出された領域の画像データから被写体の特徴データ（顔らしき被写体の特徴データ）を算出するようにしてもよい。この場合は、この２次元の顔検出処理により顔が検出された場合は、もはや３次元の顔検出処理を行う必要がないので図５のステップＳ３８に進むようにしてもよい。 Here, the position and size of the area determined to have a facial appearance are the same for all the captured frame image data, but each frame image data until a predetermined time elapses. By continuing to perform the two-dimensional face detection processing on the image, a face-like area is detected for each frame image data, and subject feature data (face-like data) is detected from the image data of the detected area for each frame image data. Subject feature data) may be calculated. In this case, if a face is detected by this two-dimensional face detection process, it is no longer necessary to perform the three-dimensional face detection process, and the process may proceed to step S38 in FIG.

各フレーム画像データから被写体の特徴データを算出すると、ステップＳ５４に進み、ＣＰＵ１０の顔認識部１０２は、該算出された各フレーム画像データの被写体の特徴データに基づいて、被写体の３次元モデルを生成する。
この３次元モデルの生成は、既に周知技術なので詳しく説明しないが、ある画像の特徴点と、該特徴点に対応する他の画像の特徴点とに基づいて、三角測量演算によって被写体の３次元モデルを生成する（モデリング処理）。 When the subject feature data is calculated from each frame image data, the process proceeds to step S54, and the face recognition unit 102 of the CPU 10 generates a three-dimensional model of the subject based on the calculated subject feature data of each frame image data. To do.
The generation of this three-dimensional model is already well-known technology and will not be described in detail. However, based on the feature points of a certain image and the feature points of another image corresponding to the feature points, the three-dimensional model of the subject is obtained by triangulation calculation. Is generated (modeling process).

また、図７は、生成された被写体（ここでは猫科の動物の顔のみ）の３次元モデルをサーフェスモデルの一種である多角形ポリゴン（ここでは、３角形ポリゴン）で表したときの様子を示す図である。
なお、サーフェスモデルでなく、ワイヤーフレームモデルやソリッドモデル等他の方法によって表すようにしてもよい。 FIG. 7 shows a state in which the generated three-dimensional model of the subject (here, only a cat face) is represented by a polygon polygon (here, a triangular polygon) which is a kind of surface model. FIG.
In addition, you may make it represent with other methods, such as a wire frame model and a solid model, instead of a surface model.

次いで、ステップＳ５５で、ＣＰＵ１０の顔認識部１０２は、該生成した被写体の３次元モデルと、図２に示す顔データテーブルに記録されている現在設定されている顔検出モードの種類の立体顔データ（３次元の顔データ）とを比較照合する。
例えば、図５のステップＳ３５の場合は、該生成した被写体の３次元モデルと、図２に示す顔データテーブルに設定されている人の立体顔データとを比較照合し、図５のステップＳ３６の場合は、たとえば、犬用の顔検出モードが設定されている場合は、該生成した被写体の３次元モデルと、図２に示す顔データテーブルに設定されている犬の立体顔データとを比較照合する。 Next, in step S55, the face recognition unit 102 of the CPU 10 generates the three-dimensional model of the generated subject and the three-dimensional face data of the currently set face detection mode type recorded in the face data table shown in FIG. (3D face data) is compared and collated.
For example, in the case of step S35 in FIG. 5, the generated three-dimensional model of the subject is compared and collated with the human three-dimensional face data set in the face data table shown in FIG. 2, and in step S36 in FIG. In this case, for example, when the face detection mode for dogs is set, the three-dimensional model of the generated subject is compared with the three-dimensional face data of the dog set in the face data table shown in FIG. To do.

また、この比較照合は、生成した被写体の３次元モデルの中に立体顔データと一致する部分があるか否かを判断するためのものである。したがって、この比較照合は、生成された３次元モデルの各部と立体顔データとを比較照合していく（サーチしていく）ことにより、各部との一致度を得る。 This comparison and collation is for determining whether or not there is a portion that matches the stereoscopic face data in the generated three-dimensional model of the subject. Therefore, in this comparison and collation, each part of the generated three-dimensional model is compared and collated with the 3D face data (searching) to obtain a degree of coincidence with each part.

次いで、ステップＳ５６で、ＣＰＵ１０の顔認識部１０２は、該生成した被写体の３次元モデルの中に、該比較照合を行なった立体顔データと所定値以上で一致する部分があるか否かを判断する。つまり、所定の範囲内で立体顔データと一致する部分があるか否かを判断することになる。この一致する部分とは、被写体の３次元モデルの中の１部であってもよいし、被写体の３次元モデルの全部であってもよい。要は、立体顔データと所定値以上で一致するものがあるかを判断できればよい。 Next, in step S56, the face recognition unit 102 of the CPU 10 determines whether or not there is a portion in the generated three-dimensional model of the subject that matches the stereoscopic face data subjected to the comparison and collation with a predetermined value or more. To do. That is, it is determined whether there is a portion that matches the stereoscopic face data within a predetermined range. The matching part may be a part of the three-dimensional model of the subject or the whole of the three-dimensional model of the subject. In short, it suffices if it is possible to determine whether there is a solid face data that matches a predetermined value or more.

ステップＳ５６で、所定値以上で一致する部分があると判断した場合は、ステップＳ５７に進み、ＣＰＵ１０の顔認識部１０２は、所定値以上で一致する部分を顔と判断し、該所定値以上で一致した部分に対応する撮像されたフレーム画像データ上の領域を顔領域とする。つまり、フレーム画像データ上の顔領域を検出する。 When it is determined in step S56 that there is a matching part with a predetermined value or more, the process proceeds to step S57, where the face recognition unit 102 of the CPU 10 determines that the matching part has a predetermined value or more as a face, An area on the captured frame image data corresponding to the matched portion is set as a face area. That is, the face area on the frame image data is detected.

この所定値以上で一致した部分に対応するフレーム画像データ上の領域とは、被写体の３次元モデルの中で、一致する部分（顔部分）の特徴データに対応するフレーム画像データ上の領域であり、該特徴データに対応するフレーム画像データは複数あるので（複数枚のフレーム画像データに基づいて被写体モデルを生成したため）、所定値以上で一致した部分に対応する、生成元となった複数のフレーム画像データのうち、直近に撮像されたフレーム画像データ上の領域であってもよいし、所定値以上で一致した部分に対応する生成元となった全てのフレーム画像データ上の領域から構成される領域であってもよい。 The area on the frame image data corresponding to the matching part at a predetermined value or more is the area on the frame image data corresponding to the feature data of the matching part (face part) in the three-dimensional model of the subject. Since there are a plurality of frame image data corresponding to the feature data (because a subject model is generated based on a plurality of frame image data), a plurality of frames that are the generation sources corresponding to portions that match at a predetermined value or more Of the image data, it may be an area on the most recently captured frame image data, or it may be composed of all the areas on the frame image data that are the generation sources corresponding to portions that match at a predetermined value or more. It may be a region.

一方、ステップＳ５６で、所定値以上で一致する部分がないと判断すると、ステップＳ５８に進み、ＣＰＵ１０の顔認識部１０２は、撮像されたフレーム画像データ内に顔がないと判断する。
このステップＳ５７で顔領域を検出すると、図５のステップＳ３８で顔を検出することができたと判断し、図３のステップＳ１２、図４のステップＳ２１で顔を検出できたと判断する。また、ステップＳ５８で顔がないと判断すると、図５のステップＳ３９で顔を検出することができないと判断し、図３のステップＳ１２、図４のステップＳ２１で顔を検出できないと判断する。 On the other hand, if it is determined in step S56 that there is no matching portion equal to or greater than the predetermined value, the process proceeds to step S58, and the face recognition unit 102 of the CPU 10 determines that there is no face in the captured frame image data.
When the face area is detected in step S57, it is determined that the face can be detected in step S38 of FIG. 5, and it is determined that the face can be detected in step S12 of FIG. 3 and step S21 of FIG. If it is determined in step S58 that there is no face, it is determined that a face cannot be detected in step S39 in FIG. 5, and a face cannot be detected in steps S12 in FIG. 3 and step S21 in FIG.

このように、１つ目の３次元の顔検出処理は、同一被写体を撮像した複数のフレーム画像データから、該被写体の３次元モデルを生成し、立体顔データと比較照合することにより顔を検出するので、顔検出の精度を向上させることができる。
例えば、被写体の顔が横を向いていたり、サングラスを装着している場合であっても検出することができる。
また、立体顔データと生成した３次元モデルとを比較照合するので、人の顔に限らず、犬や猫等の動物の顔等も検出することができる。例えば、馬など動物の顔は、見る角度が違うと極端に顔の形が変わってしまうが、３次元とすることにより、精度よく顔を検出することができる。 In this way, the first three-dimensional face detection process detects a face by generating a three-dimensional model of the subject from a plurality of frame image data obtained by imaging the same subject and comparing and comparing with the three-dimensional face data. Therefore, the accuracy of face detection can be improved.
For example, it can be detected even when the face of the subject is facing sideways or wearing sunglasses.
Further, since the three-dimensional face data and the generated three-dimensional model are compared and collated, it is possible to detect not only human faces but also animal faces such as dogs and cats. For example, the face of an animal such as a horse changes its shape extremely if the viewing angle is different, but the face can be detected with high accuracy by using three dimensions.

Ｄ−２．２つ目の３次元の顔検出処理について
次に、２つ目の３次元の顔検出処理の動作を図８のフローチャートにしたがって説明する。
図５のステップＳ３５及びステップＳ３６で、３次元の顔検出処理を行うことにより顔を検出する場合は、図８のステップＳ６１に進み、ＣＰＵ１０の顔認識部１０２は、顔の向きを正面と設定する。 D-2.2 Second Three-dimensional Face Detection Process Next, the operation of the second three-dimensional face detection process will be described with reference to the flowchart of FIG.
When the face is detected by performing the three-dimensional face detection process in step S35 and step S36 in FIG. 5, the process proceeds to step S61 in FIG. 8, and the face recognition unit 102 of the CPU 10 sets the face direction to the front. To do.

次いで、ステップＳ６２で、ＣＰＵ１０の顔認識部１０２は、図２の顔データテーブルに記録されている現在設定されている顔検出モードの種類の立体顔データに基づいて、現在設定されている向きから見た顔の平面（２次元）画像データを生成する（レンダリング処理）。ここでは、設定されている向きは正面なので、設定された顔検出モードの種類の顔を正面から見た顔の平面画像データを生成することになる。この立体的なものから平面画像データを生成する技術は周知技術なので詳しく説明しないが、陰面消去や陰線消去などを行なうことによって行なわれる。 Next, in step S62, the face recognition unit 102 of the CPU 10 starts from the currently set orientation based on the currently set face detection mode type of three-dimensional face data recorded in the face data table of FIG. Plane (two-dimensional) image data of the viewed face is generated (rendering process). Here, since the set direction is the front, plane image data of the face when the face of the set face detection mode type is viewed from the front is generated. The technology for generating planar image data from this three-dimensional image is well known and will not be described in detail, but is performed by performing hidden surface removal, hidden line removal, or the like.

たとえば、設定されている顔検出モードが人用である場合は、図２に記録されている人の立体顔データから現在設定されている向きから見た平面画像データを生成し、設定されている顔検出モードが猫用である場合は、図２に記録されている猫の立体顔データから現在設定されている向きからみた平面画像データを生成することになる。 For example, when the set face detection mode is for human use, plane image data viewed from the currently set direction is generated from the person's stereoscopic face data recorded in FIG. 2 and set. When the face detection mode is for cats, plane image data viewed from the currently set direction is generated from the cat's three-dimensional face data recorded in FIG.

次いで、ステップＳ６３で、ＣＰＵ１０の顔認識部１０２は、該生成した顔の平面画像データから顔特徴データを算出する。これにより、現在設定されている向きから見た、該設定されている顔検出モードの種類の顔特徴データが算出される。 Next, in step S63, the face recognition unit 102 of the CPU 10 calculates face feature data from the generated plane image data of the face. Thereby, the face feature data of the type of the set face detection mode as seen from the currently set direction is calculated.

次いで、ステップＳ６４で、ＣＰＵ１０の顔認識部１０２は、直近にスルー画像表示用に撮像されたフレーム画像データから被写体の特徴データを算出する。
このとき、図５のステップＳ３６での３次元の顔検出処理の場合は、撮像されたフレーム画像データの全領域に基づいて被写体の特徴データ（全被写体の特徴データ）を算出する。
また、図５のステップＳ３５での３次元の顔検出処理の場合は、撮像されたフレーム画像データのうち、顔らしきものがあると判断された領域の画像データに基づいて被写体の特徴データ（顔らしき被写体の特徴データ）を算出する。 Next, in step S64, the face recognition unit 102 of the CPU 10 calculates subject feature data from the frame image data recently captured for through image display.
At this time, in the case of the three-dimensional face detection processing in step S36 of FIG. 5, subject feature data (feature data of all subjects) is calculated based on the entire area of the captured frame image data.
Further, in the case of the three-dimensional face detection process in step S35 of FIG. 5, subject feature data (face data) based on image data of an area determined to have a facial appearance in the captured frame image data. Characteristic data of the subject).

次いで、ステップＳ６５で、ＣＰＵ１０の顔認識部１０２は、ステップＳ６３で算出された顔特徴データと、ステップＳ６４で算出された被写体の特徴データとを比較照合する。この比較照合は、生成した被写体の特徴データの中に顔特徴データと一致する部分かあるか否かを判断するためのものである。したがって、この比較照合は、生成された被写体の特徴データの各部と顔特徴データとを比較照合していく（サーチしていく）ことにより、各部との一致度を得る。 Next, in step S65, the face recognition unit 102 of the CPU 10 compares and compares the face feature data calculated in step S63 with the subject feature data calculated in step S64. This comparison and collation is for determining whether or not the generated feature data of the subject is a portion that matches the facial feature data. Therefore, in this comparison and collation, each part of the generated feature data of the subject and the face feature data are compared and collated (searched) to obtain a degree of coincidence with each part.

ステップＳ６６で、ＣＰＵ１０の顔認識部１０２は、ステップＳ６４で算出された被写体の特徴データの中に、ステップＳ６３で算出した顔特徴データと所定値以上で一致する部分があるか否かを判断する。つまり、所定の範囲内で顔特徴データと一致する部分があるか否かを判断することになる。この一致する部分とは、被写体の特徴データの中の１部であってもよいし、被写体の特徴データ全部であってもよい。要は、顔特徴データと所定値以上で一致するものがあるかを判断できればよい。 In step S66, the face recognition unit 102 of the CPU 10 determines whether or not the subject feature data calculated in step S64 has a portion that matches the face feature data calculated in step S63 at a predetermined value or more. . That is, it is determined whether or not there is a portion that matches the facial feature data within a predetermined range. This matching part may be a part of the feature data of the subject or all of the feature data of the subject. In short, it is only necessary to be able to determine whether there is a match with the face feature data at a predetermined value or more.

ステップＳ６６で、所定値以上で一致する部分があると判断した場合は、ステップＳ６７に進み、ＣＰＵ１０の顔認識部１０２は、該所定値以上で一致する部分を顔と判断し、該部分に対応する、被写体の特徴データの生成元となったフレーム画像データ上の領域を顔領域とする。つまり、フレーム画像データ上の顔領域を検出する。 If it is determined in step S66 that there is a matching part with a predetermined value or more, the process proceeds to step S67, and the face recognition unit 102 of the CPU 10 determines the matching part with the predetermined value or more as a face, and corresponds to the part. An area on the frame image data from which the subject feature data is generated is defined as a face area. That is, the face area on the frame image data is detected.

一方、ステップＳ６６で、所定値以上で一致する部分がないと判断すると、ステップＳ６８に進み、ＣＰＵ１０の顔認識部１０２は、現在設定されている向きが最後の向きであるか否かを判断する。つまり、予め決められた全ての向きが設定されたか否かを判断する。
ステップＳ６８で、設定されている向きが最後の向きでないと判断すると、ステップＳ６９に進み、ＣＰＵ１０の顔認識部１０２は、次の向きに設定してステップＳ６２に戻る。この「次の向き」とは、たとえば、現在設定されている向きから５度、右方向又は左方向に回転させた向きである。ここで、左、右とは、頭の天辺を上とし、顎や喉を下としたときを基準とする。なお、ここでは左右の方向に向きを回転させたが上下方向にも回転させるようにしてもよいし、左右方向と上下方向ともに回転させるようにしてもよい。 On the other hand, if it is determined in step S66 that there is no matching part equal to or greater than the predetermined value, the process proceeds to step S68, and the face recognition unit 102 of the CPU 10 determines whether or not the currently set direction is the last direction. . That is, it is determined whether all predetermined directions are set.
If it is determined in step S68 that the set orientation is not the last orientation, the process proceeds to step S69, where the face recognition unit 102 of the CPU 10 sets the next orientation and returns to step S62. The “next direction” is, for example, a direction rotated rightward or leftward by 5 degrees from the currently set direction. Here, “left” and “right” are based on the top of the head as the upper side and the chin and throat as the lower side. Although the direction is rotated in the left-right direction here, it may be rotated in the up-down direction, or may be rotated in both the left-right direction and the up-down direction.

一方、ステップＳ６８で、設定されている向きが最後の向きであると判断すると、ステップＳ７０に進み、顔認識部１０２は、撮像されたフレーム画像データ内に顔がないと判断する。
このステップＳ６７で顔領域を検出すると、図５のステップＳ３８で顔を検出することができたと判断し、図３のステップＳ１２、図４のステップＳ２１で顔を検出できたと判断する。また、ステップＳ７０で顔がないと判断すると、図５のステップＳ３９で顔を検出することができないと判断し、図３のステップＳ１２、図４のステップＳ２１で顔を検出できないと判断する。 On the other hand, if it is determined in step S68 that the set direction is the last direction, the process proceeds to step S70, and the face recognition unit 102 determines that there is no face in the captured frame image data.
When the face area is detected in step S67, it is determined that the face can be detected in step S38 of FIG. 5, and it is determined that the face can be detected in step S12 of FIG. 3 and step S21 of FIG. If it is determined in step S70 that there is no face, it is determined that a face cannot be detected in step S39 in FIG. 5, and a face cannot be detected in steps S12 in FIG. 3 and step S21 in FIG.

このように、２つ目の３次元の顔検出処理は、被写体を撮像した１枚のフレーム画像データから算出された被写体の特徴データと、予め記録されている立体顔データから異なる向きから見た被写体の顔の平面画像データを生成していき、該生成された平面画像データから算出された被写体の特徴データと、を比較照合することにより顔を検出するので、顔検出の精度を向上させることができる。
例えば、被写体が犬や猫等の動物であったり、被写体が横を向いている場合であっても検出することができる。
また、１つ目の３次元の顔検出処理のように、撮像された複数のフレーム画像データから被写体の３次元モデルを生成する必要は無いので処理負担を軽減させることができる。 As described above, the second three-dimensional face detection process is viewed from different directions from the subject feature data calculated from one frame image data obtained by imaging the subject and the pre-recorded stereoscopic face data. Since the face is detected by generating the plane image data of the face of the subject and comparing the feature data of the subject calculated from the generated plane image data, the accuracy of face detection is improved. Can do.
For example, even if the subject is an animal such as a dog or a cat, or the subject is facing sideways, it can be detected.
Further, unlike the first three-dimensional face detection process, it is not necessary to generate a three-dimensional model of a subject from a plurality of captured frame image data, so that the processing load can be reduced.

なお、最初は、顔の向きを正面に設定し（ステップＳ６１）、その後、顔が検出されるまで顔の向きを変えて設定するようにしたが（ステップＳ６９）、最初に設定する顔の向きは正面でなくてもよい。要は、顔が検出されるまで異なる向きに設定していくものであればよい。 Initially, the face orientation is set to the front (step S61), and then the face orientation is changed until the face is detected (step S69). May not be the front. In short, it is only necessary to set different orientations until a face is detected.

以上のように第１の実施の形態においては、ユーザによって設定された顔検出モードの種類に応じて、異なる顔検出処理を行うようにしたので、被写体の種類や撮影状況に適した顔検出処理を行うことができ、顔認識の精度を向上させることができる。
さらに、第１の実施の形態においては、ユーザによって設定された顔検出モードの種類に応じて、２次元の顔検出処理を行ったり、３次元の顔検出処理を行うようにしたので、処理負担の大きい３次元の顔検出処理を不必要に行うことがなく、また顔検出の精度を向上させることができる。たとえば、人の顔は平面的なので２次元の顔検出処理を行い、犬等の動物は一般的に立体的な顔なので２次元の顔検出処理では顔を検出することは難しいが、３次元の顔検出を行うことにより、動物の顔も検出することができる。 As described above, in the first embodiment, different face detection processing is performed according to the type of face detection mode set by the user, so that face detection processing suitable for the type of subject and the shooting situation is performed. And the accuracy of face recognition can be improved.
Furthermore, in the first embodiment, the two-dimensional face detection process or the three-dimensional face detection process is performed according to the type of the face detection mode set by the user. The three-dimensional face detection process having a large size is not performed unnecessarily, and the accuracy of face detection can be improved. For example, since a human face is flat, a two-dimensional face detection process is performed, and since an animal such as a dog is generally a three-dimensional face, it is difficult to detect a face by a two-dimensional face detection process. By performing face detection, an animal's face can also be detected.

また、２次元の顔検出処理により人の顔が検出されない場合は、３次元の顔検出処理を行うようにしたので、たとえば、人が横を向いていたり、マスクやサングラス等を装着している場合であっても人の顔を検出することができる。
また、ユーザが顔検出モードを設定するので、検出したい種類の被写体の顔のみを検出することができる。 In addition, when a human face is not detected by the two-dimensional face detection process, the three-dimensional face detection process is performed. For example, the person is facing sideways or wearing a mask or sunglasses. Even in such a case, a human face can be detected.
In addition, since the user sets the face detection mode, only the face of the type of subject to be detected can be detected.

また、被写体追従モードがＯｎの場合は、設定された顔検出モードの種類に応じて、２次元の顔検出処理や３次元の顔検出処理を行い、該顔が検出されると、２次元的な顔検出処理により該検出された顔を検出していき、追従している顔を見失うと再び、設定された顔検出モードの種類に応じて、２次元の顔検出処理や３次元の顔検出処理を行うので、追従処理の負担を軽減することができると共に、追従処理の精度を向上させることができる。 When the subject tracking mode is On, two-dimensional face detection processing or three-dimensional face detection processing is performed according to the set face detection mode type. If the detected face is detected by a simple face detection process and the face being followed is lost, a two-dimensional face detection process or a three-dimensional face detection is performed again according to the type of the set face detection mode. Since the processing is performed, it is possible to reduce the load of the tracking process and improve the accuracy of the tracking process.

［第２の実施の形態］
次に第２の実施の形態について説明する。
第１の実施の形態においては、ユーザによって選択された顔検出モードの種類に設定するようにしたが、第２の実施の形態においては、自動的に顔検出モードの種類を設定するというものである。 [Second Embodiment]
Next, a second embodiment will be described.
In the first embodiment, the type of the face detection mode selected by the user is set. However, in the second embodiment, the type of the face detection mode is automatically set. is there.

Ｅ.デジタルカメラ１の動作
第２の実施の形態も、図１に示したものと同様の構成を有するデジタルカメラ１を用いることにより本発明の撮像装置を実現する。 E. Operation of Digital Camera 1 In the second embodiment, the image pickup apparatus of the present invention is realized by using the digital camera 1 having the same configuration as that shown in FIG.

以下、第２の実施の形態のデジタルカメラ１の動作を図９及び図１０のフローチャートにしたがって説明する。
ユーザのモード切替キーの操作により、顔検出静止画撮影モードに設定されると、ステップＳ１０１で、ＣＰＵ１０の顔検出モード設定部１０１は、人用の顔検出モードに設定する。
次いで、ステップＳ１０２で、ＣＰＵ１０は、ＣＣＤ５による撮像を開始して、スルー画像表示を開始させる。
次いで、ステップＳ１０３で、ＣＰＵ１０は、被写体追従Ｏｎ／Ｏｆｆキーの操作が行なわれたか否かを判断する。 Hereinafter, the operation of the digital camera 1 according to the second embodiment will be described with reference to the flowcharts of FIGS. 9 and 10.
When the face detection still image shooting mode is set by the user's operation of the mode switching key, the face detection mode setting unit 101 of the CPU 10 sets the human face detection mode in step S101.
Next, in step S102, the CPU 10 starts imaging by the CCD 5 and starts through image display.
Next, in step S103, the CPU 10 determines whether or not the subject tracking On / Off key has been operated.

ステップＳ１０３で、被写体追従キーの操作が行われたと判断すると、ステップＳ１０４に進み、ＣＰＵ１０は、現在の被写体追従モードがＯｎであるか否かを判断する。
ステップＳ１０４で、現在の被写体追従モードがＯｎでない、つまり、Ｏｆｆであると判断すると、ステップＳ１０５に進み、ＣＰＵ１０は、被写体追従モードをＯｆｆからＯｎに切り替えて、ステップＳ１０７に進む。 If it is determined in step S103 that the subject tracking key has been operated, the process proceeds to step S104, and the CPU 10 determines whether or not the current subject tracking mode is On.
If it is determined in step S104 that the current subject tracking mode is not On, that is, it is Off, the process proceeds to step S105, and the CPU 10 switches the subject tracking mode from Off to On and proceeds to step S107.

一方、ステップＳ１０４で、現在の被写体追従モードがＯｎであると判断すると、ステップＳ１０６に進み、ＣＰＵ１０は、被写体追従モードをＯｎからＯｆｆに切り替えて、ステップＳ１０７に進む。
また、ステップＳ１０３で、被写体追従Ｏｎ／Ｏｆｆキーの操作が行なわれていないと判断すると、そのままステップＳ１０７に進む。 On the other hand, if it is determined in step S104 that the current subject tracking mode is On, the process proceeds to step S106, and the CPU 10 switches the subject tracking mode from On to Off and proceeds to step S107.
If it is determined in step S103 that the subject tracking On / Off key is not operated, the process directly proceeds to step S107.

ステップＳ１０７に進むと、ＣＰＵ１０は、ユーザによってシャッタボタンが半押しされたか否かを判断する。
ステップＳ１０７で、シャッタボタンが半押しされていないと判断するとステップＳ１０３に戻る。 In step S107, the CPU 10 determines whether or not the shutter button is half-pressed by the user.
If it is determined in step S107 that the shutter button is not half-pressed, the process returns to step S103.

一方、ステップＳ１０７で、シャッタボタンが半押しされたと判断すると、ステップＳ１０８に進み、ＣＰＵ１０の顔認識部１０２は、現在設定されている顔検出モードの種類に応じた顔検出処理を行う。
このステップＳ１０８の動作は、上記第１の実施の形態で説明した、図３のステップＳ１０の動作、つまり、図５に示す動作と同様の動作を行なう。この顔検出処理により設定された顔検出モードの種類の顔が検出されることになる。ここで、ステップＳ１０１で人用の顔検出モードが設定されているので、顔検出処理により人の顔が検出されることになる。 On the other hand, if it is determined in step S107 that the shutter button has been half-pressed, the process proceeds to step S108, where the face recognition unit 102 of the CPU 10 performs face detection processing according to the type of the currently set face detection mode.
The operation in step S108 is the same as the operation in step S10 in FIG. 3 described in the first embodiment, that is, the operation shown in FIG. The face of the face detection mode type set by this face detection process is detected. Here, since the human face detection mode is set in step S101, a human face is detected by the face detection process.

顔検出処理を行うと、ステップＳ１０９に進み、ＣＰＵ１０は、ステップＳ１０８の顔検出処理より顔を検出することができたか否かを判断する。
ステップＳ１０９で、顔を検出することができなかったと判断すると、ステップＳ１１０に進み、ＣＰＵ１０は、顔検出モード設定部１０１により未だ設定されていない顔検出モードがあるか否かを判断する。 When face detection processing is performed, the process proceeds to step S109, and the CPU 10 determines whether or not a face has been detected by the face detection processing in step S108.
If it is determined in step S109 that a face could not be detected, the process proceeds to step S110, and the CPU 10 determines whether there is a face detection mode that has not yet been set by the face detection mode setting unit 101.

ステップＳ１１０で、設定されていない顔検出モードがあると判断すると、ステップＳ１１１に進み、ＣＰＵ１０の顔検出モード設定部１０１は、未だ設定されていない種類の顔検出モードに設定して、ステップＳ１０８に戻る。
ここでは、まだ人用の顔検出モードしか設定されていないので、動物用の顔検出モード（犬用の顔検出モード、猫用の顔検出モード等のうち何れかの動物用の顔検出モード）が設定されることになる。 If it is determined in step S110 that there is a face detection mode that has not been set, the process proceeds to step S111, where the face detection mode setting unit 101 of the CPU 10 sets a face detection mode of a type that has not yet been set, and the process proceeds to step S108. Return.
Here, since only the face detection mode for humans is set, the face detection mode for animals (the face detection mode for any one of the face detection mode for dogs, the face detection mode for cats, etc.) Will be set.

一方、ステップＳ１０９で、顔を検出することができたと判断すると、ステップＳ１１２に進み、ＣＰＵ１０は、現在の被写体追従モードがＯｎであるか否かを判断する。
ステップＳ１１２で、被写体追従モードがＯｎでない、つまり、Ｏｆｆであると判断すると、ステップＳ１１３に進み、ＣＰＵ１０は、該検出した顔に対してコントラスト検出方式によるＡＦ処理を行って、ステップＳ１１５に進む。
一方、ステップＳ１１０で、設定されていない顔検出モードがないと判断すると、ステップＳ１１４に進み、ＣＰＵ１０は、所定の領域に対してコントラスト検出方式によるＡＦ処理を行って、ステップＳ１１５に進む。 On the other hand, if it is determined in step S109 that the face has been detected, the process proceeds to step S112, and the CPU 10 determines whether or not the current subject tracking mode is On.
If it is determined in step S112 that the subject tracking mode is not On, that is, it is Off, the process proceeds to step S113, and the CPU 10 performs an AF process using the contrast detection method on the detected face, and then proceeds to step S115.
On the other hand, if it is determined in step S110 that there is no face detection mode that has not been set, the process proceeds to step S114, and the CPU 10 performs an AF process using a contrast detection method on a predetermined area, and then proceeds to step S115.

ステップＳ１１５に進むと、ＣＰＵ１０は、ユーザによってシャッタボタンが全押しされたか否かを判断する。
ステップＳ１１５で、シャッタボタンが全押しされていないと判断すると、全押しされるまでステップＳ１１５に留まり、シャッタボタンが全押しされたと判断すると、ステップＳ１１６に進み、ＣＰＵ１０は、静止画撮影処理を行い、該撮影処理により得られた静止画像データを圧縮してフラッシュメモリ１４に記録する処理を行う。 In step S115, the CPU 10 determines whether or not the shutter button has been fully pressed by the user.
If it is determined in step S115 that the shutter button is not fully pressed, the process stays in step S115 until the shutter button is fully pressed. If it is determined that the shutter button is fully pressed, the process proceeds to step S116, and the CPU 10 performs still image shooting processing. Then, a process of compressing the still image data obtained by the photographing process and recording it in the flash memory 14 is performed.

また、ステップＳ１１２で、被写体追従モードがＯｎであると判断すると、図１０のステップＳ１２１に進み、ＣＰＵ１０の顔認識部１０２は、該検出された顔を２次元的に検出していくことにより、該検出された顔に対して追従していく追従処理を開始する。こで、２次元的に顔を検出するとは、２次元のデータのみを用いて顔検出することを指し、このステップＳ１２１での、２次元的に顔を検出する処理は、ブロックマッチング法などを用いて該検出された顔を検出していく。 If it is determined in step S112 that the subject tracking mode is On, the process proceeds to step S121 in FIG. 10, and the face recognition unit 102 of the CPU 10 detects the detected face two-dimensionally. A follow-up process for following the detected face is started. Here, detecting a face two-dimensionally means detecting a face using only two-dimensional data, and the process of detecting a face two-dimensionally in this step S121 uses a block matching method or the like. The detected face is used for detection.

次いで、ステップＳ１２２で、ＣＰＵ１０は、ユーザによってシャッタボタンが全押しされたか否かを判断する。
ステップＳ１２２で、シャッタボタンが全押しされていないと判断すると、ステップＳ１２３に進み、ＣＰＵ１０は、該追従している顔を見失ったか否かを判断する。つまり、ステップＳ１２１の２次元的に顔を検出する処理により顔を検出することができなくなったか否かを判断する。 Next, in step S122, the CPU 10 determines whether or not the shutter button has been fully pressed by the user.
If it is determined in step S122 that the shutter button has not been fully pressed, the process proceeds to step S123, and the CPU 10 determines whether or not the face being followed has been lost. That is, it is determined whether the face cannot be detected by the two-dimensional face detection process in step S121.

ステップＳ１２３で、顔を見失っていないと判断するとステップＳ１２２に戻り、ステップＳ１２３で、顔を見失ったと判断すると、図９のステップＳ１０８に戻る。これにより、顔を見失った場合は再び現在設定されている顔検出モードの種類に応じた顔検出処理を行うことになる。 If it is determined in step S123 that the face is not lost, the process returns to step S122. If it is determined in step S123 that the face is lost, the process returns to step S108 in FIG. As a result, when the face is lost, face detection processing corresponding to the type of the currently set face detection mode is performed again.

一方、ステップＳ１２２で、シャッタボタンが全押しされたと判断すると、ステップＳ１２４に進み、ＣＰＵ１０は、該追従している顔に対してコントラスト検出方式によるＡＦ処理を行う。つまり、シャッタボタンが全押しされた直前に２次元的に検出された顔に対してＡＦ処理を行う。
次いで、ステップＳ１２５で、ＣＰＵ１０は、静止画撮影処理を行い、該撮影処理により得られた静止画像データを圧縮してフラッシュメモリ１４に記録する処理を行う。 On the other hand, if it is determined in step S122 that the shutter button has been fully pressed, the process proceeds to step S124, and the CPU 10 performs an AF process using the contrast detection method on the following face. That is, AF processing is performed on a face that is two-dimensionally detected immediately before the shutter button is fully pressed.
Next, in step S 125, the CPU 10 performs still image shooting processing, and performs processing for compressing still image data obtained by the shooting processing and recording it in the flash memory 14.

以上のように、第２の実施の形態においては、自動的に顔検出モードを設定するので、ユーザの手間が省ける。また、撮影したい被写体の種類が具体的にわからない場合（例えば、猫なのか狐なのかわからない場合）であっても、該被写体の顔を検出することができ、顔検出の精度を向上させることができる。
また、設定された顔検出モードの種類に応じて、２次元の顔検出処理を行ったり、３次元の顔検出処理を行うようにしたので、処理負担の大きい３次元の顔検出処理を不必要に行うことがなく、また顔検出の精度を向上させることができる。たとえば、人の顔は平面的なので２次元の顔検出処理を行い、犬等の動物は一般的に立体的な顔なので２次元の顔検出処理では顔を検出することは難しいが、３次元の顔検出を行うことにより、動物の顔も検出することができる。 As described above, in the second embodiment, since the face detection mode is automatically set, the user's trouble can be saved. Further, even when the type of subject to be photographed is not specifically known (for example, when it is not known whether the subject is a cat or not), the face of the subject can be detected, and the accuracy of face detection can be improved. it can.
Further, since the two-dimensional face detection process or the three-dimensional face detection process is performed according to the set face detection mode type, the three-dimensional face detection process with a large processing load is unnecessary. Therefore, the accuracy of face detection can be improved. For example, since a human face is flat, a two-dimensional face detection process is performed, and since an animal such as a dog is generally a three-dimensional face, it is difficult to detect a face by a two-dimensional face detection process. By performing face detection, an animal's face can also be detected.

また、２次元の顔検出処理により人の顔が検出されない場合は、３次元の顔検出処理を行うようにしたので、たとえば、人が横を向いていたり、マスクやサングラス等を装着している場合であっても人の顔を検出することができる。 In addition, when a human face is not detected by the two-dimensional face detection process, the three-dimensional face detection process is performed. For example, the person is facing sideways or wearing a mask or sunglasses. Even in such a case, a human face can be detected.

また、被写体追従モードがＯｎの場合は、設定された検出モードの種類に応じて、２次元の顔検出処理や３次元の顔検出処理を行い、該顔が検出されると、２次元的な顔検出処理により該検出された顔を検出していき、追従している顔を見失うと再び、設定された検出モードの種類に応じて、２次元の顔検出処理や３次元の顔検出処理を行うので、追従処理の負担を軽減することができると共に、追従処理の精度を向上させることができる。 Further, when the subject tracking mode is On, a two-dimensional face detection process or a three-dimensional face detection process is performed according to the set detection mode type. If the detected face is detected by the face detection process and the face being followed is lost, a two-dimensional face detection process or a three-dimensional face detection process is performed again according to the set detection mode type. As a result, the burden of the tracking process can be reduced, and the accuracy of the tracking process can be improved.

なお、顔検出静止画撮影モードに設定すると、最初に人用の顔検出モードを設定するようにしたが（ステップＳ１０１）、他の種類（動物の種類）の顔検出モードに設定するようにしてもよい。
また、最初はユーザが任意の種類の顔検出モードを設定し、該設定した顔検出モードの種類の顔が検出されない場合に、自動的に設定されていない種類の顔検出モードに設定するようにしてもよい。 When the face detection still image shooting mode is set, the human face detection mode is set first (step S101), but the face detection mode is set to another type (animal type). Also good.
In addition, when a user sets an arbitrary type of face detection mode at first and a face of the set face detection mode is not detected, the face detection mode is not automatically set. May be.

［第３の実施の形態］
次に第３の実施の形態について説明する。
上記第１、２の実施の形態においては、単に顔を検出するというものであったが、第３の実施の形態においては、顔検出を、撮影された顔が誰であるか？何であるか具体的に識別する顔識別の場合に適用するというものである。
顔検出処理では、たとえば、人の顔であっても具体的に誰の顔であるかを識別することはできず、また、動物、たとえば、猫の顔であっても、ペルシャ猫、三毛猫等、猫の種類は様々であり、また、同じ種類の猫であっても猫毎に微妙に顔が異なる場合もある。したがって、顔検出では、自分の友達や子供のみ、自分が飼っているペットのみを検出して欲しいのに（たとえば、フォーカス対象としたいのに）、他の人や動物の顔まで検出されてしまうからである。 [Third Embodiment]
Next, a third embodiment will be described.
In the first and second embodiments, the face is simply detected. In the third embodiment, who is the photographed face in the face detection? This method is applied to the case of face identification that specifically identifies what is.
In the face detection process, for example, it is not possible to identify the face of a person even if it is a person's face, and even if it is an animal, for example, the face of a cat, a Persian cat, a calico There are various types of cats such as cats, and even a cat of the same type may have a slightly different face. Therefore, in face detection, I want to detect only my friends and children, and only my pets (for example, I want to focus), but it also detects faces of other people and animals Because.

この顔識別は、予め記録されている人物の顔、動物の顔が撮像されたフレーム画像データの中にあるかを識別することになる。
この人物や動物の顔は、登録モード等においてユーザが任意の人物、任意の種類の動物の顔を登録することができる。立体顔データ（３次元の顔データ）を登録する場合は、登録したい顔を異なる角度から複数撮影し、該撮影された複数のフレーム画像データから３次元モデルを生成して登録する。また、２次元の顔データを登録する場合は、登録したい顔を撮影し、該撮影されたフレーム画像データから該顔の特徴データ（顔特徴データ）を算出して登録する。 In this face identification, it is identified whether the face of a person and the face of an animal recorded in advance are in the captured frame image data.
As for the face of the person or animal, the user can register the face of any person or any kind of animal in the registration mode or the like. When registering three-dimensional face data (three-dimensional face data), a plurality of faces to be registered are photographed from different angles, and a three-dimensional model is generated and registered from the plurality of photographed frame image data. When registering two-dimensional face data, the face to be registered is photographed, and the feature data (face feature data) of the face is calculated and registered from the photographed frame image data.

ここで、登録される立体顔データや２次元の顔データは、人や犬、猫等の種類の顔であることが大まかに識別できる程度の情報量ではなく、誰であるか等具体的に識別できる程度の情報量が必要である。この登録により、被写体の種類毎に図２に示すような顔識別用の顔データテーブルが作成される。ここでも、図２と同様に、人の顔データは２次元と３次元の両方が登録されており、動物の顔データは３次元のみが登録されているものとする。なお、顔識別用の顔データテーブルには、被写体の種類毎に複数の顔データが登録可能である。たとえば、複数人の２次元の顔データ、立体顔データを記録することができるし、複数匹の猫の立体顔データを記録することも可能である。 Here, the registered three-dimensional face data and two-dimensional face data are not the amount of information that can roughly identify the type of face such as a person, a dog, or a cat. An amount of information that can be identified is necessary. By this registration, a face data table for face identification as shown in FIG. 2 is created for each type of subject. Here, as in FIG. 2, it is assumed that both two-dimensional and three-dimensional human face data are registered, and only three-dimensional animal face data is registered. In the face data table for face identification, a plurality of face data can be registered for each type of subject. For example, two-dimensional face data and three-dimensional face data of a plurality of persons can be recorded, and three-dimensional face data of a plurality of cats can be recorded.

この顔検出処理の動作を顔識別処理に適用した場合の動作は、上記各実施の形態で説明した動作とほぼ同じであるが異なる点だけを説明する。
まず、ここでは、顔検出モードに代えて顔識別モードを備え、この顔識別モードも顔検出モードと同様に、被写体の種類毎にある。たとえば、人用の顔識別モード、犬用の顔識別モード、猫用の顔識別モードといった具合である。 The operation when this face detection processing operation is applied to the face identification processing is substantially the same as the operation described in each of the above embodiments, but only the differences will be described.
First, here, a face identification mode is provided instead of the face detection mode, and this face identification mode is provided for each type of subject as in the face detection mode. For example, the face identification mode for humans, the face identification mode for dogs, and the face identification mode for cats.

Ｆ．顔識別静止画撮影モードの動作について
Ｆ−１．図３及び図４に示す顔検出静止画撮影モードの動作を、顔識別静止画撮影モードに適用したときの動作について
このときの顔識別静止画撮影モードの動作を、図３及び図４を引用し、異なる部分だけを説明する。 F. Operation in face identification still image shooting mode F-1. Operation when the face detection still image shooting mode operation shown in FIGS. 3 and 4 is applied to the face identification still image shooting mode The operation of the face identification still image shooting mode at this time is cited with reference to FIGS. Only the different parts will be explained.

まず、図３のステップＳ１は複数種類の顔識別モードを一覧表示させ、ステップＳ２で、顔識別モードが選択されたと判断すると、ステップＳ３で、該選択された顔識別モードに設定する。また、ステップＳ１０は、設定されている顔識別モードに応じた顔識別処理を行い、ステップＳ１２は、顔が識別できたか否かを判断し、顔が識別できたと判断すると、ステップＳ１３で、該識別した顔に対してＡＦ処理を行い、顔が識別できていないと判断すると、ステップＳ１４で所定の領域に対してＡＦ処理を行う。この設定されている顔識別モードに応じた顔識別処理は後で説明する。 First, step S1 in FIG. 3 displays a list of a plurality of types of face identification modes. If it is determined in step S2 that the face identification mode is selected, the selected face identification mode is set in step S3. Step S10 performs face identification processing according to the set face identification mode. Step S12 determines whether or not the face has been identified. If it is determined that the face has been identified, step S13 AF processing is performed on the identified face, and if it is determined that the face cannot be identified, AF processing is performed on a predetermined area in step S14. The face identification process according to the set face identification mode will be described later.

なお、顔識別処理により顔も検出することできるようにする場合は、ステップＳ１２で、顔が識別できないと判断した場合であって、顔が検出されたと判断した場合は、該検出された顔に対してＡＦ処理を行い、顔も検出できていないと判断された場合は、ステップＳ１４に進み、所定の領域に対してＡＦ処理を行うようにしてもよい。この顔識別処理による顔検出は後で説明する。 If the face can be detected by the face identification process, it is determined in step S12 that the face cannot be identified, and if it is determined that the face has been detected, the detected face is changed to the detected face. On the other hand, if it is determined that the AF process is performed and the face cannot be detected, the process may proceed to step S14 to perform the AF process on a predetermined area. Face detection by this face identification process will be described later.

また、ステップＳ２１は、顔が識別できたか否かを判断し、顔が識別できない場合はステップＳ２３に進み、顔が識別できた場合は、ステップＳ２２に進み、該識別された顔に対して追従処理を開始して、ステップＳ２３に進む。この追従処理は上記第１の実施の形態で説明したように、該識別された顔を２次元的に検出していくことにより、該検出された顔に対して追従していく。 In step S21, it is determined whether or not the face can be identified. If the face cannot be identified, the process proceeds to step S23. If the face can be identified, the process proceeds to step S22 and follows the identified face. The process is started and the process proceeds to step S23. As described in the first embodiment, this follow-up process follows the detected face by detecting the identified face two-dimensionally.

Ｆ−２．図９及び図１０に示す顔検出静止画撮影モードの動作を、顔識別静止画撮影モードに適用したときの動作について
このときの顔識別静止画撮影モードの動作を、図９及び図１０を引用し、異なる部分だけを説明する。 F-2. Operation when the face detection still image shooting mode operation shown in FIGS. 9 and 10 is applied to the face identification still image shooting mode The operation of the face identification still image shooting mode at this time is cited with reference to FIGS. Only the different parts will be explained.

図９のステップＳ１０１は、人用の顔識別モードに設定する。また、ステップＳ１０８は、設定されている顔識別モードに応じた顔識別処理を行い、ステップＳ１０９は、顔が識別できたか否かを判断する。ステップＳ１０９で、顔が識別できていないと判断すると、ステップＳ１１０で、設定されていない顔識別モードがあるか否かを判断する。ステップＳ１１０で、設定されていない顔識別モードがあると判断すると、ステップＳ１１１で、設定されていない種類の顔識別モードに設定してステップＳ１０８に戻る。また、ステップＳ１０９で、顔が識別できたと判断し、ステップＳ１１２で被写体追従モードがＯｎでないと判断すると、ステップＳ１１３で、該顔識別した顔に対してＡＦ処理を行い、一方、ステップＳ１１０で、設定されていない顔識別モードがないと判断すると、ステップＳ１１４で所定の領域に対してＡＦ処理を行う。この設定されている顔識別モードに応じた顔識別処理は後で説明する。 Step S101 in FIG. 9 sets the human face identification mode. Further, step S108 performs face identification processing according to the set face identification mode, and step S109 determines whether or not a face has been identified. If it is determined in step S109 that the face cannot be identified, it is determined in step S110 whether there is a face identification mode that is not set. If it is determined in step S110 that there is a face identification mode that is not set, in step S111, the face identification mode is set to a type that is not set, and the process returns to step S108. If it is determined in step S109 that the face has been identified and it is determined in step S112 that the subject tracking mode is not On, AF processing is performed on the face identified in step S113, while in step S110, If it is determined that there is no face identification mode that has not been set, AF processing is performed on a predetermined area in step S114. The face identification process according to the set face identification mode will be described later.

なお、顔識別処理により顔も検出することできるようにする場合は、ステップＳ１１０で、設定されていない顔識別モードがないと判断した場合は、顔が検出されたか否かを判断し、顔が検出された場合は該検出された顔に対してＡＦ処理を行い、顔も検出できていないと判断された場合は、ステップＳ１１４に進み、所定の領域に対してＡＦ処理を行うようにしてもよい。この顔識別処理による顔検出は後で説明する。 If it is determined that there is no face identification mode that is not set in step S110, it is determined whether or not a face has been detected. If it is detected, AF processing is performed on the detected face. If it is determined that no face is detected, the process proceeds to step S114, and AF processing is performed on a predetermined area. Good. Face detection by this face identification process will be described later.

また、ステップＳ１１２で被写体追従がＯｎであると判断して、図１０のステップＳ１２１に進むと、該識別された顔に対して追従処理を開始する。この追従処理は上記第１の実施の形態で説明したように、該識別された顔を２次元的に検出していくことにより、該検出された顔に対して追従していく。 If it is determined in step S112 that the subject tracking is On and the process proceeds to step S121 in FIG. 10, a tracking process is started for the identified face. As described in the first embodiment, this follow-up process follows the detected face by detecting the identified face two-dimensionally.

Ｇ．設定されている顔識別モードに応じた顔識別処理の動作について
Ｇ−１．図５に示す設定されている顔検出モードに応じた顔検出処理の動作を、設定されている顔識別モードに応じた顔識別処理に適用したときの動作について
このときの顔識別処理の動作を、図５を引用し、異なる部分だけを説明する。
まず、図５のステップＳ３１は、人用の顔識別モードが設定されているか否かを判断し、人用の顔識別モードが設定されていると判断すると、ステップＳ３２に進み、２次元の顔識別処理により人の顔を検出する。この２次元の顔識別処理は、撮像されたフレーム画像データから２次元の全被写体の特徴データを算出し、該算出した被写体の特徴データと、顔識別用の顔データテーブルに登録されている人の２次元の顔データとを比較照合することにより、撮像されたフレーム画像データ内に登録されている人物の顔があるか顔識別する。 G. Regarding the operation of face identification processing according to the set face identification mode G-1. About the operation when the face detection processing operation according to the set face detection mode shown in FIG. 5 is applied to the face identification processing according to the set face identification mode The operation of the face identification processing at this time is Referring to FIG. 5, only the different parts will be described.
First, in step S31 in FIG. 5, it is determined whether or not the human face identification mode is set. If it is determined that the human face identification mode is set, the process proceeds to step S32, and the two-dimensional face is determined. A human face is detected by the identification process. In this two-dimensional face identification process, feature data of all two-dimensional subjects is calculated from the captured frame image data, and the subject feature data calculated and the person registered in the face data table for face identification are calculated. The two-dimensional face data is compared and collated to identify whether there is a person's face registered in the captured frame image data.

そして、ステップＳ３３で、２次元の顔識別処理により登録されている人物の顔が識別されたか否かを判断する。ステップＳ３３で顔を識別することができたと判断すると、ステップＳ３８に進み、顔が識別できたと判断する。一方、ステップＳ３３で顔が識別できないと判断した場合、ステップＳ３１で人用の顔識別モードが設定されていないと判断した場合は、ステップＳ３６に進み、全領域に基づいて３次元の顔識別処理により現在設定されている顔識別モードの種類の顔を顔識別する処理を行って、ステップＳ３７に進む。 In step S33, it is determined whether or not the registered human face has been identified by the two-dimensional face identification process. If it is determined in step S33 that the face has been identified, the process proceeds to step S38, and it is determined that the face has been identified. On the other hand, if it is determined in step S33 that the face cannot be identified, or if it is determined in step S31 that the human face identification mode is not set, the process proceeds to step S36, and a three-dimensional face identification process is performed based on the entire region. Thus, the face identification mode currently set is performed, and the process proceeds to step S37.

そして、ステップＳ３７では３次元の顔識別処理により登録された顔があるか、つまり、３次元の顔識別により顔識別できたか否かを判断する。ステップＳ３７で登録された顔があると判断するとステップＳ３８で顔識別することができたと判断し、ステップＳ３７で登録された顔がないと判断するとステップＳ３９で顔識別することができないと判断する。
このステップＳ３８で顔識別できたと判断すると、図３のステップＳ１２、図４のステップＳ２１、図９のステップＳ１０９で顔識別できたと判断し、ステップＳ３９で顔識別することができないと判断するとステップＳ１２、ステップＳ２１、ステップＳ１０９で顔識別することができないと判断する。
このときは、図５のステップＳ３４、ステップＳ３５は不要ということになる。 In step S37, it is determined whether there is a face registered by the three-dimensional face identification process, that is, whether the face can be identified by the three-dimensional face identification. If it is determined in step S37 that there is a registered face, it is determined in step S38 that the face can be identified. If it is determined in step S37 that there is no registered face, it is determined in step S39 that the face cannot be identified.
If it is determined that the face can be identified in step S38, it is determined that the face can be identified in step S12 in FIG. 3, step S21 in FIG. 4, and step S109 in FIG. 9, and if it is determined in step S39 that the face cannot be identified, step S12. In step S21 and step S109, it is determined that the face cannot be identified.
In this case, step S34 and step S35 in FIG. 5 are not necessary.

Ｇ−２．他の方法による設定されている顔識別モードに応じた顔識別処理の動作について
このときの顔識別処理の動作を、図１１のフローチャートにしたがって説明する。なお、この場合は、顔識別用の顔データテーブルの他に、顔検出用の人の２次元の顔データがメモリ１２に記録されている。
図３のステップＳ１０、図９のステップＳ１０８に進むと、図１１のステップＳ１５１に進み、ＣＰＵ１０の顔認識部１０２は、現在人用の顔識別モードが設定されている否かを判断する。 G-2. Operation of face identification processing according to the face identification mode set by another method The operation of face identification processing at this time will be described with reference to the flowchart of FIG. In this case, in addition to the face data table for face identification, two-dimensional face data of a person for face detection is recorded in the memory 12.
When the process proceeds to step S10 in FIG. 3 and step S108 in FIG. 9, the process proceeds to step S151 in FIG. 11, and the face recognition unit 102 of the CPU 10 determines whether or not the face identification mode for humans is currently set.

ステップＳ１５１で、人用の顔識別モードが設定されている判断すると、ステップＳ１５２に進み、ＣＰＵ１０の顔認識部１０２は、撮像されたフレーム画像データの全領域に基づいて２次元の顔検出処理により人の顔を検出する。
ここでの、２次元の顔検出処理は、撮像されたフレーム画像データから被写体の２次元の特徴データ（全被写体の２次元の特徴データ）を算出し、該算出した被写体の特徴データと、メモリ１２に記録されている顔検出用の人の２次元の顔データとを比較照合することにより、画像内にある人の顔を検出する。ここで、記録されている顔検出用の顔特徴データは、人の顔であることが検出できれば十分なので、具体的に誰の顔であるかを識別できる程度の情報量は必要ない。また、算出される全被写体の特徴データも同様に、人の顔を検出するのに必要な情報量でよい。 If it is determined in step S151 that the human face identification mode is set, the process proceeds to step S152, where the face recognition unit 102 of the CPU 10 performs two-dimensional face detection processing based on the entire area of the captured frame image data. Detect human faces.
In this two-dimensional face detection process, two-dimensional feature data of the subject (two-dimensional feature data of all subjects) is calculated from the captured frame image data, and the calculated feature data of the subject and the memory The face of the person in the image is detected by comparing and collating the two-dimensional face data of the person for face detection recorded in FIG. Here, it is sufficient that the recorded face feature data for face detection can be detected as a human face, and therefore it is not necessary to have an amount of information that can specifically identify who the face is. Similarly, the calculated feature data of all subjects may be the amount of information necessary to detect a human face.

ステップＳ１５３で、ＣＰＵ１０の顔認識部１０２は、画像内に人の顔があるか否かを判断する。つまり、２次元の顔検出処理により人の顔を検出することができたか否かを判断する。即ち、算出した全被写体の特徴データの中に、顔データと所定値以上で（所定の範囲内で）一致する部分があるか否かを判断する。 In step S153, the face recognition unit 102 of the CPU 10 determines whether there is a human face in the image. That is, it is determined whether or not a human face can be detected by the two-dimensional face detection process. That is, it is determined whether or not the calculated feature data of all subjects includes a portion that matches the face data at a predetermined value or more (within a predetermined range).

ステップＳ１５３で、人の顔があると判断すると、ステップＳ１５４に進み、ＣＰＵ１０の顔認識部１０２は、該検出された顔領域に基づいて２次元の顔識別処理を行う。つまり、撮像されたフレーム画像データのうち、該検出された顔領域の画像データから被写体の２次元の特徴データ（顔特徴データ）を算出し、該算出した顔特徴データと、顔識別用の顔データテーブルに登録されている人の２次元の顔データとを比較照合することにより、該登録されている人物であるかを顔識別する。なお、この登録されている人物の顔データは、人の顔であることが大まかに識別できる程度の情報量ではなく、具体的に誰の顔であるかを識別できる程度の情報量であることはもちろんのこと、算出される被写体の特徴データ（顔特徴データ）も誰の顔であるか具体的に識別できる程度の情報量である。
この検出された顔領域に基いて２次元の顔識別処理を行うので、処理負担を軽減させることができるとともに、顔識別の精度を向上させることができる。 If it is determined in step S153 that there is a human face, the process proceeds to step S154, and the face recognition unit 102 of the CPU 10 performs a two-dimensional face identification process based on the detected face area. That is, of the captured frame image data, two-dimensional feature data (face feature data) of the subject is calculated from the image data of the detected face region, and the calculated face feature data and the face for face identification are calculated. By comparing and collating with the two-dimensional face data of a person registered in the data table, it is identified whether the person is a registered person. The registered face data of a person is not the amount of information that can roughly identify a person's face, but the amount of information that can be used to specifically identify who the face is. Needless to say, the calculated feature data (face feature data) of the subject is an amount of information that can specifically identify the face of the subject.
Since the two-dimensional face identification process is performed based on the detected face area, the processing burden can be reduced and the accuracy of the face identification can be improved.

次いで、ステップＳ１５５で、ＣＰＵ１０の顔認識部１０２は、登録された顔があるか否かを判断する。つまり、２次元の顔識別処理により登録されている人物の顔が識別できたか否かを判断する。
ステップＳ１５５で、登録された顔があると判断すると、ステップＳ１６１に進み、ＣＰＵ１０の顔認識部１０２は、顔を識別することができたと判断する。
一方、ステップＳ１５５で、登録された顔がないと判断すると、ステップＳ１５６に進み、顔認識部１０２は、該検出された顔領域に基づいて、３次元の顔識別処理により人の顔を識別する処理を行って、ステップＳ１６０に進む。この３次元の顔識別処理は後で説明する。
この検出された顔領域に基いて３次元の顔識別処理を行うので、処理負担を軽減させることができるとともに、顔識別の精度を向上させることができる。 Next, in step S155, the face recognition unit 102 of the CPU 10 determines whether there is a registered face. That is, it is determined whether or not the registered person's face can be identified by the two-dimensional face identification process.
If it is determined in step S155 that there is a registered face, the process proceeds to step S161, and the face recognition unit 102 of the CPU 10 determines that the face has been identified.
On the other hand, if it is determined in step S155 that there is no registered face, the process proceeds to step S156, where the face recognition unit 102 identifies a human face by three-dimensional face identification processing based on the detected face area. Processing is performed, and the process proceeds to step S160. This three-dimensional face identification process will be described later.
Since the three-dimensional face identification process is performed based on the detected face area, the processing burden can be reduced and the accuracy of the face identification can be improved.

一方、ステップＳ１５３で、人の顔がないと判断すると、ステップＳ１５７に進み、顔らしきものがあるか否かを判断する。この顔らしきものとは、上記第１の実施の形態で説明したように、ステップＳ１５２の２次元の顔検出処理による全被写体の特徴データと、人の２次元の顔データとの比較照合により、口と鼻はあるが片目、若しくは両目が検出されなかったり（人が横を向いている場合、サングラスをしている場合）、目があるが鼻と口が検出されなかったり場合（マスクをしている場合）のことである。つまり、顔データに基づく顔のパーツが一部検出されなかった場合のことである。 On the other hand, if it is determined in step S153 that there is no human face, the process proceeds to step S157, and it is determined whether or not there is something that looks like a face. As described in the first embodiment, this face-like object is obtained by comparing and comparing the feature data of all subjects by the two-dimensional face detection processing in step S152 and the two-dimensional face data of a person. If there is a mouth and nose but one or both eyes are not detected (if a person is looking sideways or wearing sunglasses), or if there is an eye but the nose and mouth are not detected (masked) If you are). In other words, this is a case where some face parts based on the face data are not detected.

ステップＳ１５７で、顔らしきものがあると判断すると、ステップＳ１５８に進み、ＣＰＵ１０の顔認識部１０２は、該検出された顔らしきものがある領域に基づいて３次元の顔識別処理により人の顔を識別する処理を行って、ステップＳ１６０に進む。顔らしき領域に基いて３次元の顔識別処理を行うので処理負担を軽減することができ、顔識別の精度を向上させることができる。この３次元の顔識別処理については後で説明する。
一方、ステップＳ１５７で、顔らしきものがないと判断した場合、ステップＳ１５１で人用の顔識別モードが設定されていないと判断した場合は、ステップＳ１５９に進み、フレーム画像データの全領域に基づいて３次元の顔識別処理により現在設定されている顔識別モードの種類の顔を識別する処理を行って、ステップＳ１６０に進む。この３次元の顔識別処理については後で説明する。 If it is determined in step S157 that there is something that looks like a face, the process proceeds to step S158, where the face recognition unit 102 of the CPU 10 identifies a human face by three-dimensional face identification processing based on the detected area that seems to be a face. The identification process is performed, and the process proceeds to step S160. Since the three-dimensional face identification process is performed based on the face-like area, the processing load can be reduced and the accuracy of the face identification can be improved. This three-dimensional face identification process will be described later.
On the other hand, if it is determined in step S157 that there is no face-like object, or if it is determined in step S151 that the human face identification mode is not set, the process proceeds to step S159, and based on the entire area of the frame image data. The process of identifying the face of the currently set face identification mode type is performed by the three-dimensional face identification process, and the process proceeds to step S160. This three-dimensional face identification process will be described later.

ステップＳ１６０に進むと、ＣＰＵ１０の顔認識部１０２は、登録された顔があるか否かを判断する。つまり、３次元の顔識別処理により登録されている人物の顔が識別できたか否かを判断する。
ステップＳ１６０で、登録された顔があると判断すると、ステップＳ１６１で、顔を識別することができたと判断し、ステップＳ１６０で、登録された顔がないと判断すると、ステップＳ１６２で、顔を識別することができないと判断する。
このステップＳ１６１で顔識別できたと判断すると図３のステップＳ１２、図４のステップＳ２１、図９のステップＳ１０９で顔識別できたと判断し、ステップＳ１６２で顔識別することができないと判断するとステップＳ１２、ステップＳ２１、ステップＳ１０９で顔識別することができないと判断する。 In step S160, the face recognition unit 102 of the CPU 10 determines whether there is a registered face. That is, it is determined whether or not the registered person's face can be identified by the three-dimensional face identification process.
If it is determined in step S160 that there is a registered face, it is determined in step S161 that the face can be identified. If it is determined in step S160 that there is no registered face, the face is identified in step S162. Judge that you can not.
If it is determined in step S161 that the face is identified, step S12 in FIG. 3, step S21 in FIG. 4, step S109 in FIG. In step S21 and step S109, it is determined that the face cannot be identified.

なお、図１１では、人用の顔識別モードが設定されている場合は、２次元の顔検出処理と２次元の顔識別処理との両方を行うようにしたが、顔識別処理により人の顔を検出することができるようにする場合は、２次元の顔検出処理を行わずに、２次元の顔識別処理のみを行うようにしてもよい。この場合は、ステップＳ１５１で人用の顔識別モードが設定されていると判断すると、ステップＳ１５４に進み、全領域に基づいて２次元の顔識別処理を行って、ステップＳ１５５に進む。ステップＳ１５５で、登録された顔がないと判断すると、顔を検出することができたか否かを判断する。２次元の顔識別処理により顔を検出することができたと判断すると、ステップＳ１５６に進み、顔を検出することができないと判断すると、ステップＳ１５７に進む。この場合は、ステップＳ１５２、ステップＳ１５３の動作は不要ということになる。 In FIG. 11, when the human face identification mode is set, both the two-dimensional face detection process and the two-dimensional face identification process are performed. May be detected, only the two-dimensional face identification process may be performed without performing the two-dimensional face detection process. In this case, if it is determined in step S151 that the human face identification mode is set, the process proceeds to step S154, a two-dimensional face identification process is performed based on the entire area, and the process proceeds to step S155. If it is determined in step S155 that there is no registered face, it is determined whether or not a face has been detected. If it is determined that the face can be detected by the two-dimensional face identification process, the process proceeds to step S156. If it is determined that the face cannot be detected, the process proceeds to step S157. In this case, the operations in steps S152 and S153 are not necessary.

また、この２次元の顔識別処理による顔検出は、登録されている顔がフレーム画像データ内にあると識別された場合は、当然に顔も検出されていることになり、また、たとえ登録されている顔がフレーム画像データ内にないと判断された場合であっても、全被写体の２次元の特徴データと登録されている人の２次元の顔データとの比較照合の結果により、登録されている人物の顔ではないが人の顔であることを検出することは可能であるからである。例えば、人の２次元の顔データと第１の所定値以上で一致する部分がある場合は、該顔データに対応する人物の顔であると識別し、第１の所定値以上で一致しないが、第２の所定値（第１の所定値より小さい値）以上で一致する場合は人の顔であると検出する。
なお、３次元の顔識別処理による顔検出は後で説明する。 Further, in the face detection by the two-dimensional face identification process, when the registered face is identified as being in the frame image data, the face is naturally detected, and even if the registered face is registered. Even if it is determined that the face is not included in the frame image data, the registered face is registered based on the result of comparison and matching between the two-dimensional feature data of all subjects and the registered two-dimensional face data of the person. This is because it is possible to detect that the face of the person is not the face of the person. For example, if there is a portion that matches the human two-dimensional face data at a first predetermined value or more, the face is identified as a person corresponding to the face data, and does not match at the first predetermined value or more. If the values coincide with each other at a second predetermined value (a value smaller than the first predetermined value) or more, the face is detected.
Note that face detection by three-dimensional face identification processing will be described later.

Ｈ．３次元の顔識別処理の動作について
Ｈ−１．図６に示す３次元の顔検出処理の動作を、３次元の顔識別処理に適用したときの動作について
このときの３次元の顔識別処理の動作を、図６を引用し、異なる部分だけを説明する。
まず、ステップＳ５３では、撮像された複数枚のフレーム画像データから被写体の特徴データをそれぞれ算出する。このときに算出される被写体の特徴データは、人や犬、猫等の種類の顔であることが大まかに識別できる程度の情報量ではなく、誰であるか等具体的に識別できる程度の情報量を有している。 H. Operation of 3D face identification processing H-1. About the operation when the operation of the three-dimensional face detection process shown in FIG. 6 is applied to the three-dimensional face identification process The operation of the three-dimensional face identification process at this time is shown in FIG. explain.
First, in step S53, subject feature data is calculated from a plurality of captured frame image data. The feature data of the subject calculated at this time is not the amount of information that can roughly identify whether it is a face of a person, a dog, or a cat, but information that can be specifically identified such as who it is Have quantity.

このとき、図５を顔識別処理に適用したときの図５のステップＳ３６、図１１のステップＳ１５９での３次元の顔識別処理の場合は、撮像されたフレーム画像データの全領域に基づいて被写体の２次元の特徴データ（全被写体の特徴データ）を算出する。
また、図１１のステップＳ１５６の３次元の顔識別処理の場合は、撮像された各フレーム画像データの検出された顔領域の画像データから被写体の２次元の特徴データ（２次元の顔特徴データ）を算出する。ここで顔特徴データとは、検出された顔領域内で撮像されている全ての被写体の特徴データに基づいて数値化されたデータである。
また、図１１のステップＳ１５８の３次元の顔識別処理の場合は、撮像された各フレーム画像データの顔らしきものがあると判断された領域の画像データから被写体の２次元の特徴データ（顔らしき被写体の２次元の特徴データ）を算出する。 At this time, in the case of the three-dimensional face identification process in step S36 in FIG. 5 and step S159 in FIG. 11 when FIG. 5 is applied to the face identification process, the subject is based on the entire area of the captured frame image data. 2D feature data (feature data of all subjects) is calculated.
In the case of the three-dimensional face identification process in step S156 of FIG. 11, the two-dimensional feature data (two-dimensional face feature data) of the subject is obtained from the image data of the detected face area of each captured frame image data. Is calculated. Here, the face feature data is data digitized based on the feature data of all the subjects imaged in the detected face area.
Further, in the case of the three-dimensional face identification process in step S158 in FIG. 11, the two-dimensional feature data of the subject (like the face) is obtained from the image data of the area where it is determined that there is a face-like thing in each captured frame image data. 2D feature data of the subject) is calculated.

そして、ステップＳ５５では、ステップＳ５４で各フレーム画像データの被写体特徴データから生成された被写体の３次元モデルと、顔識別用の顔データテーブルに登録されている、現在設定されている顔識別モードの種類の立体顔データとを比較照合する。
そして、ステップＳ５６で、所定値以上で一致する部分があるか否かを判断し、所定値以上で一致する部分がある場合は、ステップＳ５７で、該所定値以上で一致する部分に登録された顔があると判断し、顔領域を検出する。一方、ステップＳ５６で、所定値以上で一致する部分がないと判断すると、ステップＳ５８に進み、登録されている顔がないと判断する。 In step S55, the three-dimensional model of the subject generated from the subject feature data of each frame image data in step S54 and the currently set face identification mode registered in the face data table for face identification. Compare and collate with 3D face data.
Then, in step S56, it is determined whether or not there is a matching part with a predetermined value or more. If there is a matching part with a predetermined value or more, it is registered as a matching part with the predetermined value or more in step S57. It is determined that there is a face, and a face area is detected. On the other hand, if it is determined in step S56 that there is no matching part with a predetermined value or more, the process proceeds to step S58, and it is determined that there is no registered face.

このステップＳ５７で顔領域を検出すると、図５のステップＳ３８、図１１のステップＳ１６１で顔を識別することができたと判断し、図３のステップＳ１２、図４のステップＳ２１、図９のステップＳ１０９で顔が識別できたと判断する。また、ステップＳ５８で顔がないと判断すると、図５のステップＳ３９、図１１のステップＳ１６２で顔を識別することができないと判断し、図３のステップＳ１２、図４のステップＳ２１、図９のステップＳ１０９で顔が識別できないと判断する。 When the face area is detected in step S57, it is determined that the face can be identified in step S38 in FIG. 5 and step S161 in FIG. 11, and step S12 in FIG. 3, step S21 in FIG. 4, and step S109 in FIG. It is determined that the face can be identified. If it is determined in step S58 that there is no face, it is determined that the face cannot be identified in step S39 in FIG. 5 and step S162 in FIG. 11, and step S12 in FIG. 3, step S21 in FIG. In step S109, it is determined that the face cannot be identified.

なお、この３次元の顔識別処理においても顔を検出することできるようにすることは可能である。この３次元の顔識別処理は、登録されている顔がフレーム画像データ内にあると識別された場合は、当然に顔も検出されていることになり、また、たとえ登録されている顔がフレーム画像データ内にないと判断された場合であっても、被写体の３次元モデルと登録されている立体顔データとの比較照合の結果により、登録されている人物の顔ではないが人の顔であることを検出することは可能であるからである。例えば、立体顔データと第１の所定値以上で一致する部分がある場合は、該立体顔データに対応する顔であると識別し、第１の所定値以上で一致しないが、第２の所定値（第１の所定値より小さい値）以上で一致する場合は単に顔であると検出する。 Note that it is possible to detect a face also in this three-dimensional face identification process. In this three-dimensional face identification process, when a registered face is identified as being in the frame image data, the face is naturally detected, and even if the registered face is a frame Even if it is determined not to be in the image data, the result of the comparison of the 3D model of the subject and the registered 3D face data indicates that the face of the person is not a registered person's face. This is because it can be detected. For example, if there is a portion that matches the 3D face data with a first predetermined value or more, the face is identified as a face corresponding to the 3D face data. If the values match (a value smaller than the first predetermined value) or more, the face is simply detected.

Ｈ−２．図８に示す３次元の顔検出処理の動作を、３次元の顔識別処理に適用したときの動作について
このときの３次元の顔識別処理の動作を、図８を引用し、異なる部分だけを説明する。 H-2. About the operation when the operation of the three-dimensional face detection process shown in FIG. 8 is applied to the three-dimensional face identification process The operation of the three-dimensional face identification process at this time is shown in FIG. explain.

ステップＳ６２では、顔識別用の顔データテーブルに設定されている被写体の種類毎の立体顔データのうち、現在設定されている顔識別モードの種類の立体顔データに基づいて、現在設定されている向きから見た顔の平面画像データを生成する。そして、ステップＳ６３では、該生成した顔の平面画像データから２次元の顔特徴データを算出する。このとき、算出される顔特徴データは、人や犬、猫等の種類の顔であることが大まかに識別できる程度の情報量ではなく、誰であるか等具体的に識別できる程度の情報量を有している。
そして、ステップＳ６４では、直近に撮像されたフレーム画像データに基づいて被写体の２次元の特徴データを算出する。このとき、算出される被写体の特徴データは、人や犬、猫等の種類の顔であることが大まかに識別できる程度の情報量ではなく、誰であるか等具体的に識別できる程度の情報量を有している。 In step S62, among the 3D face data for each type of subject set in the face data table for face identification, currently set based on the 3D face data of the currently set face identification mode. Planar image data of the face viewed from the direction is generated. In step S63, two-dimensional face feature data is calculated from the generated plane image data of the face. At this time, the calculated face feature data is not the amount of information that can roughly identify the type of face such as a person, dog, or cat, but the amount of information that can be specifically identified such as who the person is have.
In step S64, two-dimensional feature data of the subject is calculated based on the most recently captured frame image data. At this time, the calculated feature data of the subject is not an amount of information that can be roughly identified as a face of a kind such as a person, dog, or cat, but information that can be specifically identified such as who the person is. Have quantity.

また、ここで、図５を顔識別処理に適用したときの図５のステップＳ３６、図１１のステップＳ１５９での３次元の顔識別処理の場合は、撮像されたフレーム画像データの全領域に基づいて被写体の２次元の特徴データ（全被写体の特徴データ）を算出する。
また、図１１のステップＳ１５６の３次元の顔識別処理の場合は、撮像された各フレーム画像データの検出された顔領域の画像データから２次元の被写体の特徴データ（２次元の顔特徴データ）を算出する。
また、図１１のステップＳ１５８の３次元の顔識別処理の場合は、撮像された各フレーム画像データの顔らしきものがあると判断された領域の画像データから２次元の被写体の特徴データ（２次元の顔らしき被写体の特徴データ）を算出する。 Here, in the case of the three-dimensional face identification process in step S36 in FIG. 5 and step S159 in FIG. 11 when FIG. 5 is applied to the face identification process, it is based on the entire region of the captured frame image data. Then, two-dimensional feature data of the subject (feature data of all subjects) is calculated.
In the case of the three-dimensional face identification process in step S156 of FIG. 11, the feature data of the two-dimensional subject (two-dimensional face feature data) is obtained from the detected face area image data of each captured frame image data. Is calculated.
In the case of the three-dimensional face identification process in step S158 in FIG. 11, the feature data of the two-dimensional subject (two-dimensional) is obtained from the image data of the area where it is determined that there is a face-like image of each captured frame image data. (Feature data of a subject that looks like a face) is calculated.

そして、ステップＳ６６で、所定値以上で一致する部分があると判断された場合は、ステップＳ６７で、該所定値以上で一致する部分に登録された顔があると判断し、顔領域を検出する。また、ステップＳ６８で、設定されている向きが最後の向きと判断された場合は、ステップＳ７０に進み、登録されている顔がないと判断する。 If it is determined in step S66 that there is a matching part with a predetermined value or more, it is determined in step S67 that there is a registered face in the matching part with the predetermined value or more, and a face area is detected. . If it is determined in step S68 that the set orientation is the last orientation, the process proceeds to step S70, where it is determined that there is no registered face.

このステップＳ６７で顔領域を検出すると、図５のステップＳ３８、図１１のステップＳ１６１で顔を識別することができたと判断し、図３のステップＳ１２、図４のステップＳ２１、図９のステップＳ１０９で顔が識別できたと判断する。また、ステップＳ７０で顔がないと判断すると、図５のステップＳ３９、図１１のステップＳ１６２で顔を識別することができないと判断し、図３のステップＳ１２、図４のステップＳ２１、図９のステップＳ１０９で顔が識別できないと判断する。 When the face area is detected in step S67, it is determined that the face can be identified in step S38 in FIG. 5 and step S161 in FIG. 11, and step S12 in FIG. 3, step S21 in FIG. 4, and step S109 in FIG. It is determined that the face can be identified. If it is determined in step S70 that there is no face, it is determined that the face cannot be identified in step S39 in FIG. 5 and step S162 in FIG. 11, and step S12 in FIG. 3, step S21 in FIG. In step S109, it is determined that the face cannot be identified.

なお、この３次元の顔識別処理においても顔を検出することできるようにすることは可能である。この３次元の顔識別処理は、登録されている顔がフレーム画像データ内にあると識別された場合は、当然に顔も検出されていることになり、また、たとえ登録されている顔がフレーム画像データ内にないと判断された場合であっても、ステップＳ６４で算出された２次元の被写体の特徴データと、ステップＳ６３で算出された顔特徴データとの比較照合の結果により、登録されている人物の顔ではないが人の顔であることを検出することは可能であるからである。例えば、ステップＳ６３で算出された顔特徴データと第１の所定値以上で一致する部分がある場合は、該顔特徴データの生成元となった立体顔データに対応する顔であると識別し、第１の所定値以上で一致しないが、第２の所定値（第１の所定値より小さい値）以上で一致する場合は単に顔であると検出する。 Note that it is possible to detect a face also in this three-dimensional face identification process. In this three-dimensional face identification process, when a registered face is identified as being in the frame image data, the face is naturally detected, and even if the registered face is a frame Even if it is determined that it is not in the image data, it is registered based on the result of comparison and matching between the feature data of the two-dimensional subject calculated in step S64 and the face feature data calculated in step S63. This is because it is possible to detect that the face is not the face of a certain person but the face of a person. For example, if there is a part that matches the face feature data calculated in step S63 at a first predetermined value or more, the face feature data is identified as a face corresponding to the stereoscopic face data from which the face feature data was generated, If the values do not match at the first predetermined value or more but match at the second predetermined value (a value smaller than the first predetermined value) or more, the face is simply detected.

Ｈ−３．他の方法による３次元の顔識別処理の動作について
まず、１つ目の他の方法による３次元の顔識別処理の動作は、上記「Ｈ−２」で説明した動作とほぼ同様であるが、上記「Ｈ−２」では、立体顔データから異なる向きの平面画像データを生成するようにしたが、ここでは、立体顔データから異なる表情（笑い、怒り、嫌悪等の異なる表情）の平面画像データを生成するというものである。このときは、立体顔データを正面から見たときの平面画像データを生成する。 H-3. Regarding the operation of the three-dimensional face identification process by another method First, the operation of the three-dimensional face identification process by the first other method is substantially the same as the operation described in the above “H-2”. In the above “H-2”, plane image data in different directions is generated from the stereoscopic face data, but here, the planar image data of different facial expressions (different facial expressions such as laughter, anger, and disgust) from the stereoscopic face data. Is generated. At this time, plane image data when the stereoscopic face data is viewed from the front is generated.

このときの動作を、図８を引用して説明する。
まず、ステップＳ６１では、顔の表情を無表情に設定し、ステップＳ６２では、現在設定されている顔識別モードの種類の立体顔データの顔の表情を認識し、該立体顔データから該設定された表情の立体顔データを生成する。そして、該生成した立体顔データに基づいて正面から見た顔の平面画像データを生成する。 The operation at this time will be described with reference to FIG.
First, in step S61, the facial expression is set to no expression, and in step S62, the facial expression of the 3D face data of the currently set face identification mode is recognized and set from the 3D face data. 3D face data of a certain expression is generated. Then, plane image data of the face viewed from the front is generated based on the generated stereoscopic face data.

そして、ステップＳ６６で、所定値以上で一致する部分があると判断すると、ステップＳ６７で、該所定値以上で一致する部分に登録されている顔があると判断し、顔領域を検出する。一方、ステップＳ６６で、所定値以上で一致する部分がないと、ステップＳ６８で、設定されている表情が最後の表情か否かを判断する。つまり、予め決められているすべての表情を設定したか否かを判断する。ステップＳ６８で、設定されている表情が最後の表情でないと判断すると、ステップＳ６９で次の表情に設定してステップＳ６２に戻る。一方、ステップＳ６８で設定されている表情が最後の表情であると判断すると、ステップＳ７０に進み、登録されている顔がないと判断する。
これにより、顔識別の精度を向上させることができる。例えば、被写体が笑っていたり、怒っている場合等であっても、顔を識別することができる。 If it is determined in step S66 that there is a matching part with a predetermined value or more, it is determined in step S67 that there is a face registered in the matching part with the predetermined value or more, and a face area is detected. On the other hand, if it is determined in step S66 that there is no matching portion equal to or greater than the predetermined value, it is determined in step S68 whether the set facial expression is the last facial expression. That is, it is determined whether all predetermined facial expressions have been set. If it is determined in step S68 that the set facial expression is not the last facial expression, the next facial expression is set in step S69, and the process returns to step S62. On the other hand, if it is determined that the facial expression set in step S68 is the last facial expression, the process proceeds to step S70, and it is determined that there is no registered face.
Thereby, the accuracy of face identification can be improved. For example, even when the subject is laughing or angry, the face can be identified.

なお、立体顔データから異なる向きの平面画像データの生成に代えて、異なる表情の平面画像データを生成するようにしたが、各表情毎に異なる向きの平面画像データを生成するようにしてもよい。つまり、向きと表情とを変えて平面画像データを生成する。
また、３次元の顔識別処理について説明したが、３次元の顔検出処理にも適用することも可能である。また、この３次元の顔識別処理により顔を検出することができるようにすることも可能である。 In addition, instead of generating plane image data with different orientations from the three-dimensional face data, plane image data with different facial expressions are generated, but plane image data with different orientations may be generated for each facial expression. . That is, plane image data is generated by changing the direction and the expression.
Further, although the three-dimensional face identification process has been described, the present invention can also be applied to a three-dimensional face detection process. It is also possible to detect a face by this three-dimensional face identification process.

次に、２つ目の他の方法による３次元の顔識別処理の動作は、上記「Ｈ−１」と「Ｈ−２」で説明した動作を組み合わせたものであり、上記「Ｈ−２」では、登録されている立体顔データから異なる向きから見た平面画像データを生成するようにしたが、撮像された複数のフレーム画像データから被写体の３次元モデルを生成し、該生成した被写体の３次元モデルから異なる向きから見た平面画像データを生成するというものである。このときは、顔識別用の顔データテーブルには、被写体の種類にかかわらず２次元の顔データのみが登録されている。 Next, the operation of the three-dimensional face identification processing by the second other method is a combination of the operations described in the above “H-1” and “H-2”. In this case, planar image data viewed from different directions is generated from the registered three-dimensional face data. However, a three-dimensional model of a subject is generated from a plurality of captured frame image data, and 3 of the generated subject is generated. Planar image data viewed from different directions is generated from a dimensional model. At this time, only two-dimensional face data is registered in the face data table for face identification regardless of the type of subject.

このときの動作を、図６及び図８を引用して説明する。
まず、図６のステップＳ５１、ステップＳ５２の動作後、撮像された撮影角度の異なる複数毎のフレーム画像データ毎に被写体の特徴データを算出し（ステップＳ５３）、該算出された各フレーム画像データ毎の被写体の特徴データから被写体の３次元モデルを生成する（ステップＳ５４）。 The operation at this time will be described with reference to FIGS.
First, after the operations of step S51 and step S52 in FIG. 6, the feature data of the subject is calculated for each of the plurality of frame image data captured at different shooting angles (step S53), and for each calculated frame image data A three-dimensional model of the subject is generated from the feature data of the subject (step S54).

そして、図８のステップＳ６１に進み、顔の向きを正面と設定し、ステップＳ６２では、該生成された被写体の３次元モデルに基づいて、該設定された向きから見た被写体の平面画像データを生成する。次いで、ステップＳ６３で、該生成した平面画像データから被写体の特徴データを算出して、ステップＳ６５に進み、該算出した被写体の特徴データ（設定されている向きから見た被写体の特徴データ）と、顔識別用の顔データテーブルに登録されている現在設定されている顔識別モードの種類の２次元の顔データとを比較照合する。 Then, the process proceeds to step S61 in FIG. 8, where the face orientation is set to the front, and in step S62, the plane image data of the subject viewed from the set orientation is obtained based on the generated three-dimensional model of the subject. Generate. Next, in step S63, subject feature data is calculated from the generated plane image data, and the process proceeds to step S65, where the calculated subject feature data (subject feature data viewed from the set orientation), and The two-dimensional face data of the currently set face identification mode type registered in the face data table for face identification is compared and collated.

そして、ステップＳ６６で、所定値以上で一致する部分があると判断すると、ステップＳ６７進み、該所定値以上で一致する部分に登録されている顔があると判断し、顔領域を検出する。一方、ステップＳ６６で、所定値以上で一致する部分がないと判断すると、ステップＳ６８で、設定されている向きが最後の向きであるか否かを判断する。ステップＳ６８で、設定されている向きが最後の向きでないと判断すると、ステップＳ６９で次の向きに設定してステップＳ６２に戻る。一方、ステップＳ６８で設定されている向きが最後の向きであると判断すると、ステップＳ７０に進み、登録されている顔がないと判断する。 If it is determined in step S66 that there is a matching part with a predetermined value or more, the process proceeds to step S67, where it is determined that there is a face registered in the matching part with the predetermined value or more, and a face area is detected. On the other hand, if it is determined in step S66 that there is no matching portion equal to or greater than the predetermined value, it is determined in step S68 whether the set direction is the last direction. If it is determined in step S68 that the set direction is not the last direction, the next direction is set in step S69, and the process returns to step S62. On the other hand, if it is determined that the orientation set in step S68 is the last orientation, the process proceeds to step S70, and it is determined that there is no registered face.

これにより、顔識別の精度を向上させることができる。例えば、人物が横を向いている場合やサングラスをしている場合であっても、該人物が登録されている顔であるかを識別することでできる。また、立体顔データではなく、２次元の顔データを登録しておけばよいので、記録容量を少なくすることができる。
なお、３次元の顔識別処理について説明したが、３次元の顔検出処理にも適用することも可能である。また、この３次元の顔識別処理により顔を検出することができるようにすることも可能である。 Thereby, the accuracy of face identification can be improved. For example, even when a person is facing sideways or wearing sunglasses, it is possible to identify whether the person is a registered face. Further, since it is only necessary to register two-dimensional face data instead of three-dimensional face data, the recording capacity can be reduced.
Although the three-dimensional face identification process has been described, the present invention can also be applied to a three-dimensional face detection process. It is also possible to detect a face by this three-dimensional face identification process.

また、図１１のステップＳ１５６による３次元の顔識別処理では、顔領域の画像データに基づいて被写体の３次元モデルを生成するので、顔の３次元モデルが生成されるということになる。このように顔の３次元モデルが生成される場合は、生成された顔の３次元モデルの表情を認識して、上述したように顔の３次元モデルの表情を変えるようにしてもよい。また、この動作は、３次元の顔識別処理について説明したが、３次元の顔検出処理にも適用することも可能である。また、この３次元の顔識別処理により顔を検出することができるようにすることも可能である。 Further, in the three-dimensional face identification process in step S156 of FIG. 11, a three-dimensional model of the subject is generated based on the image data of the face area, and thus a three-dimensional model of the face is generated. When the three-dimensional model of the face is generated in this way, the expression of the generated three-dimensional model of the face may be recognized and the expression of the three-dimensional model of the face may be changed as described above. Moreover, although this operation | movement demonstrated 3D face identification processing, it is also possible to apply to 3D face detection processing. It is also possible to detect a face by this three-dimensional face identification process.

以上のように第３の実施の形態においては、設定された顔識別モードの種類に応じて、異なる顔識別処理を行うようにしたので、被写体の種類や撮影状況に適した顔識別処理を行うことができ、顔識別の精度を向上させることができる。
さらに、第３の実施の形態においては、設定された顔識別モードの種類に応じて、２次元の顔識別処理を行ったり、３次元の顔識別処理を行うようにしたので、処理負担の大きい３次元の顔識別処理を不必要に行うことがなく、また顔識別の精度を向上させることができる。たとえば、人の顔は平面的なので２次元の顔識別処理を行い、犬等の動物は一般的に立体的な顔なので２次元の顔識別処理では顔を識別することは難しいが、３次元の顔識別を行うことにより、動物の顔も識別することができる。 As described above, in the third embodiment, different face identification processing is performed according to the type of the set face identification mode, so that face identification processing suitable for the type of subject and the shooting situation is performed. And the accuracy of face identification can be improved.
Furthermore, in the third embodiment, since the two-dimensional face identification process or the three-dimensional face identification process is performed according to the set face identification mode, the processing load is large. The three-dimensional face identification process is not performed unnecessarily, and the accuracy of face identification can be improved. For example, since a human face is flat, a two-dimensional face identification process is performed. An animal such as a dog is generally a three-dimensional face, so it is difficult to identify a face by a two-dimensional face identification process. By performing face identification, an animal's face can also be identified.

また、２次元の顔識別処理により人の顔が識別されない場合は、３次元の顔識別処理を行うようにしたので、たとえば、人が横を向いていたり、マスクやサングラス等を装着している場合であっても人の顔を識別することができる。
また、ユーザが顔識別モードを設定する場合は、識別したい種類の被写体の顔のみを識別することができる。
自動的に顔識別モードを設定する場合は、ユーザの手間が省け、また、撮影したい被写体の種類が具体的にわからない場合（例えば、猫なのか狐なのかわからない場合）であっても、該被写体の顔を識別することができ、顔識別の精度を向上させることができる。 In addition, when the human face is not identified by the two-dimensional face identification process, the three-dimensional face identification process is performed. For example, the person is facing sideways or wearing a mask or sunglasses. Even in this case, a person's face can be identified.
Further, when the user sets the face identification mode, only the face of the type of subject to be identified can be identified.
When the face identification mode is automatically set, the user can save time and the subject even if the type of subject to be photographed is not specifically known (for example, whether the subject is a cat or not). Can be identified, and the accuracy of face identification can be improved.

また、被写体追従モードがＯｎの場合は、設定された顔識別モードの種類に応じて、２次元の顔識別処理や３次元の顔識別処理を行い、該顔が識別されると、２次元的な顔検出処理により該検出された顔を検出していき、追従している顔を見失うと再び、設定された顔識別モードの種類に応じて、２次元の顔識別処理や３次元の顔識別処理を行うので、追従処理の負担を軽減することができると共に、追従処理の精度を向上させることができる。 When the subject tracking mode is On, a two-dimensional face identification process or a three-dimensional face identification process is performed according to the type of the set face identification mode. If the detected face is detected by a simple face detection process, and the face being followed is lost, a two-dimensional face identification process or a three-dimensional face identification is performed again according to the type of the set face identification mode. Since the processing is performed, it is possible to reduce the load of the tracking process and improve the accuracy of the tracking process.

［変形例］
Ｉ．上記実施の形態は以下のような変形例も可能である。 [Modification]
I. The above-described embodiment can be modified as follows.

（０１）上記各実施の形態において、被写体追従モードがＯｆｆの場合もＯｎの場合も、設定されている顔検出モードに応じた顔検出処理、顔識別処理を行うようにしたが、被写体追従モードがＯｎの場合は、設定されている顔検出モードの種類にかかわらず、３次元の顔検出処理、３次元の顔識別処理を行うようにしてもよい。この３次元の顔検出処理により検出された顔、３次元の顔識別処理により識別された顔を２次元的に顔を検出してくことにより、該検出された顔の追従処理を行い、シャッタボタンが全押しされるまでに該顔を見失った場合は、再び３次元の顔検出処理、３次元の顔識別処理を行うようにする。 (01) In each of the above embodiments, the face detection process and the face identification process corresponding to the set face detection mode are performed regardless of whether the subject tracking mode is Off or On. If is On, a three-dimensional face detection process and a three-dimensional face identification process may be performed regardless of the type of face detection mode that is set. The face detected by the three-dimensional face detection process is detected two-dimensionally from the face identified by the three-dimensional face identification process, thereby performing a tracking process of the detected face, and a shutter button If the face is lost before the button is fully pressed, the three-dimensional face detection process and the three-dimensional face identification process are performed again.

（０２）また、上記各実施の形態においては、図３のステップＳ１０、図９のステップＳ１０８の設定されている顔検出モードに応じた顔検出処理は、人用の顔検出モードが設定されている場合は、２次元の顔検出処理、２次元の顔識別処理を行い（図５のステップＳ３２、図１１のステップＳ１５２、ステップＳ１５４）、２次元の顔検出処理、２次元の顔識別処理により顔が検出されない場合に（図５のステップＳ３３でＮｏ、図１１のステップＳ１５３、ステップＳ１５５でＮｏ）、初めて３次元の顔検出処理を行うようにしたが（図５のステップＳ３５、ステップＳ３６、図１１のステップＳ１５６、ステップＳ１５８、ステップＳ１５９）、３次元の顔検出処理、３次元の顔識別処理を行わないようにしてもよい。この場合は、図５のステップＳ３３、図１１のステップＳ１５３、ステップＳ１５５で、人の顔がない、登録された人物の顔がないと判断すると、図５のステップＳ３９、図１１のステップＳ１６２に進む。この場合は、図５のステップＳ３４及びステップＳ３５、図１１のステップＳ１５６〜ステップＳ１５８の動作は不要ということになる。 (02) In the above-described embodiments, the face detection process corresponding to the face detection mode set in step S10 of FIG. 3 and step S108 of FIG. 9 is set to the human face detection mode. If so, a two-dimensional face detection process and a two-dimensional face identification process are performed (step S32 in FIG. 5, step S152 and step S154 in FIG. 11), a two-dimensional face detection process, and a two-dimensional face identification process. When a face is not detected (No in step S33 in FIG. 5, No in step S153 in FIG. 11, and No in step S155), the three-dimensional face detection process is performed for the first time (steps S35 and S36 in FIG. 5). The three-dimensional face detection process and the three-dimensional face identification process may not be performed in steps S156, S158, and S159 in FIG. In this case, if it is determined in step S33 in FIG. 5, step S153 in FIG. 11, or step S155 that there is no human face and no registered human face, the process proceeds to step S39 in FIG. 5 and step S162 in FIG. move on. In this case, the operations in steps S34 and S35 in FIG. 5 and steps S156 to S158 in FIG. 11 are unnecessary.

（０３）また、上記各実施の形態において、図３のステップＳ１０、図９のステップＳ１０８の設定されている顔検出モードに応じた顔検出処理、顔識別処理は、動物用の顔検出モード（たとえば、犬用の顔検出モード、猫用の顔検出モード等）、動物用の顔識別モードが設定されている場合は、一律に３次元の顔検出処理、３次元の顔識別処理より設定されている顔検出モードの動物の種類の顔を検出、識別するようにしたが（図５のステップＳ３６、図１１のステップＳ１５９）、ある種類の動物用の顔検出モード、顔識別モードが設定された場合は、人用の顔検出モード、人用の顔識別モードが設定されたときと同様の動作を行うようにしてもよい。つまり、該設定された種類の動物用の顔検出モード、顔識別モードに応じて２次元の顔検出処理を実行したり、３次元の顔検出処理のみを実行したりするようにしてもよい。たとえば、猫用の顔検出モードが設定された場合は人用の顔検出モードが設定されたときと同様の処理を行い、犬用の顔検出モードが設定された場合は３次元の顔検出処理のみを行う。すべての種類の動物の顔が立体的ではないからであり、平面的な顔の動物もあるからである。なお、このとき、２次元の顔検出処理を行った場合に、該２次元の顔検出処理により設定された顔検出モードの動物の種類の顔が検出できなかった場合は、３次元の顔検出処理を実行させるようにしてもよいし、実行させないようにしてもよい。
この場合は、図２に示すような顔データテーブルには、動物の種類毎に、２次元の顔データや３次元の顔データを設定しておく。
これにより、顔認識の精度を向上させることができる。 (03) In each of the above embodiments, the face detection process and face identification process corresponding to the set face detection mode in step S10 of FIG. 3 and step S108 of FIG. For example, when a face detection mode for dogs, a face detection mode for cats, etc., and a face identification mode for animals are set, they are uniformly set from a three-dimensional face detection process and a three-dimensional face identification process. The face of the animal type in the face detection mode is detected and identified (step S36 in FIG. 5, step S159 in FIG. 11), but the face detection mode and face identification mode for a certain type of animal are set. In such a case, the same operation as when the human face detection mode and the human face identification mode are set may be performed. That is, the two-dimensional face detection process may be executed or only the three-dimensional face detection process may be executed in accordance with the set face detection mode and face identification mode for animals. For example, when the face detection mode for cats is set, the same processing as when the face detection mode for humans is set is performed, and when the face detection mode for dogs is set, the three-dimensional face detection processing is performed. Only do. This is because the faces of all kinds of animals are not three-dimensional, and some animals have a flat face. At this time, if the face of the animal type in the face detection mode set by the two-dimensional face detection process is not detected when the two-dimensional face detection process is performed, the three-dimensional face detection is performed. The process may be executed or may not be executed.
In this case, two-dimensional face data and three-dimensional face data are set for each type of animal in the face data table as shown in FIG.
Thereby, the accuracy of face recognition can be improved.

（０４）また、上記各実施の形態においては、図５のステップＳ３２で、画像データの全領域に基づいて２次元の顔検出処理、２次元の顔識別処理を、図５のステップＳ３６で、画像データの全領域に基づいて３次元の顔検出処理、３次元の顔識別処理を、図１１のステップＳ１５２で、画像データの全領域に基づいて２次元の顔検出処理を、図１１のステップＳ１５９で、画像データの全領域に基づいて３次元の顔識別処理を行うようにしたが、画像データの全領域ではなく、画角の中央領域、または、ユーザによって任意に指定された領域に基づいて行うようにしてもよい。これにより、顔検出処理、顔識別処理の処理負担を軽減させることができる。 (04) In each of the above embodiments, in step S32 in FIG. 5, two-dimensional face detection processing and two-dimensional face identification processing are performed in step S36 in FIG. The three-dimensional face detection process based on the entire area of the image data is performed in step S152 of FIG. 11, and the two-dimensional face detection process is performed on the basis of the entire area of the image data. In S159, the three-dimensional face identification process is performed based on the entire area of the image data. However, based on the central area of the angle of view or the area arbitrarily designated by the user, not the entire area of the image data. May be performed. Thereby, the processing load of face detection processing and face identification processing can be reduced.

また、顔検出モードの種類、顔識別モードの種類に対応して色成分を関連付けて記録しておき、図５のステップＳ３２、図５のステップＳ３６、図１１のステップＳ１５２、ステップＳ１５９では、画像データの全領域ではなく、現在設定されている顔検出モード、顔識別モードの種類に対応する色成分を有する領域のみに基づいて行うようにしてもよい。この顔検出モード、顔識別モードの種類に対応した色成分とは、顔検出モード、顔識別モードの種類の被写体の顔の色成分である。たとえば、人用の顔識別モードの場合に対応する色成分は肌色ということになる。これにより、設定された顔検出モード、顔識別モードの種類の被写体の顔があると思われる領域のみに基づいて顔検出処理、顔識別処理を行うことができ、処理負担を軽減させることができる。 Further, color components are associated and recorded corresponding to the types of face detection mode and face identification mode, and in step S32 in FIG. 5, step S36 in FIG. 5, step S152 in FIG. You may make it carry out based not only on the whole area | region of data but only the area | region which has a color component corresponding to the kind of face detection mode currently set and face identification mode. The color components corresponding to the types of the face detection mode and the face identification mode are the color components of the face of the subject in the types of the face detection mode and the face identification mode. For example, the color component corresponding to the human face identification mode is skin color. As a result, the face detection process and the face identification process can be performed based only on the area where the face of the subject of the type of the face detection mode and the face identification mode set is considered to be present, and the processing burden can be reduced. .

（０５）また、上記各実施の形態においては、被写体の特徴点や被写体の種類毎の特徴点を抽出して特徴データを抽出し、該抽出した特徴データに基づいて３次元モデルを生成したり、該抽出した特徴データを比較照合することにより顔を検出、識別するようにしたが、他の方法によって３次元モデルを生成したり、顔を検出、識別したりするようにしてもよい。
また、複数枚の画像データに基づいて被写体の３次元モデルを生成するようにしたが、１枚の画像データから被写体の３次元モデルを生成するようにしてもよい。 (05) In each of the above embodiments, feature data is extracted by extracting feature points of a subject and feature types for each subject type, and a three-dimensional model is generated based on the extracted feature data. Although the face is detected and identified by comparing and comparing the extracted feature data, a three-dimensional model may be generated or the face may be detected and identified by other methods.
Further, the three-dimensional model of the subject is generated based on a plurality of pieces of image data, but the three-dimensional model of the subject may be generated from one piece of image data.

（０６）また、上記各実施の形態においては、静止画撮影処理により撮影された静止画像データのみを記録するようにしたが、顔検出処理により検出された顔の位置や顔の種類（設定された顔検出モードの種類）、顔識別処理により識別された顔の位置や名称（たとえば、人の場合は人物名、動物の場合は動物の名称若しくはペットの名前等）を静止画像データに関連付けて記録するようにしてもよい。 (06) In the above embodiments, only the still image data captured by the still image capturing process is recorded. However, the face position and the face type (set by the face detection process) are set. Type of face detection mode), the position and name of the face identified by the face identification process (for example, the name of a person in the case of a person, the name of an animal or the name of a pet in the case of an animal), and the like. It may be recorded.

（０７）また、上記各実施の形態においては、検出した顔、識別した顔に基いてＡＦ処理を行うようにしたが、検出された顔、識別した顔に基づいて露出制御を行なったり、トリミング処理を行うというように、検出された顔、識別された顔に基づいて所定の処理を行うようにしてもよい。 (07) In each of the above embodiments, the AF processing is performed based on the detected face and the identified face. However, exposure control is performed based on the detected face and the identified face, and trimming is performed. As in the case of performing the process, a predetermined process may be performed based on the detected face and the identified face.

（０８）また、上記各実施の形態においては、人、犬、猫等大まかな被写体の種類毎の立体顔データや２次元の顔データを記録するようにしたが、被写体の種類さらに詳しく分類して立体顔データや２次元の顔データを記録するようにしてもよい。例えば、犬なら柴犬、秋田犬、・・・、猫なら、ペルシャ猫、三毛猫、・・というように詳しく分類するようにする。これにより、顔検出、顔識別の精度を向上させることができる。 (08) Further, in each of the above embodiments, the three-dimensional face data and the two-dimensional face data for each type of rough subject such as a person, dog, cat, etc. are recorded. Thus, three-dimensional face data and two-dimensional face data may be recorded. For example, Shiba-inu, Akita-inu, etc. for dogs, Persian cats, calico, etc. for cats. Thereby, the precision of face detection and face identification can be improved.

（０９）また、図２に示す顔検出用の顔データテーブルは、被写体の種類毎に同じ次元の顔データは１つしか記録していなかったが、被写体の種類毎に同じ次元の顔データを複数記録しておくようにしてもよい。たとえば、同じ猫であってもその種類によっては極端に顔が変わる可能性もあり、すべての猫の顔が検出することができるように、複数の立体顔データを記録する。 (09) Although the face detection face data table shown in FIG. 2 records only one face data of the same dimension for each type of subject, face data of the same dimension is recorded for each type of subject. A plurality of records may be recorded. For example, even if the cat is the same, the face may change extremely depending on the type of the cat, and a plurality of three-dimensional face data is recorded so that all cat faces can be detected.

（１０）また、上記各実施の形態においては、顔検出モード、顔識別モード（これらを総称して顔認識モードという）の種類を１つ設定するようにしたが、顔検出モード、顔識別モードの種類を複数同時に設定するができるようにし、該同時に設定されたすべての顔検出モード、顔識別モードの種類の顔を検出するようにしてもよい。なお、このとき、顔検出モード、顔識別モードの種類によっては顔検出処理、顔識別処理の動作が変わってしまうので（図５、図１１参照）、動作が変わらない範囲内で複数の顔検出モード、顔識別モードを同時に設定するようにする。 (10) In the above embodiments, one type of face detection mode and face identification mode (collectively referred to as face recognition mode) is set, but the face detection mode and face identification mode are set. A plurality of types may be set at the same time, and all face detection mode and face identification mode types set at the same time may be detected. At this time, the operations of the face detection process and the face identification process change depending on the types of the face detection mode and the face identification mode (see FIGS. 5 and 11). Set the mode and face identification mode at the same time.

（１１）また、上記各実施の形態においては、人用の顔認識モードと、動物用の顔認識モードを備えるようにしたが、これに限らず、他の種類の被写体、たとえば、昆虫の顔を検出する昆虫用の顔認識モードを備える（この場合は、カブトムシ用の顔認識モード、クワガタ用の顔認識モードというように昆虫の種類毎に顔認識モードを備える）ようにしてもよい。 (11) In the above embodiments, the human face recognition mode and the animal face recognition mode are provided. However, the present invention is not limited to this, and other types of subjects, for example, insect faces, are provided. May be provided with a face recognition mode for insects (in this case, a face recognition mode is provided for each type of insect such as a face recognition mode for beetles and a face recognition mode for stag beetles).

（１２）また、上記第１、２の実施の形態における図５の動作においては、２次元の顔検出処理により人の顔が検出されない場合は（ステップＳ３３でＮｏ）、３次元の顔検出処理を行うようにしたが（ステップＳ３５、ステップＳ３６）、ステップＳ３４で顔らしきものがあると判断された場合のみ３次元の顔検出処理を行うようにしてもよい。つまり、ステップＳ３４で顔らしきものがないと判断するとそのままステップＳ３９に進む。この場合は、ステップＳ３６の動作は不要ということになる。 (12) In the operation of FIG. 5 in the first and second embodiments, when a human face is not detected by the two-dimensional face detection process (No in step S33), the three-dimensional face detection process (Step S35, step S36), but the three-dimensional face detection process may be performed only when it is determined in step S34 that there is something that looks like a face. That is, if it is determined in step S34 that there is no face-like object, the process proceeds to step S39 as it is. In this case, the operation in step S36 is unnecessary.

（１３）また、上記第３の実施の形態における図１１の動作においては、２次元の顔検出処理により人の顔が検出されないと（ステップＳ１５３でＮｏ）、顔らしきものがあるか否かを判断し（ステップＳ１５７）、顔らしきものがある場合には、顔らしきものがある領域に基づいて３次元の顔識別処理を行い（ステップＳ１５８）、顔らしきものがない場合には全領域に基づいて３次元の顔識別処理を行うようにしたが（ステップＳ１５９）、ステップＳ１５７で顔らしきものがないと判断した場合は、そのままステップＳ１６２に進むようにしてもよい。 (13) Further, in the operation of FIG. 11 in the third embodiment, if a human face is not detected by the two-dimensional face detection process (No in step S153), it is determined whether there is something that looks like a face. Judgment is made (step S157), and if there is something that looks like a face, three-dimensional face identification processing is performed based on the area that looks like a face (step S158). The three-dimensional face identification process is performed (step S159), but if it is determined in step S157 that there is no face-like object, the process may proceed to step S162 as it is.

（１４）また、本発明の特徴となる部分は、顔認識モード（顔検出モードと顔識別モードの総称）に応じて顔認識処理の動作を変えるという点にあるので、上記各実施の形態では、被写体の種類毎に顔認識モードというものを設けたが、被写体の種類毎でなくてもよい。
たとえば、顔認識モードを、２次元の顔認識処理により認識する２次元顔認識モード、３次元の顔認識処理により認識する３次元顔認識モードというように、顔認識処理の次元毎に設けてもよい。この場合は、人の顔を認識する場合は、まず、２次元顔認識モードに設定して顔認識処理を行い、顔が認識されない場合は３次元顔認識モードに設定するようにしてもよい。
また、動物の顔を認識する場合は、最初から３次元顔認識モードに設定するようにしてもよいし、動物の種類に応じて２次元顔認識モードに設定したり、３次元顔認識モードに設定したりするようにしてもよい。この場合、２次元顔認識モードに設定した場合であって、該２次元顔認識モードにより顔が認識できなかった場合は３次元顔認識モードに設定するようにしてもよい。 (14) The feature of the present invention is that the operation of the face recognition process is changed according to the face recognition mode (generic name for the face detection mode and the face identification mode). Although a face recognition mode is provided for each type of subject, it need not be for each type of subject.
For example, a face recognition mode may be provided for each dimension of the face recognition process, such as a two-dimensional face recognition mode that is recognized by a two-dimensional face recognition process, and a three-dimensional face recognition mode that is recognized by a three-dimensional face recognition process. Good. In this case, when recognizing a human face, first, the face recognition processing may be performed by setting the two-dimensional face recognition mode, and when the face is not recognized, the three-dimensional face recognition mode may be set.
When recognizing the face of an animal, the 3D face recognition mode may be set from the beginning, or the 2D face recognition mode may be set according to the type of animal, or the 3D face recognition mode may be set. You may make it set. In this case, when the 2D face recognition mode is set, and the face cannot be recognized by the 2D face recognition mode, the 3D face recognition mode may be set.

さらに、被写体の種類と顔認識の次元とを考慮して顔認識モードを設けるようにしてもよい。
たとえば、上記各実施の形態においては、人用の顔認識モードの中に、２次元の顔認識処理、３次元の顔認識処理が含まれるが、２次元の顔認識により人の顔を認識する人用の２次元顔認識モード、３次元の顔認識処理により人の顔を認識する人用の３次元顔認識モードというように、被写体の種類と顔認識の次元との組み合わせ毎に顔認識モードを備えるようにしてもよい。
たとえば、人用の２次元顔認識モードが設定された場合は、該設定された人用の２次元顔認識モードに対応する顔認識処理を行い、顔が検出されない場合は、人用の３次元顔認識モードに自動的に切り換え設定して顔認識処理を行うようにしてもよい。また、人の顔を認識する場合は、まず、人用の２次元顔認識モードを設定するようにしてもよい。
また、同じ種類の動物（たとえば、猫）の顔を認識するモードが、２次元顔認識モードと、３次元顔認識モードと２つある場合は、最初に２次元顔認識モードに設定して顔認識処理を行い、顔が認識されない場合に初めて、３次元顔認識モードに設定して顔認識処理を行うようにしてもよい。 Furthermore, a face recognition mode may be provided in consideration of the type of subject and the dimension of face recognition.
For example, in each of the above embodiments, the human face recognition mode includes a two-dimensional face recognition process and a three-dimensional face recognition process. The human face is recognized by the two-dimensional face recognition. Face recognition mode for each combination of the type of subject and the face recognition dimension, such as a human two-dimensional face recognition mode and a three-dimensional face recognition mode for humans that recognizes human faces through three-dimensional face recognition processing. You may make it provide.
For example, when the human two-dimensional face recognition mode is set, face recognition processing corresponding to the set human two-dimensional face recognition mode is performed. When no face is detected, the human three-dimensional face recognition mode is set. The face recognition processing may be performed by automatically switching to the face recognition mode. When recognizing a human face, first, a human two-dimensional face recognition mode may be set.
If there are two modes for recognizing the face of the same type of animal (for example, a cat), the two-dimensional face recognition mode and the three-dimensional face recognition mode, the two-dimensional face recognition mode is set first. When the recognition process is performed and the face is not recognized, the face recognition process may be performed by setting the three-dimensional face recognition mode for the first time.

また、上記各実施の形態で説明した被写体追従モードも顔認識モードの一種（追従顔認識モード）と考えることも可能であり、たとえば、図４のステップＳ２２、図１０のステップＳ１２１に進むと、追従顔認識モードに切り換えて設定し、該設定された追従顔認識モードに応じた顔認識処理を、つまり、図３のステップＳ１０、図９のステップＳ１０８で直近に認識された顔を２次元的に検出することにより該顔を追従していく動作を行うことになる。また、図４のステップＳ２４、図１０のステップＳ１２３で該追従している顔を見失ったと判断すると、追従顔認識モードの直前に設定されていた顔認識モードに再設定して、図３のステップＳステップＳ１０、図９のステップＳ１０８に戻るということになる。 The subject tracking mode described in each of the above embodiments can also be considered as a kind of face recognition mode (following face recognition mode). For example, when the process proceeds to step S22 in FIG. 4 and step S121 in FIG. The face recognition processing is set by switching to the following face recognition mode, that is, the face most recently recognized in step S10 in FIG. 3 and step S108 in FIG. Thus, the movement of following the face is performed. If it is determined in step S24 in FIG. 4 or step S123 in FIG. 10 that the following face has been lost, the face recognition mode set immediately before the following face recognition mode is reset, and the step in FIG. That is, the process returns to step S10 and step S108 in FIG.

（１５）また、上記変形例（０１）乃至（１４）を任意に組み合わせた態様であってもよい。 (15) Moreover, the aspect which combined the said modification (01) thru | or (14) arbitrarily may be sufficient.

（１６）また、本発明の上記実施形態は、何れも最良の実施形態としての単なる例に過ぎず、本発明の原理や構造、動作等をより良く理解することができるようにするために述べられたものであって、添付の特許請求の範囲を限定する趣旨のものでない。
したがって、本発明の上記実施形態に対してなされ得る多種多様な変形ないし修正はすべて本発明の範囲内に含まれるものであり、添付の特許請求の範囲によって保護されるものと解さなければならない。 (16) The above embodiments of the present invention are merely examples as the best embodiments, and are described in order to better understand the principle, structure, operation, and the like of the present invention. And is not intended to limit the scope of the appended claims.
Therefore, it should be understood that all the various variations and modifications that can be made to the above-described embodiments of the present invention are included in the scope of the present invention and protected by the appended claims.

最後に、上記各実施の形態においては、本発明の撮像装置をデジタルカメラ１に適用した場合について説明したが、上記の実施の形態に限定されるものではなく、要は、撮像された画像データに基づいて顔を認識することができる機器であれば適用可能である。 Finally, in each of the above embodiments, the case where the imaging device of the present invention is applied to the digital camera 1 has been described. However, the present invention is not limited to the above embodiment, and the main point is that the captured image data Any device capable of recognizing a face based on the above can be applied.

１デジタルカメラ
２撮影レンズ
３レンズ駆動ブロック
４絞り
５ＣＣＤ
６ドライバ
７ＴＧ
８ユニット回路
９画像生成部
１０ＣＰＵ
１１キー入力部
１２メモリ
１３ＤＲＡＭ
１４フラッシュメモリ
１５画像表示部
１６バス 1 Digital Camera 2 Shooting Lens 3 Lens Drive Block 4 Aperture 5 CCD
6 Driver 7 TG
8 Unit circuit 9 Image generator 10 CPU
11 Key input section 12 Memory 13 DRAM
14 Flash memory 15 Image display unit 16 Bus

Claims

Imaging means for imaging a subject;
Comprising a plurality of face recognition mode the contents of the face recognition processing to be executed is different each time to recognize the face of the subject, the face recognition mode setting means for setting one of the face recognition mode from a said plurality of face recognition mode ,
The image data captured by the imaging unit, by executing the face recognition mode setting means face recognition processing according to the set facial recognition mode, the face recognition in the imaging has been the image data Facial recognition means to
With
The face recognition mode setting means includes a plurality of face recognition modes corresponding to combinations of differences in the types of subjects to be recognized and differences in conditions other than the types of subjects as settable face recognition modes,
The face recognition unit automatically selects a face recognition process to be executed by identifying a difference in the type of the subject and a difference in conditions other than the type of the subject based on image data captured by the imaging unit. An imaging apparatus characterized by that.

The difference in conditions other than the type of subject is a difference in face recognition processing conditions,
The face recognition means
The image data captured by the imaging unit, by executing the face under the condition corresponding to the set face recognition mode by the recognition mode setting means face recognition process, the face in the image pickup has been the image data The imaging apparatus according to claim 1, wherein a face of a subject corresponding to the face recognition mode is recognized.

The condition of the face recognition process is
The imaging apparatus according to claim 1, comprising: a two-dimensional face recognition process for recognizing a face in two dimensions and a three-dimensional face recognition process for recognizing a face in three dimensions.

The face recognition mode setting means includes
The imaging apparatus according to claim 1, wherein a face recognition mode selected by a user is set.

The face recognition mode setting means includes
4. The imaging apparatus according to claim 1 , wherein the face recognition mode is automatically selected and set.

The difference in the type of subject is a human face and a non-human face,
The face recognition means
When the first face recognition mode for the human face is set by the face recognition mode setting means, the face recognition process for recognizing the human face is executed, so that it is in the captured image data. A human face is recognized, and when a second face recognition mode for a face other than a person is set by the face recognition mode setting means, a face other than a person is recognized. Item 6. The imaging device according to any one of Items 1 to 5.

The face recognition processing conditions include a condition for recognizing a face in two dimensions and a condition for recognizing a face in three dimensions,
The face recognition means
When the third face recognition mode corresponding to the condition for recognizing a face in two dimensions is set by the face recognition mode setting means, a two-dimensional face recognition process for recognizing the face in two dimensions is executed. If a fourth face recognition mode corresponding to a condition for recognizing a face in three dimensions is set by the face recognition mode setting means when a face in the imaged image data is recognized, three-dimensional The imaging apparatus according to claim 1 , wherein a face in the imaged image data is recognized by executing a three-dimensional face recognition process for recognizing the face .

The plurality of face recognition modes include a fifth face recognition mode for switching and executing face recognition processing,
The face recognition means
When the fifth face recognition mode is set by the face recognition mode setting means, a face recognition process for recognizing a person's face is executed to recognize a person's face in the captured image data. When a face of a person is not recognized by this face recognition process, it is switched to a face recognition process for recognizing a face of a type other than a person, so that a person other than the person in the captured image data is executed. The imaging apparatus according to claim 1 , wherein a type of face is recognized .

The plurality of face recognition modes include a sixth face recognition mode for switching and executing face recognition processing,
The face recognition means
When the sixth face recognition mode is set by the face recognition mode setting means, a face of a person in the captured image data is recognized by executing a two-dimensional face recognition process. When a person's face is not recognized by the recognition process, a face other than a person in the captured image data is recognized by switching to and executing the three-dimensional face recognition process. The imaging device according to claim 1 .

The face recognition means
When the first face recognition mode is set by the face recognition mode setting means , a two-dimensional face recognition process is performed on the image data picked up by the image pickup means, and the picked-up image data When the second face recognition mode is set by the face recognition mode setting means, a three-dimensional face recognition process is performed based on the image data picked up by the image pickup means. 7. The imaging apparatus according to claim 6 , wherein the imaging apparatus executes and recognizes a face of the type of the face recognition mode set by the face recognition mode setting means in the captured image data.

The second face recognition mode is further divided into a plurality of face recognition modes for recognizing each of a plurality of types of subjects other than humans,
The face recognition means
If the first face recognition mode by the face recognition mode setting means is set on the image data captured by the imaging means performs a 2-dimensional face recognition processing, image data which is image pickup When the second face recognition mode is set by the face recognition mode setting means, a face recognition process to be executed is performed according to the set type of face detection mode. By switching between the two-dimensional face recognition process and the three-dimensional face recognition process, the face of the type of face recognition mode set by the face recognition mode setting means in the image data picked up by the image pickup means is recognized. The imaging apparatus according to claim 10 .

The face recognition means
If the face cannot be recognized by the execution of the two-dimensional face recognition process, the three-dimensional face recognition process is executed based on the image data picked up by the image pickup means, and the inside of the picked-up image data The imaging apparatus according to claim 10, wherein a face in the area is recognized.

The face recognition means
Based on the image data of the entire area of the image data imaged by the imaging means, the central area of the angle of view of the image data, or the area specified by the user, the face recognition process is executed and the image is captured. 13. The imaging apparatus according to claim 1, wherein a face in the image data is recognized.

The face recognition means
Although the face could not be recognized by the execution of the two-dimensional face recognition process, but the face-like object could be recognized, it was recognized that there was the face-like image data captured by the imaging means. 13. The imaging apparatus according to claim 12, wherein a three-dimensional face recognition process is executed based on the image data of the region to recognize a face in the captured image data.

The face recognition means
The image pickup apparatus according to claim 1, wherein a face in the image data picked up by the image pickup means is detected.

The face recognition means
The imaging apparatus according to claim 1, wherein the imaging apparatus specifically identifies what face is in the image data captured by the imaging unit.

Two-dimensional face detection means for detecting a face in the image data picked up by the image pickup means;
The face recognition means
When a face is detected by the two-dimensional face detection means, a three-dimensional face recognition process is performed based on the image data of the face area detected by the two-dimensional face detection means of the image data picked up by the image pickup means. The imaging apparatus according to claim 16, wherein a face in the captured image data is identified by executing.

The plurality of face recognition modes include a seventh face recognition mode for switching and executing face recognition processing,
The face recognition means
When the seventh face recognition mode is set by the face recognition mode setting means, the face recognition processing is executed on the image data picked up by the image pickup means, and the face is recognized by this face recognition processing. The imaging apparatus according to claim 1, wherein after the recognition is performed, the detected face is switched to a subject tracking process that follows the movement of the recognized face in the image data.

The face recognition means
19. The image pickup apparatus according to claim 18, wherein when a human face cannot be tracked by the subject tracking process, the face recognition process set before switching to the subject tracking process is re-executed.

On the computer,
A face recognition mode setting process in which a plurality of face recognition modes each having different contents of face recognition processing executed when recognizing the face of the subject are provided, and any one of the plurality of face recognition modes is set. ,
A face recognition process for recognizing a face in the captured image data by executing a face recognition process corresponding to the face recognition mode set by the face recognition mode setting process on the image data;
A face recognition method for executing
The face recognition mode setting process includes a plurality of face recognition modes corresponding to combinations of differences in types of subjects to be recognized and differences in conditions other than the types of subjects as settable face recognition modes,
The face recognition process selects a face recognition process to be executed by automatically discriminating between a difference in type of the subject and a difference in conditions other than the type of subject based on the image data. Recognition method.

A computer equipped with an imaging means,
Face recognition mode setting means for providing a plurality of face recognition modes with different contents of face recognition processing executed when recognizing the face of the subject, and setting any one of the plurality of face recognition modes;
A face in the captured image data is recognized by executing face recognition processing corresponding to the face recognition mode set by the face recognition mode setting unit on the image data captured by the imaging unit. Facial recognition means,
Function as
The face recognition mode setting means includes a plurality of face recognition modes corresponding to combinations of differences in the types of subjects to be recognized and differences in conditions other than the types of subjects as settable face recognition modes,
The face recognition unit automatically selects a face recognition process to be executed by identifying a difference in the type of the subject and a difference in conditions other than the type of the subject based on image data captured by the imaging unit. A program characterized by that.