JP5016999B2

JP5016999B2 - Imaging apparatus, imaging method, and program

Info

Publication number: JP5016999B2
Application number: JP2007175461A
Authority: JP
Inventors: 博之星加
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2007-07-03
Filing date: 2007-07-03
Publication date: 2012-09-05
Anticipated expiration: 2027-07-03
Also published as: JP2009017135A

Description

本発明は、人物の顔の検出及び認識を行う撮像装置、撮像方法、及びプログラムに関する。 The present invention relates to an imaging apparatus, an imaging method, and a program for detecting and recognizing a human face.

現在、動画像中の人物から特定の個人を認識する技術が知られている（特許文献１及び特許文献２参照）。特許文献１によれば、防犯カメラを駅の自動券売機に設置して被写体認識（特定の個人の認識）を行うことにより、セキュリティー上好ましくない人物の乗車を防ぐことができる。また、特許文献２によれば、ロボットなどに被写体認識機能を搭載することにより、エンターテイメント性の向上が図られている。 Currently, a technique for recognizing a specific individual from a person in a moving image is known (see Patent Document 1 and Patent Document 2). According to Patent Document 1, by installing a security camera in an automatic ticket vending machine at a station and performing subject recognition (recognition of a specific individual), it is possible to prevent passengers who are not desirable in terms of security. Further, according to Patent Document 2, an entertainment property is improved by mounting a subject recognition function on a robot or the like.

ところで、被写体認識を行うためには一般的に、予め登録されている多くの被写体(人物の顔)と、撮影された画像中に検出された被写体（人物の顔）との比較を行う必要がある。そのため、被写体認識を行うためには、比較的高い処理能力が必要である。従って、特許文献１及び特許文献２に開示されるように動画像においてリアルタイムに被写体認識を行うためには、防犯カメラなどの撮像装置は、豊富な演算リソースを備えている必要がある。
特開２００３―１８７３５２号公報特開２００３―２７１９５８号公報 By the way, in order to perform subject recognition, it is generally necessary to compare a large number of pre-registered subjects (person's face) with subjects (person's face) detected in the captured image. is there. Therefore, a relatively high processing capability is required to perform subject recognition. Therefore, as disclosed in Patent Literature 1 and Patent Literature 2, in order to perform subject recognition in a moving image in real time, an imaging device such as a security camera needs to have abundant calculation resources.
JP 2003-187352 A Japanese Patent Laid-Open No. 2003-271958

しかしながら、撮像装置に豊富な演算リソースを搭載すると、コストの上昇、消費電力の増大、撮像装置の大型化など、種々の問題が発生する。このような問題は、デジタルカメラなどの、小型化が望まれる撮像装置においては、特に重大である。仮に、限られた演算リソースしか備えない撮像装置に被写体認識を行わせると、画像の取得から認識処理の完了までに比較的長い時間を要する。従って、特定の個人をリアルタイムに追跡することができず、実用的ではない。 However, when abundant computing resources are installed in the imaging apparatus, various problems such as an increase in cost, an increase in power consumption, and an increase in the size of the imaging apparatus occur. Such a problem is particularly serious in an imaging apparatus such as a digital camera that is desired to be downsized. If subject recognition is performed by an imaging apparatus having only limited computing resources, it takes a relatively long time from image acquisition to completion of recognition processing. Therefore, a specific individual cannot be tracked in real time and is not practical.

本発明はこのような状況に鑑みてなされたものであり、撮像装置における処理負荷の上昇を抑制しつつ、被写体認識の処理を高速に実行可能にする技術を提供することを目的とする。 The present invention has been made in view of such a situation, and an object of the present invention is to provide a technique capable of executing subject recognition processing at high speed while suppressing an increase in processing load in an imaging apparatus.

上記課題を解決するために、第１の本発明は、被写体の光学像を撮像して撮像画像を取得する撮像手段と、撮像画像において人物の顔を検出する検出手段と、前記検出手段による人物の顔の検出よりも長い処理時間をかけて、記憶部から取得した、個人の顔の特徴に関する特徴情報を用いて、前記検出手段が検出した人物の顔について個人を認識する主認識手段と、第１撮像画像において検出された人物の顔それぞれに識別情報を付与する第１付与手段と、時間的に前後して取得された２つの撮像画像において、後に取得された撮像画像において検出された人物の顔のうち、先に取得された撮像画像において識別情報を付与された人物の顔のいずれかの顔と位置の差分が閾値以下あること、および、大きさの差分が閾値以下であることの少なくともいずれかを満たす人物の顔に、前記いずれかの顔に付与されている識別情報を付与する第２付与手段と、前記第１撮像画像において検出された人物の顔に対する前記主認識手段の処理完了後に取得された第２撮像画像において検出された人物の顔に対する前記主認識手段の処理が行われている間は、時間的に連続して取得される撮像画像に対して前記第２付与手段による識別情報の付与を繰り返し行うことで、前記第２撮像画像より後に取得された第３撮像画像において検出された人物の顔のうち前記第１撮像画像において前記主認識手段が認識した個人と同一の識別情報が付与されている顔を、前記第１撮像画像において前記主認識手段が認識した個人として認識する補助認識手段と、を備えることを特徴とする撮像装置を提供する。 In order to solve the above-described problem, the first aspect of the present invention provides an imaging unit that captures an optical image of a subject to acquire a captured image, a detection unit that detects a human face in the captured image, and a person by the detection unit Main recognition means for recognizing an individual with respect to a person's face detected by the detection means, using feature information relating to characteristics of the individual's face acquired from the storage unit over a longer processing time than detection of A first providing means for providing identification information to each face of the person detected in the first captured image, and a person detected in the captured image acquired later in two captured images acquired before and after in time The difference between the face and the position of any of the faces of the persons to whom identification information has been assigned in the captured image obtained earlier is less than the threshold, and the difference in size is less than the threshold Small And a process of the main recognizing unit for the face of the person detected in the first captured image, and a second providing unit for adding the identification information given to the one of the faces to a person's face satisfying either of them. While the processing of the main recognition unit is performed on the face of the person detected in the second captured image acquired after completion, the second assigning unit is applied to the captured image acquired continuously in time. The identification information is repeatedly given by the same as the individual recognized by the main recognition unit in the first captured image among the faces of the person detected in the third captured image acquired after the second captured image. a face identification information is assigned, to provide an imaging apparatus characterized by comprising an auxiliary recognizing means for recognizing a personal said main recognition means recognized in the first captured image

また、第２の本発明は、撮像手段が、被写体の光学像を撮像して撮像画像を取得する撮像工程と、検出手段が、撮像画像において人物の顔を検出する検出工程と、主認識手段が、前記検出工程よりも長い処理時間をかけて、記憶部から取得した、個人の顔の特徴に関する特徴情報を用いて、前記検出工程で検出された人物の顔について個人を認識する主認識工程と、第１付与手段が、第１撮像画像において検出された人物の顔それぞれに識別情報を付与する第１付与工程と、第２付与手段が、時間的に前後して取得された２つの撮像画像において、後に取得された撮像画像において検出された人物の顔のうち、先に取得された撮像画像において識別情報を付与された人物の顔のいずれかの顔と位置の差分が閾値以下あること、および、大きさの差分が閾値以下であることの少なくともいずれかを満たす人物の顔に、前記いずれかの顔に付与されている識別情報を付与する第２付与工程と、補助認識手段が、前記第１撮像画像において検出された人物の顔に対する前記主認識工程の完了後に取得された第２撮像画像において検出された人物の顔に対する前記主認識工程が行われている間は、時間的に連続して取得される撮像画像に対して前記第２付与手段による識別情報の付与を繰り返し行うことで、前記第２撮像画像より後に取得された第３撮像画像において検出された人物の顔のうち前記第１撮像画像において前記主認識工程により認識された個人と同一の識別情報が付与されている顔を、前記第１撮像画像において前記主認識工程により認識された個人として認識する補助認識工程と、を備えることを特徴とする撮像方法を提供する。 According to a second aspect of the present invention, the imaging means captures an optical image of a subject to acquire a captured image, the detection means detects a human face in the captured image, and a main recognition means. However, the main recognition step of recognizing an individual with respect to the face of the person detected in the detection step using feature information related to the characteristics of the individual face acquired from the storage unit over a longer processing time than the detection step When the first applying means comprises a first step of applying the identification information on each face of the person detected in the first image, the second application means, the two imaging obtained by temporally preceding In the image, the difference between the face of one of the faces of the person who has been given identification information in the captured image acquired earlier among the faces of the person detected in the captured image acquired later is equal to or less than the threshold value. And size A second providing step of adding identification information given to any one of the faces satisfying at least one of the difference being equal to or less than a threshold value, and an auxiliary recognition unit in the first captured image; While the main recognition process is being performed on the detected human face in the second captured image acquired after the completion of the main recognition process on the detected human face, it is acquired continuously in time. In the first captured image among the human faces detected in the third captured image acquired after the second captured image, the identification information is repeatedly applied to the captured image by the second providing unit. recognizing auxiliary recognize faces said main recognition personalization same identification information recognized by the process is given, as a person who is recognized by the main recognition process in the first image To provide an imaging method, characterized by comprising: a degree, the.

また、第３の本発明は、コンピュータを、被写体の光学像を撮像して撮像画像を取得する撮像手段、撮像画像において人物の顔を検出する検出手段、前記検出手段による人物の顔の検出よりも長い処理時間をかけて、記憶部から取得した、個人の顔の特徴に関する特徴情報を用いて、前記検出手段が検出した人物の顔について個人を認識する主認識手段、第１撮像画像において検出された人物の顔それぞれに識別情報を付与する第１付与手段、時間的に前後して取得された２つの撮像画像において、後に取得された撮像画像において検出された人物の顔のうち、先に取得された撮像画像において識別情報を付与された人物の顔のいずれかの顔と位置の差分が閾値以下あること、および、大きさの差分が閾値以下であることの少なくともいずれかを満たす人物の顔に、前記いずれかの顔に付与されている識別情報を付与する第２付与手段、および、前記第１撮像画像において検出された人物の顔に対する前記主認識手段の処理完了後に取得された第２撮像画像において検出された人物の顔に対する前記主認識手段の処理が行われている間は、時間的に連続して取得される撮像画像に対して前記第２付与手段による識別情報の付与を繰り返し行うことで、前記第２撮像画像より後に取得された第３撮像画像において検出された人物の顔のうち前記第１撮像画像において前記主認識手段が認識した個人と同一の識別情報が付与されている顔を、前記第１撮像画像において前記主認識手段が認識した個人として認識する補助認識手段、として機能させるためのプログラムを提供する。 The third present invention, a computer, an imaging means for acquiring a captured image by capturing an optical image of an object, detecting means for detecting a human face in the captured image, the detection of the face of a person by the detecting means The main recognition means for recognizing the person's face detected by the detecting means using the feature information relating to the characteristics of the person's face acquired from the storage unit over a long processing time, detected in the first captured image First assigning means for assigning identification information to each of the human faces obtained, among the two picked-up images acquired before and after the time, of the human faces detected in the captured image acquired later, At least one of the difference between the face and the position of any face of the person to whom identification information is given in the acquired captured image is equal to or less than the threshold, and the difference in size is equal to or less than the threshold After the completion of the processing of the main recognition means for the second giving means for giving the identification information given to any of the faces to the face of the person satisfying the condition, and the face of the person detected in the first captured image While the process of the main recognition unit is performed on the face of the person detected in the acquired second captured image, the second providing unit identifies the captured images acquired continuously in time. By repeatedly giving information, the same identification as the individual recognized by the main recognition unit in the first captured image among the faces of persons detected in the third captured image acquired after the second captured image A program for causing a face to which information is given to function as auxiliary recognition means for recognizing an individual recognized by the main recognition means in the first captured image is provided.

なお、その他の本発明の特徴は、添付図面及び以下の発明を実施するための最良の形態における記載によって更に明らかになるものである。 Other features of the present invention will become more apparent from the accompanying drawings and the following description of the best mode for carrying out the invention.

以上の構成により、本発明によれば、撮像装置における処理負荷の上昇を抑制しつつ、被写体認識の処理を高速に実行することが可能となる。 With the above configuration, according to the present invention, it is possible to execute subject recognition processing at high speed while suppressing an increase in processing load in the imaging apparatus.

以下、添付図面を参照して、本発明の実施形態を説明する。以下で説明される個別の実施形態は、本発明の上位概念から下位概念までの種々の概念を理解するために役立つであろう。 Embodiments of the present invention will be described below with reference to the accompanying drawings. The individual embodiments described below will help to understand various concepts from the superordinate concept to the subordinate concept of the present invention.

なお、本発明の技術的範囲は、特許請求の範囲によって確定されるのであって、以下の個別の実施形態によって限定されるわけではない。また、実施形態の中で説明されている特徴の組み合わせすべてが、本発明に必須とは限らない。 The technical scope of the present invention is determined by the claims, and is not limited by the following individual embodiments. In addition, not all combinations of features described in the embodiments are essential to the present invention.

［第１の実施形態］
＜撮像装置の構成＞
本発明の撮像装置をデジタルカメラに適用した実施形態について説明する。 [First Embodiment]
<Configuration of imaging device>
An embodiment in which an imaging apparatus of the present invention is applied to a digital camera will be described.

図１は、第１の実施形態に係るデジタルカメラ１００の機能構成を示すブロック図である。 FIG. 1 is a block diagram illustrating a functional configuration of the digital camera 100 according to the first embodiment.

１０１は、ズームレンズ、フォーカスレンズ、及び絞りを含む光学系であり、１０２はメカニカルシャッタであり、１０３は撮像素子であり、１０４は、アナログ信号処理を行うＣＤＳ回路である。１０５は、アナログ信号をデジタル信号に変換するＡ／Ｄ変換器であり、１０６は、撮像素子１０３、ＣＤＳ回路１０４、及びＡ／Ｄ変換器１０５を動作させる信号を発生するタイミング信号発生回路である。 Reference numeral 101 denotes an optical system including a zoom lens, a focus lens, and a stop, 102 denotes a mechanical shutter, 103 denotes an image sensor, and 104 denotes a CDS circuit that performs analog signal processing. Reference numeral 105 denotes an A / D converter that converts an analog signal into a digital signal. Reference numeral 106 denotes a timing signal generation circuit that generates signals for operating the image sensor 103, the CDS circuit 104, and the A / D converter 105. .

１０７は、光学系１０１、メカニカルシャッタ１０２、及び撮像素子１０３を駆動させる駆動回路であり、１０８は、撮影した画像データに必要な信号処理を施す信号処理回路である。信号処理回路１０８は、顔認識回路１２２及び顔検出回路１２６を備える。 Reference numeral 107 denotes a drive circuit that drives the optical system 101, the mechanical shutter 102, and the image sensor 103, and reference numeral 108 denotes a signal processing circuit that performs necessary signal processing on the captured image data. The signal processing circuit 108 includes a face recognition circuit 122 and a face detection circuit 126.

顔検出回路１２６は、Ａ／Ｄ変換器１０５が出力したデジタル信号（即ち、画像データ）が表す撮像画像において、人物の顔を検出する。顔認識回路１２２は、顔検出回路１２６が検出した人物の顔について、認識処理（個人を認識する処理）を行う。なお、顔検出処理に要する時間を短縮するために、顔検出回路１２６は、信号処理回路１０８により小さくリサイズされた画像データが表す撮像画像において人物の顔を検出してもよい。 The face detection circuit 126 detects a human face in the captured image represented by the digital signal (that is, image data) output from the A / D converter 105. The face recognition circuit 122 performs recognition processing (processing for recognizing an individual) for the face of the person detected by the face detection circuit 126. In order to shorten the time required for the face detection process, the face detection circuit 126 may detect a human face in the captured image represented by the image data resized by the signal processing circuit 108.

なお、本実施形態における顔検出には、公知の顔検出技術を利用できる。 A known face detection technique can be used for face detection in the present embodiment.

公知の顔検出技術としては、ニューラルネットワークなどを利用した学習に基づく手法、テンプレートマッチングを用いて目、鼻、口等の形状に特徴のある部位を画像から探し出し、類似度が高ければ顔とみなす手法などがある。また、他にも、肌の色や目の形といった画像特徴量を検出し、統計的解析を用いた手法等、多数提案されている。一般的にはこれらの手法を複数組み合わせ、顔検出の精度を向上させている。 As a known face detection technique, a method based on learning using a neural network or the like, template matching is used to search a part having a characteristic shape of eyes, nose, mouth, etc. from an image, and if the degree of similarity is high, it is regarded as a face There are methods. In addition, many other methods have been proposed, such as a method that detects image feature amounts such as skin color and eye shape and uses statistical analysis. In general, a plurality of these methods are combined to improve the accuracy of face detection.

具体的な例としては特開２００２−２５１３８０号公報に記載のウェーブレット変換と画像特徴量を利用して顔検出する方法などが挙げられる。 Specific examples include a face detection method using wavelet transform and image feature amount described in JP-A-2002-251380.

また、本実施形態における顔認識にも、公知の顔認識技術を利用できる。例えば、顔認識回路１２２は、ＲＯＭ１１５などの記憶部から個人の顔の特徴に関する特徴情報を取得し、顔検出回路１２６が検出した人物の顔の特徴情報と比較することにより、顔認識処理を行うことができる。 A known face recognition technique can also be used for face recognition in the present embodiment. For example, the face recognition circuit 122 performs face recognition processing by acquiring feature information related to personal facial features from a storage unit such as the ROM 115 and comparing it with the facial feature information detected by the face detection circuit 126. be able to.

１０９は、信号処理回路１０８によって信号処理された画像データや、顔認識回路１２２及び顔検出回路１２６による顔認識処理及び顔検出処理の結果などを記憶する画像メモリである。１１０は、デジタルカメラ１００から取り外し可能なメモリカードなどの記録媒体であり、１１１は、信号処理回路１０８によって信号処理された画像データを記録媒体１１０に記録する記録回路である。 An image memory 109 stores image data signal-processed by the signal processing circuit 108, results of face recognition processing and face detection processing by the face recognition circuit 122 and the face detection circuit 126, and the like. Reference numeral 110 denotes a recording medium such as a memory card that can be removed from the digital camera 100, and reference numeral 111 denotes a recording circuit that records the image data processed by the signal processing circuit 108 on the recording medium 110.

１１２は、画像データを表示する液晶ディスプレイなどの画像表示装置であり、１１３は、信号処理回路１０８によって信号処理された画像データを画像表示装置１１２に表示する表示回路である。１１４は、デジタルカメラ１００全体を制御するシステム制御部である。１１５は、システム制御部１１４が実行するプログラム、プログラムを実行する際に使用されるパラメータやテーブル等の制御データ、及び、キズアドレス等の補正データを記憶する不揮発性メモリ（ＲＯＭ）である。 Reference numeral 112 denotes an image display device such as a liquid crystal display that displays image data, and reference numeral 113 denotes a display circuit that displays the image data signal-processed by the signal processing circuit 108 on the image display device 112. Reference numeral 114 denotes a system control unit that controls the entire digital camera 100. Reference numeral 115 denotes a nonvolatile memory (ROM) that stores a program executed by the system control unit 114, control data such as parameters and tables used when the program is executed, and correction data such as a scratch address.

１１６は、ＲＯＭ１１５に記憶されたプログラム、制御データ、及び補正データを一時的に記憶し、システム制御部１１４がプログラムを実行する際にワークエリアとして使用する揮発性メモリ（ＲＡＭ）である。１１７はストロボである。 A volatile memory (RAM) 116 temporarily stores the program, control data, and correction data stored in the ROM 115 and is used as a work area when the system control unit 114 executes the program. Reference numeral 117 denotes a strobe.

システム制御部１１４は、顔認識回路１２２の動作を制御する顔認識動作制御部１２３を備える。また、光学系１０１は、手ぶれ補正を行うＩＳ（ＩｍａｇｅＳｔａｂｉｌｉｚｅｒ）レンズを備える。１２４は振動ジャイロなどを含む振れ検出部であり、デジタルカメラ１００の振れを検出することができる。駆動回路１０７は、振れ検出部１２４から与えられる情報に基づいてＩＳレンズを駆動することにより、手ぶれ補正を行う。 The system control unit 114 includes a face recognition operation control unit 123 that controls the operation of the face recognition circuit 122. The optical system 101 includes an IS (Image Stabilizer) lens that performs camera shake correction. A shake detection unit 124 includes a vibration gyro and the like, and can detect a shake of the digital camera 100. The drive circuit 107 performs camera shake correction by driving the IS lens based on information given from the shake detection unit 124.

１１８は操作部であり、ユーザが撮影条件の設定や撮影モードなどの選択を行うために使用される。１２１はメインスイッチ（メインＳＷ）であり、デジタルカメラ１００に電源を投入するためのスイッチである。１１９はシャッターボタン（不図示）が半押しの状態になるとＯＮとなるスイッチ（ＳＷ１）であり、ＳＷ１１１９がＯＮになると、システム制御部１１４は撮影のスタンバイ動作を行う。１２０はシャッターボタン（不図示）が全押しの状態になるとＯＮになるスイッチ（ＳＷ２）である。ＳＷ２１２０がＯＮになると、システム制御部１１４は、Ａ／Ｄ変換器１０５が出力する画像データを信号処理回路１０８などによる処理を経て記録媒体１１０に記録する、撮影動作を行う。 Reference numeral 118 denotes an operation unit that is used by the user to set shooting conditions, select a shooting mode, and the like. Reference numeral 121 denotes a main switch (main SW), which is a switch for turning on the power of the digital camera 100. Reference numeral 119 denotes a switch (SW1) that is turned on when a shutter button (not shown) is half-pressed. When the SW1 119 is turned on, the system control unit 114 performs a standby operation for photographing. Reference numeral 120 denotes a switch (SW2) that is turned on when a shutter button (not shown) is fully pressed. When the SW2 120 is turned on, the system control unit 114 performs a photographing operation of recording the image data output from the A / D converter 105 on the recording medium 110 through processing by the signal processing circuit 108 or the like.

１２５はズームキーであり、ユーザが光学系１０１のズームを制御するために使用される。 A zoom key 125 is used by the user to control the zoom of the optical system 101.

以下、上述のように構成されたデジタルカメラ１００を用いて、メカニカルシャッタ１０２を使用した撮影動作について説明する。撮影動作に先立ち、システム制御部１１４の動作開始時（例えば、デジタルカメラ１００の電源投入時等）に、システム制御部１１４は、ＲＯＭ１１５から必要なプログラム、制御データ、及び補正データをＲＡＭ１１６に転送して記憶させる。また、これらのプログラムやデータに加えて、システム制御部１１４は必要に応じて、追加のプログラムやデータをＲＯＭ１１５からＲＡＭ１１６に転送して記憶させることもできる。また、システム制御部１１４が直接ＲＯＭ１１５内のデータを読み出して使用してもよい。 Hereinafter, a photographing operation using the mechanical shutter 102 using the digital camera 100 configured as described above will be described. Prior to the shooting operation, when the operation of the system control unit 114 is started (for example, when the digital camera 100 is turned on), the system control unit 114 transfers necessary programs, control data, and correction data from the ROM 115 to the RAM 116. To remember. In addition to these programs and data, the system control unit 114 can also transfer and store additional programs and data from the ROM 115 to the RAM 116 as necessary. Further, the system control unit 114 may directly read and use data in the ROM 115.

まず、光学系１０１は、システム制御部１１４からの制御信号により、絞りとレンズを駆動して、適切な明るさに設定された被写体の光学像を撮像素子１０３上に結像させる。次に、メカニカルシャッタ１０２は、システム制御部１１４からの制御信号により、露光時間が適切になるように、撮像素子１０３の動作に合わせて撮像素子１０３を遮光するように駆動される。この時、撮像素子１０３が電子シャッタ機能を有する場合は、メカニカルシャッタ１０２と併用して、露光時間を調整してもよい。 First, the optical system 101 drives an aperture and a lens according to a control signal from the system control unit 114 to form an optical image of a subject set to an appropriate brightness on the image sensor 103. Next, the mechanical shutter 102 is driven by the control signal from the system control unit 114 so as to shield the image sensor 103 in accordance with the operation of the image sensor 103 so that the exposure time is appropriate. At this time, when the image sensor 103 has an electronic shutter function, the exposure time may be adjusted in combination with the mechanical shutter 102.

撮像素子１０３は、システム制御部１１４により制御されるタイミング信号発生回路１０６が発生する動作パルスを基にした駆動パルスにより駆動される。撮像素子１０３は、被写体の光学像を光電変換により電気信号に変換し、アナログ画像信号として出力する。撮像素子１０３から出力されたアナログ画像信号は、システム制御部１１４により制御されるタイミング信号発生回路１０６が発生する動作パルスにより、ＣＤＳ回路１０４でクロック同期性ノイズが除去される。そして、Ａ／Ｄ変換器１０５でデジタル画像信号（即ち、画像データ）に変換される。 The image sensor 103 is driven by a drive pulse based on an operation pulse generated by the timing signal generation circuit 106 controlled by the system control unit 114. The image sensor 103 converts an optical image of a subject into an electrical signal by photoelectric conversion and outputs it as an analog image signal. From the analog image signal output from the image sensor 103, the clock synchronization noise is removed by the CDS circuit 104 by the operation pulse generated by the timing signal generation circuit 106 controlled by the system control unit 114. Then, it is converted into a digital image signal (that is, image data) by the A / D converter 105.

次に、システム制御部１１４により制御される信号処理回路１０８は、画像データに対して、色変換処理、ホワイトバランス処理、ガンマ補正処理、解像度変換処理、画像圧縮処理等を行う。画像メモリ１０９は、信号処理回路１０８で処理中の画像データを一時的に記憶したり、信号処理された画像データを記憶したり、顔認識処理の結果を一時的に記憶したりするために用いられる。 Next, the signal processing circuit 108 controlled by the system control unit 114 performs color conversion processing, white balance processing, gamma correction processing, resolution conversion processing, image compression processing, and the like on the image data. The image memory 109 is used for temporarily storing the image data being processed by the signal processing circuit 108, storing the image data subjected to signal processing, and temporarily storing the result of the face recognition processing. It is done.

信号処理回路１０８で信号処理された画像データや画像メモリ１０９に記憶されている画像データは、記録回路１１１において記録媒体１１０に適したデータ（例えば階層構造を持つファイルシステムのデータ）に変換されて記録媒体１１０に記録される。また、画像データは、信号処理回路１０８で解像度変換処理を施された後、表示回路１１３において画像表示装置１１２に適した信号（例えばＮＴＳＣ又はＰＡＬ方式のアナログ信号等）に変換されて画像表示装置１１２に表示される。 The image data processed by the signal processing circuit 108 and the image data stored in the image memory 109 are converted into data suitable for the recording medium 110 (for example, file system data having a hierarchical structure) by the recording circuit 111. It is recorded on the recording medium 110. Further, the image data is subjected to resolution conversion processing by the signal processing circuit 108 and then converted into a signal suitable for the image display device 112 (for example, NTSC or PAL analog signal) by the display circuit 113 and the image display device. 112.

ここで、システム制御部１１４は、制御信号を用いて、信号処理回路１０８に対し、Ａ／Ｄ変換器１０５が出力した画像データに信号処理を施さずそのまま画像メモリ１０９や記録回路１１１に出力するように指示することもできる。また、信号処理回路１０８は、システム制御部１１４から要求があった場合に、信号処理の過程で生じた画像データの情報（例えば、画像の空間周波数、指定領域の平均値、圧縮画像のデータ量等の情報）をシステム制御部１１４に出力する。或いは、信号処理回路１０８は、これらの情報から抽出された情報をシステム制御部１１４に出力する。更に、記録回路１１１は、システム制御部１１４から要求があった場合に、記録媒体１１０の種類や空き容量等の情報をシステム制御部１１４に出力する。 Here, the system control unit 114 outputs the image data output from the A / D converter 105 to the image memory 109 and the recording circuit 111 without performing signal processing on the signal processing circuit 108 using the control signal. Can also be instructed. Further, the signal processing circuit 108, when requested by the system control unit 114, information of image data generated during the signal processing (for example, the spatial frequency of the image, the average value of the designated area, the data amount of the compressed image) Etc.) is output to the system control unit 114. Alternatively, the signal processing circuit 108 outputs information extracted from these pieces of information to the system control unit 114. Further, the recording circuit 111 outputs information such as the type and free capacity of the recording medium 110 to the system control unit 114 when requested by the system control unit 114.

続いて、記録媒体１１０に記録されている画像データの再生動作について説明する。システム制御部１１４からの制御信号により、記録回路１１１は、記録媒体１１０から画像データを読み出す。画像データが圧縮されている場合には、システム制御部１１４からの制御信号により、信号処理回路１０８は、画像伸長処理を行い、画像メモリ１０９に記憶する。画像メモリ１０９に記憶されている画像データは、信号処理回路１０８で解像度変換処理を施された後、表示回路１１３において画像表示装置１１２に適した信号に変換され、画像表示装置１１２に表示される。 Next, the reproduction operation of image data recorded on the recording medium 110 will be described. In response to a control signal from the system control unit 114, the recording circuit 111 reads image data from the recording medium 110. When the image data is compressed, the signal processing circuit 108 performs an image expansion process according to a control signal from the system control unit 114 and stores it in the image memory 109. The image data stored in the image memory 109 is subjected to resolution conversion processing by the signal processing circuit 108, converted to a signal suitable for the image display device 112 by the display circuit 113, and displayed on the image display device 112. .

＜顔検出処理及び顔認識処理＞
図２は、デジタルカメラ１００を用いた撮影処理の流れを示すフローチャートである。本フローチャートの処理は、デジタルカメラ１００に電源が投入されており、且つ、デジタルカメラ１００が顔認識処理を行うように操作部１１８等を介して設定されている場合に実行される。また、図示しないが、図２のフローチャートの処理が実行されている間、光学系１０１からＡ／Ｄ変換器１０５、及び信号処理回路１０８は、継続的に被写体の光学像を撮像して画像データを生成する。以下の各ステップで使用される画像データは、このようにして生成された画像データである。 <Face detection processing and face recognition processing>
FIG. 2 is a flowchart showing a flow of photographing processing using the digital camera 100. The processing in this flowchart is executed when the digital camera 100 is turned on and the digital camera 100 is set via the operation unit 118 or the like to perform face recognition processing. Although not shown, while the processing of the flowchart of FIG. 2 is being executed, the A / D converter 105 and the signal processing circuit 108 continuously capture an optical image of the subject to obtain image data. Is generated. The image data used in the following steps is image data generated in this way.

Ｓ２０１で、顔検出回路１２６は、Ａ／Ｄ変換器１０５が出力した画像データが表す撮像画像において人物の顔を検出する（顔検出処理）。顔検出回路１２６は、検出された顔の顔情報を画像メモリ１０９に一時的に保存する。顔情報は、撮像画像における、顔の位置及び大きさのうちの少なくとも一方を含む。なお、顔が１つも検出されなかった場合は、その旨を示す情報が画像メモリ１０９に保存される。 In S201, the face detection circuit 126 detects a human face in the captured image represented by the image data output from the A / D converter 105 (face detection processing). The face detection circuit 126 temporarily stores face information of the detected face in the image memory 109. The face information includes at least one of the position and size of the face in the captured image. If no face is detected, information indicating that is stored in the image memory 109.

Ｓ２０２で、システム制御部１１４は、顔検出回路１２６が検出した人物の顔について個人を認識する処理（顔認識処理）を顔認識回路１２２が実行中であるか否かを判定する。この判定は、システム制御部１１４が顔認識動作制御部１２３に顔認識回路１２２の状態を問い合わせることにより行われる。 In step S <b> 202, the system control unit 114 determines whether the face recognition circuit 122 is executing a process for recognizing an individual (face recognition process) for the human face detected by the face detection circuit 126. This determination is performed when the system control unit 114 inquires of the face recognition operation control unit 123 about the state of the face recognition circuit 122.

顔認識回路１２２が顔認識処理を実行中であるということは、以前に開始した顔認識処理が未だ完了していないということを意味する。演算リソースが比較的限られている装置においては、顔認識処理は顔検出処理に比べて長い処理時間を要するので、顔検出処理が行われる度に顔認識処理を行うことはできない。そのため、Ｓ２０２における判定が必要となる。 The fact that the face recognition circuit 122 is executing the face recognition process means that the face recognition process started before has not been completed yet. In an apparatus with relatively limited computation resources, the face recognition process requires a longer processing time than the face detection process, and thus the face recognition process cannot be performed every time the face detection process is performed. Therefore, determination in S202 is necessary.

Ｓ２０２において、顔認識回路１２２が顔認識処理を実行中であればＳ２０４に進み、そうでなければＳ２０３に進む。 In S202, if the face recognition circuit 122 is executing the face recognition process, the process proceeds to S204, and if not, the process proceeds to S203.

Ｓ２０３では、顔認識動作制御部１２３は、顔認識回路１２２にＳ２０１で検出された顔に対する顔認識処理を開始させる。顔認識処理では、例えば、Ｓ２０１にて検出された顔に、ＲＯＭ１１５に予め登録された人物が含まれているか否かが判定される。また、Ｓ２０３では、顔検出回路１２６は、顔認識動作制御部１２３の制御に従い、Ｓ２０１で検出された顔（顔認識処理の対象の顔）それぞれに識別情報を付与する（第１付与手段）。ここで、識別情報は、個人を特定するものではないが、あるタイミングで検出された顔と他のタイミングで検出された顔とが同一であるか否かを示すものである。次いで、Ｓ２０４に進む。 In S203, the face recognition operation control unit 123 causes the face recognition circuit 122 to start face recognition processing for the face detected in S201. In the face recognition processing, for example, it is determined whether or not a person registered in advance in the ROM 115 is included in the face detected in S201. In S203, the face detection circuit 126 gives identification information to each face (face of the face recognition process) detected in S201 in accordance with the control of the face recognition operation control unit 123 (first giving means). Here, the identification information does not specify an individual, but indicates whether or not the face detected at a certain timing is the same as the face detected at another timing. Next, the process proceeds to S204.

Ｓ２０４では、顔検出回路１２６が、比較処理を行う。比較処理の詳細は図３を参照して後述するが、簡単に説明すると、顔検出回路１２６は、Ｓ２０１で検出された顔と、以前に検出された顔とを比較し、２つのタイミングで検出された顔が同一の人物であるか否かを判定する。 In S204, the face detection circuit 126 performs comparison processing. The details of the comparison process will be described later with reference to FIG. 3. Briefly, the face detection circuit 126 compares the face detected in S201 with the previously detected face and detects them at two timings. It is determined whether or not the given faces are the same person.

より具体的には、所定のタイミングで取得された第１撮像画像において顔検出処理が行われ、併せて顔認識処理が行われたものとする。そして、顔検出回路１２６は、第１撮像画像において検出された顔の中から、顔認識回路１２２の処理完了後に取得された第２撮像画像において検出された顔のいずれかと予め定められた第１の関係を満たす顔を検索する（第１検索手段）。顔検出回路１２６は、検索の結果発見された顔に対応する第２撮像画像中の顔を、第１撮像画像において顔認識回路１２２（主認識手段）がこの発見された顔について認識した個人として認識する（補助認識手段）。 More specifically, it is assumed that face detection processing is performed on the first captured image acquired at a predetermined timing, and face recognition processing is performed together. Then, the face detection circuit 126 is a first predetermined one of the faces detected in the second captured image acquired after the processing of the face recognition circuit 122 is completed from among the faces detected in the first captured image. A face satisfying the above relationship is searched (first search means). The face detection circuit 126 recognizes a face in the second captured image corresponding to the face found as a result of the search as an individual recognized by the face recognition circuit 122 (main recognition unit) in the first captured image. Recognize (auxiliary recognition means).

即ち、第１撮像画像及び第２撮像画像において予め定められた第１の関係を満たす顔については、処理時間の長い顔認識回路１２２による顔認識処理を行わなくても、顔検出回路１２６が、以前の顔認識処理の結果に基づいて個人を認識することができる。 That is, the face detection circuit 126 does not perform face recognition processing by the face recognition circuit 122 having a long processing time for faces satisfying a first relationship that is predetermined in the first captured image and the second captured image. An individual can be recognized based on the result of the previous face recognition process.

Ｓ２０５で、システム制御部１１４は表示回路１１３を制御し、顔認識回路１２２で認識された個人、或いはＳ２０４における比較処理の結果として認識された個人に関する情報を、画像表示装置１１２に表示させる。個人に関する情報とは、例えば、特定の個人の名前などの情報であり、ＲＯＭ１１５に、顔の特徴を示す特徴情報と共に格納されている。 In step S205, the system control unit 114 controls the display circuit 113 to cause the image display device 112 to display information about the individual recognized by the face recognition circuit 122 or the individual recognized as a result of the comparison processing in step S204. The personal information is information such as the name of a specific individual, for example, and is stored in the ROM 115 together with feature information indicating facial features.

Ｓ２０６で、システム制御部１１４は、ＳＷ１１１９がＯＮであるか否かを判定する。ＯＮであればＳ２０７に進み、そうでなければＳ２０１に戻る。 In step S206, the system control unit 114 determines whether the SW1 119 is ON. If it is ON, the process proceeds to S207, and if not, the process returns to S201.

Ｓ２０７で、システム制御部１１４は、駆動回路１０７や信号処理回路１０８などを用いて、ＡＦ処理やＡＥ処理などの撮影のスタンバイ動作を行う。具体的には、信号処理回路１０８が、Ａ／Ｄ変換器１０５が出力する画像データから被写体輝度を算出する。次いで、算出結果に基づいて、駆動回路１０７が、光学系１０１の絞りやメカニカルシャッタ１０２などを制御することにより、露光時間を調整する。また、信号処理回路１０８が、Ａ／Ｄ変換器１０５が出力する画像データから被写体の焦点を検出し、検出結果に基づいて駆動回路１０７が光学系１０１のズームレンズを駆動する。 In step S207, the system control unit 114 performs shooting standby operations such as AF processing and AE processing using the drive circuit 107, the signal processing circuit 108, and the like. Specifically, the signal processing circuit 108 calculates the subject brightness from the image data output from the A / D converter 105. Next, based on the calculation result, the drive circuit 107 adjusts the exposure time by controlling the diaphragm of the optical system 101, the mechanical shutter 102, and the like. The signal processing circuit 108 detects the focus of the subject from the image data output from the A / D converter 105, and the drive circuit 107 drives the zoom lens of the optical system 101 based on the detection result.

ここで、顔認識処理や比較処理により認識された個人の情報に基づいてＡＦ処理やＡＥ処理が行われてもよい。例えば、撮影者の家族が被写体に含まれる場合、駆動回路１０７は、他の人物に優先して、家族に対する焦点や露出が適切になるように光学系１０１などを駆動する。従って、システム制御部１１４は、補助認識手段としての顔検出回路１２６が認識した個人に基づいて、撮像時（撮像画像の取得時）の撮影条件を設定することができる。 Here, AF processing and AE processing may be performed based on personal information recognized by face recognition processing or comparison processing. For example, when the photographer's family is included in the subject, the drive circuit 107 drives the optical system 101 and the like so that the focus and exposure with respect to the family are appropriate in preference to other persons. Therefore, the system control unit 114 can set shooting conditions at the time of shooting (when a captured image is acquired) based on an individual recognized by the face detection circuit 126 as auxiliary recognition means.

Ｓ２０８で、システム制御部１１４は、ＳＷ２１２０がＯＮであるか否かを判定する。ＯＮであればＳ２１０に進み、そうでなければＳ２０９に進む。 In step S208, the system control unit 114 determines whether the SW2 120 is ON. If it is ON, the process proceeds to S210, and if not, the process proceeds to S209.

Ｓ２０９では、システム制御部１１４は、ＳＷ１１１９がＯＮのままであるか否かを判定する。ＯＮのままであればＳ２０８に戻り、ＯＮでなければＳ２０１に戻る。 In step S209, the system control unit 114 determines whether the SW1 119 remains ON. If it remains ON, the process returns to S208, and if not ON, the process returns to S201.

Ｓ２１０では、システム制御部１１４は、撮影動作を行い、Ｓ２０９に戻る。 In S210, the system control unit 114 performs a shooting operation and returns to S209.

以上の各ステップで説明したように、顔検出処理、顔認識処理、及び比較処理が行われ、これらの各処理の結果に基づいて画像表示装置１１２に対する表示処理や、撮影動作などが行われる。 As described in the above steps, face detection processing, face recognition processing, and comparison processing are performed, and display processing for the image display device 112, photographing operation, and the like are performed based on the results of these processing.

＜比較処理の詳細＞
次に、図３を参照して、図２のＳ２０４における比較処理の詳細な流れを説明する。本フローチャートでは、Ｓ２０１で顔検出回路１２６が顔検出処理を行ったタイミングを、時刻ｔ＝ｎとする。そして、顔検出回路１２６は、検出された顔を、前回のタイミング（ｔ＝ｎ−ｋ）で検出された顔と比較し、同一であるか否かを判定する。ここで、ｋは顔検出処理を行う間隔である。 <Details of the comparison process>
Next, a detailed flow of the comparison process in S204 of FIG. 2 will be described with reference to FIG. In this flowchart, the timing at which the face detection circuit 126 performs the face detection process in S201 is time t = n. Then, the face detection circuit 126 compares the detected face with the face detected at the previous timing (t = n−k), and determines whether or not they are the same. Here, k is an interval for performing face detection processing.

図４のタイミングチャートは、Ａ．Ｓ２０１での顔検出処理、Ｂ．Ｓ２０３での顔認識処理、Ｃ．Ｓ２０４での比較処理の時間関係を表している。 The timing chart of FIG. B. Face detection processing in S201; Face recognition processing in S203, C.I. The time relationship of the comparison process in S204 is represented.

また、図５は、時刻ｔ＝ｎ−ｋ、及びｔ＝ｎにおいて撮像画像中で検出された顔を示す図である。時刻ｔ＝ｎ−ｋにおいて検出された３つの顔には、識別情報として、「Ａ」、「Ｂ」、及び「Ｃ」が付与されている。 FIG. 5 is a diagram illustrating faces detected in the captured image at times t = n−k and t = n. “A”, “B”, and “C” are assigned as identification information to the three faces detected at time t = n−k.

まず、図３のＳ３０１で、顔検出回路１２６は、ｔ＝ｎで検出された顔のうち１つの顔Ｃｎを選択する。 First, in S301 of FIG. 3, the face detection circuit 126 selects one face Cn among the faces detected at t = n.

Ｓ３０２で、顔検出回路１２６は、ｔ＝ｎ−ｋで検出された顔のうち１つの顔Ｃｎ−ｋを選択する。 In S302, the face detection circuit 126 selects one face Cn-k among the faces detected at t = n−k.

Ｓ３０３で、顔検出回路１２６は、顔Ｃｎの位置（顔の中心座標）と顔Ｃｎ−ｋの位置を比較する。顔Ｃｎの位置が（ｘｄ，ｙｄ）、顔Ｃｎ−ｋの位置が（ｘａ，ｙａ）であったとする。その差分ΔＸ＝｜ｘｄ−ｘａ｜、ΔＹ＝｜ｙｄ−ｙａ｜がそれぞれ予め定められた閾値以下（ＴｈＸ，ＴｈＹ以下）であった場合（ΔＸ≦ＴｈＸ、ΔＹ≦ＴｈＹ）、Ｓ３０４へと進む。そうでなかった場合はＳ３０６へと進む。 In step S303, the face detection circuit 126 compares the position of the face Cn (face center coordinates) with the position of the face Cn-k. Assume that the position of the face Cn is (xd, yd) and the position of the face Cn-k is (xa, ya). If the differences ΔX = | xd−xa | and ΔY = | yd−ya | are less than or equal to a predetermined threshold (ThX, ThY or less), respectively (ΔX ≦ ThX, ΔY ≦ ThY), the process proceeds to S304. If not, the process proceeds to S306.

Ｓ３０４では、顔検出回路１２６は、顔Ｃｎの大きさと顔Ｃｎ−ｋの大きさとを比較する。顔Ｃｎの大きさがＦｄであり、顔Ｃｎ−ｋの大きさがＦａであったとすると、その２つの差分（ΔＦ＝｜Ｆｄ−Ｆａ｜）が予め定められた閾値以下であった場合（ΔＦ≦ＴｈＦ）、Ｓ３０５へ進む。そうでなかった場合はＳ３０６へ進む。 In S304, the face detection circuit 126 compares the size of the face Cn with the size of the face Cn-k. If the size of the face Cn is Fd and the size of the face Cn−k is Fa, the difference between the two (ΔF = | Fd−Fa |) is less than or equal to a predetermined threshold (ΔF ≦ ThF), the process proceeds to S305. If not, the process proceeds to S306.

Ｓ３０５で、顔検出回路１２６は、顔Ｃｎと顔Ｃｎ−ｋは、位置及び大きさが近いので、同一人物であると判断し、顔Ｃｎに顔Ｃｎ−ｋと同一の識別情報を付与する（第２付与制御手段）。次いで、Ｓ３０８に進む。 In S305, the face detection circuit 126 determines that the face Cn and the face Cn-k are the same person because the face Cn and the face Cn-k are close to each other, and gives the same identification information to the face Cn as the face Cn-k ( (2nd provision control means). Next, the process proceeds to S308.

一方、Ｓ３０６では、顔検出回路１２６は、顔Ｃｎと顔Ｃｎ−ｋは別人であると判断し、Ｓ３０７に進む。 On the other hand, in S306, the face detection circuit 126 determines that the face Cn and the face Cn-k are different people, and proceeds to S307.

Ｓ３０７で、顔検出回路１２６は、ｔ＝ｎ−ｋで検出された顔がすべて顔Ｃｎ−ｋとして選択されたか判断を行う。全て選択されていればＳ３０８へ進み、そうでない場合はＳ３０２へ進む。Ｓ３０５ではなくＳ３０７からＳ３０８へ進んだということは、ｔ＝ｎ−ｋにおいて顔Ｃｎと同一の人物と判断される顔が発見されなかったということを意味する。この場合、顔Ｃｎには識別情報は付与されない。また、ｔ＝ｎ−ｋにおいて識別情報が付与されていない顔については、比較対象の顔Ｃｎ−ｋとして選択しないように顔検出回路１２６を構成してもよい。 In step S307, the face detection circuit 126 determines whether all the faces detected at t = n−k have been selected as the face Cn−k. If all are selected, the process proceeds to S308, and if not, the process proceeds to S302. Proceeding from S307 to S308 instead of S305 means that a face determined to be the same person as the face Cn was not found at t = n−k. In this case, identification information is not given to the face Cn. Further, the face detection circuit 126 may be configured not to select a face to which no identification information is assigned at t = n−k as a comparison target face Cn−k.

Ｓ３０８で、顔検出回路１２６は、ｔ＝ｎで検出された顔がすべて顔Ｃｎとして選択されたか否かを判定する。全て選択されていればＳ２０５へ進む。そうでない場合は、Ｓ３０１へ戻り、ｔ＝ｎで検出された他の顔について同様の処理を繰り返す。図５の例では、ｔ−ｎで検出された３つの顔がＣｎとして選択され、それぞれｔ＝ｎ−ｋで検出された３つの顔と比較される。 In S308, the face detection circuit 126 determines whether all the faces detected at t = n have been selected as the face Cn. If all are selected, the process proceeds to S205. Otherwise, the process returns to S301, and the same processing is repeated for the other faces detected at t = n. In the example of FIG. 5, the three faces detected at t−n are selected as Cn and compared with the three faces detected at t = n−k, respectively.

これらの処理を所定のタイミング（時間間隔ｋ）で繰り返すことにより、同一人物を時間間隔ｋで追尾することができる。 By repeating these processes at a predetermined timing (time interval k), the same person can be tracked at the time interval k.

これらの一連の流れを時系列に表したのが図４である。Ｓ２０１で得られた顔検出結果：Ｆ（ｎ−ｋ）を元に顔認識処理：Ａ（ｎ）が行われる。顔認識処理中も、時間間隔ｋで顔検出処理が行われる。また、比較処理も行われる。この比較処理によってＦ（ｎ−ｋ）で検出された顔と同一人物であると判定された顔については、Ａ（ｎ）の結果が得られるｔ＝ｎ＋９ｋまで、前回の顔認識処理：Ａ（ｎ−９ｋ）の結果が割り当てられる。 FIG. 4 shows a series of these flows in time series. Face recognition processing: A (n) is performed based on the face detection result: F (n−k) obtained in S201. Even during face recognition processing, face detection processing is performed at time interval k. A comparison process is also performed. For the face determined to be the same person as the face detected at F (n−k) by this comparison processing, the previous face recognition processing: A () until t = n + 9k at which the result of A (n) is obtained. n-9k) results are assigned.

これにより、顔認識回路１２２により顔認識中も、顔検出回路１２６による顔認識（補助認識）が行われ、検出された顔の個人が特定される。また、Ａ（ｎ）の結果が得られた後に検出された顔についても、Ａ（ｎ＋９ｋ）の処理中も、Ａ（ｎ）の結果に基づき、個人の認識が可能である。 Thereby, even during face recognition by the face recognition circuit 122, face recognition (auxiliary recognition) is performed by the face detection circuit 126, and an individual of the detected face is specified. Further, even for a face detected after the result of A (n) is obtained, an individual can be recognized based on the result of A (n) even during the process of A (n + 9k).

以上の処理を整理して説明すると、以下のようになる。但し、本願発明の範囲は、以下の説明に限定される訳ではない。 The above process is summarized and described as follows. However, the scope of the present invention is not limited to the following description.

時間的に連続して取得された２つの撮像画像（例えば、時刻ｔ＝ｎ−ｋに取得された撮像画像（先に取得された撮像画像）と、この次のタイミング（時刻ｔ＝ｎ）に生成された撮像画像（後に取得された撮像画像））を考える。顔検出回路１２６は、先に取得された撮像画像において検出された人物の顔の中から、後に取得された撮像画像において検出された人物の顔のいずれかと予め定められた第２の関係を満たす人物の顔を検索する（第２検索手段）。顔検出回路１２６は、例えば、先に取得された撮像画像において検出された顔の中から、後に取得された撮像画像において検出された顔のいずれかの顔情報との差が閾値以下である顔情報を持つ顔を、予め定められた第２の関係を満たす顔として検索する。顔情報は、撮像画像における、顔の位置及び大きさのうちの少なくとも一方を含むことができる。 Two captured images acquired continuously in time (for example, a captured image acquired at time t = n−k (captured image acquired earlier)) and the next timing (time t = n) Consider a generated captured image (captured image acquired later)). The face detection circuit 126 satisfies a predetermined second relationship with any one of the human faces detected in the captured image acquired later from among the human faces detected in the previously acquired captured image. A person's face is searched (second search means). The face detection circuit 126 is, for example, a face whose difference from face information detected in a captured image acquired later from a face detected in a captured image acquired earlier is equal to or less than a threshold value. A face having information is searched as a face satisfying a predetermined second relationship. The face information can include at least one of the position and size of the face in the captured image.

顔検出回路１２６は、先に取得された撮像画像において発見された人物の顔に対応する、後に取得された撮像画像中の人物の顔に、この発見された人物の顔に付与されている識別情報を付与する（第２付与手段）。 The face detection circuit 126 identifies the face of the person in the captured image acquired later corresponding to the face of the person found in the previously acquired captured image, which is assigned to the face of the discovered person. Information is given (second giving means).

顔検出回路１２６は、第１撮像画像において検出された人物の顔の中から、第１撮像画像に関する顔認識回路１２２の処理完了後に取得された第２撮像画像において検出された顔のいずれかと予め定められた第１の関係を満たす顔を検索する（第１検索手段）。顔検出回路１２６は、第１撮像画像において発見された顔に対応する、第２撮像画像中の顔を、第１撮像画像で顔認識回路１２２（主認識手段）がこの発見された顔について認識した個人として認識する（補助認識手段）。 The face detection circuit 126 is preliminarily selected from any of the faces detected in the second captured image acquired after the completion of the processing of the face recognition circuit 122 related to the first captured image from among the human faces detected in the first captured image. A face that satisfies the defined first relationship is searched (first search means). The face detection circuit 126 recognizes the face in the second captured image corresponding to the face found in the first captured image by the face recognition circuit 122 (main recognition unit) in the first captured image. Recognize as an individual (auxiliary recognition means).

ここで、予め定められた第１の関係を満たす顔は、次のように検索される。第１撮像画像の取得後、第２撮像画像に到るまで、時間的に連続して取得された２つの撮像画像について、上述の第２検索手段による検索と第２付与手段による付与とが繰り返される。その結果として第２撮像画像における顔に付与された識別情報と同一の識別情報が付与されている、第１撮像画像における顔が、予め定められた第１の関係を満たす顔である。 Here, the faces satisfying the predetermined first relationship are searched as follows. After the acquisition of the first captured image, until the second captured image is reached, the above-described search by the second search means and the provision by the second provision means are repeated for the two captured images obtained in succession in time. It is. As a result, the face in the first captured image to which the same identification information as the identification information given to the face in the second captured image is assigned is a face that satisfies a predetermined first relationship.

また、顔検出回路１２６は、先に取得された撮像画像、或いは後に取得された撮像画像において検出された顔の数が多いほど、上述の予め定められた閾値（ＴｈＦ）を小さくしてもよい。多くの顔が密集しているほど閾値が小さくなるので、誤って他人の顔が同一人物の顔として検出される可能性が抑制される。 Further, the face detection circuit 126 may decrease the above-described predetermined threshold (ThF) as the number of faces detected in the captured image acquired earlier or in the captured image acquired later increases. . Since the threshold value becomes smaller as the number of faces is more dense, the possibility that the face of another person is erroneously detected as the face of the same person is suppressed.

また、図４の例では、顔認識回路１２２による顔認識処理が完了すると、次の顔認識処理が開始している。しかし、顔検出回路１２６が、検出された顔全てについて比較処理の結果として個人を認識できた場合、システム制御部１１４は、顔認識回路１２２による認識処理の実行を禁止してもよい。これにより、消費電力の節約が期待される。 In the example of FIG. 4, when the face recognition process by the face recognition circuit 122 is completed, the next face recognition process is started. However, when the face detection circuit 126 can recognize an individual as a result of the comparison process for all detected faces, the system control unit 114 may prohibit the execution of the recognition process by the face recognition circuit 122. This is expected to save power consumption.

以上説明したように、本実施形態によれば、顔認識回路１２２による顔認識処理中に、顔検出回路１２６は、検出した顔が、顔認識回路１２２による前回の認識対象の顔と予め定められた第１の関係を満たすか否かを判定する。予め定められた第１の関係を満たす場合、顔検出回路１２６は、検出した顔を、顔認識回路１２２による前回の顔認識処理により認識された個人として認識する。 As described above, according to the present embodiment, during the face recognition processing by the face recognition circuit 122, the face detection circuit 126 determines that the detected face is the face to be previously recognized by the face recognition circuit 122 in advance. Whether the first relationship is satisfied is determined. When the predetermined first relationship is satisfied, the face detection circuit 126 recognizes the detected face as an individual recognized by the previous face recognition process by the face recognition circuit 122.

これにより、撮像装置における処理負荷の上昇を抑制しつつ、被写体認識の処理を高速に実行することが可能となる。即ち、デジタルカメラ１００のような演算リソースの比較的限られた小型の装置においても、検出された顔の個人を実用的な頻度で認識することができるようになる。 Accordingly, it is possible to execute subject recognition processing at high speed while suppressing an increase in processing load in the imaging apparatus. In other words, even with a small apparatus with relatively limited computing resources such as the digital camera 100, the detected face individual can be recognized at a practical frequency.

［その他の実施形態］
上述した各実施形態の機能を実現するためには、各機能を具現化したソフトウェアのプログラムコードを記録した記録媒体をシステム或は装置に提供してもよい。そして、そのシステム或は装置のコンピュータ（又はＣＰＵやＭＰＵ）が記録媒体に格納されたプログラムコードを読み出し実行することによって、上述した各実施形態の機能が実現される。この場合、記録媒体から読み出されたプログラムコード自体が上述した各実施形態の機能を実現することになり、そのプログラムコードを記録した記録媒体は本発明を構成することになる。このようなプログラムコードを供給するための記録媒体としては、例えば、フロッピィ（登録商標）ディスク、ハードディスク、光ディスク、光磁気ディスクなどを用いることができる。或いは、ＣＤ−ＲＯＭ、ＣＤ−Ｒ、磁気テープ、不揮発性のメモリカード、ＲＯＭなどを用いることもできる。 [Other Embodiments]
In order to realize the functions of the above-described embodiments, a recording medium in which a program code of software embodying each function is recorded may be provided to the system or apparatus. The functions of the above-described embodiments are realized by the computer (or CPU or MPU) of the system or apparatus reading and executing the program code stored in the recording medium. In this case, the program code itself read from the recording medium realizes the functions of the above-described embodiments, and the recording medium on which the program code is recorded constitutes the present invention. As a recording medium for supplying such a program code, for example, a floppy (registered trademark) disk, a hard disk, an optical disk, a magneto-optical disk, or the like can be used. Alternatively, a CD-ROM, CD-R, magnetic tape, nonvolatile memory card, ROM, or the like can be used.

また、上述した各実施形態の機能を実現するための構成は、コンピュータが読み出したプログラムコードを実行することだけには限られない。そのプログラムコードの指示に基づき、コンピュータ上で稼動しているＯＳ（オペレーティングシステム）などが実際の処理の一部又は全部を行い、その処理によって上述した各実施形態の機能が実現される場合も含まれている。 The configuration for realizing the functions of the above-described embodiments is not limited to executing the program code read by the computer. Including the case where the OS (operating system) running on the computer performs part or all of the actual processing based on the instruction of the program code, and the functions of the above-described embodiments are realized by the processing. It is.

更に、記録媒体から読み出されたプログラムコードが、コンピュータに挿入された機能拡張ボードやコンピュータに接続された機能拡張ユニットに備わるメモリに書きこまれてもよい。その後、そのプログラムコードの指示に基づき、その機能拡張ボードや機能拡張ユニットに備わるＣＰＵなどが実際の処理の一部又は全部を行い、その処理によって上述した各実施形態の機能が実現される場合も含むものである。 Further, the program code read from the recording medium may be written in a memory provided in a function expansion board inserted into the computer or a function expansion unit connected to the computer. Thereafter, the CPU of the function expansion board or function expansion unit performs part or all of the actual processing based on the instruction of the program code, and the functions of the above-described embodiments may be realized by the processing. Is included.

第１の実施形態に係るデジタルカメラの機能構成を示すブロック図である。It is a block diagram which shows the function structure of the digital camera which concerns on 1st Embodiment. 第１の実施形態に係るデジタルカメラを用いた撮影処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the imaging | photography process using the digital camera which concerns on 1st Embodiment. 図２のＳ２０４における比較処理の詳細な流れを示すフローチャートである。It is a flowchart which shows the detailed flow of the comparison process in S204 of FIG. Ａ．Ｓ２０１での顔検出処理、Ｂ．Ｓ２０３での顔認識処理、Ｃ．Ｓ２０４での比較処理の時間関係を表すタイミングチャートである。A. B. Face detection processing in S201; Face recognition processing in S203, C.I. It is a timing chart showing the time relation of the comparison processing in S204. 時刻ｔ＝ｎ−ｋ、及びｔ＝ｎにおいて撮像画像中で検出された顔を示す図である。It is a figure which shows the face detected in the captured image at the time t = n-k and t = n.

Claims

Imaging means for capturing an optical image of a subject to obtain a captured image;
Detecting means for detecting a human face in the captured image;
An individual is recognized for the person's face detected by the detecting means using characteristic information relating to the characteristics of the person's face acquired from the storage unit over a longer processing time than the detection of the person's face by the detecting means. Main recognition means;
First assigning means for assigning identification information to each face of the person detected in the first captured image ;
Among the two captured images acquired before and after the time, whichever face of the person is given the identification information in the previously acquired captured image among the human faces detected in the captured image acquired later The identification information given to any one of the faces is given to a person's face that satisfies at least one of the difference between the face and the position being equal to or smaller than the threshold and the difference in size being equal to or smaller than the threshold. Second granting means,
While the process of the main recognition unit is being performed on the face of the person detected in the second captured image obtained after the completion of the process of the main recognition unit on the face of the person detected in the first captured image, By repeatedly applying the identification information by the second assigning unit to the captured images acquired continuously in time, the person detected in the third captured image acquired after the second captured image is displayed. Auxiliary recognition means for recognizing a face of the face to which the same identification information as the individual recognized by the main recognition means in the first captured image is given as an individual recognized by the main recognition means in the first captured image When,
An imaging apparatus comprising:

The imaging apparatus according to claim 1 , wherein the second assigning unit decreases the threshold as the number of human faces detected in the previously acquired captured image increases.

The imaging apparatus according to claim 1 , wherein the second assigning unit decreases the threshold value as the number of human faces detected in the captured image acquired later increases.

The apparatus further comprises prohibition means for prohibiting execution of recognition processing by the main recognition means when an individual is recognized by the auxiliary recognition means for all human faces detected in the third captured image. Item 4. The imaging device according to any one of Items 1 to 3 .

Based on the individual auxiliary recognition means recognized in the third captured image, any of claims 1 to 4 wherein the imaging means, characterized by further comprising setting means for setting a photographing condition used at the time of acquisition of the captured image The imaging apparatus of Claim 1.

An imaging step in which the imaging means captures an optical image of a subject to acquire a captured image;
A detecting step in which the detecting means detects a human face in the captured image;
The main recognition means recognizes the person with respect to the face of the person detected in the detection step by using characteristic information regarding the characteristics of the face of the person acquired from the storage unit over a longer processing time than the detection step. The main recognition process;
A first assigning step in which first assigning means assigns identification information to each face of the person detected in the first captured image ;
In the two captured images acquired before and after the time, the second assigning unit is provided with identification information in the captured image acquired earlier among the faces of the person detected in the captured image acquired later. A face of a person who satisfies at least one of the difference between the face and the position of any one of the face of the person being equal to or smaller than the threshold and the difference of the size being equal to or smaller than the threshold is given to any of the faces. A second providing step for providing the identification information,
While the auxiliary recognition means is performing the main recognition process for the face of the person detected in the second captured image obtained after the completion of the main recognition process for the face of the person detected in the first captured image Is detected in the third captured image acquired after the second captured image by repeatedly applying the identification information by the second providing unit to the captured images acquired continuously in time. Recognize the face of the person's face to which the same identification information as that of the individual recognized by the main recognition step in the first captured image is given as the individual recognized by the main recognition step in the first captured image. An auxiliary recognition process to perform,
An imaging method comprising:

Computer
An imaging means for capturing an optical image of a subject to obtain a captured image;
Detecting means for detecting a human face in the captured image;
An individual is recognized for the person's face detected by the detecting means using characteristic information relating to the characteristics of the person's face acquired from the storage unit over a longer processing time than the detection of the person's face by the detecting means. Main recognition means,
First assigning means for assigning identification information to each face of the person detected in the first captured image ;
Among the two captured images acquired before and after the time, whichever face of the person is given the identification information in the previously acquired captured image among the human faces detected in the captured image acquired later The identification information given to any one of the faces is given to a person's face that satisfies at least one of the difference between the face and the position being equal to or smaller than the threshold and the difference in size being equal to or smaller than the threshold. Second granting means, and
While the process of the main recognition unit is being performed on the face of the person detected in the second captured image obtained after the completion of the process of the main recognition unit on the face of the person detected in the first captured image, By repeatedly applying the identification information by the second assigning unit to the captured images acquired continuously in time, the person detected in the third captured image acquired after the second captured image is displayed. Auxiliary recognition means for recognizing a face of the face to which the same identification information as the individual recognized by the main recognition means in the first captured image is given as an individual recognized by the main recognition means in the first captured image ,
Program to function as.