JP5004876B2

JP5004876B2 - Imaging device

Info

Publication number: JP5004876B2
Application number: JP2008145845A
Authority: JP
Inventors: 匠上原; 収一加藤; 啓太園田; 雄一中瀬
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2008-06-03
Filing date: 2008-06-03
Publication date: 2012-08-22
Anticipated expiration: 2028-06-03
Also published as: JP2009296142A

Description

本発明は、入射した光を電気信号に変換する撮像素子の出力をデジタル値に変換して画像データを得る撮像装置に関する。特に、被写体の顔を検出する機能を備える撮像装置に関する。 The present invention relates to an imaging apparatus that obtains image data by converting an output of an imaging device that converts incident light into an electrical signal into a digital value. In particular, the present invention relates to an imaging apparatus having a function of detecting the face of a subject.

従来、人物撮影を行う場合において、主被写体である人物とその背景のコントラストの関係から焦点が人物に合わずに、背景に合ってしまうという問題があった。このような問題を解決するために、画面内の顔を検出し、検出した顔の位置に合焦させることで、人物に焦点を合わせる撮像装置が開発されている（特許文献１参照）。 Conventionally, when taking a picture of a person, there is a problem that the focus is not on the person but on the background because of the contrast between the person who is the main subject and the background. In order to solve such a problem, an imaging apparatus that focuses on a person by detecting a face in the screen and focusing on the position of the detected face has been developed (see Patent Document 1).

しかし特許文献１によると、撮影画面内に人物がいるかいないかに関わらず顔検出処理を実行するため、撮影に時間がかかるという問題があった。 However, according to Patent Document 1, there is a problem that it takes time to shoot because face detection processing is executed regardless of whether or not a person is present in the shooting screen.

この問題を解決するために、撮像装置に設けられているマイクで撮影者の音声を検出すると、顔検出を実行する撮像装置が提案されている（特許文献２参照）。 In order to solve this problem, there has been proposed an imaging device that performs face detection when a photographer's voice is detected by a microphone provided in the imaging device (see Patent Document 2).

特許文献２によると、撮像装置が風景撮影モードのときは顔検出処理を実行せず、人物撮影モードのときは顔検出処理を実行し、どちらのモードでもないときは撮影者の音声を検出したときだけ顔検出処理を実行する。 According to Patent Document 2, face detection processing is not executed when the imaging apparatus is in landscape shooting mode, face detection processing is executed when in the person shooting mode, and the voice of the photographer is detected when the mode is not in either mode. Only when the face detection process is executed.

尚、画像データからの顔検出については、非特許文献１、２に記載されたものが知られている。更に、特許文献３〜６に記載されている手法で目を検出することにより、顔の位置や大きさを推定することもできる。
特開２００１−２１５４０３号公報特開２００５−３１８０８４号公報特開平３−１７６９６号公報特開平４−２５５０１５号公報特開平５−３００６０１号公報特開平９−２５１３４２号公報テレビジョン学会誌Ｖｏｌ．４９，Ｎｏ．６，ｐｐ．７８７−７９７（１９９５）、「顔領域抽出に有効な修正ＨＳＶ表色系の提案」電子情報通信学会誌Ｖｏｌ．７４−Ｄ−ＩＩ，Ｎｏ．１１，ｐｐ．１６２５−１６２７（１９９１）、「静止濃淡情景画像から顔領域を抽出する手法」 As for face detection from image data, those described in Non-Patent Documents 1 and 2 are known. Further, the position and size of the face can be estimated by detecting eyes by the methods described in Patent Documents 3 to 6.
JP 2001-215403 A JP 2005-318084 A Japanese Patent Laid-Open No. 3-17696 JP-A-4-255015 JP-A-5-300601 Japanese Patent Laid-Open No. 9-251342 Television Society Journal Vol. 49, no. 6, pp. 787-797 (1995), “Proposal of Modified HSV Color System Effective for Face Area Extraction” The Institute of Electronics, Information and Communication Engineers Vol. 74-D-II, no. 11, pp. 1625-1627 (1991), “Method for extracting a face region from a still gray scene image”

しかしながら、上記従来の撮像装置では、確実に人物撮影である場合の顔検出処理と、人物撮影であるかどうか撮像装置が判定できない場合の顔検出処理に違いが無く、状況に応じて、顔検出処理が最適化されていなかった。 However, in the above conventional imaging device, there is no difference between the face detection processing in the case of reliably taking a person and the face detection processing in the case where the imaging device cannot determine whether or not the person has been taken. Processing was not optimized.

本発明の目的は、顔検出処理の精度を向上させることができる撮像装置を提供することにある。 The objective of this invention is providing the imaging device which can improve the precision of a face detection process.

上記目的を達成するために、本発明による撮像装置は、被写体像を光電変換することにより画像データを取得する撮像手段と、前記撮像手段によって得られた撮影画像から、顔を検出するための評価値を演算し、前記評価値をしきい値と比較して顔判定を行い、評価値がしきい値より大きければ顔であると判定する顔検出手段と、音声を検出する音声検出手段とを備え、前記顔検出手段は、前記音声検出手段の検出結果に応じて、前記しきい値を変更することを特徴とする。 In order to achieve the above object, an imaging apparatus according to the present invention includes an imaging unit that acquires image data by photoelectrically converting a subject image, and an evaluation for detecting a face from the captured image obtained by the imaging unit. A face detection unit that calculates a value, compares the evaluation value with a threshold value to perform face determination, and determines that the face is a face if the evaluation value is greater than the threshold value; and a voice detection unit that detects voice The face detection means changes the threshold value according to a detection result of the voice detection means.

本発明の撮像装置によれば、顔検出処理の精度を向上させることができる。 According to the imaging apparatus of the present invention, it is possible to improve the accuracy of face detection processing.

以下、本発明の実施の形態を図面を参照しながら詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

図１は、本発明の実施の形態に係る撮像装置としてのデジタルカメラの外観斜視図である。 FIG. 1 is an external perspective view of a digital camera as an imaging device according to an embodiment of the present invention.

装置本体１は、光学ファインダ２、電源スイッチ（ボタン）３、静止画または動画を撮影する際に押下するレリーズスイッチ４、撮影の画角を変更するためのズームレバー５、モード切替スイッチ６を備える。 The apparatus main body 1 includes an optical viewfinder 2, a power switch (button) 3, a release switch 4 that is pressed when shooting a still image or a moving image, a zoom lever 5 for changing a shooting angle of view, and a mode switch 6. .

モード切替スイッチ６は、装置本体１における各種モードを切り替える。より具体的には、装置本体１の背面に印刷されたアイコンマーク１ａにモード切替スイッチ６を合わせると、静止画記録モードへの切り替えが可能である。また、アイコンマーク１ｂにモード切替スイッチ６を合わせると、動画記録モードへの切り替えが可能である。また、アイコンマーク１ｃにモード切替スイッチ６を合わせると、再生モードにモードの切り替えが可能である。 The mode switch 6 switches various modes in the apparatus main body 1. More specifically, when the mode switch 6 is set to the icon mark 1a printed on the back surface of the apparatus main body 1, it is possible to switch to the still image recording mode. Further, when the mode switch 6 is set to the icon mark 1b, it is possible to switch to the moving image recording mode. In addition, when the mode switch 6 is set to the icon mark 1c, the mode can be switched to the reproduction mode.

液晶パネル７は、装置本体１の背面に備えられた表示手段であり、撮影レンズを介して撮像素子の受光面に結像した撮影前の被写体像をスルー画像として表示し、あるいは、撮影後記録された画像を再生して表示する。 The liquid crystal panel 7 is a display unit provided on the back surface of the apparatus main body 1 and displays a subject image before photographing formed on the light receiving surface of the image sensor through a photographing lens as a through image or recording after photographing. Play and display the recorded image.

操作部８は、操作者が各種操作を行う操作スイッチであり、具体的には、液晶パネル７上の表示を切り替える表示スイッチや、メニュースイッチ、印刷スイッチ、ＳＥＴスイッチである。 The operation unit 8 is an operation switch for an operator to perform various operations. Specifically, the operation unit 8 is a display switch for switching a display on the liquid crystal panel 7, a menu switch, a print switch, or a SET switch.

十字スイッチ９は、十字に配置された４方向スイッチ（上スイッチ、下スイッチ、右スイッチ、左スイッチ）である。 The cross switch 9 is a four-way switch (upper switch, lower switch, right switch, left switch) arranged in a cross.

図２は、図１のデジタルカメラのブロック図である。 FIG. 2 is a block diagram of the digital camera of FIG.

以下、その構成を動作（機能）と併せて説明する。 Hereinafter, the configuration will be described together with the operation (function).

図２において、バリア１０１は、装置本体１の、撮影レンズ１０２を含む撮像系を覆うことにより、撮像系の汚れや破損を防止する。撮影レンズ１０２、絞り機能を備えるシャッター１０３、光学像を電気信号に変換（光電変換）するＣＣＤやＣＭＯＳ素子等で構成される撮像部（撮像素子）１０４がある。 In FIG. 2, the barrier 101 covers the imaging system including the photographing lens 102 of the apparatus main body 1, thereby preventing the imaging system from becoming dirty or damaged. There is a photographing lens 102, a shutter 103 having a diaphragm function, and an imaging unit (imaging device) 104 including a CCD, a CMOS device, or the like that converts an optical image into an electrical signal (photoelectric conversion).

Ａ／Ｄ変換器１０５は、アナログ信号をデジタル信号に変換する。Ａ／Ｄ変換器１０５は、撮像部１０４から出力されるアナログ信号をデジタル信号に変換する場合や、音声制御部１０６から出力されるアナログ信号をデジタル信号に変換する場合に用いられる。 The A / D converter 105 converts an analog signal into a digital signal. The A / D converter 105 is used when an analog signal output from the imaging unit 104 is converted into a digital signal, or when an analog signal output from the audio control unit 106 is converted into a digital signal.

タイミング発生部１０７は、撮像部１０４、Ａ／Ｄ変換器１０５、音声制御部１０６、Ｄ／Ａ変換器１０８にクロック信号や制御信号を供給する。タイミング発生部１０７は、メモリ制御部１０９及びシステム制御部１１０により制御される。 The timing generation unit 107 supplies a clock signal and a control signal to the imaging unit 104, the A / D converter 105, the audio control unit 106, and the D / A converter 108. The timing generation unit 107 is controlled by the memory control unit 109 and the system control unit 110.

画像処理部１１１は、Ａ／Ｄ変換器１０５からのデータ、または、メモリ制御部１０９からのデータに対し所定の画素補間、縮小といったリサイズ処理や色変換処理を行う。 The image processing unit 111 performs resizing processing and color conversion processing such as predetermined pixel interpolation and reduction on the data from the A / D converter 105 or the data from the memory control unit 109.

また、画像処理部１１１では、撮影した画像データを用いて所定の演算処理が行われ、得られた演算結果に基づいてシステム制御部１１０が露光制御、測距制御を行う。これにより、ＴＴＬ（スルー・ザ・レンズ）方式のＡＦ（オートフォーカス）処理、ＡＥ（自動露出）処理、ＥＦ（フラッシュプリ発光）処理が行われる。 The image processing unit 111 performs predetermined calculation processing using the captured image data, and the system control unit 110 performs exposure control and distance measurement control based on the obtained calculation result. Thereby, AF (autofocus) processing, AE (automatic exposure) processing, and EF (flash pre-emission) processing of the TTL (through-the-lens) method are performed.

システム制御部１１０は、撮像手段としての撮像部１０４によって得られた撮影画像から、顔を検出するための評価値を演算し、評価値をしきい値と比較して顔判定を行い、評価値がしきい値より大きければ顔であると判定する顔検出手段として機能する。その詳細については、後述する図３のステップＳ３０１で説明する。 The system control unit 110 calculates an evaluation value for detecting a face from the captured image obtained by the imaging unit 104 as an imaging unit, compares the evaluation value with a threshold value, performs face determination, and evaluates the evaluation value. Functions as face detection means for determining that the face is larger than the threshold. Details thereof will be described in step S301 of FIG.

画像処理部１１１では更に、撮影した画像データを用いて所定の演算処理を行い、得られた演算結果に基づいてＴＴＬ方式のＡＷＢ（オートホワイトバランス）処理も行っている。 The image processing unit 111 further performs predetermined calculation processing using the captured image data, and also performs TTL AWB (auto white balance) processing based on the obtained calculation result.

Ａ／Ｄ変換器１０５からの出力データは、画像処理部１１１及びメモリ制御部１０９を介して、あるいは、直接メモリ制御部１０９を介して、メモリ１１２に書き込まれる。メモリ１１２は、撮像部１０４によって得られ、Ａ／Ｄ変換器１０５によりデジタルデータに変換された画像データや、液晶パネル７を含む画像表示部２３に表示するための画像データを格納する。 Output data from the A / D converter 105 is written into the memory 112 via the image processing unit 111 and the memory control unit 109 or directly via the memory control unit 109. The memory 112 stores image data obtained by the imaging unit 104 and converted into digital data by the A / D converter 105 and image data to be displayed on the image display unit 23 including the liquid crystal panel 7.

尚、メモリ１１２は、マイク２１（２１ａ、２１ｂ）において録音された音声データ、静止画像、動画像及び画像ファイルを構成する場合のファイルヘッダを格納するのにも用いられる。従って、メモリ１１２は、所定枚数の静止画像や所定時間の動画像及び音声を格納するのに十分な記憶容量を備えている。 Note that the memory 112 is also used for storing audio data recorded in the microphone 21 (21a, 21b), a still image, a moving image, and a file header when configuring an image file. Therefore, the memory 112 has a storage capacity sufficient to store a predetermined number of still images, a moving image and sound for a predetermined time.

圧縮／伸張部１１３は、適応離散コサイン変換（ＡＤＣＴ）等により画像データを圧縮、伸張する。圧縮／伸張部１１３は、シャッター１０３をトリガにしてメモリ１１２に格納された撮影画像を読み込んで圧縮処理を行い、処理を終えたデータをメモリ１１２に書き込む。 The compression / decompression unit 113 compresses and decompresses image data by adaptive discrete cosine transform (ADCT) or the like. The compression / decompression unit 113 reads a captured image stored in the memory 112 using the shutter 103 as a trigger, performs compression processing, and writes the processed data in the memory 112.

また、圧縮／伸張部１１３は、記録媒体２００の記録部２０１等からメモリ１１２に読み込まれた圧縮画像に対して伸張処理を行い、処理を終えたデータをメモリ１１２に書き込む。 The compression / decompression unit 113 performs decompression processing on the compressed image read into the memory 112 from the recording unit 201 of the recording medium 200 and writes the processed data to the memory 112.

圧縮／伸張部１１３によりメモリ１１２に書き込まれた画像データは、システム制御部１１０のファイル部においてファイル化される。そして、インターフェース（Ｉ／Ｆ）１１４、コネクタ１１５、記録媒体２００側のコネクタ２０３、インターフェース（Ｉ／Ｆ）２０２を介して、記録部２０１に記録される。また、メモリ１１２は、画像表示用のメモリ（ビデオメモリ）を兼ねている。 The image data written to the memory 112 by the compression / decompression unit 113 is filed in the file unit of the system control unit 110. Then, the data is recorded in the recording unit 201 via the interface (I / F) 114, the connector 115, the connector 203 on the recording medium 200 side, and the interface (I / F) 202. The memory 112 also serves as an image display memory (video memory).

Ｄ／Ａ変換器１０８は、メモリ１１２に格納されている画像表示用のデータをアナログ信号に変換して画像表示部２３に供給する。画像表示部２３は、液晶パネル７等の表示器上に、メモリ１１２に書き込まれた表示用の画像データをＤ／Ａ変換器１０８を介してアナログ信号に変換して表示を行う。 The D / A converter 108 converts the image display data stored in the memory 112 into an analog signal and supplies the analog signal to the image display unit 23. The image display unit 23 converts the display image data written in the memory 112 into an analog signal via the D / A converter 108 on the display device such as the liquid crystal panel 7 and displays it.

マイク２１から出力された音声信号は、アンプ等で構成される音声制御部１０６を介してＡ／Ｄ変換器１０５に供給され、Ａ／Ｄ変換器１０５においてデジタル信号に変換された後、メモリ制御部１０９によってメモリ１１２に格納される。 The audio signal output from the microphone 21 is supplied to the A / D converter 105 via the audio control unit 106 configured by an amplifier or the like, converted into a digital signal by the A / D converter 105, and then subjected to memory control. The data is stored in the memory 112 by the unit 109.

一方、記録媒体２００に記録されている音声データは、メモリ１１２に読み込まれた後、Ｄ／Ａ変換器１０８によりアナログ信号に変換される。音声制御部１０６は、このアナログ信号によりスピーカ２２を駆動し、音声出力する。 On the other hand, the audio data recorded on the recording medium 200 is read into the memory 112 and then converted into an analog signal by the D / A converter 108. The voice control unit 106 drives the speaker 22 with this analog signal and outputs a voice.

不揮発性メモリ１１６は、電気的に消去・記録可能なメモリであり、例えばＥＥＰＲＯＭ等が用いられる。不揮発性メモリ１１６には、システム制御部１１０の動作用の定数、プログラム等が記憶（記録）される。ここでいう、プログラムとは、本実施の形態にて後述する各種フローチャートを実行するためのプログラムのことである。 The nonvolatile memory 116 is an electrically erasable / recordable memory, and for example, an EEPROM or the like is used. The nonvolatile memory 116 stores (records) constants, programs, and the like for operation of the system control unit 110. Here, the program is a program for executing various flowcharts described later in the present embodiment.

システム制御部１１０は、不揮発性メモリ１１６に記憶されたプログラムを実行することで、後述する本実施の形態の各処理を実現する。システムメモリ１１７は、ＲＡＭが用いられる。システムメモリ１１７には、システム制御部１１０の動作用の定数、変数、不揮発性メモリ１１６から読み出したプログラム等を展開（記憶）する。 The system control unit 110 executes programs stored in the nonvolatile memory 116, thereby realizing each process of the present embodiment described later. The system memory 117 is a RAM. In the system memory 117, constants and variables for operation of the system control unit 110, programs read from the nonvolatile memory 116, and the like are expanded (stored).

ズームレバー５、モード切替スイッチ６、第１シャッタースイッチ５１、第２シャッタースイッチ５２、操作部８及び十字スイッチ９はシステム制御部１１０に各種の動作指示を入力するための操作手段である。 The zoom lever 5, the mode switch 6, the first shutter switch 51, the second shutter switch 52, the operation unit 8 and the cross switch 9 are operation means for inputting various operation instructions to the system control unit 110.

モード切替スイッチ６は、システム制御部１１０の動作モードを静止画記録モード、動画記録モード、再生モード等のいずれかに切り替えることができる。第１シャッタースイッチ５１は、装置本体１に設けられたレリーズスイッチ４の操作途中（半押し）でオンとなり第１シャッタースイッチ信号ＳＷ１を発生する。 The mode switch 6 can switch the operation mode of the system control unit 110 to any one of a still image recording mode, a moving image recording mode, a reproduction mode, and the like. The first shutter switch 51 is turned on during the halfway operation of the release switch 4 provided in the apparatus main body 1 and generates a first shutter switch signal SW1.

システム制御部１１０は、第１シャッタースイッチ信号ＳＷ１により、ＡＦ処理、ＡＥ処理、ＡＷＢ処理、ＥＦ処理等の動作を開始する。 The system control unit 110 starts operations such as AF processing, AE processing, AWB processing, and EF processing in response to the first shutter switch signal SW1.

第２シャッタースイッチ５２は、レリーズスイッチ４の操作完了（全押し）でオンとなり、第２シャッタースイッチ信号ＳＷ２を発生する。システム制御部１１０は、第２シャッタースイッチ信号ＳＷ２により、撮像部１０４からの信号読み出しから記録媒体２００に画像データを書き込むまでの一連の撮影処理の動作を開始する。 The second shutter switch 52 is turned on when the operation of the release switch 4 is completed (fully pressed), and generates a second shutter switch signal SW2. In response to the second shutter switch signal SW2, the system control unit 110 starts a series of shooting processing operations from reading a signal from the imaging unit 104 to writing image data on the recording medium 200.

操作部８の各操作部材は、画像表示部２３に表示される種々の機能アイコンを選択操作すること等により、場面毎に適宜機能が割り当てられ、各種機能スイッチとして作用する。機能スイッチとしては、例えば、終了スイッチ、戻るスイッチ、画像送りスイッチ、ジャンプスイッチ、絞込みスイッチ、属性変更スイッチ等がある。 Each operation member of the operation unit 8 is appropriately assigned a function for each scene by selecting and operating various function icons displayed on the image display unit 23, and functions as various function switches. Examples of the function switch include an end switch, a return switch, an image feed switch, a jump switch, a narrowing switch, and an attribute change switch.

例えば、メニュースイッチが押されると各種設定が可能なメニュー画面が画像表示部２３に表示される。操作者は、画像表示部２３に表示されたメニュー画面と、十字スイッチ９やＳＥＴスイッチとを用いて直感的に各種設定を行うことができる。電源スイッチ３は、電源オン、電源オフを切り替える。 For example, when the menu switch is pressed, a menu screen on which various settings can be made is displayed on the image display unit 23. The operator can make various settings intuitively using the menu screen displayed on the image display unit 23 and the cross switch 9 or the SET switch. The power switch 3 switches between power on and power off.

電源制御部１１８は、電池検出回路、ＤＣ−ＤＣコンバータ、通電するブロックを切り替えるスイッチ回路等により構成され、電池の装着の有無、電池の種類、電池残量の検出を行う。また、電源制御部１１８は、その検出結果及びシステム制御部１１０の指示に基づいてＤＣ−ＤＣコンバータを制御し、必要な電圧を必要な期間、記録媒体２００を含む各部へ供給する。 The power supply control unit 118 includes a battery detection circuit, a DC-DC converter, a switch circuit that switches a block to be energized, and the like, and detects whether or not a battery is attached, the type of battery, and the remaining battery level. Further, the power supply control unit 118 controls the DC-DC converter based on the detection result and an instruction from the system control unit 110, and supplies a necessary voltage to each unit including the recording medium 200 for a necessary period.

電源部１１９は、アルカリ電池やリチウム電池等の一次電池やＮｉＣｄ電池やＮｉＭＨ電池、Ｌｉ電池等の二次電池、ＡＣアダプター等からなる。コネクタ５４及び５５は電源部１１９と電源制御部１１８とを接続する。 The power supply unit 119 includes a primary battery such as an alkaline battery or a lithium battery, a secondary battery such as a NiCd battery, a NiMH battery, or a Li battery, an AC adapter, or the like. Connectors 54 and 55 connect the power supply unit 119 and the power supply control unit 118.

ＲＴＣ（ＲｅａｌＴｉｍｅＣｌｏｃｋ）１２０は、日付及び時刻を計時する。ＲＴＣ１２０は、電源制御部１１８とは別に内部に電源部を保持しており、電源部１１９が落ちた状態であっても、計時状態を続ける。システム制御部１１０は、起動時にＲＴＣ１２０より取得した日時を用いてシステムタイマを設定し、タイマ制御を実行する。 An RTC (Real Time Clock) 120 measures the date and time. The RTC 120 holds a power supply unit therein separately from the power supply control unit 118 and keeps counting time even when the power supply unit 119 is turned off. The system control unit 110 sets a system timer using the date and time acquired from the RTC 120 at the time of activation, and executes timer control.

インターフェース１１４は、メモリカードやハードディスク等の記録媒体２００またはチューナーカードと、装置本体１とのインターフェースを司る。コネクタ１１５は、記録媒体２００やチューナーカードとインターフェース１１４との接続を行う。記録媒体着脱検出部１２１は、コネクタ１１５に記録媒体２００やチューナーカードが装着されているか否かを検出する。 The interface 114 serves as an interface between the recording medium 200 such as a memory card or a hard disk or a tuner card and the apparatus main body 1. The connector 115 connects the recording medium 200 and the tuner card to the interface 114. The recording medium attachment / detachment detection unit 121 detects whether the recording medium 200 or a tuner card is attached to the connector 115.

記録媒体２００は、図２においてはメモリカードやハードディスク等である。記録媒体２００は、半導体メモリや磁気ディスク等から構成される記録部２０１、装置本体１とのインターフェース２０２、及び、記録媒体２００と装置本体１とを接続するためのコネクタ２０３を備えている。 The recording medium 200 is a memory card or a hard disk in FIG. The recording medium 200 includes a recording unit 201 composed of a semiconductor memory, a magnetic disk, or the like, an interface 202 with the apparatus main body 1, and a connector 203 for connecting the recording medium 200 and the apparatus main body 1.

また、コネクタ１１５、２０３はＳＤＩ／Ｏカードの拡張規格に準拠しており、先述の記録媒体の他、ＳＤＩ／Ｏカードの拡張規格に準拠したチューナーカードが着脱可能となっている。 The connectors 115 and 203 conform to the SDI / O card expansion standard, and a tuner card conforming to the SDI / O card expansion standard can be attached and detached in addition to the recording medium described above.

通信部１２２は、ＲＳ２３２ＣやＵＳＢ、ＩＥＥＥ１３９４、Ｐ１２８４、ＳＣＳＩ、モデム、ＬＡＮ、無線通信等の各種通信処理を行う。コネクタ（無線通信の場合はアンテナ）１２３は、通信部１２２を介して装置本体１を他の機器と接続する。 The communication unit 122 performs various communication processes such as RS232C, USB, IEEE1394, P1284, SCSI, modem, LAN, and wireless communication. A connector (antenna in the case of wireless communication) 123 connects the apparatus main body 1 to another device via the communication unit 122.

図３は、図２のデジタルカメラによって実行される撮影（撮像）処理の手順を示すフローチャートである。 FIG. 3 is a flowchart showing a procedure of photographing (imaging) processing executed by the digital camera of FIG.

図３に示される処理は、システム制御部１１０により実行される。例えば、システム制御部１１０は不図示のＣＰＵを備え、例えば、システムメモリ１１７に格納された制御プログラムを実行することにより図３に示される処理を実現する。 The process shown in FIG. 3 is executed by the system control unit 110. For example, the system control unit 110 includes a CPU (not shown), and implements the processing shown in FIG. 3 by executing a control program stored in the system memory 117, for example.

図３において撮影動作が開始されると、ステップＳ３０１において、システム制御部１１０は、スルー表示される画像信号中に人の顔が存在するか否かを検出する顔検出処理を行う。この顔検出処理については図４を用いて後述する。 When the shooting operation is started in FIG. 3, in step S <b> 301, the system control unit 110 performs face detection processing for detecting whether or not a human face exists in the through-displayed image signal. This face detection process will be described later with reference to FIG.

システム制御部１１０は、顔検出処理において人の顔が検出された場合、画像信号中において検出した顔の位置座標、サイズ（幅、高さ）、検出個数、信頼性係数等を顔情報としてシステムメモリ１１７に記憶する。顔検出処理において顔が検出されなかった場合は、システムメモリ１１７内の位置座標、サイズ（幅、高さ）、検出個数、信頼性係数等の領域に０を設定する。 When a human face is detected in the face detection process, the system control unit 110 uses the face position coordinates, size (width, height), number of detections, reliability coefficient, and the like detected in the image signal as face information. Store in the memory 117. If no face is detected in the face detection process, 0 is set in the area such as position coordinates, size (width, height), number of detections, reliability coefficient, etc. in the system memory 117.

続いてステップＳ３０２において、第１シャッタースイッチ信号ＳＷ１がＯＮされたか否か判定される。第１シャッタースイッチ信号ＳＷ１がＯＦＦであれば、再度ステップＳ３０１の顔検出処理が実行され、ＯＮであれば、次のステップＳ３０３に進む。 Subsequently, in step S302, it is determined whether or not the first shutter switch signal SW1 is turned on. If the first shutter switch signal SW1 is OFF, the face detection process in step S301 is executed again, and if it is ON, the process proceeds to the next step S303.

ステップＳ３０３において、システム制御部１１０は、測距処理を行って撮影レンズ１０２の焦点を被写体に合わせるとともに、測光処理を行って絞り値及びシャッター時間（シャッタースピード）を決定する。 In step S303, the system control unit 110 performs a distance measurement process to focus the photographing lens 102 on the subject, and performs a photometry process to determine an aperture value and a shutter time (shutter speed).

尚、測光処理において、必要であればフラッシュの設定も行われる。このとき、ステップＳ３０１において顔が検出されていれば、検出した顔の範囲で測距を行うようにすることも可能である。 In the photometric process, a flash is set if necessary. At this time, if a face is detected in step S301, it is possible to perform distance measurement within the detected face range.

次に、ステップＳ３０４では、第２シャッタースイッチ信号ＳＷ２のＯＮ／ＯＦＦ状態を判定する。第１シャッタースイッチ信号ＳＷ１がＯＮした状態で、第２シャッタースイッチ信号ＳＷ２がＯＮになると、処理はステップＳ３０４からステップＳ３０６へ進む。 Next, in step S304, the ON / OFF state of the second shutter switch signal SW2 is determined. If the second shutter switch signal SW2 is turned on while the first shutter switch signal SW1 is turned on, the process proceeds from step S304 to step S306.

第２シャッタースイッチ信号ＳＷ２がＯＮせずに、更に第１シャッタースイッチ信号ＳＷ１も解除された場合（ステップＳ３０５）、処理はステップＳ３０５からステップＳ３０１へ戻る。 When the second shutter switch signal SW2 is not turned on and the first shutter switch signal SW1 is also canceled (step S305), the process returns from step S305 to step S301.

また、第１シャッタースイッチ信号ＳＷ１がＯＮ、第２シャッタースイッチ信号ＳＷ２がＯＦＦの間は、ステップＳ３０３〜Ｓ３０５の処理が繰り返される。 Further, while the first shutter switch signal SW1 is ON and the second shutter switch signal SW2 is OFF, the processes in steps S303 to S305 are repeated.

第２シャッタースイッチＳＷ２が押されると（第２シャッタースイッチ信号ＳＷ２がＯＮされると）、ステップＳ３０６において、システム制御部１１０は、露光処理や現像処理を含む撮影処理（露光処理）を実行する。 When the second shutter switch SW2 is pressed (when the second shutter switch signal SW2 is turned on), in step S306, the system control unit 110 executes photographing processing (exposure processing) including exposure processing and development processing.

尚、露光処理では、撮像部１０４、Ａ／Ｄ変換器１０５を経て得られた画像データが、画像処理部１１１及びメモリ制御部１０９を介して、或いはＡ／Ｄ変換器１０５から直接メモリ制御部１０９を介して、メモリ１１２に書き込まれる。 In the exposure process, the image data obtained through the image pickup unit 104 and the A / D converter 105 is sent via the image processing unit 111 and the memory control unit 109 or directly from the A / D converter 105. The data is written into the memory 112 via 109.

また、現像処理では、システム制御部１１０が、メモリ制御部１０９そして必要に応じて画像処理部１１１を用いて、メモリ１１２に書き込まれた画像データを読み出して各種処理を行う。 In the development process, the system control unit 110 reads out the image data written in the memory 112 using the memory control unit 109 and, if necessary, the image processing unit 111, and performs various processes.

撮影後、ステップＳ３０７において、システム制御部１１０は、撮影処理で得られた画像データを画像ファイルとして記録媒体２００に対して書き込む記録処理を実行する。 After shooting, in step S307, the system control unit 110 executes a recording process in which the image data obtained by the shooting process is written to the recording medium 200 as an image file.

（第１の実施の形態）
図４は、図３のステップＳ３０１で実行される顔検出処理の第１の実施の形態の手順を示すフローチャートである。 (First embodiment)
FIG. 4 is a flowchart showing the procedure of the first embodiment of the face detection process executed in step S301 of FIG.

顔検出処理がスタートすると、ステップＳ４０１で、画像の顔評価値を算出する。顔評価値とは、画像に含まれる領域の顔らしさを表す数値であり、例えば、パターンマッチング法における顔テンプレートとのマッチング度であり、目・鼻・口等の特徴点のレイアウトから演算される特徴量である。 When the face detection process starts, the face evaluation value of the image is calculated in step S401. The face evaluation value is a numerical value representing the face-likeness of the area included in the image, for example, the degree of matching with the face template in the pattern matching method, and is calculated from the layout of feature points such as eyes, nose, and mouth. It is a feature quantity.

顔テンプレートとのマッチング度を求める場合、画像の内、顔検出処理を行う領域内でエッジ抽出を行い、予め決められた顔テンプレートを、抽出したエッジと比較し、顔テンプレートとの類似度を算出する。 When finding the degree of matching with a face template, edge extraction is performed within the area of the image where face detection processing is performed, a predetermined face template is compared with the extracted edge, and the degree of similarity with the face template is calculated. To do.

顔テンプレートは、顔検出処理を行う領域内で走査され、顔テンプレートとの類似度が対象画素ごとに順次算出される。これらの顔評価値を求める技術は公知であり、例えば、特開平８−６３５９７号公報等に開示されている。 The face template is scanned within a region where face detection processing is performed, and the similarity with the face template is sequentially calculated for each target pixel. Techniques for obtaining these face evaluation values are known and disclosed in, for example, Japanese Patent Laid-Open No. 8-63597.

次に、ステップＳ４０２で、ステップＳ４０１で算出した顔評価値を、予め決められていたしきい値と比較する。 Next, in step S402, the face evaluation value calculated in step S401 is compared with a predetermined threshold value.

顔検出処理を行う画像内に、顔評価値がしきい値以上となる領域が含まれていれば、ステップＳ４０３に進み、対象領域は顔であると判定し、顔検出処理を終了する。ステップＳ４０２で、顔評価値がしきい値以上となる領域が存在しなければ、ステップＳ４０４に進む。 If the image to be subjected to the face detection process includes an area where the face evaluation value is equal to or greater than the threshold value, the process proceeds to step S403, where the target area is determined to be a face, and the face detection process is terminated. If there is no region where the face evaluation value is equal to or greater than the threshold value in step S402, the process proceeds to step S404.

ステップＳ４０４では、装置本体１に備えられたマイク２１を用いて、音声検出が行われる。 In step S <b> 404, voice detection is performed using the microphone 21 provided in the apparatus main body 1.

ステップＳ４０５では、音声の有無が判定され、音声が検出されれば、ステップＳ４０６に進む。 In step S405, the presence / absence of sound is determined. If sound is detected, the process proceeds to step S406.

ステップＳ４０６では、顔評価値と比較するしきい値を下げる。これは音声が検出されていることから、撮影範囲内に人物が存在すると想定されるためである。 In step S406, the threshold value to be compared with the face evaluation value is lowered. This is because it is assumed that a person is present within the shooting range since the sound is detected.

しきい値を変更した後、ステップＳ４０７において、改めてステップＳ４０１で求めた顔評価値としきい値を比較する。ここで顔評価値がしきい値以上であれば、ステップＳ４０３に進み、対象領域は顔であると判定される。 After changing the threshold value, in step S407, the face evaluation value obtained in step S401 is compared with the threshold value. If the face evaluation value is greater than or equal to the threshold value, the process proceeds to step S403, where it is determined that the target area is a face.

一方、ステップＳ４０５で音声が検出されなかった場合、またはステップＳ４０７で顔評価値がしきい値以上となる領域が存在しなかった場合は、顔を検出することなく顔検出処理を終了する。 On the other hand, if no sound is detected in step S405, or if there is no region where the face evaluation value is equal to or greater than the threshold value in step S407, the face detection process is terminated without detecting a face.

以上説明したように、第１の実施の形態では、音声を検出した場合に顔判定の基準となるしきい値を下げる。そのため、例えば、暗がりでの撮影や、横向き、目瞑り等の、通常の顔検出処理では検出できない対象物も検出可能となり、顔検出率の向上に寄与する。 As described above, in the first embodiment, the threshold value used as a criterion for face determination is lowered when voice is detected. For this reason, for example, it is possible to detect an object that cannot be detected by normal face detection processing, such as shooting in the dark, sideways, and eye meditation, which contributes to an improvement in the face detection rate.

尚、第１の実施の形態では、顔評価値がしきい値以上か否かで顔判定の基準としたが、例えば、顔評価値が上限と下限の間の一定範囲内に入っていれば顔であると判定する技術も公知である。その場合、音声を検出した際に顔であると判定する範囲の上限と下限をそれぞれ変更して、範囲を広げることで、顔検出率の向上を図ることができる。 In the first embodiment, the face determination criterion is based on whether or not the face evaluation value is greater than or equal to the threshold value. For example, if the face evaluation value falls within a certain range between the upper limit and the lower limit. A technique for determining a face is also known. In that case, it is possible to improve the face detection rate by changing the upper and lower limits of the range that is determined to be a face when speech is detected to widen the range.

（第２の実施の形態）
第２の実施の形態では、人物の発声音を検出することで、しきい値を下げるものとする。また音源位置を検出して、音源位置を含む一部の領域のみのしきい値を下げるものとする。 (Second Embodiment)
In the second embodiment, it is assumed that the threshold value is lowered by detecting the voice of a person. Further, the sound source position is detected, and the threshold value of only a part of the region including the sound source position is lowered.

図５は、図３のステップＳ３０１で実行される顔検出処理の第２の実施の形態の手順を示すフローチャートである。 FIG. 5 is a flowchart showing the procedure of the second embodiment of the face detection process executed in step S301 of FIG.

顔検出処理がスタートすると、ステップＳ５０１で、音声検出処理を実行し、ステップＳ５０２で音声の有無を判定する。音声ありと判定されれば、次にステップＳ５０３で、音声が人物の発声音であるか否かを判定する。人物の発声音であると判定された場合には、ステップＳ５０４で、撮影画角内であるかどうかを判定する。 When the face detection process starts, a voice detection process is executed in step S501, and the presence or absence of voice is determined in step S502. If it is determined that there is sound, it is then determined in step S503 whether the sound is a person's voice. If it is determined that the sound is a person's voice, it is determined in step S504 whether the sound is within the shooting angle of view.

これら一連の音声・音源判定技術は公知であり、例えば特開平０５−２１５８３３号公報にて開示されている。 A series of these voice / sound source determination techniques are known and disclosed in, for example, Japanese Patent Application Laid-Open No. 05-215833.

図６は、図５のステップＳ５０１で実行される音源方向検出処理に用いられる音源方向検出手段の構成例を示す図である。 FIG. 6 is a diagram illustrating a configuration example of a sound source direction detecting unit used in the sound source direction detecting process executed in step S501 of FIG.

図６において、指向性の高いマイク２１（２１ａ、２１ｂ）の出力信号は、バンドパスフィルタ６０２（６０２ａ、６０２ｂ）によって、特定周波数のみ減衰無く通過する。音圧差検出回路６０３では、各マイク２１ａ、２１ｂが出力した音圧レベルを比較し、音圧レベルの差値がシステム制御部１１０へ出力される。 In FIG. 6, the output signal of the microphone 21 (21a, 21b) having high directivity passes only a specific frequency without attenuation by the band-pass filter 602 (602a, 602b). The sound pressure difference detection circuit 603 compares the sound pressure levels output from the microphones 21 a and 21 b and outputs a difference value between the sound pressure levels to the system control unit 110.

音源６０１がマイク２１の指向特性パターンから離れる程、マイク２１の出力する音圧レベルは下がる。このため、各マイク２１ａ、２１ｂの出力する音圧レベルに差があれば、高いレベルを出力するマイク側に音声信号を発する音源６０１があることが検出でき、両出力レベルの差が小さい程音源が真正面にあることが検出できる。 The sound pressure level output by the microphone 21 decreases as the sound source 601 moves away from the directional characteristic pattern of the microphone 21. For this reason, if there is a difference in the sound pressure levels output from the microphones 21a and 21b, it can be detected that there is a sound source 601 that emits an audio signal on the side of the microphone that outputs a high level. Can be detected to be in front of.

ここで、バンドパスフィルタ６０２が減衰無く通過させる特定帯域の周波数を、例えば、人の発声する周波数帯域である２ＫＨｚ前後とすることで、人の発声音の音源位置を検出することが可能となる。 Here, it is possible to detect the sound source position of a person's uttered sound by setting the frequency of the specific band that the bandpass filter 602 passes without attenuation to, for example, around 2 KHz that is a frequency band uttered by a person. .

また、指向性のマイク２１を、装置本体１の左右に１個ずつ計２個を用いることにより、装置本体１に対して左右方向の音源位置を検出することができる。更に、上下方向において異なる位置に更にもう１つ、マイク２１を備えることにより、装置本体１に対して上下方向についても音源位置を検出することができる。マイク２１の数を増やせば音源位置の検出精度は更に高まる。 Further, by using two directional microphones 21, one for each of the left and right sides of the apparatus main body 1, it is possible to detect the sound source position in the left-right direction relative to the apparatus main body 1. Furthermore, by providing another microphone 21 at a different position in the vertical direction, the position of the sound source can be detected in the vertical direction with respect to the apparatus main body 1. Increasing the number of microphones 21 further increases the accuracy of detecting the sound source position.

音源位置を検出する他の技術として、装置本体１に設けられた複数のマイク２１ａ、２１ｂの出力する音声信号の位相差を利用する技術も知られている。これは、音源６０１から各マイク２１ａ、２１ｂまでの距離に差があると、マイク２１の出力信号に位相差が生じるため、既知のマイク間距離と音速から、音源の方向を演算によって特定する技術である。詳しくは、特開平０７−１４０５２７号公報に開示されている。 As another technique for detecting the position of the sound source, a technique using a phase difference between audio signals output from a plurality of microphones 21a and 21b provided in the apparatus main body 1 is also known. This is because, if there is a difference in the distance from the sound source 601 to each of the microphones 21a and 21b, a phase difference occurs in the output signal of the microphone 21, so that the direction of the sound source is specified by calculation from the known inter-microphone distance and sound speed. It is. Details are disclosed in Japanese Patent Application Laid-Open No. 07-140527.

更に、撮影レンズ１０２の焦点距離によって上記の音源位置検出方法を使い分けても良い。 Furthermore, the above-described sound source position detection method may be properly used depending on the focal length of the photographing lens 102.

音源６０１が装置本体１の正面から横に寄るほど、音源６０１から各マイク２１ａ、２１ｂまでの距離に差が生じるため、位相差が大きくなる。そのため、広角寄りでは位相差方式での音源位置検出が好適である。 The closer the sound source 601 is to the side from the front of the apparatus body 1, the greater the difference in distance from the sound source 601 to each of the microphones 21a and 21b, and the greater the phase difference. Therefore, sound source position detection by the phase difference method is suitable near the wide angle.

一方、音源６０１が装置本体１の正面付近の場合、音声検出範囲の狭い指向性のマイク２１であれば、音源位置のわずかな違いでも音圧差が生じる。そのため、望遠寄りでは指向性のマイク２１による音圧差方式での音源位置検出が好適である。 On the other hand, when the sound source 601 is in the vicinity of the front of the apparatus main body 1, if the microphone 21 has a narrow sound detection range and has a narrow sound detection range, a sound pressure difference occurs even if the sound source position is slightly different. Therefore, sound source position detection by the sound pressure difference method using the directional microphone 21 is suitable near the telephoto position.

図５に戻り、ステップＳ５０４で、音源６０１が画角内であると判定されたら、ステップＳ５０５に進む。ステップＳ５０５では、音源方向を含む撮影範囲の一部領域を設定し、次のステップＳ５０６では、ステップＳ５０５で設定した音源位置を含む領域内のみ顔判定の基準となるしきい値を下げる処理を行う。 Returning to FIG. 5, if it is determined in step S504 that the sound source 601 is within the angle of view, the process proceeds to step S505. In step S505, a partial region of the shooting range including the sound source direction is set, and in the next step S506, processing for lowering a threshold value used as a face determination reference is performed only in the region including the sound source position set in step S505. .

顔検出の基準となるしきい値を下げる処理を行うか、あるいはステップＳ５０２からステップＳ５０４において、音源６０１を検出できない、音声は人物の発声音ではない、音源６０１は撮影画各内ではない等の判定がなされると、ステップＳ５０７に進む。 Processing to lower the threshold value used as a reference for face detection is performed, or in step S502 to step S504, the sound source 601 cannot be detected, the sound is not a person's uttered sound, the sound source 601 is not in each captured image, etc. When the determination is made, the process proceeds to step S507.

ステップＳ５０７では、第１の実施の形態のステップＳ４０１と同様に、撮影領域の顔評価値の算出が行われる。 In step S507, as in step S401 of the first embodiment, the face evaluation value of the shooting area is calculated.

ステップＳ５０８では、算出された顔評価値としきい値の比較が行われる。顔評価値がしきい値以上となる領域があれば、ステップＳ５０９において、顔であると判定された上で顔検出処理は終了し、顔評価値がしきい値以上となる領域が無ければ、顔を検出することなく顔検出処理は終了する。 In step S508, the calculated face evaluation value is compared with a threshold value. If there is an area where the face evaluation value is equal to or greater than the threshold value, the face detection process ends after it is determined in step S509 that the face evaluation value is equal to or greater than the threshold value. The face detection process ends without detecting a face.

以上説明した通り、第２の実施の形態では、音声の種類を人物の発声音か否かで区別する。そのため、撮影画角内に人物が存在する場合のみ、しきい値を下げて顔検出率を向上することが可能となり、人物が存在しない場合には、不必要にしきい値を下げることが無いため、誤検出の増加を抑えることができる。 As described above, in the second embodiment, the type of voice is distinguished based on whether it is a person's voice. Therefore, it is possible to improve the face detection rate by lowering the threshold only when there is a person within the shooting angle of view. If there is no person, the threshold will not be lowered unnecessarily. , Increase in false detection can be suppressed.

また、音源方向を検出し、音源を含む一部領域のみしきい値を下げるため、人物が存在しない領域では誤検出の増加を抑えることができる。 Further, since the sound source direction is detected and the threshold value is lowered only in a partial region including the sound source, it is possible to suppress an increase in false detection in a region where no person exists.

更に、撮影レンズ１０２の焦点距離によって音源方向検出方法を変えるため、音源方向の検出精度をより高めることができる。 Furthermore, since the sound source direction detection method is changed depending on the focal length of the photographing lens 102, the detection accuracy of the sound source direction can be further improved.

本発明の実施の形態に係る撮像装置としてのデジタルカメラの外観斜視図である。1 is an external perspective view of a digital camera as an imaging apparatus according to an embodiment of the present invention. 図１のデジタルカメラのブロック図である。It is a block diagram of the digital camera of FIG. 図２のデジタルカメラによって実行される撮影（撮像）処理の手順を示すフローチャートである。3 is a flowchart illustrating a procedure of photographing (imaging) processing executed by the digital camera of FIG. 2. 図３のステップＳ３０１で実行される顔検出処理の第１の実施の形態の手順を示すフローチャートである。It is a flowchart which shows the procedure of 1st Embodiment of the face detection process performed by FIG.3 S301. 図３のステップＳ３０１で実行される顔検出処理の第２の実施の形態の手順を示すフローチャートである。It is a flowchart which shows the procedure of 2nd Embodiment of the face detection process performed by step S301 of FIG. 図５のステップＳ５０１で実行される音源方向検出処理に用いられる音源方向検出手段の構成例を示す図である。It is a figure which shows the structural example of the sound source direction detection means used for the sound source direction detection process performed by step S501 of FIG.

Explanation of symbols

１装置本体
２１マイク
１０４撮像部
１０６音声制御部
１１０システム制御部
１１１画像処理部
６０３音圧差検出回路 1 Device body 21 Microphone
104 imaging unit 106 audio control unit 110 system control unit 111 image processing unit 603 sound pressure difference detection circuit

Claims

Imaging means for acquiring image data by photoelectrically converting a subject image;
An evaluation value for detecting a face is calculated from the photographed image obtained by the imaging means, and the face is determined by comparing the evaluation value with a threshold value. Face detection means for determining that there is,
Voice detection means for detecting voice,
The imaging apparatus according to claim 1, wherein the face detection unit changes the threshold value in accordance with a detection result of the voice detection unit.

Imaging means for acquiring image data by photoelectrically converting a subject image;
A face detection unit that calculates an evaluation value for detecting a face from the captured image obtained by the imaging unit, and determines that the face is a face if the evaluation value is within a predetermined range;
Voice detection means for detecting voice,
The imaging apparatus according to claim 1, wherein the face detection unit changes the predetermined range according to a detection result of the voice detection unit.

It said speech detection means, the imaging apparatus according to claim 1 or 2, wherein the detecting the human vocal sound as the sound.

It said speech detection means, by detecting the frequency of the frequency band of the utterance of a person, the image pickup apparatus according to claim 3, wherein the detecting the human vocal sound as the sound.

The imaging apparatus according to claim 1, wherein the sound detection unit detects a direction of the sound source from the detection result of the sound.

The imaging apparatus according to claim 5, wherein the sound detection unit detects the direction of the sound source using a phase difference between output signals of a plurality of microphones.

The imaging apparatus according to claim 5, wherein the sound detection unit detects a direction of the sound source using a sound pressure difference between output signals of a plurality of microphones.

The sound detection means detects a direction of the sound source using a phase difference of output signals of a plurality of microphones when a focal length of a photographing lens provided in the apparatus main body is close to a wide angle. 5. The imaging device according to 5 .

The sound detection means detects a direction of the sound source by using a sound pressure difference between output signals of a plurality of microphones when a focal length of a photographing lens provided in the apparatus main body is near telephoto. 5. The imaging device according to 5 .

Said face detecting means, said selected regions comprising a sound source direction detected voice detection means of claim 5 to 9, characterized in that changing the threshold value or the predetermined range in the selected region The imaging device according to any one of the above.

Evaluation value for detecting the face image pickup device according to any one of claims 1 to 10, characterized in that a similarity between the template in the pattern matching method.