JP4894278B2

JP4894278B2 - Camera apparatus and camera control program

Info

Publication number: JP4894278B2
Application number: JP2006026481A
Authority: JP
Inventors: 一記喜多
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 2006-02-03
Filing date: 2006-02-03
Publication date: 2012-03-14
Anticipated expiration: 2026-02-03
Also published as: JP2007208757A

Description

本発明は、被写体の計数機能を備えたカメラ装置、及びカメラ装置に被写体の計数機能を付与するカメラ制御プログラムに関する。 The present invention relates to a camera device having a subject counting function, and a camera control program for giving a subject counting function to the camera device.

従来、被写体を認識する機能を備えたカメラ装置が提案されている。この機能を備えたカメラ装置は、主被写体が人物か否かを判定し、人物であると判定されたときと、人物以外であると判定されたときとで、異なる露出設定を行う（特許文献１参照）。
特開平０５−０４１８３０号公報 Conventionally, a camera apparatus having a function of recognizing a subject has been proposed. The camera device having this function determines whether or not the main subject is a person, and performs different exposure settings when it is determined that the main subject is a person and when it is determined that the main subject is not a person (Patent Literature). 1).
JP 05-041830 A

ところで、集合写真を撮影する場合において、全員が集合しているか否かを確認するために、撮影者等が目視により集合者の人数を確認することはよく見られる光景である。このとき、人物であると判定した被写体に基づき人数をカウントする機能をカメラ装置に付加すれば、撮影者等が目視により人数を確認する煩雑な撮影前動作が不要となり、利便性が得られる。しかし、集合写真の撮影場所は一般に観光地等であって、集合者の周囲には無関係が人々が不可避的に存在する。したがって、単に結像した被写体像に基づき人数をカウントする機能を付加しても、集合すべき者の人数等の所望の被写体の数を適正にカウントすることはできない。 By the way, when taking a group photo, it is a common scene that a photographer or the like visually confirms the number of gatherers in order to confirm whether or not everyone is gathering. At this time, if a function for counting the number of persons based on the subject determined to be a person is added to the camera device, a complicated pre-shooting operation in which a photographer or the like visually confirms the number of persons becomes unnecessary, and convenience is obtained. However, a group photo is generally taken at a sightseeing spot, and there are unavoidable people around the gathering. Therefore, the number of desired subjects such as the number of persons to be gathered cannot be properly counted even if a function of counting the number of people based on the formed subject image is added.

本発明は、かかる従来の課題に鑑みてなされたものであり、所望の被写体の数を適正にカウントすることのできるカメラ装置及びカメラ制御プログラムを提供することを目的とする。 The present invention has been made in view of such conventional problems, and an object of the present invention is to provide a camera device and a camera control program capable of appropriately counting the number of desired subjects.

請求項１記載の発明は、  The invention described in claim 1
被写体を結像させる結像手段と、  Imaging means for imaging a subject;
この結像手段により結像される画像を表示する表示手段と、  Display means for displaying an image formed by the imaging means;
前記表示手段に表示された画像内から計数の対照となる複数の被写体を任意に選択する選択手段と、  A selection means for arbitrarily selecting a plurality of subjects to be counted from the image displayed on the display means;
前記表示手段に表示された画像内において、各被写体から抽出された特徴データとの類似度が所定以上である画像部分の個数を被写体毎に計数する計数手段と、  Counting means for counting, for each subject, the number of image portions whose similarity with feature data extracted from each subject is greater than or equal to a predetermined value in the image displayed on the display means;
前記計数手段により各被写体毎に計数された個数を、各被写体毎に区別して前記表示手段に同時に表示させる表示制御手段と、  Display control means for simultaneously displaying the number counted for each subject by the counting means on the display means while distinguishing for each subject;
を備えることを特徴とする。  It is characterized by providing.
請求項２記載の発明は、更に、  The invention according to claim 2 further includes
前記表示制御手段は、前記選択手段により選択された複数の被写体の各々について、各被写体のサンプル画像と各被写体の個数とを対応付けて前記表示手段に同時に表示することを特徴とする。  The display control means displays a sample image of each subject and the number of each subject in association with each other on the display means for each of the plurality of subjects selected by the selection means.
請求項３記載の発明は、更に、  The invention according to claim 3 further includes
前記表示制御手段は、前記表示手段に表示した画像内において、前記計数手段により計数された各被写体に対応する画像部分を識別表示した状態で、各被写体のサンプル画像と各被写体の個数とを対応付けて同時に表示することを特徴とする。  The display control means associates the sample image of each subject with the number of each subject in a state where the image portion corresponding to each subject counted by the counting means is identified and displayed in the image displayed on the display means. It is characterized by being displayed at the same time.
請求項４記載の発明は、更に、  The invention according to claim 4 further includes
前記表示制御手段は、複数の被写体の各々に対応した異なる形状で塗りつぶしを行って、各被写体に対応する画像部分を識別表示することを特徴とする。  The display control means performs painting with different shapes corresponding to each of a plurality of subjects to identify and display an image portion corresponding to each subject.
請求項５記載の発明は、更に、  The invention according to claim 5 further includes:
操作に応じて位置が任意に変位するカーソルを前記表示手段に表示し、このカーソルで指定される２点を対角とする任意の領域を指定する領域指定手段を更に備え、  A cursor whose position is arbitrarily displaced in accordance with the operation is displayed on the display means, and further includes an area designating means for designating an arbitrary area whose diagonals are two points designated by the cursor,
前記計数手段は、前記領域指定手段により指定された領域内において、各被写体から抽出された特徴データとの類似度が所定以上である画像部分の個数を計数することを特徴とする。  The counting means counts the number of image portions whose similarity with the feature data extracted from each subject is greater than or equal to a predetermined value in the area specified by the area specifying means.
請求項６記載の発明は、更に、  The invention described in claim 6 further includes:
操作に応じて位置が任意に変位するカーソルを前記表示手段に表示し、このカーソルで示される任意の位置を指定する位置指定手段を更に備え、  A cursor whose position is arbitrarily displaced according to the operation is displayed on the display means, and further includes a position designation means for designating an arbitrary position indicated by the cursor,
前記選択手段は、前記位置指定手段により指定された位置にある被写体を選択することを特徴とする。  The selection means selects a subject at a position designated by the position designation means.
請求項７記載の発明は、更に、  The invention according to claim 7 further includes
前記位置指定手段は、操作に応じて位置と大きさが任意に変位するカーソルを前記表示手段に表示し、このカーソルで示される任意の位置と大きさを指定し、  The position specifying means displays a cursor whose position and size are arbitrarily displaced according to an operation on the display means, and specifies an arbitrary position and size indicated by the cursor,
前記選択手段は、前記位置指定手段により指定された位置と大きさに対応する領域内から被写体を選択することを特徴とする。  The selection unit selects a subject from an area corresponding to the position and size designated by the position designation unit.
請求項８記載の発明は、更に、  The invention described in claim 8 further includes:
前記計数手段により計数された前記画像部分の個数に基づき、当該カメラ装置の撮影動作を制御する撮影制御手段を更に備えることを特徴とする。  The image pickup apparatus further includes shooting control means for controlling a shooting operation of the camera device based on the number of the image portions counted by the counting means.
請求項９記載の発明は、更に、  The invention according to claim 9 further includes
複数の撮影シーンの各々対応して撮影条件を記憶した記憶手段と、  Storage means for storing shooting conditions corresponding to each of a plurality of shooting scenes;
前記計数手段により計数された前記画像部分の個数に基づき、前記複数の撮影シーンのいずれかを選択する選択手段と、  Selection means for selecting one of the plurality of shooting scenes based on the number of the image portions counted by the counting means;
この選択手段により選択された撮影シーンに対応して前記記憶手段に記憶されている前記撮影条件に基づき、当該カメラ装置の撮影動作を制御する撮影制御手段を更に備えることを特徴とする。  The image processing apparatus further includes shooting control means for controlling the shooting operation of the camera device based on the shooting conditions stored in the storage means corresponding to the shooting scene selected by the selection means.
請求項１０記載の発明は、更に、  The invention described in claim 10 further includes:
撮影者の指示操作に基づき、前記複数の撮影シーンのいずれかを選択する指示選択手段を備え、  An instruction selecting means for selecting any of the plurality of shooting scenes based on a photographer's instruction operation;
前記撮影制御手段は、前記指示選択手段により撮影シーンが選択された場合には、該撮影シーンに対応して前記記憶手段に記憶されている前記撮影条件に基づき、当該カメラ装置の撮影動作を制御することを特徴とする。  When the shooting scene is selected by the instruction selection unit, the shooting control unit controls the shooting operation of the camera device based on the shooting condition stored in the storage unit corresponding to the shooting scene. It is characterized by doing.
請求項１１記載の発明は、更に、  The invention described in claim 11 further includes:
前記撮影制御手段は、当該カメラ装置の合焦動作、被写界深度、露出条件、フィルタ処理の少なくとも一つを制御することを特徴とする。  The imaging control unit controls at least one of a focusing operation, a depth of field, an exposure condition, and a filtering process of the camera device.
請求項１２記載の発明は、  The invention according to claim 12
被写体を結像させる結像手段と、この結像手段により結像される画像を表示する表示手段とを備えるカメラ装置が有するコンピュータを、  A computer having a camera device including an image forming means for forming an image of a subject and a display means for displaying an image formed by the image forming means;
前記表示手段に表示された画像内から計数の対照となる複数の被写体を任意に選択する選択手段と、  A selection means for arbitrarily selecting a plurality of subjects to be counted from the image displayed on the display means;
前記表示手段に表示された画像内において、各被写体から抽出された特徴データとの類似度が所定以上である画像部分の個数を被写体毎に計数する計数手段と、  Counting means for counting, for each subject, the number of image portions whose similarity with feature data extracted from each subject is greater than or equal to a predetermined value in the image displayed on the display means;
前記計数手段により各被写体毎に計数された個数を、各被写体毎に区別して前記表示手段に表示させる表示制御手段と、  Display control means for causing the display means to display the number counted for each subject by the counting means for each subject;
して機能させることを特徴とする。  It is characterized by functioning.

以上説明したように本発明によれば、撮像して表示された画像内から計数の対照となる複数の被写体を任意に選択し、この選択された各被写体毎に計数された個数を、各被写体毎に区別して表示手段に同時に表示させることで、任意に選択した複数の被写体の各々の個数を同時に表示して比較確認することが可能となる。 As described above, according to the present invention, a plurality of subjects to be counted are arbitrarily selected from within an image captured and displayed, and the number counted for each selected subject is determined as each subject. By distinguishing them and displaying them simultaneously on the display means, it is possible to simultaneously display the number of each of a plurality of arbitrarily selected subjects for comparison and confirmation .

以下、本発明の一実施の形態を図に従って説明する。 Hereinafter, an embodiment of the present invention will be described with reference to the drawings.

（第１の実施の形態）
図１（Ａ）は各実施の形態に共通するデジタルカメラ１の正面図、（Ｂ）は背面図、（Ｃ）は側面透視図である。このデジタルカメラ１の本体２には、その上面部に半押し機能を備えたレリーズ釦（シャッタースイッチ）３と電源スイッチ４とが配置されており、正面部にはグリップ部５、ストロボ６及び撮像レンズ部の受光窓７が配置されている。また、背面部には、モード切替スイッチ８、ズーム操作キー９、左右上下操作部を有するカーソルキー１０、決定／ＯＫキー１１、ＤＩＳＰキー１２、メニューキー１３及び電子ファインダとしても機能するＬＣＤからなる表示部１４が配置されているとともに、メモリ媒体と電池とを収納するメモリ媒体／電池収納部１５が設けられている。また、回動式ミラー１８、レンズ群１９及びＣＣＤ等で構成される撮像素子２０等が配置されている。 (First embodiment)
1A is a front view of a digital camera 1 common to each embodiment, FIG. 1B is a rear view, and FIG. 1C is a side perspective view. The main body 2 of the digital camera 1 is provided with a release button (shutter switch) 3 and a power switch 4 having a half-press function on the upper surface portion, and a grip portion 5, a strobe 6 and an image pickup device on the front portion. A light receiving window 7 of the lens unit is arranged. Further, on the back side, a mode changeover switch 8, a zoom operation key 9, a cursor key 10 having a left / right / up / down operation unit, an enter / OK key 11, a DISP key 12, a menu key 13 and an LCD functioning also as an electronic viewfinder are provided. A display unit 14 is disposed, and a memory medium / battery storage unit 15 for storing a memory medium and a battery is provided. In addition, an image pickup device 20 including a rotary mirror 18, a lens group 19, a CCD, and the like are disposed.

図２は、デジタルカメラ１の概略的回路構成を示すブロック図である。このデジタルカメラ１は、主制御手段１００、撮影制御手段１０１及び画像処理手段１０２を備えている。主制御手段１００は、抽出領域の選択手段１０３、テンプレート設定手段１０４、特徴データのテンプレートメモリ１０５及び撮影シーン設定手段１０６を有し、抽出領域の選択手段１０３にはカーソル操作手段１０７からのカーソル操作情報が入力され、撮影シーン設定手段１０６にはモード選択操作手段１０８からのモード選択操作情報が入力される。撮影制御手段１０１には、撮影シーン選択手段１０３からの設定情報、後述する認識処理ブロックからの認識情報、測光手段１０９からの測光情報、測距手段１１０からの測距情報が入力される。撮影制御手段１０１は、これら入力情報に基づき、照明手段１１１、光学系駆動部１１２、絞り／シャッター駆動部１１３、撮像駆動部１１４及び信号処理手段１１５を制御する。 FIG. 2 is a block diagram illustrating a schematic circuit configuration of the digital camera 1. The digital camera 1 includes a main control unit 100, an imaging control unit 101, and an image processing unit 102. The main control unit 100 includes an extraction region selection unit 103, a template setting unit 104, a feature data template memory 105, and a shooting scene setting unit 106. The extraction region selection unit 103 includes a cursor operation from the cursor operation unit 107. The information is input, and the mode selection operation information from the mode selection operation unit 108 is input to the shooting scene setting unit 106. The shooting control unit 101 receives setting information from the shooting scene selection unit 103, recognition information from a recognition processing block described later, photometry information from the photometry unit 109, and distance measurement information from the range finding unit 110. The imaging control unit 101 controls the illumination unit 111, the optical system driving unit 112, the aperture / shutter driving unit 113, the imaging driving unit 114, and the signal processing unit 115 based on the input information.

他方、撮影光学系１１６の光軸上には、絞り１１７、シャッター１１８及び撮像手段１１９が配置されている。撮影光学系１１６は、光学系駆動部１１２により、絞り１１７とシャッター１１８は絞り／シャッター駆動部１１３により、撮像手段１１９は撮像駆動部１１４により各々駆動される。信号処理手段１１５は、撮像手段１１９からのアナログ信号を処理するとともにデジタル信号に変換し、このデジタル画像信号は画像バッファメモリ１２０を介して、前記画像信号処理手段１０２に入力される。 On the other hand, an aperture 117, a shutter 118, and an imaging unit 119 are disposed on the optical axis of the photographing optical system 116. The photographing optical system 116 is driven by the optical system driving unit 112, the diaphragm 117 and the shutter 118 are driven by the diaphragm / shutter driving unit 113, and the imaging unit 119 is driven by the imaging driving unit 114, respectively. The signal processing unit 115 processes the analog signal from the imaging unit 119 and converts it into a digital signal, and this digital image signal is input to the image signal processing unit 102 via the image buffer memory 120.

画像処理手段１０２は、前処理ブロック１２１、特徴抽出ブロック１２２及び認識処理ブロック１２３を有している。この画像処理手段１０２において、各ブロック１２１〜１２３を経由しない画像バッファメモリ１２０からの画像データは、画像圧縮／符号化手段１２４に入力される一方、各ブロック１２１〜１２３を経由した画像データは、画像圧縮／符号化手段１２４と前記撮影制御手段１０１とに入力される。前記画像圧縮／符号化手段１２４は、これら画像データを圧縮及び符号化処理し、この処理された画像データは画像記録手段１２５に記録される。また、特徴抽出ブロック１２２からの情報は、前記特徴データのテンプレートメモリ１０５に入力されて記憶されるように構成されている。 The image processing unit 102 includes a preprocessing block 121, a feature extraction block 122, and a recognition processing block 123. In this image processing means 102, image data from the image buffer memory 120 that does not pass through the blocks 121 to 123 is input to the image compression / encoding means 124, while image data that passes through the blocks 121 to 123 is The data is input to the image compression / encoding unit 124 and the photographing control unit 101. The image compression / encoding unit 124 compresses and encodes these image data, and the processed image data is recorded in the image recording unit 125. The information from the feature extraction block 122 is configured to be input and stored in the template memory 105 of the feature data.

図３は、デジタルカメラ１の具体的回路構成を示すブロック図である。図において、操作部２３は、前記レリーズ釦３や電源スイッチ４等の図１に示したスイッチやキー群等で構成され、このスイッチ及びキー群の操作情報は、入力回路２４を介して、制御部２５に入力される。制御部２５は、ＣＰＵ及びその周辺回路と、ＣＰＵの作業用メモリであるＲＡＭ等から構成されるマイクロコンピュータであり、各部を制御する。 FIG. 3 is a block diagram showing a specific circuit configuration of the digital camera 1. In the figure, the operation unit 23 is composed of the switches and key groups shown in FIG. 1 such as the release button 3 and the power switch 4, and the operation information of the switches and key groups is controlled via the input circuit 24. Input to the unit 25. The control unit 25 is a microcomputer including a CPU and its peripheral circuits, a RAM that is a working memory of the CPU, and the like, and controls each unit.

この制御部２５には、表示メモリ２６、表示駆動ブロック２７、画像バッファメモリ２８、画像信号処理部２９、圧縮符号化／伸長復号化部３０、静止画／動画画像メモリ３１、プログラムメモリ３２、データメモリ３３、メモリＩＦ３４、外部Ｉ／Ｏインターフェース３５、通信制御ブロック３６、電源制御ブロック３７及び撮影制御部３８が接続されている。表示メモリ２６には、表示部１４に表示される各種表示データが一時的に記憶される。表示駆動ブロック２７は、前記表示部１４を駆動し、画像バッファメモリ２８は、画像データを処理する際等において一時的に格納する。 The control unit 25 includes a display memory 26, a display drive block 27, an image buffer memory 28, an image signal processing unit 29, a compression encoding / decompression decoding unit 30, a still image / moving image memory 31, a program memory 32, data A memory 33, a memory IF 34, an external I / O interface 35, a communication control block 36, a power supply control block 37, and a photographing control unit 38 are connected. The display memory 26 temporarily stores various display data displayed on the display unit 14. The display drive block 27 drives the display unit 14, and the image buffer memory 28 temporarily stores the image data when processing it.

画像信号処理部２９は、後述する撮像素子から制御部２５が取り込んだ画像信号に対する各種処理を実行するＤＳＰからなる。圧縮符号化／伸長復号化部３０は、この画像信号処理部で処理された画像データを記録時には伸長処理し、記録した画像データを再生する際には伸長復号化する。静止画／動画画像メモリ３１は、レリーズ釦３の操作により撮像された画像データ（静止画像データ）を記録保存する。プログラムメモリ３２は、後述するフローチャートに示す制御部２５の制御プログラム、及び「人物撮影モード」「夜景撮影モード」等の撮影シーン毎の撮影制御プログラム、撮影シーン毎の対象被写体の種別情報、該対象被写体の種別に対応する特徴量データ等が記憶されている。 The image signal processing unit 29 is a DSP that executes various processes on an image signal taken in by the control unit 25 from an image sensor described later. The compression encoding / decompression decoding unit 30 decompresses the image data processed by the image signal processing unit at the time of recording, and decompresses and decodes the recorded image data at the time of reproduction. The still image / moving image memory 31 records and saves image data (still image data) captured by operating the release button 3. The program memory 32 includes a control program of the control unit 25 shown in a flowchart to be described later, a shooting control program for each shooting scene such as “person shooting mode” and “night scene shooting mode”, target subject type information for each shooting scene, the target Feature amount data corresponding to the type of subject is stored.

データメモリ３３は各種データが予め格納されているとともに画像データ以外の他のデータを格納し、また、後述するフローチャートに示すテンプレートメモリとしても機能する。メモリＩＦ３４は、着脱自在な外部メモリ媒体３９に接続されている。外部Ｉ／Ｏインターフェース３５は、ＵＳＢコネクタ４０に接続され、通信制御ブロック３６は無線ＬＡＮ等送受信部４１を介してアンテナ４２に接続され、電源制御ブロック３７には、電池４３が接続されている。電池４３からの電力は電源制御ブロック３７及び制御部２５を介して各部に供給される。 The data memory 33 stores various data in advance and stores data other than the image data, and also functions as a template memory shown in a flowchart described later. The memory IF 34 is connected to a removable external memory medium 39. The external I / O interface 35 is connected to the USB connector 40, the communication control block 36 is connected to the antenna 42 via the wireless LAN transmission / reception unit 41, and the battery 43 is connected to the power control block 37. The electric power from the battery 43 is supplied to each unit via the power control block 37 and the control unit 25.

前記撮影制御部３８には、前記ストロボ６の照射角を駆動する照射各駆動部４４、照射を駆動するストロボ照明駆動部４５とが接続されているとともに、測光、測距センサ４６の受光角を駆動する受光角駆動部４７、測光、測距センサ４６から色温度を検出して出力する色温度検出部４８、測光データを検出して出力する測光部４９及び測距データを検出して出力する測距部５０が接続されている。さらに前記撮影制御部３８には、前記第１及び第２角速度センサ１７、１７が各々角速度検出部５１、５２、積分器５３、５４を介して接続されている。 The photographing control unit 38 is connected to each irradiation drive unit 44 for driving the irradiation angle of the strobe 6 and a strobe illumination driving unit 45 for driving the irradiation. The light receiving angle driving unit 47 to be driven, the color temperature detecting unit 48 for detecting and outputting the color temperature from the photometric / ranging sensor 46, the photometric unit 49 for detecting and outputting the photometric data, and the distance measuring data are detected and output. A distance measuring unit 50 is connected. Furthermore, the first and second angular velocity sensors 17 and 17 are connected to the photographing control unit 38 via angular velocity detectors 51 and 52 and integrators 53 and 54, respectively.

一方、ズームレンズユニット５５には、前記回動式ミラー１８、レンズ群１９及び撮像素子２０が配置されているとともに、この回動式ミラー１８を回転駆動する駆動機構５６、前記レンズ群１９中に介挿された絞り５７が設けられており、また、撮像素子２０の前面にはシャッター５８が配置されている。 On the other hand, the zoom lens unit 55 is provided with the rotary mirror 18, the lens group 19, and the imaging device 20, and a drive mechanism 56 that rotationally drives the rotary mirror 18 and the lens group 19. An inserted diaphragm 57 is provided, and a shutter 58 is disposed in front of the image sensor 20.

さらに、前記撮影制御部３８には、電動ミラーＹ方向駆動部５９、電動ミラーＸ方向駆動部６０、フォーカスレンズ駆動部６１、ズームレンズ駆動部６２、絞り駆動部６３、シャッター駆動部６４、映像信号処理部６５及びタイミング制御＆ドライバ６６が接続されている。電動ミラーＹ方向駆動部５９は、駆動機構５６を駆動して回動式ミラー１８を上下方向に動作させるものであり、電動ミラーＸ方向駆動部６０は左右方向に動作させるものである。フォーカスレンズ駆動部６１は、レンズ群１９中のフォーカスレンズを駆動するものであり、ズームレンズ駆動部６２は、ズーム操作キー９の操作に応じて被写体像を拡大または縮小すべくレンズ群１９中ズームレンズを駆動するものである。また、絞り駆動部６３は前記絞り５７を駆動するものであり、シャッター駆動部６４は前記シャッター５８を駆動するものである。前記映像信号処理部６５は、撮像素子２０からのアナログ信号をデジタル信号に変換するＡ／Ｄ回路及びこのＡ／Ｄ回路からのデジタル撮像信号を保持するＣＤＳと、ＣＤＳから撮像信号を供給されるアナログアンプであるゲイン調整アンプ（ＡＧＣ）等からなる。 Further, the photographing control unit 38 includes an electric mirror Y direction driving unit 59, an electric mirror X direction driving unit 60, a focus lens driving unit 61, a zoom lens driving unit 62, an aperture driving unit 63, a shutter driving unit 64, and a video signal. A processing unit 65 and a timing control & driver 66 are connected. The electric mirror Y direction drive unit 59 drives the drive mechanism 56 to operate the rotary mirror 18 in the vertical direction, and the electric mirror X direction drive unit 60 operates in the left and right direction. The focus lens driving unit 61 drives the focus lens in the lens group 19, and the zoom lens driving unit 62 zooms in the lens group 19 to enlarge or reduce the subject image in accordance with the operation of the zoom operation key 9. The lens is driven. The aperture driving unit 63 drives the aperture 57, and the shutter driving unit 64 drives the shutter 58. The video signal processor 65 is supplied with an A / D circuit that converts an analog signal from the image sensor 20 into a digital signal, a CDS that holds the digital image signal from the A / D circuit, and an image signal from the CDS. It consists of a gain adjustment amplifier (AGC), which is an analog amplifier.

図４は本実施の形態の処理手順を示すフローチャートであり、図５はこのフローチャートに対応する説明図である。制御部２５は、画像認識撮影モードが設定されている状態において、プログラムメモリ３２に格納されているプログラムに基づき、図４に示すフローチャートに従って処理を実行する。先ず、ユーザーによる操作部２３での操作により、シーン別撮影モードが設定されているか否かを判断する（ステップＳ１０１）。シーン別撮影モードが設定されていない場合には、その他のモード処理を実行する（ステップＳ１０２）。シーン別撮影モードが設定されている場合には、選択撮影シーンに応じて、予め設定された対象被写体の種別情報をメモリ（プログラムメモリ３２）から読み出す（ステップＳ１０２）。さらに、当該対象被写体の種別に対応する特徴データを前記メモリ（プログラムメモリ３２）から読み出し、テンプレートメモリ（データメモリ３３）に記憶する（ステップＳ１０３）。 FIG. 4 is a flowchart showing the processing procedure of the present embodiment, and FIG. 5 is an explanatory diagram corresponding to this flowchart. The control unit 25 executes processing according to the flowchart shown in FIG. 4 based on the program stored in the program memory 32 in the state where the image recognition photographing mode is set. First, it is determined whether or not a scene-specific shooting mode is set by an operation of the operation unit 23 by the user (step S101). If the scene-specific shooting mode is not set, other mode processing is executed (step S102). If the scene-by-scene shooting mode is set, preset target subject type information is read from the memory (program memory 32) in accordance with the selected shooting scene (step S102). Further, feature data corresponding to the type of the subject is read from the memory (program memory 32) and stored in the template memory (data memory 33) (step S103).

一方、シーン別撮影モードが設定されていない場合には、カスタム設定シーンの撮影モードが設定されているか否かを判断し（ステップＳ１０４）、設定されていない場合にはその他の処理に移行する。また、カスタム設定シーンの撮影モードが設定されている場合には、撮影スルー画像視野内からカーソル等でサンプル抽出領域を選択する（ステップＳ１０５）。すなわち、カスタム設定シーンの撮影モードの状態においては、レンズ群１９により撮像素子２０上に結像された被写体像が、図５（Ａ）に示すように、スルー画像Ｐ１として表示部１４に表示されている。また、このカスタム設定シーンの撮影モードにおいては、前記カーソルキー１０の操作により、位置及び大きさを変更可能な囲繞形状のカーソルＣ１をスルー画像Ｐ１上に表示させる。そして、カーソルキー１０の操作によりカーソルＣ１の位置及び大きさを調整して、決定／ＯＫキー１１の操作により決定すると、カーソルＣ１により囲繞された領域がサンプル抽出領域として選択される。したがって、図５（Ａ）の例においては、カーソルＣ１により囲繞された果物（蜜柑）の領域がサンプル抽出領域として選択されることとなる。 On the other hand, if the shooting mode for each scene is not set, it is determined whether or not the shooting mode for the custom setting scene is set (step S104). If it is not set, the process proceeds to other processing. If the shooting mode of the custom setting scene is set, a sample extraction region is selected with a cursor or the like from within the shooting through image field of view (step S105). That is, in the custom setting scene shooting mode state, the subject image formed on the image sensor 20 by the lens group 19 is displayed on the display unit 14 as a through image P1, as shown in FIG. ing. In the shooting mode of the custom setting scene, the cursor-shaped cursor C1 whose position and size can be changed is displayed on the through image P1 by operating the cursor key 10. When the position and size of the cursor C1 are adjusted by operating the cursor key 10 and determined by operating the enter / OK key 11, the area surrounded by the cursor C1 is selected as the sample extraction area. Therefore, in the example of FIG. 5A, the fruit (mikan) region surrounded by the cursor C1 is selected as the sample extraction region.

次に、この選択された領域内のブロック画像から特徴抽出処理により特徴を抽出し（ステップＳ１０６）、この抽出した特徴量データを、前記テンプレートメモリに記憶する（ステップＳ１０７）。したがって、このステップＳ１０７での処理により、図５（Ｂ）に示すように、テンプレートメモリにはサンプル画像Ｐ２の特徴量データＤとして、色相＝ＨＨＨ、彩度＝ＳＳＳ、明度＝ＶＶＶ、輪郭形状＝ｆｆｆ、大きさ＝ＬＬＬが記憶されることとなる。 Next, features are extracted from the block image in the selected region by feature extraction processing (step S106), and the extracted feature data is stored in the template memory (step S107). Therefore, as a result of the processing in step S107, as shown in FIG. 5B, the template memory includes the feature amount data D of the sample image P2, as hue = HHH, saturation = SSS, brightness = VVV, contour shape = fff, size = LLL will be stored.

そして、ステップＳ１０３またはステップＳ１０７に続くステップＳ１０８では、この時点で表示部１４に表示されているスルー画像を読み込む。次に、カーソル操作等により、取り込んだ被写体像から対角の２点で識別したい領域を選択する（ステップＳ１０９）。すなわち、識別すべき領域を選択する処理を実行する際には、図５（Ｃ）に示すように、表示部１４において前記カーソルＣ１とは異なる十字状の第１カーソルＣ２をスルー画像Ｐ１上に表示させる。そして、カーソルキー１０の操作により第１カーソルＣ２の位置を調整して、決定／ＯＫキー１１の操作により決定すると、第１カーソルＣ２の位置が決定される。次に、カーソルキー１０が操作されると、図５（Ｄ）に示すように、第１カーソルＣ２から分離した第２カーソルＣ３を表示させ、カーソルキー１０の操作により第２カーソルＣ３の位置を調整して、決定／ＯＫキー１１の操作により決定すると、第２カーソルＣ３の位置が決定される。そして、この第１カーソルＣ２と第２カーソルＣ３を対角とする矩形の領域を決定し、この矩形の領域を識別すべき領域である識別領域Ｆ１とする。なお、このとき、表示部１４の隅部に、前記サンプル画像Ｐ２を表示させておく。 Then, in step S108 following step S103 or step S107, the through image currently displayed on the display unit 14 is read. Next, a region to be identified by two diagonal points is selected from the captured subject image by a cursor operation or the like (step S109). That is, when executing the process of selecting the region to be identified, as shown in FIG. 5C, the cross-shaped first cursor C2 different from the cursor C1 is displayed on the through image P1 on the display unit 14. Display. Then, when the position of the first cursor C2 is adjusted by operating the cursor key 10 and determined by operating the enter / OK key 11, the position of the first cursor C2 is determined. Next, when the cursor key 10 is operated, as shown in FIG. 5D, the second cursor C3 separated from the first cursor C2 is displayed, and the position of the second cursor C3 is moved by operating the cursor key 10. When the adjustment is performed and the determination / OK key 11 is operated, the position of the second cursor C3 is determined. Then, a rectangular area whose diagonal is the first cursor C2 and the second cursor C3 is determined, and this rectangular area is set as an identification area F1 that is an area to be identified. At this time, the sample image P2 is displayed at the corner of the display unit 14.

次に、識別領域Ｆ１内の被写体像から抽出領域を検出する（ステップＳ１１０）。この抽出領域の検出は、後述するようにスルー画像の画像データの輝度信号及び色差信号から、近い輝度または色差信号別に、例えば同系色の色相別等に領域を分割し、さらに、領域の境界線となる輪郭線を抽出し、この輪郭線で囲まれた部分を一つの抽出領域として検出する。したがって、図５（Ｄ）に示すように、識別領域Ｆ１内に４個の蜜柑ｍと１個の林檎ａとが存在すると、黄色系の色相領域であって輪郭線で囲まれた４個の蜜柑ｍ領域と、黄赤系の色相領域であって輪郭線で囲まれた１個の林檎ａ領域との、計５個の抽出領域が検出されることとなる。引き続き、この検出した抽出領域を順次選択し（ステップＳ１１１）、この選択した抽出領域におけるスルー画像の特徴抽出処理を実行する（ステップＳ１１２）。つまり、選択した抽出領域において、前記特徴量データＤが有する特徴種別の特徴量を抽出する。したがって、本例においては、特徴量データＤは、色相、彩度、明度、輪郭形状、大きさであったことから、抽出領域にこれら色相、彩度、明度、輪郭形状、大きさの特徴量を抽出する。 Next, an extraction area is detected from the subject image in the identification area F1 (step S110). As will be described later, this extraction area is detected by dividing the area into brightness or color difference signals from the brightness signal and color difference signal of the image data of the through image, for example, by hue of similar colors, and the boundary lines of the areas. A contour line is extracted, and a portion surrounded by the contour line is detected as one extraction region. Therefore, as shown in FIG. 5 (D), if there are four mandarin oranges m and one apple a in the identification area F1, the four shaded yellow areas surrounded by the outlines. A total of five extraction regions, i.e., the mandarin orange m region and one apple a region that is a yellow-red hue region and surrounded by a contour line, are detected. Subsequently, the detected extraction areas are sequentially selected (step S111), and the feature extraction process of the through image in the selected extraction area is executed (step S112). That is, the feature amount of the feature type included in the feature amount data D is extracted in the selected extraction region. Therefore, in the present example, the feature amount data D is hue, saturation, brightness, contour shape, and size. Therefore, the feature amount of these hue, saturation, brightness, contour shape, and size is included in the extraction region. To extract.

そして、このステップＳ１１２で抽出した特徴量と、テンプレートメモリに記憶されている前記特徴量データＤの色相＝ＨＨＨ、彩度＝ＳＳＳ、明度＝ＶＶＶ、輪郭形状＝ｆｆｆ、大きさ＝ＬＬＬと比較し類似度を算出する（ステップＳ１１３）。つまり、特徴量データＤの各値と抽出した特徴量の各値との比率を算出する。次に、この算出した比率である類似度が所定値以上であるか否かを判断し（ステップＳ１１４）、所定値未満である場合にはステップＳ１１５及びステップＳ１１６の処理を行うことなくステップＳ１１７に進む。また、類似度が所定値以上である場合には、今回の選択領域を類似被写体の存在領域としてデータメモリ３３に記憶する（ステップＳ１１５）。また、類似被写体の識別数を計数しているカウンタの値をカウントアップさせる（ステップＳ１１６）。このとき、選択されている抽出領域が蜜柑ｍであれば、類似度は所定以上となることから、ステップＳ１１６の処理が実行されてカウンタの値がカウントアップされる。しかし、選択されている抽出領域が林檎ａであれば、類似度は所定未満となることから、ステップＳ１１６の処理が実行されずカウンタの値がカウントアップされることもない。 Then, the feature amount extracted in step S112 is compared with the hue = HHH, saturation = SSS, brightness = VVV, contour shape = fff, size = LLL of the feature amount data D stored in the template memory. The similarity is calculated (step S113). That is, the ratio between each value of the feature value data D and each value of the extracted feature value is calculated. Next, it is determined whether or not the degree of similarity, which is the calculated ratio, is greater than or equal to a predetermined value (step S114). If it is less than the predetermined value, the process proceeds to step S117 without performing steps S115 and S116. move on. If the similarity is greater than or equal to a predetermined value, the current selected area is stored in the data memory 33 as an existing area of a similar subject (step S115). In addition, the value of the counter that counts the identification number of similar subjects is counted up (step S116). At this time, if the selected extraction region is mandarin orange m, the similarity is greater than or equal to a predetermined value, so the process of step S116 is executed and the counter value is counted up. However, if the selected extraction region is apple a, the degree of similarity is less than a predetermined value, so that the process of step S116 is not executed and the value of the counter is not counted up.

そして、最後の抽出領域まで以上のステップＳ１１１〜ステップＳ１１６の処理を実行したか否かを判断し（ステップＳ１１７）、最後の抽出領域となるまでステップＳ１１１からの処理を繰り返す。したがって、本例においては、ステップＳ１１１からの処理が５回繰り返されることとなり、５回繰り返されることにより、最後の抽出領域となったならば、識別された類似被写体の領域とカウント数を表示させる（ステップＳ１１７）。このとき、本例においては前述のように、識別領域Ｆ１内に４個の蜜柑ｍと１個の林檎ａとが存在することから、このステップＳ１１７での処理より、図５（Ｅ）に示すように、表示部１４には類似被写体の領域Ｆ２とカウント数Ｎ（Ｃｏｕｎｔ＝４）とが表示されることとなる。 Then, it is determined whether or not the processing in steps S111 to S116 has been executed up to the last extraction region (step S117), and the processing from step S111 is repeated until the last extraction region is reached. Therefore, in this example, the process from step S111 is repeated five times, and if the last extracted area is obtained by repeating the process five times, the identified similar subject area and the count number are displayed. (Step S117). At this time, in the present example, as described above, there are four mandarin oranges m and one apple a in the identification area F1, so that the processing in this step S117 is shown in FIG. As described above, the display unit 14 displays the similar subject area F2 and the count number N (Count = 4).

次に、類似被写体の領域またはカウント数に応じて撮影処理を実行する（ステップＳ１１８）。つまり、識別された被写体に焦点が合うように自動焦点（ＡＦ）処理する等の焦点制御を行い、あるいは、識別された被写体が画角一杯になるようにズーム倍率やレンズ焦点距離を調整するなどの画角制御を行い、もしくは、識別された被写体に応じて撮影シーン別プログラムを自動選択する等のシーン自動選択処理等を行う。したがって、このステップＳ１１８の処理が実行された後、ユーザがレリーズ釦３を操作することにより、ステップＳ１１８での処理に応じた画像が撮像されることとなる。 Next, a photographing process is executed according to the similar subject area or the number of counts (step S118). That is, focus control such as automatic focusing (AF) processing is performed so that the identified subject is in focus, or the zoom magnification and the lens focal length are adjusted so that the identified subject has a full angle of view. Angle of view control is performed, or automatic scene selection processing such as automatic selection of a shooting scene-specific program according to the identified subject is performed. Therefore, after the process of step S118 is executed, the user operates the release button 3 to capture an image according to the process of step S118.

なお、本実施の形態においては、表示部１４にカウント数Ｎを表示させるようにしたが、表示させることなく、音声によりカウント数Ｎを報知するようにしてもよい。 In the present embodiment, the count number N is displayed on the display unit 14, but the count number N may be notified by voice without being displayed.

図６は、具体例として人物の顔を抽出して撮影する場合の動作例を示す図である。本例では、
１）予め、人間の顔の特徴データをテンプレートしてメモリに記憶しておく。
１′）あるいは、図示に示すように、撮影時に、識別したい被写体である人間の顔の見本となる被写体画像を、スルー画像内から領域を選択して指定し、選択された領域の被写体像を、識別対象の被写体のサンプルとして登録する。 FIG. 6 is a diagram illustrating an operation example when a human face is extracted and photographed as a specific example. In this example,
1) Human face feature data is previously stored as a template in a memory.
1 ′) Alternatively, as shown in the figure, a subject image that is a sample of a human face that is a subject to be identified is selected by selecting a region from the through image, and a subject image in the selected region is selected. And register as a sample of the subject to be identified.

そして、「人物モード」が選択された場合には、この記憶された人間の顔の特徴データを呼び出して、テンプレートデータとして、指定された画像領域の被写体像に対して、テンプレートの特徴量に合致する被写体があるか否かを検索するテンプレートマッチング動作を行う。すなわち、
２）まず、被写体画像のスルー画像から識別したい領域をカーソル操作などで選択する。
３）次に、撮像画像データの輝度信号及び色差信号から、近い輝度値または色差信号別に、例えば、同系の色相別等に領域を分割し、さらに、領域の境界線となる輪郭線を抽出する。
４）各分割領域（抽出領域）の内、例えばテンプレート画像の色相に近い色相の領域のみを抽出する。
５）さらに、領域の画像を２値化して、また、必要ならば、拡張、縮小処理などを行って細かい凹凸等を削減し、得られた輪郭の形状や大きさをテンプレートの特徴量と比較して、類似度が所定値以上高ければ、その領域を当該認識対象の人間の顔と認識して、その数をカウントする。
６）識別された被写体の計数値を表示するとともに、識別された領域の輪郭線、またはその中を塗りつぶした被写体像をスルー画像に重ねて表示して、識別領域を区別表示できるようにする。 When the “person mode” is selected, the stored human face feature data is recalled, and the template feature data matches the feature amount of the subject in the designated image area. A template matching operation for searching whether or not there is a subject to be performed is performed. That is,
2) First, a region to be identified from the through image of the subject image is selected by a cursor operation or the like.
3) Next, from the luminance signal and color difference signal of the captured image data, the region is divided into similar luminance values or color difference signals, for example, according to similar hues, and a contour line that is a boundary line of the region is extracted. .
4) For example, only a region having a hue close to the hue of the template image is extracted from each divided region (extraction region).
5) Furthermore, binarize the image of the area, and if necessary, perform expansion and reduction processing to reduce fine irregularities, etc., and compare the shape and size of the obtained contour with the feature amount of the template If the similarity is higher than a predetermined value, the area is recognized as a human face to be recognized and the number is counted.
6) The count value of the identified subject is displayed, and the contour line of the identified area or the subject image filled in the superimposed area is displayed on the through image so that the identified area can be distinguished and displayed.

図７〜図９は、その他の被写体の認識と計数処理の動作例を示す図である。図７は野鳥を数える例であり、（Ａ）に示すように、カーソルＣ１をスルー画像Ｐ１上に表示させ、前記カーソルキー１０の操作によりカーソルＣ１の位置及び大きさを調整して、決定／ＯＫキー１１の操作により決定すると、カーソルＣ１により囲繞された領域がサンプル抽出領域として選択される。この選択された領域内のブロック画像から特徴抽出処理により特徴を抽出し、この抽出した特徴量データを、前記テンプレートメモリに記憶する。 FIG. 7 to FIG. 9 are diagrams showing examples of other subject recognition and counting processing operations. FIG. 7 shows an example of counting wild birds. As shown in FIG. 7A, the cursor C1 is displayed on the through image P1, and the position / size of the cursor C1 is adjusted by the operation of the cursor key 10. When determined by operating the OK key 11, the area surrounded by the cursor C1 is selected as the sample extraction area. Features are extracted from the block image in the selected region by feature extraction processing, and the extracted feature data is stored in the template memory.

また、（Ｂ）に示すように、表示部１４において前記カーソルＣ１とは異なるカーソルＣ２をスルー画像Ｐ２上に表示させる。そして、前記カーソルキー１０の操作によりカーソルＣ２の位置を調整して、決定／ＯＫキー１１の操作により決定すると、カーソルＣ２の位置が決定される。カーソルＣ２で囲まれた矩形の領域を識別すべき領域である識別領域Ｆ１とする。なお、このとき、表示部１４の隅部に、前記サンプル画像Ｐ２を表示させておく。そして、識別領域Ｆ１内の被写体像から抽出領域を検出する等により、前述しように、サンプル画像Ｐ２と類似する類似被写体の識別数を計数して、野鳥の数であるカウント数Ｎ（Ｃｏｕｎｔ＝１０）を表示させる。 Further, as shown in (B), a cursor C2 different from the cursor C1 is displayed on the through image P2 on the display unit 14. Then, the position of the cursor C2 is determined by adjusting the position of the cursor C2 by operating the cursor key 10 and determining by the operation of the OK / OK key 11. The rectangular area surrounded by the cursor C2 is defined as an identification area F1 that is an area to be identified. At this time, the sample image P2 is displayed at the corner of the display unit 14. Then, as described above, the number of identifications of similar subjects similar to the sample image P2 is counted by detecting an extraction region from the subject image in the identification region F1, and the count number N (Count = 10) which is the number of wild birds. ) Is displayed.

図８は集客数を数える例であり、（Ａ）に示すように、予めテンプレートメモリに記憶したサンプル画像Ｐ２を表示部１４に表示させ、（Ｂ）に示すように、カーソルＣ２をスルー画像Ｐ１上に表示させる。そして、前記カーソルキー１０の操作によりカーソルＣ２の位置を調整して、決定／ＯＫキー１１の操作により決定すると、カーソルＣ２の位置が決定される。カーソルＣ２で囲まれた矩形の領域を識別すべき領域である識別領域Ｆ１とする。そして、識別領域Ｆ１内の被写体像から抽出領域を検出する等により、前述しように、サンプル画像Ｐ２と類似する類似被写体の識別数を計数して、観客の数であるカウント数Ｎ（Ｃｏｕｎｔ＝２５７０）を表示させる。 FIG. 8 shows an example in which the number of customers is counted. As shown in FIG. 8A, a sample image P2 stored in advance in the template memory is displayed on the display unit 14, and as shown in FIG. 8B, the cursor C2 is moved to the through image P1. Display above. Then, the position of the cursor C2 is determined by adjusting the position of the cursor C2 by operating the cursor key 10 and determining by the operation of the OK / OK key 11. The rectangular area surrounded by the cursor C2 is defined as an identification area F1 that is an area to be identified. Then, as described above, the number of identifications of similar subjects similar to the sample image P2 is counted by detecting an extraction region from the subject image in the identification region F1, and the count number N (Count = 2570) which is the number of spectators. ) Is displayed.

図９は陳列商品を種別毎に数える例であり、（Ａ）に示すように、表示部１４にスルー画像Ｐ１を表示させた後、（Ｂ）に示すように、予めテンプレートメモリに記憶したサンプル画像Ａ，Ｂを表示させ、カーソルＣ２をスルー画像Ｐ１上に表示させる。そして、前記カーソルキー１０の操作によりカーソルＣ２の位置を調整して、決定／ＯＫキー１１の操作により決定すると、カーソルＣ２の位置が決定される。カーソルＣ２で囲まれた矩形の領域を識別すべき領域である識別領域Ｆ１とする。そして、識別領域Ｆ１内の被写体像から抽出領域を検出する等により、サンプル画像Ａ，Ｂと類似する類似被写体の識別数を計数して、陳列商品の種別毎のカウント数Ｎ（ＣｏｕｎｔＡ＝０３Ｃｏｕｎｔｂ＝０４）を表示させる。無論、（Ａ）に示した陳列商品のみならず、（Ｃ）に示した陳列商品であっても種別毎にカウント数を表示することが可能である。 FIG. 9 is an example of counting displayed products for each type. As shown in FIG. 9A, after the through image P1 is displayed on the display unit 14, a sample stored in the template memory in advance as shown in FIG. 9B. The images A and B are displayed, and the cursor C2 is displayed on the through image P1. Then, the position of the cursor C2 is determined by adjusting the position of the cursor C2 by operating the cursor key 10 and determining by the operation of the OK / OK key 11. The rectangular area surrounded by the cursor C2 is defined as an identification area F1 that is an area to be identified. Then, the number of identifications of similar subjects similar to the sample images A and B is counted by detecting an extraction region from the subject image in the identification region F1, and the count number N (Count A = 03) for each type of display product. (Count b = 04) is displayed. Of course, it is possible to display the number of counts for each type not only for the display product shown in (A) but also for the display product shown in (C).

図１０は、本実施の形態における認識処理、認識被写体の設定メニューの表示例を示す図である。メニューキー１３（ＭＥＮＵ）の操作に応じて、表示部１４に（ａ）に示す画像認識処理の設定メニューを表示する。この状態でカーソルキー１０の上下操作部の操作により「画像認識」が選択されて、決定／ＯＫキー１１の操作されると、画像認識撮影モードが設定されることとなる。また、（ａ）の表示状態で、カーソルキー１０の右操作部が操作されると、（ａ′）の表示状態に移行し、「ＯＦＦ（認識しない）」「オート（撮影シーン別）」・・・等の画像認識モードにおける選択メニューを表示する。また、（ｂ）に示すメニュー画面（（ａ）と同様の画面）において、「認識する被写体の設定」が選択されると、（ｂ′）の認識する被写体の設定メニュー画面に移行する。 FIG. 10 is a diagram showing a display example of the recognition processing and recognition subject setting menu in the present embodiment. In response to the operation of the menu key 13 (MENU), the setting menu for the image recognition process shown in FIG. In this state, when “image recognition” is selected by operating the up / down operation unit of the cursor key 10 and the enter / OK key 11 is operated, the image recognition photographing mode is set. Further, when the right operation portion of the cursor key 10 is operated in the display state of (a), the state shifts to the display state of (a ′), “OFF (not recognized)”, “Auto (by shooting scene)”.・・ Displays the selection menu in the image recognition mode. In addition, when “recognized subject setting” is selected on the menu screen shown in FIG. 5B (the same screen as FIG. 1A), the screen shifts to the recognized subject setting menu screen shown in FIG.

図１１は、図４のフローチャートにおけるステップＳ１０６で実行される特徴抽出処理の詳細を示すフローチャートである。先ず、前記ステップＳ１０５において、カーソル等で選択されたサンプル抽出領域内の対象画像データを取り込む（ステップＳ２０１）。次に、前処理（１）を実行して、画像強調処理、または、鮮鋭化処理、雑音除去処理などを行い（ステップＳ２０２）、前処理（２）を実行して、２値化処理、または、正規化処理、回転処理、座標変換処理などを行う（ステップＳ２０３）。さらに、特徴抽出処理（１）を実行して、フィルタ処理、または、輪郭抽出、領域抽出、細線化、拡張収縮処理などを行い（ステップＳ２０４）、特徴抽出処理（２）を実行して、大きさ、周囲長、面積等の算出、または、円らしさ、フーリエ記述子の算出、輪郭形状の評価値算出などを行う。
（先鋭化、輪郭抽出）
図１２及び図１３は、前記ステップＳ２０２の前処理（１）における鮮鋭化処理、ステップＳ２０４の特徴抽出処理（１）における輪郭抽出処理の例として、１次微分フィルタまたは２次微分フィルタ処理による画像の先鋭化処理、エッジ（輪郭）抽出処理の例を示す図である。図１２に示すように、階調が変化する部分のエッジがボケた画像ｆ（ｉ，ｊ）を、
Δｘ_ｆ＝ｆ（ｉ＋ｊ，ｊ）−ｊ（ｉ−１，ｊ）、
Δｙ_ｆ＝ｆ（ｉ＋ｊ，ｊ）−ｊ（ｉ−１，ｊ）、
ｇ（ｉ，ｊ）＝√｛（Δｘ_ｆ）^２＋（Δｙｆ）^２｝、
または、ｇ（ｉ，ｊ）＝｜Δｘｆ｜＋｜Δｙｆ｜、
等の演算により、１次部分や勾配（Gradient）を求めると、階調が変化する勾配部分や輪郭を抽出できる。 FIG. 11 is a flowchart showing details of the feature extraction process executed in step S106 in the flowchart of FIG. First, in step S105, target image data in the sample extraction area selected by the cursor or the like is captured (step S201). Next, pre-processing (1) is executed, image enhancement processing, sharpening processing, noise removal processing, etc. are performed (step S202), pre-processing (2) is executed, binarization processing, or Normalization processing, rotation processing, coordinate conversion processing, and the like are performed (step S203). Further, the feature extraction process (1) is executed to perform filter processing, contour extraction, region extraction, thinning, expansion / contraction processing, etc. (step S204), and the feature extraction process (2) is executed to increase the size. Then, calculation of circumference, area, etc., or circularity, calculation of Fourier descriptor, calculation of contour shape evaluation value, etc. are performed.
(Sharpening, contour extraction)
FIGS. 12 and 13 show an image obtained by a primary differential filter process or a secondary differential filter process as an example of the sharpening process in the pre-process (1) in step S202 and the contour extraction process in the feature extraction process (1) in step S204. It is a figure which shows the example of this sharpening process and edge (contour) extraction process. As shown in FIG. 12, an image f (i, j) in which the edge of the portion where the gradation changes is blurred
Δx _f = f (i + j, j) −j (i−1, j),
Δy _f = f (i + j, j) −j (i−1, j),
g (i, j) = √ {(Δx _f ) ² + (Δyf) ² },
Or g (i, j) = | Δxf | + | Δyf |
When the primary part and the gradient (Gradient) are obtained by the calculation such as the above, it is possible to extract the gradient part and the contour whose gradation changes.

あるいは、
∇^２ｆ（ｉ，ｊ）＝∂^２ｆ／∂ｘ^２＋∂^２ｆ／∂ｙ^２、または、
∇^２ｆ（ｉ，ｊ）＝ｆ（ｉ＋１，ｊ）＋ｆ（ｉ−１，ｊ）＋ｆ（ｉ，ｊ−１）＋ｆ（ｉ，ｊ −１）−４ｆ（ｉ，ｊ）
等の演算により、さらに微分する２次微分（Laplacian）処理を施し、この結果を原画像データから差し引くと、エッジ部分の高周波成分を強調した画像を合成でき、ボケたエッジや輪郭を強調することができる。 Or
∇ ² f (i, j) = ∂ ² f / ∂x ² + ∂ ² f / ∂y ² , or
∇ ² f (i, j) = f (i + 1, j) + f (i−1, j) + f (i, j−1) + f (i, j−1) −4f (i, j)
By performing a second-order differential (Laplacian) process that is further differentiated by operations such as the above, and subtracting this result from the original image data, an image that emphasizes the high-frequency component of the edge portion can be synthesized, and blurred edges and contours can be emphasized Can do.

エッジや輪郭の強調処理をソフトウエア処理で行うには、図１３に示した「Ｐｒｅｗｉｔｔフィルタ」「Ｓｏｂｅｉフィルタ」や「Ｋｉｒｓｃｈフィルタ」「Ｒｏｂｅｒｔｓフィルタ」等の１次微分の空間フィルタ演算子（オペレータ）、または、「Ｌａｐｌａｃｉａｎｔフィルタ」等の２次微分の空間フィルタ演算子等を用いることができる。図示のように、入力画像２２０は、演算２２１により出力画像２２２に変換される。また、Ａ−Ａ′線上の入力画像２２３は、Ｐｒｅｗｉｔｔフィルタ（１次微分フィルタ）２２４により、Ａ−Ａ′線上の出力画像２２４に変換され、Ｓｏｂｅｉフィルタ（１次微分フィルタ）２２６により、Ａ−Ａ′線上の出力画像２２７に変換され、Ｌａｐｌａｃｉａｎｔフィルタ（２次微分フィルタ）２２８により、Ａ−Ａ′線上の出力画像２２９に変換される。 In order to perform edge and contour enhancement processing by software processing, first-order differential spatial filter operators (operators) such as “Prewitt filter”, “Sobei filter”, “Kirsch filter”, and “Roberts filter” shown in FIG. Alternatively, a second-order differential spatial filter operator such as “Laplacant filter” or the like can be used. As illustrated, the input image 220 is converted into an output image 222 by an operation 221. The input image 223 on the A-A ′ line is converted into an output image 224 on the A-A ′ line by the Prewitt filter (first-order differential filter) 224, and the A- It is converted into an output image 227 on the A ′ line, and is converted into an output image 229 on the AA ′ line by a Laplacent filter (secondary differential filter) 228.

このようなフィルタ処理を、前述の前処理における画像の先鋭化処理や、特徴抽出処理における輪郭抽出やエッジ抽出などに利用できる。被写体像の輪郭形状や外形パターン、面積などを特徴データに利用するには、特徴抽出したい画像データの輝度置等を、画素毎に輪郭強調やエッジ検出用のフィルタ演算等を行って、輪郭強調や外形抽出した画像に変換してから、特徴データ等を抽出する。 Such filter processing can be used for image sharpening processing in the above-described preprocessing, contour extraction or edge extraction in feature extraction processing, and the like. To use the contour shape, outline pattern, area, etc. of the subject image as feature data, perform brightness enhancement of the image data that you want to extract the features for each pixel by performing contour enhancement, edge detection filter operation, etc. Then, after converting the image into an extracted image, feature data and the like are extracted.

（２値化、平均化、輝度変換）
図１４及び図１５は、前記ステップＳ２０２の前処理（１）における画像強調処理、ステップＳ２０３の前処理（２）における２値化処理、ステップＳ２０４の特徴抽出処理（１）の処理例としての輝度の抽出処理の例を示す図である。図１４（ａ）の線形の輝度変換（中間階調の改善）に示すように、入力画像２３０は、演算（変換式）２３１により出力画像２３２に変換される。また、入力画像の輝度ヒストグラム分布Ｐ（ｘ）２３３は、輝度変換式２３４により、出力画像の輝度ヒストグラム分布Ｐ（ｘ）２３５に変換される。また、（ｂ）の２値化処理に示すように、入力画像の輝度分布Ｐ（ｘ）２３６は、変換式２３７により、出力画像の輝度分布Ｐ（ｘ）２３８に変換される。また、図１５の所定の輝度の抽出処理に示すように、入力画像の輝度分布Ｐ（ｘ）２３９は、変換式２４０により、出力画像の輝度分布Ｐ（ｘ）２４１に変換される。 (Binarization, averaging, luminance conversion)
14 and 15 show the luminance as an example of the image enhancement processing in the preprocessing (1) in step S202, the binarization processing in the preprocessing (2) in step S203, and the feature extraction processing (1) in step S204. It is a figure which shows the example of this extraction process. As shown in the linear luminance conversion (improvement of intermediate gradation) in FIG. 14A, the input image 230 is converted into an output image 232 by an operation (conversion formula) 231. Further, the luminance histogram distribution P (x) 233 of the input image is converted into the luminance histogram distribution P (x) 235 of the output image by the luminance conversion equation 234. Further, as shown in the binarization process of (b), the luminance distribution P (x) 236 of the input image is converted into the luminance distribution P (x) 238 of the output image by the conversion equation 237. Further, as shown in the predetermined luminance extraction process of FIG. 15, the luminance distribution P (x) 239 of the input image is converted into the luminance distribution P (x) 241 of the output image by the conversion equation 240.

このように、被写体像の画像データ（輝度値や色差値）の分布パターンなどを特徴データとして利用する場合に、画像データの輝度分布（ヒストグラム）を求め、輝度変換処理を行うことにより中間階調などの強調や圧縮ができる。また、所定の閾値との大小で２値化したり、所定の輝度の領域だけを抽出することができる。あるいは、画素数毎に輝度値や色差値を平均化（モザイク化）してまとめて、パターン単純化や情報量の圧縮ができる。 As described above, when the distribution pattern of the image data (luminance value and color difference value) of the subject image is used as feature data, the luminance distribution (histogram) of the image data is obtained, and the luminance conversion process is performed to obtain the intermediate gradation. Can be emphasized and compressed. Further, binarization can be performed based on a predetermined threshold value, or only a region having a predetermined luminance can be extracted. Alternatively, luminance values and color difference values are averaged (mosaicized) for each number of pixels, and the patterns can be simplified and the information amount can be compressed.

（画像データの変換）
なお、画像データのＲＧＢ信号や輝度信号Ｙ、色差信号Ｃｂ，Ｃｒ、あるいは、色相／彩度／明度を表すＨＳＶ（またはＨＳＢ）データ等は、以下の変換式で相互に容易に変換できる。
例えば、ＲＧＢデータをＹＣｂＣｒデータに変換するには、
Ｙ＝０．２９９＊Ｒ＋０．５８７＊Ｇ＋０．１１４＊Ｂ、
Ｃｂ＝０．１７２＊Ｒ−０．３３９＊Ｇ＋０．５１１＊Ｂ＋ＣＥＮＴＥＲ、
Ｃｒ＝０．５１１＊Ｒ−０．４２８＊Ｇ−０．０８３＊Ｂ＋ＣＥＮＴＥＲ、
ＹＣｂＣｒデータをＲＧＢデータに変換するには、
Ｒ＝Ｙ＋０．０００＊（Ｃｂ−ＣＥＮＴＥＲ）＋１．３７１＊（Ｃｒ−ＣＥＮＴＥＲ）、
Ｇ＝Ｙ−０．３３６＊（Ｃｂ−ＣＥＮＴＥＲ）−０．６９８＊（Ｃｒ−ＣＥＮＴＥＲ）、
Ｂ＝Ｙ＋１．７３２＊（Ｃｂ−ＣＥＮＴＥＲ）＋０．０００＊（Ｃｒ−ＣＥＮＴＥＲ）、 (Conversion of image data)
Note that RGB signals, luminance signals Y, color difference signals Cb and Cr, or HSV (or HSB) data representing hue / saturation / lightness, etc. of image data can be easily converted to each other by the following conversion formula.
For example, to convert RGB data to YCbCr data,
Y = 0.299 * R + 0.587 * G + 0.114 * B,
Cb = 0.172 * R−0.339 * G + 0.511 * B + CENTER
Cr = 0.511 * R−0.428 * G−0.083 * B + CENTER
To convert YCbCr data to RGB data,
R = Y + 0.000 * (Cb-CENTER) + 1.371 * (Cr-CENTER)
G = Y−0.336 * (Cb−CENTER) −0.698 * (Cr−CENTER)
B = Y + 1.732 * (Cb-CENTER) + 0.000 * (Cr-CENTER)

ＲＧＢデータ（各０〜１）をＨＳＶ（またはＨＳＢ）データに変換するには、
ｃｍａｘ＝ｍａｘｉｍｕｍ（Ｒ，Ｇ．Ｂ）、ｃｍｉｎ＝ｍｉｎｉｍｕｍ（Ｒ，Ｇ．Ｂ）とすると、
明度Ｖ＝ｃｍａｘ、彩度Ｓ＝（ｃｍａｘ−ｃｍｉｎ）／ｃｍａｘ（ただし、ｃｍａｘ＝０のときは、Ｓ＝０）、
Ｒ＝ｃｍａｘのときは、色相Ｈ＝６０°＊｛（Ｇ−Ｂ）／（ｃｍａｘ−ｃｍｉｎ）｝、
Ｇ＝ｃｍａｘのときは、色相Ｈ＝６０°＊｛２＋（Ｂ−Ｒ）／（ｃｍａｘ−ｃｍｉｎ）｝、
Ｂ＝ｃｍａｘのときは、色相Ｈ＝６０°＊｛４＋（Ｒ−Ｇ）／（ｃｍａｘ−ｃｍｉｎ）｝、
なお、Ｈ＜０のときはＨに３６０°を加える。また、Ｓ＝０のときはＨ＝０とする。 To convert RGB data (each 0 to 1) to HSV (or HSB) data,
If cmax = maximum (R, GB) and cmin = minimum (R, GB),
Lightness V = cmax, saturation S = (cmax−cmin) / cmax (where c = 0, S = 0),
When R = cmax, hue H = 60 ° * {(GB) / (cmax-cmin)},
When G = cmax, hue H = 60 ° * {2+ (BR) / (cmax-cmin)},
When B = cmax, hue H = 60 ° * {4+ (RG) / (cmax−cmin)},
When H <0, 360 ° is added to H. When S = 0, H = 0.

（色相の抽出、肌色の抽出）
図１６は、前記ステップＳ２０４の特徴抽出処理（１）の領域抽出処理例であって、所定の色の領域を抽出する例として、人間の肌色領域の抽出例を示す図である。（ａ）は、人間の肌の分光反射率特性の例であり、（ｂ）は、撮影画像サンプル中の肌色領域のＲＧＢ値、及びＨＳＶ値の例（色相：Ｈｕｅを０〜３６０°、彩度：Ｓａｔｕｒａｔｉｏｎを０〜２５５、明度：ＶａｌｕｅｏｆＢｒｉｇｈｔｎｅｓｓを０〜２５５として場合）である。また、（ｃ）は、肌色の色相：Ｈｕｅ：０〜３６０°、彩度：Ｓａｔｕｒａｔｉｏｎ：０〜１分布の例で、肌色の画像データの多くは、色相環で６°〜３８°の範囲に多く分布することが知られている（Skin Colour Analysis(by J.Sherrah and S.Gong)http://homepages.inf.ed.ac.uk/rbf/CVonline/LOCAL_COPLES/GONGI/cvOnline-skinColourAnalysis.html）。これらを利用すれば、ＨＳＶ値から、色相で約６°〜３８°の範囲の人間の顔の肌色とするなど、特定色の被写体の領域を適宜抽出することができる。 (Hue extraction, skin color extraction)
FIG. 16 is an example of region extraction processing in the feature extraction processing (1) of step S204, and shows an example of human skin color region extraction as an example of extracting a predetermined color region. (A) is an example of spectral reflectance characteristics of human skin, and (b) is an example of RGB values and HSV values of the skin color region in the photographed image sample (hue: Hue of 0 to 360 °, chromatic Degree: Saturation is 0 to 255, Brightness: Value of Brightness is 0 to 255). (C) is an example of skin color hue: Hue: 0 to 360 °, saturation: Saturation: 0 to 1 distribution, and most of the skin color image data is in the range of 6 ° to 38 ° in the hue circle. Many distributions are known (Skin Color Analysis (by J. Sherrah and S. Gong) http://homepages.inf.ed.ac.uk/rbf/CVonline/LOCAL_COPLES/GONGI/cvOnline-skinColourAnalysis.html ). By using these, it is possible to appropriately extract a subject area of a specific color, such as a skin color of a human face having a hue in a range of about 6 ° to 38 ° from the HSV value.

（膨脹、収縮、細線化、線図形化）
図１７は、前記ステップＳ２０４の特徴抽出処理（１）における膨脹収縮処理の例を示す図である。輪郭形状などから被写体の形状や特徴を判別するには、先ず、形状を単純化して情報量を圧縮や線図形化してから判別することが好ましい。例えば、図（ａ）のように、２値化された画像の輪郭や、１−画素と０−画素の境界領域で、１−画素を（８近傍の画素の）一層分だけ外側に太くする、所謂「膨脹」（Expansion）処理を行うと、輪郭などの境界部分の小さな孔や溝が取り除かれ、また、同図（ｂ）のように、逆に、一層分だけ細くする「収縮」（Contraction）処理により境界部分の突起や孤立点などが取り除かれるので、膨脹処理と収縮処理を組み合わせることで、形状の単純化ができる。 (Expansion, contraction, thinning, line drawing)
FIG. 17 is a diagram showing an example of the expansion / contraction process in the feature extraction process (1) in step S204. In order to determine the shape and characteristics of the subject from the contour shape or the like, it is preferable to first determine after simplifying the shape and compressing the information amount into a line figure. For example, as shown in FIG. 1A, the 1-pixel is thickened to the outside by one layer (of 8 neighboring pixels) in the binarized image outline or the boundary area between 1-pixel and 0-pixel. When the so-called “expansion” process is performed, small holes and grooves in the boundary such as the contour are removed, and conversely, as shown in FIG. The contraction process removes protrusions and isolated points at the boundary, so the shape can be simplified by combining the expansion process and the contraction process.

また、２値化された画像から、線幅が１の中心を抽出する、所謂「細線化」（Thinning）処理により、骨状（スケルトン）の概略形状を求めることができる。また、２値化された輪郭パターンの周囲を外側と内側とから境界に沿ってたどり、境界部画素を１とし、残りの画素を０とする「境界線追跡」処理により、輪郭の境界線のみの画像に変換できる。あるいは、輪郭形状の数画素毎に選択した画素のみを折れ線で連結する「折れ線近似」処理などを用いてもよい。 Further, a rough shape of a skeleton can be obtained by so-called “thinning” (Thinning) processing in which a center having a line width of 1 is extracted from a binarized image. Further, only the boundary line of the contour is obtained by “boundary line tracking” processing in which the periphery of the binarized contour pattern is traced along the boundary from the outside and the inside, the boundary pixel is set to 1, and the remaining pixels are set to 0. Can be converted to. Alternatively, a “polyline approximation” process or the like in which only pixels selected for every several pixels of the contour shape are connected by a polyline may be used.

（形状の識別１）
形状の識別は、２値化や線図形化したテンプレート画像やその特徴量を記憶しておき、それらと入力画像の２値化や線図形化した画像や特徴データとの相関度や類似度などを計算して、形状の識別や類似度の判別ができる。あるいは、領域内の画像からテンプレート画像と類似する画像の位置を順次検索するテンプレートマッチング等の手法により、類似する画像や図形の検索ができる。また、簡略には、被写体像の２値化画像や輪郭図形、境界線図形等の縦横の大きさ、半径、周囲長、画素数、面積、幾何学的な寸法の比などから、簡易に類似度判断や被写体の識別を行ってもよい。 (Shape identification 1)
Shape identification is performed by storing binarized or line figured template images and their feature values, and the degree of correlation and similarity between the binarized and line figured image and feature data of the input image. Can be calculated to identify the shape and determine the similarity. Alternatively, it is possible to search for similar images and graphics by a method such as template matching for sequentially searching the positions of images similar to the template image from the images in the region. Also, for simplicity, similarities are easily obtained from the binarized image of the subject image, the vertical and horizontal sizes of the contour figure, boundary line figure, etc., radius, perimeter length, number of pixels, area, ratio of geometric dimensions, etc. Degree determination and subject identification may be performed.

例えば、図１８に示すように、円では周囲長Ｌ＝２πｒ、面積Ｓ＝πｒ^２なので、（周囲長Ｌ）^２／（面積Ａ）＝４πとなることから、図形の輪郭線の円らしさを、
円らしさ＝（周囲長Ｌ）^２／（面積Ｓ）、または、
円形度ｅ＝４π（円らしさ）＝４πＳ／Ｌ^２として計算して、
円らしさが４π（＝12.57）に近い値かどうか、または、円形度が１．０に近いかどうかで、丸い形状の被写体か、凹凸が多い尖った被写体か等を計算し、形状の識別に利用できる。同様に、図１９に示すように、輪郭の縦横比＝長さｈ／幅ｗなどから、形状の「細長さ」等の評価値を求めてもよい。 For example, as shown in FIG. 18, since the circumference of a circle is L = 2πr and the area S = πr ² , (circumference length L) ² / (area A) = 4π, so the circularity of the contour of the figure is reduced. ,
Circularity = (peripheral length L) ² / (area S), or
Calculate with circularity e = 4π (circularness) = 4πS / L ² ,
Whether the object has a round shape or a sharp object with many irregularities is calculated according to whether the circularity is close to 4π (= 12.57) or the circularity is close to 1.0, and the shape is identified. Available. Similarly, as shown in FIG. 19, an evaluation value such as “strip length” of a shape may be obtained from the aspect ratio of the contour = length h / width w.

（形状の識別２．偏角関数、位置座標関数）
また、図２０に示すように、図形の輪郭線に沿って、始点から順次、偏角θ（ｓ）を求めて、１次元関数（偏角関数）に変換して、輪郭形状の特徴量として利用できる。あるいは、図２１に示すように、同様に、輪郭線に沿って順次、位置座標ｘ（ｓ）、ｙ（ｓ）、または、ｚ（ｓ）＝ｘ（ｓ）＋ｊ・ｙ（ｓ）を求めて、１次元の位置座標関数に変換して、輪郭形状の特徴量として利用できる。 (Shape identification 2. Declination function, Position coordinate function)
Further, as shown in FIG. 20, along the contour line of the figure, the deflection angle θ (s) is obtained sequentially from the start point, converted into a one-dimensional function (deflection angle function), and used as the feature amount of the contour shape. Available. Alternatively, as shown in FIG. 21, similarly, position coordinates x (s), y (s) or z (s) = x (s) + j · y (s) are sequentially obtained along the contour line. Thus, it can be converted into a one-dimensional position coordinate function and used as a feature amount of the contour shape.

（形状の識別３．フーリエ記述子）
（Ｚ形記述子）
さらに、例えば、図２２に示すように、前述の偏角関数θ（ｓ）を正規化して、正規化偏角関数：θ_Ｎ（ｓ）＝θ（ｓ）−θ（０）−２πｓ／Ｌ、
を求め、これの（ｉ＝０，１，２・・・，Ｎ−１）の離散化データφ［ｉ］をフーリエ変換して、次のようなＺ（Zahn）形フーリエ記述子を求め、輪郭形状の識別に利用できる。
θ_Ｎ（ｓ）の離散化データ：φ［ｉ］＝θ［ｉ］−θ［０］−２πｉ／Ｎ
（i=0,1,2・・・,N-1）
θ_Ｎ（ｓ）の離散フーリエ変換（＝Ｚ形記述子）：Ｃｚ［ｋ］＝（１／Ｎ）Σφ［ｉ］EXP（−ｊ２πｋｉ／Ｎ）。 (Shape identification 3. Fourier descriptor)
(Z descriptor)
Further, for example, as shown in FIG. 22, the above-mentioned declination function θ (s) is normalized, and the normalized declination function: θ _N (s) = θ (s) −θ (0) −2πs / L ,
And Fourier transform of the discretized data φ [i] of (i = 0, 1, 2,..., N−1) to obtain the following Z (Zahn) type Fourier descriptor, It can be used to identify contour shapes.
Discretized data of θ _N (s): φ [i] = θ [i] −θ [0] −2πi / N
(I = 0,1,2 ..., N-1)
Discrete Fourier transform of θ _N (s) (= Z-type descriptor): Cz [k] = (1 / N) Σφ [i] EXP (−j2πki / N).

（Ｇ形記述子）
同様に、図２３に示すように、前述の位置座標の複素平面座標ｚ（ｓ）を離散フーリエ変換して、Ｇ（Grundlund）形フーリエ記述子を求め、輪郭形状の識別に利用してもよい。
位置座標の複素平面座標ｚ（ｓ）＝ｘ（ｓ）＋ｊ・ｙ［ｓ］、
ｚ（ｓ）の離散化データｚ（ｉ）＝ｘ［ｉ］＋ｊ・ｙ［０］（i=0,1,2・・・,N-1）、
ｚ（ｓ）の離散フーリエ変換（＝Ｇ記述子）：Ｃｇ［ｋ］＝（１／Ｎ）Σｚ［ｉ］EXP（−ｊ２πｋｉ／Ｎ）。 (G type descriptor)
Similarly, as shown in FIG. 23, the above-described complex plane coordinate z (s) of the position coordinate may be subjected to discrete Fourier transform to obtain a G (Grundlund) type Fourier descriptor, which may be used for identifying the contour shape. .
Complex plane coordinates z (s) = x (s) + j · y [s] of position coordinates,
discretized data of z (s) z (i) = x [i] + j · y [0] (i = 0, 1, 2..., N−1),
Discrete Fourier transform of z (s) (= G descriptor): Cg [k] = (1 / N) Σz [i] EXP (−j2πki / N).

（Ｐ形記述子）
また、図２４に示すように、折れ線近似した偏角θ［ｉ］の指数関数ｗ［ｉ］を求め、ｗ［ｉ］をフーリエ変換した、Ｐ（Phase）形記述子を求め、輪郭形状の識別に利用してもよい。
ｗ［ｉ］＝exp（ｊθ［ｉ］）＝cosθ［ｉ］＋sinθ［ｉ］
＝（ｚ［ｉ＋１］−ｚ［ｉ］）／δ、
ただし、線分δ＝｜ｚ［ｉ＋１］−ｚ［ｉ］｜
ｗ［ｉ］の離散フーリエ変換（＝ｐ形記述子）：Ｃｐ［ｋ］＝（１／Ｎ）Σｗ［ｉ］exp（−ｊ２πｋｉ／Ｎ）。 (P type descriptor)
Also, as shown in FIG. 24, an exponential function w [i] of the deflection angle θ [i] approximated by a polygonal line is obtained, a P (Phase) shape descriptor obtained by Fourier transforming w [i] is obtained, and the contour shape is obtained. It may be used for identification.
w [i] = exp (jθ [i]) = cosθ [i] + sinθ [i]
= (Z [i + 1] -z [i]) / δ,
However, the line segment δ = | z [i + 1] −z [i] |
Discrete Fourier transform of w [i] (= p-type descriptor): Cp [k] = (1 / N) Σw [i] exp (−j2πki / N).

（特徴量の比較、類似図形の検索）
また、記憶されたテンプレート画像と被写体画像の類似度の判別や検索には、テンプレートマッチングなどのパターンマッチング法や、動きベクトル検出におけるブロックマッチング法などが利用できる。テンプレートマッチングにより、特徴抽出領域の入力画像ｆ［ｉ，ｊ］の特徴データの中から、例えば、（ｍ×ｎ）の記録された参照画像（または特徴データ）ｔ［ｋ，ｌ］に一致する画像の位置を検出する。参照画像の中心（または端点）が入力画像のある点（ｉ，ｊ）に重なるように置いて、点（ｉ，ｊ）を順に縦横にラスター走査しながら、重なる部分の画像データの類似度を順次計算して、類似度が最も高い位置点（ｉ，ｊ）を、類似する被写体がある位置として求めることができる。 (Comparison of features, search for similar figures)
In addition, a pattern matching method such as template matching or a block matching method in motion vector detection can be used to determine and search the similarity between the stored template image and the subject image. By template matching, for example, (m × n) recorded reference images (or feature data) t [k, l] are matched from the feature data of the input image f [i, j] in the feature extraction region. Detect the position of the image. Place the center (or end point) of the reference image so that it overlaps a certain point (i, j) of the input image, and perform raster scanning of the point (i, j) in order vertically and horizontally, and the similarity of the image data of the overlapping part By calculating sequentially, the position point (i, j) having the highest degree of similarity can be obtained as a position where there is a similar subject.

（相関度（相関係数））
図２５に示すように、入力画像ｆ［ｋ，ｌ］とテンプレート画像（参照画像）ｔ［ｋ，ｌ］との相関度は、次式のピアソンの相関係数（積率相関係数）Ｒなどで算出でき、最も相関係数Ｒが大きくなる位置が、検索する類似被写体がある位置として求められる。
Ｒ＝（画像ｆと画像ｔの共分散）／ｆ（画像ｆの標準偏差）・（画像ｔの標準偏差）
＝［Σ_Ｌ＝０ ^ｎ−１Σ_Ｋ＝０ ^ｍ−１｛ｆ［ｋ，ｌ］−ｆ_ＡＶ｝｛ｔ［ｋ，ｌ］−ｔ_ＡＶ｝］
／√［Σ_Ｌ＝０ ^ｎ−１Σ_Ｋ＝０ ^ｍ−１｛ｆ［ｋ，ｌ］−ｆ_ＡＶ｝^２］・√［Σ_Ｌ＝０ ^ｎ−１Σ_Ｋ＝０ ^ｍ−１｛ｔ［ｋ，ｌ］−ｔ_ＡＶ｝^２］
ただし、ｆ_ＡＶ：参照画像ｆ［ｋ，ｌ］の画像データ（輝度値、色差値、特徴量など）の平均値、
ｔ_ＡＶ：参照画像ｔ［ｋ，ｌ］の画像データ（輝度値、色差値、特徴量など）の平均値。 (Correlation degree (correlation coefficient))
As shown in FIG. 25, the correlation between the input image f [k, l] and the template image (reference image) t [k, l] is Pearson's correlation coefficient (product moment correlation coefficient) R The position where the correlation coefficient R is largest can be obtained as the position where there is a similar subject to be searched.
R = (covariance between image f and image t) / f (standard deviation of image f) · (standard deviation of image t)
= [ΣL _{= 0} ⁿ⁻¹ ΣK _{= 0} ^m−1 {f [k, l] −f _AV } {t [k, l] −t _AV }]
/ √ [Σ _{L = 0} ⁿ⁻¹ Σ _{K = 0} ^m−1 {f [k, l] −f _AV } ² ] · √ [Σ _{L = 0} ⁿ⁻¹ Σ _{K = 0} ^m−1 {t [ k, l] −t _AV } ² ]
However, f _AV : the average value of the image data (luminance value, color difference value, feature amount, etc.) of the reference image f [k, l],
t _AV : average value of image data (luminance value, color difference value, feature amount, etc.) of the reference image t [k, l].

（テンプレートマッチングの類似度）
テンプレートマッチングなど、画像ｆ［ｉ，ｊ］の中から、画像サイズ（ｍ×ｎ）の画像ｔ［ｋ，ｌ］を走査して検索する場合、類似度は、次式で計算でき、類似度ｒ（ｉ，ｊ）が最も大きくなる走査位置の点（ｉ，ｊ）が類似する被写体の位置として求まる。
Ｒ（ｉ，ｊ）＝Σ_Ｌ＝０ ^ｎ−１Σ_Ｋ＝０ ^ｍ−１ｆ［ｉ−（ｍ／２）＋ｋ，ｊ−（ｎ／２）＋１］・ｔ［ｋ，ｌ］
ただし、（ｉ，ｊ）：点の位置座標、ｆ［ｉ，ｊ］：入力画像データ、ｔ［ｋ，ｌ］：テンプレート画像のデータ、（ｍ×ｎ）：テンプレート画像のサイズ。
これを、前述の相関係数Ｒと同様に、平均値を差し引くなど正規化してもちいてもよい。 (Similarity of template matching)
When searching by scanning an image t [k, l] having an image size (m × n) from the image f [i, j], such as template matching, the similarity can be calculated by the following equation. A point (i, j) at the scanning position where r (i, j) is the largest is obtained as a similar subject position.
R (i, j) = Σ _{L = 0} ⁿ⁻¹ ΣK _{= 0} ^m−1 f [i− (m / 2) + k, j− (n / 2) +1] · t [k, l]
However, (i, j): position coordinates of points, f [i, j]: input image data, t [k, l]: template image data, (m × n): size of template image.
Similar to the correlation coefficient R described above, this may be normalized by subtracting the average value.

（絶対値差分和、距離）
前記の類似度、Ｒ（ｉ，ｊ）では乗算のための計算が増えるため、２値化画像など、平均値を差し引いたり、正規化を省略したりできる場合には、類似度の代わりに、次式のような画像間の差分和により、相違の程度（「距離」）を表すＤ（ｉ，ｊ）を求め、これを評価関数として利用できる。この場合には、加減算だけで計算できるので演算を高速化できる。この場合は、距離Ｄ（ｉ，ｊ）が最も小さい点（ｉ，ｊ）がマッチング位置を表す。
Ｄ（ｉ，ｊ）＝Σ_Ｌ＝０ ^ｎ−１Σ_Ｋ＝０ ^ｍ−１｜ｆ［ｉ−（ｍ／２）＋ｋ，ｊ−（ｎ／２）＋１］−ｔ［ｋ，ｌ］｜
ただし、（ｉ，ｊ）：点の位置座標、ｆ［ｉ，ｊ］：入力画像データ、ｔ［ｋ，ｌ］：テンプレート画像のデータ、（ｍ×ｎ）：テンプレート画像のサイズ。 (Absolute difference sum, distance)
Since the calculation for multiplication increases with the similarity, R (i, j), when the average value can be subtracted or normalization can be omitted, such as a binary image, instead of the similarity, D (i, j) representing the degree of difference (“distance”) is obtained from the difference sum between images as in the following equation, and this can be used as an evaluation function. In this case, the calculation can be speeded up because it can be calculated only by addition and subtraction. In this case, the point (i, j) having the smallest distance D (i, j) represents the matching position.
D (i, j) = Σ _{L = 0} ⁿ⁻¹ Σ _{K = 0} ^m−1 | f [i− (m / 2) + k, j− (n / 2) +1] −t [k, l] |
However, (i, j): position coordinates of points, f [i, j]: input image data, t [k, l]: template image data, (m × n): size of template image.

（特定被写体の認識処理、顔の認識処理）
前記の認識処理において、特定の被写体やシーン別撮影モードなどで注目する被写体別に専用の識別データや認識処理を必要とする場合がある。例えば「人物」の撮影シーンや「人物と風景」の撮影シーンなどにおいて、人間の顔を認識する例を説明する。 (Specific subject recognition processing, face recognition processing)
In the above recognition processing, there are cases where dedicated identification data or recognition processing is required for each subject to be noticed in a specific subject or scene-specific shooting mode. For example, an example of recognizing a human face in a “person” shooting scene, a “person and landscape” shooting scene, or the like will be described.

図２６（ａ）は、顔の眼の領域を抽出するためのマスクパターンの例で、これを参照パターンとして、テンプレートマッチング等を用いて検索して、入力画像から眼や顔のある画像領域を検索できる。（ｂ）は、眼を認識するデータの例で、例えば、眼の細長さ＝ｂ／ａとして、α_１≦ｂ／ａ≦α_２の条件に合致する、または、（眼の面積）Ｓ_１≒π×ａ×ｂ、（黒眼（瞳）の面積）Ｓ_２＝π×ｒ^２、黒眼（瞳）の比率Ｓ_２／Ｓ_１＝ｒ^２／ａｂとして、β_１≦ｒ^２／ａｂ≦β_２などの条件に合致する被写体画像の領域を「眼の領域」と識別することができる。また、（ｃ）は、人間の顔と認識するための条件データの設定で、例えば、（眉下〜鼻下までの長さ）ｈ１≒（鼻下〜あごまでの長さ）ｈ２、または、（右眼の幅）Ｗ_１≒（両眼の間）Ｗ_２≒（左眼の幅）Ｗ_３、などの条件を満たす被写体画像の領域を「顔の領域」であると識別できる。 FIG. 26A shows an example of a mask pattern for extracting an eye area of a face. Using this as a reference pattern, search is performed using template matching or the like, and an image area having an eye or a face is searched from an input image. Searchable. (B) is an example of data for recognizing the eye. For example, when the eye length = b / a, the condition of α ₁ ≦ b / a ≦ α ₂ is satisfied, or (eye area) S ₁ ≈π × a × b, (area of black eye (pupil)) S ₂ = π × r ² , and ratio of black eyes (pupil) S ₂ / S ₁ = r ² / ab, β ₁ ≦ r ² / ab ≦ beta ₂ the condition region of a subject image matching the like can be identified as a "region of the eye." (C) is a setting of condition data for recognizing a human face. For example, (length from below the eyebrows to below the nose) h1≈ (length from below the nose to the chin) h2, or An area of the subject image that satisfies the conditions such as (right eye width) W ₁ ≈ (between eyes) W ₂ ≈ (left eye width) W ₃ can be identified as a “face area”.

図２７は、人間の認識処理１（中距離、頭の認識）の処理手順を示すフローチャートである。先ず、画像を取り込み（ステップＳ３０１）、輝度もしくは色差データに基づいて輪郭を抽出する（ステップＳ３０２）。次に、前記画像をこの抽出した輪郭を境界とする領域に分割し（ステップＳ３０３）、分割領域の中からいずれかの対象領域を選択する（ステップＳ３０４）。そして、輪郭線の周囲長Ｌ、面積Ｓを算出し、ｅ＝４πＳ／Ｌ^２により、円形度（円らしさ）を算出する（ステップＳ３０５）。この算出したｅが、「ｅ１≦ｅ≦ｅ２」であるか否かを判断する（ステップＳ３０６）。この判断がＮＯであれば人間の頭の領域でないと認識し（ステップＳ３０７）、後述するステップＳ３１２に進む。 FIG. 27 is a flowchart illustrating a processing procedure of human recognition processing 1 (medium distance, head recognition). First, an image is captured (step S301), and a contour is extracted based on luminance or color difference data (step S302). Next, the image is divided into regions having the extracted contour as a boundary (step S303), and any target region is selected from the divided regions (step S304). Then, the peripheral length L and area S of the contour line are calculated, and the circularity (circularness) is calculated by e = 4πS / L ² (step S305). It is determined whether the calculated e is “e1 ≦ e ≦ e2” (step S306). If this determination is NO, it is recognized that the region is not a human head region (step S307), and the process proceeds to step S312 described later.

また、前記判断がＹＥＳであれば、比較するテンプレートとして顔のテンプレートを設定し、この設定した顔のテンプレートと選択画像（対象領域）との類似度を算出する（ステップＳ３０８）。次に、この算出した類似度が所定値以上であるか否かを判断し（ステップＳ３０９）、所定値未満である場合には、前述と同様に、人間の頭の領域でないと認識する（ステップＳ３０７）。また、所定値以上である場合には、人間の顔の領域として認識し（ステップＳ３１０）、この頭と認識された位置座標を記憶する（ステップＳ３１１）。次に、今回選択した領域が、当該画像において最後の領域であるか否かを判断し（ステップＳ３１２）、最後の領域でない場合には、ステップＳ３０４に戻って次の領域を選択し、前述した処理を繰り返す。そして、最後の領域まで以上の処理が実行されると、ステップＳ３１２の判断がＹＥＳとなり、記憶した位置座標を出力する（ステップＳ３１３）。 If the determination is YES, a face template is set as a template to be compared, and the similarity between the set face template and the selected image (target region) is calculated (step S308). Next, it is determined whether or not the calculated similarity is greater than or equal to a predetermined value (step S309). If it is less than the predetermined value, it is recognized that it is not a human head region (step S309). S307). If it is equal to or greater than the predetermined value, it is recognized as a human face region (step S310), and the position coordinates recognized as the head are stored (step S311). Next, it is determined whether or not the region selected this time is the last region in the image (step S312). If it is not the last region, the process returns to step S304 to select the next region and Repeat the process. When the above processing is executed up to the last area, the determination in step S312 is YES, and the stored position coordinates are output (step S313).

図２８は、人間の識別処理２（近距離、顔の認識）の処理手順を示すフローチャートである。先ず、画像を取り込み（ステップＳ４０１）、輝度もしくは色差データに基づいて輪郭を抽出する（ステップＳ４０２）。次に、前記画像をこの抽出した輪郭を境界とする領域に分割し（ステップＳ４０３）、分割領域の中からいずれかの対象領域を選択する（ステップＳ４０４）。そして、ＲＧＢまたは色差データに基づいて、選択した領域の平均ＲＧＢまたは平均色差データを算出し（ステップＳ４０５）、この算出したＲＧＢまたは色差値をＨＳＶに変換する（ステップＳ４０６）。 FIG. 28 is a flowchart showing a processing procedure of human identification processing 2 (short distance, face recognition). First, an image is captured (step S401), and a contour is extracted based on luminance or color difference data (step S402). Next, the image is divided into regions having the extracted contour as a boundary (step S403), and any target region is selected from the divided regions (step S404). Then, based on the RGB or color difference data, the average RGB or average color difference data of the selected region is calculated (step S405), and the calculated RGB or color difference value is converted into HSV (step S406).

引き続き、この変換したＨＳＶが肌色の領域は否か、つまり、色相（Ｈｕｅ）が６〜３８°か否かを判断し（ステップＳ４０７）、この判断がＮＯである場合には、顔の領域でないと判断する（ステップＳ４１５）。ステップＳ４０７での判断がＹＥＳである場合には、顔のマスクパターン（図２６参照）を設定し、前記対象領域において眼と瞳の領域を検索する（ステップＳ４０８）。次に、眼の領域は検出できたか否かを判断し（ステップＳ４０９）、検出できない場合には、顔の領域でないと判断する（ステップＳ４１５）。検出できた場合には、眼の縦横比（ｂ／ａ）、眼と瞳（黒眼）の面積比（ｒ^２／ａｂ）を算出し（ステップＳ４１０）、眼と瞳の比率は所定範囲内か否か、すなわち前記α_１≦ｂ／ａ≦α_２の条件に合致するか否かを判断する（ステップＳ４１１）。この判断がＮＯである場合には、顔の領域でないと判断する（ステップＳ４１５）。 Subsequently, it is determined whether or not the converted HSV is a skin color region, that is, whether or not the hue (Hue) is 6 to 38 ° (step S407). If this determination is NO, the region is not a face region. Is determined (step S415). If the determination in step S407 is yes, a face mask pattern (see FIG. 26) is set, and eye and pupil regions are searched in the target region (step S408). Next, it is determined whether or not the eye region has been detected (step S409). If it cannot be detected, it is determined that the eye region is not a face region (step S415). If detected, the aspect ratio (b / a) of the eye and the area ratio (r ² / ab) of the eyes and pupils (black eyes) are calculated (step S410), and the ratio of the eyes and pupils is within a predetermined range. Whether or not the condition of α ₁ ≦ b / a ≦ α ₂ is satisfied (step S411). If this determination is NO, it is determined that the region is not a face region (step S415).

ステップＳ４１１の判断がＹＥＳである場合には、眼と瞳の比率は所定範囲内であるか否か、すなわち前記β_１≦ｒ^２／ａｂ≦β_２の条件に合致するか否かを判断する（ステップＳ４１２）。この判断がＮＯである場合には、顔の領域でないと判断する（ステップＳ４１５）。ステップＳ４１２の判断がＹＥＳである場合には、右眼の幅Ｗ_１、右眼と左眼の間隔Ｗ_２、左眼の幅Ｗ_３を算出し（ステップＳ４１３）、Ｗ_１とＷ_２、Ｗ_３は等しいか否か、すなわちＷ_１−δ≦Ｗ_２≦Ｗ_１＋δ、Ｗ_１−δ≦Ｗ_３≦Ｗ_１＋δであるか否かを判断する（ステップＳ４１４）。この判断がＮＯである場合には、顔の領域でないと判断する（ステップＳ４１５）。そして、この判断がＹＥＳである場合、つまり、ステップＳ４０７、Ｓ４０９、Ｓ４１１、Ｓ４１２、Ｓ４１４の判断が全てＹＥＳである場合には、前記ステップＳ４０４で選択した領域を、顔の領域として認識する（ステップＳ４１６）。さらに、この認識された顔の領域の位置座標を記憶する（ステップＳ４１７）。次に、今回選択した領域が、当該画像において最後の領域であるか否かを判断し（ステップＳ４１８）、最後の領域でない場合には、ステップＳ４０４に戻って次の領域を選択し、前述した処理を繰り返す。そして、最後の領域まで以上の処理が実行されると、ステップＳ４１８の判断がＹＥＳとなり、認識結果を出力する（ステップＳ４１９）。 If the determination in step S411 is YES, it is determined whether or not the ratio between the eyes and the pupil is within a predetermined range, that is, whether or not the condition of β ₁ ≦ r ² / ab ≦ β ₂ is satisfied. (Step S412). If this determination is NO, it is determined that the region is not a face region (step S415). If the determination in step S412 is YES, the right eye width W ₁ , the right eye-left eye interval W ₂ , and the left eye width W ₃ are calculated (step S413), and W ₁ and W ₂ , W _It is determined whether ₃ is equal, that is, whether W ₁ −δ ≦ W ₂ ≦ W ₁ + δ and W ₁ −δ ≦ W ₃ ≦ W ₁ + δ (step S414). If this determination is NO, it is determined that the region is not a face region (step S415). If this determination is YES, that is, if all the determinations in steps S407, S409, S411, S412, and S414 are YES, the region selected in step S404 is recognized as a face region (step S416). Further, the position coordinates of the recognized face area are stored (step S417). Next, it is determined whether or not the region selected this time is the last region in the image (step S418). If it is not the last region, the process returns to step S404 to select the next region and Repeat the process. When the above processing is executed up to the last area, the determination in step S418 is YES, and the recognition result is output (step S419).

（第２の実施の形態）
図２９、３０は、本発明の第２の実施の形態を示すものである。本実施の形態は、撮影シーン別に対応して、特定の被写体の参照画像データや特徴量データを設定しておき、これらシーン別の参照画像と特徴データとを順にテンプレートメモリに呼び出して設定し、フォーカスされた被写体像のスルー画像を、設定された特定画像の画像や特徴データと順次比較して、合致する被写体があると認識されれば、当該シーンの被写体、及びその認識された被写体の数に応じて、該当するシーン別撮影プログラムを自動的に選択するようにしたものである。 (Second Embodiment)
29 and 30 show a second embodiment of the present invention. In the present embodiment, reference image data and feature amount data of a specific subject are set corresponding to each shooting scene, and these scene-specific reference images and feature data are sequentially set in the template memory. The through image of the focused subject image is sequentially compared with the image and feature data of the set specific image, and if it is recognized that there is a matching subject, the subject of the scene and the number of recognized subjects The corresponding scene-specific shooting program is automatically selected according to the above.

すなわち、この実施の形態において前記プログラムメモリ３２には、図２９に示すように、人物を写す場合（１）、複数の人物を風景を写す場合（２）、花を写す場合（３）・・・等の撮影シーン別に、当該シーンを撮影する場合に好適なシーン別撮影制御プログラムが記憶されているとともに、撮影シーンに対応してそのサンプル画像３００と、「色強調が肌色に設定されます。」等の当該シーン別撮影制御プログラムに関する説明文３０１等が記憶されている。さらに、各撮影シーンに対応して参照画像（サンプル画像３００とは異なる比較用の画像）またはこの参照画像の特徴量データが記憶されている。 That is, in this embodiment, as shown in FIG. 29, in the program memory 32, when a person is photographed (1), when a plurality of persons are photographed as a landscape (2), when a flower is photographed (3)・ For each shooting scene, etc., a scene-specific shooting control program suitable for shooting the scene is stored, and the sample image 300 corresponding to the shooting scene and “color enhancement is set to skin color. ”And the like related to the scene-specific shooting control program. Further, a reference image (an image for comparison different from the sample image 300) or feature data of the reference image is stored corresponding to each shooting scene.

すなわち、例えば、撮影シーン（２）の複数の人物を写す撮影シーンに対応して、１つの人物の顔の画像が参照画像として記憶され、またはこの参照画像の特徴量データが記憶されている。また、撮影シーン（３）の複数の花びらからなる花を写す撮影シーンに対応して、１枚の花びらの画像が参照画像として記憶され、またはこの参照画像の特徴量データが記憶されている。なお、各特徴量データには、「Ｎｏ．」が付されている。 That is, for example, an image of one person's face is stored as a reference image or feature amount data of the reference image is stored in correspondence with a shooting scene in which a plurality of persons in the shooting scene (2) are captured. Corresponding to a shooting scene in which a flower composed of a plurality of petals in the shooting scene (3) is captured, an image of one petal is stored as a reference image, or feature amount data of this reference image is stored. In addition, “No.” is attached to each feature amount data.

図３０は、本実施の形態の処理手順を示す一連のフローチャートであり、制御部２５はプログラムメモリ３２に格納されているプログラムに基づき、同図に示すフローチャートに従って処理を実行する。先ず、ユーザーによる操作入力部３５での操作によって、撮影シーンの自動選択機能がＯＮとなっているか否かを判断し（ステップＳ５０１）、ＯＮとなっていない場合には、その他の撮影モード処理に移行する（ステップＳ５０２）。またＯＮとなっている場合には、フォーカス枠を選択する（ステップＳ５０３）。つまり、この実施の形態においては、表示部１４に被写体スルー画像とともに複数のフォーカス枠を表示し、ユーザによる操作部２３での操作により、この複数のフォーカス枠のいずれかを選択する。したがって、この実施の形態においては、フォーカス枠の選択により、スルー画像において識別したい領域が指定されることとなる。 FIG. 30 is a series of flowcharts showing the processing procedure of the present embodiment, and the control unit 25 executes processing according to the flowchart shown in FIG. First, it is determined whether or not the automatic selection function of the shooting scene is turned on by the operation of the operation input unit 35 by the user (Step S501). The process proceeds (step S502). If it is ON, a focus frame is selected (step S503). That is, in this embodiment, a plurality of focus frames are displayed on the display unit 14 together with the subject through image, and one of the plurality of focus frames is selected by an operation on the operation unit 23 by the user. Therefore, in this embodiment, the region to be identified in the through image is designated by selecting the focus frame.

次に、表示部１４に表示されている被写体画像（被写体スルー画像）を読み込み（ステップＳ５０４）、フォーカス枠周辺の被写体に対して、ＡＦ処理を実行して合焦させ（ステップＳ５０５）。さらに、この合焦させたフォーカス枠周辺領域内の被写体像を輝度もしくは色差データに基づいて輪郭抽出し（ステップＳ５０６）、この抽出した輪郭を境界とする複数の領域に分割する（ステップＳ５０７）。 Next, the subject image (subject through image) displayed on the display unit 14 is read (step S504), and AF processing is executed on the subject around the focus frame to focus (step S505). Further, contour extraction is performed on the subject image in the focused frame peripheral region based on the luminance or color difference data (step S506), and the image is divided into a plurality of regions with the extracted contour as a boundary (step S507).

また、第１の撮影シーン（撮影シーン（１））に対応して記憶された前記参照画像もしくは特徴量データを読み出し、テンプレートメモリ（データメモリ３８）に設定する（ステップＳ５０８）。引き続き、フォーカス領域内（フォーカス枠内）から、最も大きい、もしくは、中央の輪郭領域を選択し（ステップＳ５０９）、この選択された領域のスルー画像に対して前述した特徴抽出処理を実行する（ステップＳ５１０）。また、このステップＳ５１０で抽出した特徴量をテンプレートの特徴量（テンプレートメモリに記憶された参照画像もしくは特徴量データ）と比較し類似度を算出する（ステップＳ５１１）。そして、この算出した類似度が所定値以上であるか否かを判断し（ステップＳ５１２）、所定値以上でない場合には、ステップＳ５１３及びＳ５１４の処理を行うことなく、ステップＳ５１５に進む。しかし、類似度が所定値以上であった場合には、テンプレートに設定された参照特徴量に該当する被写体と認識し、特徴量Ｎｏ．と被写体の位置座標を記憶するとともに（ステップＳ５１３）、被写体の認識数をカウントする（ステップＳ５１４）。 Further, the reference image or feature data stored corresponding to the first shooting scene (shooting scene (1)) is read and set in the template memory (data memory 38) (step S508). Subsequently, the largest or central contour region is selected from within the focus region (within the focus frame) (step S509), and the above-described feature extraction processing is executed on the through image of the selected region (step S509). S510). Further, the feature amount extracted in step S510 is compared with the feature amount of the template (reference image or feature amount data stored in the template memory) to calculate the similarity (step S511). Then, it is determined whether or not the calculated similarity is greater than or equal to a predetermined value (step S512). If not, the process proceeds to step S515 without performing steps S513 and S514. However, if the similarity is equal to or greater than a predetermined value, the subject is recognized as a subject corresponding to the reference feature amount set in the template, and the feature amount No. And the position coordinates of the subject are stored (step S513), and the number of recognized subjects is counted (step S514).

また、前記ステップＳ５０７で分割した領域おける次の分割領域を選択して（ステップＳ５１５）、全ての領域と対比済となったか否かを判断し（ステップＳ５１６）、全ての領域と対比済となるまでステップＳ５１０からの処理を繰り返す。全ての領域と対比済となったならば、認識された被写体、あるいは領域があるか否か、または前記ステップＳ５１４でカウントした認識数のカウント値が１以上であるか否かを判断する（ステップＳ５１７）。この判断の結果、認識された被写体、あるいは領域がなく、またはカウントした認識数のカウント値が０であった場合には、次の撮影シーンに対応して記憶された前記参照画像もしくは特徴量データを読み出し、テンプレートメモリ（データメモリ３８）に設定する（ステップＳ５１８）。そして、全ての撮影シーンと比較済みか否かを判断し（ステップＳ５１９）、比較済みでない場合には前述したステップＳ５０９からの処理を繰り返す。また、全ての撮影シーンと比較済みとなった場合には、撮影シーンの自動選択に失敗した旨を表示部１４に表示し、または、通常の自動撮影モードを選択して（ステップＳ５２０）、後述するステップＳ５２３に進む。 Further, the next divided area in the area divided in step S507 is selected (step S515), and it is determined whether or not all areas have been compared (step S516), and all areas have been compared. Until the process from step S510 is repeated. If all the areas have been compared, it is determined whether there is a recognized object or area, or whether the count value of the number of recognitions counted in step S514 is 1 or more (step S514). S517). If the result of this determination is that there is no recognized subject or area, or the count value of the recognized number of counts is 0, the reference image or feature quantity data stored corresponding to the next shooting scene Is set in the template memory (data memory 38) (step S518). Then, it is determined whether or not all the shooting scenes have been compared (step S519), and if not compared, the processing from step S509 described above is repeated. If all the shooting scenes have been compared, the fact that the automatic selection of the shooting scene has failed is displayed on the display unit 14 or the normal automatic shooting mode is selected (step S520), which will be described later. The process proceeds to step S523.

他方、ステップＳ５１７での判断の結果、認識された被写体、あるいは領域があり、またはカウントした認識数のカウント値が１以上であった場合には、合致すると認識されたテンプレート及び認識された被写体、あるいは領域の数に応じて撮影シーンを自動的に選択する（ステップＳ５２１）。 On the other hand, as a result of the determination in step S517, if there is a recognized subject or area, or the count value of the counted number of recognition is 1 or more, the template recognized as matching and the recognized subject, Alternatively, a shooting scene is automatically selected according to the number of areas (step S521).

すなわち、例えば、前述のように、撮影シーン（２）の複数の人物を写す撮影シーンに対応して、１つの人物の顔の画像が参照画像の特徴量データが記憶されており、ステップＳ５０８でこの特徴量データがテンプレートメモリに設定されたとする。この状態でユーザが人物３人を被写体として、フォーカス枠内に位置させてカメラを構えると、Ｓ５１０〜Ｓ５１６の処理が繰り返されることにより、ステップＳ５１４で認識数「３」がカウントされることとなる。また、人物が「３」に対応する撮影シーンは、撮影シーン（２）であることから、複数の人物を写す場合（２）を自動的に選択する。そして、この選択された撮影シーンを表示部１４の一部に表示するとともに、選択された撮影シーンに応じて撮影条件等を自動的に設定する。つまり、前述のように、プログラムメモリ３２には、複数の人物を写す場合（２）に好適な撮影シーン別撮影制御プログラムが記憶されていることから、当該撮影制御プログラムを起動させることにより、撮影条件等を自動的に設定する。これにより、認識した被写体数等に応じて、適切な撮影条件で撮影を行うことが可能となる。 That is, for example, as described above, the feature amount data of the reference image is stored as the face image of one person corresponding to the shooting scene in which a plurality of persons in the shooting scene (2) are shot, and in step S508, It is assumed that this feature amount data is set in the template memory. In this state, when the user holds the camera with three persons as subjects and is positioned within the focus frame, the number of recognitions “3” is counted in step S514 by repeating the processing of S510 to S516. . Also, since the photographic scene corresponding to the person “3” is the photographic scene (2), the case (2) where a plurality of persons are photographed is automatically selected. Then, the selected shooting scene is displayed on a part of the display unit 14, and shooting conditions and the like are automatically set according to the selected shooting scene. That is, as described above, the program memory 32 stores a shooting control program for each shooting scene that is suitable for shooting a plurality of persons (2). Set conditions automatically. Thereby, it is possible to perform shooting under appropriate shooting conditions according to the number of recognized subjects and the like.

（第３の実施の形態）
図３１、３２は、本発明の第３の実施の形態を示すものであり、人物と認識された被写体の数に応じて撮影条件を自動設定するようにしたものである。図３１は、本実施の形態の処理手順を示す一連のフローチャートであり、制御部２５はプログラムメモリ３２に格納されているプログラムに基づき、同図に示すフローチャートに従って処理を実行する。先ず、ユーザによる操作部２３の操作によって人物撮影モードが設定されているか否かを判断し（ステップＳ６０１）、設定されていない場合にはその他の撮影モード処理に移行する（ステップＳ６０２）。また、人物撮影モードが設定されている場合には、この時点で表示部１４にスルー画像として表示されている被写体画像を取り込む（ステップＳ６０３）。そして、前述した図４のステップＳ１０９と同様に、取り込んだ被写体像から対角の２点で識別したい領域を選択する（ステップＳ６０４）。 (Third embodiment)
FIGS. 31 and 32 show a third embodiment of the present invention, in which shooting conditions are automatically set according to the number of subjects recognized as a person. FIG. 31 is a series of flowcharts showing the processing procedure of the present embodiment, and the control unit 25 executes processing according to the flowchart shown in FIG. 31 based on the program stored in the program memory 32. First, it is determined whether or not the person photographing mode is set by the operation of the operation unit 23 by the user (step S601). If not set, the process proceeds to other photographing mode processing (step S602). If the person shooting mode is set, the subject image displayed as a through image on the display unit 14 at this time is captured (step S603). Then, similarly to step S109 of FIG. 4 described above, a region to be identified by two diagonal points is selected from the captured subject image (step S604).

次に、前記取り込んだ被写体画像に対し輪郭抽出処理を行って、複数領域に分割する（ステップＳ６０５）。つまり前述したように、被写体画像の画像データの輝度信号及び色差信号から、近い輝度または色差信号別に、例えば同系色の色相別等に領域を分割し、さらに、領域の境界線となる輪郭線を抽出し、この輪郭線で囲まれた部分を一つの輪郭領域（抽出領域）とすることにより、被写体画像を複数領域に分割する。引き続き、プログラムメモリ３２に予め記憶されている人物に対応する参照画像、特徴量データを読み出してテンプレートメモリ（データメモリ３３）に設定する（ステップＳ６０６）。 Next, the extracted subject image is subjected to contour extraction processing and divided into a plurality of regions (step S605). That is, as described above, the region is divided into luminance or color difference signals from the luminance signal and color difference signal of the subject image according to similar luminance or color difference signals, for example, by hues of similar colors, and the contour line that becomes the boundary line of the region is further divided. The subject image is divided into a plurality of regions by extracting and defining a portion surrounded by the contour line as one contour region (extraction region). Subsequently, the reference image and feature data corresponding to the person stored in advance in the program memory 32 are read out and set in the template memory (data memory 33) (step S606).

また、前記ステップＳ６０３で分割した複数の輪郭領域のうち、最も大きい領域、最も被写体までの距離が近い領域、中央に近い領域のいずれか先ず選択する（ステップＳ６０７）。さらに、この選択領域の被写体を測距するとともに、ＡＦ処理を行って当該被写体に合焦させて（ステップＳ６０８）、この選択領域の被写体に対して前述した特徴抽出処理を実行する（ステップＳ６０９）。このステップＳ６０９で抽出した特徴量をテンプレートの特徴量（テンプレートメモリに記憶された参照画像もしくは特徴量データ）と比較し類似度を算出する（ステップＳ６１０）。そして、この算出した類似度が所定値以上であるか否かを判断し（ステップＳ６１１）、所定値以上でない場合には、ステップＳ６１２及びＳ６１３の処理を行うことなく、ステップＳ６１４に進む。類似度が所定値以上であった場合には、人物と認識し、当該被写体のスルー画像における位置座標、及び前記ステップＳ６０８で測距した当該被写体の距離情報を記憶する（ステップＳ６１２）。さらに、人数をカウントしているカウンタの値をカウントアップさせるともに、前記ステップＳ６０５で分割した輪郭領域おける次の領域を選択する（ステップＳ６１３）。そして、全ての輪郭領域と対比したか否かを判断し（ステップＳ６１４）、全ての領域と対比済となるまでステップＳ６０８からの処理を繰り返す。 Of the plurality of contour regions divided in step S603, the largest region, the region closest to the subject, or the region near the center is first selected (step S607). Further, the subject in the selected area is measured, and AF processing is performed to focus on the subject (step S608), and the above-described feature extraction process is executed on the subject in the selected area (step S609). . The feature amount extracted in step S609 is compared with the feature amount of the template (reference image or feature amount data stored in the template memory) to calculate the similarity (step S610). Then, it is determined whether or not the calculated similarity is greater than or equal to a predetermined value (step S611). If not, the process proceeds to step S614 without performing the processes of steps S612 and S613. If the similarity is equal to or greater than a predetermined value, the person is recognized as a person, and the position coordinates of the subject in the through image and the distance information of the subject measured in step S608 are stored (step S612). Further, the value of the counter that counts the number of people is counted up, and the next area in the contour area divided in step S605 is selected (step S613). Then, it is determined whether or not all contour regions have been compared (step S614), and the processing from step S608 is repeated until all the regions have been compared.

全ての領域と対比済となったならば、前記ステップＳ６１３でカウントした認識カウント数（人数）が１以上であるか否かを判断し（ステップＳ６１５）、認識カウント数が０であった場合には、人物が認識できない旨を表示部１４に表示し、または、その他の撮影モードを選択する（ステップＳ６１６）。 If all the areas have been compared, it is determined whether or not the recognition count (number of people) counted in step S613 is 1 or more (step S615), and if the recognition count is 0. Displays that the person cannot be recognized on the display unit 14 or selects another shooting mode (step S616).

他方、ステップＳ６１５での判断の結果、認識カウント数（人数）が１以上であった場合には、この人物と認識された被写体の数（人数）に応じて、撮影条件等を設定する（ステップＳ６１７）。具体的には、以下のステップＳ６１８〜Ｓ６２８に示す処理を実行する。先ず、認識カウント数が１であるか否かを判断し（ステップＳ６１８）、認識カウント数＝１である場合には、当該被写体の距離Ｌの前方ａ［ｍｍ］、後方ｂ［ｍｍ］の範囲に被写界深度を設定する（ステップＳ６１９）。例えば、過焦点距離（ＨＦＤ）を
ＨＦＤ＝（Ｌ^２／ａ）−Ｌ、または
ＨＦＤ＝（Ｌ^２／ｂ）＋Ｌ、に設定する。 On the other hand, if the result of determination in step S615 is that the recognition count number (number of people) is 1 or more, shooting conditions and the like are set according to the number of subjects recognized as this person (number of people) (step 615). S617). Specifically, the processes shown in the following steps S618 to S628 are executed. First, it is determined whether or not the recognition count number is 1 (step S618). If the recognition count number is 1, the range of the distance a to the front a [mm] and the rear b [mm] of the subject. The depth of field is set to (step S619). For example, the hyperfocal distance (HFD) is set to HFD = (L ² / a) −L or HFD = (L ² / b) + L.

次に、焦点距離（ｆ）と、前記設定過焦点距離（ＨＦＤ）に相当する下記絞り値Ｆに設定する（ステップＳ６２０）。
Ｆ＝ｆ^２／（δ・ＨＦＤ）
（δ：許容錯乱円径）
さらに、色強調を肌色に設定し、シャープネスをややソフトに設定する（ステップＳ６２１）。しかる後に、撮影処理を実行し、ユーザによるレリーズ釦３の操作に応答して、撮像素子２０から画像データを取り込み、圧縮符号化／伸長復号化部３０で圧縮符号化して、静止画／動画画像メモリ３１に記録する（ステップＳ６２９）。 Next, the following aperture value F corresponding to the focal length (f) and the set hyperfocal length (HFD) is set (step S620).
F = f ² / (δ · HFD)
(Δ: Allowable circle of confusion)
Further, the color enhancement is set to the skin color, and the sharpness is set to be slightly soft (step S621). Thereafter, shooting processing is executed, and in response to the user operating the release button 3, image data is captured from the image sensor 20, compressed and encoded by the compression encoding / decompression decoding unit 30, and a still image / moving image It records in the memory 31 (step S629).

したがって、認識カウント数＝１であった場合には、図２３（Ａ）に示すように、人物が１人であって、前記ステップＳ６１９〜Ｓ６２１で設定された撮影条件で撮影された静止画が静止画／動画画像メモリ３１に記録される。つまり、認識された被写体の数（人数）が１人であれば、当該人物のポートレート撮影と判断して、当該被写体にのみピントを合わせ、背景は少しボケるように、当該被写体の距離に応じて、人物の前後１０ｃｍ程度の範囲のみ被写体深度Ｚ（または、それに対応する過焦点距離ＨＦＤ）を設定し、過焦点距離ＨＦＤとレンズ焦点距離（ｆ）に応じて絞りの値（Ｆ値）を設定し、設定された絞り値のＡｐｅｘ値（Ａｖ値）に連動して、測光値に基づく適正露出値（Ｅｖ）となるＴｖ値（＝Ｅｖ−Ａｖ）に相当する露出時間（シャッター速度）を設定する。また、人物のポートレート撮影として、肌色を強調する色補正フィルタ処理を加え、輪郭強調処理フィルタによるシャープネスは、ややソフト（ソフトフォーカス気味）に設定して撮影する。 Therefore, when the recognition count number = 1, as shown in FIG. 23A, there is one person, and a still image shot under the shooting conditions set in steps S619 to S621 is taken. It is recorded in the still image / moving image memory 31. In other words, if the number (number of subjects) of the recognized subjects is one, it is determined that the person has taken a portrait, and only the subject is focused, and the background is slightly blurred so that the background is slightly blurred. Accordingly, the subject depth Z (or the hyperfocal distance HFD corresponding thereto) is set only in the range of about 10 cm before and after the person, and the aperture value (F value) is set according to the hyperfocal distance HFD and the lens focal length (f). And an exposure time (shutter speed) corresponding to a Tv value (= Ev−Av) that becomes an appropriate exposure value (Ev) based on the photometric value in conjunction with the Apex value (Av value) of the set aperture value. Set. For portrait photography of a person, color correction filter processing for enhancing skin color is added, and sharpness by the contour enhancement processing filter is set slightly soft (soft focus).

また、ステップＳ６１８での判断の結果、認識カウント数≠１であった場合には、認識カウント数が２〜５であるか否かを判断する（ステップＳ６２２）。認識カウント数＝２〜５である場合には、最も近い人物の距離Ｌ１から最も遠い人物の距離Ｌ２までピントが合うよう、被写界深度を設定する（ステップＳ６２３）。例えば、過焦点距離（ＨＦＤ）を
ＨＦＤ＝（Ｌ・Ｌ１）／（Ｌ−Ｌ１）、または
ＨＦＤ＝（Ｌ・Ｌ２）／（Ｌ−Ｌ２）、に設定する。 If the recognition count number ≠ 1 as a result of the determination in step S618, it is determined whether the recognition count number is 2 to 5 (step S622). When the recognition count number is 2 to 5, the depth of field is set so that the distance from the nearest person distance L1 to the farthest person distance L2 is in focus (step S623). For example, the hyperfocal distance (HFD) is set to HFD = (L·L1) / (L−L1) or HFD = (L·L2) / (L−L2).

次に、設定した過焦点距離（ＨＦＤ）に相当する下記絞り値Ｆに設定する（ステップＳ６２４）。
Ｆ＝ｆ^２／（δ・ＨＦＤ）
（δ：許容錯乱円径）
さらに、色強調を肌色に設定し、シャープネスをノーマルに設定する（ステップＳ６２５）。しかる後に、前記撮影処理を実行する（ステップＳ６２８）。したがって、認識カウント数＝３であった場合には図３２（Ｂ）に示すように、人物が３人であって、前記ステップＳ６２３〜Ｓ６２５で設定された撮影条件で撮影された静止画が静止画／動画画像メモリ３１に記録される。つまり、認識された被写体の数（人数）が２〜５人程度の場合には、複数人のスナップショットと判断して、最も近い人物の距離Ｌ１から最も遠い人物の距離Ｌ２までにピントが合うように、被写界深度Ｚ（または、それに対応する過焦点距離ＨＦＤ）と絞りの値（Ｆ値）に設定して、前述と同様に連動して適正露出となる露出時間を設定し、また、肌色を強調する色補正フィルタ処理を加え、輪郭強調処理フィルタによるシャープネスは、ノーマルに設定して撮影する。 Next, the following aperture value F corresponding to the set hyperfocal distance (HFD) is set (step S624).
F = f ² / (δ · HFD)
(Δ: Allowable circle of confusion)
Further, the color enhancement is set to skin color, and the sharpness is set to normal (step S625). Thereafter, the photographing process is executed (step S628). Therefore, when the recognition count number = 3, as shown in FIG. 32 (B), there are three persons and still images shot under the shooting conditions set in steps S623 to S625 are still images. Recorded in the image / moving image memory 31. In other words, when the number of recognized subjects (number of persons) is about 2 to 5, it is determined as a snapshot of a plurality of persons, and the focus is adjusted from the distance L1 of the nearest person to the distance L2 of the farthest person. As described above, the depth of field Z (or the corresponding hyperfocal distance HFD) and the aperture value (F value) are set, and the exposure time for proper exposure is set in conjunction with the above, and Then, color correction filter processing for emphasizing the skin color is added, and the sharpness by the contour enhancement processing filter is set to normal, and shooting is performed.

また、ステップＳ６２２での判断の結果、認識カウント数≠２〜５であった場合には、認識カウント数は６以上である。そして、認識カウント数は６以上である場合には、最も近い人物の距離Ｌ１から無限遠点までピントが合うよう、被写界深度を設定する（ステップＳ６２６）。例えば、過焦点距離（ＨＦＤ）を
ＨＦＤ＝Ｌ１に設定する。 If the recognition count number is not 2 to 5 as a result of the determination in step S622, the recognition count number is 6 or more. If the recognition count is 6 or more, the depth of field is set so that the closest person is focused from the distance L1 to the point at infinity (step S626). For example, the hyperfocal distance (HFD) is set to HFD = L1.

次に、設定した過焦点距離（ＨＦＤ）に相当する下記絞り値Ｆに設定する（ステップＳ６２７）。
Ｆ＝ｆ^２／（δ・ＨＦＤ）
（δ：許容錯乱円径）
さらに、色強調を肌色に設定し、シャープネス（輪郭強調）をややハードにに設定する（ステップＳ６２８）。しかる後に、前記撮影処理を実行する（ステップＳ６２９）。したがって、認識カウント数＝６以上であった場合には図３２（Ｃ）に示すように、人物が例えば１６人であって、前記ステップＳ６２６〜Ｓ６８で設定された撮影条件で撮影された静止画が静止画／動画画像メモリ３１に記録されることとなる。つまり、認識された被写体の数が６人以上である場合には、複数人の記念写真撮影と判断して、人物だけでなく、遠方の背景や景色にも焦点が合うように、最も近い人物の距離Ｌ１から無限遠（∞）までにピントが合うように、被写界深度Ｚ（または、それに対応する過焦点距離ＨＦＤ）と絞りの値（Ｆ値）に設定して、前述と同様に連動して適正露出となる露出時間を設定する。また、人物が小さくなり、遠方の景色もくっきり写るように、輪郭強調処理フィルタによるシャープネスは、ややハードに設定して撮影する。 Next, the following aperture value F corresponding to the set hyperfocal distance (HFD) is set (step S627).
F = f ² / (δ · HFD)
(Δ: Allowable circle of confusion)
Further, the color enhancement is set to the skin color, and the sharpness (outline enhancement) is set to be slightly hard (step S628). Thereafter, the photographing process is executed (step S629). Therefore, when the recognition count number is 6 or more, as shown in FIG. 32C, there are 16 people, for example, and still images shot under the shooting conditions set in steps S626 to S68. Is recorded in the still image / moving image memory 31. In other words, if the number of recognized subjects is 6 or more, it is determined that a commemorative photo is taken by a plurality of people, and the closest person is focused not only on the person but also on the background and scenery in the distance. In the same manner as described above, the depth of field Z (or the corresponding hyperfocal distance HFD) and the aperture value (F value) are set so that the distance from the distance L1 to infinity (∞) is in focus. Set the exposure time for proper exposure. Further, the sharpness by the edge enhancement processing filter is set to be slightly hard so that a person becomes smaller and a distant scenery can be clearly seen.

このように、認識された被写体の距離、人数に応じて、被写界深度や絞りの設定、各種フィルタの処理の選択などの撮影処理を制御して、同じ人物撮影においても、当該被写体の状況に応じて、ユーザの意図により近い自動撮影が行える。同様に、他の撮影シーンにおいても、撮影距離や認識した被写体の数に応じて、当該撮影シーンの中でも、より詳細な撮影条件の設定や画像処理の設定が可能となる。 In this way, depending on the distance and the number of recognized subjects, the shooting process such as the depth of field and aperture setting, and the selection of various filter processes is controlled, so that the situation of the subject can be obtained even in the same person shooting. Accordingly, automatic shooting closer to the user's intention can be performed. Similarly, in other shooting scenes, more detailed shooting conditions and image processing can be set in the shooting scene according to the shooting distance and the number of recognized subjects.

なお、本実施の形態においては、ステップＳ６０４の処理を実行し、前述した図４のステップＳ１０９と同様に、取り込んだ被写体像から対角の２点で識別したい領域を選択するようにした。しかし、このステップＳ６０４を行うことなく、被写体像の全域を識別したい領域とするようにしてもよい。 In the present embodiment, the process of step S604 is executed, and the region to be identified at two diagonal points is selected from the captured subject image, as in step S109 of FIG. 4 described above. However, the entire region of the subject image may be set to be identified without performing step S604.

ここで、図３１のフローチャートにおいて用いた式について説明すると、被写界深度は、撮影レンズの焦点距離（ｆ）と絞り値（Ｆ）と撮影距離（Ｆ）等から、次のような式で算出することができる。
前方被写界深度Ｔｆ＝δＦＬ^２／（ｆ^２＋δＦＬ）
後方被写界深度Ｔｒ＝δＦＬ^２／（ｆ^２−δＦＬ）
被写界深度Ｚ＝Ｔｆ＋Ｔｒ＝δＦＬ^２ｆ^２／（ｆ^２−δＦＬ） Here, the equation used in the flowchart of FIG. 31 will be described. The depth of field is expressed by the following equation from the focal length (f), aperture value (F), photographing distance (F), and the like of the photographing lens. Can be calculated.
Forward depth of field Tf = δFL ² / (f ² + δFL)
Back depth of field Tr = δFL ² / (f ² −δFL)
Depth of field Z = Tf + Tr = δFL ² f ² / (f ² −δFL)

また、撮像面から被写体との距離に換算すると、被写界深度限界近点、遠点が求まる。
被写界深度限界近点Ｌｍｉｎ＝Ｌ−Ｔｆ＝ｆ^２Ｌ／（ｆ^２＋δＦＬ）
被写界深度限界遠点Ｌｍａｘ＝Ｌ＋Ｔｆ＝ｆ^２Ｌ／（ｆ^２−δＦＬ）
同様に、被写界深度Ｚ＝Ｔｆ＋Ｔｒ＝Ｌｍａｘ−Ｌｍｉｎ
但し、ｆ；焦点距離、Ｆ：絞り値（Ｆ値）、Ｌ；被写体との撮影距離、δ；許容錯乱円の直径を表す。 Further, when converted into the distance from the imaging surface to the subject, the depth-of-field limit near point and far point are obtained.
Depth of field limit near point Lmin = L−Tf = f ² L / (f ² + δFL)
Depth of field limit far point Lmax = L + Tf = f ² L / (f ² −δFL)
Similarly, depth of field Z = Tf + Tr = Lmax−Lmin
Where f: focal length, F: aperture value (F value), L: photographing distance to the subject, δ: diameter of allowable circle of confusion.

また、被写界深度遠点（Ｌｍａｘ）が無限遠（∞）となる撮影距離（Ｌ）である「過焦点距離」（ＨｙｐｅｒＦｏｃａｌＤｉｓｔａｎｃｅ、ＨＦＤ）では、ＨＦＤの１／２の距離から無限遠（∞）までにピントが合って見える。この過焦点距離（ＨＦＤ）と撮影距離（Ｌ）の関係からも、同様に、次式のように被写界深度を求めることができる。
過焦点距離ＨＦＤ＝ｆ^２＋δＦ、
被写界深度限界近点Ｌｍｉｎ＝（過焦点距離ＨＦＤ×撮影距離）÷（過焦点距離ＨＦＤ＋撮影距離Ｌ）＝ｆ^２Ｌ（ｆ^２＋δＦＬ）
被写界深度限界遠点Ｌｍａｘ＝（過焦点距離ＨＦＤ×撮影距離）÷（過焦点距離ＨＦＤ−撮影距離Ｌ）＝ｆ^２Ｌ（ｆ^２−δＦＬ）
被写界深度Ｚ＝Ｌｍａｘ−Ｌｍｉｎ＝２δＦＬ^２ｆ^２／（ｆ^４−δ^２Ｆ^２Ｌ^２） Further, in the “hyperfocal distance” (HFD), which is an imaging distance (L) at which the depth of field far point (Lmax) becomes infinity (∞), the distance from ½ of HFD to infinity. It appears in focus by (∞). Similarly, from the relationship between the hyperfocal distance (HFD) and the shooting distance (L), the depth of field can be obtained as in the following equation.
Hyperfocal distance HFD = f ² + δF,
Depth of field limit near point Lmin = (hyperfocal distance HFD × shooting distance) ÷ (hyperfocal distance HFD + shooting distance L) = f ² L (f ² + δFL)
Depth of field limit far point Lmax = (hyperfocal distance HFD × shooting distance) ÷ (hyperfocal distance HFD−shooting distance L) = f ² L (f ² −δFL)
Depth of field Z = Lmax−Lmin = 2δFL ² f ² / (f ⁴ −δ ² F ² L ² )

したがって、前記ステップＳ６１８のように、撮影距離Ｌを被写体の前ａ［ｍｍ］〜被写体の後ｂ［ｍｍ］の範囲にピントが合うように設定するには、
前方被写界深度Ｔｆ＝ａ，後方被写界深度Ｔｒ＝ｂ、被写界深度Ｚ＝Ｔｆ＋Ｔｒ＝ａ＋ｂ、
もしくは、ＨＦＤ＝（Ｌ^２／ａ）−Ｌ、または、ＨＦＤ＝（Ｌ^２／ｂ）＋Ｌと設定すればよい。 Therefore, as in step S618, in order to set the shooting distance L to be in the range from the front a [mm] of the subject to the rear b [mm] of the subject,
Forward depth of field Tf = a, backward depth of field Tr = b, depth of field Z = Tf + Tr = a + b,
Alternatively, HFD = (L ² / a) −L or HFD = (L ² / b) + L may be set.

また、前記ステップＳ６２２のように、撮影距離Ｌ１〜撮影距離Ｌ２までピントが合うように設定するには、
被写界深度限界近点Ｌｍｉｎ＝Ｌ１、被写界深度限界遠点Ｌｍａｘ＝Ｌ２、被写界深度Ｚ＝Ｌｍａｘ−Ｌｍｉｎ＝Ｌ２−Ｌ１、
もしくは、ＨＦＤ＝（Ｌ・Ｌ１）／（Ｌ−Ｌ１）、または、ＨＦＤ＝（Ｌ・Ｌ２）／（２−Ｌ）と設定すればよい。 Further, as in step S622, in order to set the focus from the shooting distance L1 to the shooting distance L2,
Depth of field limit near point Lmin = L1, Depth of field limit far point Lmax = L2, Depth of field Z = Lmax−Lmin = L2−L1,
Alternatively, HFD = (L·L1) / (L−L1) or HFD = (L·L2) / (2−L) may be set.

また、前記ステップＳ６２５のように、撮影距離Ｌ１〜無限遠（∞）までピントがあうように設定するには、
過焦点距離ＨＦＤ＝ｆ^２／δＦ＝Ｌ１、
と設定すればよい。 Also, as in step S625, in order to set the focus from the shooting distance L1 to infinity (∞),
Hyperfocal distance HFD = f ² / δF = L1,
Should be set.

したがって、各設定された被写界深度、もしくは、それに相当する過焦点距離ＨＦＤと焦点距離（ｆ）に応じて、絞り値Ｆ＝ｆ^２／（δ・ＨＦＤ）に設定し、
また、露出値（Ａｖ）＝被写体輝度値（Ｅｖ）＋感度値（Ｓｖ）＝開口値（Ａｖ）＋シャッター速度値（Ｔｖ）、
開口値（Ａｖ）＝ｌｏｇ_２（Ｆ）^２、シャッター速度値（Ｔｖ）ｌｏｇ_２（１／Ｔ）の関係より、適正露出値Ｅｖを満たすシャッター速度値（Ｔｖ）＝Ｅｖ−Ａｖ＝Ｅｖ−ｌｏｇ_２（Ｆ）^２、露出時間（Ｔ）＝１／（２のＴｖ乗）に設定すればよい。 Therefore, the aperture value F = f ² / (δ · HFD) is set according to each set depth of field, or the corresponding hyperfocal length HFD and focal length (f),
Further, exposure value (Av) = subject luminance value (Ev) + sensitivity value (Sv) = aperture value (Av) + shutter speed value (Tv),
From the relationship of the aperture value (Av) = log ₂ (F) ² and the shutter speed value (Tv) log ₂ (1 / T), the shutter speed value (Tv) satisfying the appropriate exposure value Ev = Ev−Av = Ev−log. ₂ (F) ² , exposure time (T) = 1 / (2 to the power of Tv) may be set.

なお、実施の形態においては、本発明をデジタルカメラに適用するようにしたが、カメラに限らず撮像機能を備えた携帯電話等の各種機器にも本発明を適用することができる。 In the embodiment, the present invention is applied to a digital camera. However, the present invention can be applied not only to a camera but also to various devices such as a mobile phone having an imaging function.

（Ａ）は本発明の各実施の形態に共通するデジタルカメラの正面図、（Ｂ）は背面図、（Ｃ）は側面透視図である。(A) is a front view of a digital camera common to each embodiment of the present invention, (B) is a rear view, and (C) is a side perspective view. 同デジタルカメラの概略的回路構成を示すブロック図である。FIG. 2 is a block diagram showing a schematic circuit configuration of the digital camera. 同デジタルカメラの具体的回路構成を示すブロック図である。2 is a block diagram showing a specific circuit configuration of the digital camera. FIG. 第１の実施の形態における処理手順を示すフローチャートである。It is a flowchart which shows the process sequence in 1st Embodiment. 同実施の形態の表示画面例を示す図である。It is a figure which shows the example of a display screen of the embodiment. 人物の顔を抽出して撮影する場合の動作例を示す図である。It is a figure which shows the operation example in the case of extracting and photographing a person's face. 野鳥を数える例を示す図である。It is a figure which shows the example which counts a wild bird. 集客数を数える例を示す図である。It is a figure which shows the example which counts the number of customers. 陳列商品を種別毎に数える例示す図である。It is a figure which shows the example which counts display goods for every classification. 本実施の形態における認識処理、認識被写体の設定メニューの表示例を示す図である。It is a figure which shows the example of a display of the recognition processing in this Embodiment, and the setting menu of a recognition subject. 図４のフローチャートにおけるステップＳ１０６で実行される特徴抽出処理の詳細を示すフローチャートである。It is a flowchart which shows the detail of the feature extraction process performed by step S106 in the flowchart of FIG. １次微分フィルタまたは２次微分フィルタ処理による画像の先鋭化処理、エッジ（輪郭）抽出処理の例を示す図である。It is a figure which shows the example of the sharpening process of the image by a primary differential filter or a secondary differential filter process, and an edge (contour) extraction process. １次微分フィルタまたは２次微分フィルタ処理による画像の先鋭化処理、エッジ（輪郭）抽出処理の例を示す図である。It is a figure which shows the example of the sharpening process of the image by a primary differential filter or a secondary differential filter process, and an edge (contour) extraction process. 膨脹、収縮処理の例を示す図である。It is a figure which shows the example of an expansion and contraction process. 所定の輝度の抽出処理を示す図である。It is a figure which shows the extraction process of predetermined brightness | luminance. 所定の色の領域を抽出する例として、人間の肌色領域の抽出例を示す図である。It is a figure which shows the example of extraction of a human skin color area | region as an example which extracts the area | region of a predetermined color. ステップＳ２０４の特徴抽出処理（１）における膨脹収縮処理の例を示す図である。It is a figure which shows the example of the expansion / contraction process in the characteristic extraction process (1) of step S204. 輪郭形状の特徴を抽出する例を示す図である。It is a figure which shows the example which extracts the feature of an outline shape. 輪郭の縦横比＝長さｈ／幅ｗなどから、形状の「細長さ」等の評価値を求める説明図である。It is explanatory drawing which calculates | requires evaluation values, such as a shape "thin length", from the aspect ratio = length h / width w of an outline. 輪郭形状の偏角関数、位置座標関数の例を示す図である。It is a figure which shows the example of the declination function of a contour shape, and a position coordinate function. 輪郭形状の偏角関数、位置座標関数の例を示す図である。It is a figure which shows the example of the declination function of a contour shape, and a position coordinate function. 輪郭形状のフーリエ記述子の例を示す図である。It is a figure which shows the example of the Fourier descriptor of an outline shape. Ｇ形記述子の例を示す図である。It is a figure which shows the example of a G-type descriptor. Ｐ形記述子の例を示す図である。It is a figure which shows the example of a P-type descriptor. 入力画像とテンプレート画像との相関度、類似度を算出する説明図である。It is explanatory drawing which calculates the correlation degree and similarity degree of an input image and a template image. 顔の眼の領域を抽出するためのマスクパターンの例を示す図である。It is a figure which shows the example of the mask pattern for extracting the area | region of the eye of the face. 人間の認識処理１（中距離、頭の認識）の処理手順を示すフローチャートである。It is a flowchart which shows the process sequence of the human recognition process 1 (medium distance, head recognition). 人間の顔の簡易な識別処理手順を示すフローチャートである。It is a flowchart which shows the simple identification process procedure of a human face. 本発明の第２の実施の形態における撮影シーンのサンプル画像等を示す図である。It is a figure which shows the sample image etc. of the imaging scene in the 2nd Embodiment of this invention. 同実施の形態における処理手順を示すフローチャートである。It is a flowchart which shows the process sequence in the embodiment. 本発明の第２の実施の形態における処理手順を示すフローチャートである。It is a flowchart which shows the process sequence in the 2nd Embodiment of this invention. 同実施の形態における撮影画像例を示す図である。It is a figure which shows the example of the picked-up image in the same embodiment.

Explanation of symbols

１デジタルカメラ
２本体
３レリーズ釦
８モード切替スイッチ
９ズーム操作キー
１０カーソルキー
１３メニューキー
１４表示部
２０撮像素子
２３操作部
２５制御部
３０圧縮符号化／伸長復号化部
３１静止画／動画画像メモリ
３２プログラムメモリ
３３データメモリ
３５操作入力部
３８撮影制御部
３９外部メモリ媒体
４６測距センサ
５６駆動機構
５８シャッター
６４シャッター駆動部
DESCRIPTION OF SYMBOLS 1 Digital camera 2 Main body 3 Release button 8 Mode change switch 9 Zoom operation key 10 Cursor key 13 Menu key 14 Display part 20 Image sensor 23 Operation part 25 Control part 30 Compression encoding / decompression decoding part 31 Still image / moving image memory 32 Program memory 33 Data memory 35 Operation input unit 38 Shooting control unit 39 External memory medium 46 Distance sensor 56 Drive mechanism 58 Shutter 64 Shutter drive unit

Claims

Imaging means for imaging a subject;
Display means for displaying an image formed by the imaging means;
A selection means for arbitrarily selecting a plurality of subjects to be counted from the image displayed on the display means;
Counting means for counting, for each subject, the number of image portions whose similarity with feature data extracted from each subject is greater than or equal to a predetermined value in the image displayed on the display means ;
Display control means for simultaneously displaying the number counted for each subject by the counting means on the display means while distinguishing for each subject;
A camera apparatus comprising:

2. The display control means, for each of a plurality of subjects selected by the selection means, displays a sample image of each subject and the number of each subject in association with each other on the display means. The camera device described.

The display control means associates the sample image of each subject with the number of each subject in a state where the image portion corresponding to each subject counted by the counting means is identified and displayed in the image displayed on the display means. The camera device according to claim 2, wherein the camera device displays the images simultaneously.

4. The camera apparatus according to claim 3, wherein the display control means performs painting with different shapes corresponding to each of a plurality of subjects to identify and display an image portion corresponding to each subject.

A cursor whose position is arbitrarily displaced in accordance with the operation is displayed on the display means, and further includes an area designating means for designating an arbitrary area whose diagonals are two points designated by the cursor ,
2. The counting unit according to claim 1, wherein the counting unit counts the number of image portions whose similarity with the feature data extracted from each subject is greater than or equal to a predetermined value in the region specified by the region specifying unit. 5. The camera device according to any one of 4.

A cursor whose position is arbitrarily displaced according to the operation is displayed on the display means, and further includes a position designation means for designating an arbitrary position indicated by the cursor,
The camera apparatus according to claim 1, wherein the selection unit selects a subject at a position designated by the position designation unit.

The position specifying means displays a cursor whose position and size are arbitrarily displaced according to an operation on the display means, and specifies an arbitrary position and size indicated by the cursor,
The camera device according to claim 6, wherein the selection unit selects a subject from an area corresponding to the position and size designated by the position designation unit.

Wherein based on the number of counted the image portion by the counting means, a camera apparatus according to any one of claims 1 to 7, characterized by further comprising a photographing control means for controlling the photographing operation of the camera device.

Storage means for storing shooting conditions corresponding to each of a plurality of shooting scenes;
Selection means for selecting one of the plurality of shooting scenes based on the number of the image portions counted by the counting means;
Claim 8, characterized in that the selection means in response to the selected photographing scene by on the basis of the imaging conditions stored in said storage means, further comprising a photographing control means for controlling the photographing operation of the camera device The camera device described.

An instruction selecting means for selecting any of the plurality of shooting scenes based on a photographer's instruction operation;
When the shooting scene is selected by the instruction selection unit, the shooting control unit controls the shooting operation of the camera device based on the shooting condition stored in the storage unit corresponding to the shooting scene. The camera device according to claim 9.

The camera apparatus according to claim 8 , wherein the photographing control unit controls at least one of a focusing operation, a depth of field, an exposure condition, and a filtering process of the camera apparatus.

A computer having a camera device including an image forming means for forming an image of a subject and a display means for displaying an image formed by the image forming means;
A selection means for arbitrarily selecting a plurality of subjects to be counted from the image displayed on the display means;
Counting means for counting, for each subject, the number of image portions whose similarity with feature data extracted from each subject is greater than or equal to a predetermined value in the image displayed on the display means ;
Display control means for causing the display means to display the number counted for each subject by the counting means for each subject;
A camera control program characterized by being made to function.