JP2010244251A

JP2010244251A - Image processor for detecting coordinate position for characteristic site of face

Info

Publication number: JP2010244251A
Application number: JP2009091296A
Authority: JP
Inventors: Masaya Usui; 雅也碓井; Kenji Matsuzaka; 健治松坂
Original assignee: Seiko Epson Corp
Current assignee: Seiko Epson Corp
Priority date: 2009-04-03
Filing date: 2009-04-03
Publication date: 2010-10-28

Abstract

<P>PROBLEM TO BE SOLVED: To achieve the efficiency and high speed operation of processing for detecting the position of the characteristic site of the face included in an image. <P>SOLUTION: An image processor for detecting the coordinate position of the characteristic site of the face included in an image under consideration includes: a face region detection part for detecting an image region including at least a portion of the face image from the image under consideration as a face region; an initial position setting part for setting the initial position of a characteristic point set in the image under consideration for detecting the coordinate position of the characteristic site from the candidates of a plurality of initial positions to be set based on the face region detection information as information related with the detection of the face image; and a characteristic position detection part for correcting the setting position of the characteristic point set at the initial position to be close to the position of the characteristic site, and for detecting the corrected setting position as the coordinate position of the characteristic site. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、注目画像に含まれる顔の特徴部位の座標位置を検出する画像処理装置に関する。 The present invention relates to an image processing apparatus that detects a coordinate position of a feature portion of a face included in an attention image.

視覚的事象のモデル化手法として、アクティブアピアランスモデル（ＡｃｔｉｖｅＡｐｐｅａｒａｎｃｅＭｏｄｅｌ、略して「ＡＡＭ」とも呼ばれる）が知られている。ＡＡＭでは、例えば、複数のサンプル画像に含まれる顔の特徴部位（例えば目尻や鼻頭やフェイスライン）の位置（座標）や画素値（例えば輝度値）の統計的分析を通じて、上記特徴部位の位置により特定される顔の形状を表す形状モデルや、平均的な形状における「見え（Ａｐｐｅａｒａｎｃｅ）」を表すテクスチャーモデルが設定され、これらのモデルを用いて顔画像がモデル化される。ＡＡＭによれば、任意の顔画像のモデル化（合成）が可能であり、また、画像に含まれる顔の特徴部位の位置の検出が可能である（特許文献１）。 As a visual event modeling method, an active appearance model (Active Appearance Model, also referred to as “AAM” for short) is known. In AAM, for example, through statistical analysis of the positions (coordinates) and pixel values (for example, luminance values) of facial features (for example, the corners of the eyes, the nose and the face lines) included in a plurality of sample images, A shape model representing the shape of the identified face and a texture model representing “appearance” in the average shape are set, and the face image is modeled using these models. According to AAM, it is possible to model (synthesize) an arbitrary face image and to detect the position of a facial feature part included in the image (Patent Document 1).

特開２００７−１４１１０７号公報JP 2007-141107 A

しかし、上記従来の技術には、画像に含まれる顔の特徴部位の位置の検出に関して、さらなる効率化・高速化の余地があった。 However, the above-described conventional technology has room for further efficiency and speed-up regarding the detection of the position of the facial feature part included in the image.

なお、このような問題は、ＡＡＭを利用する場合に限らず、画像に含まれる顔の特徴部位の位置を検出する画像処理に共通の問題であった。 Such a problem is not limited to the case of using AAM, but is a problem common to image processing for detecting the position of a facial feature part included in an image.

本発明は、上記の課題を解決するためになされたものであり、画像に含まれる顔の特徴部位の位置を検出する処理の効率化・高速化を図ることを目的とする。 The present invention has been made to solve the above-described problems, and an object of the present invention is to improve the efficiency and speed of processing for detecting the position of a facial feature part included in an image.

上記課題の少なくとも一部を解決するために本願発明は以下の態様を採る。 In order to solve at least a part of the above problems, the present invention employs the following aspects.

第１の態様は、注目画像に含まれる顔の特徴部位の座標位置を検出する画像処理装置を提供する。本発明の第１の態様に係る画像処理装置は、前記注目画像から顔画像の少なくとも一部を含む画像領域を顔領域として検出する顔領域検出部と、前記特徴部位の座標位置を検出するために前記注目画像に設定される特徴点の初期位置を、前記顔領域の検出に関連する情報である顔領域検出情報に基づいて設定される複数の前記初期位置の候補から設定する初期位置設定部と、前記初期位置に設定された前記特徴点の設定位置を前記特徴部位の位置に近づけるように補正し、補正された前記設定位置を前記特徴部位の座標位置として検出する特徴位置検出部と、を備える。 A first aspect provides an image processing apparatus that detects a coordinate position of a feature portion of a face included in an attention image. An image processing apparatus according to the first aspect of the present invention detects a coordinate area of a feature area and a face area detection unit that detects an image area including at least a part of a face image from the target image as a face area. An initial position setting unit that sets an initial position of a feature point set in the target image from a plurality of candidates for the initial position set based on face area detection information that is information related to detection of the face area And a feature position detector that corrects the set position of the feature point set to the initial position so as to approach the position of the feature part, and detects the corrected set position as a coordinate position of the feature part; Is provided.

第１の態様に係る画像処理装置によれば、特徴点の初期位置を、顔領域検出情報に基づいて設定される複数の前記初期位置の候補から設定するため、初期位置を良好な位置に設定することができる。これにより、注目画像に含まれる顔の特徴部位の位置を検出する処理の効率化・高速化を図ることができる。 According to the image processing apparatus according to the first aspect, the initial position of the feature point is set from a plurality of candidates for the initial position set based on the face area detection information, so the initial position is set to a good position. can do. Thereby, it is possible to increase the efficiency and speed of the process of detecting the position of the facial feature part included in the target image.

第１の態様に係る画像処理装置において、前記顔領域検出情報は、前記顔領域検出部による前記検出に伴い特定される情報であり、前記初期位置設定部は、特定された前記顔領域検出情報を取得する取得部と、取得された前記顔領域検出情報に基づいて前記初期位置の候補を設定する初期位置候補設定部と、を備えていてもよい。この場合、初期位置候補設定部により設定された初期位置の候補の１つから特徴点の初期位置を設定するため、注目画像に含まれる顔の特徴部位の位置を効率的かつ高速に検出することができる。 In the image processing device according to the first aspect, the face area detection information is information specified along with the detection by the face area detection unit, and the initial position setting unit is the specified face area detection information. And an initial position candidate setting unit that sets the initial position candidate based on the acquired face area detection information. In this case, since the initial position of the feature point is set from one of the initial position candidates set by the initial position candidate setting unit, the position of the feature part of the face included in the target image is detected efficiently and at high speed. Can do.

第１の態様に係る画像処理装置において、前記顔領域検出情報は、前記顔領域検出部により検出された前記顔領域に含まれる顔画像が真の顔画像であることの確からしさを表す顔領域信頼度を含み、前記初期位置候補設定部は、前記顔領域信頼度が低い場合には、前記顔領域信頼度が高い場合に比べて、設定する前記初期位置の候補の数を増やしてもよい。この場合、顔領域信頼度が低い場合に、初期位置の候補の数を増やすことにより、注目画像に含まれる顔の特徴部位の位置を効率的に検出することができる。 In the image processing apparatus according to the first aspect, the face area detection information includes a face area that represents a probability that the face image included in the face area detected by the face area detection unit is a true face image. Including the reliability, the initial position candidate setting unit may increase the number of candidates for the initial position to be set when the face area reliability is low compared to when the face area reliability is high. . In this case, when the face area reliability is low, by increasing the number of candidates for the initial position, it is possible to efficiently detect the position of the facial feature part included in the target image.

第１の態様に係る画像処理装置において、前記顔領域検出情報は、前記顔領域検出部により検出された前記顔領域に含まれる顔画像の画像面内における回転角度に関する角度情報を含み、前記初期位置候補設定部は、前記角度情報に基づいて、予め規定されている前記初期位置の候補を、前記回転角度に応じて回転させて設定してもよい。この場合、初期位置の候補を、回転角度に応じて回転させて設定することにより、注目画像に含まれる顔の特徴部位の位置を効率的かつ高速に検出することができる。 In the image processing device according to the first aspect, the face area detection information includes angle information related to a rotation angle in the image plane of the face image included in the face area detected by the face area detection unit, and The position candidate setting unit may rotate and set the predetermined initial position candidates according to the rotation angle based on the angle information. In this case, the position of the facial feature part included in the image of interest can be detected efficiently and at high speed by setting the initial position candidates by rotating them according to the rotation angle.

第１の態様に係る画像処理装置において、前記顔領域検出情報は、前記顔領域検出部が特定可能な前記顔画像の画像面内における回転角度に関する情報を含み、前記複数の初期位置の候補は、前記顔領域検出部が特定可能な前記回転角ごとに、前記回転角と値が隣接する一方の前記特定可能な回転角度との中間値から、値が隣接する他方の前記特定可能な回転角度との中間値までの範囲にそれぞれ設定されてもよい。この場合、顔領域に含まれる顔画像の回転角度を用いて、特定された回転角度を中心として所定の角度の範囲内に複数の初期位置の候補を設定することで、注目画像に含まれる顔の特徴部位の位置を効率的かつ高速に検出することができる。 In the image processing device according to the first aspect, the face area detection information includes information on a rotation angle in the image plane of the face image that can be specified by the face area detection unit, and the plurality of initial position candidates are For each of the rotation angles that can be specified by the face area detection unit, from the intermediate value between the rotation angle and the one of the specified rotation angles that are adjacent to each other, the other specified rotation angle that has the adjacent value May be set in a range up to an intermediate value. In this case, by using the rotation angle of the face image included in the face area, a plurality of initial position candidates are set within a predetermined angle range centered on the specified rotation angle, whereby the face included in the target image It is possible to efficiently and rapidly detect the position of the characteristic part.

第１の態様に係る画像処理装置において、前記顔領域検出情報は、前記顔領域検出部により検出された前記顔領域に対する顔画像の相対的な位置の傾向に関する情報を含み、前記複数の初期位置の候補は、前記顔領域に対する相対的な位置が前記傾向に応じて決定されていてもよい。この場合、顔領域に対する顔画像の相対的な位置の傾向に応じて初期位置の候補の顔領域に対する相対的な位置が決定されているため、注目画像に含まれる顔の特徴部位の位置を効率的かつ高速に検出することができる。 In the image processing device according to the first aspect, the face area detection information includes information on a tendency of a relative position of a face image with respect to the face area detected by the face area detection unit, and the plurality of initial positions The relative position with respect to the face area may be determined according to the tendency. In this case, since the relative position with respect to the candidate face area of the initial position is determined according to the tendency of the relative position of the face image with respect to the face area, the position of the facial feature portion included in the target image is efficiently determined. And can be detected at high speed.

第１の態様に係る画像処理装置において、前記初期位置設定部は、前記初期位置の候補となる位置に設定された前記特徴点に基づいて、前記注目画像の一部を変換した画像である平均形状画像を生成する生成部と、前記平均形状画像と、前記特徴部位の座標位置が既知の顔画像を含む複数のサンプル画像に基づいて生成された画像である平均顔画像と、の差分値を算出する算出部と、を備えるとともに、前記複数の初期位置の候補のうち、前記差分値が最小となる初期位置の候補を前記初期位置として設定してもよい。この場合、差分値が最小となる初期位置の候補を初期位置とすることにより、注目画像に含まれる顔の特徴部位の位置を効率的かつ高速に検出することができる。 In the image processing apparatus according to the first aspect, the initial position setting unit is an average that is an image obtained by converting a part of the target image based on the feature points set at positions that are candidates for the initial position. A difference value between a generation unit that generates a shape image, the average shape image, and an average face image that is an image generated based on a plurality of sample images including a face image in which the coordinate position of the characteristic part is known. A calculation unit for calculating, and among the plurality of initial position candidates, an initial position candidate having a minimum difference value may be set as the initial position. In this case, by setting the initial position candidate having the smallest difference value as the initial position, the position of the facial feature part included in the target image can be detected efficiently and at high speed.

第１の態様に係る画像処理装置において、前記特徴位置検出部は、前記初期位置に対応する平均形状画像と、前記平均顔画像と、の差分値に基づいて、前記差分値が小さくなるように前記設定位置を補正する補正部を備えるとともに、前記差分値が所定となる前記設定位置を前記座標位置として検出してもよい。この場合、初期位置に対応する平均形状画像と、平均顔画像と、の差分値に基づいて、特徴部位の座標位置を検出するため、注目画像に含まれる顔の特徴部位の位置を効率的かつ高速に検出することができる。 In the image processing device according to the first aspect, the feature position detection unit is configured to reduce the difference value based on a difference value between an average shape image corresponding to the initial position and the average face image. While providing the correction | amendment part which correct | amends the said setting position, you may detect the said setting position where the said difference value becomes predetermined as said coordinate position. In this case, since the coordinate position of the feature part is detected based on the difference value between the average shape image corresponding to the initial position and the average face image, the position of the feature part of the face included in the target image can be efficiently and It can be detected at high speed.

第１の態様に係る画像処理装置において、前記特徴部位は、眉毛と目と鼻と口とフェイスラインとの一部であってもよい。この場合、眉毛と目と鼻と口とフェイスラインと一部について良好に座標位置を検出することができる。 In the image processing apparatus according to the first aspect, the characteristic part may be a part of eyebrows, eyes, nose, mouth, and face line. In this case, the coordinate positions can be satisfactorily detected for the eyebrows, eyes, nose, mouth, face line, and part.

なお、本発明は、種々の態様で実現することが可能であり、例えば、プリンター、デジタルスチルカメラ、パーソナルコンピューター、デジタルビデオカメラ等で実現することができる。また、画像処理方法および装置、特徴部位の位置検出方法および装置、表情判定方法および装置、これらの方法または装置の機能を実現するためのコンピュータープログラム、そのコンピュータープログラムを記録した記録媒体、そのコンピュータープログラムを含み搬送波内に具現化されたデータ信号、等の形態で実現することができる。 Note that the present invention can be realized in various modes, for example, a printer, a digital still camera, a personal computer, a digital video camera, and the like. Also, an image processing method and apparatus, a position detection method and apparatus for a characteristic part, a facial expression determination method and apparatus, a computer program for realizing the functions of these methods or apparatuses, a recording medium recording the computer program, and the computer program Including a data signal embodied in a carrier wave.

本発明の第１実施例における画像処理装置としてのプリンター１００の構成を概略的に示す説明図である。1 is an explanatory diagram schematically showing a configuration of a printer 100 as an image processing apparatus in a first embodiment of the present invention. FIG. 第１実施例におけるＡＡＭ設定処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the AAM setting process in 1st Example. サンプル画像ＳＩの一例を示す説明図である。It is explanatory drawing which shows an example of sample image SI. サンプル画像ＳＩにおける特徴点ＣＰの設定方法の一例を示す説明図である。It is explanatory drawing which shows an example of the setting method of the feature point CP in the sample image SI. サンプル画像ＳＩに設定された特徴点ＣＰの座標の一例を示す説明図である。It is explanatory drawing which shows an example of the coordinate of the feature point CP set to sample image SI. 平均形状ｓ０の一例を示す説明図である。It is explanatory drawing which shows an example of average shape s0. 形状ベクトルｓｉおよび形状パラメーターｐｉと顔の形状ｓとの関係を例示した説明図である。It is explanatory drawing which illustrated the relationship between shape vector si and shape parameter pi, and face shape s. サンプル画像ＳＩのワープＷの方法の一例を示す説明図である。It is explanatory drawing which shows an example of the method of the warp W of the sample image SI. 平均顔画像Ａ０（ｘ）の一例を示す説明図である。It is explanatory drawing which shows an example of average face image A0 (x). 第１実施例における顔特徴位置検出処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the face feature position detection process in 1st Example. 顔領域ＦＡの検出処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the detection process of face area FA. 注目画像ＯＩにおける顔領域ＦＡの検出を説明するための説明図である。It is explanatory drawing for demonstrating the detection of the face area FA in the attention image OI. 評価値Ｔｖの算出に用いられるフィルタを説明するための説明図である。It is explanatory drawing for demonstrating the filter used for calculation of evaluation value Tv. ウィンドウＳＷを移動させた状態を例示した説明図である。It is explanatory drawing which illustrated the state which moved window SW. 顔の画像に対応する画像領域であると判定された複数のウィンドウＳＷを例示した説明図である。It is explanatory drawing which illustrated several window SW determined to be an image area | region corresponding to the image of a face. 学習に用いられるサンプル画像の一例を示す説明図である。It is explanatory drawing which shows an example of the sample image used for learning. 第１実施例における特徴点ＣＰの初期位置設定処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the initial position setting process of the feature point CP in 1st Example. グローバルパラメーターの値を変更することによる特徴点ＣＰの仮設定位置を例示した説明図である。It is explanatory drawing which illustrated the temporary setting position of the feature point CP by changing the value of a global parameter. 顔領域ＦＡの特定顔傾きが３０度の場合における特徴点ＣＰの仮設定位置を例示した説明図である。It is explanatory drawing which illustrated the temporary setting position of the feature point CP in case the specific face inclination of the face area FA is 30 degree | times. 仮設定位置の変化の段階数を説明するための説明図である。It is explanatory drawing for demonstrating the step number of the change of a temporary setting position. 平均形状画像Ｉ（Ｗ（ｘ；ｐ））の一例を示す説明図である。It is explanatory drawing which shows an example of the average shape image I (W (x; p)). 第１実施例における特徴点ＣＰ設定位置補正処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the feature point CP setting position correction process in 1st Example. 顔特徴位置検出処理の結果の一例を示す説明図である。It is explanatory drawing which shows an example of the result of a face feature position detection process. 顔領域検出情報と特徴点ＣＰの仮設定位置について第２の例を示した説明図である。It is explanatory drawing which showed the 2nd example about the temporary setting position of face area | region detection information and the feature point CP. 顔領域検出情報と特徴点ＣＰの仮設定位置についての第３の例を示した説明図である。It is explanatory drawing which showed the 3rd example about the temporary setting position of face area | region detection information and the feature point CP.

以下、本発明に係る画像処理装置の一態様であるプリンターについて、図面を参照しつつ、実施例に基づいて説明する。 Hereinafter, a printer which is an aspect of an image processing apparatus according to the present invention will be described based on examples with reference to the drawings.

Ａ．第１実施例：
Ａ１．画像処理装置の構成：
図１は、本発明の第１実施例における画像処理装置としてのプリンター１００の構成を概略的に示す説明図である。本実施例のプリンター１００は、メモリーカードＭＣ等から取得した画像データに基づき画像を印刷する、いわゆるダイレクトプリントに対応したインクジェット式カラープリンターである。プリンター１００は、プリンター１００の各部を制御するＣＰＵ１１０と、ＲＯＭやＲＡＭによって構成された内部メモリー１２０と、ボタンやタッチパネルにより構成された操作部１４０と、液晶ディスプレイにより構成された表示部１５０と、印刷機構１６０と、カードインターフェース（カードＩ／Ｆ）１７０と、を備えている。プリンター１００は、さらに、他の機器（例えばデジタルスチルカメラやパーソナルコンピューター）とのデータ通信を行うためのインターフェースを備えていてもよい。プリンター１００の各構成要素は、バスを介して双方向通信可能に接続されている。 A. First embodiment:
A1. Configuration of image processing device:
FIG. 1 is an explanatory diagram schematically showing the configuration of a printer 100 as an image processing apparatus according to the first embodiment of the present invention. The printer 100 of this embodiment is an ink jet color printer that supports so-called direct printing, in which an image is printed based on image data acquired from a memory card MC or the like. The printer 100 includes a CPU 110 that controls each unit of the printer 100, an internal memory 120 configured by a ROM and a RAM, an operation unit 140 configured by buttons and a touch panel, a display unit 150 configured by a liquid crystal display, and printing. A mechanism 160 and a card interface (card I / F) 170 are provided. The printer 100 may further include an interface for performing data communication with other devices (for example, a digital still camera or a personal computer). Each component of the printer 100 is connected via a bus so that bidirectional communication is possible.

印刷機構１６０は、印刷データに基づき印刷を行う。カードインターフェース１７０は、カードスロット１７２に挿入されたメモリーカードＭＣとの間でデータのやり取りを行うためのインターフェースである。なお、本実施例では、メモリーカードＭＣに画像データを含む画像ファイルが格納されている。 The printing mechanism 160 performs printing based on the print data. The card interface 170 is an interface for exchanging data with the memory card MC inserted into the card slot 172. In this embodiment, an image file including image data is stored in the memory card MC.

内部メモリー１２０には、画像処理部２００と、表示処理部３１０と、印刷処理部３２０と、が格納されている。画像処理部２００は、コンピュータープログラムであり、所定のオペレーティングシステムの下で、ＣＰＵ１１０により実行されることで顔特徴位置検出処理をおこなう。顔特徴位置検出処理は、顔画像における所定の特徴部位（例えば目尻や鼻頭やフェイスライン）の位置を検出する処理である。顔特徴位置検出処理については、後に詳述する。表示処理部３１０、および、印刷処理部３２０についてもＣＰＵ１１０により実行されることでぞれぞれの機能を実現する。 The internal memory 120 stores an image processing unit 200, a display processing unit 310, and a print processing unit 320. The image processing unit 200 is a computer program, and performs facial feature position detection processing by being executed by the CPU 110 under a predetermined operating system. The face feature position detection process is a process for detecting the position of a predetermined feature part (for example, the corner of the eye, the nose head, or the face line) in the face image. The face feature position detection process will be described in detail later. The display processing unit 310 and the print processing unit 320 are also executed by the CPU 110 to realize the respective functions.

画像処理部２００は、プログラムモジュールとして、顔領域検出部２１０と、特徴位置検出部２２０と、初期位置設定部２３０と、を含んでいる。初期位置設定部２３０は、取得部２３２と、初期位置候補設定部２３４と、生成部２３６と、算出部２３８と、を含んでいる。これら各部の機能については、後述の顔特徴位置検出処理の説明において詳述する。 The image processing unit 200 includes a face area detection unit 210, a feature position detection unit 220, and an initial position setting unit 230 as program modules. The initial position setting unit 230 includes an acquisition unit 232, an initial position candidate setting unit 234, a generation unit 236, and a calculation unit 238. The functions of these units will be described in detail in the description of the face feature position detection process described later.

表示処理部３１０は、表示部１５０を制御して、表示部１５０上に処理メニューやメッセージ、画像等を表示させるディスプレイドライバである。印刷処理部３２０は、画像データから印刷データを生成し、印刷機構１６０を制御して、印刷データに基づく画像の印刷を実行するためのコンピュータープログラムである。ＣＰＵ１１０は、内部メモリー１２０から、これらのプログラム（画像処理部２００、表示処理部３１０、印刷処理部３２０）を読み出して実行することにより、これら各部の機能を実現する。 The display processing unit 310 is a display driver that controls the display unit 150 to display processing menus, messages, images, and the like on the display unit 150. The print processing unit 320 is a computer program for generating print data from image data, controlling the printing mechanism 160, and printing an image based on the print data. The CPU 110 reads out and executes these programs (the image processing unit 200, the display processing unit 310, and the print processing unit 320) from the internal memory 120, thereby realizing the functions of these units.

内部メモリー１２０には、また、ＡＡＭ情報ＡＭＩおよび顔学習データＦＬＤが格納されている。ＡＡＭ情報ＡＭＩは、後述のＡＡＭ設定処理によって予め設定される情報であり、後述の顔特徴位置検出処理において参照される。ＡＡＭ情報ＡＭＩの内容については、後述のＡＡＭ設定処理の説明において詳述する。顔学習データＦＬＤは、顔領域検出部２1０による顔領域ＦＡの検出に用いられる。顔学習データＦＬＤの内容については、後述の顔領域ＦＡの検出処理の説明において詳述する。 The internal memory 120 also stores AAM information AMI and face learning data FLD. The AAM information AMI is information set in advance by an AAM setting process described later, and is referred to in a face feature position detection process described later. The contents of the AAM information AMI will be described in detail in the description of AAM setting processing described later. The face learning data FLD is used for the detection of the face area FA by the face area detection unit 2110. The contents of the face learning data FLD will be described in detail in the description of the face area FA detection process described later.

Ａ２．ＡＡＭ設定処理：
図２は、第１実施例におけるＡＡＭ設定処理の流れを示すフローチャートである。ＡＡＭ設定処理は、ＡＡＭ（アクティブアピアランスモデル（ＡｃｔｉｖｅＡｐｐｅａｒａｎｃｅＭｏｄｅｌ））と呼ばれる画像のモデル化に用いられる形状モデルおよびテクスチャーモデルを設定する処理である。本実施例において、ＡＡＭ設定処理は、ユーザによりおこなわれる。 A2. AAM setting process:
FIG. 2 is a flowchart showing the flow of AAM setting processing in the first embodiment. The AAM setting process is a process for setting a shape model and a texture model used for modeling an image called AAM (Active Appearance Model). In this embodiment, the AAM setting process is performed by the user.

はじめに、ユーザは、人物の顔を含んだ複数の画像をサンプル画像ＳＩとして用意する（ステップＳ１１０）。図３は、サンプル画像ＳＩの一例を示す説明図である。図３に示すように、サンプル画像ＳＩは、個性、人種・性別、表情（怒り、笑い、困り、驚き等）、向き（正面向き、上向き、下向き、右向き、左向き等）といった種々の属性に関して互いに相違する顔画像が含まれるように用意される。サンプル画像ＳＩがそのように用意されれば、ＡＡＭによってあらゆる顔画像を精度良くモデル化することが可能となり、あらゆる顔画像を対象とした精度の良い顔特徴位置検出処理（後述）の実行が可能となる。なお、サンプル画像ＳＩは、学習用画像とも呼ばれる。 First, the user prepares a plurality of images including a person's face as sample images SI (step S110). FIG. 3 is an explanatory diagram showing an example of the sample image SI. As shown in FIG. 3, the sample image SI is related to various attributes such as personality, race / gender, facial expression (anger, laughter, trouble, surprise, etc.), direction (front direction, upward, downward, rightward, leftward, etc.). It is prepared to include different face images. If the sample image SI is prepared in this way, any face image can be accurately modeled by AAM, and accurate face feature position detection processing (described later) for any face image can be executed. It becomes. The sample image SI is also called a learning image.

それぞれのサンプル画像ＳＩに含まれる顔画像に、特徴点ＣＰを設定する（ステップＳ１２０）。図４は、サンプル画像ＳＩにおける特徴点ＣＰの設定方法の一例を示す説明図である。特徴点ＣＰは、顔画像における所定の特徴部位の位置を示す点である。本実施例では、所定の特徴部位として、人物の顔における眉毛上の所定位置（例えば端点や４分割点等、以下同じ）、目の輪郭上の所定位置、鼻筋および小鼻の輪郭上の所定位置、上下唇の輪郭上の所定位置、顔の輪郭（フェイスライン）上の所定位置といった６８箇所の部位が設定されている。すなわち、本実施例では、人物の顔に共通して含まれる顔の器官（眉毛、目、鼻、口）および顔の輪郭における所定位置を、特徴部位として設定する。図４に示すように、特徴点ＣＰは、各サンプル画像ＳＩにおいてオペレーターにより指定された６８個の特徴部位を表す位置に設定（配置）される。このように設定された各特徴点ＣＰは各特徴部位に対応しているため、顔画像における特徴点ＣＰの配置は顔の形状を特定していると表現することができる。 A feature point CP is set to the face image included in each sample image SI (step S120). FIG. 4 is an explanatory diagram showing an example of a method for setting the feature point CP in the sample image SI. The feature point CP is a point indicating the position of a predetermined feature part in the face image. In this embodiment, as predetermined feature parts, a predetermined position on the eyebrows in a person's face (for example, an end point or a four-divided point, the same applies hereinafter), a predetermined position on the eye contour, and a predetermined position on the nose and nose contours 68 parts such as a predetermined position on the contour of the upper and lower lips and a predetermined position on the contour of the face (face line) are set. That is, in the present embodiment, a predetermined position in the facial organs (eyebrows, eyes, nose, mouth) and facial contour that are commonly included in the human face are set as the characteristic parts. As shown in FIG. 4, the feature points CP are set (arranged) at positions representing 68 feature parts designated by the operator in each sample image SI. Since each feature point CP set in this way corresponds to each feature part, the arrangement of the feature points CP in the face image can be expressed as specifying the shape of the face.

サンプル画像ＳＩにおける特徴点ＣＰの位置は、座標により特定される。図５は、サンプル画像ＳＩに設定された特徴点ＣＰの座標の一例を示す説明図である。図５において、ＳＩ（ｊ）（ｊ＝１，２，３・・・）は各サンプル画像ＳＩを示しており、ＣＰ（ｋ）（ｋ＝０，１，・・・，６７）は各特徴点ＣＰを示している。また、ＣＰ（ｋ）−Ｘは、特徴点ＣＰ（ｋ）のＸ座標を示しており、ＣＰ（ｋ）−Ｙは、特徴点ＣＰ（ｋ）のＹ座標を示している。特徴点ＣＰの座標としては、顔の大きさと顔の傾き（画像面内の傾き）と顔のＸ方向およびＹ方向の位置とのそれぞれについて正規化されたサンプル画像ＳＩにおける所定の基準点（例えば画像の左下の点）を原点とした座標が用いられる。また、本実施例では、１つのサンプル画像ＳＩに複数の人物の顔画像が含まれる場合が許容されており（例えばサンプル画像ＳＩ（２）には２人の顔画像が含まれている）、１つのサンプル画像ＳＩにおける各人物は人物ＩＤによって特定される。 The position of the feature point CP in the sample image SI is specified by coordinates. FIG. 5 is an explanatory diagram showing an example of the coordinates of the feature point CP set in the sample image SI. In FIG. 5, SI (j) (j = 1, 2, 3,...) Indicates each sample image SI, and CP (k) (k = 0, 1,..., 67) indicates each feature. Point CP is shown. CP (k) -X represents the X coordinate of the feature point CP (k), and CP (k) -Y represents the Y coordinate of the feature point CP (k). The coordinates of the feature point CP include predetermined reference points (for example, in the sample image SI normalized with respect to the size of the face, the inclination of the face (inclination in the image plane), and the position of the face in the X direction and the Y direction) The coordinates with the origin at the lower left point of the image are used. Further, in the present embodiment, a case where a plurality of human face images are included in one sample image SI is allowed (for example, two sample face images are included in the sample image SI (2)). Each person in one sample image SI is specified by a person ID.

つづいて、ユーザは、ＡＡＭの形状モデルの設定をおこなう（ステップＳ１３０）。具体的には、各サンプル画像ＳＩにおける６８個の特徴点ＣＰの座標（Ｘ座標およびＹ座標）により構成される座標ベクトル（図５参照）に対する主成分分析をおこない、特徴点ＣＰの位置により特定される顔の形状ｓが下記の式（１）によりモデル化する。なお、形状モデルは、特徴点ＣＰの配置モデルとも呼ぶ。 Subsequently, the user sets an AAM shape model (step S130). Specifically, a principal component analysis is performed on a coordinate vector (see FIG. 5) composed of the coordinates (X coordinate and Y coordinate) of 68 feature points CP in each sample image SI, and specified by the position of the feature point CP. The face shape s to be modeled is expressed by the following equation (1). The shape model is also referred to as a feature point CP arrangement model.

上記式（１）において、ｓ₀は平均形状である。図６は、平均形状ｓ₀の一例を示す説明図である。図６（ａ）および（ｂ）に示すように、平均形状ｓ₀は、サンプル画像ＳＩの各特徴点ＣＰについての平均位置（平均座標）により特定される平均的な顔の形状を表すモデルである。なお、本実施例では、平均形状ｓ₀において、外周に位置する特徴点ＣＰ（フェイスラインおよび眉毛、眉間に対応する特徴点ＣＰ、図４参照）を結ぶ直線により囲まれた領域（図６（ｂ）においてハッチングを付して示す）を「平均形状領域ＢＳＡ」と呼ぶ。平均形状ｓ₀においては、図６（ａ）に示すように、特徴点ＣＰを頂点とする複数の三角形領域ＴＡが、平均形状領域ＢＳＡをメッシュ状に分割するように設定される。 In the above formula (1), s ₀ is an average shape. FIG. 6 is an explanatory diagram showing an example of the average shape s ₀ . As shown in FIGS. 6A and 6B, the average shape s ₀ is a model representing the average face shape specified by the average position (average coordinate) for each feature point CP of the sample image SI. is there. In the present embodiment, in the average shape s ₀ , a region surrounded by a straight line connecting feature points CP (face lines, eyebrows, feature points CP corresponding to the eyebrows, see FIG. 4) located on the outer periphery (FIG. 6 ( b) is indicated by hatching) and is referred to as “average shape region BSA”. In the average shape s ₀ , as shown in FIG. 6A, a plurality of triangular areas TA having the feature points CP as vertices are set so as to divide the average shape area BSA into a mesh shape.

形状モデルを表す上記式（１）において、ｓ_iは形状ベクトルであり、ｐ_iは形状ベクトルｓ_iの重みを表す形状パラメーターである。形状ベクトルｓ_iは、顔の形状ｓの特性を表すベクトルであり、主成分分析により得られる第ｉ主成分に対応する固有ベクトルである。上記式（１）に示すように、本実施例における形状モデルでは、特徴点ＣＰの配置を表す顔形状ｓが、平均形状ｓ₀とｎ個の形状ベクトルｓ_iの線形結合との和としてモデル化される。形状モデルにおいて形状パラメーターｐ_iを適切に設定することにより、あらゆる画像における顔の形状ｓを再現することが可能である。 In the above equation (1) representing the shape model, s _i is a shape vector, and p _i is a shape parameter representing the weight of the shape vector s _i . The shape vector s _i is a vector representing the characteristics of the face shape s and is an eigenvector corresponding to the i-th principal component obtained by principal component analysis. As shown in the above equation (1), in the shape model in the present embodiment, the face shape s representing the arrangement of the feature points CP is modeled as the sum of the average shape s ₀ and the linear combination of the n shape vectors s _i. It becomes. By appropriately setting the shape parameter p _i in the shape model, the face shape s in any image can be reproduced.

図７は、形状ベクトルｓ_iおよび形状パラメーターｐ_iと、顔の形状ｓとの関係を例示した説明図である。図７（ａ）に示すように、顔の形状ｓを特定するために、寄与率のより大きい主成分に対応する固有ベクトルから順に、累積寄与率に基づき設定された個数ｎ（図７ではn＝４）の固有ベクトルが、形状ベクトルｓ_iとして採用される。形状ベクトルｓ_iのそれぞれは、図７（ａ）の矢印に示すように、各特徴点ＣＰの移動方向・移動量と対応している。本実施例では、最も寄与率の大きい第１主成分に対応する第１形状ベクトルｓ₁は顔の左右振りにほぼ相関するベクトルとなっており、形状パラメーターｐ₁を大小することにより、図７（ｂ）に示すように、顔の形状ｓの横方向の顔向きが変化する。２番目に寄与率の大きい第２主成分に対応する第２形状ベクトルｓ₂は顔の上下振りにほぼ相関するベクトルとなっており、形状パラメーターｐ₂を大小することにより、図７（ｃ）に示すように、顔の形状ｓの縦方向の顔向きが変化する。また、３番目に寄与率の大きい第３主成分に対応する第３形状ベクトルｓ₃は顔の形状の縦横比にほぼ相関するベクトルとなっており、４番目に寄与率の大きい第４主成分に対応する第４形状ベクトルｓ₄は口の開きの程度にほぼ相関するベクトルとなっている。このように、形状パラメーターの値は、顔の表情や、顔向きなど顔画像の特徴を表す。 FIG. 7 is an explanatory diagram illustrating the relationship between the shape vector s _{i, the} shape parameter p _i, and the face shape s. As shown in FIG. 7A, in order to specify the face shape s, the number n set based on the cumulative contribution rate in order from the eigenvector corresponding to the principal component having the larger contribution rate (in FIG. 7, n = The eigenvector of 4) is adopted as the shape vector s _i . Each of the shape vectors s _i corresponds to the movement direction / movement amount of each feature point CP, as indicated by the arrows in FIG. In this embodiment, most first shape vector s ₁ corresponding to the first principal component having a large contribution rate is a vector that is approximately correlated with the left and right appearance of a face, by the magnitude of the shape parameter p _1, 7 As shown in (b), the horizontal face direction of the face shape s changes. Second shape vector s ₂ corresponding to the large second principal component of the contribution rate to the second is a vector that is approximately correlated with the vertical appearance of a face, by the magnitude of the shape parameter p _2, FIG. 7 (c) As shown in FIG. 4, the vertical face direction of the face shape s changes. The third shape vector s ₃ corresponding to the third principal component having the third largest contribution ratio is a vector that is substantially correlated with the aspect ratio of the face shape, and the fourth principal component having the fourth largest contribution ratio. The fourth shape vector s ₄ corresponding to is a vector that is substantially correlated with the degree of mouth opening. As described above, the value of the shape parameter represents the feature of the face image such as facial expression and face orientation.

なお、形状モデル設定ステップ（ステップＳ１３０）において設定された平均形状ｓ₀および形状ベクトルｓ_iは、ＡＡＭ情報ＡＭＩ（図１）として内部メモリー１２０に格納される。 The average shape s ₀ and shape vector s _i set in the shape model setting step (step S130) are stored in the internal memory 120 as AAM information AMI (FIG. 1).

つづいて、ＡＡＭのテクスチャーモデルの設定をおこなう（ステップＳ１４０）。具体的には、まず、各サンプル画像ＳＩに対して、サンプル画像ＳＩにおける特徴点ＣＰの設定位置が平均形状ｓ₀における特徴点ＣＰの設定位置と等しくなるように、画像変換（以下、「ワープＷ」とも呼ぶ）を行う。 Subsequently, an AAM texture model is set (step S140). Specifically, first, for each sample image SI, image conversion (hereinafter referred to as “warp”) is performed so that the setting position of the feature point CP in the sample image SI is equal to the setting position of the feature point CP in the average shape s ₀ . W ”).

図８は、サンプル画像ＳＩのワープＷの方法の一例を示す説明図である。各サンプル画像ＳＩにおいては、平均形状ｓ₀と同様に、外周に位置する特徴点ＣＰにより囲まれた領域をメッシュ状に分割する複数の三角形領域ＴＡが設定される。ワープＷは、複数の三角形領域ＴＡのそれぞれについてのアフィン変換の集合である。すなわち、ワープＷにおいては、サンプル画像ＳＩにおけるある三角形領域ＴＡの画像は、平均形状ｓ₀における対応する三角形領域ＴＡの画像へとアフィン変換される。ワープＷにより、特徴点ＣＰの設定位置が平均形状ｓ₀における特徴点ＣＰの設定位置と等しいサンプル画像ＳＩ（以下「サンプル画像ＳＩｗ」と表す）が生成される。 FIG. 8 is an explanatory diagram showing an example of a method of warping W of the sample image SI. In each sample image SI, similarly to the average shape s ₀ , a plurality of triangular regions TA that divide the region surrounded by the feature points CP located on the outer periphery into mesh shapes are set. The warp W is a set of affine transformations for each of the plurality of triangular areas TA. That is, in the warp W, an image of a certain triangular area TA in the sample image SI is affine transformed into an image of the corresponding triangular area TA in the average shape s ₀ . With the warp W, a sample image SI (hereinafter referred to as “sample image SIw”) in which the set position of the feature point CP is equal to the set position of the feature point CP in the average shape s ₀ is generated.

なお、各サンプル画像ＳＩｗは、平均形状領域ＢＳＡ（図８においてハッチングを付して示す）を内包する矩形枠を外周とし、平均形状領域ＢＳＡ以外の領域（以下「マスク領域ＭＡ」とも呼ぶ）がマスクされた画像として生成される。平均形状領域ＢＳＡとマスク領域ＭＡとを併せた画像領域を基準領域ＢＡと呼ぶ。また、各サンプル画像ＳＩｗは、例えば５６画素×５６画素のサイズの画像として正規化される。 Each sample image SIw has a rectangular frame containing an average shape area BSA (shown with hatching in FIG. 8) as an outer periphery, and an area other than the average shape area BSA (hereinafter also referred to as “mask area MA”). Generated as a masked image. An image area in which the average shape area BSA and the mask area MA are combined is referred to as a reference area BA. Each sample image SIw is normalized as an image having a size of 56 pixels × 56 pixels, for example.

次に、各サンプル画像ＳＩｗの画素群ｘのそれぞれにおける輝度値により構成される輝度値ベクトルに対する主成分分析が行われ、顔のテクスチャー（「見え」とも呼ぶ）Ａ（ｘ）が下記の式（２）によりモデル化される。なお、画素群ｘは、平均形状領域ＢＳＡに位置する画素の集合である。 Next, a principal component analysis is performed on a luminance value vector composed of luminance values in each pixel group x of each sample image SIw, and a facial texture (also referred to as “appearance”) A (x) is expressed by the following formula ( 2). The pixel group x is a set of pixels located in the average shape area BSA.

上記式（２）において、Ａ₀（ｘ）は平均顔画像である。図９は、平均顔画像Ａ₀（ｘ）の一例を示す説明図である。平均顔画像Ａ₀（ｘ）は、ワープＷの後のサンプル画像ＳＩｗ（図８参照）の平均の顔が表された画像である。すなわち、平均顔画像Ａ₀（ｘ）は、サンプル画像ＳＩｗの平均形状領域ＢＳＡ内の画素群ｘの画素値（輝度値）の平均をとることにより算出される画像である。従って、平均顔画像Ａ₀（ｘ）は、平均的な顔の形状における平均的な顔のテクスチャー（見え）を表すモデルである。なお、平均顔画像Ａ₀（ｘ）は、サンプル画像ＳＩｗと同様に、平均形状領域ＢＳＡとマスク領域ＭＡとで構成され、例えば５６画素×５６画素のサイズの画像として算出される。 In the above equation (2), A ₀ (x) is an average face image. FIG. 9 is an explanatory diagram showing an example of the average face image A ₀ (x). The average face image A ₀ (x) is an image representing the average face of the sample image SIw (see FIG. 8) after the warp W. That is, the average face image A ₀ (x) is an image calculated by taking the average of the pixel values (luminance values) of the pixel group x in the average shape area BSA of the sample image SIw. Therefore, the average face image A ₀ (x) is a model representing the average face texture (appearance) in the average face shape. Note that the average face image A ₀ (x) is composed of the average shape area BSA and the mask area MA, similarly to the sample image SIw, and is calculated as an image having a size of 56 pixels × 56 pixels, for example.

テクスチャーモデルを表す上記式（２）において、Ａ_i（ｘ）はテクスチャーベクトルであり、λ_iはテクスチャーベクトルＡ_i（ｘ）の重みを表すテクスチャーパラメーターである。テクスチャーベクトルＡ_i（ｘ）は、顔のテクスチャーＡ（ｘ）の特性を表すベクトルであり、具体的には、主成分分析により得られる第ｉ主成分に対応する固有ベクトルである。すなわち、寄与率のより大きい主成分に対応する固有ベクトルから順に、累積寄与率に基づき設定された個数ｍの固有ベクトルが、テクスチャーベクトルＡ_i（ｘ）として採用される。本実施例では、最も寄与率の大きい第１主成分に対応する第１テクスチャーベクトルＡ₁（ｘ）は、顔色の変化（性別の差とも捉えられる）にほぼ相関するベクトルとなっている。 In the above equation (2) representing the texture model, A _i (x) is a texture vector, and λ _i is a texture parameter representing the weight of the texture vector A _i (x). The texture vector A _i (x) is a vector representing the characteristics of the facial texture A (x), and specifically, is an eigenvector corresponding to the i-th principal component obtained by principal component analysis. That is, the number m of eigenvectors set based on the cumulative contribution rate is adopted as the texture vector A _i (x) in order from the eigenvector corresponding to the principal component having the larger contribution rate. In the present embodiment, the first texture vector A ₁ (x) corresponding to the first principal component having the largest contribution rate is a vector that is substantially correlated with a change in face color (also regarded as a gender difference).

上記式（２）に示すように、本実施例におけるテクスチャーモデルでは、顔の見えを表す顔のテクスチャーＡ（ｘ）が、平均顔画像Ａ₀（ｘ）とｍ個のテクスチャーベクトルＡ_i（ｘ）の線形結合との和としてモデル化される。テクスチャーモデルにおいてテクスチャーパラメーターλ_iを適切に設定することにより、あらゆる画像における顔のテクスチャーＡ（ｘ）を再現することが可能である。なお、テクスチャーモデル設定ステップ（図２のステップＳ１４０）において設定された平均顔画像Ａ₀（ｘ）およびテクスチャーベクトルＡ_i（ｘ）は、ＡＡＭ情報ＡＭＩ（図１）として内部メモリー１２０に格納される。 As shown in the above equation (2), in the texture model in the present embodiment, the facial texture A (x) representing the appearance of the face is the average face image A ₀ (x) and m texture vectors A _i (x ) And the linear combination. By appropriately setting the texture parameter λ _i in the texture model, it is possible to reproduce the facial texture A (x) in any image. Note that the average face image A ₀ (x) and the texture vector A _i (x) set in the texture model setting step (step S140 in FIG. 2) are stored in the internal memory 120 as AAM information AMI (FIG. 1). .

以上説明したＡＡＭ設定処理（図２）により、顔の形状をモデル化する形状モデルと、顔のテクスチャーをモデル化するテクスチャーモデルが設定される。設定された形状モデルとテクスチャーモデルとを組み合わせることにより、すなわち合成されたテクスチャーＡ（ｘ）に対して平均形状ｓ₀から形状ｓへの変換（図８に示したワープＷの逆変換）を行うことにより、あらゆる顔画像の形状およびテクスチャーを再現することが可能である。 By the AAM setting process described above (FIG. 2), a shape model for modeling the face shape and a texture model for modeling the face texture are set. By combining the set shape model and the texture model, that is, the synthesized texture A (x) is converted from the average shape s ₀ to the shape s (inverse conversion of the warp W shown in FIG. 8). Thus, it is possible to reproduce the shape and texture of any face image.

Ａ３．顔特徴位置検出処理：
図１０は、第１実施例における顔特徴位置検出処理の流れを示すフローチャートである。本実施例における顔特徴位置検出処理は、ＡＡＭを利用して、注目画像に含まれる顔画像における特徴点ＣＰの配置を決定することにより、顔画像における特徴部位の位置を検出する処理である。上述したように、本実施例では、ＡＡＭ設定処理（図２）において、人物の顔の器官（眉毛、目、鼻、口）および顔の輪郭における計６８箇所の所定位置が、特徴部位として設定されている（図４参照）。そのため、本実施例の顔特徴位置検出処理では、人物の顔の器官および顔の輪郭における所定位置を表す６８個の特徴点ＣＰの位置を特定することで特徴部位の位置の検出をおこなう。 A3. Face feature position detection processing:
FIG. 10 is a flowchart showing the flow of face feature position detection processing in the first embodiment. The face feature position detection process in the present embodiment is a process for detecting the position of the feature part in the face image by determining the arrangement of the feature points CP in the face image included in the target image using AAM. As described above, in the present embodiment, in the AAM setting process (FIG. 2), a total of 68 predetermined positions in the human facial organs (eyebrows, eyes, nose, mouth) and facial contour are set as the characteristic parts. (See FIG. 4). Therefore, in the face feature position detection process of the present embodiment, the positions of the feature parts are detected by specifying the positions of 68 feature points CP representing predetermined positions in the organ and face contour of the person.

なお、顔特徴位置検出処理によって顔画像における特徴点ＣＰの配置が決定されると、顔画像についての形状パラメーターｐ_iや、テクスチャーパラメーターλ_iの値が特定される。従って、顔特徴位置検出処理の処理結果は、特定の表情（例えば笑顔や目を閉じた顔）の顔画像を検出するための表情判定や、特定の向き（例えば右向きや下向き）の顔画像を検出するための顔向き判定、顔の形状を変形する顔変形、顔の陰影補正等に利用可能である。 Note that when the arrangement of the feature points CP in the face image is determined by the face feature position detection process, the shape parameter p _i and the value of the texture parameter λ _i for the face image are specified. Therefore, the processing result of the face feature position detection process is a facial expression determination for detecting a facial image of a specific facial expression (for example, a smile or a face with closed eyes), or a facial image in a specific direction (for example, rightward or downward). It can be used for face orientation determination for detection, face deformation for deforming the face shape, face shadow correction, and the like.

はじめに、画像処理部２００（図１）は、顔特徴位置検出処理の対象となる注目画像を表す画像データを取得する（ステップＳ２１０）。本実施例のプリンター１００では、カードスロット１７２にメモリーカードＭＣが挿入されると、メモリーカードＭＣに格納された画像ファイルのサムネイル画像が表示部１５０に表示される。処理の対象となる１つまたは複数の画像は、操作部１４０を介してユーザにより選択される。画像処理部２００は、選択された１つまたは複数の画像に対応する画像データを含む画像ファイルをメモリーカードＭＣより取得して内部メモリー１２０の所定の領域に格納する。なお、取得された画像データを注目画像データと呼び、注目画像データにより表される画像を注目画像ＯＩと呼ぶものとする。 First, the image processing unit 200 (FIG. 1) acquires image data representing an attention image that is a target of face feature position detection processing (step S210). In the printer 100 of the present embodiment, when the memory card MC is inserted into the card slot 172, thumbnail images of image files stored in the memory card MC are displayed on the display unit 150. One or more images to be processed are selected by the user via the operation unit 140. The image processing unit 200 acquires an image file including image data corresponding to one or more selected images from the memory card MC and stores it in a predetermined area of the internal memory 120. The acquired image data is called attention image data, and an image represented by the attention image data is called attention image OI.

顔領域検出部２1０（図１）は、注目画像ＯＩに含まれる顔画像の少なくとも一部を含む画像領域を顔領域ＦＡとして検出する（ステップＳ２２０）。図１１は、顔領域ＦＡの検出処理の流れを示すフローチャートである。図１２は、注目画像ＯＩにおける顔領域ＦＡの検出を説明するための説明図である。図１２に示すように、顔領域検出部２1０は、予め規定された種々のサイズの正方形形状を有する複数のウィンドウＳＷのうちの１つを注目画像ＯＩに設定する（ステップＳ３００）。具体的には、顔領域検出部２１０は、まず、初期値として設定されているサイズのウィンドウＳＷを、初期位置として設定されている注目画像ＯＩ上の位置に設定する。 The face area detector 210 (FIG. 1) detects an image area including at least a part of the face image included in the target image OI as the face area FA (step S220). FIG. 11 is a flowchart showing a flow of processing for detecting the face area FA. FIG. 12 is an explanatory diagram for explaining the detection of the face area FA in the target image OI. As shown in FIG. 12, the face area detection unit 2 10 sets one of a plurality of windows SW having square shapes of various sizes defined in advance as the attention image OI (step S300). Specifically, the face area detection unit 210 first sets a window SW having a size set as an initial value at a position on the target image OI set as an initial position.

顔領域検出部２1０は、ウィンドウＳＷにより規定される画像領域から顔判定に用いるための評価値Ｔｖを算出する（ステップＳ３１０）。ここで、顔判定とは、ウィンドウＳＷにより規定される画像領域が顔の画像に対応する顔領域であるか否かの判定をいう。なお、本実施例では、顔判定は予め設定された特定顔傾き毎に実行される。すなわち、特定顔傾き毎に、ウィンドウＳＷにより規定される画像領域が当該特定顔傾き分だけ傾いた顔の画像に対応する画像領域であるか否かの判定が行われる。そのため、評価値Ｔｖも特定顔傾き毎に算出される。ここで、特定顔傾きとは、後述する図１６の下段に示すように、画像面内（インプレーン）における顔の画像の回転角度を意味している。本実施例では、特定顔傾きとして、画像の上下方向に沿って顔の画像が位置している状態（頭が上方向を向き顎が下方向を向いた状態）を基準（特定顔傾き＝０度）とし、顔の画像の傾きを時計回りに３０度ずつ増加させた計１２個の傾き（０度、３０度、６０度、・・・、３３０度）が設定されている。 The face area detecting unit 2110 calculates an evaluation value Tv to be used for face determination from the image area defined by the window SW (step S310). Here, the face determination means determination whether or not the image area defined by the window SW is a face area corresponding to the face image. In the present embodiment, face determination is executed for each specific face inclination set in advance. That is, for each specific face inclination, it is determined whether or not the image area defined by the window SW is an image area corresponding to a face image inclined by the specific face inclination. Therefore, the evaluation value Tv is also calculated for each specific face inclination. Here, the specific face inclination means the rotation angle of the face image in the image plane (in-plane) as shown in the lower part of FIG. In this embodiment, the specific face inclination is based on the state where the face image is positioned along the vertical direction of the image (the state where the head faces upward and the jaw faces downward) (specific face inclination = 0). 12 degrees (0 degrees, 30 degrees, 60 degrees,..., 330 degrees) are set by increasing the inclination of the face image by 30 degrees clockwise.

評価値Ｔｖの算出方法については特に限定はないが、本実施例では、評価値Ｔｖの算出にＮ個のフィルタ（フィルタ１〜フィルタＮ）が用いられる。図１３は、評価値Ｔｖの算出に用いられるフィルタを説明するための説明図である。各フィルタ（フィルタ１〜フィルタＮ）の外形はウィンドウＳＷと同じアスペクト比を有しており（すなわち正方形形状であり）、各フィルタにはプラス領域ｐａとマイナス領域ｍａとが設定されている。顔領域検出部２1０は、ウィンドウＳＷにより規定される画像領域にフィルタＸ（Ｘ＝１，２，・・・，Ｎ）を順に適用し、それぞれから評価値Ｔｖの基礎となる基礎評価値ｖＸ（ｖ１．ｖ２．・・・，ｖＮ）を算出する。具体的には、基礎評価値ｖＸは、フィルタＸのプラス領域ｐａに対応する画像領域に含まれる画素の輝度値の合計から、マイナス領域ｍａに対応する画像領域に含まれる画素の輝度値の合計を差し引いた値である。 The calculation method of the evaluation value Tv is not particularly limited, but in this embodiment, N filters (filter 1 to filter N) are used for calculating the evaluation value Tv. FIG. 13 is an explanatory diagram for explaining a filter used for calculating the evaluation value Tv. The external shape of each filter (filter 1 to filter N) has the same aspect ratio as that of the window SW (that is, has a square shape), and a positive region pa and a negative region ma are set for each filter. The face area detection unit 210 applies the filter X (X = 1, 2,..., N) in order to the image area defined by the window SW, and the basic evaluation value vX ( v1, v2,..., vN) are calculated. Specifically, the basic evaluation value vX is the sum of the luminance values of the pixels included in the image area corresponding to the minus area ma from the sum of the luminance values of the pixels included in the image area corresponding to the plus area pa of the filter X. Is the value obtained by subtracting.

顔領域検出部２1０は、算出した基礎評価値ｖＸと、各基礎評価値ｖＸ（ｖ１．ｖ２．・・・，ｖＮ）に対応して設定された閾値ｔｈＸ（ｔｈ１，ｔｈ２，・・・，ｔｈＮ）とをそれぞれ比較する。本実施例では、顔領域検出部２1０は、基礎評価値ｖＸが閾値ｔｈＸ以上となるフィルタＸでは、ウィンドウＳＷにより規定される画像領域が顔の画像に対応する画像領域であると判定し、フィルタＸの出力値として値「１」を設定する。一方、基礎評価値ｖＸが閾値ｔｈＸより小さいフィルタＸでは、ウィンドウＳＷにより規定される画像領域が顔の画像に対応するとは考えられない画像領域であると判定し、フィルタＸの出力値として値「０」を設定する。各フィルタＸには重み係数ＷｅＸ（Ｗｅ１，Ｗｅ２，・・・，ＷｅＮ）が設定されており、すべてのフィルタＸについての出力値と重み係数ＷｅＸとの積の合計を評価値Ｔｖとする。なお、顔判定に用いられるフィルタＸの態様や閾値ｔｈＸ、重み係数ＷｅＸ、後述の閾値ＴＨは、上記１２個の特定顔傾きのそれぞれについて予め設定されており、顔学習データＦＬＤ（図１）として内部メモリー１２０に格納されている。 The face area detection unit 2 10 calculates the calculated basic evaluation value vX and the threshold thX (th1, th2,..., ThN) set in correspondence with each basic evaluation value vX (v1.v2..., VN). ) And each. In the present embodiment, the face area detection unit 210 determines that the image area defined by the window SW is an image area corresponding to the face image in the filter X in which the basic evaluation value vX is greater than or equal to the threshold thX. The value “1” is set as the output value of X. On the other hand, in the filter X whose basic evaluation value vX is smaller than the threshold thX, it is determined that the image area defined by the window SW is an image area that is not considered to correspond to the face image, and the value “ Set to “0”. Weight coefficients WeX (We1, We2,..., WeN) are set for each filter X, and the sum of products of output values and weighting coefficients WeX for all filters X is set as an evaluation value Tv. Note that the aspect of the filter X used for face determination, the threshold thX, the weighting coefficient WeX, and the threshold TH described below are set in advance for each of the 12 specific face inclinations, and are used as face learning data FLD (FIG. 1). Stored in the internal memory 120.

顔領域検出部２1０は、算出された評価値Ｔｖと閾値ＴＨとを比較する（ステップＳ３２０）。顔領域検出部２1０は、ある特定顔傾きについて評価値Ｔｖが閾値ＴＨ以上である場合には（ステップＳ３２０：ＹＥＳ）、ウィンドウＳＷにより規定される画像領域は当該特定顔傾き分だけ傾いた顔の画像に対応する画像領域であるとして、ウィンドウＳＷにより規定される画像領域の位置、すなわち現在設定されているウィンドウＳＷの座標と、当該特定顔傾きと、を記憶する（ステップＳ３３０）。一方、いずれの特定顔傾きについても評価値Ｔｖが閾値ＴＨより小さい場合には、ステップＳ３３０の処理はスキップされる。 The face area detection unit 210 compares the calculated evaluation value Tv with the threshold value TH (step S320). When the evaluation value Tv is greater than or equal to the threshold value TH for a specific face inclination (step S320: YES), the face area detection unit 2110 determines that the image area defined by the window SW is a face inclined by the specific face inclination. As the image area corresponding to the image, the position of the image area defined by the window SW, that is, the coordinates of the currently set window SW and the specific face inclination are stored (step S330). On the other hand, if the evaluation value Tv is smaller than the threshold value TH for any specific face inclination, the process of step S330 is skipped.

図１４は、ウィンドウＳＷを移動させた状態を例示した説明図である。顔領域検出部２１０は、現在設定しているサイズのウィンドウＳＷにより注目画像ＯＩ全体をスキャンしたか否かを判定する（ステップＳ３４０）。まだ注目画像ＯＩの全体をスキャンしていない場合は（ステップＳ３４０：ＮＯ）、図１４に示すように、ウィンドウＳＷを所定の方向に所定の移動量だけ移動させる（ステップＳ３５０）。本実施例では、顔領域検出部２１０は、ウィンドウＳＷがウィンドウＳＷの水平方向の大きさの２割分の移動量で右方向に移動させるものとしている。また、ウィンドウＳＷをさらに右方向に移動させることができない位置に配置した場合には、ウィンドウＳＷを注目画像ＯＩの左端まで戻すと共に、ウィンドウＳＷの垂直方向の大きさの２割分の移動量で下方向に移動させるものとしている。ウィンドウＳＷをさらに下方向に移動させることができない位置に配置した場合には、注目画像ＯＩの全体をスキャンしたことになる。顔領域検出部２１０は、ウィンドウＳＷを移動させた後には、移動後のウィンドウＳＷについて、上述のステップＳ３１０以降の処理を実行する。 FIG. 14 is an explanatory diagram illustrating a state in which the window SW is moved. The face area detection unit 210 determines whether or not the entire image of interest OI has been scanned with the window SW having the currently set size (step S340). If the entire image of interest OI has not been scanned yet (step S340: NO), as shown in FIG. 14, the window SW is moved in a predetermined direction by a predetermined movement amount (step S350). In this embodiment, the face area detection unit 210 moves the window SW to the right by a movement amount corresponding to 20% of the horizontal size of the window SW. In addition, when the window SW is arranged at a position where it cannot be moved further to the right, the window SW is returned to the left end of the target image OI and is moved by 20% of the vertical size of the window SW. It is supposed to move downward. When the window SW is arranged at a position where it cannot be moved further downward, the entire target image OI is scanned. After moving the window SW, the face area detection unit 210 performs the processing after step S310 described above for the moved window SW.

顔領域検出部２１０は、現在設定しているサイズのウィンドウＳＷにより注目画像ＯＩの全体をスキャンしたと判定すると（ステップＳ３４０：ＹＥＳ）、設定（用意）されたすべてのサイズのウィンドウＳＷにより注目画像ＯＩをスキャンしたか否かを判定する（ステップＳ３６０）。顔領域検出部２１０は、使用していないウィンドウＳＷのサイズがある場合には（ステップＳ３６０：ＮＯ）、スキャンに用いるウィンドウＳＷのサイズを現在設定されているサイズの次に小さいサイズに変更する（ステップＳ３７０）。すなわち、本実施例では、顔領域検出部２１０は、最初に最大サイズのウィンドウＳＷによりスキャンをおこない、その後、順に小さいサイズのウィンドウＳＷを使用する。顔領域検出部２１０は、ウィンドウＳＷのサイズを変更した後には、変更後のサイズのウィンドウＳＷについて、上述のステップＳ３００以降の処理を実行する。 If the face area detection unit 210 determines that the entire attention image OI has been scanned by the window SW having the currently set size (step S340: YES), the attention image is displayed by the windows SW having all the sizes set (prepared). It is determined whether the OI has been scanned (step S360). When there is an unused window SW size (step S360: NO), the face area detection unit 210 changes the size of the window SW used for scanning to the next smaller size than the currently set size ( Step S370). In other words, in the present embodiment, the face area detection unit 210 first scans with the maximum size window SW, and then uses the smaller size windows SW in order. After the size of the window SW is changed, the face area detection unit 210 executes the processes after step S300 described above for the window SW having the changed size.

顔領域検出部２1０は、すべてのサイズのウィンドウＳＷによりスキャンを実施すると（ステップＳ３６０：ＹＥＳ）、顔領域設定処理を実行する（ステップＳ３８０）。顔領域検出部２１０は、評価値Ｔｖが閾値ＴＨ以上であると判定して記憶したウィンドウＳＷの座標と特定顔傾きとに基づき、顔の画像に対応する画像領域としての顔領域ＦＡを設定する。具体的には、特定顔傾きが０度である場合には、ウィンドウＳＷにより規定される画像領域が、そのまま顔領域ＦＡとして設定される。一方、特定顔傾きが０度以外である場合には、ウィンドウＳＷにより規定される画像領域を所定の点（例えばウィンドウＳＷの重心）を中心として時計回りに特定顔傾き分だけ回転させた画像領域が顔領域ＦＡとして設定される。 When the face area detection unit 2110 performs the scan using the windows SW of all sizes (step S360: YES), the face area detection unit 2110 executes a face area setting process (step S380). The face area detection unit 210 sets a face area FA as an image area corresponding to the face image based on the coordinates of the window SW and the specific face inclination which are determined and evaluated that the evaluation value Tv is equal to or greater than the threshold value TH. . Specifically, when the specific face inclination is 0 degree, the image area defined by the window SW is set as the face area FA as it is. On the other hand, when the specific face inclination is other than 0 degrees, the image area defined by the window SW is rotated clockwise by the specific face inclination around a predetermined point (for example, the center of gravity of the window SW). Is set as the face area FA.

顔領域検出部２１０は、ウィンドウＳＷにより規定される画像領域が顔の画像に対応する画像領域であると判定したウィンドウＳＷが複数存在する場合には、各ウィンドウＳＷにおける所定の点（例えばウィンドウＳＷの重心）の座標の平均の座標を重心とし、各ウィンドウＳＷのサイズの平均のサイズを有する１つの新たなウィンドウを顔領域ＦＡとして検出する。図１５は、顔の画像に対応する画像領域であると判定された複数のウィンドウＳＷを例示した説明図である。図１５（ａ）に示すように、例えば、互いに一部が重複する４つのウィンドウＳＷ（ＳＷ１〜ＳＷ４）により規定される画像領域が顔画像に対応する画像領域であると判定した場合には、図１５（ｂ）に示すように、４つのウィンドウＳＷのそれぞれの重心の座標の平均の座標を重心とし、４つのウィンドウＳＷのそれぞれのサイズの平均のサイズを有する１つのウィンドウを顔領域ＦＡとして設定する。 When there are a plurality of window SWs determined that the image area defined by the window SW is an image area corresponding to the face image, the face area detection unit 210 has a predetermined point (for example, the window SW) in each window SW. The average coordinate of the coordinates of the center of the window SW is used as the center of gravity, and one new window having the average size of the windows SW is detected as the face area FA. FIG. 15 is an explanatory diagram illustrating a plurality of windows SW that are determined to be image regions corresponding to a face image. As shown in FIG. 15A, for example, when it is determined that an image area defined by four windows SW (SW1 to SW4) partially overlapping each other is an image area corresponding to a face image, As shown in FIG. 15B, the average coordinate of the coordinates of the center of gravity of each of the four windows SW is taken as the center of gravity, and one window having the average size of the respective sizes of the four windows SW is taken as the face area FA. Set.

顔領域ＦＡを設定した後、顔領域検出部２１０は、顔領域信頼度を算出する（ステップＳ３９０）。顔領域信頼度は、顔領域ＦＡの検出過程に基づき算出される指標であって、検出された顔領域ＦＡが真に顔の画像に対応する画像領域であることの確からしさを表す指標である。顔領域ＦＡの検出処理では、顔の画像に対応しない画像領域、すなわち、顔の画像をまったく含まない画像領域や顔の画像の一部を含むが、顔の画像に真に対応する画像領域ではない画像領域が、誤って顔領域ＦＡとして検出される可能性がある。顔領域信頼度は、顔領域ＦＡの検出が、誤検出ではなく正しい検出であることの確からしさを表している。 After setting the face area FA, the face area detection unit 210 calculates the face area reliability (step S390). The face area reliability is an index that is calculated based on the detection process of the face area FA, and is an index that represents the probability that the detected face area FA is an image area that truly corresponds to a face image. . In the detection process of the face area FA, an image area that does not correspond to the face image, that is, an image area that does not include the face image at all or a part of the face image, but an image area that truly corresponds to the face image is included. There is a possibility that a missing image area is erroneously detected as the face area FA. The face area reliability represents the certainty that the detection of the face area FA is not a false detection but a correct detection.

本実施例では、重複ウィンドウ数を最大重複ウィンドウ数で除した値を顔領域信頼度として用いている。ここで、重複ウィンドウ数は、顔領域ＦＡを設定する際に参照したウィンドウＳＷの数、すなわち、ウィンドウＳＷにより規定される画像領域が顔の画像に対応する画像領域であると判定されたウィンドウＳＷの数である。例えば、図１５（ｂ）に示した顔領域ＦＡの設定の際には、図１５（ａ）に示した４つのウィンドウＳＷ（ＳＷ１〜ＳＷ４）が参照されているため、重複ウィンドウ数は４となる。また、最大重複ウィンドウ数は、顔領域ＦＡの検出の際に、注目画像ＯＩ上に配置されたすべてのウィンドウＳＷの内、少なくとも一部が顔領域ＦＡに重複するウィンドウＳＷの数である。最大重複ウィンドウ数は、ウィンドウＳＷの移動ピッチやサイズ変更のピッチにより一義的に定まる。重複ウィンドウ数と最大重複ウィンドウ数はいずれも顔領域ＦＡの検出過程において算出することができる。 In this embodiment, a value obtained by dividing the number of overlapping windows by the maximum number of overlapping windows is used as the face area reliability. Here, the number of overlapping windows is the number of windows SW referred to when setting the face area FA, that is, the window SW determined that the image area defined by the window SW is an image area corresponding to the face image. Is the number of For example, when the face area FA shown in FIG. 15B is set, the four windows SW (SW1 to SW4) shown in FIG. Become. The maximum number of overlapping windows is the number of windows SW that overlap at least a part of the face area FA among all the windows SW arranged on the target image OI when the face area FA is detected. The maximum number of overlapping windows is uniquely determined by the movement pitch of the window SW and the size change pitch. Both the number of overlapping windows and the maximum number of overlapping windows can be calculated in the process of detecting the face area FA.

検出された顔領域ＦＡが真に顔の画像に対応する画像領域である場合には、位置およびサイズが互いに近似する複数のウィンドウＳＷについて、ウィンドウＳＷにより規定される画像領域が顔の画像に対応する顔領域であると判定される可能性が高い。一方、検出された顔領域ＦＡが顔の画像に対応する画像領域ではなく誤検出である場合には、あるウィンドウＳＷについてはウィンドウＳＷにより規定される画像領域が顔の画像に対応する顔領域であると判定されたとしても、当該ウィンドウＳＷに位置およびサイズが近似する別のウィンドウＳＷについてはウィンドウＳＷにより規定される画像領域が顔の画像に対応する顔領域ではないと判定される可能性が高い。そのため、本実施例では、重複ウィンドウ数を最大重複ウィンドウ数で除した値を顔領域信頼度として用いている。上記により顔領域ＦＡの検出処理は終了する。 If the detected face area FA is an image area that truly corresponds to a face image, the image area defined by the window SW corresponds to the face image for a plurality of windows SW whose positions and sizes approximate each other. There is a high possibility that the face area is determined to be a face area. On the other hand, when the detected face area FA is not an image area corresponding to a face image but a false detection, for a certain window SW, the image area defined by the window SW is a face area corresponding to the face image. Even if it is determined that there is another window SW whose position and size approximate that window SW, there is a possibility that the image area defined by the window SW is not determined to be a face area corresponding to the face image. high. Therefore, in this embodiment, a value obtained by dividing the number of overlapping windows by the maximum number of overlapping windows is used as the face area reliability. The face area FA detection process is thus completed.

ここで、顔判定に用いられるフィルタＸの態様や閾値ｔｈＸ、重み係数ＷｅＸ、後述の閾値ＴＨを含む顔学習データＦＬＤについて説明する。顔学習データＦＬＤは、サンプル画像を用いた学習によって設定される。図１６は、学習に用いられるサンプル画像の一例を示す説明図である。学習には、顔の画像に対応した画像であることが予めわかっている複数の顔サンプル画像によって構成された顔サンプル画像群と、顔の画像に対応した画像ではないことが予めわかっている複数の非顔サンプル画像によって構成された非顔サンプル画像群と、が用いられる。学習による顔学習データＦＬＤの設定は特定顔傾き毎に実行されるため、図１６に示すように、顔サンプル画像群は、１２個の特定顔傾きのそれぞれに対応したものが準備される。各特定顔傾きに対応した顔サンプル画像群は、画像サイズに対する顔の画像の大きさの比が所定の値の範囲内であると共に顔の画像の傾きが特定顔傾きに等しい複数の基本顔サンプル画像と、基本顔サンプル画像を例えば１．２倍から０．８倍までの範囲の所定の倍率で拡大および縮小した画像（例えば図１６における画像ＦＩａおよびＦＩｂ）や、基本顔サンプル画像を時計回りおよび反時計回りに例えば１５度の範囲で所定の角度だけ回転させた画像（例えば図１６における画像ＦＩｃおよびＦＩｄ）を含む。サンプル画像を用いた学習は、例えばニューラルネットワークを用いた方法や、ブースティング（例えばアダブースティング）を用いた方法、サポートベクターマシーンを用いた方法等により実行される。例えば学習がニューラルネットワークを用いた方法により実行される場合には、各フィルタＸ（フィルタ１〜フィルタＮ）について、ある特定顔傾きに対応した顔サンプル画像群と非顔サンプル画像群とに含まれるすべてのサンプル画像を用いて基礎評価値ｖＸ（ｖ１〜ｖＮ）が算出され、所定の顔検出率を達成する閾値ｔｈＸ（ｔｈ１〜ｔｈＮ）が設定される。また、各フィルタＸに設定された重み係数ＷｅＸ（Ｗｅ１〜ＷｅＮ）の初期値が設定され、顔サンプル画像群および非顔サンプル画像群の中から選択された１つのサンプル画像についての評価値Ｔｖが算出される。学習においては、算出された評価値Ｔｖにより、後述する顔判定をおこなった場合の結果の正誤に基づき、各フィルタＸに設定された重み係数ＷｅＸの値が修正される。上記処理が特定顔傾き毎に実行されることにより、特定顔傾き毎の顔学習データＦＬＤが設定される。 Here, the aspect of the filter X used for face determination, the threshold value thX, the weighting coefficient WeX, and face learning data FLD including a threshold value TH described later will be described. The face learning data FLD is set by learning using a sample image. FIG. 16 is an explanatory diagram illustrating an example of a sample image used for learning. For learning, a face sample image group composed of a plurality of face sample images that are known in advance to be images corresponding to face images, and a plurality of information that is known in advance to be images that do not correspond to face images. A non-face sample image group composed of non-face sample images. Since the setting of the face learning data FLD by learning is executed for each specific face inclination, as shown in FIG. 16, a face sample image group corresponding to each of 12 specific face inclinations is prepared. The face sample image group corresponding to each specific face inclination includes a plurality of basic face samples in which the ratio of the face image size to the image size is within a predetermined value range and the face image inclination is equal to the specific face inclination. An image and an image obtained by enlarging and reducing the basic face sample image at a predetermined magnification ranging from 1.2 times to 0.8 times (for example, images FIa and FIb in FIG. 16), and the basic face sample image clockwise And images rotated counterclockwise by a predetermined angle within a range of, for example, 15 degrees (for example, images FIc and FId in FIG. 16). Learning using a sample image is executed by, for example, a method using a neural network, a method using boosting (for example, adaboost), a method using a support vector machine, or the like. For example, when learning is performed by a method using a neural network, each filter X (filter 1 to filter N) is included in a face sample image group and a non-face sample image group corresponding to a specific face inclination. A basic evaluation value vX (v1 to vN) is calculated using all the sample images, and a threshold thX (th1 to thN) for achieving a predetermined face detection rate is set. Also, initial values of the weighting factors WeX (We1 to WeN) set for each filter X are set, and an evaluation value Tv for one sample image selected from the face sample image group and the non-face sample image group is set. Calculated. In learning, the value of the weighting coefficient WeX set for each filter X is corrected based on the correctness / incorrectness of the result when face determination described later is performed based on the calculated evaluation value Tv. By executing the above process for each specific face inclination, face learning data FLD for each specific face inclination is set.

顔領域ＦＡの検出処理の後、初期位置設定部２３０（図１）は、注目画像ＯＩに対する特徴点ＣＰの初期位置を設定する（ステップＳ２３０）。図１７は、第１実施例における特徴点ＣＰの初期位置設定処理の流れを示すフローチャートである。はじめに、初期位置設定部２３０の取得部２３２は、顔領域検出情報を取得する（ステップＳ４００）。顔領域検出情報とは、顔領域検出部２1０による顔領域ＦＡの検出に関連する情報をいう。例えば、予め設定されている特定顔傾きの設定数（１２個）、設定角度（０度、３０度、６０度、・・・、３３０度）、設定間隔（３０度）や、顔領域ＦＡの検出処理の統計的なデータに基づく顔領域ＦＡにおける顔画像の位置の傾向などのほか、顔領域ＦＡの検出処理において、顔領域検出部２1０が注目画像ＯＩから顔領域ＦＡを検出した際に特定される情報も含まれる。具体的には、検出した顔領域ＦＡの特定顔傾きや、顔領域信頼度などである。本実施例では、顔領域検出情報として、特定顔傾きの設定間隔と、検出した顔領域ＦＡの特定顔傾きと、顔領域信頼度を用いた例について説明する。取得部２３２は、顔領域ＦＡの検出処理が実行されると、検出された顔領域ＦＡについての特定顔傾きと、顔領域信頼度を取得する。 After the face area FA detection process, the initial position setting unit 230 (FIG. 1) sets the initial position of the feature point CP with respect to the target image OI (step S230). FIG. 17 is a flowchart showing the flow of the initial position setting process of the feature point CP in the first embodiment. First, the acquisition unit 232 of the initial position setting unit 230 acquires face area detection information (step S400). The face area detection information refers to information related to the detection of the face area FA by the face area detection unit 210. For example, the number of preset specific face inclinations (12), the setting angle (0 degrees, 30 degrees, 60 degrees,..., 330 degrees), the setting interval (30 degrees), the face area FA In addition to the tendency of the position of the face image in the face area FA based on the statistical data of the detection process, it is specified when the face area detection unit 210 detects the face area FA from the target image OI in the face area FA detection process. Information is also included. Specifically, the specific face inclination of the detected face area FA, the face area reliability, and the like. In the present embodiment, an example will be described in which the specific face tilt setting interval, the specific face tilt of the detected face area FA, and the face area reliability are used as the face area detection information. When the detection process of the face area FA is executed, the acquisition unit 232 acquires the specific face inclination and the face area reliability for the detected face area FA.

初期位置候補設定部２３４は、顔領域検出情報に基づいて、特徴点ＣＰを注目画像ＯＩ上の仮設定位置に設定する（ステップＳ４１０）。ここでは、特定顔傾きの設定間隔が３０度、検出した顔領域ＦＡの特定顔傾きが０度、顔領域信頼度が所定値以上である場合を例にして具体的に説明する。初期位置候補設定部２３４は、顔領域ＦＡに対する顔画像の大きさ、傾き、位置（上下方向の位置および左右方向の位置）を表すグローバルパラメーターの値を種々変更することにより、特徴点ＣＰを注目画像ＯＩ上に設定する。仮設定位置は、特許請求の範囲における「初期位置の候補」に該当する。 The initial position candidate setting unit 234 sets the feature point CP as a temporary setting position on the target image OI based on the face area detection information (step S410). Here, the specific face tilt setting interval is 30 degrees, the specific face tilt of the detected face area FA is 0 degrees, and the face area reliability is more than a predetermined value, an example will be specifically described. The initial position candidate setting unit 234 pays attention to the feature point CP by changing various values of global parameters representing the size, inclination, and position (vertical position and horizontal position) of the face image with respect to the face area FA. Set on image OI. The temporarily set position corresponds to an “initial position candidate” in the claims.

図１８は、グローバルパラメーターの値を変更することによる特徴点ＣＰの仮設定位置を例示した説明図である。図１８（ａ）および図１８（ｂ）には、注目画像ＯＩにおける特徴点ＣＰと、特徴点ＣＰをつないで形成されるメッシュが示されている。初期位置候補設定部２３４は、図１８（ａ）および図１８（ｂ）の中央に示すように、顔領域ＦＡの中央部に平均形状ｓ₀が形成されるような特徴点ＣＰの仮設定位置（以下、「基準仮設定位置」とも呼ぶ）を設定する。 FIG. 18 is an explanatory view exemplifying a temporary setting position of the feature point CP by changing the value of the global parameter. 18A and 18B show a mesh formed by connecting feature points CP and feature points CP in the target image OI. The initial position candidate setting unit 234 temporarily sets the feature point CP such that the average shape s ₀ is formed at the center of the face area FA, as shown in the center of FIGS. 18 (a) and 18 (b). (Hereinafter also referred to as “reference temporary setting position”).

初期位置候補設定部２３４は、また、基準仮設定位置に対して、グローバルパラメーターの値を種々変更させた複数の仮設定位置を設定する。グローバルパラメーター（大きさ、傾き、上下方向の位置および左右方向の位置）を変更することは、注目画像ＯＩにおいて特徴点ＣＰにより形成されるメッシュが拡大・縮小、傾きを変更、並行移動することに相当する。従って、初期位置候補設定部２３４は、図１８（ａ）に示すように、基準仮設定位置のメッシュを所定倍率で拡大または縮小したメッシュを形成するような仮設定位置（基準仮設定位置の図の下および上に示す）や、所定角度だけ時計回りまたは半時計回りに傾きを変更したメッシュを形成するような仮設定位置（基準仮設定位置の図の右および左に示す）を設定する。また、初期位置候補設定部２３４は、基準仮設定位置のメッシュに対して、拡大・縮小および傾きの変更を組み合わせた変換を行ったメッシュを形成するような仮設定位置（基準仮設定位置の図の左上、左下、右上、右下に示す）も設定する。 The initial position candidate setting unit 234 also sets a plurality of temporary setting positions obtained by changing various global parameter values with respect to the reference temporary setting position. Changing the global parameters (size, inclination, vertical position and horizontal position) means that the mesh formed by the feature point CP in the target image OI is enlarged / reduced, the inclination is changed, and the parallel movement is performed. Equivalent to. Accordingly, as shown in FIG. 18A, the initial position candidate setting unit 234 forms a temporary setting position (reference temporary setting position diagram) that forms a mesh obtained by enlarging or reducing the reference temporary setting position mesh at a predetermined magnification. And a temporary setting position (shown on the right and left in the drawing of the reference temporary setting position) that forms a mesh whose inclination is changed clockwise or counterclockwise by a predetermined angle. In addition, the initial position candidate setting unit 234 forms a temporary setting position (a figure of the reference temporary setting position) that forms a mesh obtained by performing a combination of enlargement / reduction and inclination change on the reference temporary setting position mesh. (Shown in the upper left, lower left, upper right, and lower right).

また、図１８（ｂ）に示すように、初期位置候補設定部２３４は、基準仮設定位置のメッシュを所定量だけ上または下に並行移動したメッシュを形成するような仮設定位置（基準仮設定位置の図の上および下に示す）や、左または右に並行移動したメッシュを形成するような仮設定位置（基準仮設定位置の図の左および右に示す）を設定する。また、初期位置候補設定部２３４は、基準仮設定位置のメッシュに対して、上下および左右の並行移動を組み合わせた変換を行ったメッシュを形成するような仮設定位置（基準仮設定位置の図の左上、左下、右上、右下に示す）も設定する。 Further, as shown in FIG. 18B, the initial position candidate setting unit 234 forms a temporary setting position (reference temporary setting) that forms a mesh that is translated upward or downward by a predetermined amount from the mesh at the reference temporary setting position. A temporary setting position (shown on the left and right in the drawing of the reference temporary setting position) that forms a mesh moved in parallel to the left or right. In addition, the initial position candidate setting unit 234 forms a temporary setting position (a reference temporary setting position in the diagram of the reference temporary setting position) that forms a mesh obtained by performing a combination of vertical and left and right parallel movements on the mesh of the reference temporary setting position. Also set (upper left, lower left, upper right, lower right).

ここで、検出した顔領域ＦＡの特定顔傾きが０度、特定顔傾きの設定間隔が３０度である場合、メッシュの回転角度は、−１５度から１５度までの範囲に設定される。すなわち、初期位置候補設定部２３４は、特定顔傾きの設定角度（０度、３０度、６０度、・・・、３３０度）のうち、検出した顔領域ＦＡの特定顔傾き（０度）と隣接する設定角度（−３０度および３０度）との一方の中間値（−１５度）から他方の中間値（１５度）までの範囲を、メッシュの回転角度の範囲として設定する。 Here, when the specific face inclination of the detected face area FA is 0 degree and the setting interval of the specific face inclination is 30 degrees, the rotation angle of the mesh is set in a range from −15 degrees to 15 degrees. That is, the initial position candidate setting unit 234 determines the specific face inclination (0 degree) of the detected face area FA among the specific face inclination setting angles (0 degrees, 30 degrees, 60 degrees,..., 330 degrees). A range from one intermediate value (-15 degrees) to the adjacent setting angle (-30 degrees and 30 degrees) to the other intermediate value (15 degrees) is set as a range of mesh rotation angles.

図１９は、顔領域ＦＡの特定顔傾きが３０度の場合における特徴点ＣＰの仮設定位置を例示した説明図である。図１９に示すように、検出した顔領域ＦＡの特定顔傾きが３０度、特定顔傾きの設定間隔が３０度である場合、メッシュの回転角度は、１５度から４５度までの範囲に設定される。すなわち、初期位置候補設定部２３４は、特定顔傾きの設定角度（０度、３０度、６０度、・・・、３３０度）のうち、検出した顔領域ＦＡの特定顔傾き（３０度）と隣接する設定角度（０度および６０度）との一方の中間値（１５度）から他方の中間値（４５度）までの範囲を、メッシュの回転角度の範囲として設定する。いいかえれば、初期位置候補設定部２３４は、予め設定されている特定傾きが０度の場合における仮設定位置に対して、検出された顔領域ＦＡの特定傾きの角度分それぞれ傾けて設定する。 FIG. 19 is an explanatory diagram illustrating the temporarily set position of the feature point CP when the specific face inclination of the face area FA is 30 degrees. As shown in FIG. 19, when the specific face inclination of the detected face area FA is 30 degrees and the specific face inclination setting interval is 30 degrees, the mesh rotation angle is set in the range of 15 degrees to 45 degrees. The That is, the initial position candidate setting unit 234 determines the specific face inclination (30 degrees) of the detected face area FA from the specific face inclination setting angles (0 degrees, 30 degrees, 60 degrees,..., 330 degrees). A range from one intermediate value (15 degrees) to the adjacent setting angle (0 degrees and 60 degrees) to the other intermediate value (45 degrees) is set as a mesh rotation angle range. In other words, the initial position candidate setting unit 234 sets each of the detected specific inclination angles of the face area FA with respect to the provisional setting position when the specific inclination is 0 degree.

初期位置候補設定部２３４は、図１８（ａ）に示す基準仮設定位置以外の８つの仮設定位置のそれぞれにおけるメッシュに対して図１８（ｂ）に示す上下左右の並行移動が実行される仮設定位置も設定する。従って、本実施例では、４つのグローバルパラメーター（大きさ、傾き、上下方向の位置、左右方向の位置）をそれぞれ既知の３段階の値として組み合わせにより設定される８０通り（＝３×３×３×３−１）の仮設定位置と、基準仮設定位置の合計８１通りの仮設定位置が設定される。 The initial position candidate setting unit 234 temporarily performs vertical and horizontal parallel movements shown in FIG. 18B on the mesh at each of the eight temporary setting positions other than the reference temporary setting positions shown in FIG. The setting position is also set. Therefore, in the present embodiment, the four global parameters (size, inclination, vertical position, horizontal position) are set in 80 ways (= 3 × 3 × 3) that are set as known three-level values. A total of 81 temporary setting positions of (3-1) temporary setting positions and reference temporary setting positions are set.

初期位置候補設定部２３４は、顔領域信頼度が閾値以上の場合、上述のとおり８１通りの仮設定位置を設定するが、顔領域信頼度が閾値より小さい場合には、顔領域信頼度が閾値以上の場合より多くの仮設定位置を設定する。具体的には、上述では、メッシュの回転および拡大・縮小、メッシュの上下移動および左右移動、についてそれぞれ３段階に変化させて仮設定位置が設定されているが、初期位置候補設定部２３４は、顔領域信頼度と閾値との比較結果に基づいて、設定する仮設定位置のそれぞれの変化（回転、拡大・縮小、上下移動、左右移動）の段階数を決定する。図２０は、仮設定位置の変化の段階数を説明するための説明図である。初期位置候補設定部２３４は、顔領域信頼度が閾値以上である場合には、図１８に示すように、メッシュの回転および拡大・縮小、メッシュの上下移動および左右移動のそれぞれについて３段階に変化させて仮設定位置を設定する。一方、初期位置候補設定部２３４は、顔領域信頼度が閾値より小さい場合には、図２０に一例を示すように、メッシュの回転および拡大・縮小、メッシュの上下移動および左右移動のそれぞれについて５段階に変化させて仮設定位置を設定する。 The initial position candidate setting unit 234 sets 81 temporarily set positions as described above when the face area reliability is greater than or equal to the threshold, but when the face area reliability is smaller than the threshold, the face area reliability is the threshold. More temporary setting positions are set than in the above case. Specifically, in the above description, the temporary setting position is set by changing the rotation and enlargement / reduction of the mesh, the vertical movement and the horizontal movement of the mesh in three stages, but the initial position candidate setting unit 234 Based on the comparison result between the face area reliability and the threshold value, the number of stages of change (rotation, enlargement / reduction, vertical movement, horizontal movement) of each temporary setting position to be set is determined. FIG. 20 is an explanatory diagram for explaining the number of stages of change of the temporarily set position. When the face area reliability is greater than or equal to the threshold value, the initial position candidate setting unit 234 changes in three stages for mesh rotation and enlargement / reduction, mesh vertical movement, and horizontal movement as shown in FIG. To set the temporary setting position. On the other hand, when the face area reliability is smaller than the threshold value, the initial position candidate setting unit 234 sets 5 for each of mesh rotation and enlargement / reduction, mesh vertical movement, and horizontal movement, as shown in FIG. Change the stage to set the temporary setting position.

メッシュを設定する範囲は、３段階の場合と５段階の場合で同一の範囲であってもよいし異なる範囲であってもよい。すなわち、図２０に示すように、５段階の場合であっても図１８に示す３段階の場合と同一の範囲（−１５度〜１５度）であり、メッシュの回転角度が−１５度、０度、１５度、の３段階であったものを、−１５度、−７．５度、０度、７．５度、１５度の５段階に細分化して設定してもよいし、メッシュの回転角度を−３０度、−１５度、０度、１５度、３０度の５段階として３段階の場合に比べ広い範囲に設定してもよい。また、メッシュの回転および拡大・縮小、メッシュの上下移動および左右移動のすべてを５段階に設定する必要はなく、一部を５段階とし、ほかを３段階として設定してもよい。 The range for setting the mesh may be the same range or a different range in the case of three stages and in the case of five stages. That is, as shown in FIG. 20, even in the case of 5 steps, the same range (−15 to 15 degrees) as in the case of 3 steps shown in FIG. 18, and the rotation angle of the mesh is −15 degrees, 0 Degrees, 15 degrees, and 3 levels may be subdivided into 5 levels of -15 degrees, -7.5 degrees, 0 degrees, 7.5 degrees, and 15 degrees. The rotation angle may be set to a wide range as compared with the case of three stages, with five stages of −30 degrees, −15 degrees, 0 degrees, 15 degrees, and 30 degrees. Further, it is not necessary to set all of the mesh rotation, enlargement / reduction, mesh up / down movement, and left / right movement in five stages, some of which may be set in five stages, and others may be set in three stages.

生成部２３６は、設定された各仮設定位置に対応する平均形状画像Ｉ（Ｗ（ｘ；ｐ））を生成する（ステップＳ４２０）。図２１は、平均形状画像Ｉ（Ｗ（ｘ；ｐ））の一例を示す説明図である。平均形状画像Ｉ（Ｗ（ｘ；ｐ））は、入力画像における特徴点ＣＰの配置が平均形状ｓ₀における特徴点ＣＰの配置と等しくなるような変換によって算出される。 The generation unit 236 generates an average shape image I (W (x; p)) corresponding to each set temporary setting position (step S420). FIG. 21 is an explanatory diagram showing an example of the average shape image I (W (x; p)). The average shape image I (W (x; p)) is calculated by conversion such that the arrangement of the feature points CP in the input image is equal to the arrangement of the feature points CP in the average shape s ₀ .

平均形状画像Ｉ（Ｗ（ｘ；ｐ））を算出するための変換は、ＡＡＭ設定処理において、サンプル画像ＳＩｗ算出のための変換と同様に、三角形領域ＴＡ毎のアフィン変換の集合であるワープＷにより行われる。具体的には、注目画像ＯＩに配置された特徴点ＣＰによって、外周に位置する特徴点ＣＰ（フェイスラインおよび眉毛、眉間に対応する特徴点ＣＰ）を結ぶ直線により囲まれた領域である平均形状領域ＢＳＡが特定され、注目画像ＯＩにおける平均形状領域ＢＳＡに対して三角形領域ＴＡ毎のアフィン変換が行われることにより、平均形状画像Ｉ（Ｗ（ｘ；ｐ））が算出される。本実施例では、平均形状画像Ｉ（Ｗ（ｘ；ｐ））は、平均顔画像Ａ₀（ｘ）と同様に平均形状領域ＢＳＡおよびマスク領域ＭＡにより構成され、平均顔画像Ａ₀（ｘ）と同サイズの画像として算出される。 The transformation for calculating the average shape image I (W (x; p)) is a warp W that is a set of affine transformations for each triangular area TA in the AAM setting process, similarly to the transformation for calculating the sample image SIw. Is done. Specifically, an average shape that is an area surrounded by a straight line connecting feature points CP (feature points CP corresponding to face lines, eyebrows, and eyebrows) located on the outer periphery by feature points CP arranged in the target image OI The area BSA is specified, and the average shape image I (W (x; p)) is calculated by performing affine transformation for each triangular area TA on the average shape area BSA in the target image OI. In this embodiment, the average shape image I (W (x; p) ) is composed of an average face image A ₀ (x) and an average shape area BSA and a mask area MA, average face image A ₀ (x) And is calculated as an image of the same size.

ここで、平均形状ｓ₀における平均形状領域ＢＳＡに位置する画素の集合を画素群ｘと表す。また、ワープＷ実行後の画像（平均形状ｓ₀を有する顔画像）における画素群ｘに対応するワープＷ実行前の画像（注目画像ＯＩの平均形状領域ＢＳＡ）における画素群をＷ（ｘ；ｐ）と表す。平均形状画像は、注目画像ＯＩの平均形状領域ＢＳＡにおける画素群Ｗ（ｘ；ｐ）のそれぞれにおける輝度値により構成される画像であるため、Ｉ（Ｗ（ｘ；ｐ））と表される。図２１には、図１８（ａ）に示した９個の仮設定位置に対応する９個の平均形状画像Ｉ（Ｗ（ｘ；ｐ））を示している。 Here, a set of pixels located in the average shape area BSA in the average shape s ₀ is represented as a pixel group x. In addition, the pixel group in the image before the warp W execution (the average shape area BSA of the target image OI) corresponding to the pixel group x in the image after the warp W execution (face image having the average shape s ₀ ) is represented by W (x; p ). Since the average shape image is an image composed of luminance values in each of the pixel groups W (x; p) in the average shape region BSA of the target image OI, it is represented as I (W (x; p)). FIG. 21 shows nine average shape images I (W (x; p)) corresponding to the nine temporarily set positions shown in FIG.

算出部２３８は、各仮設定位置に対応する平均形状画像Ｉ（Ｗ（ｘ；ｐ））と平均顔画像Ａ₀（ｘ）との差分画像Ｉｅを算出する（ステップＳ４３０）。差分画像Ｉｅは、平均形状画像Ｉ（Ｗ（ｘ；ｐ））と平均顔画像Ａ₀（ｘ）の各画素値の差であり、本実施例では差分値とも呼ぶ。差分画像Ｉｅは、特徴点ＣＰの設定位置が、特徴部位の位置と一致している場合には表れないため、特徴点ＣＰの設定位置と、特徴部位の位置と差異を表している。本実施例では、特徴点ＣＰの仮設定位置は８１種類設定されているため、算出部２３８は、８１個の差分画像Ｉｅを算出することとなる。 The calculation unit 238 calculates a difference image Ie between the average shape image I (W (x; p)) corresponding to each temporarily set position and the average face image A ₀ (x) (step S430). The difference image Ie is a difference between pixel values of the average shape image I (W (x; p)) and the average face image A ₀ (x), and is also referred to as a difference value in this embodiment. Since the difference image Ie does not appear when the setting position of the feature point CP matches the position of the feature part, the difference image Ie represents the difference between the setting position of the feature point CP and the position of the feature part. In the present embodiment, since 81 types of temporarily set positions of feature points CP are set, the calculation unit 238 calculates 81 difference images Ie.

初期位置設定部２３０は、各差分画像Ｉｅの画素値からノルムを算出し、ノルムの値が最も小さい差分画像Ｉｅに対応する仮設置位置（以下「ノルム最小仮設定位置」とも呼ぶ）を、注目画像ＯＩにおける特徴点ＣＰの初期位置として設定する（ステップＳ４４０）。本実施例において、ノルムを算出するための画素値は輝度値であってもよいしＲＧＢ値であってもよい。以上により特徴点ＣＰ初期位置設定処理が完了する。 The initial position setting unit 230 calculates a norm from the pixel values of each difference image Ie, and pays attention to a temporary installation position corresponding to the difference image Ie having the smallest norm value (hereinafter also referred to as “norm minimum temporary setting position”). The initial position of the feature point CP in the image OI is set (step S440). In this embodiment, the pixel value for calculating the norm may be a luminance value or an RGB value. Thus, the feature point CP initial position setting process is completed.

特徴点ＣＰ初期位置設定処理が完了すると、特徴位置検出部２２０は、注目画像ＯＩにおける特徴点ＣＰの設定位置の補正を行う（ステップＳ２４０）。図２２は、第１実施例における特徴点ＣＰ設定位置補正処理の流れを示すフローチャートである。 When the feature point CP initial position setting process is completed, the feature position detection unit 220 corrects the setting position of the feature point CP in the target image OI (step S240). FIG. 22 is a flowchart showing the flow of the feature point CP setting position correction process in the first embodiment.

特徴位置検出部２２０は、注目画像ＯＩから平均形状画像Ｉ（Ｗ（ｘ；ｐ））を算出する（ステップＳ５１０）。平均形状画像Ｉ（Ｗ（ｘ；ｐ））の算出方法は、特徴点ＣＰ初期位置設定処理におけるステップＳ４２０と同様である。 The feature position detection unit 220 calculates the average shape image I (W (x; p)) from the attention image OI (step S510). The calculation method of the average shape image I (W (x; p)) is the same as that in step S420 in the feature point CP initial position setting process.

特徴位置検出部２２０は、平均形状画像Ｉ（Ｗ（ｘ；ｐ））と平均顔画像Ａ₀（ｘ）との差分画像Ｉｅを算出する（ステップＳ５２０）。特徴位置検出部２２０は、差分画像Ｉｅに基づき、特徴点ＣＰの設定位置補正処理が収束したか否かを判定する（ステップＳ５３０）。特徴位置検出部２２０は、差分画像Ｉｅのノルムを算出し、ノルムの値が予め設定された閾値より小さい場合には収束したと判定し、ノルムの値が閾値以上の場合には未だ収束していないと判定する。 The feature position detection unit 220 calculates a difference image Ie between the average shape image I (W (x; p)) and the average face image A ₀ (x) (step S520). The feature position detection unit 220 determines whether or not the feature point CP setting position correction process has converged based on the difference image Ie (step S530). The feature position detection unit 220 calculates the norm of the difference image Ie, determines that it has converged if the norm value is smaller than a preset threshold value, and still converges if the norm value is greater than or equal to the threshold value. Judge that there is no.

なお、特徴位置検出部２２０は、算出された差分画像Ｉｅのノルムの値が前回のステップＳ５２０において算出された値よりも小さい場合には収束したと判定し、前回値以上である場合には未だ収束していないと判定するものとしてもよい。あるいは、特徴位置検出部２２０は、閾値による判定と前回値との比較による判定とを組み合わせて収束判定を行うものとしてもよい。例えば、特徴位置検出部２２０は、算出されたノルムの値が、閾値より小さく、かつ、前回値より小さい場合にのみ収束したと判定し、それ以外の場合には未だ収束していないと判定するものとしてもよい。 The feature position detection unit 220 determines that the calculated norm value of the difference image Ie has converged when the value is smaller than the value calculated in the previous step S520, and still determines that the value is equal to or greater than the previous value. It is good also as what determines with not having converged. Alternatively, the feature position detection unit 220 may perform the convergence determination by combining the determination based on the threshold and the determination based on comparison with the previous value. For example, the feature position detection unit 220 determines that the calculated norm value has converged only when the calculated norm value is smaller than the threshold value and smaller than the previous value, and otherwise determines that it has not yet converged. It may be a thing.

上記のステップＳ５３０の収束判定において未だ収束していないと判定された場合には、補正部２２２は、パラメーター更新量ΔＰを算出する（ステップＳ５４０）。パラメーター更新量ΔＰは、４個のグローバルパラメーター（全体としての大きさ、傾き、Ｘ方向位置、Ｙ方向位置）、および、ＡＡＭ設定処理により算出されるｎ個の形状パラメーターｐ_i（式（１）参照）の値の変更量を意味している。なお、特徴点ＣＰを初期位置に設定した直後においては、グローバルパラメーターは、特徴点ＣＰ初期位置設定処理において決定された値が設定されている。また、このときの特徴点ＣＰの初期位置と平均形状ｓ₀の特徴点ＣＰの設定位置との相違は、全体としての大きさ、傾き、位置の相違に限られるため、形状モデルにおける形状パラメーターｐ_iの値はすべてゼロである。 When it is determined in the convergence determination in step S530 that the convergence has not yet occurred, the correction unit 222 calculates the parameter update amount ΔP (step S540). The parameter update amount ΔP includes four global parameters (total size, inclination, X direction position, Y direction position), and n shape parameters p _i calculated by the AAM setting process (formula (1)) This means the amount of change in the value of (see). Note that immediately after the feature point CP is set to the initial position, the value determined in the feature point CP initial position setting process is set as the global parameter. Further, difference between the initial position of the characteristic point CP and sets the position of the characteristic points CP of the average shape s ₀ in this case, the overall size, the tilt, because it is limited to the difference in position, shape parameters in the shape model p The values of _i are all zero.

パラメーター更新量ΔＰは、下記の式（３）により算出される。すなわち、パラメーター更新量ΔＰは、アップデートマトリックスＲと差分画像Ｉｅとの積である。 The parameter update amount ΔP is calculated by the following equation (3). That is, the parameter update amount ΔP is a product of the update matrix R and the difference image Ie.

式（３）におけるアップデートマトリックスＲは、差分画像Ｉｅに基づきパラメーター更新量ΔＰを算出するために予め学習により設定されたＭ行Ｎ列のマトリックスであり、ＡＡＭ情報ＡＭＩ（図１）として内部メモリー１２０に格納されている。本実施例では、アップデートマトリックスＲの行数Ｍは、グローバルパラメーターの数（４個）と、形状パラメーターｐ_iの数（ｎ個）との和（（４＋ｎ）個）に等しく、列数Ｎは、平均顔画像Ａ₀（ｘ）の平均形状領域ＢＳＡ内の画素数（５６画素×５６画素−マスク領域ＭＡの画素数）に等しい。アップデートマトリックスＲは、下記の式（４）および（５）により算出される。 The update matrix R in the expression (3) is a matrix of M rows and N columns set in advance by learning in order to calculate the parameter update amount ΔP based on the difference image Ie, and the internal memory 120 as the AAM information AMI (FIG. 1). Stored in In this embodiment, the number of rows M in the update matrix R is equal to the sum ((4 + n)) of the number of global parameters (4) and the number of shape parameters p _i (n), and the number of columns N is , average face image a ₀ average number of pixels in the shape area BSA of (x) - is equal to (56 pixels × 56 pixels the number of pixels in the mask area MA). The update matrix R is calculated by the following formulas (4) and (5).

補正部２２２は、算出されたパラメーター更新量ΔＰに基づきパラメーター（４個のグローバルパラメーターおよびｎ個の形状パラメーターｐ_i）を更新する（ステップＳ５５０）。これにより、注目画像ＯＩにおける特徴点ＣＰの設定位置が更新される。補正部２２２は、差分画像Ｉｅのノルムが小さくなるように更新する。パラメーターの更新の後には、再度、特徴点ＣＰの設置位置が補正された注目画像ＯＩからの平均形状画像Ｉ（Ｗ（ｘ；ｐ））の算出（ステップＳ５１０）、差分画像Ｉｅの算出（ステップＳ５２０）、差分画像Ｉｅに基づく収束判定（ステップＳ５３０）が行われる。再度の収束判定においても収束していないと判定された場合には、さらに、差分画像Ｉｅに基づくパラメーター更新量ΔＰの算出（ステップＳ５４０）、パラメーターの更新による特徴点ＣＰの設定位置補正（ステップＳ５５０）が行われる。 The correction unit 222 updates the parameters (four global parameters and n shape parameters p _i ) based on the calculated parameter update amount ΔP (step S550). Thereby, the setting position of the feature point CP in the target image OI is updated. The correction unit 222 updates the difference image Ie so that the norm of the difference image Ie becomes small. After the parameter update, the average shape image I (W (x; p)) is calculated again from the target image OI in which the installation position of the feature point CP is corrected (step S510), and the difference image Ie is calculated (step S520), convergence determination based on the difference image Ie (step S530) is performed. If it is determined that the convergence has not occurred in the convergence determination again, the parameter update amount ΔP based on the difference image Ie is calculated (step S540), and the setting position correction of the feature point CP by the parameter update (step S550). ) Is performed.

図２２のステップＳ５１０からＳ５５０までの処理が繰り返し実行されると、注目画像ＯＩにおける各特徴部位に対応する特徴点ＣＰの位置は実際の特徴部位の位置に全体として近づいていき、ある時点で収束判定（ステップＳ５３０）において収束したと判定される。収束判定において収束したと判定されると、顔特徴位置検出処理が完了する（ステップＳ５６０）。このとき設定されているグローバルパラメーターおよび形状パラメーターの値により特定される特徴点ＣＰの設定位置が、最終的な注目画像ＯＩにおける特徴点ＣＰの設定位置として特定される。ステップＳ５１０からＳ５５０までの処理の繰り返しにより、注目画像ＯＩにおける各特徴部位に対応する特徴点ＣＰの位置と実際の特徴部位の位置とが一致する場合もある。 When the processing from step S510 to step S550 in FIG. 22 is repeatedly executed, the position of the feature point CP corresponding to each feature part in the target image OI approaches the position of the actual feature part as a whole, and converges at a certain time. In the determination (step S530), it is determined that convergence has occurred. If it is determined in the convergence determination that convergence has been achieved, the face feature position detection process is completed (step S560). The setting position of the feature point CP specified by the values of the global parameter and the shape parameter set at this time is specified as the setting position of the feature point CP in the final target image OI. By repeating the processing from step S510 to S550, the position of the feature point CP corresponding to each feature part in the target image OI may coincide with the position of the actual feature part.

図２３は、顔特徴位置検出処理の結果の一例を示す説明図である。図２３には、注目画像ＯＩにおいて最終的に特定された特徴点ＣＰの設定位置が示されている。特徴点ＣＰの設定位置により、注目画像ＯＩに含まれる顔の特徴部位（人物の顔の器官（眉毛、目、鼻、口）および顔の輪郭における所定位置）の位置が特定されるため、注目画像ＯＩにおける人物の顔の器官の形状・位置や顔の輪郭形状の検出が可能となる。以上により、顔特徴位置検出処理が完了する。 FIG. 23 is an explanatory diagram illustrating an example of a result of the face feature position detection process. FIG. 23 shows the setting positions of the feature points CP finally specified in the target image OI. Since the position of the feature point CP specifies the position of the facial feature part (predetermined position in the facial organs (eyebrows, eyes, nose, mouth) and facial contour) included in the attention image OI. It is possible to detect the shape and position of the human face organ and the face contour shape in the image OI. Thus, the face feature position detection process is completed.

印刷処理部３２０は、顔の器官の形状・位置や顔の輪郭形状の検出がなされた注目画像ＯＩについての印刷データを生成する。具体的には、印刷処理部３２０は、注目画像ＯＩについて、各画素の画素値をプリンター１００が用いるインクに合わせるための色変換処理や、色変換処理後の画素の階調をドットの分布によって表すためのハーフトーン処理や、ハーフトーン処理された画像データのデータ並びをプリンター１００に転送すべき順序に並び替えるためのラスタライズ処理等を実施して印刷データを生成する。印刷機構１６０は、印刷処理部３２０により生成された印刷データに基づいて、顔の器官の形状・位置や顔の輪郭形状の検出がなされた注目画像ＯＩの印刷をおこなう。なお、印刷処理部３２０は、注目画像ＯＩについての印刷データに限らず、検出された顔の器官の形状・位置や顔の輪郭形状に基づいて、顔変形や、顔の陰影補正など所定の処理が施された画像の印刷データについても生成することができる。また、印刷機構１６０は、印刷処理部３２０により生成された印刷データに基づいて、顔変形や、顔の陰影補正などの処理が施された画像の印刷をおこなうこともできる。 The print processing unit 320 generates print data for the attention image OI in which the shape / position of the facial organ and the face contour shape are detected. Specifically, the print processing unit 320 performs color conversion processing for matching the pixel value of each pixel with the ink used by the printer 100 for the target image OI, and the gradation of the pixel after color conversion processing according to the distribution of dots. Print data is generated by performing halftone processing for representing, rasterizing processing for rearranging the data arrangement of the image data subjected to the halftone processing in an order to be transferred to the printer 100, and the like. Based on the print data generated by the print processing unit 320, the printing mechanism 160 prints the attention image OI in which the shape and position of the facial organ and the contour shape of the face are detected. Note that the print processing unit 320 is not limited to print data for the target image OI, and predetermined processing such as face deformation and face shading correction based on the detected shape and position of the facial organ and the face contour shape. It is also possible to generate print data of images subjected to. The printing mechanism 160 can also print an image that has undergone processing such as face deformation and face shading correction based on the print data generated by the print processing unit 320.

以上説明したように、第１の実施例に係る画像処理装置によれば、特徴点ＣＰの初期位置を、顔領域検出情報に基づいて設定される複数の初期位置の候補から設定するため、初期位置を特徴部位により近い位置に設定することができる。これにより、注目画像に含まれる顔の特徴部位の位置を検出する処理の効率化・高速化を図ることができる。すなわち、顔領域ＦＡにおける顔画像の位置や範囲等は、つねに一定かつ顔領域ＦＡの中心と顔画像の中心が一致するわけではなく、注目画像ＯＩに含まれる顔画像の向きや鮮明さのほか、顔領域検出部２１０の検出特性などによって顔画像の位置や範囲が異なる。そのため、特徴点ＣＰを初期位置の候補となる注目画像ＯＩ上の仮設定位置に設定する際に、顔領域検出部２1０による顔領域ＦＡの検出に関連する情報を用いることで、特徴点ＣＰを検出対象の特徴部位に対してより適当な位置に設定することができる。 As described above, according to the image processing apparatus of the first embodiment, the initial position of the feature point CP is set from a plurality of initial position candidates set based on the face area detection information. The position can be set closer to the characteristic part. Thereby, it is possible to increase the efficiency and speed of the process of detecting the position of the facial feature part included in the target image. That is, the position and range of the face image in the face area FA are not always constant, and the center of the face area FA does not always coincide with the center of the face image. In addition to the orientation and clarity of the face image included in the target image OI, The position and range of the face image vary depending on the detection characteristics of the face area detection unit 210 and the like. Therefore, when the feature point CP is set to a temporary setting position on the target image OI that is a candidate for the initial position, information related to the detection of the face area FA by the face area detection unit 2110 is used, so that the feature point CP is determined. It can be set at a more appropriate position with respect to the characteristic part to be detected.

具体的には、初期位置候補設定部２３４は、顔領域信頼度が低い場合、すなわち、顔領域信頼度が閾値より小さい場合には、顔領域ＦＡに顔画像が含まれている可能性が低いことから、顔領域信頼度が高い場合に比べて、特徴点ＣＰの仮設定位置の数を増やすことにより、特徴部位に対応した適当な位置に特徴点ＣＰを配置できる可能性が高くすることができる。反対に、顔領域信頼度が高い場合には、顔領域ＦＡに顔画像が含まれている可能性が高いため、顔領域信頼度が低い場合に比べて、特徴点ＣＰの仮設定位置の数を減らすことにより効率的かつ高速に顔の特徴部位の位置の検出をおこなうことができる。 Specifically, the initial position candidate setting unit 234 has a low possibility that a face image is included in the face area FA when the face area reliability is low, that is, when the face area reliability is smaller than a threshold value. Therefore, the possibility that the feature point CP can be arranged at an appropriate position corresponding to the feature part is increased by increasing the number of the temporarily set positions of the feature point CP, compared with the case where the face area reliability is high. it can. On the other hand, when the face area reliability is high, there is a high possibility that a face image is included in the face area FA. Therefore, the number of temporarily set positions of feature points CP is higher than when the face area reliability is low. Therefore, the position of the facial feature portion can be detected efficiently and at high speed.

また、初期位置候補設定部２３４は、予め設定されている特定傾きが０度の場合における仮設定位置に対して、検出された顔領域ＦＡの特定傾きの角度分それぞれ傾けて設定するため、特徴部位に対応したより適当な位置に特徴点ＣＰを配置することができる。よって、注目画像に含まれる顔の特徴部位の位置を効率的かつ高速に検出することができる。 Further, the initial position candidate setting unit 234 is set so as to be inclined by the angle of the specific inclination of the detected face area FA with respect to the provisional setting position when the specific inclination is set to 0 degree. The feature point CP can be arranged at a more appropriate position corresponding to the part. Therefore, the position of the facial feature part included in the target image can be detected efficiently and at high speed.

また、初期位置候補設定部２３４は、特定顔傾きの設定角度（例えば、０度、３０度、６０度、・・・、３３０度）のうち、検出した顔領域ＦＡの特定顔傾き（例えば、０度）と隣接する設定角度（例えば、−３０度および３０度）との一方の中間値（例えば、−１５度）から他方の中間値（例えば、１５度）までの範囲を、メッシュの回転角度の範囲として設定するため、顔領域ＦＡの検出処理の結果、実際の顔画像の顔の傾きが含まれている可能性の低い角度の範囲にメッシュを回転させて仮設定位置を設定する処理の無駄を抑制し、実際の顔画像の顔の傾きが含まれている可能性の高い範囲にメッシュを回転させて仮設置位置を設定することができる。よって、注目画像に含まれる顔の特徴部位の位置を効率的かつ高速に検出することができる。 In addition, the initial position candidate setting unit 234 sets the specific face inclination (for example, the detected face area FA) (for example, 0 degrees, 30 degrees, 60 degrees,..., 330 degrees) of the specific face inclination setting angles (for example, 0 degrees, 30 degrees, 60 degrees,. Rotate the mesh within a range from one intermediate value (for example, -15 degrees) between the adjacent setting angle (for example, -30 degrees and 30 degrees) to the other intermediate value (for example, 15 degrees). Processing to set a temporary setting position by rotating the mesh to an angle range where it is unlikely that the face inclination of the actual face image is included as a result of the detection processing of the face area FA to set as an angle range The temporary installation position can be set by rotating the mesh within a range where the face inclination of the actual face image is likely to be included. Therefore, the position of the facial feature part included in the target image can be detected efficiently and at high speed.

第１の実施例に係る画像処理装置によれば、特徴点ＣＰ初期位置設定処理において、グローバルパラメーターを用いて特徴点ＣＰの初期位置を設定するため、注目画像に含まれる顔の特徴部位の位置を検出する処理の効率化・高速化を図ることができる。具体的には、４つのグローバルパラメーター（大きさ、傾き、上下方向の位置、左右方向の位置）の値をそれぞれ変更させて、種々のメッシュを形成する特徴点ＣＰの仮設定位置を予め複数用意し、ノルムの値が最も小さい差分画像Ｉｅに対応する仮設定位置を初期位置としている。これにより、注目画像ＯＩにおける特徴点ＣＰの初期位置を顔の特徴部位の位置のより近くに設定することができる。よって、特徴点ＣＰ設定位置補正処理において、補正部２２２による補正が容易となるため、顔の特徴部位の位置を検出する処理の効率化・高速化を図ることができる。 According to the image processing apparatus according to the first embodiment, in the feature point CP initial position setting process, the initial position of the feature point CP is set using a global parameter. It is possible to increase the efficiency and speed of the process for detecting the. Specifically, multiple global parameters (size, inclination, vertical position, horizontal position) are changed to prepare multiple temporary setting positions for feature points CP that form various meshes. The temporary setting position corresponding to the difference image Ie having the smallest norm value is set as the initial position. Thereby, the initial position of the feature point CP in the target image OI can be set closer to the position of the facial feature part. Accordingly, in the feature point CP setting position correction process, correction by the correction unit 222 is facilitated, so that the process of detecting the position of the facial feature part can be made more efficient and faster.

第１の実施例に係るプリンター１００によれば、顔の器官の形状・位置や顔の輪郭形状の検出がなされた注目画像ＯＩについての印刷をおこなうことができる。これにより、特定の表情（例えば笑顔や目を閉じた顔）の顔画像を検出するための表情判定や、特定の向き（例えば右向きや下向き）の顔画像を検出するための顔向き判定をおこなった後に、判定結果に基づいて任意の画像を選択して印刷をおこなうことができる。また、検出された顔の器官の形状・位置や顔の輪郭形状に基づいて、顔変形や、顔の陰影補正など所定の処理が施された画像の印刷をおこなうことができる。これにより、特定の顔画像について、顔変形や、顔の陰影補正等をおこなった後に印刷をおこなうことができる。 According to the printer 100 according to the first embodiment, it is possible to print the attention image OI in which the shape / position of the facial organ and the contour shape of the face are detected. Thus, facial expression determination for detecting a facial image with a specific facial expression (for example, a face with a smile or eyes closed) and facial orientation determination for detecting a facial image with a specific orientation (for example, rightward or downward) are performed. After that, any image can be selected and printed based on the determination result. Further, based on the detected shape and position of the organ of the face and the contour shape of the face, it is possible to print an image subjected to predetermined processing such as face deformation and face shading correction. Thereby, it is possible to print a specific face image after performing face deformation, face shading correction, and the like.

Ｂ．第２実施例：
第１実施例では、取得部２３２は、顔領域検出情報を取得し（ステップＳ４００）、初期位置候補設定部２３４は、取得部２３２が取得した顔領域検出情報に基づいて、特徴点ＣＰを注目画像ＯＩ上の仮設定位置に設定していたが（ステップＳ４１０）、顔特徴位置検出処理の度に顔領域検出情報を取得する態様ではなく、初期位置候補設定部２３４は、予め顔領域検出情報に基づいて設定された仮設定位置に特徴点ＣＰを設定してもよい。 B. Second embodiment:
In the first example, the acquisition unit 232 acquires face area detection information (step S400), and the initial position candidate setting unit 234 focuses on the feature point CP based on the face area detection information acquired by the acquisition unit 232. Although the temporary setting position on the image OI has been set (step S410), the initial position candidate setting unit 234 does not acquire the face area detection information every time the facial feature position detection processing is performed, but the initial position candidate setting unit 234 previously stores the face area detection information. The feature point CP may be set at the temporary setting position set based on the above.

図２４は、顔領域検出情報と特徴点ＣＰの仮設定位置について第２の例を示した説明図である。図２５は、顔領域検出情報と特徴点ＣＰの仮設定位置についての第３の例を示した説明図である。第２実施例における初期位置候補設定部２３４は、特徴点ＣＰの初期位置の設定に、顔領域ＦＡにおける顔画像の位置の傾向に基づいて予め設定された特徴点ＣＰの仮設定位置を用いる。具体的には、図２４（ａ）に示すように、顔領域ＦＡの検出処理の統計的なデータにより、顔領域ＦＡに対して顔画像が下側に位置することが特定された場合には、図２４（ｂ）に示すように、顔領域ＦＡに対してメッシュが下側に位置するように予めグローバルパラメーターが設定される。また、顔領域ＦＡに対して顔画像が上側に位置する場合や、左右のいずれかに偏って位置する場合についても同様に予めグローバルパラメーターを設定することにより、顔画像のより近くにメッシュを配置することができる。グローバルパラメーターの設定は、ユーザによりおこなわれてもよいし、画像処理部２００によりおこなわれてもよい。 FIG. 24 is an explanatory diagram showing a second example of the temporary setting positions of the face area detection information and the feature points CP. FIG. 25 is an explanatory diagram showing a third example of the temporary setting positions of the face area detection information and the feature points CP. The initial position candidate setting unit 234 in the second embodiment uses a temporary setting position of the feature point CP set in advance based on the tendency of the position of the face image in the face area FA for setting the initial position of the feature point CP. Specifically, as shown in FIG. 24A, when the statistical data of the face area FA detection process determines that the face image is located below the face area FA. As shown in FIG. 24B, global parameters are set in advance so that the mesh is positioned below the face area FA. In addition, when the face image is located above the face area FA, or when the face image is biased to the left or right, a global parameter is set in advance to place a mesh closer to the face image. can do. The global parameter setting may be performed by the user or the image processing unit 200.

また、初期位置候補設定部２３４は、図２５（ａ）に示すように、顔領域ＦＡの検出処理の統計的なデータにより、顔領域ＦＡが顔画像に対して小さいことが特定された場合には、図２５（ｂ）に示すように、顔領域ＦＡに対してメッシュが大きくなるように予めグローバルパラメーターが設定される。また、顔領域ＦＡが顔画像に対して大きい場合や、顔領域ＦＡに対して顔画像が上下左右に偏って位置するとともに、顔領域ＦＡが顔画像に対して大小する場合についても同様に予めグローバルパラメーターを設定することにより、顔画像のより近くにメッシュを配置することができる。 In addition, as illustrated in FIG. 25A, the initial position candidate setting unit 234 determines that the face area FA is smaller than the face image by statistical data of the face area FA detection process. As shown in FIG. 25B, global parameters are set in advance so that the mesh becomes larger with respect to the face area FA. Similarly, when the face area FA is larger than the face image, or when the face image is biased vertically and horizontally with respect to the face area FA, and the face area FA is larger or smaller than the face image, the same applies in advance. By setting global parameters, the mesh can be placed closer to the face image.

第２の実施例に係る画像処理装置によれば、画像処理部２００は、取得部２３２を備えていなくても、注目画像に含まれる顔の特徴部位の位置を検出する処理の効率化・高速化を図ることができる。具体的には、取得部２３２により、顔特徴位置検出処理の度に顔領域検出情報を取得しなくても、例えば、顔領域ＦＡにおける顔画像の位置の傾向など、顔領域ＦＡの検出処理により特定される情報以外の顔領域検出情報であれば、顔領域検出情報に基づいて特徴点ＣＰの仮設定位置を予め設定することができる。これにより、特徴点ＣＰを特徴部位に対してより適当な位置に設定することができる。また、初期位置候補設定部２３４は、取得部２３２が取得した顔領域検出情報に基づいて、特徴点ＣＰを注目画像ＯＩ上の仮設定位置に設定する必要はなく、顔領域検出情報に基づいて予め設定された仮設定位置に特徴点ＣＰを設定してもよい。 According to the image processing apparatus according to the second embodiment, the image processing unit 200 does not include the acquisition unit 232, and improves the efficiency and speed of the process of detecting the position of the facial feature part included in the target image. Can be achieved. Specifically, without acquiring the face area detection information every time the facial feature position detection process is performed by the acquisition unit 232, for example, by the face area FA detection process such as the tendency of the position of the face image in the face area FA. If the face area detection information is other than the specified information, the temporary setting position of the feature point CP can be set in advance based on the face area detection information. Thereby, the feature point CP can be set at a more appropriate position with respect to the feature part. Further, the initial position candidate setting unit 234 does not need to set the feature point CP as a temporary setting position on the attention image OI based on the face area detection information acquired by the acquisition unit 232, but based on the face area detection information. The feature point CP may be set at a preset temporary setting position.

Ｃ．変形例：
なお、この発明は上記の実施例や実施形態に限られるものではなく、その要旨を逸脱しない範囲において種々の態様において実施することが可能であり、例えば次のような変形も可能である。 C. Variation:
The present invention is not limited to the above-described examples and embodiments, and can be implemented in various modes without departing from the gist thereof. For example, the following modifications are possible.

Ｃ１．変形例１：
第１実施例では、初期位置候補設定部２３４は、顔領域信頼度が閾値以上である場合には、図１８に示すように、メッシュの回転および拡大・縮小、メッシュの上下移動および左右移動のそれぞれについて３段階に変化させて仮設定位置を設定し、顔領域信頼度が閾値より小さい場合には、図２０に一例を示すように、メッシュの回転および拡大・縮小、メッシュの上下移動および左右移動のそれぞれについて５段階に変化させて仮設定位置を設定するとして説明したが、上記の３段階および５段階は例示にすぎず、例えば４段階や６段階のようにこれ以外の段階数に設定されていてもよい。 C1. Modification 1:
In the first example, the initial position candidate setting unit 234 performs mesh rotation and enlargement / reduction, mesh vertical movement, and horizontal movement as shown in FIG. If the temporary setting position is set in three stages for each, and the face area reliability is smaller than the threshold, the mesh is rotated and enlarged / reduced, the mesh is moved up and down, and left and right as shown in FIG. Although it has been described that the temporary setting position is set by changing the movement to 5 stages, the above-described 3 stages and 5 stages are merely examples. For example, the number of stages is set to other stages such as 4 stages and 6 stages. May be.

Ｃ２．変形例２：
第１実施例では、顔領域検出情報として、特定顔傾きの設定数、設定角度、設定間隔や、顔領域ＦＡの検出処理の統計的なデータに基づく顔領域ＦＡにおける顔画像の位置の傾向、顔領域ＦＡの検出処理において、顔領域検出部２1０が注目画像ＯＩから顔領域ＦＡを検出した際に特定される情報などを示したが、顔領域検出情報は、顔領域検出部２1０による顔領域ＦＡの検出に関連する情報であれば上記に限定されず、これら以外の情報も含まれる。例えば、前回の顔領域ＦＡの検出処理の結果なども含まれる。また、顔領域検出部２1０による顔領域ＦＡの検出に関連する情報についても、検出した顔領域ＦＡの特定顔傾きや、顔領域信頼度に限定されず、これら以外の情報も含まれる。例えば、ウィンドウＳＷの水平方向および垂直方向の移動量や、重複ウィンドウ数なども含まれる。 C2. Modification 2:
In the first embodiment, as the face area detection information, the number of specific face inclinations, the setting angle, the setting interval, and the tendency of the position of the face image in the face area FA based on statistical data of the face area FA detection process, In the detection process of the face area FA, the information specified when the face area detection unit 2110 detects the face area FA from the target image OI is shown. The face area detection information is the face area detection unit 2110. The information is not limited to the above as long as it is information related to FA detection, and other information is also included. For example, the result of detection processing of the previous face area FA is also included. Also, the information related to the detection of the face area FA by the face area detection unit 2110 is not limited to the specific face inclination and the face area reliability of the detected face area FA, but includes information other than these. For example, the amount of movement of the window SW in the horizontal and vertical directions, the number of overlapping windows, and the like are also included.

Ｃ３．変形例３：
第１実施例では、顔領域検出情報として、特定顔傾きの設定間隔と、検出した顔領域ＦＡの特定顔傾きと、顔領域信頼度を用いているが、必ずしもこれらすべてを用いる必要ななく、これら一部に基づいて特徴点ＣＰの仮設定位置を設定した場合であっても、特徴部位に対応したより適当な位置に特徴点ＣＰを配置することができ、注目画像に含まれる顔の特徴部位の位置を効率的かつ高速に検出することができる。また、第１実施例と第２実施例は適宜組み合わせて実現することができる。 C3. Modification 3:
In the first embodiment, as the face area detection information, the specific face inclination setting interval, the specific face inclination of the detected face area FA, and the face area reliability are used, but it is not always necessary to use all of them. Even when the temporary setting position of the feature point CP is set based on these parts, the feature point CP can be arranged at a more appropriate position corresponding to the feature part, and the feature of the face included in the attention image The position of the part can be detected efficiently and at high speed. Further, the first embodiment and the second embodiment can be realized by appropriately combining them.

Ｃ４．変形例４：
本実施例では、顔領域信頼度を用いて、仮設定位置の設定数を変化させているが、仮設定位置の設定数は顔領域信頼度以外の顔領域検出情報に基づいて決定されてもよい。例えば、顔領域ＦＡの検出処理の統計的なデータを用いて算出される正規分布の分散の程度により仮設定位置の設定数を決定してもよい。具体的は、顔領域ＦＡの検出処理の統計的なデータにより、顔領域ＦＡに対する顔画像の位置や回転、顔領域ＦＡに占める顔画像の範囲などのばらつきが大きい場合には、メッシュの回転および拡大・縮小、メッシュの上下移動および左右移動についての変化の段階数が多くなるように設定されていてもよい。例えば、平均μからのずれが±３σとなる範囲において、仮設定位置の間隔が所定の範囲内となるように仮設定位置の数が決定されてもよい。 C4. Modification 4:
In this embodiment, the number of temporary setting positions set is changed using the face area reliability. However, the number of temporary setting positions may be determined based on face area detection information other than the face area reliability. Good. For example, the set number of temporarily set positions may be determined based on the degree of dispersion of the normal distribution calculated using statistical data of the face area FA detection process. Specifically, if the statistical data of the face area FA detection process causes a large variation in the position and rotation of the face image with respect to the face area FA, the range of the face image in the face area FA, etc. It may be set so that the number of stages of change for enlargement / reduction, vertical movement of the mesh, and horizontal movement increases. For example, the number of temporarily set positions may be determined so that the interval between the temporarily set positions is within a predetermined range in a range where the deviation from the average μ is ± 3σ.

Ｃ５．変形例５：
第１実施例で示した、特定顔傾きの設定数（１２個）、設定角度（０度、３０度、６０度、・・・、３３０度）、設定間隔（３０度）は例示であり、これ以外の設定数、設定角度、設定間隔であってもよい。顔学習データＦＬＤを設定するために用いたサンプル画像とＡＡＭ設定処理に用いるサンプル画像ＳＩは一部が重複する画像であってもよいし、すべて異なる画像であってもよい。 C5. Modification 5:
The specific face tilt setting number (12), setting angles (0 degrees, 30 degrees, 60 degrees,..., 330 degrees), and setting intervals (30 degrees) shown in the first embodiment are examples. Other set numbers, set angles, and set intervals may be used. The sample image used for setting the face learning data FLD and the sample image SI used for the AAM setting process may be partially overlapping images or may be different images.

Ｃ６．変形例６：
本実施例におけるサンプル画像ＳＩはあくまで一例であり、サンプル画像ＳＩとして採用する画像の数、種類は任意に設定可能である。また、本実施例において、特徴点ＣＰの位置で示される顔の所定の特徴部位はあくまで一例であり、実施例において設定されている特徴部位の一部を省略したり、特徴部位として他の部位を採用したりしてもよい。 C6. Modification 6:
The sample image SI in this embodiment is merely an example, and the number and type of images employed as the sample image SI can be arbitrarily set. In the present embodiment, the predetermined feature portion of the face indicated by the position of the feature point CP is merely an example, and a part of the feature portion set in the embodiment may be omitted or another portion may be used as the feature portion. May be adopted.

また、本実施例では、サンプル画像ＳＩｗの画素群ｘのそれぞれにおける輝度値により構成される輝度値ベクトルに対する主成分分析によってテクスチャーモデルが設定されているが、顔画像のテクスチャー（見え）を表す輝度値以外の指標値（例えばＲＧＢ値）に対する主成分分析によってテクスチャーモデルが設定されるものとしてもよい。 In this embodiment, the texture model is set by principal component analysis with respect to the luminance value vector formed by the luminance values in each of the pixel groups x of the sample image SIw, but the luminance representing the texture (appearance) of the face image. The texture model may be set by principal component analysis for index values other than the values (for example, RGB values).

また、本実施例において、平均顔画像Ａ₀（ｘ）のサイズは５６画素×５６画素に限られず他のサイズであってもよい。また、平均顔画像Ａ₀（ｘ）は、マスク領域ＭＡ（図８）を含む必要はなく、平均形状領域ＢＳＡのみによって構成されるとしてもよい。また、平均顔画像Ａ₀（ｘ）の代わりに、サンプル画像ＳＩの統計的分析に基づき設定される他の基準顔画像が用いられるとしてもよい。 In this embodiment, the size of the average face image A ₀ (x) is not limited to 56 pixels × 56 pixels, and may be other sizes. Further, the average face image A ₀ (x) does not need to include the mask area MA (FIG. 8), and may be configured only by the average shape area BSA. Further, instead of the average face image A ₀ (x), another reference face image set based on the statistical analysis of the sample image SI may be used.

また、本実施例では、ＡＡＭを用いた形状モデルおよびテクスチャーモデルの設定が行われているが、他のモデル化手法（例えばＭｏｒｐｈａｂｌｅＭｏｄｅｌと呼ばれる手法やＡｃｔｉｖｅＢｌｏｂと呼ばれる手法）を用いて形状モデルおよびテクスチャーモデルの設定が行われるとしてもよい。 In this embodiment, the shape model and the texture model are set using AAM. However, the shape model and the texture model using other modeling methods (for example, a method called Morphable Model or a method called Active Blob) are used. A texture model may be set.

また、本実施例では、メモリーカードＭＣに格納された画像が注目画像ＯＩに設定されているが、注目画像ＯＩは例えばネットワークを介して取得された画像であってもよい。また、検出モード情報についても、ネットワークを介して取得されてもよい。 In this embodiment, the image stored in the memory card MC is set as the attention image OI. However, the attention image OI may be an image acquired via a network, for example. Also, the detection mode information may be acquired via a network.

また、本実施例では、画像処理装置としてのプリンター１００による画像処理を説明したが、処理の一部または全部がパーソナルコンピューターやデジタルスチルカメラ、デジタルビデオカメラ等の他の種類の画像処理装置により実行されるものとしてもよい。また、プリンター１００はインクジェットプリンターに限らず、他の方式のプリンター、例えばレーザプリンターや昇華型プリンターであるとしてもよい。 In this embodiment, image processing by the printer 100 as the image processing apparatus has been described. However, part or all of the processing is executed by another type of image processing apparatus such as a personal computer, a digital still camera, or a digital video camera. It is good also as what is done. The printer 100 is not limited to an ink jet printer, and may be another type of printer such as a laser printer or a sublimation printer.

本実施例において、ハードウェアによって実現されていた構成の一部をソフトウェアに置き換えるようにしてもよく、逆に、ソフトウェアによって実現されていた構成の一部をハードウェアに置き換えるようにしてもよい。 In this embodiment, a part of the configuration realized by hardware may be replaced with software, and conversely, a part of the configuration realized by software may be replaced by hardware.

また、本発明の機能の一部または全部がソフトウェアで実現される場合には、そのソフトウェア（コンピュータープログラム）は、コンピューター読み取り可能な記録媒体に格納された形で提供することができる。この発明において、「コンピューター読み取り可能な記録媒体」とは、フレキシブルディスクやＣＤ−ＲＯＭのような携帯型の記録媒体に限らず、各種のＲＡＭやＲＯＭ等のコンピューター内の内部記憶装置や、ハードディスク等のコンピューターに固定されている外部記憶装置も含んでいる。 In addition, when part or all of the functions of the present invention are realized by software, the software (computer program) can be provided in a form stored in a computer-readable recording medium. In the present invention, the “computer-readable recording medium” is not limited to a portable recording medium such as a flexible disk or a CD-ROM, but an internal storage device in a computer such as various RAMs and ROMs, a hard disk, etc. It also includes an external storage device fixed to the computer.

１００…プリンター
１１０…ＣＰＵ
１２０…内部メモリー
１４０…操作部
１５０…表示部
１６０…印刷機構
１７０…カードインターフェース
１７２…カードスロット
２００…画像処理部
２１０…顔領域検出部
２２０…特徴位置検出部
２２２…補正部
２３０…初期位置設定部
２３２…取得部
２３４…初期位置候補設定部
２３６…生成部
２３８…算出部
３１０…表示処理部
３２０…印刷処理部 100 ... Printer 110 ... CPU
DESCRIPTION OF SYMBOLS 120 ... Internal memory 140 ... Operation part 150 ... Display part 160 ... Printing mechanism 170 ... Card interface 172 ... Card slot 200 ... Image processing part 210 ... Face area detection part 220 ... Feature position detection part 222 ... Correction part 230 ... Initial position setting Unit 232 ... acquisition unit 234 ... initial position candidate setting unit 236 ... generation unit 238 ... calculation unit 310 ... display processing unit 320 ... print processing unit

Claims

An image processing apparatus for detecting a coordinate position of a feature part of a face included in an attention image,
A face area detection unit that detects an image area including at least a part of a face image from the attention image as a face area;
A plurality of the initial positions set based on face area detection information, which is information related to the detection of the face area, is an initial position of a feature point set in the image of interest for detecting the coordinate position of the feature part An initial position setting unit set from position candidates;
A feature position detector configured to correct the set position of the feature point set to the initial position so as to approach the position of the feature part and detect the corrected set position as the coordinate position of the feature part; Image processing device.

The image processing apparatus according to claim 1.
The face area detection information is information specified along with the detection by the face area detection unit,
The initial position setting unit includes:
An acquisition unit for acquiring the identified face area detection information;
An initial position candidate setting unit that sets the initial position candidate based on the acquired face area detection information.

The image processing apparatus according to claim 2,
The face area detection information includes a face area reliability indicating the certainty that the face image included in the face area detected by the face area detection unit is a true face image,
The initial position candidate setting unit increases the number of candidates for the initial position to be set when the face area reliability is low compared to when the face area reliability is high.

The image processing apparatus according to claim 2,
The face area detection information includes angle information related to a rotation angle in an image plane of a face image included in the face area detected by the face area detection unit,
The initial position candidate setting unit is an image processing apparatus configured to rotate and set the predetermined initial position candidates according to the rotation angle based on the angle information.

The image processing apparatus according to claim 1.
The face area detection information includes information related to a rotation angle in the image plane of the face image that can be specified by the face area detection unit,
The candidates for the plurality of initial positions are adjacent to each other for each rotation angle that can be specified by the face area detection unit, based on an intermediate value between the rotation angle and one of the rotation angles that can be specified. The image processing apparatus respectively set to the range to the intermediate value with the other said identifiable rotation angle.

The image processing apparatus according to claim 1.
The face area detection information includes information regarding a tendency of a relative position of a face image with respect to the face area detected by the face area detection unit,
The plurality of initial position candidates are image processing devices in which relative positions with respect to the face region are determined according to the tendency.

The image processing apparatus according to any one of claims 1 to 6,
The initial position setting unit includes:
A generating unit that generates an average shape image that is an image obtained by converting a part of the image of interest based on the feature point set at a position that is a candidate for the initial position;
A calculation unit that calculates a difference value between the average shape image and an average face image that is an image generated based on a plurality of sample images including a face image in which the coordinate position of the feature part is known. ,
An image processing apparatus configured to set, as the initial position, an initial position candidate having a minimum difference value among the plurality of initial position candidates.

The image processing apparatus according to any one of claims 1 to 7,
The feature position detector
Based on the difference value between the average shape image corresponding to the initial position and the average face image, a correction unit that corrects the set position so that the difference value is small,
An image processing apparatus that detects the set position where the difference value is predetermined as the coordinate position.

The image processing apparatus according to any one of claims 1 to 8,
The image processing apparatus, wherein the characteristic part is a part of eyebrows, eyes, nose, mouth, and face line.

A printer that detects a coordinate position of a feature part of a face included in an attention image,
A face area detection unit that detects an image area including at least a part of a face image from the attention image as a face area;
A plurality of the initial positions set based on face area detection information, which is information related to the detection of the face area, is an initial position of a feature point set in the image of interest for detecting the coordinate position of the feature part An initial position setting unit set from position candidates;
A feature position detector that corrects the setting position of the feature point set to the initial position so as to approach the position of the feature part, and detects the corrected setting position as a coordinate position of the feature part;
And a printing unit for printing the target image from which the coordinate position is detected.

An image processing method for detecting a coordinate position of a feature part of a face included in an image of interest,
Detecting an image area including at least a part of a face image from the noted image as a face area;
A plurality of the initial positions set based on face area detection information, which is information related to the detection of the face area, is an initial position of a feature point set in the image of interest for detecting the coordinate position of the feature part A step of setting from position candidates;
Correcting the setting position of the feature point set to the initial position so as to approach the position of the feature part, and detecting the corrected setting position as the coordinate position of the feature part. .

A computer program for image processing that detects a coordinate position of a feature part of a face included in an image of interest,
A face area detection function for detecting, as a face area, an image area including at least a part of a face image from the attention image;
A plurality of the initial positions set based on face area detection information, which is information related to the detection of the face area, is an initial position of a feature point set in the image of interest for detecting the coordinate position of the feature part Initial position setting function to set from position candidates,
A feature position detection function for correcting the set position of the feature point set to the initial position so as to approach the position of the feature part, and detecting the corrected set position as the coordinate position of the feature part; Computer program to be realized.