JP2010271956A

JP2010271956A - Image processing apparatus, image processing method, image processing program, and printer

Info

Publication number: JP2010271956A
Application number: JP2009123611A
Authority: JP
Inventors: Masaya Usui; 雅也碓井; Kenji Matsuzaka; 健治松坂
Original assignee: Seiko Epson Corp
Current assignee: Seiko Epson Corp
Priority date: 2009-05-21
Filing date: 2009-05-21
Publication date: 2010-12-02

Abstract

<P>PROBLEM TO BE SOLVED: To determine a plurality of expressions with high accuracy. <P>SOLUTION: An image processing apparatus is configured to detect an image region including at least part of a face image from an image under consideration as a face region in the case of determining the expression of the face included in the image under consideration, to set a face model for reproducing at least the position of the feature site of the face on the basis of a plurality of feature values, to correct the feature values so that the position of the feature site reproduced by the face model can approximate the face region, to calculate a decision value for each of the plurality of expressions on the basis of the corrected feature values, and to determine the expression of the face by integrating the decision values of the respective expressions. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、画像に含まれる顔の表情を判定する画像処理装置、画像処理方法、画像処理プログラム、および、印刷装置に関する。 The present invention relates to an image processing apparatus, an image processing method, an image processing program, and a printing apparatus that determine facial expressions included in an image.

顔画像の表情を目や鼻や口や髪型や色彩などについての位置や大きさや比率などの顔特徴情報に基づいて推定する印刷装置が開示されている（特許文献１、段落００３１、参照。）。 A printing apparatus that estimates facial expression based on facial feature information such as position, size, and ratio of eyes, nose, mouth, hairstyle, color, etc. is disclosed (see Patent Document 1, paragraph 0031). .

特開２００７−２０６９２１号公報JP 2007-206921 A

しかし、単一の顔画像に複数の表情が混在するような場合（例えば、驚いている笑顔等。）には、いずれの表情であるかを判定するのが困難であった。
本発明は、前記の課題を解決するためになされたものであり、複数の表情を精度よく判別することが可能な画像処理装置、画像処理方法、画像処理プログラム、および、印刷装置を提供することを目的とする。 However, when a plurality of facial expressions are mixed in a single face image (for example, a surprised smile), it is difficult to determine which facial expression.
The present invention has been made to solve the above problems, and provides an image processing apparatus, an image processing method, an image processing program, and a printing apparatus capable of accurately discriminating a plurality of facial expressions. With the goal.

前記課題の少なくとも一部を解決するために本願発明は以下の態様を採る。すなわち、顔領域検出部を備え、該顔領域検出部は前記注目画像から顔画像の少なくとも一部を含む画像領域を顔領域として検出する。設定部は、複数の特徴量に基づいて顔の特徴部位の位置を再現する顔モデルを設定する。さらに、補正部は、前記顔モデルによって再現される前記特徴部位の位置が、前記顔領域に近似するように前記特徴量を補正する。そして、判定部は、前記補正された前記特徴量に基づいて複数の表情ごとにその表情らしさを示す判定値を算出し、各表情についての判定値を総合して顔の表情を判定する。 In order to solve at least a part of the problems, the present invention adopts the following aspects. That is, a face area detecting unit is provided, and the face area detecting unit detects an image area including at least a part of the face image as the face area from the attention image. The setting unit sets a face model that reproduces the position of the feature portion of the face based on a plurality of feature amounts. Further, the correction unit corrects the feature amount so that the position of the feature part reproduced by the face model approximates the face region. Then, the determination unit calculates a determination value indicating the likelihood of the expression for each of a plurality of expressions based on the corrected feature amount, and determines a facial expression by combining the determination values for each expression.

このように、複数の表情のそれぞれについて、表情らしさを示す判定値を算出することにより、各表情の表情らしさを精度よく評価することができる。また、各表情についての判定値を総合的に判断することにより、いずれの表情を有するかを判定するため、複数の表情が混在するような場合でも、最も適切な表情を判定することができる。例えば、表情らしさを最も強く示す前記判定値に対応する表情を前記顔領域が有すると判定することにより、複数の表情が混在する場合でも、もっともそれらしい表情だけを判定することができる。なお、本発明は、単一の表情を有すると判定するものに限られず、例えば複数の表情の判定値が所定の閾値を超えた場合に、複数の表情を有する旨を判定するようにしてもよい。また、前記補正された前記特徴量の少なくとも２つを線形結合した前記判定値と閾値との比較することより表情の判定をしてもよい。このようにすることにより、複数の前記特徴量を総合的に考慮して判定値を算出することができる。 Thus, by calculating the determination value indicating the expression-likeness for each of a plurality of facial expressions, the expression-likeness of each expression can be accurately evaluated. Further, since the determination value for each expression is comprehensively determined to determine which expression is present, the most appropriate expression can be determined even when a plurality of expressions are mixed. For example, by determining that the face area has an expression corresponding to the determination value that most strongly indicates the likelihood of expression, only the most appropriate expression can be determined even when a plurality of expressions are mixed. Note that the present invention is not limited to determining that a single facial expression is present. For example, when a determination value of a plurality of facial expressions exceeds a predetermined threshold, it is determined that the facial expression has a plurality of facial expressions. Good. The facial expression may be determined by comparing the determination value obtained by linearly combining at least two of the corrected feature quantities with a threshold value. By doing so, it is possible to calculate the determination value in consideration of a plurality of the feature amounts.

別の態様に係る画像処理装置では、前記特徴量の少なくとも一つを、複数のサンプル画像の特徴部位の位置を主成分分析して得られる各主成分の固有ベクトルの係数とする。すなわち、顔モデルを主成分分析を利用して用意するとともに、前記特徴量を各主成分の固有ベクトルの係数とする。さらに、前記特徴量の少なくとも一つを、複数のサンプル画像の画素の色彩値を主成分分析して得られる各主成分の固有ベクトルの係数とすることにより、顔の見えをモデル化することができる。 In an image processing apparatus according to another aspect, at least one of the feature amounts is a coefficient of an eigenvector of each principal component obtained by performing principal component analysis on the positions of feature portions of a plurality of sample images. That is, a face model is prepared using principal component analysis, and the feature amount is set as a coefficient of an eigenvector of each principal component. Furthermore, the appearance of the face can be modeled by using at least one of the feature values as a coefficient of an eigenvector of each principal component obtained by principal component analysis of the color values of pixels of a plurality of sample images. .

別の態様に係る画像処理装置では、異なる時刻に対応付けられた複数の前記注目画像に含まれる顔の複数の表情を順次判定していくにあたり、前記注目画像について、ある表情を有すると判定された場合、それより後の時刻に対応付けられた前記注目画素に含まれる当該顔について、当該表情であると判定されなくなるまで、他の表情についての前記判定値の算出および該判定値に基づく判定を行わないこととする。このようにすることにより、異なる時刻に対応付けられた動画等の各フレームに対する表情判定を効率よく実行させることができる。 In the image processing device according to another aspect, when sequentially determining a plurality of facial expressions included in the plurality of attention images associated with different times, the attention image is determined to have a certain expression. If the face included in the pixel of interest associated with a later time is not determined to be the facial expression, the calculation of the determination value for the other facial expression and the determination based on the determination value are performed. Will not be performed. By doing in this way, the facial expression determination with respect to each frame, such as a moving image matched with different time, can be performed efficiently.

なお、本発明は、種々の態様で実現することが可能であり、例えば、画像処理方法および装置、特徴部位の位置検出方法および装置、表情判定方法および装置、これらの方法または装置の機能を実現するための画像処理プログラム、その画像処理プログラムを記録した記録媒体、そのコンピュータープログラムを含み搬送波内に具現化されたデータ信号、等の形態で実現することができる。 Note that the present invention can be realized in various modes, for example, an image processing method and apparatus, a feature position detection method and apparatus, a facial expression determination method and apparatus, and the functions of these methods or apparatuses. For example, a recording medium storing the image processing program, a data signal including the computer program and embodied in a carrier wave, and the like.

本発明の前記実施例における画像処理装置としてのプリンター１００の構成を概略的に示す説明図である。FIG. 2 is an explanatory diagram schematically showing a configuration of a printer 100 as an image processing apparatus in the embodiment of the present invention. 前記実施例におけるＡＡＭ設定処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the AAM setting process in the said Example. サンプル画像ＳＩの一例を示す説明図である。It is explanatory drawing which shows an example of sample image SI. サンプル画像ＳＩにおける特徴点ＣＰの設定方法の一例を示す説明図である。It is explanatory drawing which shows an example of the setting method of the feature point CP in the sample image SI. サンプル画像ＳＩに設定された特徴点ＣＰの座標の一例を示す説明図である。It is explanatory drawing which shows an example of the coordinate of the feature point CP set to sample image SI. 平均形状ｓ０の一例を示す説明図である。It is explanatory drawing which shows an example of average shape s0. 形状ベクトルｓ_iおよび形状パラメーターｐiと顔の形状ｓとの関係を例示した説明図である。It is explanatory drawing which illustrated the relationship between shape vector s _i and shape parameter pi, and face shape s. サンプル画像ＳＩのワープＷの方法の一例を示す説明図である。It is explanatory drawing which shows an example of the method of the warp W of the sample image SI. 平均顔画像Ａ₀（ｘ）の一例を示す説明図である。It is explanatory drawing which shows an example of average face image _A0 (x). 前記実施例における顔特徴検出処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the face feature detection process in the said Example. 注目画像ＯＩにおける顔領域ＦＡの検出結果の一例を示す説明図である。It is explanatory drawing which shows an example of the detection result of the face area FA in the attention image OI. 前記実施例における特徴点ＣＰの初期位置設定処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the initial position setting process of the feature point CP in the said Example. グローバルパラメーターの値を変更することによる特徴点ＣＰの仮設定位置の一例を示す説明図である。It is explanatory drawing which shows an example of the temporary setting position of the feature point CP by changing the value of a global parameter. 平均形状画像Ｉ（Ｗ（ｘ；ｐ））の一例を示す説明図である。It is explanatory drawing which shows an example of the average shape image I (W (x; p)). 前記実施例における特徴点ＣＰ設定位置補正処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the feature point CP setting position correction process in the said Example. ＣＰ設定位置補正処理の結果の一例を示す説明図である。It is explanatory drawing which shows an example of the result of CP setting position correction processing. テクスチャー検出処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a texture detection process. 表情判定処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a facial expression determination process. 判定値を示すグラフである。It is a graph which shows a judgment value. 判定値設定処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a judgment value setting process. 特徴量と喜顔であるか否かの相関を説明するグラフである。It is a graph explaining the correlation of whether it is a feature amount and a happy face. 変形例にかかる表情判定処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the facial expression determination process concerning a modification. 変形例にかかる表情判定処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the facial expression determination process concerning a modification. 変形例において表情パラメーターを設定する処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the process which sets a facial expression parameter in a modification.

以下、本発明に係る画像処理装置の一態様であるプリンターについて、図面を参照しつつ、実施例に基づいて説明する。 Hereinafter, a printer which is an aspect of an image processing apparatus according to the present invention will be described based on examples with reference to the drawings.

Ａ．本実施例：
Ａ１．画像処理装置の構成：
図１は、本発明の本実施例における画像処理装置としてのプリンター１００の構成を概略的に示す説明図である。本実施例のプリンター１００は、メモリーカードＭＣ等から取得した画像データに基づき画像を印刷する、いわゆるダイレクトプリントに対応したインクジェット式カラープリンターである。プリンター１００は、プリンター１００の各部を制御するＣＰＵ１１０と、ＲＯＭやＲＡＭによって構成された内部メモリー１２０と、ボタンやタッチパネルにより構成された操作部１４０と、液晶ディスプレイにより構成された表示部１５０と、印刷機構１６０と、カードインターフェース（カードＩ／Ｆ）１７０と、を備えている。プリンター１００は、さらに、他の機器（例えばデジタルスチルカメラやパーソナルコンピューター）とのデータ通信をおこなうためのインターフェースを備えていてもよい。プリンター１００の各構成要素は、バスを介して双方向通信可能に接続されている。 A. Example:
A1. Configuration of image processing device:
FIG. 1 is an explanatory diagram schematically showing a configuration of a printer 100 as an image processing apparatus in the present embodiment of the present invention. The printer 100 of this embodiment is an ink jet color printer that supports so-called direct printing, in which an image is printed based on image data acquired from a memory card MC or the like. The printer 100 includes a CPU 110 that controls each unit of the printer 100, an internal memory 120 configured by a ROM and a RAM, an operation unit 140 configured by buttons and a touch panel, a display unit 150 configured by a liquid crystal display, and printing. A mechanism 160 and a card interface (card I / F) 170 are provided. The printer 100 may further include an interface for performing data communication with other devices (for example, a digital still camera or a personal computer). Each component of the printer 100 is connected via a bus so that bidirectional communication is possible.

印刷機構１６０は、印刷データに基づき印刷をおこなう。カードインターフェース１７０は、カードスロット１７２に挿入されたメモリーカードＭＣとの間でデータのやり取りをおこなうためのインターフェースである。なお、本実施例では、メモリーカードＭＣに画像データを含む画像ファイルが格納されている。 The printing mechanism 160 performs printing based on the print data. The card interface 170 is an interface for exchanging data with the memory card MC inserted into the card slot 172. In this embodiment, an image file including image data is stored in the memory card MC.

内部メモリー１２０には、画像処理部２００と、表示処理部３１０と、印刷処理部３２０と、が格納されている。画像処理部２００は、コンピュータープログラムであり、所定のオペレーティングシステムの下で、ＣＰＵ１１０により実行されることで顔特徴検出処理や表情判定処理をおこなう。顔特徴検出処理は、顔画像における所定の特徴部位（例えば目尻や鼻頭やフェイスライン）の位置を検出する処理である。顔特徴検出処理については、後に詳述する。表示処理部３１０、および、印刷処理部３２０についてもＣＰＵ１１０により実行されることでぞれぞれの機能を実現する。 The internal memory 120 stores an image processing unit 200, a display processing unit 310, and a print processing unit 320. The image processing unit 200 is a computer program, and performs facial feature detection processing and facial expression determination processing by being executed by the CPU 110 under a predetermined operating system. The face feature detection process is a process of detecting the position of a predetermined feature part (for example, the corner of the eye, the nose head, or the face line) in the face image. The face feature detection process will be described in detail later. The display processing unit 310 and the print processing unit 320 are also executed by the CPU 110 to realize the respective functions.

画像処理部２００は、プログラムモジュールとして、設定部２１０と、特徴位置検出部２２０と、顔領域検出部２３０と、テクスチャー検出部２４０と、表情判定部２５０と、を含んでいる。特徴位置検出部２２０とテクスチャー検出部２４０は、それぞれ生成部２２２，２４２と、算出部２２４，２４４と、補正部２２６，２４６と、を含んでいる。表情判定部２５０は、選択部２５２と、判定値算出部２５４と、判定部２５６と、を含んでいる。これら各部の機能については、後述の顔特徴検出処理の説明において詳述する。 The image processing unit 200 includes a setting unit 210, a feature position detection unit 220, a face area detection unit 230, a texture detection unit 240, and an expression determination unit 250 as program modules. The feature position detection unit 220 and the texture detection unit 240 include generation units 222 and 242, calculation units 224 and 244, and correction units 226 and 246, respectively. The facial expression determination unit 250 includes a selection unit 252, a determination value calculation unit 254, and a determination unit 256. The functions of these units will be described in detail in the description of the face feature detection process described later.

表示処理部３１０は、表示部１５０を制御して、表示部１５０上に処理メニューやメッセージ、画像等を表示させるディスプレイドライバーである。印刷処理部３２０は、画像データから印刷データを生成し、印刷機構１６０を制御して、印刷データに基づく画像の印刷を実行するためのコンピュータープログラムである。ＣＰＵ１１０は、内部メモリー１２０から、これらのプログラム（画像処理部２００、表示処理部３１０、印刷処理部３２０）を読み出して実行することにより、これら各部の機能を実現する。 The display processing unit 310 is a display driver that controls the display unit 150 to display processing menus, messages, images, and the like on the display unit 150. The print processing unit 320 is a computer program for generating print data from image data, controlling the printing mechanism 160, and printing an image based on the print data. The CPU 110 reads out and executes these programs (the image processing unit 200, the display processing unit 310, and the print processing unit 320) from the internal memory 120, thereby realizing the functions of these units.

内部メモリー１２０には、また、ＡＡＭ情報ＡＭＩが格納されている。ＡＡＭ情報ＡＭＩは、後述のＡＡＭ設定処理によって予め設定される情報であり、後述の顔特徴検出処理において参照される。ＡＡＭ情報ＡＭＩの内容については、後述のＡＡＭ設定処理の説明において詳述する。 The internal memory 120 also stores AAM information AMI. The AAM information AMI is information set in advance by an AAM setting process described later, and is referred to in a face feature detection process described later. The contents of the AAM information AMI will be described in detail in the description of AAM setting processing described later.

Ａ２．ＡＡＭ設定処理：
図２は、本実施例におけるＡＡＭ設定処理の流れを示すフローチャートである。ＡＡＭ設定処理は、ＡＡＭ（アクティブアピアランスモデル（ＡｃｔｉｖｅＡｐｐｅａｒａｎｃｅＭｏｄｅｌ））と呼ばれる画像のモデル化に用いられる形状モデルおよびテクスチャーモデルを設定する処理である。形状モデルおよびテクスチャーモデルは、特許請求の範囲の「顔モデル」に相当する。本実施例において、ＡＡＭ設定処理は、オペレーターがコンピューターを使用することによりおこなわれる。コンピューターは、ＣＰＵとＲＡＭとＲＯＭとＨＤＤとディスプレイと入力装置等を有しており、これらがバスによって接続されている。ＣＰＵはＨＤＤに記録されたプログラムを読み出し、該プログラムにしたがった演算処理を実行することにより、コンピューターが後述する各処理を実行させる。なお、形状モデルおよびテクスチャーモデルを設定するコンピューターは特許請求の範囲の「設定部」に相当する。 A2. AAM setting process:
FIG. 2 is a flowchart showing the flow of AAM setting processing in this embodiment. The AAM setting process is a process for setting a shape model and a texture model used for modeling an image called AAM (Active Appearance Model). The shape model and the texture model correspond to a “face model” in the claims. In the present embodiment, the AAM setting process is performed by an operator using a computer. The computer has a CPU, a RAM, a ROM, an HDD, a display, an input device, and the like, which are connected by a bus. The CPU reads out a program recorded in the HDD and executes arithmetic processing according to the program, thereby causing the computer to execute each process described later. The computer that sets the shape model and the texture model corresponds to a “setting unit” in the claims.

はじめに、オペレーターは、人物の顔を含んだ複数の画像をサンプル画像ＳＩとして用意する（ステップＳ１１０）。図３は、サンプル画像ＳＩの一例を示す説明図である。図３に示すように、サンプル画像ＳＩは、個性・人種・性別、表情（喜顔、悲顔、驚顔、怒顔）、向き（正面向き、上向き、下向き、右向き、左向き等）といった種々の属性に関して互いに相違する顔画像が含まれるように用意される。サンプル画像ＳＩがそのように用意されれば、ＡＡＭによってあらゆる顔画像を精度良くモデル化することが可能となり、あらゆる顔画像を対象とした精度の良い顔特徴検出処理（後述）の実行が可能となる。なお、サンプル画像ＳＩは、学習用画像とも呼ばれる。 First, the operator prepares a plurality of images including a person's face as a sample image SI (step S110). FIG. 3 is an explanatory diagram showing an example of the sample image SI. As shown in FIG. 3, the sample image SI has various characteristics such as personality / race / gender, facial expression (happy face, sad face, surprised face, angry face), direction (front direction, upward direction, downward direction, right direction, left direction, etc.). Are prepared so as to include face images that are different from each other with respect to the attributes of. If the sample image SI is prepared in such a manner, any face image can be accurately modeled by AAM, and accurate face feature detection processing (described later) for any face image can be executed. Become. The sample image SI is also called a learning image.

それぞれのサンプル画像ＳＩに含まれる顔画像に、特徴点ＣＰを設定する（ステップＳ１２０）。図４は、サンプル画像ＳＩにおける特徴点ＣＰの設定方法の一例を示す説明図である。特徴点ＣＰは、顔画像における所定の特徴部位の位置を示す点である。本実施例では、所定の特徴部位として、人物の顔における眉毛上の所定位置（例えば端点や４分割点等、以下同じ）、目の輪郭上の所定位置、鼻筋および小鼻の輪郭上の所定位置、上下唇の輪郭上の所定位置、顔の輪郭（フェイスライン）上の所定位置といった６８箇所の部位が設定されている。すなわち、本実施例では、人物の顔に共通して含まれる顔の器官（眉毛、目、鼻、口）および顔の輪郭における所定位置を、特徴部位として設定する。図４に示すように、特徴点ＣＰは、各サンプル画像ＳＩにおいてオペレーターにより指定された６８個の特徴部位を表す位置に設定（配置）される。このように設定された各特徴点ＣＰは各特徴部位に対応しているため、顔画像における特徴点ＣＰの配置は顔の形状を特定していると表現することができる。 A feature point CP is set to the face image included in each sample image SI (step S120). FIG. 4 is an explanatory diagram showing an example of a method for setting the feature point CP in the sample image SI. The feature point CP is a point indicating the position of a predetermined feature part in the face image. In this embodiment, as predetermined feature parts, a predetermined position on the eyebrows in a person's face (for example, an end point or a four-divided point, the same applies hereinafter), a predetermined position on the eye contour, and a predetermined position on the nose and nose contours 68 parts such as a predetermined position on the contour of the upper and lower lips and a predetermined position on the contour of the face (face line) are set. That is, in the present embodiment, a predetermined position in the facial organs (eyebrows, eyes, nose, mouth) and facial contour that are commonly included in the human face are set as the characteristic parts. As shown in FIG. 4, the feature points CP are set (arranged) at positions representing 68 feature parts designated by the operator in each sample image SI. Since each feature point CP set in this way corresponds to each feature part, the arrangement of the feature points CP in the face image can be expressed as specifying the shape of the face.

サンプル画像ＳＩにおける特徴点ＣＰの位置は、座標により特定される。図５は、サンプル画像ＳＩに設定された特徴点ＣＰの座標の一例を示す説明図である。図５において、ＳＩ（ｇ）（ｇ＝１，２，３・・・）は各サンプル画像ＳＩを示しており、ＣＰ（ｋ）（ｋ＝０，１，・・・，６７）は各特徴点ＣＰを示している。また、ＣＰ（ｋ）−Ｘは、特徴点ＣＰ（ｋ）のＸ座標を示しており、ＣＰ（ｋ）−Ｙは、特徴点ＣＰ（ｋ）のＹ座標を示している。特徴点ＣＰの座標としては、顔の大きさと顔の傾き（画像面内の傾き）と顔のＸ方向およびＹ方向の位置とのそれぞれについて正規化されたサンプル画像ＳＩにおける所定の基準点（例えば画像の左下の点）を原点とした座標が用いられる。また、本実施例では、１つのサンプル画像ＳＩに複数の人物の顔画像が含まれる場合が許容されており（例えばサンプル画像ＳＩ（２）には２人の顔画像が含まれている）、１つのサンプル画像ＳＩにおける各人物は人物ＩＤによって特定される。各サンプル画像ＳＩと、各特徴点ＣＰの座標と、人物ＩＤとは、コンピューターに入力される。 The position of the feature point CP in the sample image SI is specified by coordinates. FIG. 5 is an explanatory diagram showing an example of the coordinates of the feature point CP set in the sample image SI. In FIG. 5, SI (g) (g = 1, 2, 3...) Indicates each sample image SI, and CP (k) (k = 0, 1,..., 67) indicates each feature. Point CP is shown. CP (k) -X represents the X coordinate of the feature point CP (k), and CP (k) -Y represents the Y coordinate of the feature point CP (k). The coordinates of the feature point CP include predetermined reference points (for example, in the sample image SI normalized with respect to the size of the face, the inclination of the face (inclination in the image plane), and the position of the face in the X direction and the Y direction) The coordinates with the origin at the lower left point of the image are used. Further, in the present embodiment, a case where a plurality of human face images are included in one sample image SI is allowed (for example, two sample face images are included in the sample image SI (2)). Each person in one sample image SI is specified by a person ID. Each sample image SI, the coordinates of each feature point CP, and the person ID are input to the computer.

つづいて、オペレーターは、ＡＡＭの形状モデルの設定をおこなう（ステップＳ１３０）。具体的には、各サンプル画像ＳＩにおける６８個の特徴点ＣＰの座標（Ｘ座標およびＹ座標）により構成される座標ベクトル（図５参照）に対する主成分分析をコンピューターがおこない、さらにコンピューターが特徴点ＣＰの位置により特定される顔の形状ｓを下記の式（１）によりモデル化する。なお、形状モデルは、特徴点ＣＰの配置モデルとも呼ぶ。 Subsequently, the operator sets the AAM shape model (step S130). Specifically, the computer performs principal component analysis on a coordinate vector (see FIG. 5) composed of the coordinates (X coordinate and Y coordinate) of 68 feature points CP in each sample image SI, and the computer further uses the feature points. The face shape s specified by the position of the CP is modeled by the following equation (1). The shape model is also referred to as a feature point CP arrangement model.

前記式（１）において、ｓ₀は平均形状である。図６は、平均形状ｓ₀の一例を示す説明図である。図６（ａ）および（ｂ）に示すように、平均形状ｓ₀は、サンプル画像ＳＩの各特徴点ＣＰについての平均位置（平均座標）により特定される平均的な顔の形状を表すモデルである。なお、本実施例では、平均形状ｓ₀において、外周に位置する特徴点ＣＰ（フェイスラインおよび眉毛、眉間に対応する特徴点ＣＰ、図４参照）を結ぶ直線により囲まれた領域（図６（ｂ）においてハッチングを付して示す）を「平均形状領域ＢＳＡ」と呼ぶ。コンピューターは、平均形状ｓ₀に対して、図６（ａ）に示すように、特徴点ＣＰを頂点とする複数の三角形領域ＴＡを、平均形状領域ＢＳＡをメッシュ状に分割するように設定する。 In the formula (1), s ₀ is an average shape. FIG. 6 is an explanatory diagram showing an example of the average shape s ₀ . As shown in FIGS. 6A and 6B, the average shape s ₀ is a model representing the average face shape specified by the average position (average coordinate) for each feature point CP of the sample image SI. is there. In the present embodiment, in the average shape s ₀ , a region surrounded by a straight line connecting feature points CP (face lines, eyebrows, feature points CP corresponding to the eyebrows, see FIG. 4) located on the outer periphery (FIG. 6 ( b) is indicated by hatching) and is referred to as “average shape region BSA”. As shown in FIG. 6A, the computer sets a plurality of triangular areas TA having the feature points CP as vertices so as to divide the average shape area BSA into a mesh shape with respect to the average shape s ₀ .

形状モデルを表す前記式（１）において、ｓ_iは形状ベクトルであり、ｐ_iは形状ベクトルｓ_iの重みを表す形状パラメーターである。形状ベクトルｓ_iは、顔の形状ｓの特性を表すベクトルであり、主成分分析により得られる第ｉ主成分に対応する固有ベクトルである。前記式（１）に示すように、本実施例における形状モデルでは、特徴点ＣＰの配置を表す顔形状ｓが、平均形状ｓ₀とｎ個の形状ベクトルｓ_iの線形結合との和としてモデル化される。前記式（１ａ）が示すように、形状モデルにおいて形状パラメーターｐ_iを適切に設定することにより、あらゆる画像における顔の形状ｓを再現することが可能である。また、前記式（１ｂ）が示すように、形状モデルにおいて形状パラメーターｐ_iを適切に設定することにより、あらゆる形状ｓから平均形状ｓ₀へと逆変換することができる。 In the equation (1) representing the shape model, s _i is a shape vector, and p _i is a shape parameter representing the weight of the shape vector s _i . The shape vector s _i is a vector representing the characteristics of the face shape s and is an eigenvector corresponding to the i-th principal component obtained by principal component analysis. As shown in the equation (1), in the shape model in the present embodiment, the face shape s representing the arrangement of the feature points CP is the model as the sum of the average shape s ₀ and the linear combination of the n shape vectors s _i. It becomes. As shown in the equation (1a), it is possible to reproduce the face shape s in any image by appropriately setting the shape parameter p _i in the shape model. Further, as shown in the equation (1b), by appropriately setting the shape parameter p _i in the shape model, it is possible to reversely convert from any shape s to the average shape s ₀ .

図７は、形状ベクトルｓ_iおよび形状パラメーターｐ_iと、顔の形状ｓとの関係を例示した説明図である。図７（ａ）に示すように、コンピューターは、顔の形状ｓを特定するために、寄与率のより大きい主成分に対応する固有ベクトルから順に、累積寄与率の高い順に選択された個数ｎ（図７ではｎ個の主成分のうち累積寄与率が上位４個を図示。）の固有ベクトルを、形状ベクトルｓ_iとして採用する。形状ベクトルｓ_iのそれぞれは、図７（ａ）の矢印に示すように、各特徴点ＣＰの移動方向・移動量と対応している。本実施例では、最も寄与率の大きい第１主成分に対応する第１形状ベクトルｓ₁は顔の左右振りにほぼ相関するベクトルとなっており、形状パラメーターｐ₁を大小することにより、図７（ｂ）に示すように、顔の形状ｓの横方向の顔向きが変化する。２番目に寄与率の大きい第２主成分に対応する第２形状ベクトルｓ₂は顔の上下振りにほぼ相関するベクトルとなっており、形状パラメーターｐ₂を大小することにより、図７（ｃ）に示すように、顔の形状ｓの縦方向の顔向きが変化する。また、３番目に寄与率の大きい第３主成分に対応する第３形状ベクトルｓ₃は顔の形状の縦横比にほぼ相関するベクトルとなっており、４番目に寄与率の大きい第４主成分に対応する第４形状ベクトルｓ₄は口の開きの程度にほぼ相関するベクトルとなっている。このように、形状パラメーターの値は、顔の表情や、顔向きなど顔画像の特徴を表す。本実施例における「形状パラメーター」は特許請求の範囲における「特徴量」に該当する。 FIG. 7 is an explanatory diagram illustrating the relationship between the shape vector s _{i, the} shape parameter p _i, and the face shape s. As shown in FIG. 7 (a), in order to identify the face shape s, the computer selects the number n (shown in the order from the highest cumulative contribution ratio in order from the eigenvector corresponding to the principal component having the larger contribution ratio. 7, the eigenvector of the top four of the n principal components is shown as the shape vector s _i . Each of the shape vectors s _i corresponds to the movement direction / movement amount of each feature point CP, as indicated by the arrows in FIG. In this embodiment, most first shape vector s ₁ corresponding to the first principal component having a large contribution rate is a vector that is approximately correlated with the left and right appearance of a face, by the magnitude of the shape parameter p _1, 7 As shown in (b), the horizontal face direction of the face shape s changes. Second shape vector s ₂ corresponding to the large second principal component of the contribution rate to the second is a vector that is approximately correlated with the vertical appearance of a face, by the magnitude of the shape parameter p _2, FIG. 7 (c) As shown in FIG. 4, the vertical face direction of the face shape s changes. The third shape vector s ₃ corresponding to the third principal component having the third largest contribution ratio is a vector that is substantially correlated with the aspect ratio of the face shape, and the fourth principal component having the fourth largest contribution ratio. The fourth shape vector s ₄ corresponding to is a vector that is substantially correlated with the degree of mouth opening. As described above, the value of the shape parameter represents the feature of the face image such as facial expression and face orientation. The “shape parameter” in this embodiment corresponds to the “feature amount” in the claims.

なお、形状モデル設定ステップ（ステップＳ１３０）においてコンピューターが設定した平均形状ｓ₀および形状ベクトルｓ_iは、ＡＡＭ情報ＡＭＩ（図１）としてプリンター１００の内部メモリー１２０に格納される。つづいて、コンピューターはＡＡＭのテクスチャーモデルの設定をおこなう（ステップＳ１４０）。具体的には、まず、各サンプル画像ＳＩに対して、サンプル画像ＳＩにおける特徴点ＣＰの設定位置が平均形状ｓ₀における特徴点ＣＰの設定位置と等しくなるように、画像変換（以下、「ワープＷ」とも呼ぶ）をおこなう。 The average shape s ₀ and the shape vector s _i set by the computer in the shape model setting step (step S130) are stored in the internal memory 120 of the printer 100 as AAM information AMI (FIG. 1). Subsequently, the computer sets an AAM texture model (step S140). Specifically, first, for each sample image SI, image conversion (hereinafter referred to as “warp”) is performed so that the setting position of the feature point CP in the sample image SI is equal to the setting position of the feature point CP in the average shape s ₀ . Also called “W”).

図８は、サンプル画像ＳＩのワープＷの方法の一例を示す説明図である。コンピューターは、各サンプル画像ＳＩにおいて、平均形状ｓ₀と同様に、外周に位置する特徴点ＣＰにより囲まれた領域をメッシュ状に分割する複数の三角形領域ＴＡを設定する。ワープＷは、複数の三角形領域ＴＡのそれぞれについてのアフィン変換の集合である。すなわち、ワープＷにおいては、サンプル画像ＳＩにおけるある三角形領域ＴＡの画像は、平均形状ｓ₀における対応する三角形領域ＴＡの画像へとアフィン変換される。コンピューターは、ワープＷにより、特徴点ＣＰの設定位置が平均形状ｓ₀における特徴点ＣＰの設定位置と等しいサンプル画像ＳＩ（以下「サンプル画像ＳＩｗ」と表す）を生成する。 FIG. 8 is an explanatory diagram showing an example of a method of warping W of the sample image SI. In each sample image SI, similarly to the average shape s ₀ , the computer sets a plurality of triangular regions TA that divide the region surrounded by the feature points CP located on the outer periphery into a mesh shape. The warp W is a set of affine transformations for each of the plurality of triangular areas TA. That is, in the warp W, an image of a certain triangular area TA in the sample image SI is affine transformed into an image of the corresponding triangular area TA in the average shape s ₀ . The computer generates a sample image SI (hereinafter referred to as “sample image SIw”) in which the setting position of the feature point CP is equal to the setting position of the feature point CP in the average shape s ₀ by the warp W.

なお、各サンプル画像ＳＩｗは、平均形状領域ＢＳＡ（図８においてハッチングを付して示す）を内包する矩形枠を外周とし、平均形状領域ＢＳＡ以外の領域（以下「マスク領域ＭＡ」とも呼ぶ）がマスクされた画像として生成される。平均形状領域ＢＳＡとマスク領域ＭＡとを併せた画像領域を基準領域ＢＡと呼ぶ。また、コンピューターは、各サンプル画像ＳＩｗを、例えば５６画素×５６画素のサイズの画像として正規化する。 Each sample image SIw has a rectangular frame containing an average shape area BSA (shown with hatching in FIG. 8) as an outer periphery, and an area other than the average shape area BSA (hereinafter also referred to as “mask area MA”). Generated as a masked image. An image area in which the average shape area BSA and the mask area MA are combined is referred to as a reference area BA. In addition, the computer normalizes each sample image SIw as an image having a size of, for example, 56 pixels × 56 pixels.

次に、コンピューターは、各サンプル画像ＳＩｗの画素群ｘのそれぞれにおける輝度値により構成される輝度値ベクトルに対する主成分分析を行い、顔のテクスチャー（「見え」とも呼ぶ）Ａ（ｘ）を下記の式（２）によりモデル化する。なお、画素群ｘは、平均形状領域ＢＳＡに位置する画素の集合である。

Next, the computer performs a principal component analysis on a luminance value vector composed of luminance values in each pixel group x of each sample image SIw, and a facial texture (also referred to as “appearance”) A (x) is expressed as follows: Modeled by equation (2). The pixel group x is a set of pixels located in the average shape area BSA.

前記式（２）において、Ａ₀（ｘ）は平均顔画像である。図９は、平均顔画像Ａ₀（ｘ）の一例を示す説明図である。平均顔画像Ａ₀（ｘ）は、ワープＷの後のサンプル画像ＳＩｗ（図８参照）の平均の顔が表された画像である。すなわち、平均顔画像Ａ₀（ｘ）は、サンプル画像ＳＩｗの平均形状領域ＢＳＡ内の画素群ｘの画素値（輝度値）の平均をとることにより算出される画像である。従って、平均顔画像Ａ₀（ｘ）は、平均的な顔の形状における平均的な顔のテクスチャー（見え）を表すモデルである。なお、平均顔画像Ａ₀（ｘ）は、サンプル画像ＳＩｗと同様に、平均形状領域ＢＳＡとマスク領域ＭＡとで構成され、例えば５６画素×５６画素のサイズの画像として算出される。 In the formula (2), A ₀ (x) is an average face image. FIG. 9 is an explanatory diagram showing an example of the average face image A ₀ (x). The average face image A ₀ (x) is an image representing the average face of the sample image SIw (see FIG. 8) after the warp W. That is, the average face image A ₀ (x) is an image calculated by taking the average of the pixel values (luminance values) of the pixel group x in the average shape area BSA of the sample image SIw. Therefore, the average face image A ₀ (x) is a model representing the average face texture (appearance) in the average face shape. Note that the average face image A ₀ (x) is composed of the average shape area BSA and the mask area MA, similarly to the sample image SIw, and is calculated as an image having a size of 56 pixels × 56 pixels, for example.

テクスチャーモデルを表す前記式（２）において、ａ_k（ｘ）はテクスチャーベクトルであり、λ_kはテクスチャーベクトルａ_k（ｘ）の重みを表すテクスチャーパラメーターである。テクスチャーベクトルａ_k（ｘ）は、顔のテクスチャーＡ（ｘ）の特性を表すベクトルであり、具体的には、主成分分析により得られる第ｊ主成分に対応する固有ベクトルである。すなわち、寄与率のより大きい主成分に対応する固有ベクトルから順に、累積寄与率に基づき設定された個数ｍの固有ベクトルが、テクスチャーベクトルａ_k（ｘ）として採用される。本実施例では、最も寄与率の大きい第１主成分に対応する第１テクスチャーベクトルＡ₁（ｘ）は、顔色の変化（性別の差とも捉えられる）にほぼ相関するベクトルとなっている。 In the equation (2) representing the texture model, a _k (x) is a texture vector, and λ _k is a texture parameter representing the weight of the texture vector a _k (x). The texture vector a _k (x) is a vector representing the characteristics of the facial texture A (x), and specifically, is an eigenvector corresponding to the j-th principal component obtained by principal component analysis. That is, the number m of eigenvectors set based on the cumulative contribution rate is adopted as the texture vector a _k (x) in order from the eigenvector corresponding to the principal component having the larger contribution rate. In the present embodiment, the first texture vector A ₁ (x) corresponding to the first principal component having the largest contribution rate is a vector that is substantially correlated with a change in face color (also regarded as a gender difference).

前記式（２）に示すように、本実施例におけるテクスチャーモデルでは、顔の見えを表す顔のテクスチャーＡ（ｘ）が、平均顔画像Ａ₀（ｘ）とｍ個のテクスチャーベクトルａ_k（ｘ）の線形結合との和としてモデル化される。前記式（２ａ）に示すように、テクスチャーモデルにおいてテクスチャーパラメーターλ_kを適切に設定することにより、あらゆる画像における顔のテクスチャーＡ（ｘ）を再現することが可能である。逆に、前記式（２ｂ）に示すように、テクスチャーモデルにおいてテクスチャーパラメーターλ_kを適切に設定することにより、あらゆる画像における顔のテクスチャーＡ（ｘ）から平均顔画像Ａ₀（ｘ）へと逆変換することができる。 As shown in the equation (2), in the texture model in the present embodiment, the facial texture A (x) representing the appearance of the face is the average face image A ₀ (x) and m texture vectors a _k (x ) And the linear combination. As shown in the equation (2a), it is possible to reproduce the facial texture A (x) in any image by appropriately setting the texture parameter λ _k in the texture model. On the contrary, as shown in the equation (2b), by appropriately setting the texture parameter λ _k in the texture model, the face texture A (x) in every image is reversed to the average face image A ₀ (x). Can be converted.

なお、テクスチャーモデル設定ステップ（図２のステップＳ１４０）において設定された平均顔画像Ａ₀（ｘ）およびテクスチャーベクトルａ_k（ｘ）は、ＡＡＭ情報ＡＭＩ（図１）としてプリンター１００の内部メモリー１２０に格納される。コンピューターはＡＡＭ情報ＡＭＩをいずれかの記録媒体に記録しておき、プリンター１００の製造時において該記録媒体から読み出したＡＡＭ情報ＡＭＩを記録した内部メモリー１２０をプリンター１００に組み込むか、プリンター１００に組み込むまれた内部メモリー１２０に対して該記録媒体から読み出したＡＡＭ情報ＡＭＩを記録させる。 The average face image A ₀ (x) and the texture vector a _k (x) set in the texture model setting step (step S140 in FIG. 2) are stored in the internal memory 120 of the printer 100 as AAM information AMI (FIG. 1). Stored. The computer records the AAM information AMI on any recording medium, and when the printer 100 is manufactured, the internal memory 120 that records the AAM information AMI read from the recording medium is incorporated in the printer 100 or rarely incorporated in the printer 100. The AAM information AMI read from the recording medium is recorded in the internal memory 120.

ここまでの処理により、顔の形状をモデル化する形状モデルと、顔のテクスチャーをモデル化するテクスチャーモデルが設定される。設定された形状モデルとテクスチャーモデルとを組み合わせることにより、すなわち合成されたテクスチャーＡ（ｘ）に対して平均形状ｓ₀から形状ｓへの変換（図８に示したワープＷの逆変換）をおこなうことにより、あらゆる顔画像の形状およびテクスチャーを再現することが可能である。 Through the processing so far, a shape model for modeling the shape of the face and a texture model for modeling the texture of the face are set. By combining the set shape model and the texture model, that is, the synthesized texture A (x) is converted from the average shape s ₀ to the shape s (inverse conversion of the warp W shown in FIG. 8). Thus, it is possible to reproduce the shape and texture of any face image.

Ａ３．顔特徴検出処理：
図１０は、実施例における顔特徴検出処理の流れを示すフローチャートである。本実施例における顔特徴検出処理は、ＡＡＭを利用して注目画像に含まれる顔画像における特徴点ＣＰの配置を決定する特徴点ＣＰ設定位置補正処理（ステップＳ２５０）により、顔画像における特徴部位の位置を検出し、さらにテクスチャー検出処理（ステップＳ２６０）により顔のテクスチャーを検出する処理である。上述したように、本実施例では、ＡＡＭ設定処理（図２）において、人物の顔の器官（眉毛、目、鼻、口）および顔の輪郭における計６８箇所の所定位置が、特徴部位として設定されている（図４参照）。そのため、本実施例の顔特徴検出処理では、人物の顔の器官および顔の輪郭における所定位置を示す６８個の特徴点ＣＰの配置が決定される。 A3. Face feature detection processing:
FIG. 10 is a flowchart illustrating a flow of face feature detection processing in the embodiment. In the face feature detection processing in this embodiment, the feature point CP setting position correction processing (step S250) for determining the arrangement of the feature points CP in the face image included in the target image using AAM is performed. In this process, the position is detected, and the texture of the face is further detected by the texture detection process (step S260). As described above, in the present embodiment, in the AAM setting process (FIG. 2), a total of 68 predetermined positions in the human facial organs (eyebrows, eyes, nose, mouth) and facial contour are set as the characteristic parts. (See FIG. 4). Therefore, in the face feature detection process of the present embodiment, the arrangement of 68 feature points CP indicating predetermined positions in the human face organ and face contour is determined.

なお、顔特徴検出処理によって顔画像における特徴点ＣＰの配置が決定されると、顔画像についての形状パラメーターｐ_iの値が特定される。従って、顔特徴検出処理の処理結果は特定の向き（例えば右向きや下向き）の顔画像を検出するための顔向き判定、顔の形状を変形する顔変形、顔の陰影補正等に利用可能である。 When the arrangement of the feature points CP in the face image is determined by the face feature detection process, the value of the shape parameter p _i for the face image is specified. Accordingly, the processing result of the face feature detection process can be used for face orientation determination for detecting a face image in a specific direction (for example, rightward or downward), face deformation for deforming the face shape, face shading correction, and the like. .

はじめに、画像処理部２００（図１）は、顔特徴検出処理の対象となる注目画像を表す画像データを取得する（ステップＳ２１０）。本実施例のプリンター１００では、カードスロット１７２にメモリーカードＭＣが挿入されると、メモリーカードＭＣに格納された画像ファイルのサムネイル画像が表示部１５０に表示される。処理の対象となる１つまたは複数の画像は、操作部１４０を介してユーザーにより選択される。画像処理部２００は、選択された１つまたは複数の画像に対応する画像データを含む画像ファイルをメモリーカードＭＣより取得して内部メモリー１２０の所定の領域に格納する。なお、取得された画像データを注目画像データと呼び、注目画像データの表す画像を注目画像ＯＩと呼ぶものとする。 First, the image processing unit 200 (FIG. 1) acquires image data representing an attention image that is a target of face feature detection processing (step S210). In the printer 100 of the present embodiment, when the memory card MC is inserted into the card slot 172, thumbnail images of image files stored in the memory card MC are displayed on the display unit 150. One or more images to be processed are selected by the user via the operation unit 140. The image processing unit 200 acquires an image file including image data corresponding to one or more selected images from the memory card MC and stores it in a predetermined area of the internal memory 120. Note that the acquired image data is referred to as attention image data, and an image represented by the attention image data is referred to as attention image OI.

顔領域検出部２３０（図１）は、注目画像ＯＩに含まれる顔画像の少なくとも一部を含む画像領域を顔領域ＦＡとして検出する（ステップＳ２３０）。顔領域ＦＡの検出は、公知の顔検出手法を用いておこなうことができる。公知の顔検出手法としては、例えば、パターンマッチングによる手法や肌色領域抽出による手法、サンプル画像を用いた学習（例えばニューラルネットワークを用いた学習や、ブースティングを用いた学習、サポートベクターマシーンを用いた学習等）により設定される学習データを用いる手法等がある。 The face area detection unit 230 (FIG. 1) detects an image area including at least a part of the face image included in the target image OI as the face area FA (step S230). The detection of the face area FA can be performed using a known face detection method. Known face detection methods include, for example, pattern matching methods, skin color region extraction methods, learning using sample images (for example, learning using neural networks, learning using boosting, and support vector machines). For example, a method using learning data set by learning).

図１１は、注目画像ＯＩにおける顔領域ＦＡの検出結果の一例を示す説明図である。図１１には、注目画像ＯＩにおいて検出された顔領域ＦＡが示されている。本実施例では、おおよそ、顔の上下方向は額から顎まで、左右方向は両耳の外側まで含む矩形の領域が顔領域ＦＡとして検出されるような顔検出手法が用いられている。 FIG. 11 is an explanatory diagram illustrating an example of a detection result of the face area FA in the target image OI. FIG. 11 shows the face area FA detected in the target image OI. In the present embodiment, a face detection method is used in which a rectangular area including the vertical direction of the face from the forehead to the chin and the horizontal direction from the outside of both ears is detected as the face area FA.

設定部２１０（図１）は、注目画像ＯＩにおける特徴点ＣＰの初期位置を設定する（ステップＳ２４０）。図１２は、本実施例における特徴点ＣＰの初期位置設定処理の流れを示すフローチャートである。本実施例では、設定部２１０は、顔領域ＦＡに対する顔画像の大きさ、傾き、位置（上下方向の位置および左右方向の位置）を表すグローバルパラメーターの値を種々変更して、特徴点ＣＰを注目画像ＯＩ上の仮設定位置に設定する（ステップＳ３１０）。 The setting unit 210 (FIG. 1) sets the initial position of the feature point CP in the target image OI (Step S240). FIG. 12 is a flowchart showing a flow of initial position setting processing of the feature point CP in the present embodiment. In the present embodiment, the setting unit 210 changes the values of global parameters representing the size, inclination, and position (vertical position and horizontal position) of the face image with respect to the face area FA to change the feature point CP. The temporary setting position on the attention image OI is set (step S310).

図１３は、グローバルパラメーターの値を変更することによる特徴点ＣＰの仮設定位置の一例を示す説明図である。図１３（ａ）および図１３（ｂ）には、注目画像ＯＩにおける特徴点ＣＰおよび特徴点ＣＰをつないで形成されるメッシュが示されている。設定部２１０は、図１３（ａ）および図１３（ｂ）の中央に示すように、顔領域ＦＡの中央部に平均形状ｓ₀が形成されるような特徴点ＣＰの仮設定位置（以下、「基準仮設定位置」とも呼ぶ）を設定する。 FIG. 13 is an explanatory diagram showing an example of a temporary setting position of the feature point CP by changing the value of the global parameter. 13A and 13B show a mesh formed by connecting feature points CP and feature points CP in the target image OI. As shown in the center of FIGS. 13A and 13B, the setting unit 210 temporarily sets the feature point CP (hereinafter, referred to as an average shape s _{0) at} which the average shape s ₀ is formed at the center of the face area FA. (Also referred to as “reference temporary setting position”).

設定部２１０は、また、基準仮設定位置に対して、グローバルパラメーターの値を種々変更させた複数の仮設定位置を設定する。グローバルパラメーター（大きさ、傾き、上下方向の位置および左右方向の位置）を変更することは、注目画像ＯＩにおいて特徴点ＣＰにより形成されるメッシュが拡大・縮小、傾きを変更、並行移動することに相当する。従って、設定部２１０は、図１３（ａ）に示すように、基準仮設定位置のメッシュを所定倍率で拡大または縮小したメッシュを形成するような仮設定位置（基準仮設定位置の図の下および上に示す）や、所定角度だけ時計回りまたは半時計回りに傾きを変更したメッシュを形成するような仮設定位置（基準仮設定位置の図の右および左に示す）を設定する。また、設定部２１０は、基準仮設定位置のメッシュに対して、拡大・縮小および傾きの変更を組み合わせた変換を行ったメッシュを形成するような仮設定位置（基準仮設定位置の図の左上、左下、右上、右下に示す）も設定する。 The setting unit 210 also sets a plurality of temporary setting positions obtained by changing various global parameter values with respect to the reference temporary setting position. Changing the global parameters (size, inclination, vertical position and horizontal position) means that the mesh formed by the feature point CP in the target image OI is enlarged / reduced, the inclination is changed, and the parallel movement is performed. Equivalent to. Accordingly, as shown in FIG. 13 (a), the setting unit 210 forms a temporary setting position that forms a mesh obtained by enlarging or reducing the reference temporary setting position mesh at a predetermined magnification (below the reference temporary setting position diagram and Or a temporary setting position (shown on the right and left in the drawing of the reference temporary setting position) that forms a mesh whose inclination is changed clockwise or counterclockwise by a predetermined angle. Further, the setting unit 210 forms a temporary setting position (upper left of the figure of the reference temporary setting position, which forms a mesh obtained by performing a combination of enlargement / reduction and change in inclination with respect to the mesh of the reference temporary setting position. Also set in the lower left, upper right, and lower right).

また、図１３（ｂ）に示すように、設定部２１０は、基準仮設定位置のメッシュを所定量だけ上または下に並行移動したメッシュを形成するような仮設定位置（基準仮設定位置の図の上および下に示す）や、左または右に並行移動したメッシュを形成するような仮設定位置（基準仮設定位置の図の左および右に示す）を設定する。また、設定部２１０は、基準仮設定位置のメッシュに対して、上下および左右の並行移動を組み合わせた変換を行ったメッシュを形成するような仮設定位置（基準仮設定位置の図の左上、左下、右上、右下に示す）も設定する。 Also, as shown in FIG. 13B, the setting unit 210 forms a temporary setting position (a reference temporary setting position diagram) that forms a mesh that has been translated upward or downward by a predetermined amount from the reference temporary setting position mesh. And a temporary setting position (shown on the left and right in the drawing of the reference temporary setting position) that forms a mesh moved in parallel to the left or right. Further, the setting unit 210 forms a temporary setting position (upper left and lower left in the figure of the reference temporary setting position) that forms a mesh obtained by performing a combination of vertical and horizontal parallel movements on the mesh of the reference temporary setting position. , Shown in the upper right and lower right).

設定部２１０は、図１３（ａ）に示す基準仮設定位置以外の８つの仮設定位置のそれぞれにおけるメッシュに対して図１３（ｂ）に示す上下左右の並行移動が実行される仮設定位置も設定する。従って、本実施例では、４つのグローバルパラメーター（大きさ、傾き、上下方向の位置、左右方向の位置）をそれぞれ既知の３段階の値として組み合わせにより設定される８０通り（＝３×３×３×３−１）の仮設定位置と、基準仮設定位置の合計８１通りの仮設定位置が設定される。 The setting unit 210 also has temporary setting positions at which parallel movement in the vertical and horizontal directions shown in FIG. 13B is executed with respect to the mesh at each of the eight temporary setting positions other than the reference temporary setting position shown in FIG. Set. Therefore, in the present embodiment, the four global parameters (size, inclination, vertical position, horizontal position) are set in 80 ways (= 3 × 3 × 3) that are set as known three-level values. A total of 81 temporary setting positions of (3-1) temporary setting positions and reference temporary setting positions are set.

生成部２２２（図１）は、設定された各仮設定位置に対応する平均形状画像Ｉ（Ｗ（ｘ；ｐ））を生成する（ステップＳ３２０）。図１４は、平均形状画像Ｉ（Ｗ（ｘ；ｐ））の一例を示す説明図である。平均形状画像Ｉ（Ｗ（ｘ；ｐ））は、入力画像における特徴点ＣＰの配置が平均形状ｓ₀における特徴点ＣＰの配置と等しくなるような変換によって算出される。 The generation unit 222 (FIG. 1) generates an average shape image I (W (x; p)) corresponding to each set temporary setting position (step S320). FIG. 14 is an explanatory diagram showing an example of the average shape image I (W (x; p)). The average shape image I (W (x; p)) is calculated by conversion such that the arrangement of the feature points CP in the input image is equal to the arrangement of the feature points CP in the average shape s ₀ .

平均形状画像Ｉ（Ｗ（ｘ；ｐ））を算出するための変換は、サンプル画像ＳＩｗ算出のための変換（図８参照）と同様に、三角形領域ＴＡ毎のアフィン変換の集合であるワープＷにより行われる。具体的には、注目画像ＯＩに配置された特徴点ＣＰ（図１３参照）によって平均形状領域ＢＳＡ（外周に位置する特徴点ＣＰにより囲まれた領域）が特定され、注目画像ＯＩにおける平均形状領域ＢＳＡに対して三角形領域ＴＡ毎のアフィン変換が行われることにより、平均形状画像Ｉ（Ｗ（ｘ；ｐ））が算出される。本実施例では、平均形状画像Ｉ（Ｗ（ｘ；ｐ））は、平均顔画像Ａ₀（ｘ）と同様に平均形状領域ＢＳＡおよびマスク領域ＭＡにより構成され、平均顔画像Ａ₀（ｘ）と同サイズの画像として算出される。 The transformation for calculating the average shape image I (W (x; p)) is a warp W that is a set of affine transformations for each triangular area TA, similarly to the transformation for calculating the sample image SIw (see FIG. 8). Is done. Specifically, the average shape region BSA (region surrounded by the feature points CP located on the outer periphery) is specified by the feature points CP (see FIG. 13) arranged in the attention image OI, and the average shape region in the attention image OI. An average shape image I (W (x; p)) is calculated by performing affine transformation for each triangular area TA on the BSA. In this embodiment, the average shape image I (W (x; p) ) is composed of an average face image A ₀ (x) and an average shape area BSA and a mask area MA, average face image A ₀ (x) And is calculated as an image of the same size.

なお、上述したように、画素群ｘは、平均形状ｓ₀における平均形状領域ＢＳＡに位置する画素の集合である。ワープＷ実行後の画像（平均形状ｓ₀を有する顔画像）における画素群ｘに対応するワープＷ実行前の画像（注目画像ＯＩの平均形状領域ＢＳＡ）における画素群をＷ（ｘ；ｐ）と表す。平均形状画像は、注目画像ＯＩの平均形状領域ＢＳＡにおける画素群Ｗ（ｘ；ｐ）のそれぞれにおける輝度値により構成される画像であるため、Ｉ（Ｗ（ｘ；ｐ））と表される。図１４には、図１３（ａ）に示した９個の仮設定位置に対応する９個の平均形状画像Ｉ（Ｗ（ｘ；ｐ））を示している。 As described above, the pixel group x is a set of pixels located in the average shape region BSA in the average shape s ₀ . The pixel group in the image (average shape area BSA of the target image OI) before execution of the warp W corresponding to the pixel group x in the image after execution of the warp W (face image having the average shape s ₀ ) is expressed as W (x; p). To express. Since the average shape image is an image composed of luminance values in each of the pixel groups W (x; p) in the average shape region BSA of the target image OI, it is represented as I (W (x; p)). FIG. 14 shows nine average shape images I (W (x; p)) corresponding to the nine temporarily set positions shown in FIG.

算出部２２４（図１）は、各仮設定位置に対応する平均形状画像Ｉ（Ｗ（ｘ；ｐ））と、特許請求の範囲の「比較顔画像」としての平均顔画像Ａ₀（ｘ）との差分画像Ｉｅを算出する（ステップＳ３３０）。特徴点ＣＰの仮設定位置は８１種類設定されているため、算出部２２４（図１）は、８１個の差分画像Ｉｅを算出することとなる。 The calculation unit 224 (FIG. 1) calculates an average shape image I (W (x; p)) corresponding to each temporarily set position and an average face image A ₀ (x) as a “comparison face image” in the claims. The difference image Ie is calculated (step S330). Since 81 types of temporary setting positions of the feature points CP are set, the calculation unit 224 (FIG. 1) calculates 81 difference images Ie.

設定部２１０は、各差分画像Ｉｅの画素値からノルムを算出し、ノルムの値が最も小さい差分画像Ｉｅに対応する仮設置位置（以下「ノルム最小仮設定位置」とも呼ぶ）を、注目画像ＯＩにおける特徴点ＣＰの初期位置として設定する（ステップＳ３４０）。ノルムを算出するための画素値は輝度値であってもよいしＲＧＢ値であってもよい。以上により特徴点ＣＰ初期位置設定処理が完了する。なお、ノルムの値が小さいほど平均形状画像Ｉ（Ｗ（ｘ；ｐ））と、平均顔画像Ａ₀（ｘ）との差が小さくなる（両者が近似する）。 The setting unit 210 calculates a norm from the pixel values of each difference image Ie, and sets a temporary installation position corresponding to the difference image Ie having the smallest norm value (hereinafter also referred to as “norm minimum temporary setting position”) as the target image OI. Is set as the initial position of the feature point CP at (step S340). The pixel value for calculating the norm may be a luminance value or an RGB value. Thus, the feature point CP initial position setting process is completed. Note that the smaller the norm value, the smaller the difference between the average shape image I (W (x; p)) and the average face image A ₀ (x) (both approximate).

Ａ３−１．特徴点ＣＰ設定位置補正処理：
特徴点ＣＰ初期位置設定処理が完了すると、特徴位置検出部２２０（図１）は、注目画像ＯＩにおける特徴点ＣＰの設定位置の補正をおこなう（ステップＳ２５０）。図１５は、本実施例における特徴点ＣＰ設定位置補正処理の流れを示すフローチャートである。 A3-1. Feature point CP setting position correction processing:
When the feature point CP initial position setting process is completed, the feature position detector 220 (FIG. 1) corrects the set position of the feature point CP in the target image OI (step S250). FIG. 15 is a flowchart showing the flow of the feature point CP setting position correction process in the present embodiment.

生成部２２２（図１）は、注目画像ＯＩから平均形状画像Ｉ（Ｗ（ｘ；ｐ））を算出する（ステップＳ４１０）。平均形状画像Ｉ（Ｗ（ｘ；ｐ））の算出方法は、特徴点ＣＰ初期位置設定処理におけるステップＳ３２０と同様である。 The generation unit 222 (FIG. 1) calculates an average shape image I (W (x; p)) from the attention image OI (step S410). The calculation method of the average shape image I (W (x; p)) is the same as that in step S320 in the feature point CP initial position setting process.

特徴位置検出部２２０は、平均形状画像Ｉ（Ｗ（ｘ；ｐ））と平均顔画像Ａ₀（ｘ）との差分画像Ｉｅを算出する（ステップＳ４２０）。特徴位置検出部２２０は、差分画像Ｉｅに基づき、特徴点ＣＰの設定位置補正処理が収束したか否かを判定する（ステップＳ４３０）。特徴位置検出部２２０は、差分画像Ｉｅのノルムを算出し、ノルムの値が予め設定された閾値より小さい場合には収束したと判定し、ノルムの値が閾値以上の場合には未だ収束していないと判定する。なお、特徴位置検出部２２０は、算出された差分画像Ｉｅのノルムの値が前回のステップＳ４３０において算出された値よりも小さい場合には収束したと判定し、前回値以上である場合には未だ収束していないと判定するものとしてもよい。あるいは、特徴位置検出部２２０は、閾値による判定と前回値との比較による判定とを組み合わせて収束判定をおこなうものとしてもよい。例えば、特徴位置検出部２２０は、算出されたノルムの値が、閾値より小さく、かつ、前回値より小さい場合にのみ収束したと判定し、それ以外の場合には未だ収束していないと判定するものとしてもよい。 The feature position detection unit 220 calculates a difference image Ie between the average shape image I (W (x; p)) and the average face image A ₀ (x) (step S420). The feature position detection unit 220 determines whether or not the feature point CP setting position correction process has converged based on the difference image Ie (step S430). The feature position detection unit 220 calculates the norm of the difference image Ie, determines that it has converged if the norm value is smaller than a preset threshold value, and still converges if the norm value is greater than or equal to the threshold value. Judge that there is no. The feature position detection unit 220 determines that the calculated norm value of the difference image Ie has converged when the value is smaller than the value calculated in the previous step S430, and still determines that the value is equal to or greater than the previous value. It is good also as what determines with not having converged. Alternatively, the feature position detection unit 220 may perform the convergence determination by combining the determination based on the threshold and the determination based on comparison with the previous value. For example, the feature position detection unit 220 determines that the calculated norm value has converged only when the calculated norm value is smaller than the threshold value and smaller than the previous value, and otherwise determines that it has not yet converged. It may be a thing.

前記のステップＳ４３０の収束判定において未だ収束していないと判定された場合には、補正部２２６（図１）がパラメーター更新量ΔＰを算出する（ステップＳ４５０）。パラメーター更新量ΔＰは、４個のグローバルパラメーター（全体としての大きさ、傾き、Ｘ方向位置、Ｙ方向位置）、および、特徴量であるｎ個の形状パラメーターｐ_i（ｉ＝１〜ｎの整数。）の値の変更量を意味している。なお、特徴点ＣＰを初期位置に設定した直後においては、グローバルパラメーターは、特徴点ＣＰ初期位置設定処理（図１２）において決定された値が設定されている。また、このときの特徴点ＣＰの初期位置と平均形状ｓ₀の特徴点ＣＰの設定位置との相違は、全体としての大きさ、傾き、位置の相違に限られるため、形状モデルにおける形状パラメーターｐ_iの値およびテクスチャーモデルにおけるテクスチャーパラメーターλ_kはすべてゼロである。 If it is determined in step S430 that the convergence is not yet completed, the correction unit 226 (FIG. 1) calculates the parameter update amount ΔP (step S450). The parameter update amount ΔP includes four global parameters (total size, inclination, X direction position, Y direction position), and n shape parameters p _i (i = 1 to n) as feature amounts. .) Means the change amount of the value. Immediately after setting the feature point CP to the initial position, the global parameter is set to the value determined in the feature point CP initial position setting process (FIG. 12). Further, difference between the initial position of the characteristic point CP and sets the position of the characteristic points CP of the average shape s ₀ in this case, the overall size, the tilt, because it is limited to the difference in position, shape parameters in the shape model p The value of _i and the texture parameter λ _k in the texture model are all zero.

パラメーター更新量ΔＰは、下記の式（３）により算出される。すなわち、パラメーター更新量ΔＰは、アップデートマトリックスＲと差分画像Ｉｅとの積である。 The parameter update amount ΔP is calculated by the following equation (3). That is, the parameter update amount ΔP is a product of the update matrix R and the difference image Ie.

式（３）におけるアップデートマトリックスＲは、差分画像Ｉｅに基づきパラメーター更新量ΔＰを算出するために予め学習により設定されたＭ行Ｎ列のマトリックスであり、
ＡＡＭ情報ＡＭＩ（図１）として内部メモリー１２０に格納されている。本実施例では、アップデートマトリックスＲの行数Ｍは、グローバルパラメーターの数（４個）と、形状パラメーターｐ_iの数（ｎ個）との和（（４＋ｎ）個）に等しく、列数Ｎは、平均顔画像Ａ₀（ｘ）の平均形状領域ＢＳＡ内の画素数（５６画素×５６画素−マスク領域ＭＡの画素数）に等しい。アップデートマトリックスＲは、下記の式（４）および（５）により算出される。 The update matrix R in the equation (3) is a matrix of M rows and N columns set in advance by learning in order to calculate the parameter update amount ΔP based on the difference image Ie.
It is stored in the internal memory 120 as AAM information AMI (FIG. 1). In this embodiment, the number of rows M in the update matrix R is equal to the sum ((4 + n)) of the number of global parameters (4) and the number of shape parameters p _i (n), and the number of columns N is , average face image a ₀ average number of pixels in the shape area BSA of (x) - is equal to (56 pixels × 56 pixels the number of pixels in the mask area MA). The update matrix R is calculated by the following formulas (4) and (5).

補正部２２６（図１）は、算出されたパラメーター更新量ΔＰに基づきパラメーター（４個のグローバルパラメーターとｎ個の形状パラメーターｐ_i）を更新する（ステップＳ４６０）。これにより、注目画像ＯＩにおける特徴点ＣＰの設定位置が補正される。補正部２２６は、差分画像Ｉｅのノルムが小さくなるように補正する。パラメーターの更新の後には、再度、特徴点ＣＰの設置位置が補正された注目画像ＯＩからの平均形状画像Ｉ（Ｗ（ｘ；ｐ））の算出（ステップＳ４１０）、差分画像Ｉｅの算出（ステップＳ４２０）、差分画像Ｉｅに基づく収束判定（ステップＳ４３０）が行われる。なお、平均形状画像Ｉ（Ｗ（ｘ；ｐ））は、形状パラメーターｐ_iの更新（変化）に応じて変化するものであり、特許請求の範囲の「顔領域」に相当する。再度の収束判定においても収束していないと判定された場合には、さらに、差分画像Ｉｅに基づくパラメーター更新量ΔＰの算出（ステップＳ４５０）、パラメーターの更新による特徴点ＣＰの設定位置補正（ステップＳ４６０）が行われる。 The correction unit 226 (FIG. 1) updates the parameters (four global parameters and n shape parameters p _i ) based on the calculated parameter update amount ΔP (step S460). Thereby, the setting position of the feature point CP in the target image OI is corrected. The correcting unit 226 corrects so that the norm of the difference image Ie is reduced. After the parameter update, the average shape image I (W (x; p)) is calculated again from the target image OI in which the installation position of the feature point CP is corrected (step S410), and the difference image Ie is calculated (step S420), a convergence determination based on the difference image Ie (step S430) is performed. The average shape image I (W (x; p)) changes according to the update (change) of the shape parameter p _i and corresponds to a “face region” in the claims. If it is determined that the convergence has not occurred in the convergence determination again, the parameter update amount ΔP based on the difference image Ie is calculated (step S450), and the setting position correction of the feature point CP by the parameter update (step S460). ) Is performed.

図１５のステップＳ４１０からＳ４６０までの処理が繰り返し実行されると、注目画像ＯＩにおける各特徴部位に対応する特徴点ＣＰの位置、実際の特徴部位の位置に全体として近づいていき、ある時点で収束判定（ステップＳ４３０）において収束したと判定される。収束判定において収束したと判定されると、顔特徴検出処理が完了する（ステップＳ４７０）。このとき設定されているグローバルパラメーターおよび形状パラメーターｐ_iの値により特定される特徴点ＣＰの設定位置が、最終的な注目画像ＯＩにおける特徴点ＣＰの設定位置として特定される。 When the processing from step S410 to step S460 in FIG. 15 is repeatedly executed, the position of the feature point CP corresponding to each feature part in the target image OI and the position of the actual feature part approach as a whole and converge at a certain time. In the determination (step S430), it is determined that convergence has occurred. If it is determined in the convergence determination that convergence has been achieved, the face feature detection process is completed (step S470). The setting position of the feature point CP specified by the global parameter and the shape parameter p _i set at this time is specified as the setting position of the feature point CP in the final target image OI.

図１６は、特徴点ＣＰ設定位置補正処理の結果の一例を示す説明図である。図１６には、注目画像ＯＩにおいて最終的に特定された特徴点ＣＰの設定位置が示されている。特徴点ＣＰの設定位置により、注目画像ＯＩに含まれる顔の特徴部位（人物の顔の器官（眉毛、目、鼻、口）および顔の輪郭における所定位置）の位置が特定されるため、注目画像ＯＩにおける人物の顔の器官の形状・位置や顔の輪郭形状の検出が可能となる。 FIG. 16 is an explanatory diagram illustrating an example of a result of the feature point CP setting position correction process. FIG. 16 shows the setting positions of the feature points CP finally specified in the target image OI. Since the position of the feature point CP specifies the position of the facial feature part (predetermined position in the facial organs (eyebrows, eyes, nose, mouth) and facial contour) included in the attention image OI. It is possible to detect the shape and position of the human face organ and the face contour shape in the image OI.

特徴点ＣＰ設定位置補正処理が完了すると、テクスチャー検出部２４０（図１）は、注目画像ＯＩにおいて設定される平均形状画像Ｉ（Ｗ（ｘ；ｐ））のテクスチャー（見え）の補正（検出）をおこなう（ステップＳ２６０）。 When the feature point CP setting position correction processing is completed, the texture detection unit 240 (FIG. 1) corrects (detects) the texture (appearance) of the average shape image I (W (x; p)) set in the target image OI. Is performed (step S260).

Ａ３−２．テクスチャー検出処理：
図１７は、テクスチャー検出処理の流れを示すフローチャートである。生成部２４２（図１）は、注目画像ＯＩから平均形状画像Ｉ（Ｗ（ｘ；ｐ））を算出する（ステップＳ５１０）。平均形状画像Ｉ（Ｗ（ｘ；ｐ））の算出方法は、特徴点ＣＰ初期位置設定処理におけるステップＳ３２０、および、特徴点ＣＰ設定位置補正処理におけるステップＳ４１０と同様であるが、特徴点ＣＰ設定位置補正処理完了時のグローバルパラメーターおよび形状パラメーターｐ_iの値を使用して平均形状画像Ｉ（Ｗ（ｘ；ｐ））が算出される。 A3-2. Texture detection processing:
FIG. 17 is a flowchart showing the flow of texture detection processing. The generation unit 242 (FIG. 1) calculates an average shape image I (W (x; p)) from the attention image OI (step S510). The calculation method of the average shape image I (W (x; p)) is the same as that in step S320 in the feature point CP initial position setting process and step S410 in the feature point CP setting position correction process. position correction processing completion time using the values of the global parameters and the shape parameters p _i of the average shape image I (W (x; p) ) is calculated.

さらに、テクスチャー検出部２４０の生成部２４２は、平均形状画像Ｉ（Ｗ（ｘ；ｐ））に基づく平均テクスチャー画像Ｊ（Ｔ（ｘ；λ））を算出する（ステップＳ５１５）。平均テクスチャー画像Ｊ（Ｔ（ｘ；λ））は、以下の（６）式によって算出される。

前記式（６）は、前記式（２ｂ）における顔のテクスチャーＡ（ｘ）にＷ（ｘ；ｐ）を代入したものであり、テクスチャーパラメーターλ_kが適切に設定されれば、Ｔ（ｘ；λ）が平均顔画像Ａ₀（ｘ）と等しくなる。 Further, the generation unit 242 of the texture detection unit 240 calculates an average texture image J (T (x; λ)) based on the average shape image I (W (x; p)) (step S515). The average texture image J (T (x; λ)) is calculated by the following equation (6).

The equation (6) is obtained by substituting W (x; p) into the facial texture A (x) in the equation (2b). If the texture parameter λ _k is appropriately set, T (x; λ) is equal to the average face image A ₀ (x).

そして、算出部２４４は、平均テクスチャー画像Ｊ（Ｔ（ｘ；λ））と平均顔画像Ａ₀（ｘ）との差分画像Ｉｅを算出する（ステップＳ５２０）。テクスチャー検出部２４０は、差分画像Ｉｅに基づき、テクスチャー検出処理が収束したか否かを判定する（ステップＳ５３０）。テクスチャー検出部２４０は、差分画像Ｉｅのノルムを算出し、ノルムの値が予め設定された閾値より小さい場合には収束したと判定し、ノルムの値が閾値以上の場合には未だ収束していないと判定する。なお、テクスチャー検出部２４０は、算出された差分画像Ｉｅのノルムの値が前回のステップＳ５３０において算出された値よりも小さい場合には収束したと判定し、前回値以上である場合には未だ収束していないと判定するものとしてもよい。ここにおける収束の判定は、特徴点ＣＰ設定位置補正処理におけるステップＳ４３０と同様の手法によりおこなうことができる。 Then, the calculation unit 244 calculates a difference image Ie between the average texture image J (T (x; λ)) and the average face image A ₀ (x) (step S520). The texture detection unit 240 determines whether the texture detection process has converged based on the difference image Ie (step S530). The texture detection unit 240 calculates the norm of the difference image Ie, determines that it has converged if the norm value is smaller than a preset threshold value, and has not yet converged if the norm value is greater than or equal to the threshold value. Is determined. Note that the texture detection unit 240 determines that the calculated norm value of the difference image Ie has converged if it is smaller than the value calculated in the previous step S530, and still converges if it is greater than or equal to the previous value. It is good also as what determines with not doing. The convergence determination can be performed by the same method as in step S430 in the feature point CP setting position correction process.

前記のステップＳ５３０の収束判定において未だ収束していないと判定された場合には、補正部２４６（図１）がパラメーター更新量Δλを算出する（ステップＳ５５０）。パラメーター更新量ΔΛは、特徴量であるｍ個のテクスチャーパラメーターλ_k（ｋ＝１〜ｍの整数。）の値の変更量を意味している。パラメーター更新量ΔΛは、下記の式（７）により算出される。すなわち、パラメーター更新量ΔΛは、アップデートマトリックスＵと差分画像Ｉｅとの積である。 If it is determined in step S530 that the convergence has not yet been completed, the correction unit 246 (FIG. 1) calculates the parameter update amount Δλ (step S550). The parameter update amount ΔΛ means a change amount of the value of m texture parameters λ _k (k = _{1 to} m), which is a feature amount. The parameter update amount ΔΛ is calculated by the following equation (7). That is, the parameter update amount ΔΛ is a product of the update matrix U and the difference image Ie.

式（７）におけるアップデートマトリックスＵは、差分画像Ｉｅに基づきパラメーター更新量ΔΛを算出するために予め学習により設定されたＭ行Ｎ列のマトリックスであり、
ＡＡＭ情報ＡＭＩ（図１）として内部メモリー１２０に格納されている。本実施例では、アップデートマトリックスＵの行数Ｍは、テクスチャーパラメーターλ_k（ｍ個）に等しく、列数Ｎは、平均形状領域ＢＳＡ内の画素数（５６画素×５６画素−マスク領域ＭＡの画素数）に等しい。アップデートマトリックスＵは、下記の式（８），（９）により算出される。

The update matrix U in Equation (7) is a matrix of M rows and N columns set in advance by learning in order to calculate the parameter update amount ΔΛ based on the difference image Ie.
It is stored in the internal memory 120 as AAM information AMI (FIG. 1). In this embodiment, the number M of rows in the update matrix U is equal to the texture parameter λ _k (m), and the number N of columns is the number of pixels in the average shape area BSA (56 pixels × 56 pixels−pixels in the mask area MA). Number). The update matrix U is calculated by the following formulas (8) and (9).

補正部２４６（図１）は、算出されたパラメーター更新量ΔΛに基づきテクスチャーパラメーターλ_kを更新する（ステップＳ５６０）。これにより、顔のテクスチャーＡ（ｘ）が補正される。すなわち、補正部２４６は、差分画像Ｉｅのノルムが小さくなるように補正する。パラメーターの更新の後には、再度、補正されたテクスチャーパラメーターλ_kに基づいて平均テクスチャー画像Ｊ（Ｔ（ｘ；λ））の算出（ステップＳ５１５）、差分画像Ｉｅの算出（ステップＳ５２０）、差分画像Ｉｅに基づく収束判定（ステップＳ５３０）が行われる。再度の収束判定においても収束していないと判定された場合には、さらに、差分画像Ｉｅに基づくパラメーター更新量ΔΛの算出（ステップＳ５５０）、パラメーターの更新によるテクスチャーの補正（ステップＳ５６０）が行われる。 The correction unit 246 (FIG. 1) updates the texture parameter λ _k based on the calculated parameter update amount ΔΛ (step S560). Thereby, the facial texture A (x) is corrected. That is, the correction unit 246 corrects so that the norm of the difference image Ie is reduced. After the parameter update, the average texture image J (T (x; λ)) is calculated again based on the corrected texture parameter λ _k (step S515), the difference image Ie is calculated (step S520), and the difference image. A convergence determination (step S530) based on Ie is performed. If it is determined that the convergence has not been made in the convergence determination again, the parameter update amount ΔΛ based on the difference image Ie is calculated (step S550), and the texture is corrected by updating the parameter (step S560). .

図１７のステップＳ５１０からＳ５６０までの処理が繰り返し実行されると、顔のテクスチャーＡ（ｘ）の各画素の輝度値（色彩値）が、実際の顔画像の輝度値に全体として近づいていき、ある時点で収束判定（ステップＳ５３０）において収束したと判定される。収束判定において収束したと判定されると、テクスチャー検出処理が完了する（ステップＳ５７０）。このとき設定されているテクスチャーパラメーターλ_kの値により特定される顔のテクスチャーＡ（ｘ）が最終的な顔の見えとして特定される。顔のテクスチャーＡ（ｘ）に基づいて、顔色補正等をおこなうことができる。以上のようにして、注目画像ＯＩにおける顔領域ＦＡについての形状パラメーターｐ_iとテクスチャーパラメーターλ_kが特定できると、次に表情判定処理を実行する。 When the processing from step S510 to S560 in FIG. 17 is repeatedly executed, the luminance value (color value) of each pixel of the facial texture A (x) approaches the luminance value of the actual face image as a whole, At a certain point in time, it is determined that convergence has been made in the convergence determination (step S530). If it is determined in the convergence determination that the image has converged, the texture detection process is completed (step S570). The facial texture A (x) specified by the value of the texture parameter λ _k set at this time is specified as the final facial appearance. Based on the facial texture A (x), facial color correction and the like can be performed. When the shape parameter p _i and the texture parameter λ _k for the face area FA in the target image OI can be specified as described above, the facial expression determination process is executed next.

Ａ４．表情判定処理：
図１８は、表情判定処理の流れを示すフローチャートである。表情判定部２５０は、判定対象の表情を選択する（ステップＳ６１０）。本実施例では、４種類の表情（喜顔、悲顔、驚顔、怒顔）を判定対象とし、喜顔→悲顔→驚顔→怒顔の順で判定対象を選択することとする。選択部２５２は、ｎ個の形状パラメーターｐ_iとｍ個のテクスチャーパラメーターλ_kのなかから、選択した表情の表情らしさを示す判定値Ｅを算出するために使用するｈ個の表情パラメーターｑ_l（ｌ＝１〜ｈの整数。）を選択する（ステップＳ６２０）。判定値算出部２５４は、表情パラメーターｑ_lを下記式（１０）に代入することにより、判定値Ｅを算出する（ステップＳ６３０）。

A4. Facial expression determination processing:
FIG. 18 is a flowchart showing the flow of facial expression determination processing. The facial expression determination unit 250 selects a facial expression to be determined (step S610). In this embodiment, four types of facial expressions (a happy face, a sad face, a surprised face, and an angry face) are set as determination targets, and a determination target is selected in the order of a happy face → a sad face → a surprise face → an angry face. The selection unit 252 selects the h facial expression parameters q _l (used to calculate a judgment value E indicating the facial expression of the selected facial expression from the n shape parameters p _i and the m texture parameters λ _k. 1 is an integer from 1 to h.) (Step S620). Determination value calculation unit 254 calculates determination value E by substituting facial expression parameter q _l into the following equation (10) (step S630).

前記式（１０）において、α_lは各表情パラメーターｑ_lについて設定された重み係数を示し、ｋｎは正規化のための係数を示している。すなわち、判定値Ｅは、表情パラメーターｑ_lを重み係数α_lに基づく重みによって線形結合した値である。判定値Ｅが算出されると、表情判定部２５０はすべての表情の選択を完了したか否かを判定し（ステップＳ６４０）、完了していない場合にはステップＳ６１０に戻り次の表情を選択する。これにより、喜顔の判定値Ｅと、悲顔の判定値Ｅと、驚顔の判定値Ｅと、怒顔の判定値Ｅを順に算出していくことができる。なお、各表情にごとに、ｎ個の形状パラメーターｐ_iとｍ個のテクスチャーパラメーターλ_kのなかから選択されるｈ個の表情パラメーターｑ_lは異なっており、表情パラメーターｑ_lに対応する重み係数α_lも各表情に対応して用意されている。例えば、口角が上がる傾向にある喜顔と、口角が下がる傾向にある悲顔とでは、口角に関する表情パラメーターｑ_lに対する重み係数α_lの符号が反転することとなる。従って、各表情ごとに異なる計算式に基づいて判定値Ｅが算出されることとなる。このように、形状パラメーターｐ_iとテクスチャーパラメーターλ_kのうち各表情と相関の高いｈ個の表情パラメーターｑ_lを使用して判定値Ｅを算出するため、形状的および色彩（輝度）的な観点に基づいて表情を精度よく判定することができる。 In the equation (10), α _l represents a weighting coefficient set for each facial expression parameter q _l , and kn represents a coefficient for normalization. That is, the determination value E is a value obtained by linearly combining the facial expression parameter q _l with a weight based on the weight coefficient α _l . When the determination value E is calculated, the facial expression determination unit 250 determines whether or not selection of all facial expressions has been completed (step S640). If not, the process returns to step S610 to select the next facial expression. . As a result, the determination value E for the happy face, the determination value E for the sad face, the determination value E for the surprised face, and the determination value E for the angry face can be calculated in order. For each facial expression, the h facial expression parameters q _l selected from the n shape parameters p _i and the m texture parameters λ _k are different, and the weighting coefficient corresponding to the facial expression parameter q _l α _{l is} also prepared for each facial expression. For example, the sign of the weight coefficient α _l for the facial expression parameter q _l related to the mouth angle is inverted between a happy face that tends to increase the mouth angle and a sad face that tends to decrease the mouth angle. Therefore, the determination value E is calculated based on a different calculation formula for each facial expression. As described above, since the judgment value E is calculated using the h facial expression parameters q _l having a high correlation with each facial expression among the geometric parameters _pi and the texture parameters λ _k , the geometrical and color (luminance) viewpoints are obtained. The facial expression can be accurately determined based on the above.

各表情に対応して選択される表情パラメーターｑ_lは、形状パラメーターｐ_iとテクスチャーパラメーターλ_kのうち各表情と相関の高いベクトル要素であり、後述する判定値設定処理によって設定され、予め表情判定情報ＥＭＩ（図１）として内部メモリー１２０に格納されている。各表情について判定値Ｅが算出できると、各表情について判定値Ｅを総合して判定部２５６が表情の判定を行う。具体的には、判定部２５６が最も判定値Ｅの大きい表情を検出する（ステップＳ６５０）。次に、判定部２５６は、最も大きい判定値Ｅが所定の閾値よりも大きいか否かを判定し（ステップＳ６６０）、閾値よりも大きければ当該判定値Ｅの表情を有していると判定する（ステップＳ６７０）。一方、最も大きい判定値Ｅ以下であれば、いずれの表情も有していないと判定する（ステップＳ６８０）。本実施例では、注目画像ＯＩに各表情を識別するタグ情報を添付した後、注目画像ＯＩに喜顔が含まれる場合には、印刷機構１６０に注目画像ＯＩを出力し、印刷を実行させる（ステップＳ６９０）。 The facial expression parameter q _l selected corresponding to each facial expression is a vector element having a high correlation with each facial expression out of the shape parameter p _i and the texture parameter λ _k and is set by a judgment value setting process to be described later. It is stored in the internal memory 120 as information EMI (FIG. 1). When the determination value E can be calculated for each facial expression, the determination unit 256 determines the facial expression by combining the determination values E for each facial expression. Specifically, the determination unit 256 detects a facial expression with the largest determination value E (step S650). Next, the determination unit 256 determines whether or not the largest determination value E is greater than a predetermined threshold (step S660), and determines that the expression has the expression of the determination value E if greater than the threshold. (Step S670). On the other hand, if it is equal to or less than the largest determination value E, it is determined that no facial expression is present (step S680). In this embodiment, after tag information for identifying each expression is attached to the attention image OI, if the attention image OI includes a happy face, the attention image OI is output to the printing mechanism 160 and printing is performed ( Step S690).

図１９（ａ）〜（ｄ）は、各表情の判定値Ｅを比較するグラフである。図１９（ａ）のようにいずれの表情の判定値Ｅも閾値を超えていない場合には、いずれの表情も有していないと判定する（ステップＳ６８０）。図１９（ｂ）のように、いずれか１つの表情の判定値Ｅが閾値を超えている場合には、判定値Ｅが閾値を超えている表情であると判定する。図１９（ｃ），（ｄ）のように、いずれか２以上の表情の判定値Ｅが閾値を超えている場合には、判定値Ｅが閾値を超えている表情のうち判定値Ｅが最大の表情であると判定する。このように、各表情ごとに判定値Ｅを算出することにより、各表情であるか否かを精度よく判定することができる。また、もっとも大きい判定値Ｅに対応する表情を有していると判定するため、もっともそれらしい表情を特定することができる。 FIGS. 19A to 19D are graphs for comparing the determination values E of the facial expressions. If the determination value E for any facial expression does not exceed the threshold value as shown in FIG. 19A, it is determined that no facial expression is present (step S680). As shown in FIG. 19B, when the determination value E of any one facial expression exceeds the threshold value, it is determined that the facial expression value exceeds the threshold value. As shown in FIGS. 19C and 19D, when the determination value E of any two or more facial expressions exceeds the threshold value, the determination value E is the maximum among the facial expressions whose determination value E exceeds the threshold value. It is determined that it is a facial expression. Thus, by calculating the determination value E for each facial expression, it is possible to accurately determine whether each facial expression is present. Further, since it is determined that the expression corresponding to the largest determination value E is present, the most appropriate expression can be specified.

Ａ５．判定値設定処理：
図２０は、判定値設定処理の流れを示すフローチャートである。判定値設定処理もＡＡＭ設定処理と同様にプリンター１００の製造に先立って、コンピューターによって実行される処理である。まず、喜顔であるか否か、悲顔であるか否か、驚顔であるか否か、怒顔であるか否かが既知のサンプル画像ＳＩを用意する（ステップＳ７１０）。例えば、各サンプル画像ＳＩがどの表情を有するかをオペレーターが判断し、その結果を各サンプル画像ＳＩに対応付け、コンピューターに入力させておく。次に、コンピューターは、各サンプル画像ＳＩを注目画像ＯＩとして、上述した特徴点ＣＰ初期位置設定処理と特徴点ＣＰ設定位置補正処理とテクスチャー検出処理と同様の処理を順次実行することにより、各サンプル画像ＳＩについて形状パラメーターｐ_iとテクスチャーパラメーターλ_kを算出する（ステップＳ７２０）。形状パラメーターｐ_iとテクスチャーパラメーターλ_kとが得られたら、コンピューターは、表情パラメーターｑ_lの設定対象とする表情を選択する（ステップＳ７３０）。ここでも、喜顔→悲顔→驚顔→怒顔の順で判定対象を選択することとする。表情が選択できたら、コンピューターは、形状パラメーターｐ_iとテクスチャーパラメーターλ_kのそれぞれの要素について、選択した表情であるか否かとの相関係数を算出する（ステップＳ７４０）。 A5. Judgment value setting processing:
FIG. 20 is a flowchart showing the flow of the determination value setting process. Similar to the AAM setting process, the determination value setting process is a process executed by the computer prior to manufacturing the printer 100. First, a known sample image SI is prepared for whether it is a happy face, a sad face, a surprised face, or an angry face (step S710). For example, the operator determines which facial expression each sample image SI has, and the result is associated with each sample image SI and input to the computer. Next, the computer uses each sample image SI as the target image OI, and sequentially executes the same processing as the above-described feature point CP initial position setting processing, feature point CP setting position correction processing, and texture detection processing, thereby each sample image SI. the image SI and calculates a shape parameter p _i and texture parameter lambda _k (step S720). When the shape parameter p _i and the texture parameter λ _k are obtained, the computer selects a facial expression for which the facial expression parameter q _l is to be set (step S730). Again, the determination target is selected in the order of happy face → sad face → surprise face → angry face. If the facial expression can be selected, the computer calculates a correlation coefficient between each element of the shape parameter p _i and the texture parameter λ _k with respect to whether or not it is the selected facial expression (step S740).

図２１は、喜顔であるか否かの相関を説明するグラフである。同図の横軸と縦軸は、形状パラメーターｐ_iの例として、形状パラメーターｐ₈，ｐ₉の値（０〜１００％）を示している。グラフにおいて、各サンプル画像ＳＩの形状パラメーターｐ₈，ｐ₉をプロットしており、喜顔であるものは○で示し、喜顔でないものは●で示している。図に示すように、形状パラメーターｐ₈の値が小さくなるほど、喜顔であるものの多く分布している。従って、喜顔であるか否かは形状パラメーターｐ₈に対して負の相関を有することとなる。一方、形状パラメーターｐ₉に関しては、喜顔であるものと、喜顔でないものがほぼ均等に分布しており、相関を有していない。なお、一例として、形状パラメーターｐ₈，ｐ₉を挙げているに過ぎず、他のすべての形状パラメーターｐ_iとテクスチャーパラメーターλ_kについても同様に表情との相関性の調査を行う。 FIG. 21 is a graph for explaining the correlation of whether or not the face is a happy face. The horizontal axis and the vertical axis in the figure indicate the values (0 to 100%) of the shape parameters p ₈ and p ₉ as examples of the shape parameter p _i . In the graph, which plots the shape parameter p _8, p ₉ of each sample image SI, what is Hee face indicated by ○, those that are not Hee face is indicated by ●. As shown in the figure, the smaller the value of the shape parameter p ₈ , the greater the distribution of the happy faces. Therefore, whether or not it is a happy face has a negative correlation with the shape parameter p ₈ . On the other hand, with respect to the shape parameters p _9, as is Hee face, has been substantially uniformly distributed not comply with Hee face, does not have a correlation. As an example, only the shape parameters p ₈ and p ₉ are listed, and all other shape parameters p _i and texture parameters λ _k are similarly investigated for correlation with facial expressions.

ここでは、コンピューターは、公知の計算式により、相関係数を算出し、その絶対値が所定の閾値（例えば、０．５等。）を超えている形状パラメーターｐ_iとテクスチャーパラメーターλ_kを表情パラメーターｑ_lとする（ステップＳ７５０）。従って、表情パラメーターｑ_lの要素の個数（ｈ個）は、各表情によって同じとなるとは限らない。むろん、表情パラメーターｑ_lの要素の個数（ｈ個）を予め設定しておき、相関係数の絶対値が上位ｈ個の形状パラメーターｐ_iとテクスチャーパラメーターλ_kを表情パラメーターｑ_lとして設定するようにしてもよい。さらに、表情パラメーターｑ_lの要素の個数（ｈ個）のうち、形状パラメーターｐ_iとテクスチャーパラメーターλ_kの要素の個数の比率が規定されてもよい。例えば、形状パラメーターｐ_iとテクスチャーパラメーターλ_kの要素の個数が１：１となるように制限を加えれば、形状的および色彩（輝度）的な観点をバランスよく評価することができる。ここでは、相関係数が高い形状パラメーターｐ_iとテクスチャーパラメーターλ_kが表情パラメーターｑ_lとして設定されるが、例えば喜顔の場合には目尻の角度や口角に関する形状パラメーターｐ_iや笑いじわに関するテクスチャーパラメーターλ_kが表情パラメーターｑ_lとして設定されことが予測される。 Here, the computer calculates the correlation coefficient by a known calculation formula, and expresses the shape parameter p _i and the texture parameter λ _k whose absolute value exceeds a predetermined threshold (for example, 0.5, etc.) The parameter is q _l (step S750). Therefore, the number (h) of elements of the facial expression parameter q _l is not necessarily the same for each facial expression. Of course, the number (h) of elements of the facial expression parameter q _l is set in advance, and the shape parameter p _i and the texture parameter λ _k having the highest absolute value of the correlation coefficient are set as the facial expression parameter q _l. It may be. Furthermore, the ratio of the number of elements of the shape parameter p _i and the texture parameter λ _k among the number (h) of elements of the facial expression parameter q _l may be defined. For example, the number of elements of the shape parameter p _i and texture parameter lambda _k is 1: be added to 1 and so as to limit, it is possible to evaluate a balanced shape and color (luminance) point of view. Here, the shape parameter p _i and the texture parameter λ _k having a high correlation coefficient are set as the expression parameter q _l . For example, in the case of a happy face, the shape parameter p _i related to the angle of the corner of the eye or the mouth angle is related to laughter. It is predicted that the texture parameter λ _k is set as the facial expression parameter q _l .

次に、コンピューターは、算出された相関係数に比例する値を重み係数α_lとして設定する（ステップＳ７６０）。従って、正の相関を有する表情パラメーターｑ_lについては正の重み係数α_lが設定され、負の相関を有する表情パラメーターｑ_lについては負の重み係数α_lが設定されることとなる。選択した表情についての重み係数α_lが設定できると、コンピューターは、すべての表情の選択が完了したか否かを判定し（ステップＳ７７０）、完了していない場合にはステップＳ７３０に戻る。以上のようにして、すべての表情について、表情パラメーターｑ_lと重み係数α_lが設定できると、コンピューターは、前記式（１０）によって算出される判定値Ｅの値の最大値が各表情間で一致するように、各表情についての係数ｋｎを設定する（ステップＳ７８０）。以上により前記式（１０）で使用される情報はすべて設定されたこととなり、これらの情報を表情判定情報ＥＭＩとしてプリンター１００の製造時において内部メモリー１２０に格納する（ステップＳ７９０）。 Next, the computer sets a value proportional to the calculated correlation coefficient as the weighting coefficient α ₁ (step S760). Accordingly, the facial expression parameters q _l having a positive correlation is set to a positive weight coefficient alpha _l, negative weight factor alpha _l is to be set for the expression parameter q _l having a negative correlation. If the weighting factor α _l for the selected facial expression can be set, the computer determines whether or not selection of all facial expressions has been completed (step S770), and returns to step S730 if not complete. As described above, when the facial expression parameter q _l and the weighting coefficient α _l can be set for all facial expressions, the computer determines that the maximum value of the judgment value E calculated by the equation (10) is between each facial expression. A coefficient kn for each facial expression is set so as to match (step S780). Thus, all the information used in the equation (10) has been set, and these pieces of information are stored in the internal memory 120 as the facial expression determination information EMI when the printer 100 is manufactured (step S790).

Ｂ．変形例：
なお、この発明は前記の実施例や実施形態に限られるものではなく、その要旨を逸脱しない範囲において種々の態様において実施することが可能であり、例えば次のような変形も可能である。 B. Variation:
The present invention is not limited to the above-described examples and embodiments, and can be implemented in various modes without departing from the gist thereof. For example, the following modifications are possible.

Ｂ１．変形例１：
前記実施例では、サンプル画像ＳＩを、個性・人種・性別、表情（喜顔、悲顔、驚顔、怒顔）、向きといった種々の属性に関して互いに相違する顔画像が含まれるように用意したが、表情（喜顔、悲顔、驚顔、怒顔）の属性が相違するサンプル画像ＳＩの構成比率を高めるようにしてもよい。表情の属性が相違するサンプル画像ＳＩの構成比率を高めることにより、喜顔、悲顔、驚顔、怒顔の属性の寄与率が、他の個性・人種・性別・向きといった他の属性に対して相対的に高くなるため、主成分分析において喜顔、悲顔、驚顔、怒顔である程度を示す主成分が表れることとなる。すなわち、形状パラメーターｐ_iまたはテクスチャーパラメーターλ_kのいずれかの要素が、そのまま喜顔であるか否か、悲顔であるか否か、驚顔であるか否か、怒顔であるか否かの程度を示すようにすることができる。 B1. Modification 1:
In the above-described embodiment, the sample images SI are prepared so as to include different face images with respect to various attributes such as personality / race / gender, facial expression (happy face, sad face, surprised face, angry face), and orientation. However, the composition ratio of the sample images SI having different attributes of expression (happy face, sad face, surprise face, angry face) may be increased. By increasing the composition ratio of the sample image SI with different facial expression attributes, the contribution ratio of the happy face, sad face, surprise face, and angry face attributes to other attributes such as personality, race, gender, and orientation On the other hand, since it becomes relatively high, in the principal component analysis, a principal component showing a certain degree appears as a happy face, a sad face, a surprised face, and an angry face. That is, any element of shape parameter p _i or texture parameter lambda _k is whether it Hee face, whether悲顔, whether Odorokikao, whether Okokao The degree can be shown.

前記実施例と同様に判定値設定処理を行った場合には、表情の属性が相違するサンプル画像ＳＩの構成比率を高めるほど、形状パラメーターｐ_iまたはテクスチャーパラメーターλ_kのいずれかの要素と、各表情との相関係数が高くなる。形状パラメーターｐ_iまたはテクスチャーパラメーターλ_kのいずれかの要素と、各表情との相関係数が１または−１に極めて近いとき（例えば、絶対値が０．９以上となるとき等。）には、当該要素の値を閾値と比較することにより、各表情の判定をおこなうことができる。例えば、第１主成分に対応する形状パラメーターｐ₁と閾値を比較することにより喜顔であるか否かを判定し、一方、第２主成分に対応する形状パラメーターｐ₂と閾値を比較することにより悲顔であるか否かを判定することができる。なお、種々の顔について特徴点の検出をする必要があるため、ある程度は個性・人種・性別・向きの属性が相違するサンプル画像ＳＩも含めておくことが望ましい。また、特徴点の検出の検出を別途おこなうのであれば、表情（喜顔、悲顔、驚顔、怒顔）の属性が相違するサンプル画像ＳＩのみを使用してＡＡＭ情報ＡＭＩを設定することも可能である。 When the determination value setting process is performed in the same manner as in the embodiment, as the component ratio of the sample image SI having different facial expression attributes is increased, either the shape parameter _pi or the texture parameter λ _k Correlation coefficient with facial expression increases. When the correlation coefficient between any element of the shape parameter p _i or the texture parameter λ _k and each facial expression is very close to 1 or −1 (for example, when the absolute value is 0.9 or more). Each facial expression can be determined by comparing the value of the element with a threshold value. For example, by comparing the shape parameter p ₁ corresponding to the first principal component with a threshold value, it is determined whether or not the face is a happy face, while comparing the shape parameter p ₂ corresponding to the second principal component with the threshold value. Thus, it can be determined whether or not it is a sad face. In addition, since it is necessary to detect feature points for various faces, it is desirable to include sample images SI having different personality, race, gender, and orientation attributes to some extent. In addition, if the detection of feature points is performed separately, the AAM information AMI may be set using only sample images SI having different facial expression attributes (happy face, sad face, surprise face, angry face). Is possible.

Ｂ２．変形例２：
前記実施例では、単一の静止画像を注目画像ＯＩとするものを例示したが、時系列的に連続して画像入力された動画やデジタルスチルカメラの連写機能によって撮影した一連の静止画像を、順次、注目画像ＯＩとするようにしてもよい。例えば、動画の各フレームのなかから各表情が顕著なフレームを検出し、該フレームを印刷機構１６０において印刷するようにしてもよい。各フレームについて、前記実施例と同様の顔特徴検出処理と表情判定処理を実行するようにしてもよいが、本変形例では以下に説明する表情判定処理を実行する。 B2. Modification 2:
In the above embodiment, a single still image is set as the target image OI. However, a series of still images taken by a continuous shooting function of a moving image or a digital still camera in which images are continuously input in time series are illustrated. The target image OI may be sequentially set. For example, a frame with a significant expression can be detected from each frame of the moving image and the frame can be printed by the printing mechanism 160. For each frame, the same facial feature detection processing and facial expression determination processing as in the above embodiment may be executed, but in this modification, facial expression determination processing described below is executed.

図２２は、本変形例にかかる表情判定処理の流れを示すフローチャートである。表情判定部２５０は、判定対象の表情を選択するにあたり、直前のフレームに存在する顔がいずれかの表情であると判定されたか否かを判定する（ステップＳ６０２）。なお、各フレームには単一の顔のみが存在するものとして説明する。直前のフレームに存在する顔がいずれの表情であるとも判定されていなかった場合には、前記実施例と同様に、喜顔→悲顔→驚顔→怒顔の順で選択し、判定値Ｅが最も大きく、かつ、閾値よりも大きい表情を判定する（前記実施例と同様のステップＳ６１０〜Ｓ６９０）。一方、直前のフレームに存在する顔がいずれかの表情であると判定されていた場合には、該表情を選択する（ステップＳ６０４）。 FIG. 22 is a flowchart showing the flow of facial expression determination processing according to the present modification. When selecting the facial expression to be determined, the facial expression determination unit 250 determines whether or not the face existing in the immediately preceding frame has been determined to be any facial expression (step S602). In the following description, it is assumed that each frame has only a single face. If it is not determined that the face existing in the immediately preceding frame is any facial expression, in the same manner as in the above embodiment, selection is made in the order of happy face → sad face → surprise face → angry face, and determination value E The facial expression having the largest value and larger than the threshold value is determined (steps S610 to S690 similar to the above embodiment). On the other hand, when it is determined that the face existing in the immediately preceding frame is any facial expression, the facial expression is selected (step S604).

そして、該表情について、ｈ個の表情パラメーターｑ_lを選択し（ステップＳ６２２）。表情パラメーターｑ_lを上記式（１０）に代入することにより、判定値Ｅを算出する（ステップＳ６３２）。該判定値Ｅが閾値（前記実施例と同じもの）より大きい場合には（ステップＳ６６２）、ステップＳ６０４にて選択した表情であると判定する（ステップＳ６７２）。一方、該判定値Ｅが閾値以下の場合には、ステップＳ６１０を実行し、前記実施例と同様のステップＳ６１０〜Ｓ６８０を順次実行する。ただし、直前のフレームの表情についてはすでに判定値Ｅの判定が完了しているため、ステップＳ６１０では直前のフレームの表情は選択しないこととする。 Then, h facial expression parameters q _l are selected for the facial expression (step S622). The determination value E is calculated by substituting the facial expression parameter q _l into the above equation (10) (step S632). If the determination value E is larger than the threshold value (the same as in the above embodiment) (step S662), it is determined that the facial expression is selected in step S604 (step S672). On the other hand, if the determination value E is less than or equal to the threshold value, step S610 is executed, and steps S610 to S680 similar to those in the above embodiment are sequentially executed. However, since the determination of the determination value E has already been completed for the expression in the immediately preceding frame, the expression in the immediately preceding frame is not selected in step S610.

極めて短時間の間に画像入力された動画の各フレームでは、直前のフレームと同じ表情の顔が存在する可能性が高いと考えることができる。本変形例のフローとすることにより、同じ表情が継続している限り、その表情についての判定値Ｅの算出と、閾値の判定のみが行われることとなる。直前の表情であると判定されなくなった時点で、すべての表情についての判定値Ｅの算出が行われるため、効率的な表情判定を動画に対しておこなうことができる。むろん、連写機能によって極めて短時間の間に撮影された静止画像であっても同様の効果を得ることができる。さらに、単一の注目画像ＯＩにおいて複数の顔が含まれる場合には、それぞれが同一の表情を有していると推定することができる。従って、先に表情判定をおこなった顔について判定された表情に関する判定を、次の顔に対して最初に行うようにしてもよい。 It can be considered that each frame of a moving image input in an extremely short time is highly likely to have a face with the same expression as the previous frame. By using the flow of this modification, as long as the same facial expression continues, only the calculation of the judgment value E and the threshold value are determined for that facial expression. Since the determination value E is calculated for all facial expressions when it is no longer determined that the previous facial expression is determined, efficient facial expression determination can be performed on the moving image. Of course, the same effect can be obtained even with a still image taken in a very short time by the continuous shooting function. Furthermore, when a plurality of faces are included in a single target image OI, it can be estimated that each has the same expression. Therefore, the determination regarding the facial expression determined for the face for which the facial expression determination has been performed first may be performed first for the next face.

Ｂ３．変形例３：
前記実施例においては、形状パラメーターｐ_iとテクスチャーパラメーターλ_kのいずれの要素も各表情との相関が高ければ判定値Ｅを算出に使用するようにしたが、例えば形状パラメーターｐ_iのみを使用して判定値Ｅを算出するようにしてもよい。さらに、状況に応じて、形状パラメーターｐ_iのみを使用して判定値Ｅを算出するのか、形状パラメーターｐ_iとテクスチャーパラメーターλ_kの双方を使用して判定値Ｅを算出するのか切り換えるようにしてもよい。 B3. Modification 3:
In the above embodiment, the determination value E is used for calculation if any element of the shape parameter p _i and the texture parameter λ _k has a high correlation with each facial expression. For example, only the shape parameter p _i is used. Thus, the determination value E may be calculated. Furthermore, depending on circumstances, whether to calculate the judgment value E using only the shape parameters p _i, and to switch whether to calculate a decision value E using both shape parameters p _i and texture parameter lambda _k Also good.

図２３は、本変形例にかかる表情判定処理の流れを示すフローチャートである。本変形例においては、肌色補正を実行する肌色補正モードがユーザーによって指定可能（不図示）である。肌色補正モードにおいて注目画像ＯＩに存在する顔の肌色画素の色彩値を予め指定された肌色の記憶色の色彩値に補正する処理が印刷の際に実行される。前記実施例と同様に判定対象の表情を選択すると（ステップＳ６１０）、肌色補正モードが選択されているか否かを判定する（ステップＳ６１５）。肌色補正モードが選択されていない場合には、前記実施例と同様にステップＳ６２０〜Ｓ６８０を順次実行する。すなわち、形状パラメーターｐ_iとテクスチャーパラメーターλ_kの双方を使用して判定値Ｅを算出し、各表情の判定を行っていく。 FIG. 23 is a flowchart showing the flow of facial expression determination processing according to this modification. In the present modification, a skin color correction mode for performing skin color correction can be specified by a user (not shown). In the skin color correction mode, processing for correcting the color value of the skin color pixel of the face existing in the target image OI to the color value of the memory color of the skin color specified in advance is executed at the time of printing. When a facial expression to be determined is selected as in the above embodiment (step S610), it is determined whether the skin color correction mode is selected (step S615). If the skin color correction mode is not selected, steps S620 to S680 are sequentially executed as in the above embodiment. That is, the determination value E is calculated using both the shape parameter p _i and the texture parameter λ _k , and each facial expression is determined.

一方、肌色補正モードが選択されている場合には、形状パラメーターｐ_iのみを選択する（ステップＳ６２５）。そして、前記実施例と同様にステップＳ６３０〜Ｓ６８０を順次実行する。すなわち、形状パラメーターｐ_iのみを使用して判定値Ｅを算出し、各表情の判定を行っていく。形状パラメーターｐ_iのみを使用して判定値Ｅすることができるように、判定値設定処理においては形状パラメーターｐ_iのみから表情パラメーターｑ_lを選択するようにしておけばよい。相関係数の絶対値が所定の閾値よりも大きい形状パラメーターｐ_iのみを選択する場合には、形状パラメーターｐ_iとテクスチャーパラメーターλ_kの双方から表情パラメーターｑ_lを選択する場合よりも表情パラメーターｑ_lの要素数は減少することとなる。そして、前記実施例と同様に形状パラメーターｐ_iとテクスチャーパラメーターλ_kの双方を使用して前記式（１０）の計算をおこなうために必要な情報と、形状パラメーターｐ_iのみを使用して前記式（１０）の計算をおこなうために必要な情報とを表情判定情報ＥＭＩとして内部メモリー１２０に格納しておけばよい。 On the other hand, when the skin color correction mode is selected, only the shape parameter p _i is selected (step S625). Then, Steps S630 to S680 are sequentially executed as in the above embodiment. That is, the determination value E is calculated using only the shape parameter p _i , and each facial expression is determined. As it can be determined value E using only the shape parameter p _i, in the determination value setting process it is sufficient to choose a facial expression parameters q _l only shape parameter p _i. If the absolute value of the correlation coefficient is to select only large shape parameter p _i than a predetermined threshold value, the facial expression parameters q than when selecting both from expression parameter q _l shape parameters p _i and texture parameter lambda _k The number of elements of _l will decrease. As in the above embodiment, the information necessary for calculating the equation (10) using both the shape parameter p _i and the texture parameter λ _{k and} the equation using only the shape parameter p _i are used. Information necessary for performing the calculation of (10) may be stored in the internal memory 120 as facial expression determination information EMI.

本変形例で肌色補正モードが指定されている場合には、テクスチャーパラメーターλ_kを使用することなく表情の判定が行われることとなる。肌色補正モードにおいては、補正前の顔の画素の色彩値が理想的な肌色に補正されるため、補正前の顔の画素の色彩値が表情に相関する色を有していても、印刷結果における表情の印象に与える影響は少ないと考えることができる。そのため、テクスチャーパラメーターλ_kを使用することなく表情判定を行っても、印刷結果における表情の印象と、判定された表情との相違が生じる可能性は低い。上述したように、テクスチャーパラメーターλ_kを使用することなく表情判定をおこなう場合、表情パラメーターｑ_lの要素数は減少するため前記式（１０）の計算を高速化することができる。 When the skin color correction mode is specified in this modification, the facial expression is determined without using the texture parameter λ _k . In the skin color correction mode, the color value of the face pixel before correction is corrected to the ideal skin color, so even if the color value of the face pixel before correction has a color that correlates with the facial expression, the print result It can be considered that there is little influence on the impression of facial expression. Therefore, even if facial expression determination is performed without using the texture parameter λ _k , there is a low possibility that a difference between the impression of the facial expression in the print result and the determined facial expression will occur. As described above, when facial expression determination is performed without using the texture parameter λ _k , the number of elements of the facial expression parameter q _l is reduced, so that the calculation of the equation (10) can be speeded up.

Ｂ４．変形例４：
前記実施例においては、判定値Ｅと比較する閾値を一定としたが、該閾値を状況に応じて変動させるようにしてもよい。例えば、ある形状パラメーターｐ_iやテクスチャーパラメーターλ_kの値が所定の基準値よりも大きいか否かによって、判定値Ｅと比較する閾値を切り換えるようにしてもよい。例えば、第１形状ベクトルｓ₁が顔を大きく左または右に振っていることを示している場合には、顔を大きく左または右に振っていない場合よりも判定値Ｅを下方修正するようにしてもよい。顔を大きく左または右に振っている場合には、各表情の特徴が通常よりも顕著でなくなる場合（例えば表情の変動による口角の変動が、顔振りによって曖昧になる等）には、判定値Ｅを緩和することが望ましい。 B4. Modification 4:
In the embodiment, the threshold value to be compared with the determination value E is constant, but the threshold value may be varied according to the situation. For example, the threshold value to be compared with the determination value E may be switched depending on whether the value of a certain shape parameter p _i or the texture parameter λ _k is larger than a predetermined reference value. For example, when the first shape vector s ₁ indicates that the face is greatly shaken to the left or right, the determination value E is corrected downward as compared with the case where the face is not greatly shaken to the left or right. May be. When the face is greatly shaken to the left or right, if the characteristics of each facial expression are less noticeable than usual (for example, the variation of the mouth angle due to the facial expression is ambiguous due to the facial swing), the judgment value It is desirable to reduce E.

Ｂ５．変形例５：
前記実施例の判定値設定処理においては、各表情との相関係数が上位の形状パラメーターｐ_iとテクスチャーパラメーターλ_kの要素を表情パラメーターｑ_lの要素として設定することとしたが、見た目の感覚に基づいて表情パラメーターｑ_lの要素を選択するようにしてもよい。 B5. Modification 5:
In the determination value setting process of the embodiment, the elements of the shape parameter p _i and the texture parameter λ _k having higher correlation coefficients with each expression are set as the elements of the expression parameter q _l , but The element of the expression parameter q _l may be selected based on the above.

図２４は、本変形例において表情パラメーターｑ_lを選択する処理の流れを示すフローチャートである。まず、コンピューターは、形状パラメーターｐ_iとテクスチャーパラメーターλ_kのうちいずれか１つを選択する（ステップＳ８１０）。ここでは、形状パラメーターｐ₁が選択されたものとして以下説明する。次に、コンピューターは、選択した形状パラメーターｐ₁をレンジにおける下限値に設定する（ステップＳ８２０）。次に、コンピューターは、選択されなかった形状パラメーターｐ_iとテクスチャーパラメーターλ_kの各値を０とする（ステップＳ８３０）。以上のように設定された形状パラメーターｐ_iとテクスチャーパラメーターλ_kを、前記式（１ａ），（２ａ）に適用しつつアフィン変換を行うことにより、コンピューターは、顔のテクスチャーＡ（ｘ）を算出する（ステップＳ８４０）。 FIG. 24 is a flowchart showing a flow of processing for selecting the facial expression parameter q _l in the present modification. First, the computer selects one of the shape parameter p _i and the texture parameter λ _k (step S810). Here, the following description will be made assuming that the shape parameter p ₁ is selected. Next, the computer sets the selected shape parameter p ₁ to the lower limit value in the range (step S820). Next, the computer sets each value of the shape parameter p _i and the texture parameter λ _k not selected to 0 (step S830). The computer calculates the facial texture A (x) by performing the affine transformation while applying the shape parameter p _i and the texture parameter λ _k set as described above to the equations (1a) and (2a). (Step S840).

そして、コンピューターは、テクスチャーＡ（ｘ）をディスプレイに表示させる（ステップＳ８５０）。次に、コンピューターは、選択した形状パラメーターｐ₁がレンジにおける上限値に達していているか否かを判定し（ステップＳ８６０）、達していない場合には選択した形状パラメーターｐ₁の現在の値にレンジの幅の５％を加算して（ステップＳ８６５）、ステップＳ８４０に戻る。このような処理を繰り返して実行することにより、選択した形状パラメーターｐ₁をレンジの下限値から５％ずつ増加させ、それに応じて変動する顔のテクスチャーＡ（ｘ）を順次ディスプレイに表示させていくことができる。選択した形状パラメーターｐ₁は顔の振りに対応する特徴量であるため、形状パラメーターｐ₁の変動とともに表示部１５０に表示される顔の振り角度が変動することとなる。 Then, the computer displays the texture A (x) on the display (step S850). Next, the computer determines whether the shape parameters p ₁ selected has been reached the upper limit value in the range (step S860), the current value of the shape parameter p ₁ selected. If it is judged range Is added (step S865), and the process returns to step S840. By repeatedly executing such processing, the selected shape parameter p ₁ is increased by 5% from the lower limit value of the range, and the facial texture A (x) that fluctuates accordingly is sequentially displayed on the display. be able to. Since the selected shape parameter p ₁ is a feature amount corresponding to the swing of the face, the swing angle of the face displayed on the display unit 150 varies with the variation of the shape parameter p ₁ .

選択した形状パラメーターｐ₁がレンジにおける上限値に達していている場合には、コンピューターは、選択した形状パラメーターｐ₁を表情パラメーターｑ_lとして採用するか否かを問い合わせる表示をディスプレイにて表示させ、入力装置にてオペレーターの操作を受け付ける（ステップＳ８７０）。ここで、採用する旨の操作が受け付けられた場合には、選択した形状パラメーターｐ₁が表情パラメーターｑ_lとして採用される（Ｓ８８０）。次に、すべての形状パラメーターｐ_iとテクスチャーパラメーターλ_kを選択したか否かを判定し（ステップＳ８９０）、すべて選択していない場合には、ステップＳ８１０にて次の形状パラメーターｐ_iまたはテクスチャーパラメーターλ_kを選択する。 If the selected shape parameters p ₁ has not reached the upper limit value in the range, the computer may display the display for inquiring whether or not to adopt the shape parameter p ₁ selected as the expression parameter q _l at the display, The operator's operation is accepted by the input device (step S870). Here, when the operation to adopt is accepted, the selected shape parameter p ₁ is adopted as the facial expression parameter q _l (S880). Next, it is determined whether or not all shape parameters p _i and texture parameters λ _k have been selected (step S890). If not all are selected, the next shape parameter p _i or texture parameter is determined in step S810. to select the λ _k.

以上の処理を実行することにより、オペレーターは形状パラメーターｐ_iとテクスチャーパラメーターλ_kのそれぞれを個別に変動させた場合の、顔のテクスチャーＡ（ｘ）の変動をディスプレイにて観察することができ、どれが表情と相関しているかを把握することができる。そして、相関していると感じれば、現在選択されている形状パラメーターｐ_iまたはテクスチャーパラメーターλ_kの一を表情パラメーターｑ_lとして採用することができる。さらに、オペレーターが相関すると感じた程度を指定する操作を入力装置にて受け付けるようにし、該程度に基づいて重み係数α_lを設定するようにしてもよい。 By executing the above processing, the operator can observe the change in the facial texture A (x) on the display when the shape parameter p _i and the texture parameter λ _k are individually changed. It is possible to grasp which correlates with facial expressions. If it is felt that there is a correlation, one of the currently selected shape parameter p _i or texture parameter λ _k can be adopted as the expression parameter q _l . Further, an operation for designating the degree that the operator feels to be correlated may be accepted by the input device, and the weighting coefficient α _l may be set based on the degree.

Ｂ６．変形例６：
前記実施例においては、もっとも大きい判定値Ｅに対応する表情を有していると判定するようにしたが、図１９（ｃ）のように喜顔と驚顔の判定値Ｅが双方とも閾値を超え、かつ、双方の判定値Ｅが近い値である場合には、双方の表情を有していると判定するようにしてもよい。すなわち、図１９（ｃ）のように、２番目に大きい判定値Ｅと、最大の判定値Ｅとの差が所定の基準範囲（例えば、最大の判定値Ｅの１０％等。）内であれば、２つの表情が混在していると判定するようにしてもよい。 B6. Modification 6:
In the above embodiment, it is determined that the expression corresponding to the largest determination value E is present. However, as shown in FIG. If both of the determination values E are close to each other, it may be determined that both expressions are present. That is, as shown in FIG. 19C, the difference between the second largest determination value E and the maximum determination value E is within a predetermined reference range (for example, 10% of the maximum determination value E). For example, it may be determined that two facial expressions are mixed.

Ｂ７．変形例７：
前記実施例では、特徴点ＣＰ初期位置設定処理において、４つのグローバルパラメーター（大きさ、傾き、上下方向の位置、左右方向の位置）のそれぞれについての３段階の値の組み合わせに対応する合計８０種類（＝３×３×３×３−１）の仮設定位置が予め設定されているが、仮設定位置の設定に用いるパラメーターの種類および数やパラメーター値の段階数は変更可能である。例えば、４つのグローバルパラメーターの内の一部のみが仮設定位置の設定に用いられるとしてもよいし、用いられるパラメーターのそれぞれについて、５段階の値の組み合わせにより仮設置位置が設定されるものとしてもよい。 B7. Modification 7:
In the embodiment, in the feature point CP initial position setting process, a total of 80 types corresponding to combinations of values in three stages for each of the four global parameters (size, inclination, vertical position, and horizontal position). A temporary setting position (= 3 × 3 × 3 × 3-1) is preset, but the type and number of parameters used for setting the temporary setting position and the number of parameter values can be changed. For example, only a part of the four global parameters may be used for setting the temporary setting position, or the temporary setting position may be set by combining five values for each parameter used. Good.

Ｂ８．変形例８：
前記実施例の特徴点ＣＰ設定位置補正処理では、注目画像ＯＩに基づき平均形状画像Ｉ（Ｗ（ｘ；ｐ））を算出することにより注目画像ＯＩの特徴点ＣＰの設置位置を平均顔画像Ａ₀（ｘ）の特徴点ＣＰの設置位置に整合させているが、平均顔画像Ａ₀（ｘ）に対して画像変換をおこなうことにより両者の特徴点ＣＰの配置を整合させるものとしてもよい。すなわち、前記実施例では、「顔領域」としての平均形状画像Ｉ（Ｗ（ｘ；ｐ））を変化させることとしたが、平均顔画像Ａ₀（ｘ）を形状パラメーターｐ_iやテクスチャーパラメーターλ_kの変化に応じて変化させてもよい。いずれにしても、注目画像ＯＩに含まれる「顔領域」と、顔モデルによって再現される顔の画像との差を小さくさせる形状パラメーターｐ_iやテクスチャーパラメーターλ_kに補正されることとなる。 B8. Modification 8:
In the feature point CP setting position correction process of the above embodiment, the average shape image I (W (x; p)) is calculated based on the target image OI to determine the installation position of the feature point CP of the target image OI as the average face image A. _Although it is matched with the installation position of the feature point CP of ₀ (x), the arrangement of both feature points CP may be matched by performing image conversion on the average face image A ₀ (x). That is, in the above embodiment, the average shape image I (W (x; p)) as the “face region” is changed, but the average face image A ₀ (x) is changed to the shape parameter p _i and the texture parameter λ. You may change according to the change of _k . In any case, the shape parameter p _i and the texture parameter λ _k that reduce the difference between the “face region” included in the target image OI and the face image reproduced by the face model are corrected.

Ｂ９．変形例９：
前記実施例におけるサンプル画像ＳＩ（図３）はあくまで一例であり、サンプル画像ＳＩとして採用する画像の数、種類は任意に設定可能である。また、前記実施例において、特徴点ＣＰの位置で示される顔の所定の特徴部位（図４参照）はあくまで一例であり、実施例において設定されている特徴部位の一部を省略したり、特徴部位として他の部位を採用したりしてもよい。 B9. Modification 9:
The sample image SI (FIG. 3) in the above embodiment is merely an example, and the number and type of images employed as the sample image SI can be arbitrarily set. In the embodiment, the predetermined feature portion (see FIG. 4) of the face indicated by the position of the feature point CP is merely an example, and a part of the feature portion set in the embodiment may be omitted, Other parts may be adopted as the part.

また、前記実施例では、サンプル画像ＳＩｗの画素群ｘのそれぞれにおける輝度値により構成される輝度値ベクトルに対する主成分分析によってテクスチャーモデルが設定されているが、顔画像のテクスチャー（見え）を表す輝度値以外の指標値（例えばＲＧＢ値）に対する主成分分析によってテクスチャーモデルが設定されるものとしてもよい。 In the embodiment, the texture model is set by the principal component analysis for the brightness value vector formed by the brightness value in each of the pixel groups x of the sample image SIw, but the brightness representing the texture (appearance) of the face image. The texture model may be set by principal component analysis for index values other than the values (for example, RGB values).

また、前記実施例において、平均顔画像Ａ₀（ｘ）のサイズは５６画素×５６画素に限られず他のサイズであってもよい。また、平均顔画像Ａ₀（ｘ）は、マスク領域ＭＡ（図８）を含む必要はなく、平均形状領域ＢＳＡのみによって構成されるとしてもよい。また、平均顔画像Ａ₀（ｘ）の代わりに、サンプル画像ＳＩの統計的分析に基づき設定される他の基準顔画像が用いられるとしてもよい。 In the embodiment, the size of the average face image A ₀ (x) is not limited to 56 pixels × 56 pixels, and may be other sizes. Further, the average face image A ₀ (x) does not need to include the mask area MA (FIG. 8), and may be configured only by the average shape area BSA. Further, instead of the average face image A ₀ (x), another reference face image set based on the statistical analysis of the sample image SI may be used.

また、判定対象の表情は、喜顔と悲顔と驚顔と怒顔に限られるものではなく、困顔や寝顔や焦顔等の表情を判定するようにしてもよい。むろん、前記実施例や変形例において、喜顔と悲顔と驚顔と怒顔のすべてを判定するのではなく、例えば喜顔であるか否かのみを判定するようにしてもよい。 Moreover, the facial expression to be determined is not limited to a happy face, a sad face, a surprised face, and an angry face, and expressions such as a troubled face, a sleeping face, and a dark face may be determined. Of course, in the embodiment and the modified example, it is possible to determine not only all of a happy face, a sad face, a surprised face, and an angry face, but, for example, whether or not it is a happy face.

また、前記実施例では、ＡＡＭを用いた形状モデルおよびテクスチャーモデルの設定が行われているが、他のモデル化手法（例えばＭｏｒｐｈａｂｌｅＭｏｄｅｌと呼ばれる手法やＡｃｔｉｖｅＢｌｏｂと呼ばれる手法）を用いて形状モデルおよびテクスチャーモデルの設定が行われるとしてもよい。 In the embodiment, the shape model and the texture model are set using AAM. However, the shape model and texture model using other modeling methods (for example, a method called Morphable Model or a method called Active Blob) are used. A texture model may be set.

また、前記実施例では、メモリーカードＭＣに格納された画像が注目画像ＯＩに設定されているが、注目画像ＯＩは例えばネットワークを介して取得された画像であってもよい。また、前記実施例では、画像処理装置としてのプリンター１００による画像処理を説明したが、処理の一部または全部がパーソナルコンピューターやデジタルスチルカメラ、デジタルビデオカメラ等の他の種類の画像処理装置により実行されるものとしてもよい。また、プリンター１００はインクジェットプリンターに限らず、他の方式のプリンター、例えばレーザプリンターや昇華型プリンターであるとしてもよい。 In the embodiment, the image stored in the memory card MC is set as the attention image OI. However, the attention image OI may be an image acquired via a network, for example. In the above embodiment, image processing by the printer 100 as an image processing apparatus has been described. However, part or all of the processing is executed by another type of image processing apparatus such as a personal computer, a digital still camera, or a digital video camera. It is good also as what is done. The printer 100 is not limited to an ink jet printer, and may be another type of printer such as a laser printer or a sublimation printer.

前記実施例において、ハードウェアによって実現されていた構成の一部をソフトウェアに置き換えるようにしてもよく、逆に、ソフトウェアによって実現されていた構成の一部をハードウェアに置き換えるようにしてもよい。 In the embodiment, a part of the configuration realized by hardware may be replaced by software, and conversely, a part of the configuration realized by software may be replaced by hardware.

また、本発明の機能の一部または全部がソフトウェアで実現される場合には、そのソフトウェア（コンピュータープログラム）は、コンピューター読み取り可能な記録媒体に格納された形で提供することができる。この発明において、「コンピューター読み取り可能な記録媒体」とは、フレキシブルディスクやＣＤ−ＲＯＭのような携帯型の記録媒体に限らず、各種のＲＡＭやＲＯＭ等のコンピューター内の内部記憶装置や、ハードディスク等のコンピューターに固定されている外部記憶装置も含んでいる。 In addition, when part or all of the functions of the present invention are realized by software, the software (computer program) can be provided in a form stored in a computer-readable recording medium. In the present invention, the “computer-readable recording medium” is not limited to a portable recording medium such as a flexible disk or a CD-ROM, but an internal storage device in a computer such as various RAMs and ROMs, a hard disk, etc. It also includes an external storage device fixed to the computer.

１００…プリンター、１１０…ＣＰＵ、１２０…内部メモリー、１４０…操作部、１５０…表示部、１６０…印刷機構、１７０…カードインターフェース、１７２…カードスロット、２００…画像処理部、２１０…設定部、２２０…特徴位置検出部、２２２，２４２…生成部、２２４，２４４…算出部、２２６，２４６…補正部、２３０…顔領域検出部、２４０…テクスチャー検出部、２５０…表情判定部、３１０…表示処理部、３２０…印刷処理部。 DESCRIPTION OF SYMBOLS 100 ... Printer, 110 ... CPU, 120 ... Internal memory, 140 ... Operation part, 150 ... Display part, 160 ... Printing mechanism, 170 ... Card interface, 172 ... Card slot, 200 ... Image processing part, 210 ... Setting part, 220 ... feature position detection unit, 222, 242 ... generation unit, 224, 244 ... calculation unit, 226, 246 ... correction unit, 230 ... face area detection unit, 240 ... texture detection unit, 250 ... facial expression determination unit, 310 ... display process Part, 320... Print processing part.

Claims

An image processing apparatus for determining facial expressions included in a noticed image,
A face area detection unit that detects an image area including at least a part of a face image from the attention image as a face area;
A setting unit that sets a face model that reproduces at least the position of a feature part of the face based on a plurality of feature amounts;
A correction unit that corrects the feature amount so that the position of the feature part reproduced by the face model approximates the face region;
A determination unit that calculates a determination value indicating the likelihood of the expression for each of a plurality of facial expressions based on the corrected feature amount, and determines a facial expression by combining the determination values for each expression; apparatus.

The image processing apparatus according to claim 1, wherein the determination unit determines that the face region has an expression corresponding to the determination value that most strongly indicates the likelihood of expression.

3. The image according to claim 1, wherein at least one of the feature amounts is a coefficient of an eigenvector of each principal component obtained by principal component analysis of the position of the feature portion of a plurality of sample images. Processing equipment.

The at least one of the feature amounts includes a coefficient of an eigenvector of each principal component obtained by principal component analysis of color values of pixels of a plurality of sample images. Image processing device.

In sequentially determining a plurality of facial expressions included in the plurality of images of interest associated with different times,
When it is determined that the attention image has a certain expression,
Until the face included in the pixel of interest associated with a later time is determined not to be the facial expression, calculation of the determination value for other facial expressions and determination based on the determination value are not performed. The image processing apparatus according to any one of claims 1 to 4.

6. The expression according to claim 1, wherein the determination unit determines a facial expression by comparing the determination value obtained by linearly combining at least two of the corrected feature quantities with a threshold value. Image processing apparatus.

An image processing method for determining a facial expression included in an image of interest,
An image area including at least a part of a face image is detected as a face area from the attention image,
Set a face model that reproduces at least the position of the facial feature based on multiple feature quantities,
Correcting the feature amount so that the position of the feature part reproduced by the face model approximates the face region;
An image processing method for calculating a determination value for each of a plurality of facial expressions based on the corrected feature amount, and determining a facial expression by combining the determination values for each facial expression.

An image processing program for image processing for determining a facial expression included in an image of interest,
A face area detection function for detecting, as a face area, an image area including at least a part of a face image from the attention image;
A setting function for setting a face model that reproduces at least the position of a facial feature part based on a plurality of feature amounts;
A correction function for correcting the feature amount so that the position of the feature part reproduced by the face model approximates the face region;
An image processing program for causing a computer to realize a determination function for calculating a determination value for each of a plurality of facial expressions based on the corrected feature amount and determining a facial expression by combining the determination values for each facial expression.

A face area detection unit that detects an image area including at least a part of a face image from the attention image as a face area;
A setting unit that sets a face model that reproduces at least the position of a feature part of the face based on a plurality of feature amounts;
A correction unit that corrects the feature amount so that the position of the feature part reproduced by the face model approximates the face region;
A determination unit that calculates a determination value for each of a plurality of facial expressions based on the corrected feature amount, and determines a facial expression by combining the determination values for each expression;
A printing apparatus comprising: a printing unit that prints the attention image including a face image determined to have a predetermined facial expression among the attention images.