JP2019062436A

JP2019062436A - Image processing apparatus, image processing method, and program

Info

Publication number: JP2019062436A
Application number: JP2017186335A
Authority: JP
Inventors: 有一中田; Yuichi Nakada
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2017-09-27
Filing date: 2017-09-27
Publication date: 2019-04-18
Anticipated expiration: 2037-09-27
Also published as: JP7057086B2

Abstract

To generate an image taken under a desired illumination condition from a captured image even if a subject distance changes.SOLUTION: An image processing apparatus includes image acquisition means 301 for acquiring image data obtained by imaging a scene including a subject, first acquisition means 105 for acquiring distance information from an imaging apparatus that captures the scene to the subject, storage means 202 for storing normal line information corresponding to a predetermined feature pattern associated with the subject, generation means 304 for generating normal line information to be applied to lighting processing corresponding to the image data on the basis of the distance information and the normal line information stored in the storage means, and lighting processing means 306 for executing the lighting processing for the image data using the normal line information generated by the generation means.SELECTED DRAWING: Figure 3

Description

本発明は、画像処理装置、画像処理方法、及びプログラムに関する。 The present invention relates to an image processing apparatus, an image processing method, and a program.

人物を対象とした写真撮影において、顔の向きや構図、ライティング等は、被写体の印象を決める大きな要因の一つである。そのため、これらの撮影条件を写真撮影後に調整することで、被写体を補正することが行われている。例えば、特許文献１には、顔の立体形状を表す顔立体形状テンプレートより生成される顔モデルを用いて、被写体を補正する画像処理装置が開示されている。 In photographing a subject, the orientation, composition, lighting, etc. of the face are one of the major factors that determine the impression of the subject. Therefore, the subject is corrected by adjusting these shooting conditions after taking a picture. For example, Patent Document 1 discloses an image processing apparatus that corrects an object using a face model generated from a face solid shape template representing a face solid shape.

この特許文献１の画像処理装置では、画像データから検出した顔情報（即ち、顔位置情報、顔の大きさ情報、顔部品情報、及び顔方向情報）に基づいて、顔立体形状テンプレートを変形させて被写体の顔モデルを生成している。また、特許文献１の画像処理装置では、検出された顔部品情報や顔の大きさ情報等に基づいて顔の立体形状を推定し、複数の顔立体形状テンプレートから被写体に適した顔立体形状テンプレートを選択している。これにより、顔の個人差や顔の向き等による見た目の違いに応じて、顔立体形状テンプレートを選択することを可能としている。 In the image processing apparatus of Patent Document 1, the three-dimensional face shape template is deformed based on face information (that is, face position information, face size information, face parts information, and face direction information) detected from image data. The face model of the subject is generated. Further, in the image processing apparatus of Patent Document 1, a three-dimensional shape of a face is estimated based on the detected face part information, face size information, etc., and a three-dimensional shape template suitable for a subject from a plurality of face three-dimensional shape templates. Is selected. In this way, it is possible to select a three-dimensional face shape template in accordance with the difference in appearance due to the individual difference of the face, the orientation of the face, and the like.

特開２０１４−４９８６６号公報JP, 2014-49866, A

しかしながら、特許文献１の画像処理装置のように、少なくとも被写体距離を考慮することなく、顔立体形状テンプレートを選択する場合、被写体距離が変化すると、被写体の見た目とは異なる顔立体形状テンプレートが選択される可能性がある。そして、これにより、顔モデルも被写体と似つかないように生成され、結果、その顔モデルを用いて補正された被写体は不自然なものとして生成される。 However, as in the image processing apparatus of Patent Document 1, when selecting the face solid shape template without considering at least the subject distance, when the subject distance changes, a face solid shape template different from the subject's appearance is selected. There is a possibility of As a result, the face model is also generated so as not to match the subject, and as a result, the subject corrected using the face model is generated as unnatural.

本発明は、前記従来の問題に鑑みてなされたものであって、その目的は、被写体距離が変化しても、撮像画像から所望の照明条件下で撮影したかのような画像を生成することである。 The present invention has been made in view of the above-described conventional problems, and an object thereof is to generate an image as if taken under a desired illumination condition from a captured image even if the subject distance changes. It is.

上記目的を達成するために、本発明の画像処理装置は、被写体を含むシーンを撮像することにより得られる画像データを取得する画像取得手段と、前記シーンを撮像した撮像装置から前記被写体までの距離情報を取得する第一の取得手段と、前記被写体に関連する所定の特徴パターンに対応する法線情報を記憶する記憶手段と、前記距離情報と前記記憶手段が記憶する前記法線情報に基づき、前記画像データに対するライティング処理に適用する法線情報を生成する生成手段と、前記生成手段が生成した前記法線情報を用いて、前記画像データに対して、ライティング処理を実行するライティング処理手段とを備える。 In order to achieve the above object, an image processing apparatus according to the present invention includes an image acquisition unit for acquiring image data obtained by imaging a scene including a subject, and a distance from the imaging apparatus which captured the scene to the subject A first acquisition unit for acquiring information, a storage unit for storing normal line information corresponding to a predetermined feature pattern related to the subject, the distance information, and the normal line information stored by the storage unit; A generation unit configured to generate normal information to be applied to the lighting processing on the image data; and a lighting processing unit configured to execute the lighting processing on the image data using the normal information generated by the generation unit. Prepare.

本発明によれば、被写体距離が変化しても、撮像画像から所望の照明条件下で撮影したかのような画像を生成することができる。 According to the present invention, even if the subject distance changes, it is possible to generate an image as if taken under a desired illumination condition from the captured image.

画像処理装置を備えた撮像装置の外観を示す図である。It is a figure showing the appearance of the imaging device provided with the image processing device. 画像処理装置を備えた撮像装置の内部構成を示すブロック図である。It is a block diagram showing an internal configuration of an imaging device provided with an image processing device. 画像処理装置の機能構成を示すブロック図である。It is a block diagram showing functional composition of an image processing device. 画像処理装置における処理の手順を示すフローチャートである。5 is a flowchart showing the procedure of processing in the image processing apparatus. カラー画像データと距離画像データを示す図である。It is a figure which shows color image data and distance image data. 顔領域と器官位置を示す図である。It is a figure which shows a face area | region and an organ position. 顔モデル法線情報取得処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of a face model normal line information acquisition process. 顔モデル法線情報を示す図である。It is a figure which shows face model normal information. 法線画像データ生成処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of normal image data generation processing. 法線画像データ生成処理の概要を示す図である。It is a figure which shows the outline | summary of normal image data generation processing. ライティング処理の概要を示す図である。It is a figure which shows the outline | summary of a lighting process. ライティング処理の結果を示す図である。It is a figure which shows the result of a lighting process. 顔モデル法線情報取得処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of a face model normal line information acquisition process. 領域別顔モデル法線情報を示す図である。It is a figure which shows area classified face model normal line information. 顔モデル法線情報を示す図である。It is a figure which shows face model normal information. 領域別顔モデル法線情報取得処理の手順を示すフローチャートである。It is a flow chart which shows a procedure of field model face information normal information acquisition processing according to field. 顔法線画像データ生成処理の概要を示す図である。It is a figure which shows the outline | summary of a face normal line image data generation process. 画像処理装置の機能構成を示すブロック図である。It is a block diagram showing functional composition of an image processing device. 画像処理装置における処理の手順を示すフローチャートである。5 is a flowchart showing the procedure of processing in the image processing apparatus. 被写体モデル法線情報を示す図である。It is a figure showing subject model normal line information.

（撮像装置の外観）
図１は、実施形態に係る画像処理装置を備えた撮像装置の外観を示す図であり、図１（ａ）は撮像装置の前面、図１（ｂ）は背面の外観を各々、示している。撮像装置１０１は、光学部１０２、撮像ボタン１０３、ストロボ１０４、距離画像取得部１０５、表示部１０６、及び操作ボタン１０７を備える。 (Appearance of the imaging device)
FIG. 1 is a view showing an appearance of an imaging apparatus provided with an image processing apparatus according to the embodiment, and FIG. 1A shows an appearance of a front surface of the imaging apparatus and FIG. . The imaging apparatus 101 includes an optical unit 102, an imaging button 103, a flash 104, a distance image acquisition unit 105, a display unit 106, and an operation button 107.

光学部１０２は、ズームレンズ、フォーカスレンズ、ブレ補正レンズ、絞り、及びシャッターによって構成される鏡筒であり、被写体の光情報を集光する。撮像ボタン１０３は、ユーザが撮像の開始を撮像装置１０１に指示するためのボタンである。ストロボ１０４は、ユーザの指示に従って、撮像の開始に合わせて発光させることが可能な照明である。 The optical unit 102 is a lens barrel configured of a zoom lens, a focus lens, a shake correction lens, an aperture, and a shutter, and collects light information of an object. The imaging button 103 is a button for the user to instruct the imaging apparatus 101 to start imaging. The strobe 104 is an illumination that can be emitted in response to the start of imaging according to a user's instruction.

距離画像取得部１０５は、距離情報として、撮像指示に応じて撮像したシーンにおける被写体の距離画像データを取得する。ここで、距離画像データとは、画像の各画素の画素値として、その画素に対応する被写体距離を格納した画像データのことである。距離画像取得部１０５は、赤外光を発光する赤外発光部と、被写体に反射した赤外光を受光する受光部とを備える。距離画像取得部１０５は、赤外発光部において発光した赤外光が被写体に反射し、受光部が受光するまでの時間に基づいて、撮像装置から被写体までの距離値を算出する。そして、算出した距離値と受光部のセンサ画素数や画角等を含む距離撮像情報に基づいて、被写体の位置情報を算出し、距離画像データを生成する。なお、距離画像データの取得方法は、必ずしもこれに限られない。したがって、例えば、距離画像取得部１０５の代わりに光学部１０２と同様の光学系を設け、異なる２つの視点から撮像された画像データ間の視差に基づいて、三角測量を行うことにより距離画像データを取得することもできる。 The distance image acquisition unit 105 acquires, as distance information, distance image data of an object in a scene captured according to an imaging instruction. Here, the distance image data is image data in which the subject distance corresponding to the pixel is stored as the pixel value of each pixel of the image. The distance image acquiring unit 105 includes an infrared light emitting unit that emits infrared light, and a light receiving unit that receives infrared light reflected by the subject. The distance image acquisition unit 105 calculates the distance value from the imaging device to the subject based on the time until the infrared light emitted from the infrared light emitting unit is reflected on the subject and the light receiving unit receives the light. Then, based on the calculated distance value and distance imaging information including the number of sensor pixels of the light receiving unit, the angle of view, etc., the position information of the object is calculated, and distance image data is generated. In addition, the acquisition method of distance image data is not necessarily restricted to this. Therefore, for example, an optical system similar to the optical unit 102 is provided instead of the distance image acquisition unit 105, and triangulation is performed based on the parallax between the image data captured from two different viewpoints. You can also get it.

表示部１０６は、撮像装置１０１にて処理された画像データや他の各種データを表示する、液晶ディスプレイ等の表示装置である。なお、図１に示すように、撮像装置１０１は光学ファインダを備えていないので、フレーミング操作（ピントや構図の確認）は表示部１０６を用いて行われる。即ち、撮像装置１０１に関して、表示部１０６においてライブビュー画像を確認しながら撮像が行われるので、フレーミングやフォーカシングの操作が行われるとき、表示部１０６は、電子ファインダとしても機能する。その他、表示部１０６では、カメラ設定メニュー等も表示する。 The display unit 106 is a display device such as a liquid crystal display that displays image data processed by the imaging device 101 and other various data. Note that as shown in FIG. 1, since the imaging device 101 is not provided with an optical finder, a framing operation (confirmation of focus and composition) is performed using the display unit 106. That is, since the imaging is performed while confirming the live view image in the display unit 106 regarding the imaging device 101, the display unit 106 also functions as an electronic finder when the framing and focusing operations are performed. In addition, the display unit 106 also displays a camera setting menu and the like.

操作ボタン１０７は、撮像装置１０１の動作モードの切り換え操作や、撮像時の各種パラメータ等をユーザが撮像装置１０１に指示するためのボタンである。なお、撮像装置１０１は、動作モードの一つとして、撮像された画像における照明の当たり具合を撮像後に補正するライティング処理モードを備える。そのため、ユーザは、操作ボタン１０７又は撮像ボタン１０３を用いて、ライティング処理モードへの切り替え、ライティング処理に用いる仮想照明の照明パラメータの設定、さらに照明の当たり具合を調整（補正）する被写体の選択等を行うことができる。また、ユーザは補正された画像データを出力する際に、距離画像データを出力するか否か等の指示をすることもできる。その他、表示部１０６は、タッチスクリーン機能を備えていてもよく、その場合、タッチスクリーンを用いたユーザ指示を操作ボタン１０７の入力として扱うことも可能である。 The operation button 107 is a button for the user to instruct the imaging device 101 to switch the operation mode of the imaging device 101, various parameters at the time of imaging, and the like. Note that the imaging apparatus 101 includes, as one of the operation modes, a lighting processing mode in which the degree of illumination of the captured image is corrected after imaging. Therefore, the user uses the operation button 107 or the imaging button 103 to switch to the lighting processing mode, sets the illumination parameters of the virtual illumination used for the lighting processing, and selects the subject whose illumination condition is adjusted (corrected), etc. It can be performed. Further, when outputting the corrected image data, the user can also instruct whether or not to output the distance image data. In addition, the display unit 106 may have a touch screen function. In that case, it is also possible to handle a user instruction using the touch screen as an input of the operation button 107.

（撮像装置の内部構成）
図２は、実施形態に係る画像処理装置を備えた撮像装置の内部構成を示すブロック図である。ＣＰＵ２０１は、各構成の処理の全てに関わり、ＲＯＭ（Read Only Memory）２０２や、ＲＡＭ（Random Access Memory）２０３に格納された命令を順に読み込み、解釈することで処理を実行する。 (Internal configuration of imaging device)
FIG. 2 is a block diagram showing an internal configuration of an imaging apparatus provided with the image processing apparatus according to the embodiment. The CPU 201 relates to all the processing of each configuration, and executes processing by sequentially reading and interpreting instructions stored in a ROM (Read Only Memory) 202 and a RAM (Random Access Memory) 203.

ＲＯＭ２０２は、ＣＰＵ２０１により実行されるプログラムを記憶する。なお、本実施形態において、ＲＯＭ２０２は、複数の顔モデル法線情報及び領域別顔モデル法線情報を格納するものとする。顔モデル法線情報は、所定の形状の顔（顔モデル）に対応する、顔表面の法線ベクトルを画素値に格納した顔モデル法線画像データと、顔モデル法線画像データにおける目・鼻・口等の器官位置を示す顔モデル法線器官位置とを含む。また、領域別顔モデル法線情報は、顔輪郭・目・鼻・口等の領域に対応する法線ベクトルを画素値に格納した領域別顔モデル法線画像データと、領域別顔モデル法線画像データにおける器官位置を示す領域別顔モデル法線器官位置とを含む。ＲＡＭ２０３は、ＣＰＵ２０１により実行されるプログラムを記憶する。 The ROM 202 stores a program executed by the CPU 201. In the present embodiment, the ROM 202 stores a plurality of face model normal line information and area-specific face model normal line information. The face model normal information corresponds to a face of a predetermined shape (face model), face model normal image data in which a normal vector of the face surface is stored in pixel values, and an eye and a nose in the face model normal image data -Including a face model normal organ position indicating the position of an organ such as the mouth. In addition, region-specific face model normal information includes region-specific face model normal image data in which normal vectors corresponding to regions such as face contour, eyes, nose, and mouth are stored in pixel values, and region-specific face model normals. And a region-specific face model normal organ position indicating an organ position in the image data. The RAM 203 stores a program to be executed by the CPU 201.

光学系制御部２０４は、光学部１０２に対して、フォーカスを合わせる、シャッターを開く、絞りを調整する等のＣＰＵ２０１から指示された制御を行う制御回路である。また、光学系制御部２０４は、光学系の状態等に基づいて、フォーカスを合わせた被写体までの距離を取得し、ＲＡＭ２０３等に記憶することができる。 The optical system control unit 204 is a control circuit that performs control instructed by the CPU 201 such as focusing on the optical unit 102, opening the shutter, adjusting the aperture, and the like. In addition, the optical system control unit 204 can acquire the distance to the focused object based on the state of the optical system and the like, and store the distance in the RAM 203 or the like.

制御部２０５は、撮像ボタン１０３や操作ボタン１０７からのユーザ指示を受け取り、撮像モード、ライティング処理モード等への切り換え、被写体領域の選択、照明パラメータの設定等の制御を行う制御回路である。カラー撮像素子部２０６は、光学部１０２にて集光された光情報を電流値に変換する撮像素子である。カラー撮像素子部２０６は、ベイヤ配列等の所定の配列を有するカラーフィルタを備え、光学部１０２において集光された光から被写体の色情報を取得する。 A control unit 205 is a control circuit that receives a user instruction from the imaging button 103 or the operation button 107, controls switching to an imaging mode, a lighting processing mode, etc., selects a subject region, sets illumination parameters, and the like. The color imaging element unit 206 is an imaging element that converts the light information collected by the optical unit 102 into a current value. The color imaging element unit 206 includes a color filter having a predetermined arrangement such as a Bayer arrangement, and acquires color information of an object from light collected by the optical unit 102.

Ａ／Ｄ変換部２０７は、カラー撮像素子部２０６において検知された被写体の色情報をデジタル信号値に変換し、ＲＡＷ画像データとする処理回路である。なお、本実施形態では、同時刻に撮像した距離画像データとＲＡＷ画像データを取得できるものとする。画像処理部２０８は、Ａ／Ｄ変換部２０７で取得されたＲＡＷ画像データに対して現像処理を行い、カラー画像データを生成する。また、画像処理部２０８は、カラー画像データや距離画像データを用いて、カラー画像データにライティング処理を施した補正画像データを生成する等の各種画像処理を実行する。なお、画像処理部２０８の内部構成及び機能は、後述の図３を用いて詳述する。 The A / D conversion unit 207 is a processing circuit that converts color information of the subject detected in the color imaging device unit 206 into digital signal values to obtain RAW image data. In the present embodiment, it is assumed that distance image data and RAW image data captured at the same time can be acquired. The image processing unit 208 performs development processing on the RAW image data acquired by the A / D conversion unit 207, and generates color image data. Further, the image processing unit 208 performs various image processing such as generation of corrected image data obtained by performing lighting processing on color image data using color image data and distance image data. The internal configuration and functions of the image processing unit 208 will be described in detail with reference to FIG. 3 described later.

キャラクタージェネレーション部２０９は、文字やグラフィック等を生成する処理回路である。キャラクタージェネレーション部２０９により生成された文字やグラフィックは、表示部１０６において、画像データや補正画像データ等に重畳して表示される。エンコーダ部２１０は、画像処理部２０８において処理されたカラー画像データやライティング処理によって生成される補正画像データを含む各種画像データをＪｐｅｇ等のファイルフォーマットに変換する。 The character generation unit 209 is a processing circuit that generates characters, graphics, and the like. The characters and graphics generated by the character generation unit 209 are displayed on the display unit 106 so as to be superimposed on image data, corrected image data, and the like. The encoder unit 210 converts various image data including color image data processed by the image processing unit 208 and corrected image data generated by lighting processing into a file format such as Jpeg.

メディアＩ／Ｆ２１１は、ＰＣ／メディア２１３（例えば、ハードディスク、メモリカード、ＣＦカード、ＳＤカード等）に、画像データを送受信するためのインタフェースである。メディアＩ／Ｆ２１１としては、例えば、ＵＳＢ（Universal Serial Bus）等が用いられる。システムバス２１２は、データを送受信するためのバスである。 The media I / F 211 is an interface for transmitting and receiving image data to and from the PC / media 213 (for example, a hard disk, a memory card, a CF card, an SD card, etc.). As the media I / F 211, for example, USB (Universal Serial Bus) or the like is used. The system bus 212 is a bus for transmitting and receiving data.

（画像処理部の内部構成）
図３は、実施形態に係る画像処理装置（画像処理部２０８）の機能構成を示すブロック図である。現像処理部３０１は、Ａ／Ｄ変換部２０７から取得したＲＡＷ画像データに対して、ホワイトバランス処理、デモザイク処理、ノイズリダクション処理、色変換処理等を施し、カラー画像データを生成する。なお、撮像装置１０１は、現像処理部３０１において生成されたカラー画像データを表示部１０６に出力して表示することや、ＲＡＭ２０３、ＰＣ／メディア２１３等の記憶装置に記憶することができる。 (Internal configuration of image processing unit)
FIG. 3 is a block diagram showing the functional configuration of the image processing apparatus (image processing unit 208) according to the embodiment. The development processing unit 301 performs white balance processing, demosaicing processing, noise reduction processing, color conversion processing, and the like on the RAW image data acquired from the A / D conversion unit 207, and generates color image data. The imaging apparatus 101 can output color image data generated by the development processing unit 301 to the display unit 106 for display, or can store the color image data in a storage device such as the RAM 203 or the PC / media 213.

顔検出部３０２は、現像処理部３０１より取得したカラー画像データから被写体の顔情報を取得する。なお、被写体の顔情報には、カラー画像データにおいて被写体の顔が占める領域を示す顔領域と、顔に含まれる目や口等のカラー画像データにおける位置を示す器官位置とに関する情報が少なくとも含まれる。 The face detection unit 302 acquires face information of a subject from color image data acquired from the development processing unit 301. The face information of the subject includes at least information on a face area indicating an area occupied by the face of the subject in the color image data and an organ position indicating a position in the color image data such as eyes and mouth included in the face. .

顔モデル法線情報取得部３０３は、被写体までの距離情報に基づいて、複数の顔モデル法線情報から被写体に合う法線情報を選択する。本実施形態では、顔検出部３０２から取得した顔情報と距離画像取得部１０５から取得した距離画像データに基づいて、ＲＯＭ２０２から顔モデル法線情報を選択する。 The face model normal information acquisition unit 303 selects normal line information matching the subject from the plurality of pieces of face model normal line information based on the distance information to the subject. In the present embodiment, face model normal line information is selected from the ROM 202 based on the face information acquired from the face detection unit 302 and the distance image data acquired from the distance image acquisition unit 105.

法線生成部３０４は、顔検出部３０２から顔情報、顔モデル法線情報取得部３０３から顔モデル法線情報、距離画像取得部１０５から距離画像データを各々、取得すると、それらの情報に基づいて、カラー画像データに対応する法線画像データを生成する。 The normal generation unit 304 acquires face information from the face detection unit 302, face model normal information from the face model normal information acquisition unit 303, and distance image data from the distance image acquisition unit 105, respectively, based on the information. And generates normal image data corresponding to the color image data.

照明パラメータ設定部３０５は、制御部２０５から受け渡されるユーザ操作（指示）に基づいて、照明パラメータを設定する。なお、本実施形態における照明パラメータは、照明の位置、姿勢、強度、光源色等を含む。ライティング処理部３０６は、距離画像取得部１０５から距離画像データ、法線生成部３０４から法線画像データ、照明パラメータ設定部３０５から照明パラメータを各々、取得し、それらのデータに基づいて、カラー画像データにライティング処理を実行する。なお、撮像装置１０１では、ライティング処理により補正された補正画像データを、ＲＡＭ２０３やＰＣ／メディア２１３等の記憶装置に出力し、記憶することや、表示部１０６に出力して表示することができる。 The illumination parameter setting unit 305 sets an illumination parameter based on a user operation (instruction) received from the control unit 205. The illumination parameters in the present embodiment include the position, attitude, intensity, light source color, and the like of the illumination. The lighting processing unit 306 acquires the distance image data from the distance image acquisition unit 105, the normal image data from the normal generation unit 304, and the illumination parameter from the illumination parameter setting unit 305, respectively, and a color image is obtained based on those data. Perform a writing process on the data. The imaging apparatus 101 can output the corrected image data corrected by the lighting processing to a storage device such as the RAM 203 or the PC / media 213 for storage or output to the display unit 106 for display.

（画像処理部の処理フロー）
図４は、実施形態に係る画像処理装置（画像処理部２０８）における処理の手順を示すフローチャートである。図４のフローチャートに示す処理において、画像処理部２０８は、先ず、カラー画像データから取得した顔情報と距離画像データに基づいて、顔モデル法線情報を選択する。画像処理部２０８は、次に、顔情報、顔モデル法線情報、及び距離画像データに基づいて、カラー画像データに対応する法線画像データを生成する。画像処理部２０８は、さらに、ユーザにより設定された照明パラメータ、距離画像データ及び法線画像データに基づいて、カラー画像データに仮想照明を付加するライティング処理を実行し、補正画像データを生成する。 (Processing flow of image processing unit)
FIG. 4 is a flowchart showing the procedure of processing in the image processing apparatus (image processing unit 208) according to the embodiment. In the process illustrated in the flowchart of FIG. 4, the image processing unit 208 first selects face model normal line information based on face information and distance image data acquired from color image data. Next, the image processing unit 208 generates normal image data corresponding to color image data based on the face information, the face model normal information, and the distance image data. The image processing unit 208 further executes lighting processing for adding virtual illumination to color image data based on the illumination parameters, distance image data and normal image data set by the user, and generates corrected image data.

以下、画像処理部２０８の処理手順に関して、詳述する。現像処理部３０１は、Ａ／Ｄ変換部２０７より取得したＲＡＷ画像データに基づいて、カラー画像データを生成する（Ｓ４０１）。カラー画像データは、例えば、図５（ａ）のように示され、また、カラー画像データ５０１の画素Ｉ（ｉ，ｊ）にはＲＧＢ値が画素値として格納されており、各々Ｉｒ（ｉ、ｊ）、Ｉｇ（ｉ、ｊ）、Ｉｂ（ｉ、ｊ）として示される。なお、カラー画像データの取得方法は、必ずしもこれに限られない。したがって、例えば、ＲＡＭ２０３やＰＣ／メディア２１３に記憶されているＲＡＷ画像データを取得し、現像処理部３０１により、その取得したＲＡＷ画像データからカラー画像データを生成することもできる。また、ＲＡＭ２０３やＰＣ／メディア２１３に記憶されているカラー画像データを直接、取得することもできる。 The processing procedure of the image processing unit 208 will be described in detail below. The development processing unit 301 generates color image data based on the RAW image data acquired from the A / D conversion unit 207 (S401). The color image data is shown, for example, as shown in FIG. 5A, and RGB values are stored as pixel values in the pixel I (i, j) of the color image data 501, and each of the Ir (i, j) j) shown as Ig (i, j), Ib (i, j). The method of acquiring color image data is not necessarily limited to this. Therefore, for example, RAW image data stored in the RAM 203 or PC / media 213 can be acquired, and the development processing unit 301 can generate color image data from the acquired RAW image data. Further, color image data stored in the RAM 203 or the PC / media 213 can also be obtained directly.

画像処理部２０８は、距離画像取得部１０５から距離画像データを取得する（Ｓ４０２）。距離画像データは、例えば、図５（ｂ）のように示され、また、距離画像データ５０２の画素Ｄ（ｉ，ｊ）には、撮像装置から被写体までの距離値が格納されている。なお、距離画像データの取得方法は、必ずしもこれに限られない。したがって、例えば、ＲＡＭ２０３やＰＣ／メディア２１３に記憶されている距離画像データを直接、取得することもできる。 The image processing unit 208 acquires distance image data from the distance image acquisition unit 105 (S402). The distance image data is shown, for example, as shown in FIG. 5B, and the pixel D (i, j) of the distance image data 502 stores the distance value from the imaging device to the subject. In addition, the acquisition method of distance image data is not necessarily restricted to this. Therefore, for example, the distance image data stored in the RAM 203 or the PC / media 213 can also be obtained directly.

画像処理部２０８は、次に、ＲＡＭ２０３からフラグＳＷ１の状態を取得し、フラグＳＷ１の状態に基づいて、ライティング処理を実行するか否かを判定する（Ｓ４０３）。画像処理部２０８は、フラグＳＷ１がＯＦＦの場合はライティング処理を実行しないと判定し（Ｓ４０３Ｎｏ）、ステップＳ４０４に処理を移行する。また、フラグＳＷ１がＯＮの場合はラインティング処理を実行すると判定し（Ｓ４０３Ｙｅｓ）、ステップＳ４０５に処理を移行する。 Next, the image processing unit 208 acquires the state of the flag SW1 from the RAM 203, and determines whether to execute the writing process based on the state of the flag SW1 (S403). When the flag SW1 is OFF, the image processing unit 208 determines that the writing process is not to be performed (S403 No), and the process proceeds to Step S404. Further, when the flag SW1 is ON, it is determined that the lining process is to be performed (S403: Yes), and the process proceeds to Step S405.

ステップＳ４０３において、ライティング処理を実行しないと判定されると、画像処理部２０８は、ＲＡＭ２０３からフラグＳＷ２の状態を取得する。そして、画像処理部２０８は、フラグＳＷ２がＯＦＦの場合はカラー画像データのみを、フラグＳＷ２がＯＮの場合はカラー画像データと距離画像データをＰＣ／メディア２１３に出力し、記憶する（Ｓ４０４）。或いは、画像処理部２０８は、カラー画像データを表示部１０６に出力し、表示させることで、ユーザが確認できるようにすることもできる。なお、画像処理部２０８は、ステップＳ４０４における処理を実行すると、図４に示す処理を終了する。 If it is determined in step S403 that the writing process is not to be performed, the image processing unit 208 acquires the state of the flag SW2 from the RAM 203. Then, the image processing unit 208 outputs only color image data when the flag SW2 is OFF, and outputs color image data and distance image data to the PC / media 213 when the flag SW2 is ON (S404). Alternatively, the image processing unit 208 can output color image data to the display unit 106 and can display the color image data so that the user can confirm. When the image processing unit 208 executes the processing in step S404, the processing illustrated in FIG. 4 ends.

また、ステップＳ４０３において、ライティング処理を実行すると判定されると、顔検出部３０２は、カラー画像データに基づいて、顔情報を検出（取得）する（Ｓ４０５）。ここで、顔情報に関して、図６を用いて説明する。図６に示されるように、顔情報には、顔領域６０１及び器官位置６０２が含まれる。顔領域は、画像データにおいて、顔が含まれる領域の画素の集合として示される。器官位置６０２は、顔領域内における目、鼻、口等に対応する座標として示される。 If it is determined in step S403 that the lighting process is to be performed, the face detection unit 302 detects (acquires) face information based on the color image data (S405). Here, the face information will be described with reference to FIG. As shown in FIG. 6, the face information includes a face area 601 and an organ position 602. The face area is indicated in the image data as a set of pixels of the area including the face. The organ position 602 is shown as coordinates corresponding to the eyes, nose, mouth, etc. in the face area.

顔領域、器官位置の検出方法については、既存のアルゴリズムを適用することができる。既存のアルゴリズムとしては、テンプレートマッチングを用いたアルゴリズムや、Ｈａａｒ−Ｌｉｋｅ特徴量を用いたアルゴリズム等があり、本実施形態では、テンプレートマッチングを適用することで、顔領域・器官位置を検出する。 An existing algorithm can be applied to a method of detecting a face area and an organ position. The existing algorithms include an algorithm using template matching, an algorithm using Haar-Like feature quantities, and the like. In this embodiment, the face region / organ position is detected by applying template matching.

テンプレートマッチングを適用した検出では、先ず、カラー画像データに対して、閾値処理を実行することで、肌色の領域を顔候補領域として抽出する。次に、様々な大きさの顔画像のテンプレートを用いて、顔候補領域に対してマッチング処理を実行し、尤度を算出する。そして、その算出した尤度に基づいて、顔領域か否かの判定処理を実行し、顔領域を抽出する。器官位置についても、抽出された顔領域に対して、目、鼻、口の画像のテンプレートを用いて、同様にマッチング処理を実行し、尤度を算出する。そして、この尤度の大きさに基づいて、顔領域・器官位置の信頼度を算出する。以上の処理を実行することで、顔領域６０１、器官位置６０２を取得する。 In the detection to which template matching is applied, first, a skin color area is extracted as a face candidate area by performing threshold processing on color image data. Next, matching processing is performed on the face candidate area using templates of face images of various sizes to calculate likelihood. Then, based on the calculated likelihood, a process of determining whether or not it is a face area is executed to extract a face area. Also for the organ position, the matching process is similarly performed on the extracted face area using the template of the eye, nose, and mouth image, and the likelihood is calculated. Then, based on the magnitude of the likelihood, the reliability of the face area / organ position is calculated. The face area 601 and the organ position 602 are acquired by executing the above processing.

次に、顔モデル法線情報取得部３０３は、被写体距離に基づいて、顔モデル法線情報を取得する（Ｓ４０６）。なお、顔モデル法線情報取得部３０３における顔モデル法線情報を取得する処理に関しては、後述の図７等を用いて詳述する。 Next, the face model normal information acquisition unit 303 acquires face model normal information based on the subject distance (S406). The process of acquiring face model normal information in the face model normal information acquisition unit 303 will be described in detail with reference to FIG.

法線生成部３０４は、ステップＳ４０２、Ｓ４０５、及びＳ４０６で各々、取得した、距離画像データ、顔情報、及び顔モデル法線情報に基づいて、ステップＳ４０１で生成されたカラー画像データに対応する法線画像データを生成する（Ｓ４０７）。なお、法線生成部３０４における法線画像データを生成する処理に関しては、後述の図９、図１０等を用いて詳述する。 The normal line generation unit 304 corresponds to the color image data generated in step S401 based on the distance image data, face information, and face model normal information acquired in steps S402, S405, and S406, respectively. Line image data is generated (S407). The process of generating the normal line image data in the normal line generation unit 304 will be described in detail with reference to FIGS. 9 and 10 described later.

照明パラメータ設定部３０５は、制御部２０５から受け渡されるユーザ操作に基づいて、照明パラメータを設定（決定）する（Ｓ４０８）。ライティング処理部３０６は、距離画像データ、法線画像データ、及び照明パラメータに基づいて、カラー画像データに対して、カラー画像データに仮想照明（仮想的な光源）を付加する等のライティング処理を実行し、補正画像データを生成する（Ｓ４０９）。なお、ライティング処理部３０６におけるライティング処理に関しては、後述の図１１等を用いて詳述する。 The illumination parameter setting unit 305 sets (determines) illumination parameters based on the user operation received from the control unit 205 (S408). The lighting processing unit 306 executes lighting processing such as adding virtual illumination (virtual light source) to color image data with respect to color image data based on distance image data, normal image data, and illumination parameters. The correction image data is generated (S409). The writing process in the writing process unit 306 will be described in detail with reference to FIG.

ライティング処理部３０６は、制御部２０５から受け渡されるユーザ操作（即ち、ユーザからの指示）に基づいて、ライティング処理を終了するか否かを判定する（Ｓ４１０）。画像処理部２０８は、ライティング処理部３０６によりライティング処理を終了すると判定されると処理をステップＳ４１１に移行し、また、ライティング処理を終了しないと判定されると処理をステップＳ４０８に返し、再び照明パラメータを設定する。 The lighting processing unit 306 determines whether to end the lighting processing based on the user operation (that is, an instruction from the user) received from the control unit 205 (S410). If it is determined that the lighting processing is to be ended by the lighting processing unit 306, the image processing unit 208 shifts the processing to step S411, and if it is determined that the lighting processing is not ended, the processing returns to step S408. Set

処理がステップＳ４１１に移行されると、画像処理部２０８は、ＲＡＭ２０３からフラグＳＷ２の状態を取得し、ライティング処理部３０６は、その取得したフラグＳＷ２の状態に基づいて、各種データを出力する。ライティング処理部３０６は、フラグＳＷ２がＯＦＦの場合はカラー画像データ及び補正画像データを、フラグＳＷ２がＯＮの場合はカラー画像データ、補正画像データ、及び距離画像データをＰＣ／メディア２１３に出力し、記憶する。その後、画像処理部２０８は、図４に示す処理を終了する。 When the process proceeds to step S411, the image processing unit 208 acquires the state of the flag SW2 from the RAM 203, and the lighting processing unit 306 outputs various data based on the acquired state of the flag SW2. The lighting processing unit 306 outputs the color image data and the correction image data when the flag SW2 is OFF, and outputs the color image data, the correction image data, and the distance image data to the PC / media 213 when the flag SW2 is ON. Remember. Thereafter, the image processing unit 208 ends the processing illustrated in FIG.

（顔モデル法線情報取得処理）
ここでは、ステップＳ４０６において、顔モデル法線情報取得部３０３により実行される顔モデル法線情報取得処理（即ち、距離画像データと顔情報に基づいて、顔モデル法線情報を取得する処理）について説明する。 (Face model normal information acquisition processing)
Here, in step S406, the process of acquiring face model normal information (that is, the process of acquiring face model normal information based on distance image data and face information) executed by the face model normal information acquisition unit 303. explain.

以下、顔モデル法線情報取得処理の手順に関して、図７を用いて詳述する。顔モデル法線情報取得部３０３は、先ず、顔検出部３０２より取得した顔情報と距離画像取得部１０５より取得した距離画像データに基づいて、被写体距離を算出する（Ｓ７０１）。なお、本実施形態では、顔領域（即ち、被写体の顔が占める領域）における距離値の平均を被写体距離とする。 Hereinafter, the procedure of the face model normal information acquisition process will be described in detail with reference to FIG. The face model normal information acquisition unit 303 first calculates the subject distance based on the face information acquired from the face detection unit 302 and the distance image data acquired from the distance image acquisition unit 105 (S701). In the present embodiment, the average of the distance values in the face area (that is, the area occupied by the face of the subject) is taken as the subject distance.

顔モデル法線情報取得部３０３は、ステップＳ７０１で算出した被写体距離に基づいて、顔モデル法線情報を取得する（Ｓ７０２）。本実施形態では、図８に示す被写体距離に応じた複数の顔モデル法線情報から、ステップＳ７０１で算出した被写体距離に最も近い顔モデル法線情報を選択する。 The face model normal information acquisition unit 303 acquires face model normal information based on the subject distance calculated in step S701 (S702). In the present embodiment, face model normal information closest to the subject distance calculated in step S701 is selected from a plurality of face model normal information corresponding to the subject distance shown in FIG.

図８は、被写体距離に応じた複数の顔モデル法線情報を例示したものである。図８において、顔モデル法線情報８０１、８０２、８０３は各々、０．５ｍ、１．５ｍ、５．０ｍの被写体距離に応じた顔モデル法線情報であり、これらの顔モデル法線情報はＲＯＭ２０２に格納されている。 FIG. 8 illustrates a plurality of face model normal line information according to the subject distance. In FIG. 8, face model normal information 801, 802, 803 are face model normal information according to subject distances of 0.5 m, 1.5 m, and 5.0 m, respectively, and these face model normal information is It is stored in the ROM 202.

顔モデル法線情報は、顔表面の法線ベクトル（即ち、被写体距離の位置から観察したときの顔表面の向き）を画素値に格納した顔モデル法線画像データと、顔モデル法線画像データにおける目・鼻・口等の器官位置を示す顔モデル法線器官位置とを含む。なお、顔モデル法線画像データにおける顔領域以外の領域における画素（即ち、背景画素）には、背景画素を示す画素値を格納するものとする。 The face model normal information includes face model normal image data storing face vector normal vectors (that is, the direction of the face surface when observed from the position of the object distance) in pixel values, and face model normal image data A face model normal organ position indicating an organ position such as eyes, nose and mouth in Note that pixel values indicating background pixels are stored in pixels (that is, background pixels) in areas other than the face area in the face model normal image data.

図８に示されるように、正面顔の場合、被写体距離が短いほど目・口等の器官位置が中心から離れ（即ち、顔輪郭に近付き）、被写体距離が長くなるほど目・口等の器官位置が顔の中央に寄ったような見た目となる。このように、被写体距離によって、顔輪郭と目、鼻、口等のバランスは変化し、また、器官位置の変化は、被写体距離が短いほど、大きくなる。このため、顔モデル法線情報に関して、被写体距離が短いほど、短い距離間隔で顔モデル法線情報を保持しておくことが好ましい。 As shown in FIG. 8, in the case of a frontal face, organ positions such as eyes and mouth move away from the center as the subject distance decreases (that is, approach to the face contour) and organ positions such as eyes and mouth as the subject distance increases. Looks like it was in the middle of the face. As described above, the balance between the face contour and the eyes, nose, mouth, and the like changes according to the subject distance, and the change in organ position increases as the subject distance decreases. Therefore, with regard to face model normal information, it is preferable to hold face model normal information at short distance intervals as the subject distance is shorter.

また、図８では、顔モデル法線情報が３つの場合を、その一例として説明したが、顔モデル法線情報の数は、これに限定されるものではない。したがって、例えば、ＲＯＭ２０２等の容量に応じて、顔モデル法線情報の数を２つに設定しても、また４つ以上に設定してもよい。なお、顔モデル法線情報の数を多く設定すれば、撮影時の条件により適合した顔モデル法線情報を選択することができる。 Further, in FIG. 8, the case where the face model normal line information is three has been described as an example, but the number of face model normal line information is not limited to this. Therefore, for example, depending on the capacity of the ROM 202 or the like, the number of pieces of face model normal line information may be set to two or four or more. If the number of pieces of face model normal information is set to a large number, it is possible to select face model normal information that conforms to the conditions at the time of shooting.

（法線画像データ生成処理）
ここでは、ステップＳ４０７において、法線生成部３０４により実行される法線画像データ生成処理（即ち、顔情報、顔モデル法線情報、及び距離画像データに基づいてカラー画像データに対応する法線画像データを生成する処理）について説明する。以下、法線画像データ生成処理の手順及び概要に関して、各々、図９及び図１０を用いて詳述する。 (Normal image data generation process)
Here, normal image data generation processing performed by the normal generation unit 304 in step S407 (ie, a normal image corresponding to color image data based on face information, face model normal information, and distance image data A process of generating data will be described. Hereinafter, the procedure and outline of the normal image data generation process will be described in detail with reference to FIGS. 9 and 10, respectively.

法線生成部３０４は、距離画像データに基づいて、被写体法線画像データを生成する（Ｓ９０１）。本実施形態の被写体法線画像データの生成に関して、法線生成部３０４は、先ず、距離画像データにおける画素の２次元的な座標値と、その画素の画素値から取得される距離値に基づいて、画素毎に３次元的な座標値を算出する。そして、法線生成部３０４は、次に、その算出した座標値に基づいて、各画素に対応する法線ベクトルを算出する。なお、着目画素に対応する法線ベクトルは、着目画素の近傍領域の３次元的な座標値から算出した勾配に基づいて、算出される。このようにして、法線生成部３０４は、被写体法線画像データ１００１を生成する。 The normal generation unit 304 generates subject normal image data based on the distance image data (S901). Regarding generation of subject normal line image data according to the present embodiment, the normal line generation unit 304 first generates a two-dimensional coordinate value of a pixel in the distance image data and a distance value acquired from the pixel value of the pixel. , Three-dimensional coordinate values are calculated for each pixel. Then, the normal-line generating unit 304 next calculates a normal vector corresponding to each pixel based on the calculated coordinate value. Note that the normal vector corresponding to the pixel of interest is calculated based on the gradient calculated from the three-dimensional coordinate values of the vicinity region of the pixel of interest. Thus, the normal line generation unit 304 generates the subject normal line image data 1001.

法線生成部３０４は、次に、顔情報と顔モデル法線情報に基づいて、顔法線画像データを生成する（Ｓ９０２）。本実施形態の顔法線画像データの生成に関して、法線生成部３０４は、顔情報に含まれる器官位置６０２と、顔モデル法線情報に含まれる顔モデル法線器官位置に基づいて、顔モデル法線画像データを変形することで、顔法線画像データ１００２を生成する。なお、本実施形態では、アフィン変換を用いて、カラー画像データ５０１内の被写体の顔に合うように、顔モデル法線画像データを変形する。具体的には、器官位置６０２と顔モデル法線器官位置から最小二乗法を用いてアフィンパラメータを算出し、その算出したアフィンパラメータに基づいて顔モデル法線画像パラメータを変形することで、顔法線画像データ１００２を生成する。なお、顔法線画像データにおいて顔以外の背景画素は、背景画素を示す画素値を格納するものとする。 Next, the normal generation unit 304 generates face normal image data based on the face information and the face model normal information (S902). Regarding generation of face normal image data according to the present embodiment, the normal generation unit 304 generates a face model based on the organ position 602 included in the face information and the face model normal organ position included in the face model normal information. The face normal image data 1002 is generated by deforming the normal image data. In the present embodiment, affine transformation is used to deform face model normal image data so as to fit the face of the subject in the color image data 501. Specifically, an affine parameter is calculated from the organ position 602 and the face model normal organ position using the least squares method, and the face model normal image parameter is deformed based on the calculated affine parameter. Line image data 1002 is generated. In the face normal image data, background pixels other than the face store pixel values indicating the background pixels.

法線生成部３０４は、被写体法線画像データ１００１と顔法線画像データ１００２を生成すると、それらを統合し、さらに平滑化することで、法線画像データを生成する（Ｓ９０３）。具体的には、先ず、顔法線画像データ１００２の顔領域以外の領域における画素（即ち、背景画素）は被写体法線画像データ１００１の画素値、それ以外の画素は顔法線画像データ１００２の画素値となるように、画像データを統合する。即ち、統合法線画像データ１００３を生成する。法線生成部３０４は、次に、生成した統合法線画像データ１００３に対して、平滑化処理を実行することで、法線画像データ１００４を生成する。なお、本実施形態では、カラー画像データ５０１を参照画像として、統合法線画像データ１００３に平滑化処理を適用するものとする。また、この場合、例えば、カラー画像データ５０１を参照画像とするジョイントバイラテラルフィルタを適用すればよい。 When the subject normal image data 1001 and the face normal image data 1002 are generated, the normal generation unit 304 integrates them and smoothes them to generate normal image data (S903). Specifically, first, the pixels in the area other than the face area of the face normal image data 1002 (that is, background pixels) are the pixel values of the subject normal image data 1001, and the other pixels are the face normal image data 1002. The image data is integrated so as to be a pixel value. That is, integrated normal image data 1003 is generated. Next, the normal line generation unit 304 generates normal line image data 1004 by performing smoothing processing on the generated integrated normal line image data 1003. In the present embodiment, smoothing processing is applied to the integrated normal image data 1003 using the color image data 501 as a reference image. In this case, for example, a joint bilateral filter using the color image data 501 as a reference image may be applied.

以上のように処理することで、法線画像データを生成することができる。なお、法線画像データを生成する処理は、必ずしも上述の処理に限定されることはない。したがって、例えば、被写体法線画像データ１００１の生成を行うことなく、顔法線画像データ１００２のみから法線画像データを生成することもできる。また、この場合、被写体法線画像データ１００１の代わりに、全画素に適当な法線ベクトルを格納した画像データを与えればよい。 By processing as described above, normal image data can be generated. In addition, the process which produces | generates normal line image data is not necessarily limited to the above-mentioned process. Therefore, for example, normal image data can be generated only from the face normal image data 1002 without generating the object normal image data 1001. Further, in this case, instead of the object normal image data 1001, image data in which appropriate normal vectors are stored in all pixels may be provided.

（ライティング処理）
ここでは、ステップＳ４０９において、ライティング処理部３０６により実行されるライティング処理について説明する。ステップＳ４０９のライティング処理は、上述のように、距離画像データ及び法線画像データに基づき、さらにユーザにより設定された照明パラメータに応じて、カラー画像データに対して仮想照明を付加し、補正画像を生成する処理である。本実施形態において、ライティング処理により生成される補正画像Ｉ’は、例えば、下式（１）に従って生成される。 (Lighting process)
Here, the lighting processing performed by the lighting processing unit 306 in step S409 will be described. As described above, the lighting process in step S409 adds virtual illumination to color image data based on distance image data and normal image data and further according to the illumination parameters set by the user, and corrects the corrected image. It is processing to generate. In the present embodiment, the corrected image I ′ generated by the lighting process is generated, for example, according to the following equation (1).

式（１）において、Ｉ’ｒ、Ｉ’ｇ、Ｉ’ｂは補正画像データＩ’の画素値、Ｌｒｍ、Ｌｇｍ、Ｌｂｍはｍ番目の照明の色、ｋｍはｍ番目の照明に対する補正度合いを表す。さらに、ｍ番目の照明に対する補正度合いｋｍは、仮想照明の位置Ｑ、強度（照明の明るさ）α、姿勢Ｕ及び画素（ｘ、ｙ）に対応する距離値、法線ベクトルＮに基づいて決定され、例えば、下式（２）によって算出される。なお、仮想照明の色Ｌ、位置Ｑ、強度α、及び姿勢Ｕは、ユーザにより設定される照明パラメータである。また、ｔは仮想照明による補正度合いを調整する補正係数であり、本実施形態では１に設定する。 In equation (1), I'r, I'g, I'b are the pixel values of the corrected image data I ', Lrm, Lgm, Lbm are the colors of the m-th illumination, and km is the degree of correction for the m-th illumination Represent. Furthermore, the correction degree km for the mth illumination is determined based on the position Q of the virtual illumination, the intensity (brightness of illumination) α, the posture U, the distance value corresponding to the pixel (x, y), and the normal vector N For example, it is calculated by the following equation (2). The color L of the virtual illumination, the position Q, the intensity α, and the attitude U are illumination parameters set by the user. Further, t is a correction coefficient for adjusting the degree of correction by the virtual illumination, and is set to 1 in the present embodiment.

以下、式（２）について、図１１を用いて説明を補足する。図１１において、Ｑは、上述のように、仮想照明の位置を示しており、Ｐは画素（ｉ、ｊ）の三次元的な位置を示すものである。なお、Ｐは、距離画像データ５０２の画素値に基づいて取得された、撮像装置１０１から各画素までの距離値（即ち、各画素の距離値）、撮像装置１０１の画角、カラー画像データ５０１の画像サイズ等に基づいて、算出される。また、Ｗは、画素（ｉ、ｊ）の位置Ｐから仮想照明の位置Ｑまでの距離が大きくなるに従って、大きな値を返す関数である。即ち、関数Ｗにより、画素（ｉ、ｊ）が仮想照明に近いほど、明るくなるように補正される。 Hereinafter, the description of the equation (2) will be supplemented with reference to FIG. In FIG. 11, Q indicates the position of the virtual illumination as described above, and P indicates the three-dimensional position of the pixel (i, j). Note that P is the distance value from the imaging device 101 to each pixel (that is, the distance value of each pixel) acquired based on the pixel value of the distance image data 502, the angle of view of the imaging device 101, color image data 501 It is calculated based on the image size of W is a function that returns a larger value as the distance from the position P of the pixel (i, j) to the position Q of the virtual illumination increases. That is, the function W corrects the pixel (i, j) to be brighter as it is closer to the virtual illumination.

式（２）において、Ｋは、ρが小さいほど、大きな値を返す関数であり、ρは、画素（ｉ，ｊ）の位置Ｐから仮想照明の位置Ｑに向かうベクトルＶ（ｉ，ｊ）と、照明の姿勢Ｕのなす角度で示される。また、Ｎ（ｉ，ｊ）は画素（ｉ、ｊ）に対応する法線ベクトルである。加えて、式（２）の分子においてベクトルＶ（ｉ，ｊ）と法線ベクトルＮの内積を含むことから、画素Ｐ（ｉ，ｊ）から仮想照明に向かうベクトルＶと法線ベクトルＮとのなす角が小さい画素ほど明るくなるように補正される。このように補正することで、例えば、図１２等に示されるように、カラー画像データ５０１を、仮想照明により被写体を照らしたかのような補正画像１２０１に補正することができる。即ち、上式に示すように、ライティング処理を実行することで（補正画像を生成することで）、照明の位置と被写体の形状に応じた明るさの補正が可能となる。 In equation (2), K is a function that returns a larger value as ρ is smaller, and ρ is a vector V (i, j) from the position P of the pixel (i, j) to the position Q of the virtual illumination , It is shown by the angle which the attitude | position U of illumination makes. Further, N (i, j) is a normal vector corresponding to the pixel (i, j). In addition, since the inner product of the vector V (i, j) and the normal vector N is included in the numerator of the equation (2), the vector V from the pixel P (i, j) toward the virtual illumination and the normal vector N It is corrected that the smaller the angle formed, the brighter the pixel. By performing this correction, for example, as shown in FIG. 12 and the like, the color image data 501 can be corrected to a corrected image 1201 as if the subject was illuminated by virtual illumination. That is, as shown in the above equation, by performing the lighting process (by generating a correction image), it is possible to correct the brightness according to the position of the illumination and the shape of the subject.

以上のように、本発明では、被写体距離が変化しても、撮像画像から所望の照明条件下で撮影したかのような別の画像を生成することができる。具体的には、撮影画像上で大きさが同程度の被写体であっても、焦点距離の長いレンズを利用して長い被写体距離で撮影した場合と、焦点距離の短いレンズを利用して短い被写体距離で撮影した場合とで、ライティング処理により、異なる効果をもたらす。 As described above, according to the present invention, even if the subject distance changes, it is possible to generate another image as if taken under a desired illumination condition from the captured image. Specifically, even when the subject is of the same size on the photographed image, the subject is photographed at a long subject distance using a lens having a long focal length, and the subject is short using a lens having a short focal length The lighting process produces different effects depending on the distance taken.

なお、本実施形態では、顔法線情報に関して、顔検出部３０２から取得した顔情報及び距離画像取得部１０５から取得した距離画像データに基づいて、ＲＯＭ２０２から取得する仕様として説明したが、距離画像データの取得方法は必ずしもこれに限られない。したがって、例えば、ステップＳ７０１において、光学系制御部２０４が被写体にフォーカスを合わせたときに取得した被写体距離に基づいて、ステップＳ７０２において顔法線情報を選択することもできる。また、ＲＡＭ２０３やＰＣ／メディア２１３に記憶されているＲＡＷ画像データにタグ情報として被写体距離が含まれている場合は、ステップＳ７０１において、その被写体距離を取得して、ステップＳ７０２において顔法線情報を選択することもできる。 In the present embodiment, the face normal information is described as a specification to be acquired from the ROM 202 based on the face information acquired from the face detection unit 302 and the distance image data acquired from the distance image acquisition unit 105. The data acquisition method is not necessarily limited to this. Therefore, for example, face normal information can be selected in step S702 based on the subject distance acquired when the optical system control unit 204 brings the subject into focus in step S701. If the RAW image data stored in the RAM 203 or the PC / media 213 includes the subject distance as tag information, the subject distance is acquired in step S701, and the face normal information is input in step S702. It can also be selected.

（第２実施形態）
第１実施形態では、被写体距離に応じて、複数の顔モデル法線情報から被写体に適した顔モデル法線情報を選択したが、本実施形態（第２実施形態）では、被写体距離に応じて領域別顔モデル法線情報から顔モデル法線情報を生成する。これにより、様々な被写体距離に対応する顔モデル法線情報を取得することができる。 Second Embodiment
In the first embodiment, the face model normal information suitable for the subject is selected from the plurality of face model normal information in accordance with the subject distance, but in the present embodiment (the second embodiment), in accordance with the subject distance. Face model normal information is generated from region-specific face model normal information. Thereby, face model normal line information corresponding to various subject distances can be acquired.

図１４は、領域別顔モデル法線情報をその一例として示すものである。領域別顔モデル法線情報は、図１４（ａ）に示す領域別顔モデル法線画像データ１４０１〜１４０５と図１４（ｂ）に示す領域別顔モデル法線器官位置を含む。 FIG. 14 shows region-specific face model normal line information as an example. The area-specific face model normal line information includes the area-specific face model normal image data 1401 to 1405 shown in FIG. 14A and the area-specific face model normal organ position shown in FIG.

図１４（ａ）に示す領域別顔モデル法線画像データ１４０１〜１４０５は、各々、顔概略形状、右目周辺領域、左目周辺領域、鼻周辺領域、口周辺領域における顔表面の法線ベクトルを、画素値に格納した画像データである。 Region-specific face model normal image data 1401 to 1405 shown in FIG. 14A respectively represent normal vectors of the face surface in the face outline shape, the right eye peripheral region, the left eye peripheral region, the nose peripheral region, and the mouth peripheral region. Image data stored in pixel values.

また、図１４（ｂ）に示されるように、右目周辺領域、左目周辺領域、鼻周辺領域、口周辺領域に対応する領域別顔モデル法線画像データ１４０２〜１４０５には、各々、器官の基準とする位置を示す器官基準位置が設定されている。即ち、領域別顔モデル法線器官位置は、顔概略形状に対応する領域別顔モデル法線画像データ１４０１における被写体距離に応じた器官の位置を示している。なお、図１４に示す例では、０．５ｍ、１．５ｍ、５．０ｍの被写体距離に対応した領域別顔モデル法線器官位置を保持しているものとする（即ち、被写体距離毎に領域別顔モデル法線器官位置が格納される）。 In addition, as shown in FIG. 14B, in the region-specific face model normal image data 1402-1405 corresponding to the right eye peripheral region, left eye peripheral region, nose peripheral region, and mouth peripheral region, organ reference An organ reference position indicating the position to be set is set. That is, the region-specific face model normal organ position indicates the position of the organ corresponding to the subject distance in the region-specific face model normal image data 1401 corresponding to the face outline shape. In the example shown in FIG. 14, it is assumed that the region-specific face model normal organ position corresponding to the subject distance of 0.5 m, 1.5 m, and 5.0 m is held (ie, the area for each subject distance). Another face model normal organ position is stored).

第２実施形態では、ステップＳ４０６の顔モデル法線情報の取得処理が第１実施形態と異なる。図１３は、本発明の第２実施形態に係る画像処理装置（画像処理部２０８）における顔モデル法線情報の取得処理の手順を示すフローチャートである。 In the second embodiment, the process of acquiring face model normal information in step S406 is different from that of the first embodiment. FIG. 13 is a flowchart showing a procedure of acquisition processing of face model normal information in the image processing apparatus (image processing unit 208) according to the second embodiment of the present invention.

以下、顔モデル法線情報取得処理の手順に関して、図１３を用いて詳述する。顔モデル法線情報取得部３０３は、第１実施形態のステップＳ７０１と同様に、顔検出部３０２より取得した顔情報と距離画像取得部１０５より取得した距離画像データに基づいて、被写体距離を算出する（Ｓ１３０１）。 Hereinafter, the procedure of face model normal information acquisition processing will be described in detail with reference to FIG. The face model normal information acquisition unit 303 calculates the subject distance based on the face information acquired from the face detection unit 302 and the distance image data acquired from the distance image acquisition unit 105, as in step S701 of the first embodiment. (S1301).

顔モデル法線情報取得部３０３は、被写体距離を算出すると、図１４に示す領域別顔モデル法線器官位置に基づいて、ステップＳ１３０１で取得した被写体距離に対応する顔モデル器官位置を、所定の補間処理により算出する（Ｓ１３０２）。 When the face model normal information acquisition unit 303 calculates the subject distance, the face model organ position corresponding to the subject distance acquired in step S1301 is determined based on the region model face model normal organ position shown in FIG. It is calculated by interpolation processing (S1302).

ここで、顔モデル器官位置の算出に関して、説明を補足する。取得した被写体距離をｌとした場合、領域別顔モデル法線器官位置が設定された被写体距離のうち、取得した被写体距離であるｌよりも小さく、またｌに最も近い被写体距離ｌＡに対応した領域別顔モデル法線器官位置の座標（ｘＡ、ｙＡ）を選択する。同様に、領域別顔モデル法線器官位置が設定された被写体距離のうち、取得した被写体距離であるｌよりも大きく、またｌに最も近い被写体距離ｌＢに対応した領域別顔モデル法線器官位置の座標（ｘＢ、ｙＢ）を選択する。そして、この場合、顔モデル器官位置（ｘ’、ｙ’）は、下式（３）のように求めることができる。 Here, the description will be supplemented regarding the calculation of the face model organ position. Assuming that the acquired subject distance is l, an area corresponding to the subject distance lA which is smaller than l which is the acquired subject distance and is closest to l among the subject distances for which the region-specific face model normal organ position is set. The coordinates (xA, yA) of another face model normal organ position are selected. Similarly, an area-specific face model normal organ position corresponding to an object distance lB that is larger than the acquired object distance l and is closest to l among the object distances for which the area-specific face model normal organ position is set. Select the coordinates (xB, yB) of. And in this case, face model organ position (x ', y') can be calculated | required like the following Formula (3).

なお、式（３）において、ｗはｌ、ｌＡ、ｌＢから算出される重みであり、例えば、下式（４）のように求めることができる。 In Equation (3), w is a weight calculated from l, 1A, and 1B, and can be obtained, for example, as the following Equation (4).

式（４）において、γは、重みｗを調整するためのパラメータであり、領域別顔モデル法線器官位置が設定された被写体距離の間隔（例えば、０．１ｍ、０．５ｍ、１．０ｍ等）に応じて調整することができる。γに関して、領域別顔モデル法線器官位置が設定された被写体距離の間隔が、所定の値よりも短い間隔である場合には１に設定され、また、所定の値よりも長い間隔である場合には１より大きい値に設定される。このように設定することで、被写体距離が短いほど器官の移動量を大きく、被写体距離が長いほど器官の移動量を小さくすることができる。 In Equation (4), γ is a parameter for adjusting the weight w, and is an interval (for example, 0.1 m, 0.5 m, 1.0 m) of the subject distance in which the position of the region-specific face model normal organ is set. Etc.) can be adjusted. Regarding γ, the subject distance interval for which the region-specific face model normal organ position is set is set to 1 if the interval is shorter than a predetermined value, and if the interval is longer than the predetermined value Is set to a value greater than one. By setting in this way, it is possible to increase the movement amount of the organ as the subject distance is shorter and to decrease the movement amount of the organ as the subject distance is longer.

顔モデル法線情報取得部３０３は、領域別顔モデル法線画像データと算出した顔モデル器官位置に基づいて、顔モデル法線画像データを生成する（Ｓ１３０３）。具体的には、顔概略形状に対応する領域別顔モデル法線画像データ１４０１に対して、顔モデル器官位置と器官基準位置が一致するように、その他の領域別顔モデル法線画像データ１４０２〜１４０５を位置合わせする。そして、領域別顔モデル法線画像データ１４０１に対して、領域別顔モデル法線画像データ１４０２〜１４０５の法線情報のある画素の画素値を上書きすることで、顔モデル法線画像データを生成する。図１５に、図１３に示す処理によって生成された顔モデル法線画像データ１５０１を、その一例として示す。 The face model normal information acquisition unit 303 generates face model normal image data based on the region-specific face model normal image data and the calculated face model organ position (S1303). Specifically, with respect to region-specific face model normal image data 1401 corresponding to the face outline shape, other region-specific face model normal image data 1402-1402 are matched so that the face model organ position and the organ reference position coincide. Align 1405. Then, the face model normal image data is generated by overwriting the pixel value of the pixel having normal information of the region-specific face model normal image data 1402 to 1405 with respect to the region-specific face model normal image data 1401. Do. FIG. 15 shows face model normal image data 1501 generated by the processing shown in FIG. 13 as an example thereof.

以上のように、本発明では、被写体距離が変化しても、撮像画像から所望の照明条件下で撮影したかのような別の画像を生成することができる。なお、本実施形態では、領域別顔モデル法線情報として、被写体距離に関係なく、領域毎に１つの領域別顔モデル法線画像データを保持するものとして説明したが、異なる被写体距離に対応する領域別顔モデル法線画像データを各々、複数保持させてもよい。したがって、例えば、０．５ｍ、１．５ｍ、５．０ｍの被写体距離に対応する領域別顔モデル法線画像データを各々、保持し、ステップＳ１３０１で算出した被写体距離と最も近い被写体距離に対応する領域別顔モデル法線画像データを取得することもできる。また、領域（器官）毎に保持する、領域別顔モデル法線画像データの枚数を変更することもできる。例えば、顔概略形状や鼻周辺領域に対応する領域別顔モデル法線画像データは複数枚保持させ、右目周辺領域、左目周辺領域、口周辺領域に対応する領域別顔モデル法線画像データは一枚のみ保持させてもよい。 As described above, according to the present invention, even if the subject distance changes, it is possible to generate another image as if taken under a desired illumination condition from the captured image. In the present embodiment, although it has been described that one region-by-region face model normal image data is held for each region as region-by-region face model normal information, it corresponds to different object distances. A plurality of region-specific face model normal image data may be held. Therefore, for example, region-specific face model normal image data corresponding to object distances of 0.5 m, 1.5 m, and 5.0 m are held, and the object distance closest to the object distance calculated in step S1301 is stored. Area-specific face model normal image data can also be acquired. In addition, it is possible to change the number of face model normal image data classified by area, which is held for each area (organ). For example, a plurality of face model normal image data corresponding to the face outline shape and the nose peripheral area are held, and the face model normal image data corresponding to the right eye peripheral area, the left eye peripheral area and the mouth peripheral area is one. Only the sheet may be held.

（第３実施形態）
第２実施形態では、領域別顔モデル法線情報に基づいて、顔モデル法線情報を生成したが、本実施形態（第３実施形態）では、被写体距離に応じて、領域別顔モデル法線情報を選択し、顔法線画像データを生成する。 Third Embodiment
In the second embodiment, the face model normal information is generated based on the region-specific face model normal information. However, in the present embodiment (third embodiment), the region-specific face model normal is generated according to the subject distance. Information is selected to generate face normal image data.

第３実施形態では、第１実施形態との関係（比較）でステップＳ４０６が異なり、ステップＳ４０６の顔モデル法線情報の取得処理の代わりに、領域別顔モデル法線情報の取得処理を実行する。また、ステップＳ９０２の顔法線画像データ生成処理の処理内容が第１実施形態と異なる。以下、これらの処理（即ち、領域別顔モデル法線情報の取得処理及び顔法線画像データ生成処理）に関して、詳述する。 In the third embodiment, step S406 is different in the relationship (comparison) with the first embodiment, and instead of the processing for acquiring face model normal information in step S406, processing for acquiring area model face normal information is executed. . Further, the processing content of the face normal image data generation processing of step S902 is different from that of the first embodiment. Hereinafter, these processes (that is, the process of acquiring face model normal information by region and the process of generating face normal image data) will be described in detail.

（領域別顔モデル法線情報取得処理）
ここでは、顔モデル法線情報取得部３０３において実行される領域別顔モデル法線情報取得処理について説明する。本実施形態では、領域別顔モデル法線情報の取得処理に関して、距離画像データと顔情報に基づいて、領域別顔モデル法線情報を取得する。図１６に、領域別顔モデル法線情報取得処理の手順を示す。 (Area-specific face model normal information acquisition processing)
Here, an area-specific face model normal information acquisition process executed by the face model normal information acquisition unit 303 will be described. In this embodiment, area-specific face model normal line information is acquired based on distance image data and face information in regard to the process of acquiring area-specific face model normal line information. FIG. 16 shows the procedure of region type face model normal information acquisition processing.

顔モデル法線情報取得部３０３は、第１実施形態のステップＳ７０１と同様に、顔検出部３０２より取得した顔情報と距離画像取得部１０５より取得した距離画像データに基づいて、被写体距離を算出する（Ｓ１６０１）。 The face model normal information acquisition unit 303 calculates the subject distance based on the face information acquired from the face detection unit 302 and the distance image data acquired from the distance image acquisition unit 105, as in step S701 of the first embodiment. (S1601).

顔モデル法線情報取得部３０３は、取得した被写体距離に対応する領域別顔モデル法線情報を選択する（Ｓ１６０２）。なお、ステップＳ１６０２の処理では、取得した被写体距離に最も近い領域別顔モデル法線情報を選択する。例えば、図１４に示す領域別顔モデル法線情報の場合には、領域別顔モデル法線画像データ１４０１〜１４０５と、領域別顔モデル法線器官位置のうち、取得した被写体距離に最も近いものを取得する。 The face model normal information acquisition unit 303 selects region model face normal information corresponding to the acquired subject distance (S1602). Note that, in the processing of step S1602, region-specific face model normal line information closest to the acquired subject distance is selected. For example, in the case of region-by-region face model normal information shown in FIG. 14, the region-by-region face model normal image data 1401 to 1405 and the region-by-region face model normal image position are the closest to the acquired subject distance To get

（顔法線画像データ生成処理）
ここでは、法線生成部３０４において実行される顔法線画像データ生成処理について説明する。本実施形態では、ステップＳ１６０２で取得した領域別顔モデル法線情報（即ち、領域別顔モデル法線画像データ１４０１〜１４０５及び領域別顔モデル法線器官位置）に基づいて、顔法線画像データを生成する。図１７に、顔法線画像データ生成処理の概要を示す。 (Face normal image data generation process)
Here, the face normal image data generation process executed by the normal line generation unit 304 will be described. In the present embodiment, face normal image data is obtained based on the region-specific face model normal information (that is, the region-specific face model normal image data 1401 to 1405 and the region-specific face model normal organ position) acquired in step S1602. Generate FIG. 17 shows an outline of face normal image data generation processing.

先ず、法線生成部３０４は、アフィン変換を用いて、カラー画像データ５０１における被写体の顔に合うように領域別顔モデル法線画像データを変形する。顔概略形状に対応する領域別顔モデル法線画像データ１４０１のアフィンパラメータは、器官位置６０２とステップＳ１６０２で選択された領域別顔モデル法線器官位置に基づいて、最小二乗法を用いて算出される。 First, using the affine transformation, the normal line generation unit 304 deforms the region-specific face model normal image data so as to fit the face of the subject in the color image data 501. The affine parameters of the region-specific face model normal image data 1401 corresponding to the face outline shape are calculated using the least squares method based on the organ position 602 and the region face model normal organ position selected in step S1602. Ru.

次に、その他の領域別顔モデル法線画像データ１４０２〜１４０５のアフィンパラメータを、器官位置６０２と領域別顔モデル法線画像データ１４０２〜１４０５に対応する器官基準位置が一致するように、平行移動成分を補正したものとして算出する。これにより、領域別顔モデル法線画像データ１４０２〜１４０５を、カラー画像データ５０１の被写体の器官位置６０２に合わせて、変形することができる。 Next, the affine parameters of other regional face model normal image data 1402-1405 are translated so that the organ reference position corresponding to the organ position 602 and the regional face model normal image data 1402-1405 match. Calculated as corrected components. Thereby, the region-specific face model normal image data 1402 to 1405 can be deformed in accordance with the organ position 602 of the object of the color image data 501.

そして、法線生成部３０４は、以上のように算出した変換パラメータを用いて領域別顔モデル法線画像データ１４０１〜１４０５を変形し、領域別顔法線画像データ１７０１〜１７０５を生成する。さらに、法線生成部３０４は、領域別顔法線画像データ１７０１〜１７０５を統合することにより、顔法線画像データ１７０６を生成する。 Then, the normal-line generating unit 304 deforms the region-by-region face model normal image data 1401 to 1405 using the conversion parameter calculated as described above, and generates region-by-region face normal image data 1701 to 1705. Furthermore, the normal line generation unit 304 generates face normal image data 1706 by integrating the region-by-area face normal image data 1701 to 1705.

即ち、本実施形態では、顔概略形状以外の領域別顔法線画像データ１７０２〜１７０５において法線情報を有する画素を、顔概略形状に対応する領域別顔法線画像データ１７０１に上書きすることで、顔法線画像データ１７０６を生成している。なお、領域別顔法線画像データ１７０２〜１７０５の全てにおいて背景となる画素（即ち、その領域に関連する法線情報を有さない画素）は、顔概略形状に対応する領域別顔法線画像データ１７０１の画素値で顔法線画像データ１７０６が生成される。 That is, in the present embodiment, the pixels having normal line information in the region-by-region face normal image data 1702 to 1705 other than the face general shape are overwritten on the region-by-region face normal image data 1701 corresponding to the face general shape. , Face normal image data 1706 is generated. Note that the pixels serving as the background in all of the region-by-region face normal image data 1702 to 1705 (that is, the pixels having no normal line information related to the region) are region-by-region face normal images corresponding to the face outline shape. Face normal image data 1706 is generated using the pixel values of the data 1701.

以上のように、本発明では、被写体距離が変化しても、撮像画像から所望の照明条件下で撮影したかのような別の画像を生成することができる。なお、本実施形態では、ステップＳ１６０２において、取得した被写体距離に最も近い領域別顔モデル法線器官位置を選択していたが、第２実施形態のステップＳ１３０２で上述したように、補間処理により領域別顔モデル法線器官位置を算出してもよい。 As described above, according to the present invention, even if the subject distance changes, it is possible to generate another image as if taken under a desired illumination condition from the captured image. In the present embodiment, the region model normal model position closest to the acquired subject distance is selected in step S1602, but as described above in step S1302 in the second embodiment, the region is obtained by interpolation processing. Another face model normal organ position may be calculated.

また、本実施形態では、領域別顔モデル法線情報として、領域毎に１つの領域別顔モデル法線画像データを保持するものとして説明したが、異なる被写体距離に対応する領域別顔モデル法線画像データを各々、複数保持させてもよい。したがって、例えば、０．５ｍ、１．５ｍ、５．０ｍの被写体距離に対応する領域別顔モデル法線画像データを各々、保持し、ステップＳ１６０１で算出した被写体距離と最も近い被写体距離に対応する領域別顔モデル法線画像データを取得することもできる。 Further, in the present embodiment, it has been described that one region-by-region face model normal image data is held for each region as region-by-region face model normal information, but region-by-region face model normals corresponding to different subject distances A plurality of image data may be held. Therefore, for example, region-specific face model normal image data corresponding to subject distances of 0.5 m, 1.5 m, and 5.0 m are held, and the subject distance closest to the subject distance calculated in step S 1601 is corresponded. Area-specific face model normal image data can also be acquired.

加えて、領域（器官）毎に保持する、領域別顔モデル法線画像データの枚数を変更することもできる。例えば、顔概略形状や鼻周辺領域に対応する領域別顔モデル法線画像データは複数枚保持させ、右目周辺領域、左目周辺領域、口周辺領域に対応する領域別顔モデル法線画像データは一枚のみ保持させてもよい。 In addition, it is also possible to change the number of region-specific face model normal image data held for each region (organ). For example, a plurality of face model normal image data corresponding to the face outline shape and the nose peripheral area are held, and the face model normal image data corresponding to the right eye peripheral area, the left eye peripheral area and the mouth peripheral area is one. Only the sheet may be held.

（第４実施形態）
第１実施形態から第３実施形態までは、主に処理対象を人物として説明したが、本実施形態では、人物（より詳細には、人物の顔）に加えて他の被写体も処理対象とする。これにより、様々な被写体に対して被写体距離に応じた法線情報を取得することができる。 Fourth Embodiment
In the first to third embodiments, the processing target is mainly described as a person, but in the present embodiment, in addition to the person (more specifically, the face of the person), other subjects are also processed. . This makes it possible to acquire normal line information according to the subject distance for various subjects.

（画像処理部の内部構成）
図１８は、本発明の実施形態（第４実施形態）に係る画像処理装置（画像処理部２０８）の機能構成を示すブロック図である。図１８に示す画像処理部２０８の機能構成では、第１実施形態に係る画像処理装置（画像処理部２０８）との関係で、顔モデル法線情報取得部３０３の代わりに、被写体モデル法線情報取得部１８０１が実装される。加えて、法線生成部３０４の処理内容も異なる。 (Internal configuration of image processing unit)
FIG. 18 is a block diagram showing a functional configuration of the image processing apparatus (image processing unit 208) according to the embodiment (the fourth embodiment) of the present invention. In the functional configuration of the image processing unit 208 shown in FIG. 18, subject model normal information instead of the face model normal information acquisition unit 303 in relation to the image processing apparatus (image processing unit 208) according to the first embodiment. An acquisition unit 1801 is implemented. In addition, the processing content of the normal generation unit 304 is also different.

被写体モデル法線情報取得部１８０１は、被写体までの距離情報と被写体情報に基づいて、複数の被写体モデル法線情報から被写体に合う法線情報を選択する。本実施形態では、ユーザ操作により選択された被写体情報を制御部２０５から取得する。被写体情報は、カラー画像データに含まれる被写体の種類を表し、例えば、人物の他にも、犬、猫等から選択することができるものとする。また、この場合、人物以外の被写体である、犬、猫等のみを被写体の対象とすることもできる。 The subject model normal line information acquisition unit 1801 selects normal line information matching the subject from among a plurality of subject model normal line information, based on the distance information to the subject and the subject information. In the present embodiment, subject information selected by a user operation is acquired from the control unit 205. The subject information indicates the type of the subject included in the color image data, and can be selected from, for example, dogs, cats and the like in addition to the person. Further, in this case, it is possible to set only a dog, a cat, etc., which is a subject other than a person, as the subject of the subject.

（画像処理部の処理フロー）
図１９は、本発明の実施形態に係る画像処理装置（画像処理部２０８）における処理の手順を示すフローチャートである。図１９に示すフローチャートは、図４に示すフローチャートとの関係（比較）で、ステップＳ１９０５〜ステップＳ１９０７の処理が異なる。 (Processing flow of image processing unit)
FIG. 19 is a flowchart showing the procedure of processing in the image processing apparatus (image processing unit 208) according to the embodiment of the present invention. The flowchart shown in FIG. 19 differs from the flowchart shown in FIG. 4 in the processes (steps S1905 to S1907).

以下、ステップＳ１９０５〜ステップＳ１９０７の各処理について、説明する。図１９のステップＳ１９０３において、ライティング処理を実行すると判定されると、被写体モデル法線情報取得部１８０１は、制御部２０５からユーザ操作に基づいた被写体情報を取得する（Ｓ１９０５）。被写体モデル法線情報取得部１８０１は、被写体距離とステップＳ１９０５で取得した被写体情報に基づいて、複数の被写体モデル法線情報の中からカラー画像データ５０１の被写体に対応する被写体モデル法線情報を取得する（Ｓ１９０６）。なお、本実施形態では、被写体情報の種類と被写体距離に応じた法線画像データとテンプレート画像データを保持しているものとする。 Hereinafter, each process of step S1905 to step S1907 will be described. If it is determined in step S1903 in FIG. 19 that the lighting process is to be performed, the subject model normal information acquisition unit 1801 acquires subject information based on a user operation from the control unit 205 (S1905). The subject model normal information acquisition unit 1801 acquires subject model normal line information corresponding to the subject in the color image data 501 from among a plurality of subject model normal line information based on the subject distance and the subject information acquired in step S1905. (S1906). In the present embodiment, it is assumed that normal image data and template image data are held according to the type of subject information and subject distance.

ここで、図２０に、本実施形態における被写体モデル法線情報をその一例として示す。被写体モデル法線情報は、図２０に示されるように、被写体モデル法線画像データと被写体モデルテンプレート画像データを含む。図２０（ａ）に示す被写体モデル法線画像データ２００１、２００２、２００３は各々、０．５ｍ、１．５ｍ、５．０ｍの被写体距離に応じた被写体モデル法線画像データである。 Here, FIG. 20 shows subject model normal line information in this embodiment as an example. The subject model normal information includes subject model normal image data and subject model template image data, as shown in FIG. Subject model normal image data 2001, 2002, and 2003 shown in FIG. 20A are subject model normal image data according to subject distances of 0.5 m, 1.5 m, and 5.0 m, respectively.

また、図２０（ｂ）に示す被写体モデルテンプレート画像データ２００４、２００５、２００６は、被写体モデル法線画像データ２００１、２００２、２００３をカラー画像データ５０１に位置合わせするために利用される。本実施形態では、被写体モデルテンプレート画像データ２００４、２００５、２００６として、被写体モデル法線画像データ２００１、２００２、２００３に対応する輪郭情報を、画像データとして保持しているものとする。 Further, subject model template image data 2004, 2005, 2006 shown in FIG. 20B is used to align subject model normal image data 2001, 2002, 2003 with color image data 501. In the present embodiment, it is assumed that contour information corresponding to subject model normal image data 2001, 2002, 2003 is held as image data as subject model template image data 2004, 2005, 2006.

なお、図２０では、被写体モデル法線情報として、人物の顔（特徴パターン）に対応する被写体モデル法線情報をその一例として説明したが、画像処理部２０８は、同様に、犬、猫等の様々な被写体モデル法線情報を被写体距離に応じて保持しているものとする。 In FIG. 20, subject model normal line information corresponding to a person's face (feature pattern) has been described as an example of the subject model normal line information. It is assumed that various subject model normal line information is held according to the subject distance.

法線生成部３０４は、被写体モデル法線情報に基づいて、法線画像データを生成する（Ｓ１９０７）。法線画像データの生成において、法線生成部３０４は、先ず、被写体モデルテンプレート画像データを用いて、カラー画像データ５０１の被写体に対して、被写体モデル法線画像データを位置合わせするためのパラメータを算出する。本実施形態では、カラー画像データ５０１における被写体の輪郭情報を抽出し、被写体モデルテンプレート画像データとマッチングすることで、位置合わせパラメータを算出する。 The normal line generation unit 304 generates normal line image data based on the subject model normal line information (S1907). In generating normal image data, the normal-line generating unit 304 first uses the subject model template image data to set parameters for aligning subject model normal image data with respect to the subject in the color image data 501. calculate. In the present embodiment, outline information of a subject in color image data 501 is extracted, and matching with subject model template image data is performed to calculate alignment parameters.

法線生成部３０４は、次に、算出した位置合わせパラメータに基づいて、被写体モデル法線画像データをカラー画像データ５０１に合わせて変換する。そして、法線生成部３０４は、第一実施形態と同様にカラー画像データを参照画像とする平滑化処理を施すことで、法線画像データを生成する。 Next, the normal-line generation unit 304 converts the subject model normal image data into the color image data 501 based on the calculated alignment parameter. Then, the normal line generation unit 304 generates normal line image data by performing smoothing processing using color image data as a reference image as in the first embodiment.

以上のように、本発明では、被写体距離が変化しても、撮像画像から所望の照明条件下で撮影したかのような別の画像を生成することができる。なお、本実施形態では、制御部２０５等から取得した撮影時のユーザ操作に基づいて被写体情報を取得したが（即ち、制御部２０５を被写体情報取得手段として機能させたが）、被写体情報の取得方法は、必ずしもこれに限定されない。したがって、例えば、カラー画像データ５０１のタグ情報として被写体情報をＲＡＭ２０３等に記憶しておき、その被写体情報をＲＡＭ２０３から取得することもできる。また、別途、被写体検出処理部を具備させ、顔、犬、猫等、複数種類の被写体を検出し、これに基づいて、被写体情報を取得することもできる。 As described above, according to the present invention, even if the subject distance changes, it is possible to generate another image as if taken under a desired illumination condition from the captured image. In the present embodiment, subject information is acquired based on the user operation at the time of shooting acquired from the control unit 205 or the like (that is, although the control unit 205 functions as a subject information acquisition unit), acquisition of subject information The method is not necessarily limited to this. Therefore, for example, subject information may be stored in the RAM 203 or the like as tag information of the color image data 501, and the subject information may be acquired from the RAM 203. Alternatively, a subject detection processing unit may be separately provided to detect a plurality of types of subjects such as a face, a dog, and a cat, and subject information may be acquired based on the detected subjects.

加えて、本実施形態では、被写体モデルテンプレート画像データとして輪郭情報を画像データとして保持させたが、被写体モデルのテクスチャや特徴点情報を保持させてもよい。なお、この場合、ステップＳ１９０７において、テクスチャや特徴点情報に基づいて、位置合わせパラメータを算出すればよい。 In addition, in the present embodiment, contour information is held as image data as subject model template image data, but textures and feature point information of the subject model may be held. In this case, in step S1907, alignment parameters may be calculated based on the texture and feature point information.

（その他の実施例）
本発明は、上述の実施形態の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける１つ以上のプロセッサがプログラムを読出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。 (Other embodiments)
The present invention supplies a program that implements one or more functions of the above-described embodiments to a system or apparatus via a network or storage medium, and one or more processors in a computer of the system or apparatus read and execute the program. Can also be realized. It can also be implemented by a circuit (eg, an ASIC) that implements one or more functions.

その他、以上において、好ましい実施形態として、主に人物の顔等を例示し説明したが、被写体は、必ずしも人物の顔等に限定されない。したがって、例えば、所定の部分において法線にばらつきを有する物体であって、また、法線情報を格納する上でモデル化することが可能な物体であれば、その物体を本発明の被写体として適用することができる。 Besides, in the above, as a preferred embodiment, mainly the face and the like of a person are illustrated and described, but the subject is not necessarily limited to the face and the like of a person. Therefore, for example, if it is an object having a variation in normal in a predetermined part, and the object can be modeled in storing normal information, the object is applied as the subject of the present invention can do.

１０５距離画像取得部
２０８画像処理部
３０１現像処理部
３０３顔モデル法線情報取得部
３０４法線生成部 105 distance image acquisition unit 208 image processing unit 301 development processing unit 303 face model normal information acquisition unit 304 normal generation unit

Claims

An image acquisition unit that acquires image data obtained by imaging a scene including a subject;
A first acquisition unit that acquires distance information from the imaging device that has captured the scene to the subject;
Storage means for storing normal line information corresponding to a predetermined feature pattern associated with the subject;
Generation means for generating normal line information to be applied to lighting processing on the image data based on the distance information and the normal line information stored in the storage means;
An image processing apparatus comprising: lighting processing means for performing a lighting process on the image data using the normal line information generated by the generation means.

The storage means stores a plurality of normal line information corresponding to a predetermined feature pattern in accordance with the distance from the imaging device to the subject,
The image processing apparatus according to claim 1, wherein the generation unit selects normal line information from a plurality of normal line information stored in the storage unit based on the distance information.

3. The apparatus according to claim 2, wherein the generation means generates normal line information to be applied to the lighting process by deforming normal line information selected based on the distance information according to the distance information. Image processing apparatus as described.

4. The image processing apparatus according to claim 2, wherein the storage unit stores the plurality of pieces of normal line information at shorter distance intervals as the distance from the imaging device to the subject is shorter.

The normal line information stored in the storage means includes normal line information for each area set for each predetermined area in a predetermined feature pattern of the subject. An image processing apparatus according to claim 1.

6. The image according to claim 5, wherein the generation means generates normal line information to be applied to the lighting process by aligning the normal line information for each area based on the distance information. Processing unit.

The generation means generates normal information to be applied to the lighting process by transforming the normal information by region selected based on the distance information according to the distance information and further integrating the information. The image processing apparatus according to claim 5, wherein

It further comprises a second acquisition unit for acquiring subject information indicating the type of the subject,
The image processing apparatus according to any one of claims 1 to 7, wherein the generation unit generates normal line information corresponding to a predetermined feature pattern of the subject based on the subject information.

9. The image processing apparatus according to claim 8, further comprising detection means for detecting the subject from the image data.

The normal means applied to the lighting process by aligning the detected subject with normal information corresponding to a predetermined feature pattern of the subject selected based on the distance information. The image processing apparatus according to claim 9, generating information.

The image processing apparatus according to any one of claims 1 to 10, wherein the predetermined feature pattern of the subject is a face.

An image processing method in an image processing apparatus, comprising storage means for storing normal line information corresponding to a predetermined feature pattern related to a subject,
An image acquisition step of acquiring image data obtained by imaging a scene including the subject by the image acquisition means;
A first acquisition step of acquiring distance information from the imaging device that has captured the scene to the subject by the first acquisition unit;
A generation step of generating normal information to be applied to lighting processing on the image data based on the distance information and the normal information stored by the storage unit;
And a lighting processing step of performing lighting processing on the image data using the normal line information generated in the generation step by a lighting processing means.

A program for causing a computer to function as each means of the image processing apparatus according to any one of claims 1 to 11.