JP2021145385A

JP2021145385A - Image processing apparatus, image processing method, and program

Info

Publication number: JP2021145385A
Application number: JP2021097301A
Authority: JP
Inventors: 有一中田; Yuichi Nakada
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2019-08-21
Filing date: 2021-06-10
Publication date: 2021-09-24

Abstract

To obtain an image provided with a shadow according to a posture and facial expression of a subject.SOLUTION: The image processing apparatus includes: acquisition means for acquiring a color image obtained by imaging a subject; specifying means for specifying a face area of the subject as a face area in the color image; and generation means for generating a corrected image formed by modifying brightness of at least a part of the face area in the color image.SELECTED DRAWING: Figure 4

Description

本発明は、画像データに陰影を付与する技術に関する。 The present invention relates to a technique for imparting shading to image data.

撮像装置を用いて被写体を撮影する場合、被写体に対する光の当たり方によって得られる画像は大きく変わる。例えば逆光の状態で撮影した場合、被写体の全体あるいは一部が影になり暗く写ってしまう。また、ストロボを用いて被写体に光を照射して撮影を行う場合には、光の影響により被写体の陰影が飛ばされ、被写体が平坦に見えるような場合もある。これらの画像を補正する方法として、特定の方向から疑似的に光を当てたように被写体の暗部を補正する方法が知られている（特許文献１）。特許文献１に記載の技術では、仮想光源の影響をガウス分布で表現する。この際、別途設定したライティング方向に従って輝度補正のガウス分布を偏らせることにより、仮想的に所望の方向から照明を与えたような画像を得ることができる。 When a subject is photographed using an imaging device, the image obtained varies greatly depending on how the light hits the subject. For example, when shooting against the sun, all or part of the subject becomes a shadow and appears dark. Further, when the subject is irradiated with light using a strobe for shooting, the shadow of the subject may be skipped due to the influence of the light, and the subject may appear flat. As a method of correcting these images, there is known a method of correcting a dark part of a subject as if light is applied from a specific direction in a pseudo manner (Patent Document 1). In the technique described in Patent Document 1, the influence of a virtual light source is expressed by a Gaussian distribution. At this time, by biasing the Gaussian distribution of the brightness correction according to the lighting direction set separately, it is possible to obtain an image as if the illumination was applied from a virtually desired direction.

また、被写体に対応する３Ｄモデルを用いて所定の仮想照明条件下でＣＧのレンダリング処理を行い、レンダリングしたＣＧ画像を、撮影により得られた画像中の被写体像と置き換えることで被写体のライティングを変更する方法も知られている（特許文献２）。特許文献２に記載の技術では、あらかじめ用意しておいた３Ｄモデルの中から被写体と置き換える３Ｄモデルを決定する。 In addition, the lighting of the subject is changed by performing CG rendering processing under predetermined virtual lighting conditions using a 3D model corresponding to the subject and replacing the rendered CG image with the subject image in the image obtained by shooting. A method for rendering is also known (Patent Document 2). In the technique described in Patent Document 2, a 3D model to be replaced with the subject is determined from the 3D models prepared in advance.

特許第５２８１８７８号公報Japanese Patent No. 5281878 特許第５０８８２２０号公報Japanese Patent No. 5088220

撮影者が所望する照明条件下で撮影したかのような別の画像を撮影画像から生成する場合、被写体の姿勢や表情に合わせた陰影を付与した画像が得られることが好ましい。しかしながら特許文献１、２の技術では以下のような課題がある。 When another image is generated from the captured image as if it was captured under the lighting conditions desired by the photographer, it is preferable to obtain an image with shading according to the posture and facial expression of the subject. However, the techniques of Patent Documents 1 and 2 have the following problems.

特許文献１に記載の技術のように仮想光源の影響をガウス分布で表現する場合、人間の顔のような複雑な形状の被写体に対して自然な陰影を付与することは困難である。また、特許文献２に記載の技術のように、被写体の３Ｄモデルを用いてレンダリングした結果で画像中の被写体像を置きかえる場合、置き換えた後の被写体の姿勢や表情が３Ｄモデルの姿勢や表情に置き換わってしまい不自然な結果となってしまう。そこで、本発明は被写体の姿勢や表情に合わせた陰影を付与した画像を得ることを目的とする。 When the influence of a virtual light source is expressed by a Gaussian distribution as in the technique described in Patent Document 1, it is difficult to give a natural shadow to a subject having a complicated shape such as a human face. Further, when the subject image in the image is replaced with the result of rendering using the 3D model of the subject as in the technique described in Patent Document 2, the posture or facial expression of the subject after the replacement becomes the posture or facial expression of the 3D model. It will be replaced and the result will be unnatural. Therefore, an object of the present invention is to obtain an image in which a shadow is added according to the posture and facial expression of the subject.

上記課題を解決するために、本発明に係る画像処理装置は、被写体を撮像することにより得られたカラー画像を取得する取得手段と、前記カラー画像において前記被写体の顔の領域を顔領域として特定する特定手段と、前記カラー画像における前記顔領域の少なくとも一部の明るさが変更された補正画像を生成する生成手段を有することを特徴とする。 In order to solve the above problems, the image processing apparatus according to the present invention specifies an acquisition means for acquiring a color image obtained by photographing a subject and a region of the face of the subject as a face region in the color image. It is characterized by having a specific means for generating a corrected image in which the brightness of at least a part of the face region in the color image is changed.

本発明によれば、被写体の姿勢や表情に合わせた陰影を付与した画像を得ることができる。 According to the present invention, it is possible to obtain an image in which a shadow is added according to the posture and facial expression of the subject.

本発明の実施形態１に係る撮像装置の外観を示す図。The figure which shows the appearance of the image pickup apparatus which concerns on Embodiment 1 of this invention. 本発明の実施形態１に係る撮像装置の内部構成を示す図。The figure which shows the internal structure of the image pickup apparatus which concerns on Embodiment 1 of this invention. 本発明の実施形態１に係る画像処理部の構成を示すブロック図。The block diagram which shows the structure of the image processing part which concerns on Embodiment 1 of this invention. 本発明の実施形態１に係る処理の流れを示すフローチャート。The flowchart which shows the flow of the process which concerns on Embodiment 1 of this invention. 本発明の実施形態１に係る画像データの例を示す図。The figure which shows the example of the image data which concerns on Embodiment 1 of this invention. 本発明の実施形態１に係る顔情報を示す図。The figure which shows the face information which concerns on Embodiment 1 of this invention. 本発明の実施形態１に係る距離補正処理の流れを示すフローチャート。The flowchart which shows the flow of the distance correction processing which concerns on Embodiment 1 of this invention. 本発明の実施形態１に係る距離補正処理の概要を表す図。The figure which shows the outline of the distance correction processing which concerns on Embodiment 1 of this invention. 本発明の実施形態１に係る法線補正処理の流れを示すフローチャート。The flowchart which shows the flow of the normal correction processing which concerns on Embodiment 1 of this invention. 本発明の実施形態１に係る法線補正処理の概要を表す図。The figure which shows the outline of the normal correction processing which concerns on Embodiment 1 of this invention. 本発明の実施形態１に係る法線平滑化処理の概要を表す図。The figure which shows the outline of the normal smoothing process which concerns on Embodiment 1 of this invention. 本発明の実施形態１に係るライティング処理の流れを示すフローチャート。The flowchart which shows the flow of the lighting process which concerns on Embodiment 1 of this invention. 本発明の実施形態１に係るライティング処理の概要を示す図。The figure which shows the outline of the lighting process which concerns on Embodiment 1 of this invention. 本発明の実施形態１に係るライティング処理の効果を表す図。The figure which shows the effect of the lighting process which concerns on Embodiment 1 of this invention. 本発明の実施形態２に係る補正係数の例を示す図。The figure which shows the example of the correction coefficient which concerns on Embodiment 2 of this invention. 本発明の実施形態３に係るライティング処理の効果を表わす図。The figure which shows the effect of the lighting process which concerns on Embodiment 3 of this invention. 本発明の実施形態４に係る処理の流れを示すフローチャート。The flowchart which shows the flow of the process which concerns on Embodiment 4 of this invention.

［実施形態１］
＜撮像装置の外観＞
図１は本実施形態に係る撮像装置の外観を示す図であり、図１（ａ）は撮像装置の前面、図１（ｂ）は背面の外観を示している。撮像装置１０１は、光学部１０２、撮像ボタン１０３、ストロボ１０４、距離取得部１０５、表示部１０６、および操作ボタン１０７を有している。 [Embodiment 1]
<Appearance of imaging device>
1A and 1B are views showing the appearance of the image pickup apparatus according to the present embodiment, FIG. 1A shows the appearance of the front surface of the image pickup apparatus, and FIG. 1B shows the appearance of the back surface of the image pickup apparatus. The image pickup apparatus 101 includes an optical unit 102, an image pickup button 103, a strobe 104, a distance acquisition unit 105, a display unit 106, and an operation button 107.

光学部１０２はズームレンズ、フォーカスレンズ、ブレ補正レンズ、絞り、およびシャッターによって構成される鏡筒であり、被写体の光情報を集光する。撮像ボタン１０３は、ユーザが撮像の開始を撮像装置１０１に指示するためのボタンである。ストロボ１０４は、ユーザ指示に従い撮像の開始に合わせて発光させることができる照明である。距離取得部１０５は、撮像指示に応じて被写体の距離画像データを取得する距離取得モジュールである。距離画像データとは、画像の各画素の画素値としてその画素に対応する被写体距離を格納した画像データのことを意味する。距離取得部１０５は、赤外光を発光する赤外発光部と、被写体に反射した赤外光を受光する受光部とを含み、発光した赤外光が被写体に反射し受光するまでの時間を基に撮像装置から被写体までの距離値を算出する。そして、算出した距離値と受光部のセンサ画素数や画角等を含む距離撮像情報に基づき被写体の位置情報を算出し距離画像データを生成する。なお、距離画像データの取得方法はこれに限られない。例えば距離取得部１０５の代わりに光学部１０２と同様の光学系を設け、異なる２つの視点から撮像された画像データの間の視差に基づいて、三角測量を行うことにより距離画像データを取得するようにしてもよい。 The optical unit 102 is a lens barrel composed of a zoom lens, a focus lens, a blur correction lens, an aperture, and a shutter, and collects light information of a subject. The image pickup button 103 is a button for the user instructing the image pickup apparatus 101 to start imaging. The strobe 104 is an illumination that can emit light at the start of imaging according to a user instruction. The distance acquisition unit 105 is a distance acquisition module that acquires distance image data of a subject in response to an imaging instruction. The distance image data means image data in which the subject distance corresponding to the pixel is stored as the pixel value of each pixel of the image. The distance acquisition unit 105 includes an infrared light emitting unit that emits infrared light and a light receiving unit that receives infrared light reflected by the subject, and takes a time until the emitted infrared light is reflected by the subject and received. Based on this, the distance value from the image pickup device to the subject is calculated. Then, the position information of the subject is calculated based on the calculated distance value and the distance imaging information including the number of sensor pixels of the light receiving unit, the angle of view, and the like, and the distance image data is generated. The method of acquiring the distance image data is not limited to this. For example, instead of the distance acquisition unit 105, an optical system similar to the optical unit 102 is provided, and distance image data is acquired by performing triangulation based on the parallax between the image data captured from two different viewpoints. It may be.

表示部１０６は、撮像装置１０１にて処理された画像データや他の各種データなどを表示する、液晶ディスプレイなどのディスプレイである。本実施形態では撮像装置１０１に光学ファインダを設けていないので、フレーミング操作（ピントや構図の確認）は、表示部１０６を用いて行われる。すなわち、表示部１０６でライブビュー画像を確認しながら撮像が行われるので、フレーミングやフォーカシングの操作を行っている間は、表示部１０６は電子ファインダとして機能すると言える。表示部１０６では、撮像範囲をリアルタイムに表示するライブビュー表示が行われる他、カメラ設定メニューが表示される。 The display unit 106 is a display such as a liquid crystal display that displays image data processed by the image pickup apparatus 101, various other data, and the like. Since the image pickup apparatus 101 is not provided with an optical viewfinder in the present embodiment, the framing operation (confirmation of focus and composition) is performed using the display unit 106. That is, since the image is taken while checking the live view image on the display unit 106, it can be said that the display unit 106 functions as an electronic viewfinder while performing framing and focusing operations. On the display unit 106, a live view display for displaying the imaging range in real time is performed, and a camera setting menu is displayed.

操作ボタン１０７は、撮像装置１０１の動作モードの切り換え操作や、撮像時の各種パラメータなどをユーザが撮像装置１０１に指示するためのボタンである。なお、本実施形態では動作モードの一つとして、撮像された画像における照明の当たり具合を撮像後に補正するライティング補正処理モードが含まれる。ユーザは操作ボタン１０７、あるいは撮像ボタン１０３を用いてライティング補正処理モードへの切り替えや、ライティング補正に用いる仮想照明の照明パラメータの設定や、照明の当たり具合を調整する被写体の選択などを行うことができる。また、ユーザは補正された画像データを出力する際に、距離画像データを出力するかどうか等の指示をすることもできるものとする。なお、表示部１０６はタッチスクリーン機能を有していても良く、その場合はタッチスクリーンを用いたユーザ指示を操作ボタン１０７の入力として扱うことも可能である。 The operation button 107 is a button for the user to instruct the image pickup apparatus 101 of an operation for switching the operation mode of the image pickup apparatus 101, various parameters at the time of imaging, and the like. In the present embodiment, as one of the operation modes, a lighting correction processing mode for correcting the lighting condition in the captured image after imaging is included. The user can switch to the lighting correction processing mode using the operation button 107 or the image pickup button 103, set the lighting parameters of the virtual lighting used for the lighting correction, select the subject for adjusting the lighting condition, and the like. can. Further, when outputting the corrected image data, the user can also give an instruction as to whether or not to output the distance image data. The display unit 106 may have a touch screen function, and in that case, a user instruction using the touch screen can be treated as an input of the operation button 107.

＜撮像装置の内部構成＞
図２は本実施形態における撮像装置１０１の内部構成を示すブロック図である。 <Internal configuration of imaging device>
FIG. 2 is a block diagram showing an internal configuration of the image pickup apparatus 101 according to the present embodiment.

ＣＰＵ２０２は、各構成の処理すべてに関わり、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）２０３や、ＲＡＭ（ＲｏｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）２０４に格納された命令を順に読み込み、解釈し、その結果に従って処理を実行する。システムバス２１２はデータを送受信するためのバスである。なお、本実施形態において、ＲＯＭ２０３には、人間の顔に対応する顔法線モデルが格納されているものとする。顔法線モデルは、所定の形状の顔に対応する、顔表面の法線ベクトルを画素値に格納した法線画像データと、法線画像データにおける人の、目や口などの器官位置を示す器官位置情報とを含む。 The CPU 202 is involved in all the processes of each configuration, reads and interprets the instructions stored in the ROM (Read Only Memory) 203 and the RAM (Random Access Memory) 204 in order, and executes the processes according to the result. The system bus 212 is a bus for transmitting and receiving data. In the present embodiment, it is assumed that the face normal model corresponding to the human face is stored in the ROM 203. The face normal model shows normal image data in which the normal vector of the face surface corresponding to a face having a predetermined shape is stored in pixel values, and the positions of human organs such as eyes and mouth in the normal image data. Includes organ location information.

制御部２０６は、撮像ボタン１０３や操作ボタン１０７からのユーザ指示を受取り、撮像、ライティング補正処理モードの切り換え、被写体領域の選択、照明パラメータの設定などの制御を行う制御回路である。光学系制御部２０５は光学部１０２に対して、フォーカスを合わせる、シャッターを開く、絞りを調整するなどのＣＰＵ２０２から指示された制御を行う制御回路である。 The control unit 206 is a control circuit that receives user instructions from the image pickup button 103 and the operation button 107 and controls imaging, switching of the lighting correction processing mode, selection of a subject area, setting of lighting parameters, and the like. The optical system control unit 205 is a control circuit that controls the optical unit 102 instructed by the CPU 202, such as focusing, opening a shutter, and adjusting the aperture.

カラー撮像素子部２０１は、光学部１０２にて集光された光情報を電流値へと変換する撮像素子である。カラー撮像素子部２０１にはベイヤ配列などの所定の配列を有するカラーフィルタが備えてあり、光学部１０２にて集光された光から被写体の色情報が取得される。 The color image sensor unit 201 is an image sensor that converts the light information collected by the optical unit 102 into a current value. The color image pickup element unit 201 is provided with a color filter having a predetermined arrangement such as a Bayer arrangement, and color information of a subject is acquired from the light focused by the optical unit 102.

Ａ／Ｄ変換部２０８は、カラー撮像素子部２０１にて検知された被写体の色情報をデジタル信号値に変換しＲＡＷ画像データとする処理回路である。なお、本実施形態では同時刻に撮像した距離画像データとＲＡＷ画像データが取得可能であるとする。 The A / D conversion unit 208 is a processing circuit that converts the color information of the subject detected by the color image sensor unit 201 into digital signal values and converts them into RAW image data. In this embodiment, it is assumed that the distance image data and the RAW image data captured at the same time can be acquired.

画像処理部２０９はＡ／Ｄ変換部２０８で取得されたＲＡＷ画像データに対して現像処理を行い、カラー画像データを生成する。また、画像処理部２０９はカラー画像データや距離画像データを用いて、カラー画像データにライティング補正を行った補正画像データを生成するなどの、各種画像処理を行う。画像処理部２０９の内部構造は後に詳述する。 The image processing unit 209 develops the RAW image data acquired by the A / D conversion unit 208 to generate color image data. Further, the image processing unit 209 performs various image processing such as generating corrected image data in which lighting correction is performed on the color image data by using the color image data and the distance image data. The internal structure of the image processing unit 209 will be described in detail later.

また、キャラクタージェネレーション部２０７は文字やグラフィックなどを生成する処理回路である。キャラクタージェネレーション部２０７により生成された文字やグラフィックは、画像データや補正画像データなどに重畳して表示部１０６に表示される。 Further, the character generation unit 207 is a processing circuit that generates characters, graphics, and the like. The characters and graphics generated by the character generation unit 207 are superimposed on the image data, the corrected image data, and the like and displayed on the display unit 106.

エンコーダ部２１０は、画像処理部２０９にて処理したカラー画像データやライティング補正処理によって生成される補正画像データを含む各種画像データをＪｐｅｇなどのファイルフォーマットに変換する処理を行う。 The encoder unit 210 performs a process of converting various image data including the color image data processed by the image processing unit 209 and the corrected image data generated by the lighting correction process into a file format such as JPEG.

メディアＩ／Ｆ２１１は、ＰＣ／メディア２１３（例えば、ハードディスク、メモリカード、ＣＦカード、ＳＤカードなど）に画像データを送受信するためのインタフェースである。メディアＩ／Ｆ２１１としては、例えばＵＳＢ（ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ）などが用いられる。 The media I / F211 is an interface for transmitting and receiving image data to and from a PC / media 213 (for example, a hard disk, a memory card, a CF card, an SD card, etc.). As the media I / F211 for example, USB (Universal Serial Bus) or the like is used.

＜画像処理部の内部構成＞
図３は本実施形態における画像処理部２０９の機能構成を示すブロック図である。現像処理部３０１は、Ａ／Ｄ変換部２０８から取得したＲＡＷ画像データに対してホワイトバランス処理、デモザイク処理、ノイズリダクション処理、色変換処理、エッジ強調処理およびガンマ処理等を施し、カラー画像データを生成する。生成したカラー画像データは表示部１０６へ出力して表示したり、ＲＡＭ２０４、ＰＣ／メディア２１３などの記憶装置に記憶することができる。なお、本実施形態では、現像処理部３０１はガンマ処理を施さずにカラー画像データを生成し、ライティング部３０５に出力する。 <Internal configuration of image processing unit>
FIG. 3 is a block diagram showing a functional configuration of the image processing unit 209 in the present embodiment. The development processing unit 301 performs white balance processing, demosaic processing, noise reduction processing, color conversion processing, edge enhancement processing, gamma processing, and the like on the RAW image data acquired from the A / D conversion unit 208, and obtains the color image data. Generate. The generated color image data can be output to the display unit 106 for display or stored in a storage device such as a RAM 204 or a PC / media 213. In this embodiment, the development processing unit 301 generates color image data without performing gamma processing and outputs the color image data to the lighting unit 305.

距離補正部３０２は、カラー画像データ、顔情報、ユーザにより選択された被写体位置に基づいて、距離画像データから選択された被写体に対応する補正距離データを生成する。本実施形態では、補正距離データは、主に選択された被写体位置に対応した人物と、それ以外の背景に対応する距離値を格納しているものとする。 The distance correction unit 302 generates correction distance data corresponding to the subject selected from the distance image data based on the color image data, face information, and the subject position selected by the user. In the present embodiment, it is assumed that the correction distance data mainly stores a person corresponding to the selected subject position and a distance value corresponding to other backgrounds.

顔検出部３０３は、現像処理部３０１から取得したカラー画像データから被写体の顔情報を取得する。被写体の顔情報には、少なくともカラー画像データにおいて被写体の顔が占める領域を示す顔領域と、顔に含まれる目や口などのカラー画像データにおける位置を示す器官位置とに関する情報が含まれる。 The face detection unit 303 acquires the face information of the subject from the color image data acquired from the development processing unit 301. The face information of the subject includes at least information on a face region indicating an area occupied by the subject's face in the color image data and an organ position indicating a position in the color image data such as eyes and mouth included in the face.

法線補正部３０４は、顔検出部３０３から取得した顔情報と、現像部３０１から取得したカラー画像データとに基づいて、ＲＯＭ２０３に格納された顔法線モデルを補正する。 The normal correction unit 304 corrects the face normal model stored in the ROM 203 based on the face information acquired from the face detection unit 303 and the color image data acquired from the development unit 301.

ライティング処理部３０５は、距離補正部３０２から取得した補正距離データと、法線補正部３０４から取得した補正法線データと、制御部２０６から取得した照明パラメータとに基づいて、カラー画像データに対してライティング処理を行う。ライティング処理により生成された補正画像データはＲＡＭ２０４やＰＣ／メディア２１３などの記憶装置に出力して記憶したり、表示部１０６へ出力して表示したりすることができる。 The lighting processing unit 305 refers to the color image data based on the correction distance data acquired from the distance correction unit 302, the correction normal data acquired from the normal correction unit 304, and the illumination parameters acquired from the control unit 206. And perform the lighting process. The corrected image data generated by the lighting process can be output and stored in a storage device such as a RAM 204 or a PC / media 213, or can be output and displayed on the display unit 106.

＜画像処理部の処理フロー＞
図４は本実施形態の撮像装置における画像処理部２０９の動作手順を示すフローチャートである。本実施形態において、画像処理部２０９は、カラー画像データから取得した顔情報と、ユーザ指示に基づき取得した被写体位置Ｐ０とを用いて、選択された被写体に対応する補正距離データを距離画像データから生成する。そして、被写体の顔情報と、あらかじめ保持していた顔法線モデルとに基づき、被写体の顔に合わせた法線画像データを生成する。その後、ユーザ操作によって設定された照明パラメータ、補正距離データ及び生成された法線画像データに基づき、カラー画像データに仮想光源を追加するライティング処理を行い補正画像データを生成する。以下、画像処理部２０９の動作手順の詳細について述べる。 <Processing flow of image processing unit>
FIG. 4 is a flowchart showing an operation procedure of the image processing unit 209 in the image pickup apparatus of the present embodiment. In the present embodiment, the image processing unit 209 uses the face information acquired from the color image data and the subject position P0 acquired based on the user's instruction to obtain the correction distance data corresponding to the selected subject from the distance image data. Generate. Then, based on the face information of the subject and the face normal model held in advance, normal image data matching the face of the subject is generated. After that, based on the lighting parameters set by the user operation, the correction distance data, and the generated normal image data, a lighting process of adding a virtual light source to the color image data is performed to generate the correction image data. The details of the operation procedure of the image processing unit 209 will be described below.

ステップＳ４０１において、現像処理部３０１がＡ／Ｄ変換部２０８から取得したＲＡＷ画像データにデモザイク処理などの現像処理を施してカラー画像データを生成する。本実施形態におけるカラー画像データについて図５（ａ）を用いて説明する。カラー画像データＩ５０１の画素（ｉ，ｊ）にはＲＧＢ値が画素値として格納されているものとし、それぞれＩｒ（ｉ，ｊ）、Ｉｇ（ｉ，ｊ）、Ｉｂ（ｉ，ｊ）と表すものとする。なお、カラー画像データの取得方法はこれに限るものではない。例えば、ＲＡＭ２０４やＰＣ／メディア２１３に記憶されているＲＡＷ画像データを取得し現像処理部３０１がカラー画像データを生成してもよい。あるいは、既に現像処理が行われたカラー画像データをＲＡＭ２０４やＰＣ／メディア２１３から取得してもよい。そして、ステップＳ４０２において、現像部３０１はステップＳ４０１で取得したカラー画像データを表示部１０６に出力する。ユーザは、表示部１０６での表示に基づいて、ライティング補正処理を行うかどうかの判断を行う。 In step S401, the development processing unit 301 performs development processing such as demosaic processing on the RAW image data acquired from the A / D conversion unit 208 to generate color image data. The color image data in this embodiment will be described with reference to FIG. 5 (a). It is assumed that RGB values are stored as pixel values in the pixels (i, j) of the color image data I501, and they are represented as Ir (i, j), Ig (i, j), and Ib (i, j), respectively. And. The method for acquiring color image data is not limited to this. For example, the RAW image data stored in the RAM 204 or the PC / media 213 may be acquired and the development processing unit 301 may generate the color image data. Alternatively, the color image data that has already been developed may be acquired from the RAM 204 or the PC / media 213. Then, in step S402, the developing unit 301 outputs the color image data acquired in step S401 to the display unit 106. The user determines whether or not to perform the lighting correction process based on the display on the display unit 106.

ステップＳ４０３において、制御部２０６が操作部１０７からの入力に従い、ライティング補正処理を行う指示が入力されているか否かの判定を行う。ライティング補正処理を行う指示が入力されていない場合はステップＳ４０４に進む。ライティング補正処理を行う指示が入力されている場合は、制御部２０６はライティング補正を行うことを示す信号を現像部３０１と距離補正部３０２とに出力してステップＳ４０６に進む。 In step S403, the control unit 206 determines whether or not an instruction to perform the lighting correction process is input according to the input from the operation unit 107. If the instruction to perform the lighting correction process is not input, the process proceeds to step S404. When an instruction to perform the lighting correction process is input, the control unit 206 outputs a signal indicating that the lighting correction is performed to the developing unit 301 and the distance correction unit 302, and proceeds to step S406.

ステップＳ４０４において、制御部２０６は、ユーザにより画像の出力指示が入力されているかどうかを判定する。ユーザにより画像の出力指示が入力されていると判定された場合は、ステップＳ４０５に進む。ユーザにより画像の出力指示が入力されていると判定されない場合は、ステップＳ４０３に戻る。 In step S404, the control unit 206 determines whether or not an image output instruction has been input by the user. If it is determined that the image output instruction has been input by the user, the process proceeds to step S405. If it is not determined that the image output instruction has been input by the user, the process returns to step S403.

ステップＳ４０５において、制御部２０６は画像の出力指示を現像部３０１に出力し、現像部３０１は、カラー画像データをＰＣ／メディア２１３に出力して処理を終了する。 In step S405, the control unit 206 outputs an image output instruction to the developing unit 301, and the developing unit 301 outputs the color image data to the PC / media 213 to end the process.

ステップＳ４０６において、距離補正部３０２が距離取得部１０５から距離画像データを取得する。本実施形態における距離画像データについて図５（ｂ）を用いて説明する。距離画像データＤ５０２の画素（ｉ，ｊ）には画素値として撮像装置から被写体までの距離値Ｄ（ｉ，ｊ）を格納しているものとする。なお、距離画像データの取得方法はこれに限るものではない。例えば、ＲＡＭ２０４やＰＣ／メディア２１３に記憶されている距離画像データを取得してもよい。 In step S406, the distance correction unit 302 acquires the distance image data from the distance acquisition unit 105. The distance image data in this embodiment will be described with reference to FIG. 5 (b). It is assumed that the pixel (i, j) of the distance image data D502 stores the distance value D (i, j) from the image pickup apparatus to the subject as a pixel value. The method of acquiring the distance image data is not limited to this. For example, the distance image data stored in the RAM 204 or the PC / media 213 may be acquired.

ステップＳ４０７において、現像部３０１は顔検出部３０３にカラー画像データを出力し、顔検出部３０３は入力されたカラー画像データから被写体の顔情報を取得する。本実施形態における顔情報について図６を用いて説明する。本実施形態における顔情報は、顔領域６０１および器官位置６０２を示す情報を含む。顔領域は、カラー画像データ５０１において顔が含まれる領域の画素の集合を表す。器官位置６０２は、顔領域内における目や口に対応する座標を表す。顔領域、器官位置の検出方法については既存のアルゴリズムが適用可能である。例として、テンプレートマッチングを用いたアルゴリズムや、Ｈａａｒ−Ｌｉｋｅ特徴量を用いたアルゴリズムなどが挙げられる。本実施形態では、テンプレートマッチングによって顔領域・器官位置を検出する。まず、カラー画像データに対してしきい値処理を行うことで肌色の領域を顔候補領域として抽出する。すなわち、様々な肌色に基づいて決定された画素値の範囲の中に画素値が収まる画素を、顔候補領域として抽出する。そして、様々な大きさの顔画像テンプレートを用いて顔候補領域に対してマッチング処理を行い、顔領域としての尤度を算出する。最後に、算出された尤度が所定の閾値以上である領域を顔領域として抽出する。また、顔検出部３０３は、抽出された顔領域に対して目、口画像テンプレートを用いて同様のテンプレートマッチングを行い、目および口に対応する座標を抽出する。以上の処理により顔領域６０１、器官位置６０２が取得される。顔検出部３０３は、取得した顔情報を法線補正部３０４に出力する。なお、検出する器官としては、目や口以外にも鼻や耳など別の器官を抽出してもよい。 In step S407, the developing unit 301 outputs the color image data to the face detecting unit 303, and the face detecting unit 303 acquires the face information of the subject from the input color image data. The face information in this embodiment will be described with reference to FIG. The face information in the present embodiment includes information indicating the face area 601 and the organ position 602. The face region represents a set of pixels in a region including a face in the color image data 501. The organ position 602 represents the coordinates corresponding to the eyes and mouth in the facial area. Existing algorithms can be applied to the method of detecting the facial region and organ position. Examples include an algorithm using template matching and an algorithm using Haar-Like features. In this embodiment, the face region / organ position is detected by template matching. First, the skin color region is extracted as a face candidate region by performing threshold processing on the color image data. That is, pixels whose pixel values fall within the range of pixel values determined based on various skin colors are extracted as face candidate regions. Then, matching processing is performed on the face candidate region using face image templates of various sizes, and the likelihood as the face region is calculated. Finally, a region whose calculated likelihood is equal to or higher than a predetermined threshold value is extracted as a face region. Further, the face detection unit 303 performs the same template matching on the extracted face area using the eye and mouth image templates, and extracts the coordinates corresponding to the eyes and mouth. By the above processing, the face area 601 and the organ position 602 are acquired. The face detection unit 303 outputs the acquired face information to the normal correction unit 304. In addition to the eyes and mouth, other organs such as the nose and ears may be extracted as the organs to be detected.

ステップＳ４０８において、距離補正部３０２はユーザによって指定された被写体の位置を決定する。本実施形態において、ユーザは表示部１０６に設けられたタッチパネルや、操作ボタン１０７を用いて、ライティング補正処理を行いたい被写体の位置を指定する。距離補正部３０２は、ユーザ操作により入力された被写体選択位置Ｐ０’を制御部２０６から取得する。そして、取得した被写体選択位置Ｐ０’に基づいて、カラー画像データにおける指定された被写体位置Ｐ０を算出する。本実施形態では、タッチスクリーン機能を有した表示部１０６にカラー画像データを表示し、表示画面中の被写体をユーザがタッチする操作を受け付け、距離補正部３０２はユーザがタッチした位置を被写体選択位置Ｐ０’として制御部２０６から取得する。この際、被写体選択位置Ｐ０’は表示部１０６の画素位置に対応する。距離補正部３０２は、この表示部１０６上での画素位置を、カラー画像データの画素位置に変換することで被写体位置Ｐ０を算出する。 In step S408, the distance correction unit 302 determines the position of the subject designated by the user. In the present embodiment, the user uses the touch panel provided on the display unit 106 and the operation buttons 107 to specify the position of the subject to be subjected to the lighting correction processing. The distance correction unit 302 acquires the subject selection position P0'input by the user operation from the control unit 206. Then, the designated subject position P0 in the color image data is calculated based on the acquired subject selection position P0'. In the present embodiment, the color image data is displayed on the display unit 106 having the touch screen function, the operation of the user touching the subject on the display screen is accepted, and the distance correction unit 302 sets the position touched by the user as the subject selection position. Obtained from the control unit 206 as P0'. At this time, the subject selection position P0'corresponds to the pixel position of the display unit 106. The distance correction unit 302 calculates the subject position P0 by converting the pixel position on the display unit 106 into the pixel position of the color image data.

ステップＳ４０９において、距離補正部３０２が、ステップＳ４０８で取得した被写体位置Ｐ０と、現像部３０１から取得したカラー画像データとを用いて、ステップＳ４０６で取得した距離画像データから補正距離データを生成する。補正距離データ生成処理の詳細については後述する。距離補正部３０２は、生成した補正距離データをライティング部３０５に出力する。 In step S409, the distance correction unit 302 generates correction distance data from the distance image data acquired in step S406 by using the subject position P0 acquired in step S408 and the color image data acquired from the developing unit 301. The details of the correction distance data generation process will be described later. The distance correction unit 302 outputs the generated correction distance data to the lighting unit 305.

ステップＳ４１０において、法線補正部３０４が顔検出部３０３から取得した顔情報と、現像部３０１から入力されたカラー画像データとに基づいて、被写体の顔に合わせた法線画像データである補正法線データを生成する。補正法線データ生成処理の詳細については後述する。法線補正部３０４は、生成した補正法線データをライティング部３０５に出力する。 In step S410, a correction method that is normal image data that matches the face of the subject based on the face information acquired by the normal correction unit 304 from the face detection unit 303 and the color image data input from the development unit 301. Generate line data. The details of the correction normal data generation process will be described later. The normal correction unit 304 outputs the generated correction normal data to the lighting unit 305.

ステップＳ４１１において、ライティング部３０５が、入力された補正距離データと補正法線データとに基づいて、カラー画像データに対して仮想的な光源を加えるなどのライティング処理を行い補正画像データを生成する。ライティング処理の詳細については後述する。 In step S411, the lighting unit 305 performs lighting processing such as adding a virtual light source to the color image data based on the input correction distance data and the correction normal data to generate the correction image data. The details of the lighting process will be described later.

ステップＳ４１２において、ライティング部３０５が制御部２０６から、ライティング処理に用いる照明パラメータの設定の変更が入力されたかどうかを判定する。照明パラメータの設定が変更されたと判定された場合はステップＳ４１１に戻り再びライティング処理を行う。照明パラメータの設定が変更されていないと判定された場合はステップＳ４１３に進む。 In step S412, the lighting unit 305 determines whether or not a change in the setting of the lighting parameter used for the lighting process has been input from the control unit 206. If it is determined that the lighting parameter setting has been changed, the process returns to step S411 and the lighting process is performed again. If it is determined that the lighting parameter settings have not been changed, the process proceeds to step S413.

ステップＳ４１３では、ライティング部３０５が、制御部２０６から画像の出力指示が入力されたかどうかを判定する。画像の出力指示が入力されたと判定された場合は、ステップＳ４１４に進む。画像の出力指示が入力されていないと判定された場合はステップＳ４１２に戻る。ステップＳ４１４では、ライティング処理部３０５が、生成された補正画像データをＰＣ／メディア２１３に出力して処理を終了する。以上が本実施形態の画像処理部２０９で行われる処理の流れである。以上の処理によれば、被写体に合わせて変形した顔法線モデルを用いてライティング処理を行うことができるので、被写体の姿勢や表情に合わせた自然な陰影を付与した画像を得ることができる。以下、画像処理部２０９の各構成部で行われる処理の詳細について説明する。 In step S413, the lighting unit 305 determines whether or not an image output instruction has been input from the control unit 206. If it is determined that the image output instruction has been input, the process proceeds to step S414. If it is determined that the image output instruction has not been input, the process returns to step S412. In step S414, the lighting processing unit 305 outputs the generated corrected image data to the PC / media 213 and ends the processing. The above is the flow of processing performed by the image processing unit 209 of the present embodiment. According to the above processing, since the lighting processing can be performed using the face normal model deformed according to the subject, it is possible to obtain an image with natural shading according to the posture and facial expression of the subject. Hereinafter, the details of the processing performed in each component of the image processing unit 209 will be described.

＜補正距離データ生成処理＞
ここでは、ステップＳ４０９で距離補正部３０２が行う補正距離データ生成処理について、図７に示すフローチャートを参照して説明する。ステップＳ７０１において、距離補正部３０２は、顔情報と被写体位置Ｐ０と距離画像データとに基づき被写体候補領域の抽出を行う。図８（ａ）（ｂ）を用いて本ステップの処理を説明する。まず、距離補正部３０２は、顔情報が示す顔領域の中から被写体位置Ｐ０に最も近い顔領域６０１を選択する。そして、選択された顔領域中の各画素の距離値を距離画像データから取得し、それらの平均値を顔領域の距離値として算出する。その後、距離補正部３０２は、顔領域の距離値との距離値の差が所定の閾値以下となる画素とそれ以外の画素とに分けた二値画像８０１を生成する。すなわち、ここで行われる処理は、選択された被写体からの距離が所定の範囲に含まれる被写体とそれ以外の被写体とを判別する処理である。ここで、二値画像８０１において、顔領域の距離値との距離値の差が閾値以下である画素を被写体候補領域８０２とする。なお、ここで行われる被写体候補領域の判別は上記の方法に限られず、単に選択された被写体位置からの距離値の差が所定の閾値以内となる領域を被写体候補領域として決定してもよい。 <Correction distance data generation process>
Here, the correction distance data generation process performed by the distance correction unit 302 in step S409 will be described with reference to the flowchart shown in FIG. 7. In step S701, the distance correction unit 302 extracts the subject candidate region based on the face information, the subject position P0, and the distance image data. The process of this step will be described with reference to FIGS. 8A and 8B. First, the distance correction unit 302 selects the face area 601 closest to the subject position P0 from the face areas indicated by the face information. Then, the distance value of each pixel in the selected face region is acquired from the distance image data, and the average value thereof is calculated as the distance value of the face region. After that, the distance correction unit 302 generates a binary image 801 divided into pixels in which the difference between the distance value in the face region and the distance value is equal to or less than a predetermined threshold value and other pixels. That is, the process performed here is a process of discriminating between a subject whose distance from the selected subject is within a predetermined range and a subject other than that. Here, in the binary image 801, the pixel in which the difference between the distance value of the face region and the distance value is equal to or less than the threshold value is defined as the subject candidate region 802. The determination of the subject candidate area performed here is not limited to the above method, and an area in which the difference in the distance value from the selected subject position is within a predetermined threshold value may be determined as the subject candidate area.

ステップＳ７０２において、距離補正部３０２は、二値画像８０１に対して小成分除去処理や穴埋め処理を施すことにより被写体候補領域に含まれる小さな連結成分を除去したり、穴を埋める整形処理を行う。小成分除去処理・穴埋め処理としては、モルフォロジ演算を用いた方法やラベリング処理を利用した方法などが適用可能である。ここではモルフォロジ演算を用いた方法を利用する。距離補正部３０２は小成分除去処理として、二値画像８０１に含まれる被写体候補領域に対してオープニング処理を行う。そして、その後の穴埋め処理としては、被写体候補領域に対してクロージング処理を行う。図８（ｃ）に本ステップによって得られる二値画像８０３の例を示す。 In step S702, the distance correction unit 302 performs a small component removal process or a hole filling process on the binary image 801 to remove a small connected component included in the subject candidate region or perform a shaping process to fill the hole. As the small component removal process / fill-in-the-blank process, a method using a morphology calculation or a method using a labeling process can be applied. Here, a method using morphology calculation is used. The distance correction unit 302 performs an opening process on the subject candidate region included in the binary image 801 as a small component removal process. Then, as the subsequent fill-in-the-blank process, a closing process is performed on the subject candidate area. FIG. 8C shows an example of the binary image 803 obtained by this step.

ステップＳ７０３において、距離補正部３０２は、ステップＳ７０２で整形処理が行われた二値画像８０３に対して平滑化処理を施し、多値の補正距離データ８０４（図８（ｄ））を生成する。例えば、二値画像８０３のうち被写体候補領域８０２に含まれる画素の画素値を２５５、その他の画素の画素値を０とした画像に対して平滑化処理を行うこと、一画素あたり８ビットの距離情報を有する補正距離データ８０４を生成する。このとき、画素値が大きいほど被写体までの距離が小さいものとする。 In step S703, the distance correction unit 302 performs smoothing processing on the binary image 803 subjected to the shaping process in step S702 to generate multi-value correction distance data 804 (FIG. 8D). For example, in the binary image 803, a smoothing process is performed on an image in which the pixel value of the pixel included in the subject candidate area 802 is 255 and the pixel value of the other pixels is 0, and the distance of 8 bits per pixel. The correction distance data 804 having information is generated. At this time, it is assumed that the larger the pixel value, the smaller the distance to the subject.

なお、平滑化処理としては、ガウシアンフィルタやカラー画像データの画素値を参照しつつ平滑化を行うジョイントバイラテラルフィルタ等が適用可能である。本実施形態では以下の式（１）で表されるジョイントバイラテラルフィルタを利用するものとする。 As the smoothing process, a Gaussian filter, a joint bilateral filter that performs smoothing while referring to the pixel values of color image data, and the like can be applied. In this embodiment, the joint bilateral filter represented by the following equation (1) is used.

ｓは処理対象画素、Ωはｓの近傍領域、ｐはΩに含まれる画素、Ｉは平滑化を行う画像データ、Ｒは参照用画像データ、ｆはｐとｓと間の距離に基づく重み、ｇは画素値に基づく重みを表す。ｆはｓとｐとの距離が大きくなるほど重みが小さくなるように設定される。ｇは参照用画像の画素ｐと画素ｓの画素値の差が大きいほど重みが小さくなるように設定する。式（１）ではＹは画素ｐと画素ｓの画素値の輝度差を表すものとする。ステップＳ７０３では、Ｉとして補正距離データ８０３を、Ｒとしてカラー画像データを使用し平滑化処理を行う。カラー画像データを参照しつつ二値画像８０３に対してジョイントバイラテラルフィルタを利用することにより、カラー画像データ内の画素値の近い画素のみを利用して平滑化処理を施すことができる。これにより、被写体領域８０２の輪郭を、カラー画像データ中の被写体の輪郭に合わせつつ平滑化をおこなう事ができる。なお、平滑化処理の方法はこれに限るものではない。例えば、ｆの設定方法として近傍領域内で等しい重み与えても構わない。また、ｇの設定方法として輝度値の代わりに色差に基づいて重みを構わない。あるいは、画素値が一定値以内であれば重みを一定にするなどしても良い。 s is the pixel to be processed, Ω is the neighborhood region of s, p is the pixel included in Ω, I is the image data for smoothing, R is the reference image data, and f is the weight based on the distance between p and s. g represents a weight based on the pixel value. f is set so that the weight decreases as the distance between s and p increases. g is set so that the larger the difference between the pixel values of the pixel p and the pixel s of the reference image, the smaller the weight. In the formula (1), Y represents the brightness difference between the pixel values of the pixel p and the pixel s. In step S703, smoothing processing is performed using the correction distance data 803 as I and the color image data as R. By using a joint bilateral filter for the binary image 803 while referring to the color image data, it is possible to perform the smoothing process using only the pixels having similar pixel values in the color image data. As a result, the contour of the subject area 802 can be smoothed while matching the contour of the subject in the color image data. The method of smoothing processing is not limited to this. For example, as a method of setting f, equal weights may be given in the neighborhood region. Further, as a method of setting g, a weight may be used based on a color difference instead of a luminance value. Alternatively, if the pixel value is within a certain value, the weight may be made constant.

また、補正距離データ８０４を多値画像で取得することにより、ステップＳ４０９で行うライティング処理の際に被写体輪郭部の違和感を軽減することができる。以上の処理により、距離補正部３０２は、主に手前の被写体とそれ以外の背景に分割され、それぞれに対応する距離値が格納された補正距離データ８０４を取得することができる。なお、ここで行われるフィルタ処理はジョイントバイラテラルフィルタである必要はなく、カラー画像データの画素値を基準とするフィルタ処理であればどのようなものを利用してもよい。 Further, by acquiring the correction distance data 804 as a multi-valued image, it is possible to reduce the discomfort of the subject contour portion during the lighting process performed in step S409. By the above processing, the distance correction unit 302 is mainly divided into a subject in the foreground and a background other than the subject, and can acquire the correction distance data 804 in which the distance values corresponding to the respective subjects are stored. The filter processing performed here does not have to be a joint bilateral filter, and any filter processing based on the pixel value of the color image data may be used.

＜法線画像データ生成処理＞
ここでは、ステップＳ４１０で法線生成部３０４が行う補正法線データ生成処理について説明する。本実施形態における補正法線データ生成処理は、ＲＯＭ２０３やＰＣ／メディア２１３に格納された顔法線モデルを、カラー画像データに基づいて補正する処理である。以下、補正法線データ生成処理の詳細について図９に示すフローチャートを参照して述べる。 <Normal image data generation processing>
Here, the correction normal data generation process performed by the normal generation unit 304 in step S410 will be described. The correction normal data generation process in the present embodiment is a process for correcting the face normal model stored in the ROM 203 or the PC / media 213 based on the color image data. Hereinafter, the details of the correction normal data generation process will be described with reference to the flowchart shown in FIG.

ステップＳ９０１において、顔法線モデルをカラー画像データに合わせて変形する際の変形パラメータを算出する。本実施形態の顔法線情報の例を図１０（ａ）に示す。顔法線モデルには顔法線画像データ１００１と、それに対応する器官位置情報１００２が含まれている。顔法線画像データ１００１は、画素Ｎ（ｉ，ｊ）に画素値として顔の向きの法線ベクトル（Ｎｘ（ｉ，ｊ）、Ｎｙ（ｉ，ｊ）、Ｎｚ（ｉ，ｊ））を格納した画像データである。Ｎｘ（ｉ，ｊ）、Ｎｙ（ｉ，ｊ）、Ｎｚ（ｉ，ｊ）はそれぞれ画素（ｉ，ｊ）に格納された法線ベクトルの、互いに直交する３本の座標軸であるｘ軸、ｙ軸、ｚ軸方向の成分である。また、顔法線画像データ１００１に含まれる法線ベクトルは全て単位ベクトルとする。顔の領域に対応する画素は顔表面に垂直な方向のベクトルが法線ベクトルとして格納されており、顔以外の領域に対応する画素は撮像装置の光軸とは逆方向のベクトルが法線ベクトルとして格納されているものとする。本実施形態ではｚ軸を撮像装置の光軸と逆方向とし、顔以外の領域に対応する画素では法線ベクトルとして（０，０，１）が格納されるものとする。器官位置情報１００２は、顔法線画像データ１００１中の右目、左目、口の座標値を示す。 In step S901, the deformation parameter when the face normal model is deformed according to the color image data is calculated. An example of the face normal information of this embodiment is shown in FIG. 10 (a). The face normal model includes face normal image data 1001 and corresponding organ position information 1002. The face normal image data 1001 stores the normal vector of the face orientation (Nx (i, j), Ny (i, j), Nz (i, j)) as a pixel value in the pixel N (i, j). This is the image data. Nx (i, j), Ny (i, j), and Nz (i, j) are the x-axis and y, which are three coordinate axes of the normal vector stored in the pixel (i, j), which are orthogonal to each other. It is a component in the axial and z-axis directions. Further, all the normal vectors included in the face normal image data 1001 are unit vectors. For the pixels corresponding to the face area, the vector in the direction perpendicular to the face surface is stored as a normal vector, and for the pixels corresponding to the area other than the face, the vector in the direction opposite to the optical axis of the image pickup device is the normal vector. It is assumed that it is stored as. In the present embodiment, the z-axis is in the direction opposite to the optical axis of the image pickup apparatus, and (0, 0, 1) is stored as a normal vector in the pixels corresponding to the region other than the face. The organ position information 1002 indicates the coordinate values of the right eye, the left eye, and the mouth in the face normal image data 1001.

本ステップでは、法線補正部３０４が、顔法線モデルに対応する器官位置１００２と、カラー画像データの顔情報に含まれる器官位置６０２とから、カラー画像データ５０１と顔法線画像データ１００１との右目、左目、口の座標を対応づける。そして、法線補正部３０４は、顔法線画像データ１００１の器官位置１００２を器官位置６０２に合わせるための変形パラメータを算出する。変形パラメータとしては、アフィン変換に用いるためのアフィン変換係数を算出する。アフィン変換係数の算出法としては最小二乗法などが利用可能である。すなわち、器官位置１００２をアフィン変換した際の、器官位置６０２との誤差の事情輪が最小になるアフィン変換係数が、ここでの変換パラメータとして決定される。なお、本実施形態では顔法線画像データ１００１は画素値として法線ベクトルのｘ軸、ｙ軸、ｚ軸方向の成分を保有しているが、例えば３チャンネル８ｂｉｔカラー画像データの各チャンネルにこれらを割り当てても構わない。例えば、法線ベクトルの各軸方向の成分は−１．０から１．０の値をとるため、この間の値を０から２５５に割り当てることで法線ベクトルの情報を３チャンネル８ｂｉｔカラー画像データとして保有することができる。 In this step, the normal correction unit 304 uses the organ position 1002 corresponding to the face normal model and the organ position 602 included in the face information of the color image data to obtain the color image data 501 and the face normal image data 1001. Associate the coordinates of the right eye, left eye, and mouth of. Then, the normal correction unit 304 calculates a deformation parameter for adjusting the organ position 1002 of the face normal image data 1001 to the organ position 602. As the deformation parameter, the affine transformation coefficient for use in the affine transformation is calculated. As a method for calculating the affine transformation coefficient, the least squares method or the like can be used. That is, the affine transformation coefficient that minimizes the circumstance of the error from the organ position 602 when the organ position 1002 is affine-transformed is determined as the conversion parameter here. In the present embodiment, the face normal image data 1001 has components in the x-axis, y-axis, and z-axis directions of the normal vector as pixel values. For example, these are included in each channel of the 3-channel 8-bit color image data. May be assigned. For example, since each axial component of the normal vector takes a value of -1.0 to 1.0, by assigning a value between them from 0 to 255, the information of the normal vector can be used as 3-channel 8-bit color image data. Can be owned.

ステップＳ９０２において、法線補正部３０４は、ステップＳ９０１で算出したアフィン変換係数を用いて顔法線画像データ１００１を変換し法線画像データ１００３を生成する。これにより、カラー画像データ５０１に含まれる顔領域に顔法線画像データ１００１をフィッティングした法線画像データ１００３が生成される。法線画像データ１００３は、画素Ｎ’（ｉ，ｊ）には画素値として法線ベクトル（Ｎ’ｘ（ｉ，ｊ）、Ｎ’ｙ（ｉ，ｊ）、Ｎ’ｚ（ｉ，ｊ））を格納した画像データである。法線画像データ１００３の法線ベクトルは、顔法線画像１００１に対応する領域（図１０（ｂ））については、顔法線画像１００１の各画素に格納された法線ベクトル（Ｎｘ，Ｎｙ，Ｎｚ）に基づいて算出される。そして、顔法線画像１００１に対応しない領域については、撮像装置の光軸と逆方向の法線ベクトル（０，０，１）が格納されるものとする。本ステップにより顔法線画像１００１中の顔領域をカラー画像データ中の顔領域に概ね合わせる事ができる。しかし、顔の輪郭など器官以外の位置は正確に合わせられない場合があるため、以降のステップでこれを補正する。 In step S902, the normal correction unit 304 converts the face normal image data 1001 using the affine transformation coefficient calculated in step S901 to generate the normal image data 1003. As a result, the normal image data 1003 is generated by fitting the face normal image data 1001 to the face region included in the color image data 501. In the normal image data 1003, the normal vector (N'x (i, j), N'y (i, j), N'z (i, j)) is used as a pixel value for the pixel N'(i, j). ) Is stored in the image data. The normal vector of the normal image data 1003 is a normal vector (Nx, Ny, It is calculated based on Nz). Then, for the region that does not correspond to the face normal image 1001, it is assumed that the normal vector (0, 0, 1) in the direction opposite to the optical axis of the imaging device is stored. By this step, the face area in the face normal image 1001 can be roughly matched with the face area in the color image data. However, positions other than organs such as the contour of the face may not be accurately aligned, so this will be corrected in the subsequent steps.

ステップＳ９０３において、法線補正部３０４は、法線画像データ１００３をｘ軸、ｙ軸、ｚ軸方向の成分毎に分け、ｘ軸成分法線データ１１０１、ｙ軸成分法線データ１１０２、ｚ軸成分法線データ１１０３の３つの画像データに分解する（図１１（ａ））。これにより、二値画像８０３と同様の平滑化処理が適用可能となる。本実施形態では、法線補正部３０４は、ステップＳ７０３と同様にジョイントバイラテラルフィルタを作用させる。 In step S903, the normal correction unit 304 divides the normal image data 1003 into components in the x-axis, y-axis, and z-axis directions, and divides the normal image data 1003 into x-axis component normal data 1101, y-axis component normal data 1102, and z-axis. It is decomposed into three image data of the component normal data 1103 (FIG. 11 (a)). This makes it possible to apply the same smoothing process as the binary image 803. In the present embodiment, the normal correction unit 304 operates the joint bilateral filter in the same manner as in step S703.

ステップＳ９０４において、法線補正部３０４は、ｘ軸成分法線データ１１０１に対して平滑化処理を行い、平滑化ｘ軸成分法線データ１１０４を生成する。平滑化処理としては、カラー画像データ５０１を参照画像とするジョイントバイラテラルフィルタを適用する。本処理によって得られる平滑化ｘ軸成分法線データ１１０４は各画素に平滑化されたｘ軸成分の値Ｎ”ｘが格納されているものとする。 In step S904, the normal correction unit 304 performs a smoothing process on the x-axis component normal data 1101 to generate the smoothed x-axis component normal data 1104. As the smoothing process, a joint bilateral filter using the color image data 501 as a reference image is applied. It is assumed that the smoothed x-axis component normal data 1104 obtained by this process stores the smoothed x-axis component value N "x in each pixel.

ステップＳ９０５において、法線補正部３０４は、ｙ軸成分法線データ１１０２に対して平滑化処理を行い、平滑化ｙ軸成分法線データ１１０５を生成する。平滑化処理としては、カラー画像データ５０１を参照画像とするジョイントバイラテラルフィルタを適用する。本処理によって得られる平滑化ｙ軸成分法線データ１１０５は各画素に平滑化されたｙ軸成分の値Ｎ”ｙが格納されているものとする。 In step S905, the normal correction unit 304 performs a smoothing process on the y-axis component normal data 1102 to generate the smoothed y-axis component normal data 1105. As the smoothing process, a joint bilateral filter using the color image data 501 as a reference image is applied. It is assumed that the smoothed y-axis component normal data 1105 obtained by this processing stores the smoothed y-axis component value N "y in each pixel.

ステップＳ９０６において、法線補正部３０４は、ｚ軸成分法線データ１１０３に対して平滑化処理を行い、平滑化ｚ軸成分法線データ１１０６を生成する。平滑化処理としては、カラー画像データ５０１を参照画像とするジョイントバイラテラルフィルタを適用する。本処理によって得られる平滑化ｚ軸成分法線データ１１０６は各画素に平滑化されたｚ軸成分の値Ｎ”ｚが格納されているものとする。
上記ステップＳ９０４からステップＳ９０６により、法線画像データの１００３中の顔の輪郭を、カラー画像データ中の被写体の輪郭に合わせる事ができる。 In step S906, the normal correction unit 304 performs a smoothing process on the z-axis component normal data 1103 and generates the smoothed z-axis component normal data 1106. As the smoothing process, a joint bilateral filter using the color image data 501 as a reference image is applied. It is assumed that the smoothed z-axis component normal data 1106 obtained by this process stores the smoothed z-axis component value N "z in each pixel.
From step S904 to step S906, the contour of the face in the normal image data 1003 can be matched with the contour of the subject in the color image data.

ステップＳ９０７において、法線補正部３０４は平滑化ｘ軸成分法線データ１１０４、平滑化ｙ軸成分法線データ１１０５、平滑化ｚ軸成分法線データ１１０６を統合し、平滑化法線画像データ１１０７を生成する（図１１（ｂ））。平滑化法線画像データ１１０７は画素（ｉ，ｊ）に法線ベクトル（Ｎ”ｘ（ｉ，ｊ）、Ｎ”ｙ（ｉ，ｊ）、Ｎ”ｚ（ｉ，ｊ））を格納した画像データである。 In step S907, the normal correction unit 304 integrates the smoothed x-axis component normal data 1104, the smoothed y-axis component normal data 1105, and the smoothed z-axis component normal data 1106, and the smoothed normal image data 1107. Is generated (FIG. 11 (b)). The smoothed normal image data 1107 is an image in which normal vectors (N "x (i, j), N" y (i, j), N "z (i, j)) are stored in pixels (i, j). It is data.

ステップＳ９０８において、平滑化法線画像データ１１０７の各画素に格納された法線ベクトルを単位ベクトルになるように正規化する。ステップＳ９０４からステップＳ９０６では、各軸成分ごとに平滑化処理を行ったため、画素によって格納されている法線ベクトルの大きさが異なる。これを補正するため、本ステップでは式（２）のように法線ベクトルの大きさが１になるように正規化を行う。 In step S908, the normal vector stored in each pixel of the smoothed normal image data 1107 is normalized so as to be a unit vector. In steps S904 to S906, since the smoothing process is performed for each axis component, the magnitude of the normal vector stored in each pixel is different. In order to correct this, in this step, normalization is performed so that the magnitude of the normal vector becomes 1 as in Eq. (2).

これにより、画素（ｉ，ｊ）に大きさ１の法線ベクトル（Ｎ’’’ｘ（ｉ，ｊ）、Ｎ’’’ｙ（ｉ，ｊ）、Ｎ’’’ｚ（ｉ，ｊ））を格納した補正法線データが取得される。 As a result, the normal vector of size 1 (N ′ ″ x (i, j), N ″ ″ y (i, j), N ″ ′ z (i, j) for the pixel (i, j). ) Is stored in the correction normal data.

以上により、法線補正部３０４は補正法線データを取得する。以上の処理によれば、被写体の顔に合わせて顔法線モデルを補正することができるので、ライティング処理において被写体の顔に対して自然な陰影を付与することができる。また、上記のように各座標軸成分について独立に平滑化処理を行うことで、法線方向が平滑化処理により大きく変わることを防ぐことができる。 As described above, the normal correction unit 304 acquires the correction normal data. According to the above processing, since the face normal model can be corrected according to the face of the subject, it is possible to give a natural shadow to the face of the subject in the lighting process. Further, by performing the smoothing process independently for each coordinate axis component as described above, it is possible to prevent the normal direction from being significantly changed by the smoothing process.

＜ライティング処理＞
ここでは、ステップＳ４１１で行われるライティング処理について説明する。本実施形態におけるライティング処理は、補正距離データ、補正法線データに基づき、ユーザ操作によって設定された照明パラメータに応じてカラー画像データに対して仮想光源を加える処理を行って補正画像を生成する処理である。以下、ライティング処理の詳細について図１２に示すフローチャートを参照して説明する。 <Lighting process>
Here, the lighting process performed in step S411 will be described. The lighting process in the present embodiment is a process of generating a corrected image by adding a virtual light source to the color image data according to the lighting parameters set by the user operation based on the correction distance data and the correction normal data. Is. Hereinafter, the details of the lighting process will be described with reference to the flowchart shown in FIG.

ステップＳ１２０１において、ライティング部３０５が制御部２０６からユーザによって設定された、ライティング処理に用いる照明パラメータを取得する。本実施形態では、ユーザは操作部１０７の操作により照明パラメータとして仮想照明の位置Ｑ、姿勢Ｕ、強度α、光源色Ｌを設定する。 In step S1201, the lighting unit 305 acquires the lighting parameters used for the lighting process set by the user from the control unit 206. In the present embodiment, the user sets the position Q, the posture U, the intensity α, and the light source color L of the virtual lighting as lighting parameters by operating the operation unit 107.

ステップＳ１２０２において、ライティング部３０５が補正距離データ８０４、法法線画像データ１００３、ステップＳ１１０１で取得された照明パラメータに基づいて、カラー画像データ５０１の画素値の補正を行う。本実施形態では式（３）に従ってカラー画像データの画素値を補正し、補正画像データをＩ’を生成するものとする。 In step S1202, the lighting unit 305 corrects the pixel value of the color image data 501 based on the correction distance data 804, the normal image data 1003, and the illumination parameters acquired in step S1101. In the present embodiment, the pixel value of the color image data is corrected according to the equation (3), and the corrected image data is generated as I'.

ここで、Ｉ’ｒ、Ｉ’ｇ、Ｉ’ｂは補正画像データＩ’の画素値、Ｌｒｍ、Ｌｇｍ、Ｌｂｍはｍ番目の照明の色、ｋｍはｍ番目の照明に対する画素値の補正度合いを表す。ｋｍは照明の明るさα、位置Ｑ、姿勢Ｕおよび画素（ｘ、ｙ）に対応する距離値、法線ベクトルＶに基づいて決定する。例えば式（４）のように求めることができる。 Here, I'r, I'g, and I'b are the pixel values of the corrected image data I', Lrm, Lgm, and Lbm are the m-th illumination colors, and km is the correction degree of the pixel values for the m-th illumination. show. km is determined based on the brightness α of the illumination, the position Q, the attitude U, the distance value corresponding to the pixel (x, y), and the normal vector V. For example, it can be obtained by the equation (4).

式（４）について図１３を用いて説明する。ｔは仮想光源による補正度合いを調整する補正係数である。本実施形態ではｔ＝１とする。αは照明の明るさを表す変数である。Ｑは光源の位置を表すベクトルである。Ｐは画素（ｉ、ｊ）の三次元的な位置を表すベクトルであり、補正距離データ８０４から下記のように算出される。まず、補正距離データ８０４の画素値に基づき、撮像装置１０１から各画素に対応する被写体位置までの仮想的な距離値を算出する。この際、補正距離データ８０４において画素値の大きな画素ほど撮像装置１０１からの距離が小さいものとする。続いて、ライティング部３０５は各画素に対応する仮想的な距離値と、撮像装置１０１の画角とカラー画像データ５０１の画像サイズなどに基づき、画素（ｉ、ｊ）の三次元的な位置Ｐを算出する。Ｗは画素（ｉ、ｊ）の位置Ｐから光源の位置Ｑまでの距離が大きくなるに従い大きな値を返す関数である。ρはＱからＰ（ｉ，ｊ）に向かうベクトルと、照明の姿勢Ｕのなす角度を表す。Ｋはρが小さいほど大きな値となるような関数である。Ｎ（ｉ，ｊ）は画素（ｉ、ｊ）に対応する法線ベクトル、Ｖ（ｉ，ｊ）はＱからＰ（ｉ，ｊ）に向かう方向を表す単位ベクトルである。本実施形態のように補正画像を生成することにより、照明の位置と被写体の形状に応じた明るさの補正が可能である。以上のように、仮想光源からの距離に応じて画素値を加算するライティング処理が行われる。以上の処理により、仮想光源に近く、仮想光源から画素（ｉ，ｊ）に向かうベクトルと法線ベクトルとのなす角が小さい画素ほど明るくなるように補正することができる。これにより、図１４に示すように、仮想照明により被写体を照らしたかのような補正画像１４０１を得る事ができる。 Equation (4) will be described with reference to FIG. t is a correction coefficient for adjusting the degree of correction by the virtual light source. In this embodiment, t = 1. α is a variable that represents the brightness of the illumination. Q is a vector representing the position of the light source. P is a vector representing the three-dimensional position of the pixel (i, j), and is calculated from the correction distance data 804 as follows. First, based on the pixel value of the correction distance data 804, a virtual distance value from the image pickup apparatus 101 to the subject position corresponding to each pixel is calculated. At this time, it is assumed that the larger the pixel value in the correction distance data 804, the smaller the distance from the image pickup apparatus 101. Subsequently, the lighting unit 305 uses the virtual distance value corresponding to each pixel, the angle of view of the imaging device 101, the image size of the color image data 501, and the like, and the three-dimensional positions P of the pixels (i, j). Is calculated. W is a function that returns a larger value as the distance from the position P of the pixel (i, j) to the position Q of the light source increases. ρ represents the angle formed by the vector from Q to P (i, j) and the attitude U of the illumination. K is a function such that the smaller ρ is, the larger the value is. N (i, j) is a normal vector corresponding to the pixel (i, j), and V (i, j) is a unit vector representing the direction from Q to P (i, j). By generating the corrected image as in the present embodiment, it is possible to correct the brightness according to the position of the illumination and the shape of the subject. As described above, the lighting process of adding the pixel values according to the distance from the virtual light source is performed. By the above processing, it is possible to correct the pixel so that it is closer to the virtual light source and the angle between the vector from the virtual light source toward the pixel (i, j) and the normal vector is smaller, the brighter the pixel. As a result, as shown in FIG. 14, it is possible to obtain a corrected image 1401 as if the subject was illuminated by virtual illumination.

ステップＳ１２０３において、ライティング部３０５は、画素値の補正を行った補正画像データを表示部１０６に表示して処理を終了する。ユーザは、ここで表示部１０６に表示された補正画像データを見て、照明パラメータの変更指示や画像の出力指示を入力する。 In step S1203, the lighting unit 305 displays the corrected image data in which the pixel value has been corrected on the display unit 106, and ends the process. The user sees the corrected image data displayed on the display unit 106, and inputs an instruction to change the lighting parameter and an instruction to output the image.

以上の処理によれば、被写体に合わせて変形した顔法線モデルを用いてライティング処理を行うことができるので、被写体の姿勢や表情に合わせた自然な陰影を付与した画像を得ることができる。 According to the above processing, since the lighting processing can be performed using the face normal model deformed according to the subject, it is possible to obtain an image with natural shading according to the posture and facial expression of the subject.

［実施形態２］
実施形態１では被写体の画素値によらずにカラー画像データの画素値を補正する例について生成した。実施形態２では、被写体の輝度値に基づいて補正量を制御する方法について説明する。被写体の輝度値に基づいて補正量を制御することにより、あらかじめ輝度値の高い領域を補正した場合に発生する白とびや、暗部を補正した際に発生するノイズ増加を抑制することができる。 [Embodiment 2]
In the first embodiment, an example of correcting the pixel value of the color image data regardless of the pixel value of the subject is generated. In the second embodiment, a method of controlling the correction amount based on the brightness value of the subject will be described. By controlling the correction amount based on the brightness value of the subject, it is possible to suppress overexposure that occurs when a region having a high brightness value is corrected in advance and noise increase that occurs when a dark part is corrected.

本実施形態の撮像装置１０１の構成と、基本的な処理の流れは実施形態１と同様であるので説明を省略する。実施形態２において実施形態１と異なる点は、ライティング処理部３０５で行われるライティング処理において、式４に示す補正係数ｔが各画素の輝度値によって決定される点である。 Since the configuration of the image pickup apparatus 101 of the present embodiment and the basic processing flow are the same as those of the first embodiment, the description thereof will be omitted. The difference between the second embodiment and the first embodiment is that the correction coefficient t shown in the equation 4 is determined by the luminance value of each pixel in the lighting process performed by the lighting processing unit 305.

本実施形態における補正係数ｔの決定方法の例を図１５を参照して説明する。図１５（ａ）では、あらかじめ設定したしきい値ｔｈ１、ｔｈ２に基づいて補正係数ｔが決定される例を示している。この例では、画素の輝度値Ｙが０≦Ｙ＜ｔｈ１の区間ではｔ＝１、ｔｈ１≦Ｙ＜ｔｈ２の区間ではｔが単調に減少、ｔｈ２≦Ｙの区間ではｔ＝０となるように補正係数ｔが決定される。このように補正係数ｔを決定すると、輝度値の大きな画素ほど仮想光源による補正度合いを小さくすることができる。そのため、仮想光源の影響により輝度値の高い画素が白とびするのを抑制する効果が得られる。なお、図１５（ａ）では、ｔｈ１≦Ｙ＜ｔｈ２の区間でｔがＹの一次関数となるように直線的に減少させているが、減少のさせかたはこれに限らない。ｔをＹの一次関数とした場合、補正画像Ｉ’はカラー画像データＩの画素値の二次関数として表現される。この場合、ｔｈ１≦Ｙ＜ｔｈ２の区間に補正画像の画素値が極大となり、階調反転が発生してしまう場合がある。これを抑制する手段として、ｔｈ１≦Ｙ＜ｔｈ２の区間における減少を二次曲線や三角関数等を利用して表現してもよい。こうすることによって、補正画像における階調反転の発生を抑制することができる。あるいは、図１５（ｂ）に示すように補正係数ｔを決めることもできる。図１５（ｂ）では、あらかじめ設定したしきい値ｔｈ１、ｔｈ２、ｔｈ３によって決定される補正係数の例を示している。この例では、輝度値Ｙが０≦Ｙ＜ｔｈ３の区間ではｔが単調増加、ｔｈ３≦Ｙ＜ｔｈ１の区間ではｔ＝１、ｔｈ１≦Ｙ＜ｔｈ２の区間ではｔが単調減少、ｔｈ２≦Ｙの区間ではｔ＝０となるように補正係数ｔが決定される。図１５（ｂ）のように補正係数ｔを設定することにより、白とびに加え輝度値の小さい暗部のノイズが、ライティング処理により強調されることを抑制することができる。また、輝度値Ｙが０≦Ｙ＜ｔｈ３の区間における増加や、ｔｈ１≦Ｙ＜ｔｈ２の区間における減少を二次曲線や三角関数等を利用して表現する方法することで、図１５（ａ）の場合と同様に、階調反転の発生を抑制することができる。 An example of a method for determining the correction coefficient t in the present embodiment will be described with reference to FIG. FIG. 15A shows an example in which the correction coefficient t is determined based on the preset threshold values th1 and th2. In this example, the pixel brightness value Y is corrected so that t = 1 in the section where 0 ≦ Y <th1, t decreases monotonically in the section where th1 ≦ Y <th2, and t = 0 in the section where th2 ≦ Y. The coefficient t is determined. When the correction coefficient t is determined in this way, the degree of correction by the virtual light source can be reduced as the pixel has a larger luminance value. Therefore, it is possible to obtain the effect of suppressing overexposure of pixels having a high luminance value due to the influence of the virtual light source. In FIG. 15A, t is linearly reduced so that t becomes a linear function of Y in the interval of th1 ≦ Y <th2, but the method of reduction is not limited to this. When t is a linear function of Y, the corrected image I'is expressed as a quadratic function of the pixel value of the color image data I. In this case, the pixel value of the corrected image becomes maximum in the interval of th1 ≦ Y <th2, and gradation inversion may occur. As a means of suppressing this, the decrease in the interval of th1 ≦ Y <th2 may be expressed by using a quadratic curve, a trigonometric function, or the like. By doing so, it is possible to suppress the occurrence of gradation inversion in the corrected image. Alternatively, the correction coefficient t can be determined as shown in FIG. 15 (b). FIG. 15B shows an example of the correction coefficient determined by the preset threshold values th1, th2, and th3. In this example, t increases monotonically in the section where the brightness value Y is 0 ≦ Y <th3, t = 1 in the section where th3 ≦ Y <th1, t decreases monotonically in the section where th1 ≦ Y <th2, and th2 ≦ Y. In the interval, the correction coefficient t is determined so that t = 0. By setting the correction coefficient t as shown in FIG. 15B, it is possible to suppress that the noise in the dark portion having a small brightness value in addition to the overexposure is emphasized by the lighting process. Further, by expressing the increase in the interval where the luminance value Y is 0 ≦ Y <th3 and the decrease in the section where th1 ≦ Y <th2 by using a quadratic curve, a trigonometric function, or the like, FIG. As in the case of, the occurrence of gradation inversion can be suppressed.

以上のように、本実施形態の処理によれば、あらかじめ輝度値の高い領域を補正した場合に発生する白とびや、暗部を補正した際に発生するノイズ増加を抑制することができる。 As described above, according to the processing of the present embodiment, it is possible to suppress overexposure that occurs when a region having a high luminance value is corrected in advance and noise increase that occurs when a dark portion is corrected.

［実施形態３］
実施形態１、実施形態２ではシーンに対して仮想光源を付与することにより暗く写っている被写体を明るくするようなライティング処理を行う例について説明した。実施形態３では、ストロボの発光の影響などで平坦に写ってしまった被写体に対して影を付与することにより、被写体の立体感を強調する方法について説明する。 [Embodiment 3]
In the first and second embodiments, an example of performing a lighting process for brightening a dark subject by applying a virtual light source to the scene has been described. In the third embodiment, a method of emphasizing the three-dimensional effect of the subject by adding a shadow to the subject which is projected flat due to the influence of the light emission of the strobe or the like will be described.

本実施形態の撮像装置１０１の構成と、基本的な処理の流れは実施形態１と同様であるので説明を省略する。実施形態３において実施形態１と異なる点は、ステップＳ１２０２で行われる画素値補正の処理が異なる点である。以下、本実施形態のステップＳ１２０２で行われる処理について説明する。本実施形態ノステップＳ１２０２では、実施形態１とは異なり、以下の式（５）に基づいて画素値の補正が行われる。 Since the configuration of the image pickup apparatus 101 of the present embodiment and the basic processing flow are the same as those of the first embodiment, the description thereof will be omitted. The difference between the third embodiment and the first embodiment is that the pixel value correction process performed in step S1202 is different. Hereinafter, the process performed in step S1202 of the present embodiment will be described. In the present embodiment Nostep S1202, unlike the first embodiment, the pixel value is corrected based on the following equation (5).

式（３）との違いは、ｋ’ｍに応じてカラー画像データの画素値が小さくなるように画素値を補正している点である。すなわち、本実施形態で行われるのは、仮想光源からの距離に応じて画素値を減算するライティング処理である。ｋ’ｍは照明の明るさα、位置Ｑ、姿勢Ｕおよび画素（ｘ、ｙ）に対応する距離値、法線ベクトルＶに基づいて決定する。例えば式（６）のように求めることができる。 The difference from the equation (3) is that the pixel value is corrected so that the pixel value of the color image data becomes smaller according to k'm. That is, what is performed in this embodiment is a lighting process in which the pixel value is subtracted according to the distance from the virtual light source. k'm is determined based on the brightness α of the illumination, the position Q, the attitude U, the distance value corresponding to the pixel (x, y), and the normal vector V. For example, it can be obtained as in Eq. (6).

式（４）との違いは、主に法線ベクトルＮ（ｉ，ｊ）とＶ（ｉ，ｊ）のなす角の影響である。式（４）では法線ベクトルＮが仮想光源方向を向いているほどｋの値は大きくなったが、式（６）では逆に法線ベクトルＮが仮想光源方向を向いているほどｋの値は小さくなる。つまり、式（６）により、仮想照明に近く法線ベクトルＮが仮想光源方向を向いていない画素ほど強い影を付与することができる。これにより、図１６に示す補正画像１６０１のように、法線画像データに基づき顔の頬や鼻にのみ影を付与することが可能となる。 The difference from the equation (4) is mainly the influence of the angle formed by the normal vectors N (i, j) and V (i, j). In equation (4), the value of k increased as the normal vector N pointed in the direction of the virtual light source, whereas in equation (6), the value of k increased as the normal vector N pointed in the direction of the virtual light source. Becomes smaller. That is, according to the equation (6), a pixel closer to the virtual illumination and the normal vector N does not face the direction of the virtual light source can give a stronger shadow. As a result, as in the corrected image 1601 shown in FIG. 16, it is possible to add shadows only to the cheeks and nose of the face based on the normal image data.

以上の処理によれば、ストロボの発光などの影響により平坦に写ってしまった被写体に対して立体感が出るように影を付与する補正を行うことができる。 According to the above processing, it is possible to perform correction for adding a shadow to a subject that appears flat due to the influence of light emission of a strobe or the like so as to give a three-dimensional effect.

［実施形態４］
上記の実施形態では、シーンに仮想的な光源を付与するリライティング処理と、画像に影を付与するリライティング処理とについて説明した。実施形態４では、上記の２つの処理を撮影条件に基づいて切り替える方法について説明する。図１７は、実施形態４における画像処理部２０９の動作手順を示すフローチャートである。実施形態１と比較し、新たにステップＳ１７０１とステップＳ１７０２とが加わっている点が異なる。 [Embodiment 4]
In the above embodiment, the rewriting process of giving a virtual light source to the scene and the rewriting process of giving a shadow to the image have been described. In the fourth embodiment, a method of switching between the above two processes based on the shooting conditions will be described. FIG. 17 is a flowchart showing an operation procedure of the image processing unit 209 according to the fourth embodiment. The difference is that step S1701 and step S1702 are newly added as compared with the first embodiment.

ステップＳ１７０１では、ライティング部３０５が、実光源に関する情報を取得する。ここで、実光源とは被写体を撮像する空間において実際に存在する光源のことである。本実施形態では、制御部２０６がユーザによるストロボ使用の指示や、ストロボ１０４からの入力信号に基づいてストロボ発光の有無を判定し、画像データの撮像時にストロボが用いられたかどうかをライティング部３０５に出力するとする。画像データの撮像時にストロボが用いられたと判定された場合は所定の位置Ｑ’に配置されたストロボが発光しているものとして取得される。なお、実光源の情報の取得方法はこれに限られず、例えば選択された被写体の顔領域の画素の平均輝度を求め、平均輝度が閾値以上である場合には撮像時にストロボが発光されたものとして判定してもよい。 In step S1701, the lighting unit 305 acquires information about the actual light source. Here, the actual light source is a light source that actually exists in the space where the subject is imaged. In the present embodiment, the control unit 206 determines whether or not the strobe is emitted based on the user's instruction to use the strobe and the input signal from the strobe 104, and informs the lighting unit 305 whether or not the strobe is used when capturing the image data. Suppose you want to output. If it is determined that the strobe has been used at the time of capturing the image data, it is acquired as if the strobe arranged at the predetermined position Q'is emitting light. The method of acquiring the information of the actual light source is not limited to this. For example, the average brightness of the pixels in the face region of the selected subject is obtained, and if the average brightness is equal to or more than the threshold value, it is assumed that the strobe is emitted at the time of imaging. You may judge.

ステップＳ１７０２では、ライティング部３０５が、ステップＳ１７０１で取得された実光源情報に基づいて、ライティング処理のモードを設定する。このステップでは、ステップＳ１７０１で、撮像時にストロボが発光されていないと判定された場合は、実施形態１に示す仮想光源を付与するライティングモードが設定される。そして、ステップＳ１７０１で、撮像時にストロボが発光されていると判定された場合は、実施形態３に示す影を付与するライティングモードが設定される。 In step S1702, the lighting unit 305 sets the lighting processing mode based on the actual light source information acquired in step S1701. In this step, if it is determined in step S1701 that the strobe is not emitting light at the time of imaging, a lighting mode for applying the virtual light source shown in the first embodiment is set. Then, in step S1701, when it is determined that the strobe is emitting light at the time of imaging, the lighting mode for imparting the shadow shown in the third embodiment is set.

そして、ステップＳ４１１では、ライティング部３０５が、ステップＳ１７０２で設定されたライティングモードに対応するライティング処理をカラー画像データに対して行い、補正画像データを生成する。 Then, in step S411, the lighting unit 305 performs the lighting process corresponding to the lighting mode set in step S1702 on the color image data to generate the corrected image data.

以上が本実施形態における処理の流れである。以上の処理によれば、被写体を撮像した時の光源の状態に応じて、適切なライティング処理を選択することができる。 The above is the flow of processing in this embodiment. According to the above processing, an appropriate lighting processing can be selected according to the state of the light source when the subject is imaged.

なお、本実施形態の処理は上記に限られるものではない。例えば、ステップＳ１７０１において実光源情報としてストロボ光の位置Ｑ’を取得し、ステップＳ４１１のライティング処理において、照明パラメータの初期値としてストロボ光の位置Ｑ’を入力するようにしてもよい。また、カラー画像データにおいて輝度が所定の閾値よりも大きな領域にはストロボ以外の実光源が存在しているとし、検出された実光源が被写体よりも撮像装置１０１に近い位置に存在する場合に、ライティングモードを影付与モードとするようにしてもよい。また、カラー画像データから実光源の位置を取得して、照明パラメータの初期値に入力してもよい。 The processing of this embodiment is not limited to the above. For example, the position Q'of the strobe light may be acquired as the actual light source information in step S1701, and the position Q'of the strobe light may be input as the initial value of the lighting parameter in the lighting process of step S411. Further, it is assumed that an actual light source other than the strobe exists in a region where the brightness is larger than a predetermined threshold value in the color image data, and when the detected actual light source exists at a position closer to the image pickup apparatus 101 than the subject. The lighting mode may be set to the shadow addition mode. Alternatively, the position of the actual light source may be acquired from the color image data and input to the initial value of the illumination parameter.

＜その他の実施形態＞
本発明の実施形態は上記に示す実施形態に限定されるものではない。例えば、ライティング処理において法線画像データを用いずに、被写体の距離情報を直接用いてライティング処理を行うようにしてもよい。この場合は、上記の式とは異なる計算式を用いる必要があるため処理が煩雑になるが、本発明と同様の効果を得ることができる。また、その際に顔法線モデルの代わりに所定の顔の３Ｄモデルを保持しておいてもよい。すなわち、本発明の実施において、被写体の３次元形状を示す情報を広く用いることが可能である。また、顔法線モデルの代わりに所定の顔の３Ｄモデルを保持しておき、変形した３Ｄモデルに基づいて法線情報を取得するようにしてもよい。 <Other Embodiments>
The embodiment of the present invention is not limited to the embodiment shown above. For example, the lighting process may be performed by directly using the distance information of the subject without using the normal image data in the lighting process. In this case, since it is necessary to use a calculation formula different from the above formula, the processing becomes complicated, but the same effect as that of the present invention can be obtained. At that time, a 3D model of a predetermined face may be held instead of the face normal model. That is, in carrying out the present invention, it is possible to widely use information indicating the three-dimensional shape of the subject. Further, a 3D model of a predetermined face may be held instead of the face normal model, and normal information may be acquired based on the deformed 3D model.

本発明は、上述の実施形態の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける１つ以上のプロセッサーがプログラムを読出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。 The present invention supplies a program that realizes one or more functions of the above-described embodiment to a system or device via a network or storage medium, and one or more processors in the computer of the system or device reads and executes the program. It can also be realized by the processing to be performed. It can also be realized by a circuit (for example, ASIC) that realizes one or more functions.

３０１現像部
３０２距離補正部
３０３顔検出部
３０４法線補正部
３０５ライティング部 301 Development unit 302 Distance correction unit 303 Face detection unit 304 Normal correction unit 305 Lighting unit

上記課題を解決するために、本発明に係る画像処理装置は、被写体を撮像することにより得られた撮像画像を取得する取得手段と、前記撮像画像において前記被写体の顔の領域を顔領域として特定する特定手段と、前記撮像画像における前記顔領域の少なくとも一部の明るさが変更された補正画像を生成する生成手段を有し、前記生成手段は、前記顔領域のうち仮想光源を向かない領域に対応する画素ほど強い影が付与されるよう前記明るさを変更し、該変更の度合いを、前記顔領域において輝度値が相対的に小さい画素では抑制することを特徴とする。 In order to solve the above problems, the image processing apparatus according to the present invention specifies an acquisition means for acquiring an captured image obtained by imaging a subject and a region of the face of the subject as a face region in the captured image. The specific means for generating the corrected image in which the brightness of at least a part of the face region in the captured image is changed is provided, and the generating means is a region of the face region that does not face the virtual light source. The brightness is changed so that a stronger shadow is given to the pixel corresponding to the above, and the degree of the change is suppressed in the pixel having a relatively small brightness value in the face region.

Claims

An acquisition means for acquiring an image obtained by photographing a subject, and
A specific means for identifying the face area of the subject as a face area in the color image, and
An image processing apparatus comprising a generation means for generating a corrected image in which the brightness of at least a part of the face region in the color image is changed.

The image processing apparatus according to claim 1, wherein the generation means generates an image whose brightness is changed according to the shape of the face as the correction image.

In the generation means, the degree to which the brightness in the face region is changed is the degree to which the brightness in the subject for which the specific means has specified the face region and the background region excluding the region in the vicinity of the subject is changed. The image processing apparatus according to claim 1, wherein the corrected image is generated so as to be larger than the above.

The image processing apparatus according to claim 1, wherein the specifying means specifies only the face of a person as the face region among the subjects in the color image.

The image processing apparatus according to claim 1, wherein the generation means imparts a shadow according to a shape in the face region.

Claim 1 is characterized in that the degree of change from the brightness of the face region in the color image to the brightness of the face region in the corrected image depends on the brightness of the face region in the color image. The image processing apparatus according to.

In the corrected image, the distance between the first pixel included in the face region and the image pickup device included in the face region and the image pickup device in the first pixel is the same, and the distance from the image pickup device is the same, and the first pixel A claim characterized in that, for a second pixel corresponding to a normal direction different from the normal direction, the degree of change from the brightness of each of the first pixel and the second pixel in the color image is different. Item 6. The image processing apparatus according to any one of Items 1 to 6.

The first aspect of the present invention is characterized in that the generation means does not change the brightness according to the normal direction of the subject in a region other than the face region specified by the specific means and the vicinity of the face region. The image processing apparatus described.

A program that causes a computer to function as the image processing device according to any one of claims 1 to 8.

It is an image processing method that generates a corrected image in which the brightness of a color image obtained by photographing a subject is changed.
Acquire the color image and
In the color image, the face area of the subject is specified as the face area, and the face area is specified.
An image processing method for generating the corrected image in which the brightness of at least a part of the face region in the color image is changed.