JP2021164063A

JP2021164063A - Image processing apparatus, image processing method, and program

Info

Publication number: JP2021164063A
Application number: JP2020063880A
Authority: JP
Inventors: 正明松岡; Masaaki Matsuoka
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2020-03-31
Filing date: 2020-03-31
Publication date: 2021-10-11
Anticipated expiration: 2040-03-31
Also published as: JP7451264B2

Abstract

To enable suppression of generation of an artifact due to insufficient extraction accuracy of a subject region.SOLUTION: An image processing apparatus includes: map obtaining means (301) configured to obtain an evaluation value distribution corresponding to an image as an evaluation value map; map generating means (301) configured to generate a first subject map based on a subject region extracted from the image using the evaluation value map; degree obtaining means (303) configured to obtain a sparse degree indicating a degree of a sparse region included in the first subject map; and correction means (303, 305) configured to execute correction processing on the image using at least any one of the first subject map and a second subject map (307) generated without using the evaluation value map. The correction means (303, 305) executes the correction processing using the second subject map preferentially rather than the first subject map, as the sparse degree becomes higher.SELECTED DRAWING: Figure 2

Description

本発明は、撮像された画像に対する画像処理技術に関する。 The present invention relates to an image processing technique for captured images.

従来、撮像画像から被写体領域を抽出し、被写体領域内だけ明るさを補正したり、被写体領域以外に背景ぼかし効果を付与したりするカメラが知られている。特許文献１では、デフォーカス量分布に基づいて被写体領域を抽出し、被写体領域以外をぼかすことで電子的に背景ぼかし効果を調節する技術が開示されている。 Conventionally, there are known cameras that extract a subject area from a captured image, correct the brightness only in the subject area, or apply a background blur effect to the area other than the subject area. Patent Document 1 discloses a technique of extracting a subject region based on a defocus amount distribution and electronically adjusting the background blur effect by blurring a region other than the subject region.

特開２００８−１５７５４号公報Japanese Unexamined Patent Publication No. 2008-15754

しかしながら、上述の特許文献に開示された従来技術では、デフォーカス量分布のヒストグラムを解析して被写体領域のデフォーカス量範囲を決定するため、人物などの被写体と壁などの背景が接近している場合はデフォーカス量範囲が精度よく決定できない。結果、被写体領域に背景の一部が疎らに含まれたり、逆に被写体領域の一部が疎らに欠けたりして、画像補正や画像効果が疎らに適用され出力画像に斑状等のアーティファクトが発生してしまう。 However, in the prior art disclosed in the above-mentioned patent document, since the histogram of the defocus amount distribution is analyzed to determine the defocus amount range of the subject area, the subject such as a person and the background such as a wall are close to each other. In that case, the defocus amount range cannot be determined accurately. As a result, part of the background is sparsely included in the subject area, or conversely, part of the subject area is sparsely lacking, and image correction and image effects are applied sparsely, causing artifacts such as spots in the output image. Resulting in.

そこで、本発明は、被写体領域の抽出精度不足によるアーティファクト発生を抑圧可能にすることを目的とする。 Therefore, an object of the present invention is to make it possible to suppress the occurrence of artifacts due to insufficient extraction accuracy of the subject area.

本発明の画像処理装置は、画像に対応した評価値分布を評価値マップとして取得するマップ取得手段と、前記評価値マップを用いて前記画像から抽出した被写体領域に基づく第１の被写体マップを生成するマップ生成手段と、前記第１の被写体マップに含まれる疎ら領域の度合を表す、疎ら度合を取得する度合取得手段と、前記第１の被写体マップと、前記評価値マップを用いずに生成された第２の被写体マップとの、少なくともいずれかを用いて前記画像に補正処理を行う補正手段と、を有し、前記補正手段は、前記疎ら度合が高いほど前記第１の被写体マップよりも前記第２の被写体マップを優先的に用いて、前記補正処理を行うことを特徴とする。 The image processing apparatus of the present invention generates a map acquisition means for acquiring an evaluation value distribution corresponding to an image as an evaluation value map, and a first subject map based on a subject area extracted from the image using the evaluation value map. The map generation means for acquiring the degree of sparseness, which represents the degree of the sparse area included in the first subject map, the first subject map, and the evaluation value map are not used. The image has a correction means for correcting the image using at least one of the second subject map, and the correction means has a higher degree of sparseness than the first subject map. The correction process is performed by preferentially using the second subject map.

本発明によれば、被写体領域の抽出精度不足によるアーティファクト発生を抑圧可能になる。 According to the present invention, it is possible to suppress the occurrence of artifacts due to insufficient extraction accuracy of the subject area.

実施形態に係るデジタルカメラの構成例を示す図である。It is a figure which shows the structural example of the digital camera which concerns on embodiment. 撮像部の構成を説明するための図である。It is a figure for demonstrating the structure of the image pickup part. 画像処理部の構成例を示す図である。It is a figure which shows the structural example of the image processing part. 被写体領域抽出部の動作を説明するための図である。It is a figure for demonstrating operation of a subject area extraction part. 被写体マップ合成部の動作を説明するための図である。It is a figure for demonstrating the operation of the subject map synthesis part. 疎ら判定部の構成を説明するための図である。It is a figure for demonstrating the structure of the sparseness determination part. 膨張フィルタ部の構成例を示す図である。It is a figure which shows the structural example of the expansion filter part. 収縮フィルタ部の構成例を示す図である。It is a figure which shows the structural example of the shrinkage filter part. ＭＡＸ／ＭＥＤＩＡＮ／ＭＩＮフィルタ部の動作フローチャートである。It is an operation flowchart of the MAX / MEDIAN / MIN filter unit. 静止画撮影時の制御部の動作フローチャートである。It is an operation flowchart of the control unit at the time of still image shooting. 疎ら判定部の他の構成を説明するための図である。It is a figure for demonstrating another structure of a sparseness determination part.

以下、本発明の実施形態を、添付の図面に基づいて詳細に説明する。なお、以下の実施形態において示す構成は一例に過ぎず、本発明は図示された構成に限定されるものではない。同一の構成または処理については、同じ参照符号を付して説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings. The configuration shown in the following embodiments is only an example, and the present invention is not limited to the illustrated configuration. The same configuration or processing will be described with the same reference numerals.

図１は、本発明実施形態の画像処理装置の一適用例としての撮像装置（以下、デジタルカメラ１００とする）の概略的な構成例を示したブロック図である。
制御部１０１は、例えばＣＰＵである。ＲＯＭ１０２は、書き換え可能な不揮発性メモリであり、デジタルカメラ１００が備える各ブロックの動作を制御する動作プログラムに加え、各ブロックの動作に必要なパラメータ等を記憶する。制御部１０１は、ＲＯＭ１０２から動作プログラムを読み出し、ＲＡＭ１０３に展開して実行することにより、本実施形態のデジタルカメラ１００が備える各ブロックの動作を制御する。ＲＡＭ１０３は、書き換え可能な揮発性メモリであり、デジタルカメラ１００が備える各ブロックの動作において出力されたデータの一時的な記憶領域として用いられる。 FIG. 1 is a block diagram showing a schematic configuration example of an image pickup apparatus (hereinafter referred to as a digital camera 100) as an application example of the image processing apparatus according to the embodiment of the present invention.
The control unit 101 is, for example, a CPU. The ROM 102 is a rewritable non-volatile memory, and stores, in addition to an operation program for controlling the operation of each block included in the digital camera 100, parameters and the like necessary for the operation of each block. The control unit 101 reads an operation program from the ROM 102, expands it into the RAM 103, and executes it to control the operation of each block included in the digital camera 100 of the present embodiment. The RAM 103 is a rewritable volatile memory, and is used as a temporary storage area for data output in the operation of each block included in the digital camera 100.

光学系１０４は、被写体等の光学像を撮像部１０５の撮像面上に結像させる。
撮像部１０５は、例えばＣＣＤやＣＭＯＳセンサ等の撮像素子であり、光学系１０４により撮像素子に結像された光学像を光電変換し、得られた撮像信号（アナログ信号）をＡ／Ｄ変換部１０６に出力する。
Ａ／Ｄ変換部１０６は、入力された撮像信号にＡ／Ｄ変換処理を適用し、得られた撮像データ（デジタル撮像信号）をＲＡＭ１０３に出力して記憶させる。 The optical system 104 forms an optical image of a subject or the like on the imaging surface of the imaging unit 105.
The image pickup unit 105 is, for example, an image pickup element such as a CCD or a CMOS sensor. The optical image image formed on the image pickup device by the optical system 104 is photoelectrically converted, and the obtained image pickup signal (analog signal) is converted to an A / D converter. Output to 106.
The A / D conversion unit 106 applies the A / D conversion process to the input imaging signal, outputs the obtained imaging data (digital imaging signal) to the RAM 103, and stores the obtained imaging data (digital imaging signal).

画像処理部１０７は、ＲＡＭ１０３に記憶されている撮像データに対して、ホワイトバランス調整、色補間、縮小／拡大、フィルタリングなど、様々な画像処理を適用し、得られた画像データをＲＡＭ１０３に出力して記憶させる。本実施形態に係る後述する補正処理は、画像処理部１０７において行われる。 The image processing unit 107 applies various image processing such as white balance adjustment, color interpolation, reduction / enlargement, and filtering to the imaged data stored in the RAM 103, and outputs the obtained image data to the RAM 103. And memorize it. The correction process described later according to this embodiment is performed by the image processing unit 107.

記録媒体１０８は、着脱可能なメモリカード等であり、画像処理部１０７で画像処理がなされてＲＡＭ１０３に記憶されている画像データや、Ａ／Ｄ変換部１０６でＡ／Ｄ変換された撮像データなどを記録画像として記録する。
表示部１０９は、液晶ディスプレイ（ＬＣＤ）等の表示デバイスであり、撮像部１０５で取り込まれた被写体像をスルー表示するなど、様々な情報を表示する。撮像部１０５で取り込まれた被写体像をスルー表示する場合、表示部１０９は、ＥＶＦ（電子ビューファインダ）として機能する。 The recording medium 108 is a detachable memory card or the like, such as image data that has been image-processed by the image processing unit 107 and stored in the RAM 103, image data that has been A / D-converted by the A / D conversion unit 106, and the like. Is recorded as a recorded image.
The display unit 109 is a display device such as a liquid crystal display (LCD), and displays various information such as a through display of a subject image captured by the image pickup unit 105. When the subject image captured by the imaging unit 105 is displayed through, the display unit 109 functions as an EVF (electronic viewfinder).

ピントマップ処理部１１０は、撮像部１０５による撮像信号を解析することで、被写体等のピント分布に関連する情報をピントマップとして生成し、そのピントマップのデータをＲＡＭ１０３に出力して記憶させる。ピントマップ処理部１１０におけるピントマップの生成処理の詳細は後述する。本実施形態の場合、ピントマップ処理部１１０が取得した被写体等のピント分布に関連する情報であるピントマップは、撮像された画像に対する評価値分布を表した評価値マップとして用いられる。 The focus map processing unit 110 analyzes the image pickup signal by the image pickup unit 105 to generate information related to the focus distribution of the subject or the like as a focus map, and outputs the focus map data to the RAM 103 for storage. The details of the focus map generation process in the focus map processing unit 110 will be described later. In the case of the present embodiment, the focus map, which is the information related to the focus distribution of the subject or the like acquired by the focus map processing unit 110, is used as an evaluation value map showing the evaluation value distribution for the captured image.

図２は、撮像部１０５の撮像面の構成例を説明するための図である。
画素２０２は、マイクロレンズ２０１と一対の光電変換部２０３、２０４とから構成される。図１の撮像部１０５の撮像面には、それらマイクロレンズ２０１と一対の光電変換部２０３、２０４とで構成された画素２０２が、二次元的に規則的に配列されている。図２に示す構成の撮像部１０５では、二次元的に規則的に配列された各画素２０２の一対の光電変換部２０３、２０４の出力から、一対の画像としてＡ像、Ｂ像が出力される。すなわち、撮像部１０５によれば、図１の光学系１０４の瞳の異なる領域を通過する一対の光束を一対の光学像として結像させて、それらを一対の画像であるＡ像およびＢ像として出力することができる。 FIG. 2 is a diagram for explaining a configuration example of the imaging surface of the imaging unit 105.
The pixel 202 is composed of a microlens 201 and a pair of photoelectric conversion units 203 and 204. On the imaging surface of the imaging unit 105 of FIG. 1, pixels 202 composed of the microlens 201 and a pair of photoelectric conversion units 203 and 204 are arranged two-dimensionally and regularly. In the imaging unit 105 having the configuration shown in FIG. 2, A image and B image are output as a pair of images from the outputs of the pair of photoelectric conversion units 203 and 204 of the pixels 202 arranged two-dimensionally regularly. .. That is, according to the imaging unit 105, a pair of light fluxes passing through different regions of the pupil of the optical system 104 of FIG. 1 are imaged as a pair of optical images, and these are formed as a pair of images, A image and B image. Can be output.

図１のピントマップ処理部１１０は、それらＡ像とＢ像との位相差分布、つまり視点がそれぞれ異なる二つの画像群から取得される視差情報分布を、評価値マップ（ピントマップ）として出力する。Ａ像とＢ像の位相差分布としては、例えば特許文献１に開示されている手法を用いたデフォーカス量分布を取得すればよい。 The focus map processing unit 110 of FIG. 1 outputs the phase difference distribution between the A image and the B image, that is, the parallax information distribution acquired from two image groups having different viewpoints as an evaluation value map (focus map). .. As the phase difference distribution between the A image and the B image, for example, the defocus amount distribution using the method disclosed in Patent Document 1 may be obtained.

図３は、画像処理部１０７の構成例を示すブロック図である。図３に示すように、画像処理部１０７は、被写体領域抽出部３０１、疎ら判定部３０２、被写体マップ合成部３０３、及び補正処理部３０５を有して構成されている。 FIG. 3 is a block diagram showing a configuration example of the image processing unit 107. As shown in FIG. 3, the image processing unit 107 includes a subject area extraction unit 301, a sparseness determination unit 302, a subject map composition unit 303, and a correction processing unit 305.

被写体領域抽出部３０１は、ピントマップ処理部１１０から入力されるピントマップ（評価値マップ）を用いて被写体領域を抽出して被写体マップを生成する被写体マップ取得処理を行う。そして、被写体領域抽出部３０１は、その被写体マップの情報を、被写体マップ合成部３０３と疎ら判定部３０２とに出力する。なお被写体領域は、例えば特許文献１に開示されているデフォーカス量分布を用いて抽出することができる。 The subject area extraction unit 301 performs a subject map acquisition process of extracting a subject area using a focus map (evaluation value map) input from the focus map processing unit 110 to generate a subject map. Then, the subject area extraction unit 301 outputs the information of the subject map to the subject map synthesizing unit 303 and the sparse determination unit 302. The subject region can be extracted using, for example, the defocus amount distribution disclosed in Patent Document 1.

図４（ａ）〜図４（ｄ）は、被写体領域抽出部３０１における被写体マップ生成処理の動作を説明するための図である。
図４（ａ）は図１の撮像部１０５の撮像面４０１上に結像された被写体像を説明するための図である。図４（ａ）の例では、主被写体である人物４１２にピントが合っていて、その手前に人物４１１が立っているとする。図４（ａ）の例の場合、主被写体の人物４１２は、部屋の壁の直前に立っているとする。また図４（ａ）の場合、部屋内には家具４１３もあり、その家具４１３は背面が壁につくように設置されている。このため、主被写体の人物４１２にピントが合っている場合、その家具４１３もピントが合った状態になっているとする。一方、人物４１１は、人物４１２よりも手前に立っているため、その人物４１１にはピントが合っていない。 4 (a) to 4 (d) are diagrams for explaining the operation of the subject map generation process in the subject area extraction unit 301.
FIG. 4A is a diagram for explaining a subject image formed on the imaging surface 401 of the imaging unit 105 of FIG. In the example of FIG. 4A, it is assumed that the person 412, which is the main subject, is in focus, and the person 411 stands in front of the person 412. In the case of the example of FIG. 4A, it is assumed that the person 412 of the main subject stands in front of the wall of the room. Further, in the case of FIG. 4A, there is also furniture 413 in the room, and the furniture 413 is installed so that the back surface is attached to the wall. Therefore, when the person 412 of the main subject is in focus, it is assumed that the furniture 413 is also in focus. On the other hand, since the person 411 stands in front of the person 412, the person 411 is out of focus.

図４（ｂ）は、ピントマップ４０２を表した図である。図４（ｂ）に示したピントマップ４０２はグレースケールで表現されており、デフォーカス量が大きいほど白く表され、デフォーカス量が小さいほどグレーに表されている。なお図４（ｂ）中の領域４２２は図４（ａ）の人物４１２に対応した領域であり、領域４２３は図４（ａ）の家具４１３に対応した領域であり、領域４２１は図４（ａ）の人物４１１に対応した領域である。図４（ａ）の主被写体の人物４１２と家具４１３及びその背後の壁は距離が近くそれぞれデフォーカス量が小さいため、領域４２２と４２３および壁はグレーで表され、一方、手前側の人物４１１はデフォーカス量が大きいため、領域４２１は白で表されている。 FIG. 4B is a diagram showing the focus map 402. The focus map 402 shown in FIG. 4B is represented in gray scale, and the larger the defocus amount is, the whiter it is, and the smaller the defocus amount is, the grayer it is. The area 422 in FIG. 4B is the area corresponding to the person 412 in FIG. 4A, the area 423 is the area corresponding to the furniture 413 in FIG. 4A, and the area 421 is the area corresponding to FIG. 4 (a). This is the area corresponding to the person 411 in a). Since the main subject person 412 and furniture 413 in FIG. 4A and the wall behind them are close to each other and the amount of defocus is small, the areas 422 and 423 and the wall are shown in gray, while the person 411 on the front side. Since the amount of defocus is large, the area 421 is represented by white.

図４（ｄ）は、デフォーカス量頻度分布を示した図である。図４（ｄ）の頻度分布４０４は人物４１１のデフォーカス量頻度分布を示し、頻度分布４０５は人物４１２および家具４１３のデフォーカス量頻度分布を、頻度分布４０６は壁のデフォーカス量頻度分布を示している。前述したように人物４１２と家具４１３は壁に近いため、人物４１２及び家具４１３のデフォーカス量頻度分布４０５と、壁のデフォーカス量頻度分布４０６との境界（頻度分布の谷部分）は、不鮮明になっている。一方、人物４１２及び家具４１３や壁から離れている人物４１１のデフォーカス量頻度分布４０４は、それら人物４１２及び家具４１３や壁のデフォーカス量頻度分布４０５，４０６と明確に区別可能になっている。 FIG. 4D is a diagram showing a defocus amount frequency distribution. The frequency distribution 404 in FIG. 4 (d) shows the defocus amount frequency distribution of the person 411, the frequency distribution 405 shows the defocus amount frequency distribution of the person 412 and the furniture 413, and the frequency distribution 406 shows the defocus amount frequency distribution of the wall. Shown. As described above, since the person 412 and the furniture 413 are close to the wall, the boundary between the defocus frequency distribution 405 of the person 412 and the furniture 413 and the defocus frequency distribution 406 of the wall (the valley part of the frequency distribution) is unclear. It has become. On the other hand, the defocus frequency distribution 404 of the person 412 and the furniture 413 or the person 411 away from the wall is clearly distinguishable from the defocus frequency distributions 405 and 406 of the person 412 and the furniture 413 or the wall. ..

図４（ｃ）は、図４（ｄ）のデフォーカス量頻度分布に基づいて生成される被写体マップ４０３を示した図である。被写体マップは、白が２５５で黒が０の８ビットの２値で表されるマップである。白部分は、被写体領域を表す被写体ラベルとして用いられ、黒部分は被写体領域外（非被写体領域）を表す非被写体ラベルとして用いられる。このように、被写体マップは、白部分で表される被写体ラベルと、黒部分で表される非被写体ラベルとの、少なくとも二つのラベル領域にクラス分けされている。被写体マップの生成時には、デフォーカス量０を含むデフォーカス量頻度分布４０５のピークを挟む一方の谷ｄ１から他方の谷ｄ２までのＬ２範囲に含まれる領域が白（２５５）の被写体ラベルで表される被写体領域となされる。Ｌ２範囲外（Ｌ１範囲やＬ３範囲）の領域は黒（０）の非被写体ラベルで表される非被写体領域となされる。なお、図４（ｃ）中の領域４３２は図４（ａ）の人物４１２に対応した領域であり、領域４３３は図４（ａ）の家具４１３に対応した領域である。 FIG. 4C is a diagram showing a subject map 403 generated based on the defocus amount frequency distribution of FIG. 4D. The subject map is an 8-bit binary map in which white is 255 and black is 0. The white portion is used as a subject label representing the subject area, and the black portion is used as a non-subject label representing the outside of the subject area (non-subject area). As described above, the subject map is classified into at least two label areas, that is, a subject label represented by a white portion and a non-subject label represented by a black portion. When the subject map is generated, the area included in the L2 range from one valley d1 to the other valley d2 sandwiching the peak of the defocus amount frequency distribution 405 including the defocus amount 0 is represented by a white (255) subject label. It is the subject area. The area outside the L2 range (L1 range or L3 range) is a non-subject area represented by a black (0) non-subject label. The area 432 in FIG. 4C is the area corresponding to the person 412 in FIG. 4A, and the area 433 is the area corresponding to the furniture 413 in FIG. 4A.

ただし、図４（ｄ）のデフォーカス量頻度分布例の場合、デフォーカス量頻度分布４０５と４０６との境界が不鮮明である。このため、図４（ｃ）の被写体マップ４０３では、人物４１２に対応した領域４３２および家具４１３に対応した領域４３３だけでなく、部屋の壁の一部が被写体ラベルを表す白（２５５）の領域４３４として生成されている。この領域４３４は、本来の主被写体ではない領域であるため、主被写体の人物４１２のように纏まった領域にはならず、疎らに散らばった斑状等の小領域になることが多い。以下、被写体マップにおいて疎らに散らばった斑状等の各小領域４３４を、疎ら領域と呼ぶことにする。 However, in the case of the defocus amount frequency distribution example of FIG. 4D, the boundary between the defocus amount frequency distributions 405 and 406 is unclear. Therefore, in the subject map 403 of FIG. 4C, not only the area 432 corresponding to the person 412 and the area 433 corresponding to the furniture 413, but also a white (255) area in which a part of the wall of the room represents the subject label. Generated as 434. Since this region 434 is a region that is not the original main subject, it is not a region that is organized like the person 412 of the main subject, but is often a small region such as sparsely scattered spots. Hereinafter, each small area 434 such as spots scattered sparsely in the subject map will be referred to as a sparse area.

図３に説明を戻す。
疎ら判定部３０２は、被写体領域抽出部３０１によって生成された被写体マップにおいて疎ら領域を検出し、被写体マップ内に疎ら領域がどの程度含まれているかを示す疎ら度合を判定する疎ら度合取得処理を行う。そして、疎ら判定部３０２は、その取得した疎ら度合を表す情報を、被写体マップ合成部３０３に出力する。 The explanation is returned to FIG.
The sparseness determination unit 302 detects a sparse area in the subject map generated by the subject area extraction unit 301, and performs a sparseness degree acquisition process for determining the degree of sparseness indicating how much the sparse area is included in the subject map. .. Then, the sparseness determination unit 302 outputs the acquired information indicating the degree of sparseness to the subject map composition unit 303.

本実施形態の場合、疎ら判定部３０２は、被写体マップに含まれる各疎ら領域の面積を求め、それら疎ら領域の面積を基に当該被写体マップの疎ら度合を判定する。疎ら度合は一例として０〜１００％の割合を示す値となされており、疎ら判定部３０２は、被写体マップの全面積に対して疎ら領域の面積が相対的に大きくなるほど、当該被写体マップの疎ら度合を高い値にする。図４（ｃ）に例示した被写体マップ４０３の場合、壁の一部の小領域４１４が疎ら領域として検出され、被写体マップに対して疎ら領域の面積が相対的に大きいほど高い値の疎ら度合が出力される。なお、疎ら判定部３０２の構成、疎ら領域検出、および疎ら度合の判定処理等の詳細な説明は後述する。 In the case of the present embodiment, the sparseness determination unit 302 obtains the area of each sparse area included in the subject map, and determines the degree of sparseness of the subject map based on the area of the sparse area. The degree of sparseness is set to a value indicating a ratio of 0 to 100% as an example, and the sparseness determination unit 302 increases the degree of sparseness of the subject map as the area of the sparse area becomes relatively larger than the total area of the subject map. To a high value. In the case of the subject map 403 illustrated in FIG. 4C, a small area 414 of a part of the wall is detected as a sparse area, and the larger the area of the sparse area with respect to the subject map, the higher the degree of sparseness. It is output. A detailed description of the configuration of the sparseness determination unit 302, the detection of the sparse area, the determination process of the degree of sparseness, and the like will be described later.

そして、被写体マップ合成部３０３は、被写体領域抽出部３０１からの被写体マップと、予め用意された代替シルエット３０７とを、疎ら度合に基づいて合成し、その合成後の被写体マップを補正処理部３０５に出力する。詳細は後述するが、本実施形態の場合、被写体マップ合成部３０３では、疎ら度合が高いほど、代替シルエット３０７が被写体マップ４０３よりも優先的に用いられるように合成された合成後被写体マップが生成される。そしてこの場合、補正処理部３０５では、疎ら度合が高いほど代替シルエット３０７が被写体マップ４０３よりも優先的に用いられるように合成された合成後被写体マップに基づく補正処理が行われることになる。 Then, the subject map synthesizing unit 303 synthesizes the subject map from the subject area extraction unit 301 and the alternative silhouette 307 prepared in advance based on the degree of sparseness, and the combined subject map is sent to the correction processing unit 305. Output. Although the details will be described later, in the case of the present embodiment, the subject map compositing unit 303 generates a post-composite subject map synthesized so that the higher the degree of sparseness, the more preferentially the alternative silhouette 307 is used over the subject map 403. Will be done. In this case, the correction processing unit 305 performs correction processing based on the synthesized subject map so that the alternative silhouette 307 is used preferentially over the subject map 403 as the degree of sparseness increases.

図５（ａ）〜図５（ｄ）は、被写体マップ合成部３０３の動作を説明するための図である。
図５（ｂ）は代替シルエット３０７の一例を示した図である。図５（ｂ）に示した代替シルエット３０７は、人型５１０を含む固定形状マップである。前述した被写体マップはピントマップ（評価値マップ）を用いて生成された第１の被写体マップであり、一方、代替シルエットは評価値マップを用いずに予め生成されている第２の被写体マップである。代替シルエット３０７の情報は、例えば図１のＲＯＭ１０２が保持しており、画像処理部１０７において利用される時に、図１のＲＡＭ１０３に展開されて被写体マップ合成部３０３に送られる。なお、疎ら度合を基に代替シルエット３０７が合成される場合、画像処理部１０７では、例えば撮像画像から既知の人物画像認識処理などで人型の領域を検出する。そして、被写体マップ合成部３０３では、その検出位置に、代替シルエット３０７の人型５１０の位置を合わせるようにして合成するものとする。 5 (a) to 5 (d) are diagrams for explaining the operation of the subject map compositing unit 303.
FIG. 5B is a diagram showing an example of the alternative silhouette 307. The alternative silhouette 307 shown in FIG. 5B is a fixed shape map including the humanoid 510. The subject map described above is the first subject map generated using the focus map (evaluation value map), while the alternative silhouette is the second subject map generated in advance without using the evaluation value map. .. The information of the alternative silhouette 307 is held by, for example, the ROM 102 of FIG. 1, and when it is used by the image processing unit 107, it is expanded in the RAM 103 of FIG. 1 and sent to the subject map compositing unit 303. When the alternative silhouette 307 is synthesized based on the degree of sparseness, the image processing unit 107 detects a humanoid region by, for example, a known person image recognition process from the captured image. Then, the subject map synthesizing unit 303 synthesizes the humanoid 510 of the alternative silhouette 307 so as to match the detected position.

図５（ａ）のグラフ５０１は、代替シルエット使用率と疎ら度合との関係を示した図である。図５（ａ）の縦軸が代替シルエット使用率［％］を示し、横軸が疎ら度合［％］を示している。グラフ５０１に示すように、疎ら度合が第１の閾値ＴＨ１未満である場合には代替シルエット使用率が０％となされ、疎ら度合が第２の閾値ＴＨ２以上である場合には代替シルエット使用率が１００％となされる。また、疎ら度合が第１の閾値ＴＨ１以上で第２の閾値ＴＨ２未満である場合には、疎ら度合が高くなるほど、代替シルエット使用率が高くなる。 Graph 501 of FIG. 5A is a diagram showing the relationship between the alternative silhouette usage rate and the degree of sparseness. The vertical axis of FIG. 5A shows the alternative silhouette usage rate [%], and the horizontal axis shows the degree of sparseness [%]. As shown in Graph 501, when the degree of sparseness is less than the first threshold value TH1, the alternative silhouette usage rate is set to 0%, and when the degree of sparseness is greater than or equal to the second threshold value TH2, the alternative silhouette usage rate is set to 0%. It is made 100%. Further, when the degree of sparseness is equal to or higher than the first threshold value TH1 and less than the second threshold value TH2, the higher the degree of sparseness, the higher the usage rate of the alternative silhouette.

図５（ｄ）は図４（ｃ）に示した被写体マップ４０３を示した図である。
図５（ｃ）は、被写体マップ合成部３０３において、図５（ｄ）の被写体マップ４０３と図５（ｂ）の代替シルエット３０７とを、図５（ａ）のグラフ５０１の疎ら度合を基に合成した後の合成後被写体マップ５０３を示した図である。なお、被写体マップ合成部３０３における合成処理の詳細は後述する。 FIG. 5D is a diagram showing the subject map 403 shown in FIG. 4C.
5 (c) shows the subject map 403 of FIG. 5 (d) and the alternative silhouette 307 of FIG. 5 (b) in the subject map synthesizer 303 based on the degree of sparseness of the graph 501 of FIG. 5 (a). It is a figure which showed the subject map 503 after composition after composition. The details of the compositing process in the subject map compositing unit 303 will be described later.

図５（ｃ）の例は、被写体マップの疎ら度合が例えば図５（ａ）の第２の閾値ＴＨ２以上であったため、代替シルエット使用率が１００％になされた場合の合成後被写体マップ５０３を示している。図５（ｃ）の例では、図５（ｄ）の被写体マップ４０３内の図４（ａ）の家具４１３に対応した領域４３３が白（２５５）の被写体レベルとはならず黒（０）の非被写体レベルになるが、疎ら領域４３４についてはすべて黒（０）の非被写体レベルになっている。この合成後被写体マップ５０３が後段の補正処理部３０５で後述する補正処理に用いられた場合、疎ら領域に補正が行われて不要なアーティファクトが発生してしまうのを防ぐことができることになる。 In the example of FIG. 5 (c), since the degree of sparseness of the subject map is, for example, the second threshold value TH2 or more of FIG. 5 (a), the combined subject map 503 when the alternative silhouette usage rate is 100% is used. Shown. In the example of FIG. 5 (c), the area 433 corresponding to the furniture 413 of FIG. 4 (a) in the subject map 403 of FIG. 5 (d) does not become the subject level of white (255) and is black (0). Although it is a non-subject level, all the sparse areas 434 are black (0) non-subject levels. When the combined subject map 503 is used in the correction processing described later in the correction processing unit 305 in the subsequent stage, it is possible to prevent the sparse region from being corrected and unnecessary artifacts from being generated.

図５の説明では、代替シルエット３０７は予め用意され加工等されずに被写体マップ４０３と合成される例を挙げたが、本実施形態はこれに限定されるものではない。代替シルエット３０７を加工して被写体マップ４０３と合成してもよい。例えば、図４（ａ）の主被写体の人物４１２の顔器官位置、関節位置や姿勢情報などを検出し、その検出結果を基に、代替シルエット３０７の人型５１０の位置と形状を、主被写体の人物４１２の位置と形状に合うように変形や拡大・縮小等するようにしても良い。その他にも、代替シルエットは撮像された画像の解析を行うことで生成されてもよい。例えば、機械学習に基づいた意味的領域分割などの手法を使って、主被写体の人物４１２の人物マップを検出し、その人物マップを代替シルエットとして用いても良い。 In the description of FIG. 5, the alternative silhouette 307 is prepared in advance and combined with the subject map 403 without being processed or the like, but the present embodiment is not limited to this. The alternative silhouette 307 may be processed and combined with the subject map 403. For example, the facial organ position, joint position, posture information, etc. of the person 412 of the main subject in FIG. 4A are detected, and the position and shape of the humanoid 510 of the alternative silhouette 307 are determined based on the detection results. It may be deformed, enlarged / reduced, etc. so as to match the position and shape of the person 412. In addition, the alternative silhouette may be generated by analyzing the captured image. For example, a person map of the person 412 of the main subject may be detected by using a technique such as semantic region division based on machine learning, and the person map may be used as an alternative silhouette.

図３に説明を戻す。
加算部３０４は、Ａ像３０８とＢ像３０９の一対の視差画像が入力され、それらＡ像３０８とＢ像３０９の一対の視差画像を加算する。加算部３０４による加算後の画像（加算画像）は補正処理部３０５に送られる。 The explanation is returned to FIG.
The addition unit 304 inputs a pair of parallax images of the A image 308 and the B image 309, and adds the pair of parallax images of the A image 308 and the B image 309. The image after addition by the addition unit 304 (addition image) is sent to the correction processing unit 305.

補正処理部３０５は、加算画像の明るさを、合成後被写体マップに基づいて補正する。補正処理部３０５における補正処理は、以下の式（１）の演算により表される。なお、式（１）において、Ｘは加算画像の画素値、Ｇは合成後被写体マップの画素値、Ｙは補正処理が行われた後の画像の画素値である。この補正処理部３０５による補正処理後の画像は、画像処理部１０７における補正後画像３１０として出力される。 The correction processing unit 305 corrects the brightness of the added image based on the subject map after composition. The correction processing in the correction processing unit 305 is expressed by the calculation of the following equation (1). In the equation (1), X is the pixel value of the added image, G is the pixel value of the subject map after composition, and Y is the pixel value of the image after the correction process is performed. The image after the correction processing by the correction processing unit 305 is output as the corrected image 310 in the image processing unit 107.

Ｙ＝Ｘ・（１＋Ｇ／２５５）式（１） Y = X · (1 + G / 255) Equation (1)

式（１）は加算画像の明るさを合成後被写体マップに基づいて補正する補正処理の演算例であるため、補正後画像３１０は、着目被写体つまり主被写体である人物４１２にライトを照らしたようなライティング補正効果が付与された画像となる。本実施形態では、ライティング補正効果を付与する補正処理を挙げたが、補正処理はこの例に限定されるものではない。例えば、合成後被写体マップに基づいて加算画像にシャープネス調整を行う補正処理でも良く、この場合、着目被写体のシャープネスが向上した画像の取得が可能となる。その他にも、着目被写体に対する補正処理ではなく、合成後被写体マップに基づいて着目被写体の領域外の背景領域について背景ぼかしや背景コントラスト調整を行うような補正処理でもよい。この場合、着目被写体の領域外の背景がぼけた画像や背景コントラストが調整された画像の取得が可能となる。またこれらライティング補正、シャープネス調整、背景ぼかし、背景コントラスト調整等は、それぞれ別個に行われても良いし、二つ以上が組み合わされて行われても良い。 Since the equation (1) is an calculation example of the correction process for correcting the brightness of the added image based on the subject map after composition, the corrected image 310 seems to illuminate the subject of interest, that is, the person 412 which is the main subject. The image will have a good lighting correction effect. In the present embodiment, the correction process for imparting the lighting correction effect is mentioned, but the correction process is not limited to this example. For example, a correction process that adjusts the sharpness of the added image based on the combined subject map may be used, and in this case, it is possible to acquire an image with improved sharpness of the subject of interest. In addition, instead of the correction processing for the subject of interest, the correction processing may be performed such that the background blur or the background contrast is adjusted for the background area outside the area of the subject of interest based on the combined subject map. In this case, it is possible to acquire an image in which the background outside the region of the subject of interest is blurred or an image in which the background contrast is adjusted. Further, these lighting corrections, sharpness adjustments, background blurring, background contrast adjustments, etc. may be performed separately, or may be performed in combination of two or more.

図６（ａ）は、図３の疎ら判定部３０２の構成例を示す図であり、図６（ｂ）〜図６（ｇ）は図６（ａ）の構成における動作を説明するための図である。
図６（ａ）に示すように、疎ら判定部３０２は、膨張フィルタ部６０１、収縮フィルタ部６０２、差分検出部６０３、差分検出部６０４、マップ統合部６０５、ＭＥＤＩＡＮフィルタ部６０６、及び疎ら度合算出部６０７を有する。 6 (a) is a diagram showing a configuration example of the sparseness determination unit 302 of FIG. 3, and FIGS. 6 (b) to 6 (g) are diagrams for explaining the operation in the configuration of FIG. 6 (a). Is.
As shown in FIG. 6A, the sparseness determination unit 302 includes an expansion filter unit 601, a contraction filter unit 602, a difference detection unit 603, a difference detection unit 604, a map integration unit 605, a median filter unit 606, and a sparseness degree calculation. It has a part 607.

膨張フィルタ部６０１は、入力被写体マップ６０８の白（２５５）の被写体ラベル部分を膨張させるフィルタ部である。図６（ｂ）は、入力被写体マップ６０８が前述の図４（ｃ）に示した被写体マップ４０３である場合に、その被写体マップ４０３を膨張フィルタ部６０１にて膨張フィルタ処理した後の、膨張後被写体マップ６１１を示した図である。すなわち図４（ｃ）の被写体マップ４０３に対して膨張フィルタ処理が行われた場合、被写体マップ４０３内で互いに近い白部分同士（被写体ラベル部分同士）が繋がった、図６（ｂ）に示すような膨張後被写体マップ６１１が生成される。本実施形態の場合、膨張後被写体マップ６１１は第３の被写体マップに相当する。 The expansion filter unit 601 is a filter unit that expands the white (255) subject label portion of the input subject map 608. FIG. 6B shows that when the input subject map 608 is the subject map 403 shown in FIG. 4C described above, the subject map 403 is expanded and filtered by the expansion filter unit 601 after expansion. It is a figure which showed the subject map 611. That is, when the expansion filter processing is performed on the subject map 403 of FIG. 4 (c), the white portions (subject label portions) that are close to each other in the subject map 403 are connected, as shown in FIG. 6 (b). After expansion, the subject map 611 is generated. In the case of the present embodiment, the expanded subject map 611 corresponds to the third subject map.

収縮フィルタ部６０２は、入力被写体マップ６０８の白（２５５）の被写体ラベルを収縮させるフィルタ部である。図６（ｃ）は、入力被写体マップ６０８が図４（ｃ）に示した被写体マップ４０３である場合に、その被写体マップ４０３を収縮フィルタ部６０２にて収縮フィルタ処理した後の、収縮後被写体マップ６１２を示した図である。すなわち図４（ｃ）の被写体マップ４０３に対して収縮フィルタ処理が行われた場合、被写体マップ４０３内で互いに近い黒部分同士が繋がった、図６（ｃ）に示すような収縮後被写体マップ６１２が生成される。本実施形態の場合、収縮後被写体マップ６１２は第４の被写体マップに相当する。 The shrink filter unit 602 is a filter unit that shrinks the white (255) subject label of the input subject map 608. FIG. 6 (c) shows a post-shrinkage subject map after the input subject map 608 is the subject map 403 shown in FIG. 4 (c) and the subject map 403 is shrink-filtered by the shrink filter unit 602. It is a figure which showed 612. That is, when the subject map 403 of FIG. 4 (c) is subjected to the shrinkage filter processing, the black portions close to each other in the subject map 403 are connected to each other, and the contracted subject map 612 as shown in FIG. 6 (c). Is generated. In the case of the present embodiment, the contracted subject map 612 corresponds to the fourth subject map.

差分検出部６０３は、膨張フィルタ処理前後の被写体マップで差分があるところを白（２５５）の被写体ラベルとし、それ以外を黒（０）の非被写体ラベルとするような差分検出処理を行う。図６（ｄ）は、膨張フィルタ処理前である図４（ｃ）の被写体マップ４０３と、膨張フィルタ処理後である図６（ｂ）の膨張後被写体マップ６１１とから、差分検出部６０３が差分検出処理を行った後の、差分検出マップ６１３を示した図である。すなわち図４（ｃ）の被写体マップ４０３と図６（ｂ）の膨張後被写体マップ６１１との差分検出処理が行われた場合、差分部分が白（２５５）となり、それ以外が黒（０）の非被写体ラベルとなった、図６（ｄ）に示すような差分検出マップ６１３が生成される。本実施形態の場合、差分検出マップ６１３は第１の疎ら領域マップに相当する。 The difference detection unit 603 performs the difference detection process so that the white (255) subject label is used for the difference in the subject map before and after the expansion filter processing, and the black (0) non-subject label is used for the other parts. In FIG. 6 (d), the difference detection unit 603 is different from the subject map 403 of FIG. 4 (c) before the expansion filter processing and the post-expansion subject map 611 of FIG. 6 (b) after the expansion filter processing. It is a figure which showed the difference detection map 613 after performing the detection process. That is, when the difference detection process between the subject map 403 of FIG. 4 (c) and the expanded subject map 611 of FIG. 6 (b) is performed, the difference portion becomes white (255) and the other portion becomes black (0). A difference detection map 613 as shown in FIG. 6D, which is a non-subject label, is generated. In the case of the present embodiment, the difference detection map 613 corresponds to the first sparse area map.

差分検出部６０４は、収縮フィルタ処理前後の被写体マップで差分があるところを白（２５５）の被写体ラベルとし、それ以外の黒（０）の非被写体ラベルとするような差分検出処理を行う。図６（ｅ）は、収縮フィルタ処理前である図４（ｃ）の被写体マップ４０３と、収縮フィルタ処理後である図６（ｃ）の収縮後被写体マップ６１２とから、差分検出部６０４が差分検出処理を行った後の、差分検出マップ６１４を示した図である。つまり図４（ｃ）の被写体マップ４０３と図６（ｃ）の収縮後被写体マップ６１２との差分検出処理によれば、差分部分が白（２５５）の被写体ラベルで、それ以外が黒（０）の非被写体ラベルとなる図６（ｅ）に示すような差分検出マップ６１４が生成される。本実施形態の場合、差分検出マップ６１４は第２の疎ら領域マップに相当する。 The difference detection unit 604 performs the difference detection process so that the white (255) subject label is used as the difference in the subject map before and after the shrinkage filter processing, and the other black (0) non-subject label is used. In FIG. 6 (e), the difference detection unit 604 is different from the subject map 403 of FIG. 4 (c) before the shrinkage filter processing and the post-shrinkage subject map 612 of FIG. 6 (c) after the contraction filter processing. It is a figure which showed the difference detection map 614 after performing the detection process. That is, according to the difference detection process between the subject map 403 of FIG. 4 (c) and the contracted subject map 612 of FIG. 6 (c), the difference portion is the subject label of white (255), and the rest is black (0). The difference detection map 614 as shown in FIG. 6E, which is the non-subject label of the above, is generated. In the case of the present embodiment, the difference detection map 614 corresponds to the second sparse area map.

これら差分検出部６０３、６０４における差分検出処理は、以下の式（２）で表される。なお、式（２）において、Ｘ０及びＸ１は差分検出部へ入力される被写体マップである。つまりＸ０とＸ１は、差分検出部６０３の場合には膨張フィルタ処理前後の被写体マップであり、差分検出部６０４の場合には膨張フィルタ処理前後の被写体マップである。また式（２）において、ＡＢＳは絶対値関数、Ｓは差分検出部の出力である。 The difference detection process in the difference detection units 603 and 604 is represented by the following equation (2). In the equation (2), X0 and X1 are subject maps input to the difference detection unit. That is, X0 and X1 are subject maps before and after the expansion filter processing in the case of the difference detection unit 603, and subject maps before and after the expansion filter processing in the case of the difference detection unit 604. Further, in the equation (2), ABS is an absolute value function, and S is an output of the difference detection unit.

Ｓ＝ＡＢＳ（Ｘ０−Ｘ１）式（２） S = ABS (X0-X1) Equation (2)

このように、膨張フィルタ部６０１による膨張フィルタ処理前後の被写体マップを用い、差分検出部６０３で差分検出処理を行うように構成することで、疎ら領域における黒部分を検出することができることになる。つまり膨張フィルタ部６０１の膨張フィルタ処理で被写体マップの疎ら領域の黒部分（非被写体ラベル）が変化（白の被写体ラベルに変化）することになり、さらに差分検出部６０３で差分検出処理で疎ら領域における黒部分を検出することができることになる。また、収縮フィルタ部６０２による収縮フィルタ処理前後の被写体マップを用い、差分検出部６０４で差分検出処理を行うように構成することで、疎ら領域における白部分を検出することができることになる。つまり収縮フィルタ部６０２の収縮フィルタ処理によって被写体マップの疎ら領域の白部分が変化（黒部分に変化）することになり、さらに差分検出部６０４で差分検出処理を行うことで、その疎ら領域における白部分を検出することができることになる。 In this way, by using the subject map before and after the expansion filter processing by the expansion filter unit 601 and configuring the difference detection unit 603 to perform the difference detection processing, it is possible to detect the black portion in the sparse region. That is, the black part (non-subject label) of the sparse area of the subject map changes (changes to the white subject label) by the expansion filter processing of the expansion filter unit 601, and further, the sparse area by the difference detection process by the difference detection unit 603. It will be possible to detect the black part in. Further, by using the subject map before and after the contraction filter processing by the contraction filter unit 602 and configuring the difference detection unit 604 to perform the difference detection process, the white portion in the sparse region can be detected. That is, the white part of the sparse area of the subject map changes (changes to the black part) by the shrinkage filter processing of the shrinkage filter unit 602, and the difference detection process of the difference detection unit 604 further changes the white part in the sparse area. The part can be detected.

なお本実施形態では、差分検出部６０３，６０４における差分検出処理を式（２）で表される演算としたが、この例に限定されるものではない。例えば、白（２５５）の部分をＴＲＵＥ（真）、黒（０）の部分をＦＡＬＳＥ（偽）としたうえで、ＸＯＲ（排他的論理和）の論理演算を行って差分検出を行うようにしても良い。 In the present embodiment, the difference detection process in the difference detection units 603 and 604 is an operation represented by the equation (2), but the present invention is not limited to this example. For example, after the white (255) part is TRUE (true) and the black (0) part is FALSE (false), the difference is detected by performing the logical operation of XOR (exclusive OR). Is also good.

差分検出部６０３による差分検出マップと、差分検出部６０４による差分検出マップとは、マップ統合部６０５に入力される。
マップ統合部６０５は、入力された差分検出マップのいずれかが白なら白（２５５）の被写体ラベルとし、それ以外を黒（０）の非被写体ラベルにして出力するマップ統合処理を行う。図６（ｆ）は、図６（ｄ）に示した差分検出マップ６１３と、図６（ｅ）に示した差分検出マップ６１４とを、マップ統合処理した後の統合マップ６１５を示した図である。 The difference detection map by the difference detection unit 603 and the difference detection map by the difference detection unit 604 are input to the map integration unit 605.
If any of the input difference detection maps is white, the map integration unit 605 sets the subject label as white (255), and sets the other labels as non-subject labels of black (0) and outputs the map integration process. FIG. 6 (f) is a diagram showing an integrated map 615 after the difference detection map 613 shown in FIG. 6 (d) and the difference detection map 614 shown in FIG. 6 (e) are subjected to map integration processing. be.

マップ統合部６０５におけるマップ統合処理は、以下の式（３）で表される。なお、式（３）において、Ｘ２およびＸ３はマップ統合部へ入力される差分検出マップである。また式（３）において、ＣＬＩＰは０以下なら０、２５５以上なら２５５に値をクリップするクリップ関数、Ｉはマップ統合部の出力である。 The map integration process in the map integration unit 605 is represented by the following equation (3). In the equation (3), X2 and X3 are difference detection maps input to the map integration unit. Further, in the equation (3), CLIP is a clip function that clips the value to 0 if it is 0 or less and 255 if it is 255 or more, and I is the output of the map integration unit.

Ｉ＝ＣＬＩＰ（Ｘ２＋Ｘ３）式（３） I = CLIP (X2 + X3) Equation (3)

マップ統合部６０５において式（３）で表されるようなマップ統合処理が行われることで、図４（ｃ）の被写体マップ４０３から疎ら領域全体を検出することができる。すなわち、図６（ｆ）に示した統合マップ６１５は、被写体マップ４０３から検出された疎ら領域マップ６１５となされている。 By performing the map integration process as represented by the equation (3) in the map integration unit 605, the entire sparse region can be detected from the subject map 403 of FIG. 4 (c). That is, the integrated map 615 shown in FIG. 6 (f) is a sparse area map 615 detected from the subject map 403.

なお本実施形態では、マップ統合部６０５において式（３）の演算を行う例を挙げたが、これに限定されるものではない。例えば白（２５５）の部分をＴＲＵＥとし、黒（０）の部分をＦＡＬＳＥとしたうえで、ＯＲ（論理和）の論理演算を行ってマップ統合を行うようにしても良い。 In the present embodiment, an example in which the calculation of the equation (3) is performed in the map integration unit 605 is given, but the present invention is not limited to this. For example, the white (255) part may be TRUE, the black (0) part may be FALSE, and then OR (logical sum) logical operation may be performed to integrate the map.

また本実施形態においては、差分検出部６０３による疎ら領域の黒部分検出結果と、差分検出部６０４による疎ら領域の白部分検出結果との両方を疎ら領域として評価する構成としたが、これに限定されるものではない。例えば、差分検出部６０３の検出結果だけ用いても良いし、逆に差分検出部６０４の検出結果だけ用いるようにしても良い。 Further, in the present embodiment, both the black portion detection result of the sparse region by the difference detection unit 603 and the white portion detection result of the sparse region by the difference detection unit 604 are evaluated as the sparse region, but the configuration is limited to this. It is not something that is done. For example, only the detection result of the difference detection unit 603 may be used, or conversely, only the detection result of the difference detection unit 604 may be used.

図６（ａ）のＭＥＤＩＡＮフィルタ部６０６は、マップ統合部６０５にて生成された疎ら領域マップ６１５に対し、メディアンフィルタをかけるフィルタ処理部である。図６（ｇ）は、図６（ｆ）に示した疎ら領域マップ６１５にメディアンフィルタ処理が行われた後の疎ら領域マップ６１６を示した図である。疎ら領域マップ６１５に対するメディアンフィルタ処理は、当該疎ら領域マップ６１５の孤立領域を除去する孤立領域除去処理となる。すなわち疎ら領域マップ６１５に対してメディアンフィルタ処理が行われると、疎ら領域マップ６１５内の細い線や細かい点等を除去することができ、被写体の輪郭部などで発生する疎ら領域マップの誤判定領域を除去することができる。 The MEDIAN filter unit 606 of FIG. 6A is a filter processing unit that applies a median filter to the sparse area map 615 generated by the map integration unit 605. FIG. 6 (g) is a diagram showing a sparse region map 616 after the median filter processing is performed on the sparse region map 615 shown in FIG. 6 (f). The median filter process for the sparse area map 615 is an isolated area removal process for removing the isolated area of the sparse area map 615. That is, when the median filter processing is performed on the sparse area map 615, fine lines and fine points in the sparse area map 615 can be removed, and an erroneous determination area of the sparse area map generated in the contour portion of the subject or the like can be removed. Can be removed.

図６（ａ）の疎ら度合算出部６０７は、メディアンフィルタ処理後の疎ら領域マップ６１６の白の被写体ラベル部分の画素数を計測し、その計測画素数がマップ内の全画素数に対して占める割合を、疎ら度合として算出するような疎ら度合算出処理を行う。そして、疎ら度合算出部６０７は、算出した疎ら度合６０９の情報を前述した図３の被写体マップ合成部３０３へ出力する。なお本実施形態では、疎ら度合を画素数の割合として算出したが、例えば疎ら領域マップの白の被写体ラベル部分の画素数をそのまま疎ら度合の値としても良い。 The sparseness degree calculation unit 607 of FIG. 6A measures the number of pixels of the white subject label portion of the sparse area map 616 after the median filter processing, and the measured pixel number occupies the total number of pixels in the map. The sparseness degree calculation process is performed so that the ratio is calculated as the sparseness degree. Then, the sparseness degree calculation unit 607 outputs the calculated information of the sparseness degree degree 609 to the subject map synthesis unit 303 of FIG. 3 described above. In the present embodiment, the degree of sparseness is calculated as a ratio of the number of pixels, but for example, the number of pixels of the white subject label portion of the sparse area map may be used as the value of the degree of sparseness as it is.

図７は、図６（ａ）の膨張フィルタ部６０１の構成例を示した図である。膨張フィルタ部６０１は、ＭＡＸフィルタ部７０１、ＭＥＤＩＡＮフィルタ部７０２、およびＭＩＮフィルタ部７０３を有する。
図８は、図６（ａ）の収縮フィルタ部６０２の構成例を示した図である。収縮フィルタ部６０２は、ＭＩＮフィルタ部８０１、ＭＥＤＩＡＮフィルタ部８０２、およびＭＡＸフィルタ部８０３を有する。
これら図７と図８に示されたＭＡＸフィルタ部７０１と８０３、ＭＥＤＩＡＮフィルタ部７０２と８０２、ＭＩＮフィルタ部７０３と８０１の各動作を、図９のフローチャートを用いて説明する。なお図９のフローチャートでは、ＭＡＸフィルタ、ＭＥＤＩＡＮフィルタ部、およびＭＩＮフィルタ部を区別せずに、単に、フィルタ部と呼ぶ。 FIG. 7 is a diagram showing a configuration example of the expansion filter unit 601 of FIG. 6A. The expansion filter unit 601 includes a MAX filter unit 701, a MEDIAn filter unit 702, and a MIN filter unit 703.
FIG. 8 is a diagram showing a configuration example of the contraction filter unit 602 of FIG. 6A. The contraction filter unit 602 includes a MIN filter unit 801 and a MEDIAn filter unit 802, and a MAX filter unit 803.
The operations of the MAX filter units 701 and 803, the MEDIA filter units 702 and 802, and the MIN filter units 703 and 801 shown in FIGS. 7 and 8 will be described with reference to the flowchart of FIG. In the flowchart of FIG. 9, the MAX filter, the MEDIAn filter unit, and the MIN filter unit are not distinguished and are simply referred to as a filter unit.

ステップＳ９０１において、フィルタ部は、図６（ａ）の入力被写体マップ６０８の着目画素それぞれについて、その着目画素の周辺画素の値を積算し、その積算値ΣＰｉｘと閾値ｔｈとを比較する。そして、フィルタ部は、積算値ΣＰｉｘが閾値ｔｈ以上であればステップＳ９０２へ処理を進めて白の値（２５５）を出力し、一方、積算値ΣＰｉｘが閾値ｔｈ未満である場合にはステップＳ９０３に処理を進めて黒の値（０）を出力する。 In step S901, the filter unit integrates the values of the peripheral pixels of the pixel of interest for each of the pixels of interest in the input subject map 608 of FIG. 6A, and compares the integrated value ΣPix with the threshold value th. Then, if the integrated value ΣPix is equal to or more than the threshold value th, the process proceeds to step S902 and outputs a white value (255), while if the integrated value ΣPix is less than the threshold value th, the process proceeds to step S903. The process proceeds and the black value (0) is output.

ここで、着目画素に対する周辺画素の範囲（参照範囲とする）が、着目画素を中心として縦横７×７画素の範囲である場合、閾値ｔｈを２５５×１＝２５５とすることで、当該フィルタ部はＭＡＸフィルタ部として動作する。また、閾値ｔｈを２５５×７×７＝１２４９５とすることで、当該フィルタ部はＭＩＸフィルタ部として動作する。また、閾値ｔｈを２５５×（７×７／２）＝２５５×（２４．５）＝２５５×２５＝６３７５とすることで、当該フィルタ部はＭＥＤＩＡＮフィルタ部として動作する。 Here, when the range of peripheral pixels (referred to as a reference range) with respect to the pixel of interest is a range of 7 × 7 pixels in the vertical and horizontal directions centered on the pixel of interest, the threshold value th is set to 255 × 1 = 255, so that the filter unit is concerned. Operates as a MAX filter unit. Further, by setting the threshold value th to 255 × 7 × 7 = 12495, the filter unit operates as a MIX filter unit. Further, by setting the threshold value th to 255 × (7 × 7/2) = 255 × (24.5) = 255 × 25 = 6375, the filter unit operates as a median filter unit.

この図９のフローチャートの動作により、図７の膨張フィルタ部６０１の場合、ＭＡＸフィルタ部７０１では、入力被写体マップ６０８の白（２５５）の被写体ラベル部分が一律に膨張し、疎ら領域の黒（０）の非被写体ラベル部分が白で埋まるようになる。その後、ＭＩＮフィルタ部７０３により被写体の輪郭部が膨張したところは収縮させて元に戻す。これにより、疎ら領域の黒（０）の非被写体ラベル部分だけを変化させることができる。さらに、ＭＥＤＩＡＮフィルタ部７０２により、疎ら領域の黒（０）の非被写体ラベル部分の埋め残しを白（２５５）で埋める。これにより、後段のＭＩＮフィルタ部７０３の処理で黒（０）の非被写体ラベル部分の埋め残しが再度広がることがない。 Due to the operation of the flowchart of FIG. 9, in the case of the expansion filter unit 601 of FIG. 7, in the MAX filter unit 701, the white (255) subject label portion of the input subject map 608 is uniformly expanded, and the black (0) of the sparse region is uniformly expanded. ) Non-subject label part will be filled with white. After that, the portion where the contour portion of the subject is expanded by the MIN filter unit 703 is contracted and restored. As a result, only the black (0) non-subject label portion in the sparse region can be changed. Further, the median filter unit 702 fills the unfilled portion of the black (0) non-subject label portion in the sparse region with white (255). As a result, the unfilled portion of the black (0) non-subject label portion does not spread again in the processing of the MIN filter unit 703 in the subsequent stage.

また図８の収縮フィルタ部６０２の場合、ＭＩＮフィルタ部８０１では、入力被写体マップ６０８の白（２５５）の部分が一律に収縮し、疎ら領域の白の被写体ラベル部分が黒（０）で埋まる。その後、ＭＡＸフィルタ部８０３により被写体輪郭部が収縮したところは膨張させて元に戻す。これにより、疎ら領域の白部分だけを変化させることができる。さらにＭＥＤＩＡＮフィルタ部８０２により、疎ら領域の白の被写体ラベル部分の埋め残しを黒で埋めることで、後段のＭＡＸフィルタ部８０３の処理で白部分の埋め残しが再度広がることがない。 Further, in the case of the contraction filter unit 602 of FIG. 8, in the MIN filter unit 801 the white (255) portion of the input subject map 608 is uniformly contracted, and the white subject label portion of the sparse region is filled with black (0). After that, the part where the subject contour portion is contracted by the MAX filter unit 803 is expanded and restored. As a result, only the white portion of the sparse region can be changed. Further, the median filter unit 802 fills the unfilled portion of the white subject label portion in the sparse region with black, so that the unfilled portion of the white portion does not spread again in the processing of the MAX filter unit 803 in the subsequent stage.

図１０は、図１のデジタルカメラ１００において、静止画撮影が行われる場合の制御部１０１の動作を説明するためのフローチャートである。
ステップＳ１００１の処理として、制御部１０１は、不図示のシャッターボタンがいわゆる半押し状態（ＳＷ１オン）になるまで、表示部１０９にＥＶＦ映像を表示させるＥＶＦ撮像制御を行う。 FIG. 10 is a flowchart for explaining the operation of the control unit 101 when still image shooting is performed in the digital camera 100 of FIG.
As a process of step S1001, the control unit 101 performs EVF imaging control for displaying the EVF image on the display unit 109 until the shutter button (not shown) is in a so-called half-pressed state (SW1 is on).

次にステップＳ１００２において、制御部１０１は、シャッターボタンが半押し状態（ＳＷ１オン）であるか否かを判定する。制御部１０１は、半押し状態（ＳＷ１オン）でないと判定した場合にはステップＳ１００１に処理を戻し、一方、ユーザにてシャッターボタンが操作されることで半押し状態（ＳＷ１オン）になっていると判定した場合にはステップＳ１００３に処理を進める。 Next, in step S1002, the control unit 101 determines whether or not the shutter button is in the half-pressed state (SW1 on). When the control unit 101 determines that it is not in the half-pressed state (SW1 on), the process returns to step S1001, while the shutter button is operated by the user to enter the half-pressed state (SW1 on). If it is determined, the process proceeds to step S1003.

ステップＳ１００３に進むと、制御部１０１は、光学系１０４のフォーカスレンズを駆動制御するオートフォーカス（ＡＦ）処理を行って、被写体にフォーカスを合わせるようにする。 Proceeding to step S1003, the control unit 101 performs an autofocus (AF) process for driving and controlling the focus lens of the optical system 104 to focus on the subject.

次にステップＳ１００４において、制御部１０１は、シャッターボタンがいわゆる全押し状態（ＳＷ２オン）であるか否かを判定する。制御部１０１は、全押し状態（ＳＷ２オン）でないと判定した場合にはステップＳ１００１に処理を戻し、一方、全押し状態（ＳＷ２オン）になっていると判定した場合にはステップＳ１００５に処理を進める。 Next, in step S1004, the control unit 101 determines whether or not the shutter button is in the so-called fully pressed state (SW2 on). When the control unit 101 determines that it is not in the fully pressed state (SW2 on), it returns the process to step S1001, while when it determines that it is in the fully pressed state (SW2 on), it performs the process in step S1005. Proceed.

ステップＳ１００５に進むと、制御部１０１は、各部を制御して静止画を撮像させる。
その後、ステップＳ１００６に進むと、制御部１０１は、画像処理部１０７を制御して本実施形態に係る補正処理を含む画像処理を行わせる。画像処理部１０７における補正処理は、前述の図３等を用いて説明したような補正処理であり、例えばライティング補正、シャープネス調整、背景ぼかし、背景コントラスト調整などの何れか若しくは二つ以上を組み合わせた補正処理である。 When the process proceeds to step S1005, the control unit 101 controls each unit to capture a still image.
After that, when the process proceeds to step S1006, the control unit 101 controls the image processing unit 107 to perform image processing including the correction processing according to the present embodiment. The correction process in the image processing unit 107 is a correction process as described with reference to FIG. 3 and the like described above, and is a combination of any or two or more of, for example, lighting correction, sharpness adjustment, background blurring, and background contrast adjustment. This is a correction process.

なお図１０のフローチャートの例では、撮像された静止画のみに補正処理を施す例を挙げたが、この例に限定されるものではない。例えば、ＥＶＦ撮像中に補正処理を行ってもよい。ＥＶＦ撮像中に補正処理を行うと、ユーザは記録される静止画の仕上がり具合を、ＥＶＦ映像を見ることで事前に確認しながら、レリーズを切ることができ、利便性が高い。 In the example of the flowchart of FIG. 10, an example in which correction processing is performed only on the captured still image is given, but the present invention is not limited to this example. For example, correction processing may be performed during EVF imaging. If the correction process is performed during the EVF imaging, the user can release the recorded still image while checking the finish condition of the recorded still image in advance by viewing the EVF image, which is highly convenient.

また前述した実施形態の説明では、被写体マップの疎ら領域の黒（非被写体ラベル）部分と白（被写体ラベル）部分をそれぞれ検出して統合することで疎ら領域を検出する構成としたが、これに限定されるものではない。例えば、図３に示した疎ら判定部３０２は、図１１（ａ）のような構成であっても良い。なお、図１１の構成例において、膨張フィルタ部６０１、収縮フィルタ部６０２、ＭＥＤＩＡＮフィルタ部６０６、疎ら度合算出部６０７、被写体マップ６０８、疎ら度合６０９は、図６と同様であるためそれらの説明は省略する。 Further, in the description of the above-described embodiment, the sparse region is detected by detecting and integrating the black (non-subject label) portion and the white (subject label) portion of the sparse region of the subject map. It is not limited. For example, the sparseness determination unit 302 shown in FIG. 3 may have a configuration as shown in FIG. 11 (a). In the configuration example of FIG. 11, the expansion filter unit 601, the contraction filter unit 602, the median filter unit 606, the sparseness degree calculation unit 607, the subject map 608, and the sparseness degree degree 609 are the same as those in FIG. Omit.

図１１の構成の疎ら判定部３０２において、膨張フィルタ部６０１からは前述の図６（ｂ）に示した膨張後被写体マップ６１１が出力され、収縮フィルタ部６０２からは前述の図６（ｃ）に示した収縮後被写体マップ６１２が出力される。これら膨張後被写体マップ６１１と収縮後被写体マップ６１２は、差分検出部１１０５に入力される。 In the sparseness determination unit 302 having the configuration of FIG. 11, the expansion filter unit 601 outputs the post-expansion subject map 611 shown in FIG. 6 (b), and the contraction filter unit 602 displays the above-mentioned FIG. 6 (c). After the contraction shown, the subject map 612 is output. The expanded subject map 611 and the contracted subject map 612 are input to the difference detection unit 1105.

差分検出部１１０５は、前述した式（２）と同様の演算を行って、それら膨張後被写体マップ６１１と収縮後被写体マップ６１２との差分を検出する。差分検出部１１０５による差分検出処理の結果、前述の図６（ｆ）に示したのと同様の疎ら領域マップ６１５が生成される。図１１の構成例の場合、差分検出部１１０５による差分検出処理結果のマップは第３の疎ら領域マップに相当する。 The difference detection unit 1105 performs the same calculation as the above-mentioned equation (2) to detect the difference between the expanded subject map 611 and the contracted subject map 612. As a result of the difference detection process by the difference detection unit 1105, a sparse region map 615 similar to that shown in FIG. 6 (f) described above is generated. In the case of the configuration example of FIG. 11, the map of the difference detection processing result by the difference detection unit 1105 corresponds to the third sparse area map.

ＭＥＤＩＡＮフィルタ部６０６では、疎ら領域マップ６１５の被写体輪郭部などの誤検出を排除して、前述の図６（ｇ）と同様の疎ら領域マップ６１６を出力する。
図１１の構成例の場合、被写体マップの差分検出処理およびマップ統合処理のための構成及び演算を減らすことができる。 The median filter unit 606 eliminates erroneous detection of the subject contour portion of the sparse area map 615 and outputs the sparse area map 616 similar to FIG. 6 (g) described above.
In the case of the configuration example of FIG. 11, the configuration and calculation for the difference detection process and the map integration process of the subject map can be reduced.

以上説明したように、本実施形態によれば、例えば被写体領域ごとに適応的に画像補正や画像効果を適用する画像処理装置において、被写体領域抽出精度不足によるアーティファクト発生を抑圧することが可能である。 As described above, according to the present embodiment, for example, in an image processing device that adaptively applies image correction and image effects to each subject area, it is possible to suppress the occurrence of artifacts due to insufficient subject area extraction accuracy. ..

前述した本実施形態では、ＭＡＸフィルタ部やＭＩＮフィルタ部などの空間フィルタで疎ら領域が変化するのを検出して疎ら度合を算出する構成としたが、これに限定されるものではない。例えば、被写体マップを縦や横に走査して白の被写体ラベルと黒の非被写体ラベルとがトグルするようなトグル回数を数えて疎ら度合を算出するようにしても良い。また白（被写体ラベル）と黒（非被写体ラベル）とがトグルする回数が多い領域について、白から黒へ変化したときに白が持続した幅を疎ら領域の白部分の数とし、黒から白へ変化した時に黒が持続した幅を疎ら領域の黒部分の数としてカウントしても良い。このように構成することで、空間フィルタを使うよりも演算コストを軽くすることができる。 In the above-described embodiment, the spatial filter such as the MAX filter unit and the MIN filter unit detects the change in the sparse region and calculates the degree of sparseness, but the present invention is not limited to this. For example, the degree of sparseness may be calculated by scanning the subject map vertically or horizontally and counting the number of times the white subject label and the black non-subject label are toggled. Also, for areas where white (subject label) and black (non-subject label) are frequently toggled, the width at which white persists when changing from white to black is defined as the number of white areas in the sparse area, and from black to white. The width in which black lasts when it changes may be counted as the number of black portions in the sparse region. With this configuration, the calculation cost can be reduced as compared with the use of a spatial filter.

また例えば、離散フーリエ変換（ＦＦＴ）や離散コサイン変換（ＤＣＴ）を用いて、被写体マップの疎ら領域に対応する周波数帯域を解析することで疎ら度合を算出するようにしても良い。すなわち疎ら判定部３０２は、評価値マップを用いて抽出した被写体マップを周波数領域に変換する周波数領域変換処理を行って周波数領域マップを生成し、その周波数領域マップに基づいて疎ら度合を算出する。この例の場合、疎ら判定部３０２は、予め疎ら周波数範囲が決定されており、被写体マップを小ブロックごとに分けて周波数変換処理した周波数領域マップの小ブロックごとに、その疎ら周波数範囲で所定の閾値以上の応答を示しているか否かを判定する。そして、疎ら判定部３０２は、所定以上の応答を示している小ブロックの数に応じて疎ら度合を算出する。より具体的に説明すると、疎ら判定部３０２は、被写体マップに対し、例えば８×８画素のブロックごとにＦＦＴ処理を実施して、疎ら領域に対応する周波数応答が閾値より高いブロックのブロック数を数え、それらのブロックの割合を疎ら度合とする。このように構成することで、空間フィルタを使うよりもきめ細かく疎ら領域の周波数帯域を決めることができる。 Further, for example, the degree of sparseness may be calculated by analyzing the frequency band corresponding to the sparse region of the subject map by using the discrete Fourier transform (FFT) or the discrete cosine transform (DCT). That is, the sparseness determination unit 302 performs frequency domain conversion processing for converting the subject map extracted using the evaluation value map into a frequency domain to generate a frequency domain map, and calculates the degree of sparseness based on the frequency domain map. In the case of this example, the sparseness determination unit 302 has a sparse frequency range determined in advance, and the subject map is divided into small blocks and frequency conversion processing is performed for each small block of the frequency domain map. It is determined whether or not the response is equal to or higher than the threshold value. Then, the sparseness determination unit 302 calculates the degree of sparseness according to the number of small blocks showing a response equal to or greater than a predetermined value. More specifically, the sparseness determination unit 302 performs FFT processing on the subject map, for example, for each block of 8 × 8 pixels, and determines the number of blocks of blocks having a frequency response higher than the threshold value corresponding to the sparse area. Count and let the percentage of those blocks be the degree of sparseness. With this configuration, the frequency band in the sparse region can be determined more finely than using a spatial filter.

また本実施形態では、評価値分布として位相差分布（例えばデフォーカス量分布によるピント分布）を用いているが、これに限定されるものではない。評価値分布は、例えば、Ａ像とＢ像のずれ量（つまり視差）を表すシフト量の分布であっても良い。なおシフト量は、検出ピッチ（同一種類の画素の配置ピッチ）をかけてマイクロメートルなどの長さの単位で表しても良い。また例えば、評価値分布は、デフォーカス量を焦点深度（２Ｆδもしくは１Ｆδ。Ｆは絞り値、δは許容錯乱円径）で正規化した値の分布であっても良い。なお、絞り値Ｆは像高中央付近の絞り値を代表値として全面固定値としても良いし、光学系１０４のケラレで周辺像高の絞り値が暗くなるのを加味した絞り値分布を適用するようにしても良い。 Further, in the present embodiment, the phase difference distribution (for example, the focus distribution based on the defocus amount distribution) is used as the evaluation value distribution, but the present invention is not limited to this. The evaluation value distribution may be, for example, a distribution of a shift amount representing the amount of deviation (that is, parallax) between the A image and the B image. The shift amount may be expressed in units of length such as micrometer by multiplying the detection pitch (arrangement pitch of pixels of the same type). Further, for example, the evaluation value distribution may be a distribution of values obtained by normalizing the defocus amount with the depth of focus (2Fδ or 1Fδ. F is the aperture value and δ is the permissible circle of confusion diameter). The aperture value F may be a fixed value on the entire surface with the aperture value near the center of the image height as a representative value, or an aperture value distribution that takes into account that the aperture value of the peripheral image height becomes dark due to the eclipse of the optical system 104 is applied. You may do so.

また本実施形態では、画像のピント情報分布、例えば位相差測距方式によるデフォーカス量分布を、評価値マップとして取得する例を挙げたが、これに限定されるものではない。例えば、評価値マップは、コントラスト測距方式による被写体距離つまりフォーカス位置を逐次異ならせて得られる画像群から取得されるコントラスト情報分布に基づいて生成されても良い。また例えば、評価値マップは、像面側のデフォーカス量を物体面側の距離値に変換した距離情報分布に基づいて生成されても良い。また距離情報分布を取得する際の測距の方式は、位相差測距方式、コントラスト測距方式あるいは画像特徴に基づくパッシブ方式に限定されない。例えば、測距の方式は、ＴＯＦ（ＴｉｍｅＯｆＦｌｉｇｈｔ）方式やストロボ反射光の有無を比較するようなアクティブ方式が用いられてもよい。さらには被写体距離によらない方式でも良く、例えば動きベクトル分布をマップ化したオプティカルフロー、色情報を基にラベリングした色ラベルマップ、機械学習に基づいた意味的領域分割などに基づいて被写体マップが生成されてもよい。意味的領域分割を利用する場合は、代替シルエットはそれ以外の方式を用いる必要があるが、人型の固定形状マップを用いるなど、シーン変化によりロバストな方式を選択するようにすればよい。すなわち、記評価値マップは、画像のピント情報分布、距離情報分布、動きベクトル情報分布、色ラベリング情報分布、もしくは機械学習による意味的領域分割の、少なくともいずれかを基に生成されてもよい。 Further, in the present embodiment, an example of acquiring the focus information distribution of an image, for example, the defocus amount distribution by the phase difference distance measurement method as an evaluation value map has been given, but the present invention is not limited to this. For example, the evaluation value map may be generated based on the contrast information distribution acquired from the image group obtained by sequentially changing the subject distance, that is, the focus position by the contrast ranging method. Further, for example, the evaluation value map may be generated based on the distance information distribution obtained by converting the defocus amount on the image plane side into the distance value on the object plane side. Further, the distance measurement method for acquiring the distance information distribution is not limited to the phase difference distance measurement method, the contrast distance measurement method, or the passive method based on image features. For example, as the distance measuring method, a TOF (Time Of Flight) method or an active method for comparing the presence or absence of strobe reflected light may be used. Furthermore, a method that does not depend on the subject distance may be used. For example, a subject map is generated based on an optical flow that maps a motion vector distribution, a color label map that is labeled based on color information, and a semantic region division based on machine learning. May be done. When using semantic region division, it is necessary to use another method for the alternative silhouette, but it is sufficient to select a robust method depending on the scene change, such as using a humanoid fixed shape map. That is, the evaluation value map may be generated based on at least one of the focus information distribution, the distance information distribution, the motion vector information distribution, the color labeling information distribution, and the semantic region division by machine learning of the image.

本発明は、上述の実施形態の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける一つ以上のプロセッサがプログラムを読出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。
上述の実施形態は、何れも本発明を実施するにあたっての具体化の例を示したものに過ぎず、これらによって本発明の技術的範囲が限定的に解釈されてはならないものである。すなわち、本発明は、その技術思想、又はその主要な特徴から逸脱することなく、様々な形で実施することができる。 The present invention supplies a program that realizes one or more functions of the above-described embodiment to a system or device via a network or storage medium, and one or more processors in the computer of the system or device reads and executes the program. It can also be realized by the processing to be performed. It can also be realized by a circuit (for example, ASIC) that realizes one or more functions.
The above-described embodiments are merely examples of embodiment of the present invention, and the technical scope of the present invention should not be construed in a limited manner by these. That is, the present invention can be implemented in various forms without departing from the technical idea or its main features.

１００：デジタルカメラ、１０１：制御部、１０７：画像処理部、１１０：ピントマップ処理部、３０１：被写体領域抽出部、３０２：疎ら判定部、３０３：被写体マップ合成部、３０４：加算部、３０５：補正処理部 100: Digital camera, 101: Control unit, 107: Image processing unit, 110: Focus map processing unit, 301: Subject area extraction unit, 302: Sparseness determination unit, 303: Subject map composition unit, 304: Addition unit, 305: Correction processing unit

Claims

A map acquisition method that acquires the evaluation value distribution corresponding to the image as an evaluation value map,
A map generation means for generating a first subject map based on a subject area extracted from the image using the evaluation value map, and a map generation means.
A degree acquisition means for acquiring the degree of sparseness, which represents the degree of the sparse area included in the first subject map, and
It has a correction means for performing correction processing on the image using at least one of the first subject map and the second subject map generated without using the evaluation value map.
The image processing apparatus is characterized in that the correction processing is performed by preferentially using the second subject map over the first subject map as the degree of sparseness increases.

The map generation means generates the first subject map classified into at least two label areas, that is, a subject label representing a subject area and a non-subject label representing a non-subject.
The degree acquisition means
A third subject map in which the region where the subject labels are sparsely distributed is changed by a predetermined process on the first subject map is generated.
A fourth subject map in which the region where the non-subject labels are sparsely distributed is changed by a predetermined process on the first subject map is generated.
The image processing apparatus according to claim 1, wherein the degree of sparseness is calculated based on at least one of the third subject map and the fourth subject map.

The degree acquisition means
A first sparse area map is generated based on the first subject map and the third subject map.
A second sparse area map is generated based on the first subject map and the fourth subject map.
The image processing apparatus according to claim 2, wherein the degree of sparseness is calculated based on at least one of the first sparse area map and the second sparse area map.

The degree acquisition means generates a third sparse area map based on the third subject map and the fourth subject map, and calculates the degree of sparseness based on the third sparse area map. The image processing apparatus according to claim 2.

The predetermined process for generating the third subject map includes a filter process for contracting the label area of the subject label and then expanding at least the label area of the subject label. The image processing apparatus according to any one of claims 2 to 4.

The predetermined process for generating the fourth subject map is characterized by including a filter process of shrinking the label area of the non-subject label and then expanding at least the label area of the non-subject label. The image processing apparatus according to any one of claims 2 to 4.

The image processing apparatus according to any one of claims 3 to 6, wherein the degree acquisition means also performs an isolated area removing process for removing an isolated area from the sparse area map.

The map generation means generates the first subject map classified into at least two label areas, that is, a subject label representing a subject area and a non-subject label representing a non-subject.
The image processing apparatus according to claim 1, wherein the degree acquisition means calculates the degree of sparseness based on the number of times the subject label and the non-subject label toggle.

The first aspect of claim 1, wherein the degree acquisition means converts the first subject map into a frequency domain to generate a frequency domain map, and calculates the sparseness based on the frequency domain map. Image processing device.

The degree acquisition means converts the first subject map into a frequency domain for each small block to generate a frequency domain map, and determines a predetermined sparse frequency range for each small block of the frequency domain map. The image according to claim 9, wherein it is determined whether or not the response is equal to or higher than the threshold value, and the degree of sparseness is calculated according to the number of small blocks showing the response equal to or higher than a predetermined threshold value. Processing equipment.

The evaluation value map is characterized by including one of a focus information distribution, a distance information distribution, a motion vector information distribution, a color labeling information distribution, and a semantic region division by machine learning of the image. The image processing apparatus according to any one of claims 1 to 10.

The eleventh aspect of claim 11 is characterized in that the focus information distribution is a parallax information distribution acquired from image groups having different viewpoints, or a contrast information distribution acquired from image groups obtained by sequentially changing the focus positions. The image processing apparatus described.

The image processing according to claim 12, wherein the parallax information distribution includes one of a map based on a shift amount representing parallax, a map based on a defocus amount, and a map based on a distance value. Device.

An image processing method executed by an image processing device.
A map acquisition process that acquires the evaluation value distribution corresponding to the image as an evaluation value map, and
A map generation step of generating a first subject map based on a subject area extracted from the image using the evaluation value map, and a map generation step.
A degree acquisition step for acquiring the degree of sparseness, which represents the degree of the sparse area included in the first subject map, and
It has a correction step of performing correction processing on the image using at least one of the first subject map and the second subject map generated without using the evaluation value map.
The image processing method is characterized in that, in the correction step, the correction processing is performed by preferentially using the second subject map over the first subject map as the degree of sparseness increases.

A program for causing a computer to function as each means included in the image processing apparatus according to any one of claims 1 to 13.