JP5323165B2

JP5323165B2 - Stereoscopic image processing apparatus and stereoscopic image processing method

Info

Publication number: JP5323165B2
Application number: JP2011242953A
Authority: JP
Inventors: 久雄熊井; 永雄服部; 健一郎山本; 郁子椿; 幹生瀬戸
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2011-11-07
Filing date: 2011-11-07
Publication date: 2013-10-23
Anticipated expiration: 2031-11-07
Also published as: JP2013098944A; WO2013069624A1

Abstract

A three-dimensional image processing device pertaining to an embodiment of the present invention accepts an input three-dimensional image, calculates one or more parallax features in the three-dimensional image, and divides the three-dimensional image into multiple scenes according to the parallax features. In addition, the three-dimensional image processing device performs three-dimensional image correction processing on the basis of the scenes.

Description

本発明は、立体画像処理装置および立体画像処理方法に関するものである。 The present invention relates to a stereoscopic image processing apparatus and a stereoscopic image processing method.

人間は、一定の間隔を持つ２つの目により得られる視覚情報の違いから、空間を把握する能力を持つ。左右の眼による異なる視点から得られる像のずれを視差と呼ぶ。人間は、視差を手掛かりの一つとして、物体までの距離を把握している。実際、物体までの距離は視差から算出することが可能である。このことを利用して、右目用画像を右目に表示し、左目用画像を左目に表示する立体画像表示装置を用意し、右目用画像、左目用画像として視差を設けた画像を提示することにより立体視が可能であることが知られている。ここでは、立体視を意図して視差を設けた複数の画像のことを立体画像と称する。また、物体までの距離を「奥行」と表現する。立体画像を撮影したカメラの焦点距離やカメラの間隔が既知であれば、視差と奥行は互いに変換可能である。 Humans have the ability to grasp space from the difference in visual information obtained by two eyes having a fixed interval. The shift of images obtained from different viewpoints by the left and right eyes is called parallax. Humans grasp the distance to an object using parallax as a clue. In fact, the distance to the object can be calculated from the parallax. By using this, a stereoscopic image display device that displays a right-eye image and a left-eye image on the right eye is prepared, and an image with parallax is provided as a right-eye image and a left-eye image. It is known that stereoscopic viewing is possible. Here, a plurality of images provided with parallax for the purpose of stereoscopic viewing are referred to as stereoscopic images. The distance to the object is expressed as “depth”. The parallax and the depth can be converted to each other if the focal length of the camera that captured the stereoscopic image and the interval between the cameras are known.

人間は立体視において、視差に応じた両眼の光軸のなす角度、すなわち輻輳角の大きさを対象物までの距離に対応付けていると言われている。よって、左目用画像上のある被写体の位置より右目用画像上のその被写体の位置の方が、左目用画像上から見て相対的に右側にずれるよう視差を付けた画像を見せると、被写体が実際の表示面より遠景側にあるように知覚させることができる。逆に、左目用画像上のある被写体の位置より右目用画像上のその被写体の位置の方が、左目用画像上から見て相対的に左側にずれるよう視差を付けた画像を見せると、被写体が実際の表示面より近景側にあるように知覚させることができる。 It is said that humans associate the angle formed by the optical axes of both eyes according to parallax, that is, the size of the convergence angle, with the distance to the object in stereoscopic vision. Therefore, when an image with a parallax is displayed such that the position of the subject on the right-eye image is relatively shifted to the right when viewed from the left-eye image than the position of a certain subject on the left-eye image, the subject is It can be perceived as being on the far side of the actual display surface. On the contrary, when an image with parallax is displayed such that the position of the subject on the right-eye image is relatively shifted to the left when viewed from the left-eye image than the position of a certain subject on the left-eye image, Can be perceived as being closer to the foreground than the actual display surface.

以下では、左目用画像を基準として右目用画像上の被写体の右方向の視差のずれを、正の視差として説明する。被写体は、視差量が正の値であると表示画面より遠景方向に見える。視差量が０の場合はちょうど表示画面と同じ距離にあるように見える。視差量が負の値になる場合、すなわち左目用画像を基準として右目用画像上の被写体が左方向にずれている場合は、表示画面より近景方向に見える。視差量の単位としては、例えば画像におけるピクセル数を用いることができる。 In the following, the shift in the right-direction parallax of the subject on the right-eye image with reference to the left-eye image will be described as positive parallax. If the amount of parallax is a positive value, the subject appears far away from the display screen. When the amount of parallax is 0, it appears to be at the same distance as the display screen. When the amount of parallax becomes a negative value, that is, when the subject on the right-eye image is shifted leftward with respect to the left-eye image, it appears in the foreground direction from the display screen. As a unit of the amount of parallax, for example, the number of pixels in the image can be used.

しかし、立体画像の視差量を正の方向に大きくしすぎ、観察者の目の間隔、正確には無限遠を見ている時の瞳孔間の距離を超えた視差量とすると、自然界では起き得ない状態となり、立体視が不可能となるか、立体視できたとしても人体に強い負担を強いることになる。一方、視差量を負の方向に大きくしすぎると、極度な寄り目を観察者に強いることとなり、快適な立体視ができなくなる。またいずれの場合においても、視差量が正又は負の方向に大きくなるほど輻輳と目の焦点の調節との乖離が大きくなり、不自然な状態となるので、違和感を生じる。このように、立体画像の視差量がある一定の範囲では快適に立体視が可能であるが、視差量が大きくなると両目の画像が融合しなくなり、立体視が困難あるいは不可能となる。 However, if the parallax amount of the stereoscopic image is too large in the positive direction and the parallax amount exceeds the distance between the eyes of the observer, precisely the distance between the pupils when looking at infinity, it can occur in the natural world. In this state, stereoscopic viewing becomes impossible or even if stereoscopic viewing is possible, a heavy burden is imposed on the human body. On the other hand, if the amount of parallax is too large in the negative direction, extreme crossing will be forced on the viewer, and comfortable stereoscopic viewing will not be possible. In any case, as the amount of parallax increases in the positive or negative direction, the difference between the convergence and the focus adjustment of the eyes increases, resulting in an unnatural state, which causes a sense of discomfort. As described above, stereoscopic viewing can be comfortably performed within a certain range of the parallax amount of the stereoscopic image. However, when the parallax amount increases, the images of both eyes are not fused, and stereoscopic viewing becomes difficult or impossible.

特許文献１には、これに対する解決手段が開示されている。図２５は特許文献１に記載の発明の要部を図解したものであり、これを用いて説明する。図２５においては、視差計算部５００は左右眼用の画像から、画面全体について各座標における視差を計算した視差地図を計算する。計算方法は、左右画像の輝度パターンの相関を計算する相関マッチング法等を用いる。注視点計算部５０１において立体画像の視差の最大値、すなわち最遠景視差量を計算し、視差制御部５０２が画面表示部５０３の表示画面上での左右のずれ量を、その最遠景視差量が観察者の両眼間隔を越えないように設定する。両眼間隔は、成人の場合約６５ｍｍである。これにより、観察者の視線が平行より広がることがなくなり、両目の画像の視差が融合する範囲内になるように制御することができる。あるいは、注視点計算部５０１において立体画像の視差の最小値、すなわち最近景視差量を計算し、視差制御部５０２が画面表示部５０３の表示画面上での左右のずれ量を、その最近景視差量が、ある所定の大きさβ以下にならないように設定する方法が開示されている。これにより、観察者の視点が非常に近い位置になり３次元画像表示面からの目のピント情報と視線の輻輳角の大きな不一致状態をなくすことが出来、観察者が表示画像を両眼融合しやすくなるように左右画像を制御することができる。つまり、立体画像に対してこのような補正処理を行うことにより、常に視聴者に見やすい立体画像を呈示できる。 Patent Document 1 discloses a solution for this. FIG. 25 illustrates a main part of the invention described in Patent Document 1, and will be described using this. In FIG. 25, the parallax calculation unit 500 calculates a parallax map obtained by calculating the parallax at each coordinate for the entire screen from the left and right eye images. As a calculation method, a correlation matching method for calculating a correlation between luminance patterns of the left and right images is used. The gazing point calculation unit 501 calculates the maximum parallax value of the stereoscopic image, that is, the farthest view parallax amount, and the parallax control unit 502 calculates the left and right shift amount on the display screen of the screen display unit 503, and the farthest view parallax amount. Set so that the distance between the eyes of the observer is not exceeded. The distance between eyes is about 65 mm for adults. Thereby, the line of sight of the observer does not spread more than parallel, and control can be performed so that the parallax of the images of both eyes is within the range where they are fused. Alternatively, the gaze point calculation unit 501 calculates the minimum parallax value of the stereoscopic image, that is, the closest scene parallax amount, and the parallax control unit 502 calculates the right and left shift amount on the display screen of the screen display unit 503 as the nearest scene parallax. A method is disclosed in which the amount is set so as not to be equal to or less than a certain predetermined size β. As a result, the observer's viewpoint is located very close, and it is possible to eliminate a large mismatch state between the focus information of the eyes from the 3D image display surface and the convergence angle of the line of sight. The left and right images can be controlled so as to be easier. That is, by performing such correction processing on a stereoscopic image, a stereoscopic image that is always easy for the viewer to view can be presented.

また、例えば人物の上半身をアップで写した立体画像のように、被写体が画像の周辺部分にかかっており、その被写体が画面より手前に知覚されるような視差量を有する場合、その視差量にも関わらず手前に飛び出して見え難いという現象がある。視聴者にとって、表示装置の画面に表示された被写体が画面の枠部分で切り取られて手前に浮いて見えるという状況が違和感をもって感じられ、被写体が飛び出して知覚することを妨げてしまうことが原因と考えられる。 In addition, for example, when a subject is placed on the peripheral portion of the image, such as a stereoscopic image in which the upper body of a person is taken up, and the subject has a parallax amount that is perceived in front of the screen, the parallax amount Nevertheless, there is a phenomenon that it is difficult to see by jumping out to the front. For viewers, the situation that the subject displayed on the screen of the display device is cut off at the frame part of the screen and appears floating in front is caused by a sense of discomfort, preventing the subject from jumping out and perceiving Conceivable.

一方特許文献２には、表示される映像の種類に応じて適切に映像表示特性を調整する技術が開示されている。 On the other hand, Patent Document 2 discloses a technique for appropriately adjusting video display characteristics according to the type of video to be displayed.

図２６は特許文献２に記載の発明の要部を図解したものであり、これを用いて説明する。輝度情報検出部６０９は、受信部６０２から出力される映像信号Ｄｂに含まれる輝度信号Ｙの各画素における輝度より、輝度情報値Ｙｉを検出する。コンテンツ特徴検出部６１０は、輝度情報値Ｙｉをもとに各フレームの映像コンテンツの特徴を判定し、コンテンツ特徴判定値Ｊｉを複数コンテンツ特徴検出器６１１に出力する。複数コンテンツ特徴検出部６１１は複数フレーム分のコンテンツ特徴判定値Ｊｉより、変動の少ない映像コンテンツ判定（一時的変動の影響を受けにくいコンテンツ判定）を行い、より尤もらしい判定結果としての、複数コンテンツ特徴判定値Ｆｉを補正制御部６０６に出力する。補正制御部６０６は、複数コンテンツ特徴判定値Ｆｉを基にそれぞれのコンテンツの特徴に応じた映像補正値Ｐｉを映像補正部６０５に出力する。映像補正部６０５は、入力された映像補正パラメータＰｉを用いて映像信号Ｄｃに対して映像補正を行い、それを映像信号Ｄｄとして表示部６０４に出力する。このような構成により、入力された映像の種類に応じて適切な画質補正を行う。 FIG. 26 illustrates a main part of the invention described in Patent Document 2, and will be described using this. The luminance information detection unit 609 detects the luminance information value Yi from the luminance at each pixel of the luminance signal Y included in the video signal Db output from the receiving unit 602. The content feature detection unit 610 determines the feature of the video content of each frame based on the luminance information value Yi, and outputs the content feature determination value Ji to the multiple content feature detector 611. The multiple content feature detection unit 611 performs video content determination with less variation (content determination that is not easily affected by temporary variation) from the content feature determination value Ji for a plurality of frames, and the multiple content feature as a more likely determination result. The determination value Fi is output to the correction control unit 606. The correction control unit 606 outputs the video correction value Pi corresponding to the feature of each content to the video correction unit 605 based on the multiple content feature determination value Fi. The video correction unit 605 performs video correction on the video signal Dc using the input video correction parameter Pi, and outputs it to the display unit 604 as the video signal Dd. With such a configuration, appropriate image quality correction is performed according to the type of input video.

特開平７−１６７６３３号公報JP-A-7-167633 特開２００８−１３４２７７号公報JP 2008-134277 A

立体画像は上記のように、視差量が正の方向に大きすぎても負の方向に大きすぎても、立体視が困難あるいは不可能となる。このため立体画像は、あまり大きな視差を付け過ぎないように製作されているものも多い。また、立体画像は上記のように、被写体が画像の周辺部分にかかっておりその被写体が画面より手前に知覚されるような視差量を有する場合、その視差量にも関わらず手前に飛び出して見え難いという現象がある。このため立体画像は、被写体があまり画面より手前に飛び出し過ぎないように製作されているものも多い。しかし画一的に上記のような方針で立体画像を製作すると、奥行が感じられない立体画像や被写体が飛び出して見えることのない立体画像、すなわち立体感が乏しい立体画像が多くなり、立体画像の特性が生かされないことになる。 As described above, if the amount of parallax is too large in the positive direction or too large in the negative direction, it is difficult or impossible to make a stereoscopic image. For this reason, many stereoscopic images are manufactured so as not to add too much parallax. In addition, as described above, when a subject covers the peripheral part of the image and has a parallax amount that is perceived in front of the screen as described above, the stereoscopic image appears to pop out to the front regardless of the amount of parallax. There is a phenomenon that is difficult. For this reason, many stereoscopic images are produced so that the subject does not protrude too much in front of the screen. However, if a 3D image is produced according to the above-mentioned policy uniformly, there will be more 3D images in which the depth is not felt and 3D images in which the subject does not appear to pop out, that is, 3D images with a poor 3D effect. The characteristics will not be utilized.

これに対して、立体画像の表示時に立体感の乏しい立体画像に対して立体感の強調処理を行うことが考えられる。この時、全ての立体画像の立体感が乏しいわけではないため、立体感の乏しい立体画像に選択的に立体感の強調処理を行うことが望ましい。また、立体感が乏しい要因は様々であるので、立体画像の立体感の特徴に応じて立体感の強調処理の種類や強調量を変えることが望ましい。 On the other hand, it is conceivable to perform a stereoscopic effect enhancement process on a stereoscopic image with a poor stereoscopic effect when the stereoscopic image is displayed. At this time, since the stereoscopic effect of all the stereoscopic images is not poor, it is desirable to selectively perform the stereoscopic effect enhancement process on the stereoscopic image with poor stereoscopic effect. In addition, since there are various factors that cause a lack of stereoscopic effect, it is desirable to change the type and amount of enhancement processing for the stereoscopic effect according to the stereoscopic effect of the stereoscopic image.

しかしながら、特許文献１に記載の立体画像の調整機能は、立体画像を見やすく疲れにくくすること、すなわち立体視の安全性を考慮したものであって、立体感の強調を考慮したものではない。立体感の乏しい立体画像を判別する機能や立体画像の立体感の特徴を検出する機能を有しておらず、上記のような立体感の強調処理を適切に行うことはできない。 However, the stereoscopic image adjustment function described in Patent Document 1 is intended to make the stereoscopic image easy to see and less fatigue, that is, considering the safety of stereoscopic vision, and does not consider enhancing the stereoscopic effect. It does not have a function of discriminating a stereoscopic image with a poor stereoscopic effect or a function of detecting a feature of the stereoscopic effect of the stereoscopic image, and the above-described stereoscopic effect enhancement process cannot be appropriately performed.

また、特許文献２に記載の技術は、表示される映像の輝度情報を用いて映像の特徴を判定し、映像の種類に応じて適切に映像表示特性を調整する技術であるが、これは立体画像ではない通常の画像に対する技術である。立体感の乏しい立体画像を判別する機能や立体画像の立体感の特徴を検出する機能、立体感を強調する機能のいずれも有しておらず、上記のような立体感の強調処理を行うことはできない。 The technique described in Patent Document 2 is a technique for determining the characteristics of a video using luminance information of the displayed video and appropriately adjusting the video display characteristics according to the type of the video. This is a technique for a normal image that is not an image. It does not have a function to discriminate a stereoscopic image with poor stereoscopic effect, a feature to detect the stereoscopic effect of a stereoscopic image, or a function to enhance the stereoscopic effect, and performs the above-described stereoscopic effect enhancement processing. I can't.

本発明は上記課題を鑑みて成されたものであり、立体感の乏しい立体画像を判別すること及び立体画像の立体感の特徴に応じて立体感を強調する補正処理を行うことを目的とする。 The present invention has been made in view of the above problems, and an object of the present invention is to determine a stereoscopic image with a poor stereoscopic effect and to perform a correction process that enhances the stereoscopic effect according to the stereoscopic effect of the stereoscopic image. .

本発明の一態様によれば、本発明の立体画像処理装置は、立体画像のシーンを分類する立体画像処理装置であって、前記立体画像を入力する入力部と、前記立体画像から該立体画像の１以上の視差特徴量を算出する視差特徴量算出部と、前記視差特徴量により前記立体画像を複数のシーンに分類する立体画像分類部とを備えたことを特徴とする。 According to an aspect of the present invention, the stereoscopic image processing apparatus of the present invention is a stereoscopic image processing apparatus that classifies a scene of a stereoscopic image, the input unit for inputting the stereoscopic image, and the stereoscopic image from the stereoscopic image. A parallax feature amount calculation unit that calculates one or more parallax feature amounts, and a stereoscopic image classification unit that classifies the stereoscopic image into a plurality of scenes based on the parallax feature amount.

本発明の別の一態様によれば、本発明の立体画像処理装置は、前記シーンに基づき立体画像補正処理を行う画像補正部を備えても良い。 According to another aspect of the present invention, the stereoscopic image processing apparatus of the present invention may include an image correction unit that performs a stereoscopic image correction process based on the scene.

本発明の別の一態様によれば、前記立体画像補正処理は、前記立体画像の視差調整処理により立体感を強調する処理であっても良い。 According to another aspect of the present invention, the stereoscopic image correction process may be a process of enhancing a stereoscopic effect by a parallax adjustment process of the stereoscopic image.

本発明の別の一態様によれば、前記視差特徴量は、前記立体画像の最近景の視差量と最遠景の視差量の幅である視差範囲幅であっても良い。 According to another aspect of the present invention, the parallax feature amount may be a parallax range width that is a width between the parallax amount of the closest view and the parallax amount of the farthest view of the stereoscopic image.

本発明の別の一態様によれば、前記視差特徴量は、視差量に関する第１の定数より近景側の視差量により決定される近景側度数総和であっても良い。 According to another aspect of the present invention, the parallax feature amount may be a foreground-side frequency sum determined by a near-field parallax amount from a first constant related to the parallax amount.

本発明の別の一態様によれば、前記視差特徴量は、前記視差量に関する第１の定数より小さい視差量に関する第２の定数より近景側の視差量により決定される第二近景側度数総和であっても良い。 According to another aspect of the present invention, the parallax feature amount is determined based on a second foreground-side frequency sum determined by a near-field parallax amount from a second constant relating to a parallax amount smaller than a first constant relating to the parallax amount. It may be.

本発明の別の一態様によれば、前記視差特徴量は、視差量に関する第３の定数より近景側の、前記立体画像の周辺部における視差量により決定される画像周辺部近景側度数総和であっても良い。 According to another aspect of the present invention, the parallax feature amount is an image peripheral portion foreground side frequency sum determined by a parallax amount in the peripheral portion of the stereoscopic image closer to the foreground than the third constant related to the parallax amount. There may be.

本発明の別の一態様によれば、前記立体画像分類部は、前記視差範囲幅と第１の視差範囲幅定数を比較して前記立体画像を分類する処理を行い、前記画像補正部は、前記視差範囲幅が前記第１の視差範囲幅定数より小さいシーンに、前記立体画像の視差を大きくする視差調整処理を行っても良い。 According to another aspect of the present invention, the stereoscopic image classification unit performs a process of classifying the stereoscopic image by comparing the parallax range width with a first parallax range width constant, and the image correction unit includes: You may perform the parallax adjustment process which enlarges the parallax of the said stereo image to the scene where the said parallax range width is smaller than the said 1st parallax range width constant.

本発明の別の一態様によれば、前記立体画像分類部は、前記視差範囲幅と前記第１の視差範囲幅定数及びそれより大きい第２の視差範囲幅定数を比較して前記立体画像を分類する処理と、前記近景側度数総和と第１の近景側度数総和定数及びそれより大きい第２の近景側度数総和定数を比較して前記立体画像を分類する処理とを行い、前記画像補正処理部は、前記視差範囲幅が前記第１の視差範囲幅定数より大きくかつ前記第２の視差範囲幅定数より小さいシーンであって、かつ前記近景側度数総和が前記第１の近景側度数総和定数より大きくかつ前記第２の近景側度数総和定数より小さいシーンに、前記立体画像の視差を小さくする視差調整処理を行っても良い。 According to another aspect of the present invention, the stereoscopic image classification unit compares the parallax range width with the first parallax range width constant and a second parallax range width constant larger than the stereoscopic image. A process of classifying, and a process of classifying the stereoscopic image by comparing the foreground-side power sum total with the first foreground-side power sum constant and a larger second foreground-side power sum constant. The scene is a scene in which the parallax range width is larger than the first parallax range width constant and smaller than the second parallax range width constant, and the foreground side power sum is the first foreground side power sum constant A parallax adjustment process for reducing the parallax of the stereoscopic image may be performed on a scene that is larger and smaller than the second foreground-side power sum constant.

本発明の別の一態様によれば、前記立体画像分類部は、前記視差範囲幅と前記第１の視差範囲幅定数及びそれより大きい前記第２の視差範囲幅定数を比較して前記立体画像を分類する処理と、前記近景側度数総和と第３の近景側度数総和定数を比較して分類する処理と、前記第二近景側度数総和と第４の近景側度数総和定数を比較して前記立体画像を分類する処理とを行い、前記画像補正処理部は、前記視差範囲幅が前記第１の視差範囲幅定数より大きくかつ前記第２の視差範囲幅定数より小さいシーンであって、かつ前記近景側度数総和が前記第３の近景側度数総和定数より大きいシーンであって、かつ前記第二近景側度数総和が前記第４の近景側度数総和定数より小さいシーンに、前記立体画像の視差を小さくする視差調整処理を行っても良い。 According to another aspect of the present invention, the stereoscopic image classification unit compares the parallax range width with the first parallax range width constant and the second parallax range width constant larger than the stereoscopic image. , A process for comparing and classifying the foreground side power sum and the third foreground side power sum constant, and comparing the second foreground side power sum and the fourth foreground side power sum constant The image correction processing unit is a scene in which the parallax range width is larger than the first parallax range width constant and smaller than the second parallax range width constant; and The parallax of the stereoscopic image is applied to a scene in which a foreground side power sum is larger than the third foreground side power sum constant and the second foreground side power sum is smaller than the fourth foreground side power sum constant. Perform parallax adjustment processing to reduce Good.

本発明の別の一態様によれば、前記立体画像分類部は、前記視差範囲幅と前記第１の視差範囲幅定数を比較して前記立体画像を分類する処理と、前記画像周辺部近景側度数総和と第１の画像周辺部近景側度数総和定数を比較して前記立体画像を分類する処理とを行い、前記画像補正処理部は、前記視差範囲幅が前記第１の視差範囲幅定数より大きいシーンであって、かつ画像周辺部近景側度数総和が前記第１の画像周辺部近景側度数総和定数より大きいシーンに、前記立体画像の視差を大きくする視差調整処理を行っても良い。 According to another aspect of the present invention, the stereoscopic image classifying unit compares the parallax range width with the first parallax range width constant to classify the stereoscopic image, and the image peripheral portion near view side A process of classifying the stereoscopic image by comparing the frequency sum and the first image peripheral portion near-field-side power sum constant, and the image correction processing unit is configured such that the parallax range width is greater than the first parallax range width constant. A parallax adjustment process for increasing the parallax of the stereoscopic image may be performed on a scene that is a large scene and whose image peripheral portion near view side power sum is larger than the first image peripheral portion near view side power sum constant.

本発明の別の一態様によれば、本発明の立体画像処理装置は、立体画像のシーンを分類する立体画像処理装置であって、前記立体画像及び該立体画像の１以上の視差特徴量を入力する入力部と、前記視差特徴量により前記立体画像を複数のシーンに分類する立体画像分類部とを備えたことを特徴とする。 According to another aspect of the present invention, a stereoscopic image processing apparatus of the present invention is a stereoscopic image processing apparatus that classifies a scene of a stereoscopic image, and the stereoscopic image and at least one parallax feature amount of the stereoscopic image are obtained. An input unit for input and a stereoscopic image classification unit for classifying the stereoscopic image into a plurality of scenes based on the parallax feature amount are provided.

本発明の別の一態様によれば、本発明の立体画像処理装置は、前記シーンの変化により前記立体画像のシーンチェンジを検出するシーンチェンジ検出部を備えても良い。 According to another aspect of the present invention, the stereoscopic image processing apparatus of the present invention may include a scene change detection unit that detects a scene change of the stereoscopic image based on a change in the scene.

本発明の別の一態様によれば、本発明の立体画像処理方法は、立体画像のシーンを分類する立体画像処理方法であって、前記立体画像を入力するステップと、前記立体画像から該立体画像の１以上の視差特徴量を算出するステップと、前記視差特徴量により前記立体画像を複数のシーンに分類するステップとを有することを特徴とする。 According to another aspect of the present invention, a stereoscopic image processing method of the present invention is a stereoscopic image processing method for classifying a scene of a stereoscopic image, the step of inputting the stereoscopic image, and the stereoscopic image from the stereoscopic image. The method includes calculating one or more parallax feature amounts of the image and classifying the stereoscopic image into a plurality of scenes based on the parallax feature amount.

本発明の別の一態様によれば、本発明の立体画像処理方法は、立体画像のシーンを分類する立体画像処理方法であって、前記立体画像及び該立体画像の１以上の視差特徴量を入力するステップと、前記視差特徴量により前記立体画像を複数のシーンに分類するステップとを有することを特徴とする。 According to another aspect of the present invention, a stereoscopic image processing method of the present invention is a stereoscopic image processing method for classifying a scene of a stereoscopic image, wherein the stereoscopic image and one or more parallax feature quantities of the stereoscopic image are obtained. And a step of classifying the stereoscopic image into a plurality of scenes based on the parallax feature amount.

本発明の別の一態様によれば、本発明のプログラムは、コンピュータに立体画像のシーンを分類する立体画像処理を実行させるためのプログラムであって、前記立体画像を入力するステップと、前記立体画像から該立体画像の１以上の視差特徴量を算出するステップと、前記視差特徴量により前記立体画像を複数のシーンに分類するステップとを含むことを特徴とする。 According to another aspect of the present invention, the program of the present invention is a program for causing a computer to execute a stereoscopic image process for classifying a stereoscopic image scene, the step of inputting the stereoscopic image, and the stereoscopic image The method includes calculating one or more parallax feature amounts of the stereoscopic image from the image and classifying the stereoscopic image into a plurality of scenes based on the parallax feature amount.

本発明の別の一態様によれば、本発明のプログラムは、コンピュータに立体画像のシーンを分類する立体画像処理を実行させるためのプログラムであって、前記立体画像及び該立体画像の１以上の視差特徴量を入力するステップと、前記視差特徴量により前記立体画像を複数のシーンに分類するステップとを含むことを特徴とする。 According to another aspect of the present invention, a program of the present invention is a program for causing a computer to execute a stereoscopic image process for classifying a stereoscopic image scene, the stereoscopic image and at least one of the stereoscopic image. The method includes inputting a parallax feature value and classifying the stereoscopic image into a plurality of scenes based on the parallax feature value.

上記手段により、本発明に係る立体画像処理装置は、立体画像の視差特徴量を算出し、これを用いて立体画像を複数のシーンに分類することが可能となる。また、そのシーンに基づき立体画像の立体感を強調する補正処理を行うことが可能となる。 By the above means, the stereoscopic image processing apparatus according to the present invention can calculate the parallax feature amount of the stereoscopic image and use this to classify the stereoscopic image into a plurality of scenes. Further, it is possible to perform a correction process for enhancing the stereoscopic effect of the stereoscopic image based on the scene.

立体画像のフレームの概念を説明する図である。It is a figure explaining the concept of the frame of a stereo image. 第１の実施例に係る立体画像表示装置の構成を示すブロック図である。It is a block diagram which shows the structure of the stereo image display apparatus which concerns on a 1st Example. 第１の実施例に係る視差特徴量算出部の構成を示すブロック図である。It is a block diagram which shows the structure of the parallax feature-value calculation part which concerns on a 1st Example. 視差ヒストグラムの一例である。It is an example of a parallax histogram. 第１の実施例における立体画像の分類の種類とそれに対応する画像補正処理を示す図である。It is a figure which shows the kind of classification | category of the stereo image in 1st Example, and the image correction process corresponding to it. 立体画像とそれによる奥行き知覚を説明する図である。It is a figure explaining a stereo image and the depth perception by it. 立体画像とそれによる奥行き知覚を説明する図である。It is a figure explaining a stereo image and the depth perception by it. 立体画像とそれによる奥行き知覚を説明する図である。It is a figure explaining a stereo image and the depth perception by it. 第２の実施例に係る立体画像表示装置の構成を示すブロック図である。It is a block diagram which shows the structure of the stereo image display apparatus which concerns on a 2nd Example. 第２の実施例に係る視差特徴量算出部の構成を示すブロック図である。It is a block diagram which shows the structure of the parallax feature-value calculation part which concerns on a 2nd Example. 近景側度数総和について説明するための視差ヒストグラムを示した図である。It is the figure which showed the parallax histogram for demonstrating near view side frequency sum total. 第２の実施例における立体画像の分類の種類とそれに対応する画像補正処理を示す図である。It is a figure which shows the kind of classification | category of the stereo image in the 2nd Example, and the image correction process corresponding to it. 第２の実施例における視差調整量の決定方法を説明する図である。It is a figure explaining the determination method of the parallax adjustment amount in a 2nd Example. 第３の実施例に係る立体画像表示装置の構成を示すブロック図である。It is a block diagram which shows the structure of the stereo image display apparatus which concerns on a 3rd Example. 第３の実施例に係る視差特徴量算出部の構成を示すブロック図である。It is a block diagram which shows the structure of the parallax feature-value calculation part which concerns on a 3rd Example. 画像の周辺部分を説明する図である。It is a figure explaining the peripheral part of an image. 第３の実施例における立体画像の分類の種類とそれに対応する画像補正処理を示す図である。It is a figure which shows the kind of classification | category of the stereo image in a 3rd Example, and the image correction process corresponding to it. 第４の実施例に係る立体画像表示装置の構成を示すブロック図である。It is a block diagram which shows the structure of the stereo image display apparatus which concerns on a 4th Example. 第４の実施例に係る視差特徴量算出部の構成を示すブロック図である。It is a block diagram which shows the structure of the parallax feature-value calculation part which concerns on a 4th Example. 近景側度数総和及び第二近景側度数総和について説明するための視差ヒストグラムを示した図である。It is the figure which showed the parallax histogram for demonstrating a foreground side frequency sum total and a 2nd foreground side frequency sum total. 第４の実施例における立体画像の分類の種類とそれに対応する画像補正処理を示す図である。It is a figure which shows the kind of classification | category of the stereo image in 4th Example, and the image correction process corresponding to it. 第５の実施例に係る立体画像表示装置の構成を示すブロック図である。It is a block diagram which shows the structure of the stereo image display apparatus which concerns on a 5th Example. 第５の実施例に係るシーンチェンジ検出部の構成を示すブロック図である。It is a block diagram which shows the structure of the scene change detection part which concerns on a 5th Example. シーンチェンジ検出に用いるフレームと、それによって検出されるシーンチェンジの位置の関係を説明する図である。It is a figure explaining the relationship between the frame used for scene change detection, and the position of the scene change detected by it. 特許文献１に記載の視差制御方法を説明する図である。It is a figure explaining the parallax control method of patent document 1. FIG. 特許文献２に記載の映像表示装置を説明する図である。It is a figure explaining the video display apparatus of patent document 2. FIG.

以下、添付図面を参照しながら本発明の好適な立体画像処理装置の実施例について詳細に説明する。本発明は、フィールド信号とフレーム信号のいずれに対しても適用できるものであるが、フィールドとフレームは互いに類似の関係にあるため、フレーム信号を代表例として説明するものとする。 Hereinafter, preferred embodiments of a stereoscopic image processing apparatus of the present invention will be described in detail with reference to the accompanying drawings. The present invention can be applied to both a field signal and a frame signal, but since a field and a frame have a similar relationship to each other, the frame signal will be described as a representative example.

まず図１により、本発明における、右目用画像データと左目用画像データからなる立体画像の、フレームの概念について説明する。図１（ａ）に示すのは、1枚の画像の左半分に横方向の解像度を通常の１／２にした左目用画像（Ｌ）を、右半分に横方向の解像度を通常の１／２にした右目用画像（Ｒ）を格納した形式での立体画像である。サイドバイサイド形式と呼ばれている。この場合は、左目用画像（Ｌ）と右目用画像（Ｒ）からなる１枚の画像が１フレームになる。動画の場合は、フレームレートをＸフレーム／秒とすると、上記形式の画像が１秒間にＸ枚、すなわちフレームレートと同一の枚数必要である。 First, referring to FIG. 1, the concept of a frame of a stereoscopic image composed of right-eye image data and left-eye image data in the present invention will be described. FIG. 1A shows a left-eye image (L) in which the horizontal resolution is halved to the normal half on the left half of one image, and the horizontal resolution to the normal half in the right half. 3 is a stereoscopic image in a format storing a right-eye image (R) 2. It is called side-by-side format. In this case, one image composed of the left-eye image (L) and the right-eye image (R) is one frame. In the case of a moving image, if the frame rate is X frames / second, X images of the above format are required per second, that is, the same number as the frame rate.

図１（ｂ）に示すのは、1枚の画像の上半分に縦方向の解像度を通常の１／２にした左目用画像（Ｌ）を、下半分に縦方向の解像度を通常の１／２にした右目用画像（Ｒ）を格納した形式での立体画像である。トップアンドボトム形式と呼ばれている。この場合も、左目用画像（Ｌ）と右目用画像（Ｒ）からなる１枚の画像が１フレームになる。動画もサイドバイサイド形式と同様、この形式の画像が１秒間にフレームレートと同一の枚数必要である。またこの他にも、左目用画像と右目用画像を、解像度はそのまま上下につないで大きな１枚の画像とする、フレームパッキングと呼ばれる方式もある。これも動画の場合は、この形式の画像が１秒間にフレームレートと同一の枚数必要である。 FIG. 1B shows a left-eye image (L) in which the vertical resolution is halved in the upper half of one image, and the vertical resolution in the lower half of the normal 1 /. 3 is a stereoscopic image in a format storing a right-eye image (R) 2. It is called the top and bottom format. Also in this case, one image composed of the left-eye image (L) and the right-eye image (R) is one frame. Similarly to the side-by-side format, the number of images in this format is the same as the frame rate per second. In addition to this, there is a method called frame packing in which the left-eye image and the right-eye image are connected to each other in the vertical direction so as to form a single large image. In the case of a moving image, the same number of images of this format as the frame rate is required per second.

一方、図１（ｃ）に示すのは、左目用画像（Ｌ）と右目用画像（Ｒ）を時間軸上で交互に並べた形式での立体画像である。フレームシーケンシャル形式と呼ばれている。この場合は本願では図１（ｃ）に示すように、１枚の左目用画像（Ｌ）と１枚の右目用画像（Ｒ）をセットにして１フレームと数える。よって動画の場合は、フレームレートをＸフレーム／秒とすると、左目用画像（Ｌ）が１秒間にＸ枚、右目用画像（Ｒ）が１秒間にＸ枚、交互に並んでいることから、合計で２Ｘ枚の画像となる。 On the other hand, FIG. 1C shows a stereoscopic image in a form in which left-eye images (L) and right-eye images (R) are alternately arranged on the time axis. It is called the frame sequential format. In this case, in the present application, as shown in FIG. 1C, one left-eye image (L) and one right-eye image (R) are counted as one frame. Therefore, in the case of a moving image, assuming that the frame rate is X frames / second, the left-eye image (L) is alternately arranged X times per second, and the right-eye image (R) X times per second. The total is 2X images.

このように、本願においては、画像の実際の格納形式に関わらず、左目用画像と右目用画像のセットで１フレームと考える。さらには、ｎ視点（ただしｎ＞２とする）の映像からなるｎ眼画像の場合も、ｎ眼分の画像をセットにして１フレームと考える。 Thus, in the present application, a set of the left-eye image and the right-eye image is considered as one frame regardless of the actual image storage format. Furthermore, in the case of an n-eye image composed of images of n viewpoints (where n> 2), the image for n eyes is set as one frame.

次に、本発明における、シーン及びシーンチェンジの定義について説明する。 Next, the definition of scenes and scene changes in the present invention will be described.

本発明におけるシーンとは、同一の補正処理を施すことができるような画像群を指すものとする。言い換えると、同一のシーンに分類された画像であれば、同じ補正処理を施すことができる。 The scene in the present invention refers to an image group that can be subjected to the same correction processing. In other words, the same correction process can be applied to images classified into the same scene.

本発明では、立体画像の視差情報を用いてシーンの類似を判定し、分類している。２つの立体画像の視差情報が類似していれば、それらの画像は同一のシーンに分類される。 In the present invention, scene similarity is determined and classified using disparity information of a stereoscopic image. If the parallax information of two stereoscopic images is similar, the images are classified into the same scene.

立体画像が動画である場合、単一のカメラで撮影した連続する一連の画像群は、同一シーンである場合が多い。これら一連の画像群においては、視差情報は大きく変わらない場合が多いためである。 When the stereoscopic image is a moving image, a series of consecutive images captured by a single camera is often the same scene. This is because the disparity information is not largely changed in these series of images.

また、立体画像の動画においてシーンが変わることをシーンチェンジと呼ぶ。立体画像の視差は通常、シーンチェンジの前後においては不連続に変化する。シーンごとに、被写体が変わったり、被写体とカメラとの位置関係が変わったりするためである。 In addition, a scene change in a stereoscopic video is called a scene change. The parallax of a stereoscopic image usually changes discontinuously before and after a scene change. This is because the subject changes for each scene, or the positional relationship between the subject and the camera changes.

本発明の第１の実施例は、立体画像処理装置であって、立体画像の奥行の幅が狭いために立体感が乏しく感じられるシーンに対し、立体感を強調する画像補正処理を行うことを目的としている。このために、立体画像の視差の範囲の幅を立体画像の視差の特徴量として用いてシーン分類を行う。 1st Example of this invention is a stereo image processing apparatus, Comprising: The image correction process which emphasizes a stereo effect is performed with respect to the scene where a stereo effect is felt poor because the width | variety of the depth of a stereo image is narrow. It is aimed. For this purpose, scene classification is performed using the width of the parallax range of the stereoscopic image as the parallax feature amount of the stereoscopic image.

図２は、第１の実施例にかかる立体画像処理装置１Ａの構成を示すブロック図である。立体画像処理装置１Ａは、入力部２、立体画像処理部３、視差特徴量算出部４Ａ、立体画像分類部５Ａ、画像遅延部６、画像補正部７Ａ、表示制御部８、表示部９、メガネ同期部１０から構成される。また、ユーザが装着するシャッタメガネ１１を用いる。 FIG. 2 is a block diagram illustrating the configuration of the stereoscopic image processing apparatus 1A according to the first embodiment. The stereoscopic image processing apparatus 1A includes an input unit 2, a stereoscopic image processing unit 3, a parallax feature amount calculation unit 4A, a stereoscopic image classification unit 5A, an image delay unit 6, an image correction unit 7A, a display control unit 8, a display unit 9, and glasses. The synchronization unit 10 is configured. Further, the shutter glasses 11 worn by the user are used.

入力部２は、立体画像データを立体画像処理装置１Ａに入力する部分である。入力される立体画像データは、放送波によるもの、記録メディアから電子的に読み出されたもの、通信により取得されたものなど、どのようなものでも構わない。すなわち、入力部２は、放送電波の受信機装置であっても良いし、半導体メモリ読み出し装置、光ディスクや磁気ディスクの読み出し装置、ネットワークとの通信機能を持つものであっても良い。あるいは、立体画像処理装置１Ａは立体画像が撮影可能なカメラであり、入力部２は多眼カメラであっても良い。要するに、立体画像として解釈可能なデータを入力できるものであれば良い。また、立体画像データは、右目用画像データと左目用画像データからなるものであっても良いし、多眼表示用の多視点画像データであっても良い。また、画像データと奥行データ又は視差データからなるものであっても良い。さらに、立体画像データは、静止画であっても良いし、動画であっても良い。入力部２は、入力された立体画像データを立体画像処理部３に送る。立体画像データが静止画の場合は、同じ画像を連続して出力し、動画と同様に扱う。 The input unit 2 is a part that inputs stereoscopic image data to the stereoscopic image processing apparatus 1A. The input stereoscopic image data may be any data such as a broadcast wave, electronically read from a recording medium, or acquired through communication. That is, the input unit 2 may be a broadcast radio receiver device, or may have a semiconductor memory reading device, an optical disk or magnetic disk reading device, or a communication function with a network. Alternatively, the stereoscopic image processing apparatus 1A may be a camera that can capture a stereoscopic image, and the input unit 2 may be a multi-view camera. In short, any data can be used as long as it can input data that can be interpreted as a stereoscopic image. Further, the stereoscopic image data may be composed of right-eye image data and left-eye image data, or may be multi-view image data for multi-view display. Further, it may be composed of image data and depth data or parallax data. Furthermore, the stereoscopic image data may be a still image or a moving image. The input unit 2 sends the input stereoscopic image data to the stereoscopic image processing unit 3. When the stereoscopic image data is a still image, the same image is continuously output and handled in the same manner as a moving image.

立体画像処理部３は、入力部２により画像データの入力を受け、左目用画像データと右目用画像データに展開する。入力された画像データに付加情報がある場合は、付加情報を抽出する。付加情報は、撮影時のカメラパラメータ、視差情報や奥行き情報、視差のヒストグラム情報等であっても良い。前記の通り、入力部２に入力される立体画像データは、右目用画像データと左目用画像データからなるもののほか、様々な形式がありうる。立体画像データが右目用画像データと左目用画像データからなる場合は、それをそのまま用いる。多視点画像データである場合は、そこから２視点分のデータを選択し右目用画像データと左目用画像データとする。立体画像処理部３は、左目用画像データと右目用画像データを、視差特徴量算出部４Ａ及び画像遅延部６に送る。付加情報がある場合は、それも視差特徴量算出部４Ａに送る。なお、立体画像データが画像データと視差データからなる場合は、それらのデータをそのまま視差特徴量算出部４Ａに送ってもよい。また、立体画像データが画像データと奥行データからなる場合は、奥行データを視差データに変換し、画像データと視差データを視差特徴量算出部４Ａに送ってもよい。 The stereoscopic image processing unit 3 receives image data input from the input unit 2 and develops it into left-eye image data and right-eye image data. If there is additional information in the input image data, the additional information is extracted. The additional information may be camera parameters at the time of shooting, parallax information and depth information, parallax histogram information, and the like. As described above, the stereoscopic image data input to the input unit 2 may have various formats in addition to the right-eye image data and the left-eye image data. If the stereoscopic image data consists of right-eye image data and left-eye image data, they are used as they are. In the case of multi-viewpoint image data, data for two viewpoints is selected from the data and used as right-eye image data and left-eye image data. The stereoscopic image processing unit 3 sends the left-eye image data and the right-eye image data to the parallax feature amount calculation unit 4A and the image delay unit 6. If there is additional information, it is also sent to the parallax feature amount calculation unit 4A. When the stereoscopic image data is composed of image data and parallax data, these data may be sent to the parallax feature amount calculation unit 4A as they are. Further, when the stereoscopic image data includes image data and depth data, the depth data may be converted into parallax data, and the image data and the parallax data may be sent to the parallax feature amount calculation unit 4A.

視差特徴量算出部４Ａは、左目用画像データと右目用画像データから視差を算出し、さらに視差に関する特徴量である視差特徴量を抽出する。付加情報を用いて視差特徴量を抽出する場合もある。これらの処理の詳細な内容については後述する。視差特徴量算出部４Ａは、抽出した視差特徴量を立体画像分類部５Ａに送る。 The parallax feature amount calculation unit 4A calculates a parallax from the left-eye image data and the right-eye image data, and further extracts a parallax feature amount that is a feature amount related to the parallax. In some cases, a parallax feature amount is extracted using additional information. Details of these processes will be described later. The parallax feature amount calculation unit 4A sends the extracted parallax feature amount to the stereoscopic image classification unit 5A.

立体画像分類部５Ａは、視差特徴量を用いて立体画像を所定の種類に分類する。そして立体画像分類結果を画像補正部７Ａに出力する。 The stereoscopic image classification unit 5A classifies the stereoscopic image into a predetermined type using the parallax feature amount. Then, the stereoscopic image classification result is output to the image correction unit 7A.

画像遅延部６は、立体画像処理部３より入力された左目用画像データと右目用画像データを保持するメモリを有し、入力された画像データを１フレーム分遅延させて、画像補正部７Ａに出力する。すなわち、（ｎ＋１）フレーム目の画像が入力されているときは、ｎフレーム目の画像を出力する。ｎフレーム目の画像に対して視差特徴量算出部４Ａ、立体画像分類部５Ａで処理を行った結果は、ｎフレーム目の画像を全て処理した後でなければ得られない。よって、その結果を用いて同じｎフレーム目の画像に画像補正処理を施すためには、ｎフレーム目の画像データを保持して遅延して出力することが必要となる。 The image delay unit 6 has a memory for holding the left-eye image data and the right-eye image data input from the stereoscopic image processing unit 3, delays the input image data by one frame, and sends it to the image correction unit 7A. Output. That is, when the (n + 1) th frame image is input, the nth frame image is output. The result of processing the parallax feature quantity calculation unit 4A and the stereoscopic image classification unit 5A on the nth frame image can be obtained only after all the nth frame images have been processed. Therefore, in order to perform image correction processing on the same n-th frame image using the result, it is necessary to hold and delay the n-th frame image data.

画像補正部７Ａは、立体画像分類部５Ａから入力した立体画像分類結果を元に、画像補正処理を決定し、画像遅延部６から入力した左目用画像データと右目用画像データに対して画像補正処理を行う。そして画像補正処理後の左目用画像データと右目用画像データを表示制御部８に出力する。 The image correction unit 7A determines an image correction process based on the stereoscopic image classification result input from the stereoscopic image classification unit 5A, and performs image correction on the left-eye image data and the right-eye image data input from the image delay unit 6. Process. Then, the image data for left eye and the image data for right eye after image correction processing are output to the display control unit 8.

表示制御部８は、画像補正部７Ａから入力した画像補正処理後の左目用画像データと右目用画像データを、立体画像の提示方法に合わせた方式で、表示部９に出力する。また、観察者の装着したシャッタメガネ１１を表示部９における画像表示と同期して動作させるための同期信号をメガネ同期部１０に送る。 The display control unit 8 outputs the left-eye image data and the right-eye image data after the image correction process input from the image correction unit 7A to the display unit 9 in a manner that matches the stereoscopic image presentation method. In addition, a synchronization signal for operating the shutter glasses 11 worn by the observer in synchronization with image display on the display unit 9 is sent to the glasses synchronization unit 10.

表示部９は、表示制御部８から入力した画像を随時表示する。本実施例では、表示部９として液晶表示パネルを用い、左目用画像と右目用画像を交互に表示する。 The display unit 9 displays the image input from the display control unit 8 as needed. In this embodiment, a liquid crystal display panel is used as the display unit 9, and the left-eye image and the right-eye image are alternately displayed.

メガネ同期部１０は、表示制御部８から送られた同期信号を、赤外線や電波等を用いて、シャッタメガネ１１に対して送信する。 The glasses synchronization unit 10 transmits the synchronization signal sent from the display control unit 8 to the shutter glasses 11 using infrared rays or radio waves.

シャッタメガネ１１は、メガネ同期部１０から送信された同期信号を受信し、それに従って右目用及び左目用のシャッタを開閉する。観察者はこのシャッタメガネ１１をかけて表示部９を見る。本実施例では前記の通り、表示部９に液晶表示パネルを用い、左目用画像と右目用画像を交互に表示し、観察者の装着したシャッタメガネ１１と同期して立体視を行う方式を用いている。表示部９に左目用画像が表示されているときはシャッタメガネ１１の左目用シャッタを開、右目用シャッタを閉とすることにより左目用画像を左目に呈示し、右目用画像が表示されているときには左目用シャッタを閉、右目用シャッタを開とすることにより右目用画像を右目に呈示して、立体視を実現する。 The shutter glasses 11 receive the synchronization signal transmitted from the glasses synchronization unit 10 and open and close the shutters for the right eye and the left eye accordingly. An observer sees the display unit 9 wearing the shutter glasses 11. In the present embodiment, as described above, a liquid crystal display panel is used for the display unit 9, a left-eye image and a right-eye image are alternately displayed, and a stereoscopic view is performed in synchronization with the shutter glasses 11 worn by the observer. ing. When the left-eye image is displayed on the display unit 9, the left-eye image is presented to the left eye by opening the left-eye shutter of the shutter glasses 11 and closing the right-eye shutter, and the right-eye image is displayed. Sometimes the left-eye shutter is closed and the right-eye shutter is opened, so that the right-eye image is presented to the right eye to realize stereoscopic viewing.

さらに、視差特徴量算出部４Ａ、立体画像分類部５Ａ、画像補正部７Ａのより詳細な処理内容について説明する。 Furthermore, more detailed processing contents of the parallax feature amount calculation unit 4A, the stereoscopic image classification unit 5A, and the image correction unit 7A will be described.

図３は、視差特徴量算出部４Ａの構成を示すブロック図である。視差算出部４１、視差ヒストグラム作成部４２、視差範囲幅算出部４３から構成されている。 FIG. 3 is a block diagram illustrating a configuration of the parallax feature amount calculation unit 4A. It includes a parallax calculation unit 41, a parallax histogram creation unit 42, and a parallax range width calculation unit 43.

視差算出部４１は、左目用画像データと右目用画像データから、立体画像を構成する右目用画像及び左目用画像の対応点のずれ、即ち視差を、ブロックマッチング等を用いて画像全域にわたって求める。この時、画像内の画素毎に視差を求めても良いし、所定の大きさのブロック（例えば８×８ピクセルのブロック）毎に視差を求めても良い。なお、入力部２に入力される立体画像データの付加情報として、画素毎あるいは所定の大きさのブロック毎の視差が付加されていた場合は、視差算出部４１における処理をスキップしても良い。また、立体画像処理部３から立体画像データとして画像データと視差データが送られてきた場合は、視差算出部４１における処理をスキップする。また、付加情報として画素毎あるいは所定の大きさのブロック毎の奥行及び撮影時のカメラの間隔と焦点距離の情報が付加されていた場合は、カメラの間隔と焦点距離の情報を用いて、奥行を視差に変換しても良い。視差算出部４１は、視差を視差ヒストグラム作成部４２に送る。 The parallax calculation unit 41 obtains a shift between corresponding points of the right-eye image and the left-eye image constituting the stereoscopic image, that is, parallax from the left-eye image data and the right-eye image data over the entire image using block matching or the like. At this time, the parallax may be obtained for each pixel in the image, or the parallax may be obtained for each block of a predetermined size (for example, an 8 × 8 pixel block). Note that when the parallax for each pixel or each block having a predetermined size is added as the additional information of the stereoscopic image data input to the input unit 2, the processing in the parallax calculation unit 41 may be skipped. When image data and parallax data are sent from the stereoscopic image processing unit 3 as stereoscopic image data, the processing in the parallax calculation unit 41 is skipped. In addition, when additional information includes depth information for each pixel or block of a predetermined size and camera interval and focal length information at the time of shooting, information about the camera interval and focal length is used to determine the depth. May be converted into parallax. The parallax calculation unit 41 sends the parallax to the parallax histogram creation unit 42.

視差ヒストグラム作成部４２は、視差算出部４１で求めた画像全域の視差のデータから、その度数分布である視差ヒストグラムを作成する。視差ヒストグラムの代用として奥行きをヒストグラムにしたものでも良い。すなわち立体画像に表現されている表示物の視差またはそれと同等の量の度数分布が表されているものであれば良い。視差ヒストグラム作成部４２は、視差ヒストグラムを視差範囲幅算出部４３に送る。 The parallax histogram creation unit 42 creates a parallax histogram that is a frequency distribution from the parallax data of the entire image obtained by the parallax calculation unit 41. A depth histogram may be used instead of the parallax histogram. In other words, any display may be used as long as the parallax of the display object represented in the stereoscopic image or a frequency distribution of the same amount is represented. The parallax histogram creation unit 42 sends the parallax histogram to the parallax range width calculation unit 43.

視差範囲幅算出部４３は、視差ヒストグラム作成部４２から入力した視差ヒストグラムから、その画像における最近景の視差量と最遠景の視差量を求め、その幅を視差特徴量として立体画像分類部５Ａに出力する。このように、立体画像における被写体が存在する視差の範囲を、以下では視差範囲と呼ぶ。また、視差範囲の幅を視差範囲幅と呼び、Ｗと表記する。 The parallax range width calculation unit 43 obtains the parallax amount of the nearest scene and the parallax amount of the farthest view in the image from the parallax histogram input from the parallax histogram creation unit 42, and uses the width as the parallax feature amount to the stereoscopic image classification unit 5A. Output. In this manner, the parallax range in which a subject in a stereoscopic image exists is hereinafter referred to as a parallax range. Further, the width of the parallax range is referred to as a parallax range width and is denoted as W.

図４を用いて視差範囲幅算出部４３の処理の詳細を説明する。図４は視差ヒストグラムの一例を示している。横軸は視差量、縦軸は度数である。視差量が０の点を基準とし、０以上が遠景側、０未満が近景側としている。この視差ヒストグラムには、近景側に小さなピーク、遠景側に大きなピークがある。例えば遠い風景をバックにして近くにいる数人の人間を撮影した場合に、このようなヒストグラムの形となる。閾値Ｔｄは視差ヒストグラムの度数の閾値である。実際の視差ヒストグラムは図４のように滑らかではなく小さなピークが存在するので、その影響を除外するために、閾値Ｔｄを用いる。視差ヒストグラムと閾値Ｔｄの交点の中で最大の視差量を持つ点の視差量を最遠景の視差量とし、最小の視差量を持つ点の視差量を最近景の視差量とする。つまり、図４のように複数の交点がある場合でも、視差が最大及び最小の交点の視差量を最遠景の視差量及び最近景の視差量とする。そして、最遠景の視差量から最近景の視差量を引いたものを視差範囲幅Ｗとする。 Details of the processing of the parallax range width calculation unit 43 will be described with reference to FIG. FIG. 4 shows an example of a parallax histogram. The horizontal axis is the amount of parallax, and the vertical axis is the frequency. A point where the amount of parallax is 0 is used as a reference, 0 or more is the far view side, and less than 0 is the close view side. This parallax histogram has a small peak on the near side and a large peak on the far side. For example, when a few people nearby are photographed against a background of a distant landscape, such a histogram is formed. The threshold value Td is a threshold value of the frequency of the parallax histogram. Since the actual parallax histogram has a small peak that is not smooth as shown in FIG. 4, the threshold value Td is used to exclude the influence. The parallax amount of the point having the maximum parallax amount at the intersection of the parallax histogram and the threshold value Td is set as the parallax amount of the farthest view, and the parallax amount of the point having the minimum parallax amount is set as the parallax amount of the nearest scene. That is, even when there are a plurality of intersections as shown in FIG. 4, the parallax amount at the intersection where the parallax is maximum and minimum is set as the parallax amount of the farthest view and the parallax amount of the nearest scene. The parallax range width W is obtained by subtracting the parallax amount of the latest view from the parallax amount of the farthest view.

なお、ここでは一つの閾値Ｔｄを用いて最遠景の視差量及び最近景の視差量を検出したが、最近景の視差量及び最遠景の視差量で異なる閾値を用いて検出しても構わない。 Here, the disparity amount of the farthest view and the disparity amount of the closest view are detected using one threshold Td, but may be detected using different threshold values for the disparity amount of the nearest view and the disparity amount of the most distant view. .

立体画像分類部５Ａでは、視差範囲幅Ｗを用いて立体画像を分類する。本実施例では、視差範囲幅Ｗの大きさで２種類に分類する。判定のための閾値をＴｗ１Ａとすると、Ｗ＜Ｔｗ１Ａの場合、視差範囲幅の狭い立体画像と分類し、Ｔｗ１Ａ≦Ｗの場合、視差範囲幅の広い立体画像と分類する。この分類結果を画像補正部７Ａに出力する。 The stereoscopic image classification unit 5A classifies the stereoscopic image using the parallax range width W. In this embodiment, the parallax range width W is classified into two types. Assuming that the threshold for determination is Tw1A, if W <Tw1A, the stereoscopic image is classified as a narrow parallax range, and if Tw1A ≦ W, the stereo image is classified as a wide parallax range. The classification result is output to the image correction unit 7A.

このように視差範囲幅の狭い立体画像を分類するため、例えばＴｗ１Ａは以下のように定めることができる。人間が許容できる遠景側の最大視差量は、前記の通り画面上での視差量が無限遠を見ている時の瞳孔間の距離と等しくなる量である。これを遠景側最大視差量とする。一方、人間が許容できる近景側の最大視差量は一般に、人間の目が画面上を見たときの輻輳角と、被写体の見かけの位置を見たときの輻輳角との差である視差角差が１°を超えないような視差量が安全であるとされている。ここから、視差角差が１°となる近景側の視差量を近景側最大視差量とする。そして、遠景側最大視差量から近景側最大視差量を引いたものを最大視差範囲幅Ｗｍａｘとする。このＷｍａｘを基準にしてＴｗ１Ａを定めることができる。例えば、Ｗｍａｘの１０％の値をＴｗ１Ａとする。 In order to classify stereoscopic images with a narrow parallax range in this way, for example, Tw1A can be determined as follows. The maximum disparity amount on the distant view side that can be accepted by humans is an amount that the disparity amount on the screen is equal to the distance between the pupils when looking at infinity. This is the far-field maximum parallax amount. On the other hand, the maximum amount of parallax on the near side that humans can tolerate is generally the difference between the angle of convergence when the human eye looks at the screen and the angle of convergence when the apparent position of the subject is viewed. It is said that the amount of parallax that does not exceed 1 ° is safe. From here, the foreground side parallax amount at which the parallax angle difference is 1 ° is set as the foreground side maximum parallax amount. Then, the maximum disparity range width Wmax is obtained by subtracting the distant view side maximum disparity amount from the distant view side maximum disparity amount. Tw1A can be determined based on this Wmax. For example, a value of 10% of Wmax is Tw1A.

図５に、上記分類処理による２種類の分類に関する典型的な視差ヒストグラムの例を示す。各視差ヒストグラムの横軸は視差量、縦軸は度数であり、中央の細い縦線は、視差０の線である。また、各マスの点線以下の部分に、後述の画像補正処理の種類を記載している。 FIG. 5 shows an example of a typical parallax histogram relating to two types of classification by the classification process. The horizontal axis of each parallax histogram is the amount of parallax, the vertical axis is the frequency, and the thin vertical line at the center is a line of parallax 0. In addition, the types of image correction processing to be described later are described in portions below the dotted line of each square.

図５の左側の視差ヒストグラムはＷ＜Ｔｗ１Ａを満たし、視差範囲幅の狭い立体画像と分類されたものである。このように視差ヒストグラムが狭い範囲に収まっている。つまり、画像の中の被写体間の奥行の幅が狭く、立体感が乏しい立体画像である。図５の右側の視差ヒストグラムはＴｗ１Ａ≦Ｗを満たし、視差範囲幅の広い立体画像と分類されたものである。このように視差ヒストグラムが広い範囲に分布している。つまり、画像の中に様々な奥行の部分があり、立体感が豊かな立体画像である。 The parallax histogram on the left side of FIG. 5 satisfies the W <Tw1A and is classified as a stereoscopic image with a narrow parallax range width. Thus, the parallax histogram is within a narrow range. That is, it is a stereoscopic image with a narrow depth between subjects in the image and a poor stereoscopic effect. The parallax histogram on the right side of FIG. 5 is classified as a stereoscopic image that satisfies Tw1A ≦ W and has a wide parallax range. Thus, the parallax histogram is distributed over a wide range. That is, there are various depth portions in the image, and the stereoscopic image is rich in stereoscopic effect.

画像補正部７Ａは、視差範囲幅Ｗの大きさによる立体画像分類結果を用い、立体画像が視差範囲幅の狭いシーンと分類された場合に、立体感を強調するよう視差を調整する処理を行う。視差調整の手段として様々な方法が考えられるが、本実施例では左右の画像をそれぞれ水平方向にシフトさせることによる視差調整方法を用いる。図６〜図８を用いて、この原理を説明する。 The image correcting unit 7A uses the stereoscopic image classification result based on the size of the parallax range width W, and performs a process of adjusting the parallax so as to enhance the stereoscopic effect when the stereoscopic image is classified as a scene with a narrow parallax range width. . Various methods are conceivable as the parallax adjustment means. In this embodiment, a parallax adjustment method by shifting the left and right images in the horizontal direction is used. This principle will be described with reference to FIGS.

図６（ａ）は、丸と三角の二つの物体が写っている左右の画像からなる立体画像を示している。図６（ｂ）は、人間がこの立体画像を立体画像表示装置を用いて見たときに各物体の奥行がどのように知覚されるかを示している。表示された左右の画像上の各物体の位置関係により、三角の物体は立体画像表示装置より手前に、丸の物体は奥に知覚される。 FIG. 6A shows a stereoscopic image composed of left and right images in which two objects, a circle and a triangle, are shown. FIG. 6B shows how the depth of each object is perceived when a human views this stereoscopic image using a stereoscopic image display device. Due to the positional relationship between the objects on the displayed left and right images, a triangular object is perceived in front of the stereoscopic image display device, and a round object is perceived in the back.

図７（ａ）は、図６（ａ）の左目用画像を左にずらし右目用画像を右にずらすシフト処理を施した立体画像を示している。この処理により、視差は画面全体で一様に大きくなる。図７（ｂ）は、図７（ａ）の立体画像による各物体の奥行の知覚を示している。図７（ｂ）を図６（ｂ）と比較すると、いずれの物体もより遠景方向に移動して見える効果があることがわかる。また、丸の物体の方が三角の物体より多く遠景方向に移動しており、二つの物体の奥行の幅が広がっていることがわかる。つまり、より立体感が強調されて見える効果があることがわかる。 FIG. 7A shows a stereoscopic image that has been subjected to a shift process in which the left-eye image of FIG. 6A is shifted to the left and the right-eye image is shifted to the right. With this process, the parallax increases uniformly over the entire screen. FIG. 7B shows the perception of the depth of each object by the stereoscopic image of FIG. When FIG. 7B is compared with FIG. 6B, it can be seen that there is an effect that any object appears to move further in the distant view direction. In addition, it can be seen that the round object moves more in the distant direction than the triangular object, and the depth of the two objects is wider. That is, it can be seen that there is an effect that the stereoscopic effect is more emphasized.

図８（ａ）は逆に、図６（ａ）の左目用画像を右にずらし右目用画像を左にずらすシフト処理を施した立体画像を示している。この処理により、視差は画面全体で一様に小さくなる。図８（ｂ）は、図８（ａ）の立体画像による各物体の奥行の知覚を示している。図８（ｂ）を図６（ｂ）と比較すると、いずれの物体もより近景方向に移動して見える効果があることがわかる。また、丸の物体の方が三角の物体より多く近景方向に移動しており、二つの物体の奥行の幅が狭まっていることがわかる。つまり、より立体感が弱められて見える効果があることがわかる。 In contrast, FIG. 8A shows a stereoscopic image subjected to a shift process in which the left-eye image in FIG. 6A is shifted to the right and the right-eye image is shifted to the left. By this process, the parallax is uniformly reduced over the entire screen. FIG. 8B shows the perception of the depth of each object by the stereoscopic image of FIG. Comparing FIG. 8B with FIG. 6B, it can be seen that there is an effect that any object appears to move more in the foreground direction. It can also be seen that the round object moves more in the foreground direction than the triangular object, and the depth of the two objects is narrower. In other words, it can be seen that there is an effect that the three-dimensional effect is further weakened.

このように、単純に右目用画像と左目用画像の位置関係を水平方向にずらすシフト処理を施して視差を一様に増減させるだけでも、画像中の物体の知覚される位置を変える効果や、奥行の幅が変わることにより立体感を強調したり弱めたりする効果が得られる。この原理を用いると、物体が手前に飛び出しすぎて見える画像は奥にずらすように調整し、物体が奥すぎて見える画像は手前にずらすように調整する、といったことが可能である。また、意図的に立体感を強調したり弱めたりすることも可能である。なお、このシフト処理では視差を一様に増減させており、視差範囲幅は変わっていないが、奥行の幅は増減する効果がある、ということである。 In this way, the effect of changing the perceived position of an object in the image simply by performing a shift process that shifts the positional relationship between the right-eye image and the left-eye image in the horizontal direction to increase or decrease the parallax uniformly, By changing the width of the depth, an effect of enhancing or weakening the stereoscopic effect can be obtained. By using this principle, it is possible to adjust an image so that an object that is projected too far forward is shifted to the back, and to adjust an image that is viewed too deeply to the front. It is also possible to intentionally enhance or weaken the stereoscopic effect. In this shift process, the parallax is uniformly increased and decreased, and the parallax range width is not changed, but the depth width is increased and decreased.

画像補正部７Ａは、立体画像が視差範囲幅の狭いシーンと分類された場合、すなわち図５における「処理Ａ」と記載のあるマスに分類された場合に、左目用画像を左にずらし右目用画像を右にずらすシフト処理を行い、視差を画面全体で一様に増加させる。このシフト処理は上記で図７を用いて説明したように、奥行の幅を広げてより立体感を強調する効果がある。 When the stereoscopic image is classified as a scene with a narrow parallax range, that is, when the image correction unit 7A is classified into a square having “Processing A” in FIG. Shift processing is performed to shift the image to the right, and the parallax is increased uniformly over the entire screen. As described above with reference to FIG. 7, this shift process has the effect of widening the depth and enhancing the stereoscopic effect.

次に、上記の視差調整のためのシフト処理の調整量の決定方法を説明する。画像補正前の立体画像の立体感が乏しいほど、立体感をより強く強調する画像補正処理を施す、すなわち調整量を大きくすることが望ましい。上記シフト処理の調整量をａとしたとき、以下のような方法でａを定めることができる。 Next, a method for determining the adjustment amount of the shift process for parallax adjustment will be described. As the stereoscopic effect of the stereoscopic image before image correction is poorer, it is desirable to perform an image correction process that emphasizes the stereoscopic effect more strongly, that is, to increase the adjustment amount. When the adjustment amount of the shift process is a, a can be determined by the following method.

例えば、前記の視差範囲幅Ｗ及び定数ｃ１を用いて、ａ＝ｃ１／Ｗと定めることができる。視差範囲幅Ｗが狭く立体感が乏しい立体画像ほど、調整量ａが大きくなるようにしたものである。 For example, a = c1 / W can be determined using the parallax range width W and the constant c1. The adjustment amount “a” is increased as the stereoscopic image has a narrower parallax range width W and less stereoscopic effect.

また、視差ヒストグラムからその最頻値Ｈｍａｘを求め、定数ｃ２を用いて、ａ＝ｃ２×Ｈｍａｘと定めることもできる。図４にＨｍａｘを示す。一般に、視差範囲幅が狭いと視差ヒストグラムの最頻値の度数Ｈｍａｘは高くなる。そこでＨｍａｘが大きいほど調整量ａが大きくなるようにしたものである。 Further, the mode value Hmax can be obtained from the parallax histogram, and a = c2 × Hmax can be determined using the constant c2. FIG. 4 shows Hmax. In general, when the parallax range width is narrow, the frequency Hmax of the mode value of the parallax histogram increases. Therefore, the adjustment amount a is increased as Hmax is increased.

あるいは、上記の視差範囲幅Ｗと視差ヒストグラムの最頻値Ｈｍａｘの両方と定数ｃ３を用いて、ａ＝ｃ３／Ｗ×Ｈｍａｘと定めることもできる。さらには上記の方法を組み合わせ、例えばａ＝ｃ１／Ｗ＋ｃ２×Ｈｍａｘと定めても良い。 Alternatively, a = c3 / W × Hmax can be determined by using both the parallax range width W and the mode Hmax of the parallax histogram and the constant c3. Furthermore, the above methods may be combined, for example, a = c1 / W + c2 × Hmax.

立体画像がその他の分類をされた場合、すなわち図５において「−」と記載のあるマスに分類された場合は、画像補正部７Ａは画像補正処理を行わない。この場合は視差範囲幅が中程度以上であり、特に画像補正処理を行わなくても十分立体感が感じられるためである。 When the stereoscopic image is classified in other manners, that is, when the stereoscopic image is classified into a square having “−” in FIG. 5, the image correction unit 7A does not perform image correction processing. In this case, the parallax range width is medium or more, and a stereoscopic effect can be sufficiently felt without performing image correction processing.

以上のように、本実施例における立体画像処理装置１Ａは、立体画像の視差範囲幅Ｗを用いて、立体画像を奥行の幅の狭いシーンと奥行の幅の広いシーンに分類する。これにより、立体画像の視差量の特徴により立体画像の内容に関するシーン分類を行うことができるという効果を有する。このような分類は、従来の輝度情報等を用いた画像の分類手段では不可能であった。 As described above, the stereoscopic image processing apparatus 1A according to the present embodiment classifies a stereoscopic image into a scene having a narrow depth and a scene having a wide depth by using the parallax range width W of the stereoscopic image. Thereby, it has the effect that the scene classification regarding the content of the stereoscopic image can be performed by the feature of the parallax amount of the stereoscopic image. Such classification is not possible with conventional image classification means using luminance information or the like.

また本実施例における立体画像処理装置１Ａは、立体画像のシーン分類結果を用いて、奥行の幅の狭いシーン、すなわち立体感の乏しいシーンに対し、立体感を強調する画像補正処理を施す。これにより、立体感が少ないシーンに対して選択的に、そのシーンに適した立体感の強調処理を行うことができる、という効果を有する。 In addition, the stereoscopic image processing apparatus 1A according to the present embodiment performs image correction processing for enhancing the stereoscopic effect on a scene having a narrow depth, that is, a scene having a poor stereoscopic effect, using the scene classification result of the stereoscopic image. Accordingly, there is an effect that it is possible to selectively perform a stereoscopic effect enhancement process suitable for a scene with little stereoscopic effect.

また本実施例における立体画像処理装置１Ａは、視差範囲幅Ｗや最頻値Ｈｍａｘといった、立体画像の立体感に関連する特徴量を用いて上記シフト処理の調整量を決定する。これにより、各立体画像に対し最適な調整量を設定することができる、という効果を有する。 Further, the stereoscopic image processing apparatus 1A according to the present embodiment determines the adjustment amount of the shift processing using a feature amount related to the stereoscopic effect of the stereoscopic image, such as the parallax range width W and the mode value Hmax. Thereby, there is an effect that an optimal adjustment amount can be set for each stereoscopic image.

本発明の第２の実施例は、立体画像処理装置であって、第１の実施例で説明した画像補正処理に加えて、被写体の飛び出しが少ないために立体感が乏しく感じられるシーンに対し、被写体を手前に飛び出させて立体感を強調する画像補正処理を行うことを目的としている。このために第１の実施例と比べて、視差特徴量としてさらに、近景側に存在する被写体が画像内に占める割合を示す量を用いてシーン分類を行う。 The second embodiment of the present invention is a stereoscopic image processing apparatus, and in addition to the image correction processing described in the first embodiment, for a scene in which the stereoscopic effect is felt poor because the subject does not pop out. An object of the present invention is to perform image correction processing for emphasizing the three-dimensional effect by popping out the subject. For this reason, as compared with the first embodiment, scene classification is performed by using an amount indicating the ratio of the subject existing in the foreground side in the image as the parallax feature amount.

図９は、第２の実施例にかかる立体画像処理装置１Ｂの構成を示すブロック図である。第１の実施例に係る立体画像処理装置１Ａとは、視差特徴量算出部４Ｂ、立体画像分類部５Ｂ、画像補正部７Ｂが異なる。同一の部分に関する説明は省略し、異なる部分を中心に説明する。 FIG. 9 is a block diagram illustrating a configuration of a stereoscopic image processing apparatus 1B according to the second embodiment. The stereoscopic image processing apparatus 1A according to the first embodiment is different in a parallax feature amount calculation unit 4B, a stereoscopic image classification unit 5B, and an image correction unit 7B. The description about the same part is abbreviate | omitted, and it demonstrates centering on a different part.

図１０は、視差特徴量算出部４Ｂの構成を示すブロック図である。視差特徴量算出部４Ａと比較すると、近景側度数総和算出部４４が追加されている。 FIG. 10 is a block diagram illustrating a configuration of the parallax feature amount calculation unit 4B. Compared to the parallax feature amount calculation unit 4A, a foreground side power total calculation unit 44 is added.

近景側度数総和算出部４４は、視差ヒストグラム作成部４２から入力した視差ヒストグラムから、近景側の度数の総和である近景側度数総和Ｓを求め、これを視差特徴量として立体画像分類部５Ｂに出力する。 The foreground-side power sum total calculation unit 44 obtains a foreground-side power sum S that is the sum of the foreground-side frequencies from the parallax histogram input from the parallax histogram creation unit 42, and outputs this to the stereoscopic image classification unit 5B as a parallax feature amount. To do.

図１１は、近景側度数総和Ｓについて説明するための視差ヒストグラムを示した図である。横軸は視差量、縦軸は度数である。視差量が０の点を基準とし、０以上が遠景側、０未満が近景側としている。所定の閾値Ｔｄ１を用いて、視差ヒストグラムにおいて視差量がＴｄ１以下のビンの度数の総和を求め、これを近景側度数総和Ｓとする。図１１において網掛けを施した部分が、近景側度数総和Ｓに相当する。視差ヒストグラムが視差量−ｌからｍ（ｌ、ｍは自然数）の範囲で作成されており、視差ヒストグラムの各ビンの幅を１、各ビンの度数をｈ_ｉ（−ｌ≦ｉ≦ｍ）とすると、近景側度数総和Ｓは以下の式で求めることができる。
Ｓ＝Σｈ_ｉ（−ｌ≦ｉ≦Ｔｄ１）
一般的にはＴｄ１＝０とし、視差ヒストグラムの視差量０までの度数総和を算出する。しかしこれに限られるわけではなく、０以外の視差量までの度数総和を算出しても良い。例えば、視差推定には推定誤りがあることから、近景側の視差を持つ可能性がある画素まで広く含めて度数を求めるために、Ｔｄ＝１０として度数総和を求めても良い。 FIG. 11 is a diagram showing a parallax histogram for explaining the foreground-side power sum S. FIG. The horizontal axis is the amount of parallax, and the vertical axis is the frequency. A point where the amount of parallax is 0 is used as a reference, and 0 or more is the far side, and less than 0 is the near side. Using the predetermined threshold value Td1, the sum of the frequencies of bins whose parallax amount is equal to or less than Td1 is obtained in the parallax histogram, and this is used as the near-field-side frequency sum S. In FIG. 11, the shaded portion corresponds to the foreground side power sum S. Parallax histogram m from the parallax amount -l (l, m are natural numbers) is written in a range of the width of each bin of the parallax histogram 1, and the frequency of each bin _{h i (-l ≦ i ≦ m} ) Then, the foreground side power sum S can be obtained by the following equation.
S = Σh _i (−1 ≦ i ≦ Td1)
In general, Td1 = 0 is set, and the total frequency up to the parallax amount 0 in the parallax histogram is calculated. However, the present invention is not limited to this, and the total frequency up to a parallax amount other than 0 may be calculated. For example, since there is an estimation error in the parallax estimation, the frequency sum may be obtained by setting Td = 10 in order to obtain the frequency widely including pixels that may have a near-field parallax.

一方、視差範囲幅算出部４３は、前記の通り視差範囲幅Ｗを視差特徴量として出力する。従って、視差特徴量算出部４Ｂは視差特徴量として、視差範囲幅Ｗ及び近景側度数総和Ｓの２つの情報を立体画像分類部５Ｂに出力する。 On the other hand, the parallax range width calculation unit 43 outputs the parallax range width W as a parallax feature amount as described above. Therefore, the parallax feature amount calculation unit 4B outputs two pieces of information, that is, the parallax range width W and the foreground side power sum S as the parallax feature amount, to the stereoscopic image classification unit 5B.

立体画像分類部５Ｂは、視差特徴量である視差範囲幅Ｗ及び近景側度数総和Ｓを用いて立体画像を分類する。本実施例では、視差範囲幅Ｗの大きさで３種類に、近景側度数総和Ｓの大きさで３種類に、それぞれ分類する。従ってトータルでは３×３＝９種類に分類できる。この分類結果を画像補正部７Ｂに出力する。 The stereoscopic image classification unit 5B classifies the stereoscopic image using the parallax range width W and the foreground-side power sum S that are parallax feature amounts. In this embodiment, the parallax range width W is classified into three types, and the foreground side power sum S is classified into three types. Therefore, it can be classified into 3 × 3 = 9 types in total. The classification result is output to the image correction unit 7B.

視差範囲幅Ｗについては、判定のための閾値Ｔｗ１Ｂ、Ｔｗ２Ｂ（Ｔｗ１Ｂ＜Ｔｗ２Ｂ）を用い、以下のように分類する。
・Ｗ＜Ｔｗ１Ｂ：視差範囲幅の狭い立体画像
・Ｔｗ１Ｂ≦Ｗ≦Ｔｗ２Ｂ：視差範囲幅が中程度の立体画像
・Ｔｗ２Ｂ＜Ｗ：視差範囲幅の広い立体画像
このように視差範囲幅の狭い立体画像及び視差範囲幅の中程度の立体画像を分類するため、例えばＴｗ１Ｂとしては前記の最大視差範囲幅Ｗｍａｘの１０％の値を、Ｔｗ２ＢとしてはＷｍａｘの５０％の値を設定する。 The parallax range width W is classified as follows using threshold values Tw1B and Tw2B (Tw1B <Tw2B) for determination.
W <Tw1B: Stereo image with a narrow parallax range width Tw1B ≦ W ≦ Tw2B: Stereo image with a medium parallax range width Tw2B <W: Stereo image with a wide parallax range width In order to classify medium images with a medium parallax range width, for example, a value of 10% of the maximum parallax range width Wmax is set as Tw1B, and a value of 50% of Wmax is set as Tw2B.

近景側度数総和Ｓについては、判定のための閾値Ｔｓ１、Ｔｓ２（０≦Ｔｓ１＜Ｔｓ２）を用い、以下のように分類する。
・Ｓ＜Ｔｓ１：近景側に被写体がほとんど存在しない立体画像
・Ｔｓ１≦Ｓ≦Ｔｓ２：近景側に小さな被写体が存在する立体画像
・Ｔｓ２＜Ｓ：近景側に中程度以上の被写体が存在する立体画像
このように近景側に小さな被写体が存在するか否かを判定するために、閾値Ｔｓ１、Ｔｓ２はいずれも小さな値を設定する。例えば、Ｔｓ１は視差ヒストグラムの全度数の５％、Ｔｓ２は視差ヒストグラムの全度数の１０％のように設定する。 The near-field-side frequency sum S is classified as follows using threshold values Ts1 and Ts2 (0 ≦ Ts1 <Ts2) for determination.
S <Ts1: Stereo image with almost no subject on the foreground side Ts1 ≦ S ≦ Ts2: Stereo image with a small subject on the foreground side Ts2 <S: Stereo image with a medium or larger subject on the foreground side Thus, in order to determine whether or not there is a small subject on the foreground side, the threshold values Ts1 and Ts2 are both set to small values. For example, Ts1 is set to 5% of the total frequency of the parallax histogram, and Ts2 is set to 10% of the total frequency of the parallax histogram.

図１２に、上記分類処理による９種類のシーンの典型的な視差ヒストグラムの例を一覧表にして示す。行方向に視差範囲幅Ｗによる３つの分類を、列方向に近景側度数総和Ｓによる３つの分類を記載し、そのマトリックスで合計９種類の分類を示している。各視差ヒストグラムの横軸は視差量、縦軸は度数であり、中央の細い縦線は、視差０を表す。また、各マスの点線以下の部分に、後述の画像補正処理の種類を記載している。 FIG. 12 shows a list of examples of typical parallax histograms of nine types of scenes obtained by the above classification process. Three classifications according to the parallax range width W are described in the row direction, and three classifications according to the foreground-side frequency sum S are described in the column direction, and a total of nine classifications are shown in the matrix. The horizontal axis of each parallax histogram is the parallax amount, the vertical axis is the frequency, and the thin vertical line in the center represents parallax 0. In addition, the types of image correction processing to be described later are described in portions below the dotted line of each square.

まず視差範囲幅Ｗによる３つの分類から説明する。図１２の左側の列の視差ヒストグラムは、Ｗ＜Ｔｗ１Ｂを満たし、視差範囲幅の狭い立体画像と分類されたものである。それぞれ視差ヒストグラムの概形は異なるが、いずれも視差範囲幅が狭い。図１２の中央の列の視差ヒストグラムは、Ｔｗ１Ｂ≦Ｗ≦Ｔｗ２Ｂを満たし、視差範囲幅が中程度の立体画像と分類されたものである。それぞれ視差ヒストグラムの概形は異なるが、いずれも視差範囲幅は中程度である。図１２の右側の列の視差ヒストグラムは、Ｔｗ２Ｂ＜Ｗを満たし、視差範囲幅の広い立体画像と分類されたものである。それぞれ視差ヒストグラムの概形は異なるが、いずれも視差範囲幅は広い。 First, the three classifications based on the parallax range width W will be described. The parallax histogram in the left column of FIG. 12 is classified as a stereoscopic image that satisfies W <Tw1B and has a narrow parallax range width. Although the outlines of the parallax histograms are different, the parallax range width is narrow. The parallax histogram in the center column in FIG. 12 is classified as a stereoscopic image that satisfies Tw1B ≦ W ≦ Tw2B and has a medium parallax range width. Although the outlines of the parallax histograms are different from each other, the parallax range width is medium in all cases. The parallax histogram in the right column of FIG. 12 is classified as a stereoscopic image that satisfies Tw2B <W and has a wide parallax range. Although the outlines of the parallax histograms are different, the parallax range is wide.

次に近景側度数総和Ｓによる３つの分類について説明する。図１２の上側の行の視差ヒストグラムは、Ｓ＜Ｔｓ１を満たし、近景側に被写体が存在しない立体画像と分類されたものである。それぞれ視差ヒストグラムの幅は異なるが、いずれも視差０の線より左側には視差ヒストグラムの度数がほとんどない。図１２の中央の行の視差ヒストグラムは、Ｔｓ１≦Ｓ≦Ｔｓ２を満たし、近景側に比較的小さな被写体が存在する立体画像と分類されたものである。それぞれ視差ヒストグラムの幅は異なるが、いずれも視差０の線より左側に視差ヒストグラム少し存在する。図１２の下側の行の視差ヒストグラムは、Ｔｓ２＜Ｓを満たし、近景側に中程度以上の被写体が存在する立体画像と分類されたものである。それぞれ視差ヒストグラムの幅は異なるが、いずれも視差０の線より左側に視差ヒストグラムの多くの部分が存在する。 Next, three classifications based on the foreground-side frequency sum S will be described. The parallax histogram in the upper row in FIG. 12 is classified as a stereoscopic image that satisfies S <Ts1 and has no subject on the near view side. Although the parallax histograms have different widths, there is almost no frequency of the parallax histogram on the left side of the parallax 0 line. The parallax histogram in the center row of FIG. 12 is classified as a stereoscopic image that satisfies Ts1 ≦ S ≦ Ts2 and has a relatively small subject on the foreground side. Although the widths of the parallax histograms are different from each other, there are a few parallax histograms on the left side of the parallax line. The parallax histogram in the lower row of FIG. 12 is classified as a stereoscopic image that satisfies Ts2 <S and has a medium or higher subject on the near view side. Although the widths of the parallax histograms are different from each other, many portions of the parallax histogram exist on the left side of the parallax line.

画像補正部７Ｂは、上記のように９種類のシーンに分類された立体画像分類結果に従い、所定のシーンに対して画像補正処理を行う。補正手段は第１の実施例と同じく、左右の画像をそれぞれシフトさせることによる視差調整方法を用いる。 The image correction unit 7B performs image correction processing on a predetermined scene according to the stereoscopic image classification results classified into nine types of scenes as described above. As in the first embodiment, the correction means uses a parallax adjustment method by shifting the left and right images.

立体画像が、視差範囲幅が狭いシーンと分類された場合、すなわち図１２における「処理Ａ」と記載のあるマスのいずれかに分類された場合は、画像補正部７Ｂは左目用画像を左にずらし右目用画像を右にずらすシフト処理を行い、視差を画面全体で一様に増加させる。このシフト処理は上記で図７を用いて説明したように、奥行の幅を広げてより立体感を強調する効果がある。このシフト処理における調整量は、第１の実施例と同じ方法により決定することができる。 When the stereoscopic image is classified as a scene having a narrow parallax range width, that is, when the stereoscopic image is classified into one of the squares described as “Processing A” in FIG. 12, the image correction unit 7B shifts the left-eye image to the left. A shift process for shifting the right-eye image to the right is performed, and the parallax is increased uniformly over the entire screen. As described above with reference to FIG. 7, this shift process has the effect of widening the depth and enhancing the stereoscopic effect. The adjustment amount in this shift process can be determined by the same method as in the first embodiment.

立体画像が、視差範囲幅が中程度でかつ近景側に比較的小さな被写体が存在するシーンと分類された場合、すなわち図１２における「処理Ｂ」と記載のあるマスに分類された場合は、画像補正部７Ｂは左目用画像を右にずらし右目用画像を左にずらすシフト処理を行い、視差を画面全体で一様に減少させる。このシフト処理は上記で図８を用いて説明したように、画像中の被写体全体がより近景側に移動して飛び出して見える効果がある。 When the stereoscopic image is classified as a scene having a medium parallax range width and a relatively small subject on the foreground side, that is, when the stereoscopic image is classified into a square described as “Processing B” in FIG. The correction unit 7B performs a shift process of shifting the left-eye image to the right and the right-eye image to the left, and reduces the parallax uniformly over the entire screen. As described above with reference to FIG. 8, this shift processing has an effect that the entire subject in the image appears to move out to the near side.

ここで、視差範囲幅が中程度であることを条件としているのは、以下の理由による。前記の通り、立体画像において被写体が画像の周辺部分にかかっており、その被写体が画面より手前に知覚されるような視差量を有する場合、その視差量にも関わらず手前に飛び出して見え難いという現象がある。ここで、視差範囲幅が狭い立体画像に対して被写体を近景側に飛び出させる調整を行うと、被写体の全部、あるいは大部分が近景側に飛び出すことになり、飛び出した被写体が画像の周辺部にかかる可能性が高くなる。よって、被写体を飛び出させる画像補正処理を行ってもその効果が得られない可能性が高くなる。一方、視差範囲幅が広い場合は、そのままでも十分立体感が感じられるので、画像補正処理を行わなくても良い。以上の理由により視差範囲幅が中程度であることを条件としている。 Here, the condition that the parallax range width is medium is based on the following reason. As described above, in a stereoscopic image, when a subject covers the peripheral portion of the image and has a parallax amount that can be perceived in front of the screen, it is difficult to see the subject popping out in front of the screen regardless of the amount of parallax. There is a phenomenon. Here, if adjustment is performed to project the subject to the foreground side with respect to a stereoscopic image with a narrow parallax range width, all or most of the subject will be projected to the foreground side, and the projected subject will appear at the periphery of the image. This possibility increases. Therefore, there is a high possibility that the effect cannot be obtained even if image correction processing for projecting the subject is performed. On the other hand, when the parallax range width is wide, a stereoscopic effect can be sufficiently felt as it is, and therefore image correction processing need not be performed. For the above reason, the condition is that the parallax range width is medium.

また、近景側に比較的小さな被写体が存在することを条件としているのは、あまり大きな被写体を飛び出させて見えるようにすると、圧迫感が有り、かえって見難くなる事があるためである。また、被写体が大きいほど画像の周辺部分にかかる可能性が高く、被写体をより手前に飛び出させて迫力を増すという効果が得られない可能性が高いためである。 In addition, the reason that there is a relatively small subject on the near view side is that if a very large subject is projected and seen, there is a feeling of pressure and it may be difficult to see. Also, the larger the subject, the higher the possibility that it will be applied to the peripheral part of the image, and there is a high possibility that the effect of increasing the force by popping the subject further forward will not be obtained.

すなわち以上の処理は、被写体を飛び出させる画像補正処理の効果があると思われるシーンに対して選択的に当該画像補正処理を行うものである。 That is, the above processing selectively performs image correction processing on a scene that is considered to have an effect of image correction processing for projecting a subject.

このシフト処理における調整量は、例えば以下のような方法で定めることができる。 The adjustment amount in this shift process can be determined by the following method, for example.

図１３は、被写体が近景方向に移動するシフト処理の処理前後の視差ヒストグラムを示したものである。（ａ）がシフト処理前の視差ヒストグラム、（ｂ）がシフト処理後の視差ヒストグラムである。各視差ヒストグラムで網掛けされている部分は、視差ヒストグラムにおける近景側部分である。近景方向へ移動させるためシフト処理を行うことで、視差ヒストグラムは全体として左方向に移動し、シフト処理後の近景側度数総和の値も変わる。シフト処理後に被写体が画面より余りに飛び出すとかえって見難くなる。そこで、シフト処理後の近景側度数総和をＳ’、所定の閾値をＴｓ３として、Ｓ’＝Ｔｓ３となるシフト量を求め、これを調整量ａとすることで、被写体が飛び出すことによる迫力の増加と、見難さの増加とのバランスを取ることができる。 FIG. 13 shows a parallax histogram before and after the shift process in which the subject moves in the foreground direction. (A) is a parallax histogram before the shift process, and (b) is a parallax histogram after the shift process. The shaded portion in each parallax histogram is the near view side portion in the parallax histogram. By performing the shift process to move in the foreground direction, the parallax histogram moves to the left as a whole, and the value of the foreground side power sum after the shift process also changes. If the subject jumps out of the screen too much after the shift process, it will be difficult to see. Therefore, assuming that the foreground frequency sum after shift processing is S ′ and the predetermined threshold is Ts3, a shift amount where S ′ = Ts3 is obtained, and this is set as an adjustment amount a, thereby increasing the force due to the subject popping out. And the increase in difficulty of viewing.

Ｔｓ３としては、例えば視差ヒストグラムの全度数の５０％と定めることができる。Ｓ’が５０％の時、シフト処理後の画像中の被写体の半分は画面より手前に飛び出し、残り半分が画面より奥にあることになり、近景側および遠景側の被写体がバランスよく分布するためである。 Ts3 can be set to 50% of the total frequency of the parallax histogram, for example. When S ′ is 50%, half of the subjects in the image after the shift processing will pop out from the screen and the other half will be behind the screen, so that the foreground and far side subjects are distributed in a well-balanced manner. It is.

立体画像がその他の分類をされた場合、すなわち図１２において「−」と記載のあるマスのいずれかに分類された場合は、画像補正部７Ｂは画像補正処理を行わない。これらはいずれも視差範囲幅が中程度以上であり、特に画像補正処理を行わなくても十分立体感が感じられ、また上記の図１２の「処理Ｂ」の場合のようにさらに立体感を強調できる条件でもないためである。 When the stereoscopic image is classified in other manners, that is, when the stereoscopic image is classified into any of the squares marked with “−” in FIG. 12, the image correction unit 7B does not perform the image correction process. Each of these has a medium or larger parallax range width, and a sufficient stereoscopic effect can be felt without performing image correction processing. Further, as in the case of “Processing B” in FIG. 12, the stereoscopic effect is further enhanced. This is because it is not a possible condition.

以上のように、本実施例における立体画像処理装置１Ｂは、視差範囲幅Ｗ及び近景側度数総和Ｓという２種類の視差特徴量を用いて立体画像を９種類のシーンに分類している。これにより、立体画像の視差情報についてより多様な観点から、より細かく精度の高い分類を可能とする、という効果を有する。 As described above, the stereoscopic image processing apparatus 1B according to the present embodiment classifies a stereoscopic image into nine types of scenes using two types of parallax feature amounts, that is, the parallax range width W and the foreground-side power sum S. This has the effect of enabling more detailed and highly accurate classification of the parallax information of the stereoscopic image from various viewpoints.

また本実施例における立体画像処理装置１Ｂは、より細かく精度の高い分類を可能となったことから、近景側に比較的小さな被写体が存在する立体画像に対して、その被写体をより手前に飛び出させて迫力を増す画像補正処理を行うことがさらに可能となった。これにより、第１の実施例に比べてより多様な画像補正処理を、シーンの内容に応じて適切に行うことができるという効果を有する。また、立体感の強調処理の対象となるシーンがより多くなるという効果を有する。 In addition, since the stereoscopic image processing apparatus 1B according to the present embodiment enables finer and more accurate classification, the stereoscopic image processing apparatus 1B causes the subject to jump out to the near side with respect to the stereoscopic image in which a relatively small subject exists on the foreground side. This makes it possible to perform image correction processing that increases the power. Accordingly, there is an effect that more various image correction processes can be appropriately performed in accordance with the contents of the scene as compared with the first embodiment. Further, there is an effect that the number of scenes to be subjected to the stereoscopic effect enhancement process is increased.

本発明の第３の実施例は、立体画像処理装置であって、第２の実施例で説明した２種類の画像補正処理に加えて、画像の周辺部に近景側に飛び出した被写体が有るために立体感が感じられ難いシーンに対し、被写体を奥に移動させて立体感を感じやすくすることで、相対的に立体感を強調する画像補正処理を行うことを目的としている。このために第２の実施例と比べて、視差特徴量としてさらに、画像の周辺部において近景側に飛び出した被写体が画像周辺部に占める割合を示す量を用いてシーン分類を行う。またこの新たな視差特徴量を用いることで、第２の実施例で説明した被写体を手前に飛び出させて立体感を強調する画像補正処理も、処理対象とするシーンをより的確に分類することができる。 The third embodiment of the present invention is a stereoscopic image processing apparatus, and in addition to the two types of image correction processing described in the second embodiment, there is a subject that protrudes to the foreground side at the periphery of the image. The object of the present invention is to perform image correction processing that relatively enhances the stereoscopic effect by moving the subject to the back to make it easier to feel the stereoscopic effect in a scene where it is difficult to feel the stereoscopic effect. For this reason, as compared with the second embodiment, scene classification is performed using a parallax feature amount that further indicates the proportion of the subject that protrudes to the near view side in the peripheral portion of the image in the peripheral portion of the image. In addition, by using this new parallax feature amount, the image correction processing for emphasizing the stereoscopic effect by popping out the subject described in the second embodiment can more accurately classify scenes to be processed. it can.

図１４は、第３の実施例にかかる立体画像処理装置１Ｃの構成を示すブロック図である。第２の実施例に係る立体画像処理装置１Ｂとは、視差特徴量算出部４Ｃ、立体画像分類部５Ｃ、画像補正部７Ｃが異なる。同一の部分に関する説明は省略し、異なる部分を中心に説明する。 FIG. 14 is a block diagram illustrating a configuration of a stereoscopic image processing apparatus 1C according to the third embodiment. The stereoscopic image processing apparatus 1B according to the second embodiment is different in a parallax feature amount calculation unit 4C, a stereoscopic image classification unit 5C, and an image correction unit 7C. The description about the same part is abbreviate | omitted, and it demonstrates centering on a different part.

図１５は、視差特徴量算出部４Ｃの構成を示すブロック図である。視差特徴量算出部４Ｂと比較すると、画像周辺部視差ヒストグラム生成部４５と画像周辺部近景側度数総和算出部４６が追加されている。 FIG. 15 is a block diagram illustrating a configuration of the parallax feature amount calculation unit 4C. Compared with the parallax feature amount calculation unit 4B, an image peripheral part parallax histogram generation unit 45 and an image peripheral part near view side frequency total calculation unit 46 are added.

画像周辺部視差ヒストグラム生成部４５は、視差算出部４１で求めた画像全域の視差のデータのうち、画面の周辺部分の視差のデータのみを用いて、その度数分布である画像周辺部視差ヒストグラムを作成する。あるいは、画像周辺部視差ヒストグラムの代用として画像周辺部の奥行きをヒストグラムにしたものでも良い。すなわち立体画像に表現されている表示物の視差またはそれと同等の量の度数分布が表されているものであれば良い。画像周辺部視差ヒストグラム生成部４５は、画像周辺部視差ヒストグラムを画像周辺部近景側度数総和算出部４６に送る。 The image peripheral part parallax histogram generating unit 45 uses only the parallax data of the peripheral part of the screen among the parallax data of the entire image obtained by the parallax calculating unit 41, and calculates the image peripheral part parallax histogram having the frequency distribution. create. Alternatively, an image peripheral depth may be a histogram instead of the image peripheral parallax histogram. In other words, any display may be used as long as the parallax of the display object represented in the stereoscopic image or a frequency distribution of the same amount is represented. The image peripheral part parallax histogram generating unit 45 sends the image peripheral part parallax histogram to the image peripheral part foreground side frequency sum total calculating unit 46.

図１６は、画像の周辺部分を説明するものである。網掛け部分が、画像の周辺部分である。例えば、横１９２０ピクセル、縦１０８０ピクセルの画像において、画像の周辺の幅１００ピクセルの部分を、画像の周辺部分として用いることができる。 FIG. 16 illustrates the peripheral portion of an image. The shaded portion is the peripheral portion of the image. For example, in an image having a horizontal size of 1920 pixels and a vertical length of 1080 pixels, a portion having a width of 100 pixels around the periphery of the image can be used as the peripheral portion of the image.

画像周辺部近景側度数総和算出部４６は、画像周辺部視差ヒストグラム生成部４５から入力した画像周辺部視差ヒストグラムを用いて、その近景側の度数の総和である画像周辺部近景側度数総和Ｓｅを求め、これを視差特徴量として立体画像分類部５Ｃに出力する。その算出方法は、前記の近景側度数総和Ｓの算出方法と同様である。 The image peripheral portion foreground side frequency sum total calculation unit 46 uses the image peripheral portion parallax histogram input from the image peripheral portion parallax histogram generation unit 45 to calculate the image peripheral portion foreground side frequency total sum Se, which is the sum of the frequencies on the foreground side. This is obtained and output to the stereoscopic image classification unit 5C as a parallax feature amount. The calculation method is the same as the calculation method of the foreground side power sum S described above.

一方、視差範囲幅算出部４３及び近景側度数総和算出部４４は、前記の通りそれぞれ視差範囲幅Ｗと近景側度数総和Ｓを視差特徴量として出力する。従って、視差特徴量算出部４Ｃは視差特徴量として、視差範囲幅Ｗ、近景側度数総和Ｓ、画像周辺部近景側度数総和Ｓｅの３つの情報を立体画像分類部５Ｃに出力する。 On the other hand, the parallax range width calculation unit 43 and the foreground side power total calculation unit 44 output the parallax range width W and the foreground side power total S as the parallax feature amounts as described above. Therefore, the parallax feature amount calculation unit 4C outputs, as the parallax feature amount, three pieces of information of the parallax range width W, the foreground side power sum S, and the image peripheral portion foreground side power total Se to the stereoscopic image classification unit 5C.

立体画像分類部５Ｃは、視差特徴量である視差範囲幅Ｗ、近景側度数総和Ｓ、画像周辺部近景側度数総和Ｓｅを用いて立体画像を分類する。本実施例では、視差範囲幅Ｗの大きさで３種類に、近景側度数総和Ｓの大きさで３種類に、画像周辺部近景側度数総和Ｓｅの大きさで２種類にそれぞれ分類する。従って、トータルでは３×３×２＝１８種類に分類できる。この分類結果を画像補正部７Ｃに出力する。 The stereoscopic image classifying unit 5C classifies the stereoscopic image using the parallax range width W, which is a parallax feature amount, the foreground side power sum S, and the image peripheral portion foreground side power Se. In the present embodiment, the parallax range width W is classified into three types, the foreground side power sum total S is classified into three types, and the image peripheral portion near view side power sum Se is classified into two types. Therefore, it can be classified into 3 × 3 × 2 = 18 types in total. The classification result is output to the image correction unit 7C.

視差範囲幅Ｗ及び近景側度数総和Ｓを用いた分類の方法は、第２の実施例に記載の方法と同様である。 The classification method using the parallax range width W and the foreground-side power sum S is the same as the method described in the second embodiment.

画像周辺部近景側度数総和Ｓｅについては、判定のための閾値Ｔｓｅ１を用い、以下のように分類する。
・Ｓｅ≦Ｔｓｅ１：画像周辺部において近景側に被写体が存在しない立体画像
・Ｓｅ＞Ｔｓｅ１：画像周辺部において近景側に被写体が存在する立体画像
このように画像周辺部において近景側に被写体が存在するか否かを判定するために、閾値Ｔｓｅ１は小さな値を設定する。例えばＴｓｅ１＝０と設定するのが最も厳しい条件設定であり、画像周辺部ヒストグラムが近景部分に少しでも存在していれば、画像周辺部において近景側には被写体が存在すると判定することになる。条件を緩めるには、Ｔｓｅ１を０より大きな値、例えばＴｓｅ１を画像周辺部視差ヒストグラムの全度数の２％と設定すれば良い。 The image peripheral portion near view side frequency total Se is classified as follows using a threshold value Tse1 for determination.
Se ≦ Tse1: A stereoscopic image in which no subject is present on the foreground side in the peripheral portion of the image. Se> Tse1: A stereoscopic image in which a subject is present on the foreground side in the peripheral portion of the image. In order to determine whether or not, the threshold value Tse1 is set to a small value. For example, setting Tse1 = 0 is the strictest condition setting. If the image peripheral portion histogram exists even in the foreground portion, it is determined that the subject exists in the foreground side in the image peripheral portion. In order to relax the condition, Tse1 may be set to a value larger than 0, for example, Tse1 may be set to 2% of the total frequency of the image peripheral parallax histogram.

画像補正部７Ｃは、上記のように１８種類のシーンに分類された立体画像分類結果に従い、所定のシーンに対して画像補正処理を行う。補正手段は第１及び第２の実施例と同じく、左右の画像をそれぞれシフトさせることによる視差調整方法を用いる。 The image correction unit 7C performs image correction processing on a predetermined scene according to the stereoscopic image classification results classified into 18 types of scenes as described above. As in the first and second embodiments, the correction means uses a parallax adjustment method by shifting the left and right images.

図１７は、上記分類処理による１８種類のシーンを表に示したものである。３種類の視差関連情報のうち、画像周辺部近景側度数総和Ｓｅによる２種類の分類については、表を（ａ）と（ｂ）に分けて示している。各マスには後述の画像補正処理の種類を記載している。 FIG. 17 is a table showing 18 types of scenes obtained by the above classification process. Of the three types of parallax-related information, the table is divided into (a) and (b) for two types of classification based on the image peripheral portion near view side frequency sum Se. Each square describes the type of image correction processing described later.

立体画像が、視差範囲幅が狭いシーンと分類された場合、すなわち図１７における「処理Ａ」と記載のあるマスのいずれかに分類された場合は、画像補正部７Ｃは左目用画像を左にずらし右目用画像を右にずらすシフト処理を行い、視差を画面全体で一様に増加させる。このシフト処理は上記で図７を用いて説明したように、奥行の幅を広げてより立体感を強調する効果がある。このシフト処理における調整量は、第１の実施例と同じ方法により決定することができる。 When the stereoscopic image is classified as a scene with a narrow parallax range width, that is, when the stereoscopic image is classified into one of the squares described as “Processing A” in FIG. 17, the image correction unit 7 </ b> C shifts the left-eye image to the left. A shift process for shifting the right-eye image to the right is performed, and the parallax is increased uniformly over the entire screen. As described above with reference to FIG. 7, this shift process has the effect of widening the depth and enhancing the stereoscopic effect. The adjustment amount in this shift process can be determined by the same method as in the first embodiment.

立体画像が、視差範囲幅が中程度で、かつ近景側に比較的小さな被写体が存在し、かつ画像周辺部において近景側に被写体が存在しないシーンと分類された場合、すなわち図１７における「処理Ｂ」と記載のあるマスに分類された場合は、画像補正部７Ｃは左目用画像を右にずらし右目用画像を左にずらすシフト処理を行い、視差を画面全体で一様に減少させる。このシフト処理は上記で図８を用いて説明したように、画像中の被写体全体がより近景側に移動して飛び出して見える効果がある。 When the stereoscopic image is classified as a scene having a medium parallax range width, a relatively small subject on the foreground side, and no subject on the foreground side in the periphery of the image, that is, “Process B” in FIG. In the case where the image correction unit 7C is classified into the squares with "", the image correction unit 7C shifts the left-eye image to the right and shifts the right-eye image to the left to uniformly reduce the parallax over the entire screen. As described above with reference to FIG. 8, this shift processing has an effect that the entire subject in the image appears to move out to the near side.

ここで、視差範囲幅が中程度であること及び近景側に比較的小さな被写体が存在することを条件としているのは、第２の実施例で説明した図１２の「処理Ｂ」の場合と同じ理由による。また、画像周辺部において近景側に被写体が存在しないことを条件としているのは、前記の通り、立体画像において被写体が画像の周辺部分にかかっており、その被写体が画面より手前に知覚されるような視差量を有する場合、その視差量にも関わらず手前に飛び出して見え難いという現象があり、このような状況が発生しない場合にのみ被写体を手前に飛び出させる画像補正処理を行うためである。このように、第２の実施例における図１２の「処理Ｂ」の場合と比べて、画像周辺部において近景側に被写体が存在しないという条件をさらに追加することで、被写体を手前に飛び出させる画像補正処理を行うことが適切なシーンをさらに精度高く分類することが可能となる。 Here, the condition that the parallax range width is medium and that there is a relatively small subject on the foreground side is the same as in the case of “Process B” in FIG. 12 described in the second embodiment. Depending on the reason. Also, the condition that the subject does not exist on the foreground side in the peripheral portion of the image is that, as described above, the subject covers the peripheral portion of the image in the stereoscopic image, and the subject is perceived in front of the screen. This is because, when there is a large amount of parallax, there is a phenomenon that it is difficult to see the image by popping forward, regardless of the amount of parallax, and image correction processing is performed to make the subject pop out toward the front only when such a situation does not occur. In this way, compared to the case of “Processing B” in FIG. 12 in the second embodiment, an image that causes the subject to pop out toward the foreground by adding a condition that the subject does not exist on the foreground side in the peripheral portion of the image. It is possible to classify scenes suitable for performing the correction process with higher accuracy.

このシフト処理における調整量は、例えば以下のような方法で定めることができる。近景方向へ移動させるためシフト処理を行うことで、視差ヒストグラムは全体として左方向に移動し、シフト処理後の近景側度数総和の値や画像周辺部近景側度数総和の値も変わる。シフト処理後に被写体が画面より余りに飛び出すとかえって見難くなる。また、シフト処理後に画面周辺部において被写体が画面より手前に位置するようになると、やはり見難くなる。そこで、シフト処理後の近景側度数総和の値及びシフト処理後の画像周辺部近景側度数総和の値のそれぞれに関する条件により、調整量を決定する。 The adjustment amount in this shift process can be determined by the following method, for example. By performing the shift process to move in the foreground direction, the parallax histogram is moved in the left direction as a whole, and the value of the foreground side power sum and the value of the image peripheral portion foreground side power sum after the shift process also change. If the subject jumps out of the screen too much after the shift process, it will be difficult to see. Further, if the subject is positioned in front of the screen at the periphery of the screen after the shift process, it is difficult to see. Therefore, the amount of adjustment is determined according to the conditions relating to the value of the foreground-side power sum total after the shift process and the value of the image periphery near-field side power sum total after the shift process.

シフト処理後の近景側度数総和に関しては、第２の実施例において図１２の「処理Ｂ」と分類された場合と同様に、閾値Ｔｓ３を用いた処理により調整量を定めることができる。こうして求めた調整量をａ１とする。一方、シフト処理後の画像周辺部近景側度数総和の値に関しては、シフト処理後の画像周辺部近景側度数総和をＳｅ’、所定の閾値をＴｓｅ２として、Ｓｅ’＝Ｔｓｅ２となるシフト量を求め、これを調整量ａ２とする。Ｔｓｅ２は、Ｔｓｅ１と同様小さな値を用いる。最終的な調整量ａは、シフト処理後の近景側度数総和の値に関する上記の条件と画像周辺部近景側度数総和の値に関する上記の条件の両方を満たす必要がある。そこで、ａ１とａ２のうち小さいほうの値を調整量ａとして採用する。 As for the foreground frequency summation after the shift processing, the adjustment amount can be determined by the processing using the threshold value Ts3, as in the case of the classification as “processing B” in FIG. 12 in the second embodiment. The adjustment amount thus obtained is set as a1. On the other hand, with respect to the value of the image peripheral portion foreground side frequency sum after the shift processing, Se ′ = Tse2 is obtained as a shift amount, where Se ′ is the image peripheral portion foreground side frequency sum after the shift processing and the predetermined threshold is Tse2. This is the adjustment amount a2. Tse2 uses a small value like Tse1. The final adjustment amount a needs to satisfy both the above-described condition relating to the value of the foreground side power sum after the shift process and the above-described condition relating to the value of the image peripheral portion near-field side power sum. Therefore, the smaller value of a1 and a2 is adopted as the adjustment amount a.

立体画像が、視差範囲幅が中程度あるいは広くて、かつ画像周辺部において近景側に被写体が存在するシーンと分類された場合、すなわち図１７における「処理Ｃ」と記載のあるマスのいずれかに分類された場合は、画像補正部７Ｃは左目用画像を左にずらし右目用画像を右にずらすシフト処理を行い、視差を画面全体で一様に増加させる。このシフト処理は上記で図７を用いて説明したように、被写体はより遠景にあるように見える効果がある。この処理は、上記の図１７の「処理Ａ」と記載のあるマスのいずれかに分類された場合と類似するが、調整量が異なる。この場合は、画像周辺部において近景側に存在する被写体を画面より奥に移動させるのに必要な調整量で、シフト処理を行う。この処理により、少なくとも画像周辺部には画面より手前に位置する被写体は存在しなくなる。従って、前記の被写体が画像の周辺部分にかかっているとその被写体が手前に位置するものであっても手前に飛び出して見え難くなるという現象は発生せず、立体感が感じられやすくなる、という効果が得られる。言い換えれば、相対的に立体感を強調する効果が得られる。 When the stereoscopic image is classified as a scene having a medium or wide parallax range width and a subject on the foreground side in the periphery of the image, that is, one of the squares described as “Processing C” in FIG. When the images are classified, the image correcting unit 7C performs a shift process for shifting the left-eye image to the left and the right-eye image to the right to increase the parallax uniformly over the entire screen. As described above with reference to FIG. 7, this shift process has an effect that the subject appears to be in a distant view. This process is similar to the case where it is classified into one of the squares described as “Process A” in FIG. 17, but the adjustment amount is different. In this case, the shift process is performed with an adjustment amount necessary to move the subject existing on the near view side in the image periphery to the back of the screen. By this processing, at least the subject located in front of the screen does not exist at the periphery of the image. Therefore, if the subject is on the peripheral part of the image, even if the subject is located in front, the phenomenon of jumping out to the front and making it difficult to see does not occur, and it is easy to feel a stereoscopic effect An effect is obtained. In other words, an effect of relatively enhancing the stereoscopic effect can be obtained.

ここで、画像周辺部において近景側に被写体が存在することを条件としているのは、前記の通り立体画像において被写体が画像の周辺部分にかかっており、その被写体が画面より手前に知覚されるような視差量を有する場合、その視差量にも関わらず手前に飛び出して見え難という現象があるため、これを発生しないようにする画像補正処理を施すためである。また、視差範囲幅が中程度あるいは広いことを条件としているのは、視差範囲幅が狭い立体画像はそもそも立体感が乏しいために、上記の奥行の幅を広げる画像補正処理（処理Ａ）を優先して施すためである。 Here, the condition that the subject exists on the foreground side in the peripheral portion of the image is that the subject is in the peripheral portion of the image in the stereoscopic image as described above, and the subject is perceived in front of the screen. This is because when there is a large amount of parallax, there is a phenomenon in which it is difficult to see the image by popping forward, regardless of the amount of parallax, so that image correction processing is performed to prevent this phenomenon from occurring. Also, the reason that the parallax range width is medium or wide is that a stereoscopic image with a narrow parallax range width has a poor stereoscopic effect in the first place. Therefore, priority is given to the image correction processing (processing A) that widens the depth. This is to apply.

このシフト処理における調整量は、例えば以下のような方法で定めることができる。シフト処理後に画像周辺部において近景側に被写体が存在しないようになればよい事から、所定の閾値をＴｓｅ３として、シフト処理後の画像周辺部近景側度数総和Ｓｅ’＝Ｔｓｅ３となるシフト量を求め、調整量ａとする。Ｔｓｅ３は、Ｔｓｅ１と同様小さな値を用いる。 The adjustment amount in this shift process can be determined by the following method, for example. Since it is sufficient that the subject does not exist on the foreground side in the image periphery after the shift process, a predetermined threshold value is set as Tse3, and the shift amount at which the image periphery near view side frequency sum Se ′ = Tse3 after the shift process is obtained. The adjustment amount is a. Tse3 uses a small value like Tse1.

立体画像がその他の分類をされた場合、すなわち図１７において「−」と記載のあるマスのいずれかに分類された場合は、画像補正部７Ｃは画像補正処理を行わない。これらはいずれも視差範囲幅が中程度以上であり、特に画像補正処理を行わなくても十分立体感が感じられる。また、上記図１７の「処理Ｂ」の場合のように、さらに立体感を強調できる条件でもない。また、上記図１７の「処理Ｃ」の場合のように、画像周辺部において近景側に存在する被写体がありこれが立体感を阻害する、ということも無いためである。 When the stereoscopic image is classified in other ways, that is, when it is classified into one of the squares marked with “−” in FIG. 17, the image correction unit 7C does not perform the image correction process. All of these have a medium or larger parallax range width, and a sufficient stereoscopic effect can be felt without performing image correction processing. Further, as in the case of “Processing B” in FIG. Further, as in the case of “Processing C” in FIG. 17, there is a subject that exists in the foreground side at the periphery of the image, and this does not impede the stereoscopic effect.

以上のように、本実施例における立体画像処理装置１Ｃは、視差範囲幅Ｗ、近景側度数総和Ｓ、画像周辺部近景側度数総和Ｓｅという３種類の視差特徴量を用いて立体画像を１８種類のシーンに分類している。これにより、第２の実施例に比べてさらに細かく精度の高い分類を可能とする、という効果を有する。 As described above, the stereoscopic image processing apparatus 1 </ b> C according to the present embodiment uses 18 types of stereoscopic images using the three types of parallax feature amounts of the parallax range width W, the foreground side power sum S, and the image peripheral portion foreground side power total Se. It is classified into the scene. This has the effect of enabling finer and more accurate classification than in the second embodiment.

また本実施例における立体画像処理装置１Ｃは、さらに細かく精度の高い分類が可能になったことから、第２の実施例で説明した被写体を飛び出させて立体感を強調する画像補正処理の対象となるシーンを、より的確に分類することが可能となるという効果を有する。 In addition, since the stereoscopic image processing apparatus 1C according to the present embodiment enables finer and more accurate classification, it is possible to perform image correction processing that emphasizes the stereoscopic effect by popping out the subject described in the second embodiment. There is an effect that the scene can be classified more accurately.

さらに、本実施例における立体画像処理装置１Ｃは、視差特徴量として画像周辺部近景側度数総和Ｓｅを用いることで、被写体が画像の周辺部分にかかっているか否かを分類することができるようになり、この分類を用いて、立体感を強調するためのさらに別の画像補正処理を施すことができるという効果を有する。すなわち、画像補正処理の適用対象となるシーンがさらに多くなるという効果を有する。 Furthermore, the stereoscopic image processing apparatus 1C according to the present embodiment can classify whether or not the subject covers the peripheral portion of the image by using the image peripheral portion near view side frequency sum Se as the parallax feature amount. Thus, using this classification, it is possible to perform another image correction process for enhancing the stereoscopic effect. That is, there is an effect that the number of scenes to which the image correction process is applied increases.

本発明の第４の実施例は、立体画像処理装置であって、第２の実施例で説明した被写体を手前に飛び出させて立体感を強調する画像補正処理について、当該補正処理が対象とするシーンをさらに的確に分類することができるものである。このために第２の実施例と比べて、視差特徴量としてさらに、近景側に存在する被写体が画像内に占める割合を示すもう一つの量を用いてシーン分類を行う。 The fourth embodiment of the present invention is a stereoscopic image processing apparatus, and the correction processing is targeted for the image correction processing that emphasizes the stereoscopic effect by popping out the subject described in the second embodiment. The scene can be classified more accurately. For this reason, as compared with the second embodiment, scene classification is performed using another amount indicating the proportion of the subject existing in the foreground side in the image as the parallax feature amount.

図１８は、第４の実施例にかかる立体画像処理装置１Ｄの構成を示すブロック図である。第２の実施例に係る立体画像処理装置１Ｂとは、視差特徴量算出部４Ｄ、立体画像分類部５Ｄ、画像補正部７Ｄが異なる。同一の部分に関する説明は省略し、異なる部分を中心に説明する。 FIG. 18 is a block diagram illustrating a configuration of a stereoscopic image processing apparatus 1D according to the fourth embodiment. The stereoscopic image processing apparatus 1B according to the second embodiment is different in a parallax feature amount calculation unit 4D, a stereoscopic image classification unit 5D, and an image correction unit 7D. The description about the same part is abbreviate | omitted, and it demonstrates centering on a different part.

図１９は、視差特徴量算出部４Ｄの構成を示すブロック図である。視差特徴量算出部４Ｂと比較すると、第二近景側度数総和算出部４７が追加されている。 FIG. 19 is a block diagram illustrating a configuration of the parallax feature amount calculation unit 4D. Compared with the parallax feature amount calculation unit 4B, a second foreground side frequency total calculation unit 47 is added.

近景側度数総和算出部４４は第２の実施例と同様、視差ヒストグラム作成部４２から入力した視差ヒストグラムにおいて、視差量が閾値Ｔｄ１以下のビンの度数の総和である近景側度数総和Ｓを求め、これを視差特徴量として立体画像分類部５Ｄに出力する。また、第二近景側度数総和算出部４７は、視差ヒストグラム作成部４２から入力した視差ヒストグラムにおいて、視差量が閾値Ｔｄ２以下のビンの度数の総和である第二近景側度数総和Ｓ２を求め、これを視差特徴量として立体画像分類部５Ｄに出力する。ここで、Ｔｄ２＜Ｔｄ１である。 In the disparity histogram input from the disparity histogram creation unit 42, the foreground-side frequency sum total calculation unit 44 obtains a foreground side frequency summation S that is the sum of the frequencies of bins whose parallax amount is equal to or less than the threshold Td1, as in the second embodiment. This is output to the stereoscopic image classification unit 5D as a parallax feature amount. Further, the second foreground side frequency sum total calculation unit 47 obtains a second foreground side frequency total sum S2 that is the sum of the frequencies of bins whose parallax amount is equal to or less than the threshold Td2 in the parallax histogram input from the parallax histogram creation unit 42. Is output to the stereoscopic image classification unit 5D as a parallax feature amount. Here, Td2 <Td1.

図２０は、近景側度数総和Ｓ及び第二近景側度数総和Ｓ２について説明するための視差ヒストグラムを示した図である。横軸は視差量、縦軸は度数である。視差量が０の位置を実線で示す。視差量０以上が遠景側、０未満が近景側としている。また、閾値Ｔｄ１とＴｄ２の位置の一例を点線で示す。図２０（ａ）と（ｂ）は、視差ヒストグラムの近景側部分の形が異なる。 FIG. 20 is a diagram showing a parallax histogram for explaining the near-field side power sum S and the second foreground-side power sum S2. The horizontal axis is the amount of parallax, and the vertical axis is the frequency. A position where the amount of parallax is 0 is indicated by a solid line. The disparity amount of 0 or more is the far view side, and less than 0 is the close view side. An example of the positions of the threshold values Td1 and Td2 is indicated by a dotted line. FIGS. 20A and 20B are different in the shape of the near view side portion of the parallax histogram.

図２０の各視差ヒストグラムにおいて、縦横に網掛けされている部分と横に網掛けされている部分の和が、近景側度数総和Ｓに相当する。視差ヒストグラムが視差量−ｌからｍ（ｌ、ｍは自然数）の範囲で作成されており、視差ヒストグラムの各ビンの幅を１、各ビンの度数をｈ_ｉ（−ｌ≦ｉ≦ｍ）とすると、近景側度数総和Ｓは前記の通り以下の式で求めることができる。
Ｓ＝Σｈ_ｉ（−ｌ≦ｉ≦Ｔｄ１）
一方、視差ヒストグラムにおいて縦横に網掛けされている部分が、第二近景側度数総和Ｓ２に相当する。これは以下の式で求めることができる。
Ｓ２＝Σｈ_ｉ（−ｌ≦ｉ≦Ｔｄ２）
さらに図２０を用いて、視差ヒストグラムの形の違いにより近景側度数総和Ｓ及び第二近景側度数総和Ｓ２がどのような値になるかを説明する。図２０において仮に、Ｔｄ１＝−５ピクセル、Ｔｄ２＝−１５ピクセルであるとする。また、図２０（ａ）についてのＳ及びＳ２の算出値をＳａ、Ｓ２ａとし、図２０（ｂ）についてのＳ及びＳ２の算出値をＳｂ、Ｓ２ｂとする。また、図２０（ａ）の視差ヒストグラムと（ｂ）の視差ヒストグラムは、視差量がＴｄ１より大きい部分の形は同じであるが、視差量がＴｄ１より小さい部分の形が異なり、（ｂ）の方がピークの高さが低いが、より近景側に広がっているものとする。さらに、視差ヒストグラムの全度数は等しいとする。以上の条件においては、Ｔｄ１より近景側の領域の度数の総和は（ａ）と（ｂ）とで等しくなる。すなわち、Ｓａ＝Ｓｂである。また、Ｔｄ２＜Ｔｄ１であることから、Ｓａ≧Ｓ２ａ、Ｓｂ≧Ｓ２ｂとなる。 In each parallax histogram of FIG. 20, the sum of the portion shaded vertically and horizontally and the portion shaded horizontally corresponds to the near-field-side power sum S. Parallax histogram m from the parallax amount -l (l, m are natural numbers) is written in a range of the width of each bin of the parallax histogram 1, and the frequency of each bin _{h i (-l ≦ i ≦ m} ) Then, the foreground side power sum S can be obtained by the following equation as described above.
S = Σh _i (−1 ≦ i ≦ Td1)
On the other hand, the part shaded vertically and horizontally in the parallax histogram corresponds to the second foreground side power sum S2. This can be obtained by the following equation.
S2 = Σh _i (−1 ≦ i ≦ Td2)
Furthermore, FIG. 20 will be used to explain the values of the foreground-side power sum S and the second foreground-side power sum S2 depending on the shape of the parallax histogram. In FIG. 20, it is assumed that Td1 = −5 pixels and Td2 = −15 pixels. Further, the calculated values of S and S2 in FIG. 20A are Sa and S2a, and the calculated values of S and S2 in FIG. 20B are Sb and S2b. In addition, the parallax histogram in FIG. 20A and the parallax histogram in FIG. 20B have the same shape of the portion where the amount of parallax is larger than Td1, but the shape of the portion where the amount of parallax is smaller than Td1 is different. It is assumed that the peak has a lower height, but spreads closer to the foreground. Further, it is assumed that all frequencies of the parallax histogram are equal. Under the above conditions, the sum of the frequencies in the foreground area from Td1 is equal between (a) and (b). That is, Sa = Sb. Moreover, since Td2 <Td1, Sa ≧ S2a and Sb ≧ S2b.

ここで例えばＳａの値が、ヒストグラムの全度数の２０％、Ｓ２ａはそれより少なく８％であったとする。これに対して、Ｓｂの値は、上記の通りＳａと同じくヒストグラムの全度数の２０％である。一方Ｓ２ｂの値は、図から明らかなようにＳ２ａより大きく、例えばヒストグラムの全度数の１２％といった値になる。つまり、視差ヒストグラムがより近景側に広がっていると、第二近景側度数総和Ｓ２の値はより大きくなる。 Here, for example, it is assumed that the value of Sa is 20% of the total frequency of the histogram, and S2a is 8% less than that. On the other hand, the value of Sb is 20% of the total frequency of the histogram as described above. On the other hand, the value of S2b is larger than S2a as is apparent from the figure, and is, for example, 12% of the total frequency of the histogram. That is, when the parallax histogram spreads further toward the foreground side, the value of the second foreground side frequency sum S2 becomes larger.

このことから本実施例では、近景側度数総和Ｓに加えて第二近景側度数総和Ｓ２を用いることにより、上記のように近景側の視差ヒストグラムの形状の違いを判別することが可能となる。このような判別ができるのは、Ｔｄ２＜Ｔｄ１としているため、すなわち第二近景側度数総和Ｓ２は近景側度数総和Ｓに比べてより近景側の狭い部分のみの度数総和を算出しているためである。 Therefore, in this embodiment, by using the second foreground-side power sum S2 in addition to the foreground-side power sum S, the difference in the shape of the near-field parallax histogram can be determined as described above. This determination can be made because Td2 <Td1, that is, the second foreground side frequency sum S2 calculates the frequency sum of only the narrower portion on the foreground side than the foreground side frequency sum S. is there.

Ｔｄ１については第２の実施例と同様、一般的にはＴｄ１＝０とし、視差ヒストグラムの視差０までの度数総和を算出するが、これに限られるわけではなく、０以外の視差量までの度数総和を算出しても良い。Ｔｄ２も同様に、Ｔｄ２＜Ｔｄ１を満たす範囲で適宜設定して良い。 As in the second embodiment, Td1 is generally set to 0 for Td1, and the total frequency up to parallax 0 of the parallax histogram is calculated. However, the present invention is not limited to this, and the frequency up to the parallax amount other than 0 is calculated. The sum may be calculated. Similarly, Td2 may be appropriately set within a range satisfying Td2 <Td1.

一方、視差範囲幅算出部４３は、前記の通り視差範囲幅Ｗを視差特徴量として出力する。従って、視差特徴量算出部４Ｄは視差特徴量として、視差範囲幅Ｗ、近景側度数総和Ｓ及び第二近景側度数総和Ｓ２の３つの情報を立体画像分類部５Ｄに出力する。 On the other hand, the parallax range width calculation unit 43 outputs the parallax range width W as a parallax feature amount as described above. Accordingly, the parallax feature amount calculation unit 4D outputs, as the parallax feature amount, three pieces of information of the parallax range width W, the foreground side power sum S, and the second foreground side power sum S2 to the stereoscopic image classification unit 5D.

立体画像分類部５Ｄは、視差特徴量である視差範囲幅Ｗ、近景側度数総和Ｓ、第二近景側度数総和Ｓ２を用いて立体画像を分類する。本実施例では、視差範囲幅Ｗの大きさで３種類に、近景側度数総和Ｓ及び第二近景側度数総和Ｓ２の大きさを複合させて用いて２種類にそれぞれ分類する。従って、トータルでは３×２＝６種類に分類できる。この分類結果を画像補正部７Ｄに出力する。 The stereoscopic image classification unit 5D classifies the stereoscopic image using the parallax range width W, which is a parallax feature amount, the foreground-side power sum S, and the second foreground-side power sum S2. In the present embodiment, the size of the parallax range width W is classified into three types, and the foreground side power sum S and the second foreground side power sum S2 are combined and classified into two types. Therefore, it can be classified into 3 × 2 = 6 types in total. The classification result is output to the image correction unit 7D.

視差範囲幅Ｗを用いた分類の方法は、第２の実施例に記載の方法と同様である。 The classification method using the parallax range width W is the same as the method described in the second embodiment.

近景側度数総和Ｓ及び第二近景側度数総和Ｓ２については、判定のための閾値Ｔｓ３、Ｔｓ４（０≦Ｔｓ３、０≦Ｔｓ４）を用い、以下のように分類する。
・Ｓ＞Ｔｓ３かつＳ２＜Ｔｓ４：近景側に小さな被写体が存在しかつその被写体が手前に飛び出しすぎていない立体画像
・それ以外：それ以外の立体画像
Ｔｓ３とＴｓ４の適切な値は、近景側度数総和Ｓ及び第二近景側度数総和Ｓ２を算出するための閾値Ｔｄ１、Ｔｄ２の値によっても変わるため、適宜設定する必要がある。例えば前記の図２０に関する説明における例のように、Ｔｄ１＝−５ピクセル、Ｔｄ２＝−１５ピクセルであり、Ｓａ＝Ｓｂ＝ヒストグラムの全度数の２０％、Ｓ２ａ＝８％、Ｓ２ｂ＝１２％であったとする。これに対して、Ｔｓ３＝ヒストグラムの全度数の１５％、Ｔｓ４＝１０％の値を用いるとする。すると図２０（ａ）の視差ヒストグラムについてはＳａ＞Ｔｓ３かつＳ２ａ＜Ｔｓ４であり、図２０（ｂ）の視差ヒストグラムについてはＳｂ＞Ｔｓ３かつＳ２ｂ＞Ｔｓ４となる。よって、図２０（ａ）の視差ヒストグラムを持つ立体画像は、近景側に小さな被写体が存在しかつその被写体が手前に飛び出しすぎていない立体画像と分類され、図２０（ｂ）の視差ヒストグラムを持つ立体画像は、それ以外の立体画像と分類される。図２０（ａ）と（ｂ）の視差ヒストグラムの形を比較すると、（ｂ）の方がより近景側に被写体が飛び出している。上記の判定条件は、このような状況を判別するためのものである。 The foreground side frequency sum S and the second foreground side frequency sum S2 are classified as follows using threshold values Ts3 and Ts4 (0 ≦ Ts3, 0 ≦ Ts4) for determination.
S> Ts3 and S2 <Ts4: A stereoscopic image in which there is a small subject on the foreground side and the subject does not protrude too far forward. Other: other stereoscopic images. Appropriate values for Ts3 and Ts4 are the near side frequency. Since it varies depending on the values of the thresholds Td1 and Td2 for calculating the sum S and the second foreground side power sum S2, it is necessary to set them appropriately. For example, as in the example in the description of FIG. 20, Td1 = −5 pixels and Td2 = −15 pixels, Sa = Sb = 20% of the total frequency of the histogram, S2a = 8%, and S2b = 12%. Suppose. On the other hand, assume that Ts3 = 15% of the total frequency of the histogram and Ts4 = 10% are used. Then, Sa> Ts3 and S2a <Ts4 for the parallax histogram of FIG. 20A, and Sb> Ts3 and S2b> Ts4 for the parallax histogram of FIG. 20B. Therefore, the stereoscopic image having the parallax histogram of FIG. 20A is classified as a stereoscopic image in which a small subject exists in the foreground side and the subject does not protrude too far, and has the parallax histogram of FIG. The stereoscopic image is classified as any other stereoscopic image. When the shapes of the parallax histograms in FIGS. 20A and 20B are compared, the subject in FIG. 20B protrudes closer to the foreground side. The above determination condition is for determining such a situation.

画像補正部７Ｄは、上記のように６種類のシーンに分類された立体画像分類結果に従い、所定のシーンに対して、画像補正処理を行う。補正手段は第１〜第３の実施例と同じく、左右の画像をそれぞれシフトさせることによる視差調整方法を用いる。 The image correction unit 7D performs image correction processing on a predetermined scene according to the stereoscopic image classification result classified into the six types of scenes as described above. As in the first to third embodiments, the correction means uses a parallax adjustment method by shifting the left and right images.

図２１は、上記分類処理による６種類のシーンを表に示したものである。行方向に視差範囲幅Ｗによる３つの分類を、列方向に近景側度数総和Ｓ及び第二近景側度数総和Ｓ２による２つの分類を記載し、そのマトリックスで合計６種類のシーンを示している。各マスには後述の画像補正処理の種類を記載している。 FIG. 21 is a table showing six types of scenes obtained by the classification process. Three classifications according to the parallax range width W are described in the row direction, and two classifications according to the foreground-side power sum S and the second foreground-side power sum S2 are described in the column direction, and a total of six types of scenes are shown in the matrix. Each square describes the type of image correction processing described later.

立体画像が、視差範囲幅が狭いシーンと分類された場合、すなわち図２１における「処理Ａ」と記載のあるマスのいずれかに分類された場合は、画像補正部７Ｄは左目用画像を左にずらし右目用画像を右にずらすシフト処理を行い、視差を画面全体で一様に増加させる。このシフト処理は上記で図７を用いて説明したように、奥行の幅を広げてより立体感を強調する効果がある。このシフト処理における調整量は、第１の実施例と同じ方法により決定することができる。 When the stereoscopic image is classified as a scene having a narrow parallax range width, that is, when the stereoscopic image is classified into one of the squares described as “Processing A” in FIG. 21, the image correction unit 7D moves the left-eye image to the left. A shift process for shifting the right-eye image to the right is performed, and the parallax is increased uniformly over the entire screen. As described above with reference to FIG. 7, this shift process has the effect of widening the depth and enhancing the stereoscopic effect. The adjustment amount in this shift process can be determined by the same method as in the first embodiment.

立体画像が、視差範囲幅が中程度で、かつ近景側に小さな被写体が存在し、かつその被写体が手前に飛び出しすぎていないと分類された場合、すなわち図２１における「処理Ｂ」と記載のあるマスに分類された場合は、画像補正部７Ｄは左目用画像を右にずらし右目用画像を左にずらすシフト処理を行い、視差を画面全体で一様に減少させる。このシフト処理は上記で図８を用いて説明したように、画像中の被写体全体がより近景側に移動して飛び出して見える効果がある。 When a stereoscopic image is classified as having a medium parallax range width, a small subject on the near side, and the subject not protruding too far, that is, “Processing B” in FIG. 21 is described. When the image is classified into squares, the image correction unit 7D performs a shift process for shifting the left-eye image to the right and the right-eye image to the left, thereby reducing the parallax uniformly over the entire screen. As described above with reference to FIG. 8, this shift processing has an effect that the entire subject in the image appears to move out to the near side.

ここで、視差範囲幅が中程度であること及び近景側に比較的小さな被写体が存在することを条件としているのは、第２の実施例で説明した図１２における「処理Ｂ」の場合と同じ理由による。また、近景側の小さな被写体が手前に飛び出しすぎていないことを条件としているのは、画像補正処理前の立体画像において既に被写体が飛び出している場合、さらに被写体を飛び出させる画像補正処理を行うとかえって見難くなる可能性が高いためである。このように、第２の実施例における図１２における「処理Ｂ」の場合と比べて、近景側の小さな被写体が手前に飛び出しすぎていない、という条件をさらに追加することで、被写体を手前に飛び出させる画像補正処理を行うことが適切なシーンをさらに精度高く分類することが可能となる。 Here, the condition that the parallax range is medium and that there is a relatively small subject on the foreground side is the same as the case of “Process B” in FIG. 12 described in the second embodiment. Depending on the reason. In addition, the condition is that the small subject on the foreground side does not jump out too far, but if the subject has already popped out in the stereoscopic image before the image correction processing, image correction processing for further popping out the subject is performed. This is because it is likely to be difficult to see. In this way, compared to the case of “Processing B” in FIG. 12 in the second embodiment, by adding a condition that the small subject on the foreground side does not jump too far forward, the subject jumps forward. It is possible to classify scenes that are appropriate to be subjected to image correction processing with higher accuracy.

このシフト処理における調整量は、第３の実施例における図１７における「処理Ｂ」の場合と同様の方法で定めることができる。 The adjustment amount in this shift process can be determined by the same method as in the case of “Process B” in FIG. 17 in the third embodiment.

立体画像がその他の分類をされた場合、すなわち図２１において「−」と記載のあるマスのいずれかに分類された場合は、画像補正部７Ｄは画像補正処理を行わない。これらはいずれも視差範囲幅が中程度以上であり、特に画像補正処理を行わなくても十分立体感が感じられる。また、上記図２１における「処理Ｂ」の場合のように、さらに立体感を強調できる条件でもないためである。 When the stereoscopic image is classified in other ways, that is, when the stereoscopic image is classified into any of the squares marked with “−” in FIG. 21, the image correction unit 7D does not perform the image correction process. All of these have a medium or larger parallax range width, and a sufficient stereoscopic effect can be felt without performing image correction processing. Further, this is because it is not a condition for further enhancing the stereoscopic effect as in the case of “Processing B” in FIG.

以上のように、本実施例における立体画像処理装置１Ｄは、視差範囲幅Ｗ、近景側度数総和Ｓ及び第二近景側度数総和Ｓ２という３種類の視差特徴量を用いて立体画像を６種類のシーンに分類している。これにより、第２の実施例で説明した被写体を飛び出させて立体感を強調する画像補正処理の対象となるシーンを、より的確に分類することが可能となるという効果を有する。 As described above, the stereoscopic image processing apparatus 1D according to the present embodiment converts six types of stereoscopic images using the three types of parallax feature amounts of the parallax range width W, the foreground side power sum S, and the second foreground side power sum S2. Classified into scenes. Accordingly, there is an effect that it is possible to more accurately classify a scene to be subjected to an image correction process in which the subject described in the second embodiment is projected and the stereoscopic effect is enhanced.

本発明の第５の実施例は、第３の実施例にかかる立体画像処理装置に、シーン分類結果を用いたシーンチェンジ検出部を加えたものである。これにより、立体画像が動画である場合に、シーンチェンジを的確に検出すること、及び画像補正処理をより適切に行うことが可能になる。 In a fifth embodiment of the present invention, a scene change detection unit using a scene classification result is added to the stereoscopic image processing apparatus according to the third embodiment. As a result, when the stereoscopic image is a moving image, it is possible to accurately detect a scene change and perform image correction processing more appropriately.

図２２は、第５の実施例にかかる立体画像処理装置１Ｅの構成を示すブロック図である。第３の実施例に係る立体画像処理装置１Ｃとは、画像補正部７Ｅが異なり、シーンチェンジ検出部１２が追加されている。同一の部分に関する説明は省略し、異なる部分、追加された部分を中心に説明する。 FIG. 22 is a block diagram illustrating a configuration of a stereoscopic image processing apparatus 1E according to the fifth embodiment. The image correction unit 7E is different from the stereoscopic image processing apparatus 1C according to the third embodiment, and a scene change detection unit 12 is added. The description about the same part is abbreviate | omitted, and it demonstrates centering on a different part and the added part.

図２３は、シーンチェンジ検出部１２の構成を示すブロック図である。シーンチェンジ検出部１２は、立体画像分類結果遅延部１２１と立体画像分類結果比較部１２２からなる。シーンチェンジ検出部１２は、立体画像分類部５Ｃから入力した１８種類に分類された立体画像分類結果を用いて、立体画像にシーンチェンジが発生したか否かを判定し、その結果であるシーンチェンジ検出フラグＦｓを画像補正部７Ｅに出力する。 FIG. 23 is a block diagram illustrating a configuration of the scene change detection unit 12. The scene change detection unit 12 includes a stereoscopic image classification result delay unit 121 and a stereoscopic image classification result comparison unit 122. The scene change detection unit 12 determines whether or not a scene change has occurred in the stereoscopic image using the 18 types of stereoscopic image classification results input from the stereoscopic image classification unit 5C, and the scene change that is the result of the determination. The detection flag Fs is output to the image correction unit 7E.

立体画像分類結果遅延部１２１は、入力された立体画像分類結果を保存し、次に入力された立体画像分類結果に対して１フレーム遅延させた立体画像分類結果を立体画像分類結果比較部１２２に送る。 The stereoscopic image classification result delay unit 121 stores the input stereoscopic image classification result, and then inputs the stereoscopic image classification result delayed by one frame with respect to the input stereoscopic image classification result to the stereoscopic image classification result comparison unit 122. send.

立体画像分類結果比較部１２２は、入力された立体画像分類結果及び立体画像分類結果遅延部１２１により1フレーム分遅延された立体画像分類結果を入力し、フレーム間で分類結果に変化があるか否かを判定する。すなわち図２３に示したように、（ｎ＋１）フレーム目の立体画像分類結果と、ｎフレーム目の立体画像分類結果が異なっているか否かを判定する。この２つの立体画像分類結果が異なっている場合、シーンチェンジ検出フラグＦｓに１を設定する。これは、立体画像のｎフレームと（ｎ＋１）フレームの間でシーンチェンジを検出したことを意味する。一方、この２つの立体画像分類結果が同じであった場合は、Ｆｓに０を設定する。これは、シーンチェンジが検出されなかったことを意味する。 The stereoscopic image classification result comparison unit 122 inputs the input stereoscopic image classification result and the stereoscopic image classification result delayed by one frame by the stereoscopic image classification result delay unit 121, and whether there is a change in the classification result between frames. Determine whether. That is, as shown in FIG. 23, it is determined whether or not the (n + 1) th frame stereoscopic image classification result is different from the nth frame stereoscopic image classification result. If the two stereoscopic image classification results are different, 1 is set to the scene change detection flag Fs. This means that a scene change has been detected between n frames and (n + 1) frames of the stereoscopic image. On the other hand, when the two stereoscopic image classification results are the same, 0 is set to Fs. This means that no scene change has been detected.

画像補正部７Ｅは、画像補正部７Ｃと同様、立体画像分類部５Ｃにより１８種類に分類された立体画像分類結果に従い、所定の分類に対して、画像補正処理を行う。補正手段は、第１〜第４の実施例と同じく、左右の画像をそれぞれシフトさせることによる視差調整方法を用いている。 Similar to the image correction unit 7C, the image correction unit 7E performs image correction processing on a predetermined classification according to the stereoscopic image classification results classified into 18 types by the stereoscopic image classification unit 5C. As in the first to fourth embodiments, the correction means uses a parallax adjustment method by shifting the left and right images.

ここで、立体画像が動画の場合、被写体の視差はフレーム毎に変わっていくため、基本的にはフレーム毎に立体画像の視差の分布の状況に応じて視差の調整量を適切に設定すること、すなわちフレーム毎に視差調整処理のシフト量を変化させることが望ましい。しかしフレーム毎のシフト量の変化が大きいと奥行が頻繁に変化してしまい、観察者がそれに追従するのが困難になり、かえって見難くなるという問題がある。さらには、視差検出処理にエラーが発生すると、これによるシフト量の変化が加わってさらに見難くなるという問題が発生する。これに対しては、例えばシフト量に対して時間方向にローパスフィルタ処理を施して、シフト量の変化を滑らかにすると良い。 Here, when the stereoscopic image is a moving image, the parallax of the subject changes from frame to frame. Therefore, basically, the amount of parallax adjustment is appropriately set according to the situation of the parallax distribution of the stereoscopic image for each frame. That is, it is desirable to change the shift amount of the parallax adjustment processing for each frame. However, if the change in the shift amount for each frame is large, the depth is frequently changed, which makes it difficult for the observer to follow it, which makes it difficult to see. Furthermore, when an error occurs in the parallax detection process, a change in the shift amount due to the error is added to make it more difficult to see. For this, for example, the shift amount may be smoothed by performing low-pass filter processing on the shift amount in the time direction.

一方、シーンチェンジの前後においては、立体画像の被写体の視差が不連続に変わる場合がある。この場合、シーンチェンジ前のシーンと後のシーンを共に立体感豊かに表示するためには、シーンチェンジ時のシフト量の変化が大きくなるとしても、シーンチェンジ前後のそれぞれのシーンに適したシフト量を使用することが望ましい。そこで、画像補正部７Ｅは、シーンチェンジ検出部１２から入力されたシーンチェンジ検出フラグＦｓが１の場合、すなわちシーンチェンジが発生している場合は、上記のローパスフィルタ処理を一旦解除し、シフト量を不連続に変化させることを許容する。このような処理を行うことで、シーンチェンジ時以外はシフト量に対するローパスフィルタ処理によりシフト量を滑らかに変化させることで見苦しさを防止しすることと、シーンチェンジが発生した場合はシーンチェンジ後の立体画像のシフト量に素早く変更することを両立させた視差調整処理を実現できる。すなわち、より適切な視差調整処理を実現できる。 On the other hand, the parallax of the subject of the stereoscopic image may change discontinuously before and after the scene change. In this case, in order to display both the scene before and after the scene change with a rich stereoscopic effect, even if the change in the shift amount at the time of the scene change becomes large, the shift amount suitable for each scene before and after the scene change It is desirable to use Therefore, when the scene change detection flag Fs input from the scene change detection unit 12 is 1, that is, when a scene change has occurred, the image correction unit 7E temporarily cancels the above-described low-pass filter processing, and shift amount Is allowed to change discontinuously. By performing such processing, it is possible to prevent unsightly by changing the shift amount smoothly by low-pass filter processing for the shift amount except during a scene change, and if a scene change occurs, after the scene change It is possible to realize parallax adjustment processing that is compatible with quickly changing the shift amount of a stereoscopic image. That is, more appropriate parallax adjustment processing can be realized.

また、例えば画像の輝度情報の変化を用いてシーンチェンジ検出を行う従来技術があるが、この方法では視差量の変化を捉えることはできない。本実施例の立体画像処理装置１Ｄのように視差調整処理を行う場合には、視差量の変化を捉えることのできるシーンチェンジ検出方法を用いることが望ましい。例えば、シーンチェンジの前後で輝度情報には大きな変化は無いが視差量には変化がある場合、視差量の変化を捉えることのできるシーンチェンジ検出方法でなければ検出できないためである。シーンチェンジ検出部１２によるシーンチェンジ検出手段は、視差量の変化を捉えたシーンチェンジ検出を可能とするものである。 Further, for example, there is a conventional technique in which scene change detection is performed using a change in luminance information of an image, but this method cannot capture a change in the amount of parallax. When the parallax adjustment processing is performed as in the stereoscopic image processing apparatus 1D of the present embodiment, it is desirable to use a scene change detection method that can capture a change in the amount of parallax. For example, if there is no significant change in luminance information before and after a scene change but there is a change in the amount of parallax, it can be detected only by a scene change detection method that can capture the change in the amount of parallax. The scene change detection means by the scene change detection unit 12 enables scene change detection that captures a change in the amount of parallax.

なお、立体画像分類部５Ｃは入力された立体画像を１８種類に分類するが、画像補正部７Ｅは「処理Ａ」、「処理Ｂ」、「処理Ｃ」、「−」（処理無し）の４通りの画像補正処理を行っている。つまり、１８種類の分類のうちいくつかは同じ画像補正処理を行うため、まとめられている。しかしシーンチェンジにおいては、元の１８種類の分類を用いて、それに変化があった時にシーンチェンジと判定する。これは、たとえ同じ画像補正処理を行う立体画像と分類された場合であっても、シーンチェンジ時には補正量が異なる可能性があるため、分類が異なる場合にはシーンチェンジと判定すべきであるためである。 The stereoscopic image classification unit 5C classifies the input stereoscopic image into 18 types, but the image correction unit 7E has four types of “processing A”, “processing B”, “processing C”, and “−” (no processing). A normal image correction process is performed. That is, some of the 18 types of classifications are grouped together to perform the same image correction process. However, in the scene change, the original 18 types of classification are used, and when there is a change, the scene change is determined. This is because even if the image is classified as a stereoscopic image subjected to the same image correction processing, the correction amount may be different at the time of a scene change, so if the classification is different, it should be determined as a scene change. It is.

以上のように、本実施例における立体画像処理装置１Ｅは、立体画像のシーン分類結果を用いてシーンチェンジ検出を行う。これにより、立体画像の被写体の視差の変化を捉えたシーンチェンジ検出を可能とするという効果を有する。 As described above, the stereoscopic image processing apparatus 1E according to the present embodiment performs scene change detection using the scene classification result of the stereoscopic image. This has the effect of making it possible to detect scene changes that capture changes in the parallax of a subject in a stereoscopic image.

また、シーンチェンジ検出結果を立体画像に対する画像補正処理に用いることで、より適切な視差調整処理を実現できるという効果を有する。 Further, the use of the scene change detection result for the image correction process for the stereoscopic image has an effect that a more appropriate parallax adjustment process can be realized.

ところで、本実施例においては、立体画像の（ｎ＋１）フレーム目との立体画像分類結果とｎフレーム目の立体画像分類結果の変化を見てシーンチェンジを検出した。しかし検出方法はこれに限らない。例えば、立体画像の（ｎ＋１）フレーム目の立体画像分類結果と（ｎ−１）フレーム目の立体画像分類結果の変化や、（ｎ＋１）フレーム目の立体画像分類結果と（ｎ−２）フレーム目の立体画像分類結果の変化のように、２以上離れたフレーム間での立体画像分類結果の変化によりシーンチェンジを検出することも可能である。このような場合、シーンチェンジ検出に用いる２つのフレームのうち、時間的に遅いほうのフレームとその一つ前のフレームとの間のシーンチェンジを検出することになる。例えば、（ｎ＋１）フレーム目の立体画像分類結果と（ｎ−１）フレーム目の立体画像分類結果の変化によりシーンチェンジ検出する場合は、時間的に遅い（ｎ＋１）フレーム目と、その一つ前のｎフレーム目との間のシーンチェンジを検出することになる。 By the way, in this embodiment, the scene change is detected by looking at the change in the stereoscopic image classification result of the (n + 1) th frame of the stereoscopic image and the change of the stereoscopic image classification result of the nth frame. However, the detection method is not limited to this. For example, the change of the (n + 1) th frame stereoscopic image classification result and the (n−1) th frame stereoscopic image classification result, the (n + 1) th frame stereoscopic image classification result, and the (n−2) th frame. It is also possible to detect a scene change by a change in the stereoscopic image classification result between two or more frames, as in the change in the stereoscopic image classification result. In such a case, a scene change between a frame that is later in time and a frame immediately before the two frames used for scene change detection is detected. For example, when a scene change is detected based on a change in the (n + 1) th frame stereoscopic image classification result and the (n−1) th frame stereoscopic image classification result, the (n + 1) th frame later in time and the previous one A scene change between the nth frame and the nth frame is detected.

図２４を用いて、シーンチェンジ検出に用いるフレームと、それによって検出されるシーンチェンジの位置の関係をさらに説明する。図２４は、（ｎ−２）フレームから（ｎ＋２）フレームまでの各フレーム目を時系列順に示している。（ｎ−２）フレームからｎフレームまでが１つのシーン、ｎフレームと（ｎ＋１）フレームの間でシーンチェンジがあって立体画像分類結果が変わり、（ｎ＋１）フレームから（ｎ＋２）フレームが別のシーンである状況を図示している。 The relationship between the frame used for scene change detection and the position of the scene change detected thereby will be further described with reference to FIG. FIG. 24 shows the frames from the (n−2) frame to the (n + 2) frame in chronological order. (N-2) frame to n frame is one scene, there is a scene change between n frame and (n + 1) frame, the stereoscopic image classification result changes, and (n + 1) frame to (n + 2) frame is another scene The situation is illustrated.

隣接するフレーム間の立体画像分類結果の変化によりシーンチェンジを検出する場合、時間の経過につれ、
（１）（ｎ−２）フレームと（ｎ−１）フレームの間の立体画像分類結果の変化を判定
（２）（ｎ−１）フレームとｎフレームの間の立体画像分類結果の変化を判定
（３）ｎフレームと（ｎ＋１）フレームの間の立体画像分類結果の変化を判定
・・・
というように順に処理を行う。（１）、（２）の処理では変化が無く、（３）の処理で変化があるため、（３）の処理においてシーンチェンジを検出する。 When a scene change is detected by a change in the stereoscopic image classification result between adjacent frames, over time,
(1) Judgment of change in stereoscopic image classification result between (n-2) frame and (n-1) frame (2) Judgment of change in stereoscopic image classification result between (n-1) frame and n frame (3) Judgment of change in stereoscopic image classification result between n frames and (n + 1) frames
The process is performed in order. Since there is no change in the processes (1) and (2) and there is a change in the process (3), a scene change is detected in the process (3).

一方、一つ離れたフレーム間の多次元統計量の距離によりシーンチェンジを検出する場合、時間の経過につれ、
（１’）（ｎ−２）フレームとｎフレームの間の立体画像分類結果の変化を判定
（２’）（ｎ−１）フレームと（ｎ＋１）フレームの間の立体画像分類結果の変化を判定
（３’）ｎフレームと（ｎ＋２）フレームの間の立体画像分類結果の変化を判定
・・・
というように順に処理を行う。（１’）の処理では変化が無く、（２’）の処理では変化があるため、（２’）の処理においてシーンチェンジを検出する。また、その次の（３’）の処理でも、ｎフレームと（ｎ＋２）フレームが違うシーンに属していることから、変化がある。このように、一つ離れたフレーム間の立体画像分類結果の変化によりシーンチェンジを検出する場合、一つのシーンチェンジに対し２度立体画像分類結果の変化がある。これは誤検出となるため、例えばある時点の処理でシーンチェンジ検出すると、その次のフレームでの処理結果は無視するようにすることで解決できる。 On the other hand, when a scene change is detected based on the distance of multi-dimensional statistics between one frame apart, over time,
(1 ′) Determine change in stereoscopic image classification result between (n−2) frame and n frame (2 ′) Determine change in stereoscopic image classification result between (n−1) frame and (n + 1) frame (3 ') Judgment of change in stereoscopic image classification result between n frames and (n + 2) frames ...
The process is performed in order. Since there is no change in the process (1 ′) and there is a change in the process (2 ′), a scene change is detected in the process (2 ′). In the next process (3 ′), there is a change because the n frame and the (n + 2) frame belong to different scenes. As described above, when a scene change is detected based on a change in the stereoscopic image classification result between frames that are separated by one, there is a change in the stereoscopic image classification result twice for one scene change. Since this is an erroneous detection, for example, when a scene change is detected in a process at a certain time, the process result in the next frame can be ignored.

このように、複数のフレーム間での視差に関する立体画像分類結果の変化によりシーンチェンジを検出することが可能である。 As described above, it is possible to detect a scene change based on a change in the stereoscopic image classification result regarding the parallax between a plurality of frames.

本実施例の立体画像処理装置１Ｅは、第３の実施例における立体画像処理装置１Ｃにシーンチェンジ検出部１２を追加したものである。同様に、第１の実施例における立体画像処理装置１Ａ、第２の実施例における立体画像処理装置１Ｂ、第４の実施例における立体画像処理装置１Ｄにシーンチェンジ検出部１２を追加した構成とすることも可能である。これらの場合も同様に、立体画像分類結果の変化によりシーンチェンジを検出可能である。 A stereoscopic image processing apparatus 1E according to the present embodiment is obtained by adding a scene change detection unit 12 to the stereoscopic image processing apparatus 1C according to the third embodiment. Similarly, the scene change detection unit 12 is added to the stereoscopic image processing apparatus 1A in the first embodiment, the stereoscopic image processing apparatus 1B in the second embodiment, and the stereoscopic image processing apparatus 1D in the fourth embodiment. It is also possible. In these cases as well, a scene change can be detected by a change in the stereoscopic image classification result.

上記の第１〜５の実施例では、立体感を強調する補正処理に関して説明した。しかし実際は、このような立体感の強調処理だけではなく、従来技術にあるような、立体画像の安全性を確保する補正処理も併せて行っても良い。 In the first to fifth embodiments, the correction process for enhancing the stereoscopic effect has been described. However, in practice, not only such a stereoscopic effect enhancement process, but also a correction process for ensuring the safety of a stereoscopic image as in the prior art may be performed.

上記の第１〜第５の実施例における視差範囲算出部４３では、視差ヒストグラムからその画像における最近景の視差量と最遠景の視差量の幅から、視差範囲幅を求めた。しかし視差範囲幅の求め方はこれに限らない。例えば、立体画像の視差マップから直接、最近景の視差量と最遠景の視差量を探索し、その幅を視差範囲幅としても良い。すなわち、どのような求め方であれ、立体画像の最近景の視差量と最遠景の視差量の幅を求め、それを視差範囲幅とすればよい。 In the parallax range calculation unit 43 in the first to fifth embodiments, the parallax range width is obtained from the parallax amount of the closest view and the parallax amount of the farthest view in the image from the parallax histogram. However, the method of obtaining the parallax range width is not limited to this. For example, the parallax amount of the nearest scene and the parallax amount of the farthest view may be searched directly from the parallax map of the stereoscopic image, and the width may be set as the parallax range width. In other words, whatever method is used, the width of the parallax amount of the most recent view and the parallax amount of the farthest view of the stereoscopic image may be obtained and used as the parallax range width.

上記の第１〜第５の実施例では、視差マップから視差特徴量を算出した。しかし、入力された立体画像データの付加情報として視差特徴量のデータが付加されていても良い。この場合、視差特徴量算出部４Ａ〜４Ｄにおける視差特徴量を求める処理は、スキップしても良い。 In the above first to fifth embodiments, the parallax feature amount is calculated from the parallax map. However, parallax feature amount data may be added as additional information of the input stereoscopic image data. In this case, the process of obtaining the parallax feature amount in the parallax feature amount calculation units 4A to 4D may be skipped.

上記の第１〜第５の実施例では、立体画像データが静止画の場合は、同じ画像を連続して出力し、動画と同様に扱った。しかしこれに限らず、立体画像データが静止画の場合に、該静止画を1枚ずつ処理しても良い。 In the first to fifth embodiments described above, when the stereoscopic image data is a still image, the same image is continuously output and handled in the same manner as a moving image. However, the present invention is not limited to this, and when the stereoscopic image data is a still image, the still image may be processed one by one.

上記の第１〜第５の実施例では、表示部９として、左目用画像と右目用画像を交互に表示する液晶表示パネルを用い、シャッタメガネ１１と組み合わせて立体画像を呈示している。しかし、立体画像の表示方式はこれに限定されるものではない。右目用画像と左目用画像を異なる偏光方式で表示し、左右で異なる偏光フィルタを有するメガネにより観察する方式を用いてもかまわない。また、レンチキュラレンズやパララックスバリアを用いた表示部を使用し、観察者がメガネをかけなくても立体視可能な方式を用いてもかまわない。 In said 1st-5th Example, the liquid crystal display panel which displays the image for left eyes and the image for right eyes alternately as the display part 9 is combined with the shutter glasses 11, and the three-dimensional image is shown. However, the stereoscopic image display method is not limited to this. A right-eye image and a left-eye image may be displayed using different polarization methods, and a method of observing with glasses having different polarization filters may be used. In addition, a display unit using a lenticular lens or a parallax barrier may be used, and a method that enables stereoscopic viewing without an observer wearing glasses may be used.

上記の第１〜第５の実施例では、立体画像のシーン分類結果を、立体感を強調する視差調整処理を施すことに用いた。しかし、本実施例による立体画像のシーン分類結果を他の用途に用いることも可能である。例えば、上記の立体画像分類機能により複数枚の立体画像の静止画をシーン分類して、シーンごとに分けて立体画像のサムネイルを画面に表示させ、観察者にどのような立体画像がどのようなシーンに属するかを一覧させる、という用い方も可能である。また、立体画像間の視差の変化が小さくなるように並べ替えて表示する、といった用い方も可能である。また、複数枚の立体画像を自動的に順次切り替えて表示する際に、立体感の乏しい立体画像は自動的に飛ばして表示することで、観察者には立体感の豊かな立体画像のみを呈示する、といった用い方も可能である。さらに、上記の立体画像分類機能を立体画像記録装置、立体画像編集装置、立体画像編集プログラム等に用いて、立体画像のシーンチェンジ点に自動的にインデックスを入れ、立体画像再生時や編集時の目印に用いることも可能である。 In the first to fifth embodiments described above, the scene classification result of the stereoscopic image is used for performing a parallax adjustment process that enhances the stereoscopic effect. However, the scene classification result of the stereoscopic image according to the present embodiment can be used for other purposes. For example, the above three-dimensional image classification function classifies still images of a plurality of three-dimensional images into scenes, displays thumbnails of the three-dimensional images for each scene, and displays what three-dimensional images to the observer. It is also possible to use a list of whether the scene belongs. Further, it is possible to use such that the images are rearranged and displayed so that the change in parallax between the stereoscopic images is reduced. In addition, when a plurality of stereoscopic images are automatically switched and displayed sequentially, stereoscopic images with poor stereoscopic effect are automatically skipped and displayed, so that only a stereoscopic image with a rich stereoscopic effect is presented to the observer. It is also possible to use it. Furthermore, the above-mentioned stereoscopic image classification function is used for a stereoscopic image recording device, a stereoscopic image editing device, a stereoscopic image editing program, etc., and automatically indexing the scene change points of the stereoscopic image, when reproducing or editing the stereoscopic image. It can also be used as a mark.

上記の第２〜第５の実施例では、近景側度数総和算出部４４又は第二近景側度数総和算出部４７は、視差ヒストグラム作成部４２で作成した視差ヒストグラムを用いて、近景側度数総和Ｓ又は第二近景側度数総和Ｓ２を算出した。しかし、視差算出部４１で求めた画像全域の視差のデータから直接、近景側度数総和Ｓ等を算出しても良い。例えば、画像全域の視差のデータが立体画像の画素毎の視差データである場合は、視差がＴｄ１以下の画素数を数えることで、近景側度数総和Ｓを算出できる。 In the second to fifth embodiments described above, the foreground side power sum calculation unit 44 or the second foreground side power sum calculation unit 47 uses the parallax histogram created by the parallax histogram creation unit 42 to use the foreground side power sum S. Or 2nd foreground side frequency sum total S2 was computed. However, the foreground-side power sum S or the like may be calculated directly from the parallax data of the entire image obtained by the parallax calculation unit 41. For example, when the parallax data of the entire image is parallax data for each pixel of the stereoscopic image, the foreground-side power sum S can be calculated by counting the number of pixels whose parallax is Td1 or less.

上記の第３及び第５の実施例では、画像周辺部近景側度数総和算出部４６において、画像周辺部視差ヒストグラム作成部４５で作成した画像周辺部視差ヒストグラムを用いて、画像周辺部近景側度数総和Ｓｅを算出した。しかし、視差算出部４１で求めた画像全域の視差のデータのうち画像の周辺部分の視差のデータを用いて直接、画像周辺部近景側度数総和Ｓｅを算出しても良い。 In the third and fifth embodiments described above, the image peripheral portion near view side frequency using the image peripheral portion parallax histogram created by the image peripheral portion parallax histogram creating portion 45 in the image peripheral portion foreground side frequency sum total calculating portion 46. Total Se was calculated. However, it is also possible to directly calculate the image peripheral portion near view side frequency total Se using the parallax data of the peripheral portion of the image among the parallax data of the entire image obtained by the parallax calculation unit 41.

上記の第１〜第５の実施例では、立体画像処理装置に関して説明した。しかしその他、立体画像表示装置、立体画像編集装置、立体画像処理方法、立体画像表示方法、立体画像編集方法等、立体画像に関する幅広い装置及び方法に利用することが可能である。 In the first to fifth embodiments, the stereoscopic image processing apparatus has been described. However, the present invention can be applied to a wide range of apparatuses and methods related to stereoscopic images, such as a stereoscopic image display apparatus, a stereoscopic image editing apparatus, a stereoscopic image processing method, a stereoscopic image display method, and a stereoscopic image editing method.

以上、第１〜第５の実施例について具体的に説明を行ったが、本発明はそれらに限定されるものではない。上述した５つの実施例にそれぞれ開示された技術的手段を適宜組み合わせて得られる実施の形態についても本発明の技術的範囲に含まれる。 The first to fifth embodiments have been specifically described above, but the present invention is not limited to them. Embodiments obtained by appropriately combining the technical means disclosed in the five examples described above are also included in the technical scope of the present invention.

また、上記の各実施例において、添付図面に図示されている構成等については、あくまで一例であり、これらに限定されるものではなく、本発明の効果を発揮する範囲内で適宜変更することが可能である。その他、本発明の目的の範囲を逸脱しない限りにおいて適宜変更して実施することが可能である。 Further, in each of the above-described embodiments, the configuration and the like illustrated in the accompanying drawings are merely examples, and are not limited thereto, and may be appropriately changed within the scope of the effects of the present invention. Is possible. In addition, various modifications can be made without departing from the scope of the object of the present invention.

上記の各実施例の説明では、機能を実現するための各構成要素をそれぞれ異なる部位であるとして説明を行っているが、実際にこのように明確に分離して認識できる部位を有していなければならないわけではない。上記の各実施例の機能を実現する立体画像処理装置が、機能を実現するための各構成要素を、例えば実際にそれぞれ異なる部位を用いて構成していてもかまわないし、あるいは、全ての構成要素を一つのＬＳＩに実装していてもかまわない。すなわち、どういう実装形態であれ、機能として各構成要素を有していれば良い。 In the description of each of the above embodiments, each component for realizing the function is described as being a different part, but it should actually have a part that can be clearly separated and recognized in this way. It doesn't have to be. The stereoscopic image processing apparatus that implements the functions of the above embodiments may configure each component for realizing the function using, for example, different parts, or all the components. May be mounted on a single LSI. That is, what kind of mounting form should just have each component as a function.

また、上記の各実施例で説明した機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することにより各部の処理を行っても良い。なお、ここでいう「コンピュータシステム」とは、ＯＳや周辺機器等のハードウェアを含むものとする。 In addition, a program for realizing the functions described in the above embodiments is recorded on a computer-readable recording medium, and the program recorded on the recording medium is read into a computer system and executed. Processing may be performed. Here, the “computer system” includes an OS and hardware such as peripheral devices.

また、「コンピュータシステム」は、ＷＷＷシステムを利用している場合であれば、ホームページ提供環境（あるいは表示環境）も含むものとする。 Further, the “computer system” includes a homepage providing environment (or display environment) if a WWW system is used.

また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間の間、動的にプログラムを保持するもの、その場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含むものとする。また前記プログラムは、前述した機能の一部を実現するためのものであっても良く、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであっても良い。 The “computer-readable recording medium” refers to a storage device such as a flexible medium, a magneto-optical disk, a portable medium such as a ROM and a CD-ROM, and a hard disk incorporated in a computer system. Furthermore, the “computer-readable recording medium” dynamically holds a program for a short time like a communication line when transmitting a program via a network such as the Internet or a communication line such as a telephone line. In this case, a volatile memory in a computer system serving as a server or a client in that case, and a program that holds a program for a certain period of time are also included. The program may be a program for realizing a part of the above-described functions, or may be a program that can realize the above-described functions in combination with a program already recorded in a computer system.

本発明は、立体画像処理装置、立体画像表示装置、立体画像編集装置、立体画像処理方法、立体画像表示方法、立体画像編集方法等、立体画像に関する幅広い装置及び方法に利用することが可能である。 INDUSTRIAL APPLICABILITY The present invention can be used in a wide range of apparatuses and methods related to stereoscopic images, such as stereoscopic image processing apparatuses, stereoscopic image display apparatuses, stereoscopic image editing apparatuses, stereoscopic image processing methods, stereoscopic image display methods, and stereoscopic image editing methods. .

１Ａ〜１Ｅ立体画像処理装置
２入力部
３立体画像処理部
４Ａ〜４Ｄ視差特徴量算出部
５Ａ〜５Ｄ立体画像分類部
６画像遅延部
７Ａ〜７Ｅ画像補正部
８表示制御部
９表示部
１０メガネ同期部
１１シャッタメガネ
１２シーンチェンジ検出部
４１視差算出部
４２視差ヒストグラム作成部
４３視差範囲幅算出部
４４近景側度数総和算出部
４５画像周辺部視差ヒストグラム生成部
４６画像周辺部近景側度数総和算出部
４７第二近景側度数総和算出部
１２１立体画像分類結果遅延部
１２２立体画像分類結果比較部 1A to 1E Stereoscopic image processing device 2 Input unit 3 Stereoscopic image processing units 4A to 4D Parallax feature amount calculation units 5A to 5D Stereoscopic image classification unit 6 Image delay units 7A to 7E Image correction unit 8 Display control unit 9 Display unit 10 Glasses synchronization Unit 11 shutter glasses 12 scene change detection unit 41 parallax calculation unit 42 parallax histogram creation unit 43 parallax range width calculation unit 44 foreground side frequency sum total calculation unit 45 image peripheral part parallax histogram generation unit 46 image peripheral part near background side frequency total calculation unit 47 Second foreground side frequency sum total calculation unit 121 Stereo image classification result delay unit 122 Stereo image classification result comparison unit

Claims

A stereoscopic image processing apparatus for classifying a stereoscopic image scene,
An input unit for inputting the stereoscopic image;
A parallax feature amount calculation unit that calculates one or more parallax feature amounts of the stereoscopic image from the stereoscopic image;
A stereoscopic image classification unit that classifies the stereoscopic image into a plurality of scenes based on the parallax feature amount;
A stereoscopic image processing device in which at least one of the parallax feature amounts is calculated from two parallax amounts .

The stereoscopic image processing apparatus according to claim 1, further comprising an image correction unit that performs a stereoscopic image correction process based on the scene.

The stereoscopic image processing apparatus according to claim 2, wherein the parallax feature amount is a parallax range width that is a width of a parallax amount of the most recent view and a parallax amount of the farthest view of the stereoscopic image.

A stereoscopic image processing apparatus for classifying a stereoscopic image scene,
An input unit for inputting the stereoscopic image;
A parallax feature amount calculation unit that calculates one or more parallax feature amounts of the stereoscopic image from the stereoscopic image;
A stereoscopic image classification unit that classifies the stereoscopic image into a plurality of scenes based on the parallax feature amount;
The stereoscopic image processing apparatus, wherein the parallax feature amount is a foreground-side power sum determined by a near-field parallax amount from a first constant related to the parallax amount.

A stereoscopic image processing apparatus for classifying a stereoscopic image scene,
An input unit for inputting the stereoscopic image;
A parallax feature amount calculation unit that calculates one or more parallax feature amounts of the stereoscopic image from the stereoscopic image;
And a stereoscopic image classification section for classifying the three-dimensional image into a plurality of scenes by the parallax feature amount, the parallax feature amount is viewed from the second constants for smaller parallax amount than the first constant related Saryou the foreground side A stereoscopic image processing apparatus that is a second foreground side power sum determined by the amount of parallax.

A stereoscopic image processing apparatus for classifying a stereoscopic image scene,
An input unit for inputting the stereoscopic image;
A parallax feature amount calculation unit that calculates one or more parallax feature amounts of the stereoscopic image from the stereoscopic image;
A stereoscopic image classification unit that classifies the stereoscopic image into a plurality of scenes based on the parallax feature amount;
The stereoscopic image processing device, wherein the parallax feature amount is an image peripheral portion foreground side frequency sum determined by a parallax amount in the peripheral portion of the stereoscopic image closer to the foreground than a third constant related to the parallax amount.

The stereoscopic image classification unit performs a process of classifying the stereoscopic image by comparing the parallax range width with a first parallax range width constant;
The stereoscopic image processing apparatus according to claim 3, wherein the image correction unit performs a parallax adjustment process for increasing a parallax of the stereoscopic image in a scene in which the parallax range width is smaller than the first parallax range width constant.

The stereoscopic image classification unit compares the parallax range width with a first parallax range width constant and a second parallax range width constant larger than that, and classifies the stereoscopic image, and a first constant related to a parallax amount Performing a process of classifying the stereoscopic image by comparing the near-field side power sum determined by the near-field parallax amount with the first foreground-side power sum constant and a larger second foreground-side power sum constant,
The image correction processing unit is a scene in which the parallax range width is larger than the first parallax range width constant and smaller than the second parallax range width constant, and the near-field side power sum is the first foreground. The stereoscopic image processing apparatus according to claim 3, wherein a parallax adjustment process for reducing parallax of the stereoscopic image is performed on a scene that is larger than a side power sum constant and smaller than the second foreground side power sum constant.

The stereoscopic image classifying unit compares the parallax range width with a first parallax range width constant and the second parallax range width constant larger than the first parallax range width constant , From the process of comparing and classifying the near-field-side power sum constant determined by the disparity amount on the foreground side from the constant and the third near-field side power sum constant, and the second constant related to the disparity amount smaller than the first constant related to the disparity amount Performing a process of classifying the stereoscopic image by comparing the second foreground side power sum determined by the amount of parallax on the foreground side and the fourth foreground side power sum constant,
The image correction processing unit is a scene in which the parallax range width is larger than the first parallax range width constant and smaller than the second parallax range width constant, and the foreground-side power sum is the third
A parallax adjustment process for reducing the parallax of the stereoscopic image is performed on a scene that is larger than the foreground side power sum constant of the second scene and the second foreground side power sum is smaller than the fourth foreground side power sum constant. Item 4. The stereoscopic image processing apparatus according to Item 3.

The stereoscopic image classification unit compares the parallax range width with a first parallax range width constant to classify the stereoscopic image, and a peripheral portion of the stereoscopic image closer to the foreground than the third constant related to the amount of parallax Performing a process of classifying the stereoscopic image by comparing the image peripheral portion near view side power sum total determined by the amount of parallax in the first image peripheral portion near view side power sum constant,
The image correction processing unit is a scene in which the parallax range width is larger than the first parallax range width constant, and the image peripheral portion foreground side power sum is larger than the first image peripheral portion foreground side power sum constant. The stereoscopic image processing apparatus according to claim 3, wherein a parallax adjustment process for increasing a parallax of the stereoscopic image is performed on a scene.

The stereoscopic image processing apparatus according to claim 1, further comprising a scene change detection unit that detects a scene change of the stereoscopic image based on a change in the scene.

A stereoscopic image processing method for classifying a stereoscopic image scene,
Inputting the stereoscopic image;
Calculating one or more parallax feature quantities of the stereoscopic image from the stereoscopic image;
Classifying the stereoscopic image into a plurality of scenes according to the parallax feature amount, and
A stereoscopic image processing method in which at least one of the parallax feature amounts is calculated from two parallax amounts .

A stereoscopic image processing method for classifying a stereoscopic image scene,
Inputting one or more parallax feature quantities of the stereoscopic image from the stereoscopic image;
Classifying the stereoscopic image into a plurality of scenes according to the parallax feature amount, and
Wherein at least one of the parallax feature amount, the three-dimensional image processing method which has been calculated from the two parallax amount.

A program for causing a computer to execute stereoscopic image processing for classifying a stereoscopic image scene,
Inputting the stereoscopic image;
Calculating one or more parallax feature quantities of the stereoscopic image from the stereoscopic image;
Classifying the stereoscopic image into a plurality of scenes according to the parallax feature amount, and
At least one of the parallax feature amounts is a program calculated from two parallax amounts .

A program for causing a computer to execute stereoscopic image processing for classifying a stereoscopic image scene,
Inputting one or more parallax feature quantities of the stereoscopic image from the stereoscopic image;
Classifying the stereoscopic image into a plurality of scenes according to the parallax feature amount, and
At least one of the parallax feature amounts is a program calculated from two parallax amounts.