JP2014112794A

JP2014112794A - Stereoscopic image generating apparatus, stereoscopic image generating method, and program

Info

Publication number: JP2014112794A
Application number: JP2012266492A
Authority: JP
Inventors: Motohisa Ito; 元久伊藤
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2012-12-05
Filing date: 2012-12-05
Publication date: 2014-06-19

Abstract

PROBLEM TO BE SOLVED: To generate stereoscopic image data prevented from causing motion sickness.SOLUTION: A most projecting pixel presence possible region setting unit 11 sets an area which can include a pixel having a largest projecting amount, to inputted image data. A largest projecting amount calculation unit 12 calculates the largest projecting amount of a pixel in the set area. A stereoscopic image generating unit 13 determines the largest projecting amount of the pixel in the stereoscopic image data on the basis of the calculated largest projecting amount, and generates the stereoscopic image data from the image data on the basis of the determined largest projecting amount.

Description

本発明は、立体画像データを生成する技術に関するものである。 The present invention relates to a technique for generating stereoscopic image data.

デジタル技術の進歩により、立体画像の撮影や編集が可能な立体画像生成装置が普及しつつある。例えば、従来大掛かりな装置が必要であった立体画像の撮影は、手持ちのカムコーダで可能となっている。このような立体画像生成装置の普及に伴い、流布する立体画像の数も急激に増加している。 With the advancement of digital technology, stereoscopic image generating apparatuses capable of capturing and editing stereoscopic images are becoming widespread. For example, it is possible to shoot a stereoscopic image, which conventionally requires a large-scale device, with a hand-held camcorder. With the spread of such 3D image generation devices, the number of 3D images to be disseminated is increasing rapidly.

人間は、視差や輻輳又は眼球の焦点調整作用により立体感を覚えている。自然視では、視差や輻輳で認識した対象物の位置と、眼球の焦点調整作用で認識した対象物の位置とが一致しており、違和感を覚えることはない。しかし、立体画像表示装置で表示された立体画像を見ると、視差や輻輳で認識した対象物の位置は、立体画像表示装置の表示面に対して手前方向又は奥方向であるのに対して、眼球の焦点調整作用で認識した対象物の位置は、立体画像表示装置の表示面上になる。従って、対象物の位置情報に不一致が生じ、違和感を覚えることになる。さらに、対象物が移動すると、視差や輻輳で認識した対象物の位置と眼球の焦点調整作用で認識した対象物の位置との差異が変化する。この位置情報の差異の変化により上述した違和感はますます増加し、その結果、より大きな疲労感を覚えたり、又は、頭痛を感じたり吐き気を催す。このような状態を映像酔いと称している。即ち、立体画像の鑑賞は、平面画像の鑑賞と比較して、映像酔いを引き起こしやすいといえる。 Humans have a three-dimensional effect due to parallax, vergence, or focus adjustment of the eyeball. In natural vision, the position of the object recognized by parallax or convergence matches the position of the object recognized by the focus adjustment function of the eyeball, and there is no sense of incongruity. However, when viewing the stereoscopic image displayed on the stereoscopic image display device, the position of the object recognized by the parallax or the convergence is the front direction or the back direction with respect to the display surface of the stereoscopic image display device. The position of the object recognized by the focus adjustment function of the eyeball is on the display surface of the stereoscopic image display device. Therefore, the position information of the object is inconsistent, and the user feels uncomfortable. Further, when the object moves, the difference between the position of the object recognized by parallax or convergence and the position of the object recognized by the focus adjustment function of the eyeball changes. Due to the change in the positional information difference, the above-mentioned uncomfortable feeling increases more and more, and as a result, the user feels more tired or feels headache or nausea. Such a state is called video sickness. That is, it can be said that viewing a stereoscopic image is more likely to cause video sickness than viewing a planar image.

一般消費者が家庭用の機器で作成された立体画像は、映像酔いの防止に十分配慮されているとは限らない。従って、一般消費者により作成された立体画像は、映像酔いを引き起こす虞が比較的高いといえる。一方、専門的な知識を有する者が映像酔いの防止に十分配慮して作成した立体画像は、映像酔いを引き起こす虞は相対的に低いといえる。しかし、視聴者の体調や視聴環境によっては、映像酔いの防止に十分配慮して作成された立体画像であっても、映像酔いを引き起こす可能性が存在する。 A stereoscopic image created by a consumer with a household device is not always considered enough to prevent motion sickness. Therefore, it can be said that a stereoscopic image created by a general consumer has a relatively high risk of causing motion sickness. On the other hand, it can be said that a stereoscopic image created by a person with specialized knowledge with sufficient consideration for prevention of video sickness has a relatively low risk of causing video sickness. However, depending on the physical condition and viewing environment of the viewer, there is a possibility of causing video sickness even if the stereoscopic image is created with sufficient consideration for prevention of video sickness.

特許文献１及び２には、立体画像による映像酔いを軽減するための技術が開示されている。即ち、特許文献１に開示される技術は、視聴者の視線変移量が閾値を超えた場合、急激な変化の少ない画像に差替えて表示させることにより、視聴者の映像酔いを防止するものである。特許文献２に開示される技術は、視差調整情報に基づき視差量を調整し、調整後の視差量を基に立体画像を生成することにより、違和感を軽減し映像酔いを防止するものである。 Patent Documents 1 and 2 disclose techniques for reducing video sickness caused by stereoscopic images. In other words, the technique disclosed in Patent Document 1 prevents viewers from getting sick of video by replacing and displaying an image with little sudden change when the amount of gaze shift of the viewer exceeds a threshold value. . The technique disclosed in Patent Document 2 adjusts the amount of parallax based on parallax adjustment information and generates a stereoscopic image based on the adjusted amount of parallax, thereby reducing discomfort and preventing video sickness.

特開２０１０−２６６９８９号公報JP 2010-266989 A 特開２０１１−３５７１２号公報JP 2011-35712 A

しかしながら、特許文献１に開示される技術は、視聴者の視線変移量に基づき映像酔いの防止を図る技術であるため、視聴者の視線変移量を計測する手段を有しない立体画像表示装置では実施することができない。同様の理由で、特許文献１に開示される技術は、映像酔いの発生を抑止した立体画像の制作に適用することはできない。さらに、特許文献１に開示される技術は、視聴者が複数である場合、視線変移量は視聴者毎に異なることがあるため、対応が難しい。また、特許文献２に開示される技術は、視線変移は考慮していないため、視線変移により発生する映像酔いを防止することができない。 However, since the technique disclosed in Patent Document 1 is a technique for preventing video sickness based on the viewer's gaze shift amount, it is implemented in a stereoscopic image display apparatus that does not have a means for measuring the viewer's gaze shift amount. Can not do it. For the same reason, the technique disclosed in Patent Document 1 cannot be applied to the production of a stereoscopic image in which the occurrence of video sickness is suppressed. Furthermore, the technique disclosed in Patent Document 1 is difficult to deal with when there are a plurality of viewers because the amount of line-of-sight shift may differ for each viewer. In addition, since the technique disclosed in Patent Document 2 does not consider line-of-sight shift, video sickness caused by line-of-sight shift cannot be prevented.

そこで、本発明の目的は、映像酔いを防止した立体画像データを生成することにある。 Accordingly, an object of the present invention is to generate stereoscopic image data in which video sickness is prevented.

本発明の立体画像生成装置は、画像データを入力する入力手段と、前記入力手段により入力された前記画像データに対し、最大の飛び出し量を持つ画素が存在する可能性がある領域を設定する設定手段と、前記設定手段により設定された前記領域内における画素の最大の飛び出し量を算出する算出手段と、前記算出手段により算出された最大の飛び出し量に基づいて、立体画像データにおける画素の最大の飛び出し量を決定する決定手段と、前記決定手段により決定された最大の飛び出し量に基づいて、前記画像データから前記立体画像データを生成する生成手段とを有することを特徴とする。 The stereoscopic image generating apparatus of the present invention is configured to set an area for which a pixel having a maximum pop-out amount may exist for the image data input by the input means and the input means for inputting image data. Means for calculating the maximum pop-out amount of the pixel in the region set by the setting means, and based on the maximum pop-out amount calculated by the calculation means, the maximum pixel amount in the stereoscopic image data It has a determining means for determining the pop-out amount and a generating means for generating the stereoscopic image data from the image data based on the maximum pop-out amount determined by the determining means.

本発明によれば、映像酔いを防止した立体画像データを生成することが可能となる。 According to the present invention, it is possible to generate stereoscopic image data in which video sickness is prevented.

本発明の第１の実施形態に係る立体画像生成装置の構成を示す図である。It is a figure which shows the structure of the stereo image production | generation apparatus which concerns on the 1st Embodiment of this invention. 本発明の第１の実施形態における最大飛び出し画素存在可能領域設定部の詳細な構成を示す図である。It is a figure which shows the detailed structure of the largest protrusion pixel presence possible area | region setting part in the 1st Embodiment of this invention. 移動先座標予測部の詳細な構成を示す図である。It is a figure which shows the detailed structure of a movement destination coordinate estimation part. 最大飛び出し画素存在可能領域選択部の処理を示すフローチャートである。It is a flowchart which shows the process of the largest protrusion pixel presence possible area | region selection part. 立体画像生成部による立体画像データの視差量の決定処理を示すフローチャートである。It is a flowchart which shows the determination process of the parallax amount of the stereo image data by a stereo image production | generation part. 比較例における立体画像データと最大飛び出し画素の位置との一例を示す図である。It is a figure which shows an example of the stereo image data in a comparative example, and the position of the largest pop-out pixel. 比較例における立体画像データと最大飛び出し画素の位置との一例を示す図である。It is a figure which shows an example of the stereo image data in a comparative example, and the position of the largest pop-out pixel. 本発明の実施形態に係る立体画像生成装置により生成される立体画像データと最大飛び出し画素の位置との一例を示す図である。It is a figure which shows an example of the stereo image data produced | generated by the stereo image production | generation apparatus which concerns on embodiment of this invention, and the position of the largest pop-out pixel. 本実施形態に係る立体画像生成装置により生成される立体画像データと最大飛び出し画素の位置との一例を示す図である。It is a figure which shows an example of the stereo image data produced | generated by the stereo image production | generation apparatus which concerns on this embodiment, and the position of the largest pop-out pixel. 時刻ｎから時刻ｎ＋６までの、最大飛び出し画素の位置の変移について説明するための図である。It is a figure for demonstrating the transition of the position of the largest protrusion pixel from the time n to the time n + 6. 本発明の第２の実施形態に係る立体画像生成装置の構成を示す図である。It is a figure which shows the structure of the stereo image production | generation apparatus which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施形態における最大飛び出し画素存在可能領域設定部の詳細な構成を示す図である。It is a figure which shows the detailed structure of the largest protruding pixel presence possible area | region setting part in the 2nd Embodiment of this invention.

以下、本発明を適用した好適な実施形態を、添付図面を参照しながら詳細に説明する。但し、本発明の要旨を逸脱しない範囲で適宜構成を変更することが可能である。従って、本発明の適用範囲は、以下に説明する実施形態に限定されない。なお、以下の説明においては、同一の機能を有する構成には、原則として同一の符号を付し、重複した説明は省略するものとする。 DESCRIPTION OF EXEMPLARY EMBODIMENTS Hereinafter, preferred embodiments to which the invention is applied will be described in detail with reference to the accompanying drawings. However, the configuration can be changed as appropriate without departing from the gist of the present invention. Therefore, the scope of application of the present invention is not limited to the embodiments described below. In the following description, components having the same functions are denoted by the same reference numerals in principle, and redundant descriptions are omitted.

本発明の実施形態の説明に先立ち、立体画像の結像位置の関係について説明する。立体画像を眼球で見ると、両眼の視差や輻輳により脳内で画像の融合が起こり、あたかも表示面から像が飛び出しているように視聴者は認識する。このとき、飛び出し量は、視聴者に近い対象ほど大きく、視聴者から遠い対象ほど小さい。 Prior to the description of the embodiment of the present invention, the relationship between the imaging positions of a stereoscopic image will be described. When a stereoscopic image is viewed with an eyeball, the viewer recognizes that the images are fused in the brain due to parallax and convergence of both eyes, and the image is projected from the display surface. At this time, the pop-out amount is larger as the object is closer to the viewer and smaller as the object is farther from the viewer.

図１は、本発明の第１の実施形態に係る立体画像生成装置の構成を示す図である。図１に示すように、本実施形態に係る立体画像生成装置は、最大飛び出し画素存在可能領域設定部１１、最大飛び出し量算出部１２、及び、立体画像生成部１３を備える。なお、最大飛び出し画素存在可能領域設定部１１、最大飛び出し量算出部１２及び立体画像生成部１３は、立体画像生成装置内の不図示のＣＰＵが、ＲＯＭ等の記録媒体から必要なデータ及びプログラムを読み出して実行することにより実現する機能構成である。他の実施形態として、最大飛び出し画素存在可能領域設定部１１、最大飛び出し量算出部１２及び立体画像生成部１３のうちの少なくとも一部をハードウェアで実装してもよい。 FIG. 1 is a diagram illustrating a configuration of a stereoscopic image generation apparatus according to the first embodiment of the present invention. As shown in FIG. 1, the stereoscopic image generation apparatus according to the present embodiment includes a maximum pop-out pixel presence area setting unit 11, a maximum pop-out amount calculation unit 12, and a three-dimensional image generation unit 13. Note that the maximum pop-out pixel existence area setting unit 11, the maximum pop-out amount calculation unit 12, and the stereoscopic image generation unit 13 are such that a CPU (not shown) in the stereoscopic image generation apparatus downloads necessary data and programs from a recording medium such as ROM. This is a functional configuration realized by reading and executing. As another embodiment, at least a part of the maximum pop-out pixel presence area setting unit 11, the maximum pop-out amount calculation unit 12, and the stereoscopic image generation unit 13 may be implemented by hardware.

最大飛び出し画素存在可能領域設定部１１は、次時刻において最大の飛び出し量を持つ画素が存在する可能性がある領域を示す座標値１０２を算出して出力する。以下では、「最大の飛び出し量を持つ画素」を「最大飛び出し画素」と称し、また、「最大の飛び出し量を持つ画素が存在する可能性がある領域」を「最大飛び出し画素存在可能領域」と称する。 The maximum pop-out pixel existence area setting unit 11 calculates and outputs a coordinate value 102 indicating an area where a pixel having the maximum pop-out amount may exist at the next time. In the following, “pixels with the largest pop-out amount” will be referred to as “maximum pop-out pixels”, and “regions where pixels with the largest pop-out amount may exist” are referred to as “maximum pop-out pixel existence regions”. Called.

図２は、最大飛び出し画素存在可能領域設定部１１の詳細な構成を示す図である。図２に示すように、最大飛び出し画素存在可能領域設定部１１は、移動先座標検出部２１、移動先座標予測部２２、シーンチェンジ判定部２３、最大飛び出し画素存在可能領域選択部２４、及び、遅延部２５を備える。 FIG. 2 is a diagram illustrating a detailed configuration of the maximum pop-out pixel existence area setting unit 11. As shown in FIG. 2, the maximum pop-out pixel existence region setting unit 11 includes a movement destination coordinate detection unit 21, a movement destination coordinate prediction unit 22, a scene change determination unit 23, a maximum pop-out pixel existence region selection unit 24, and A delay unit 25 is provided.

移動先座標検出部２１は、現時刻の最大飛び出し画素が次時刻で存在する座標値、即ち、移動先座標を検出する。本実施形態では、現時刻の画像データと次時刻の画像データとの相関関係から画素の移動量や移動先座標を検出する、画像相関法を適用するが、これに限定されるものではない。なお、上記現時刻は、第１の時点の例であり、上記次時刻は、第２の時点の例である。また、上記現時刻の画像データは、第１の画像データの例であり、上記次時刻の画像データは、第２の画像データの例である。 The destination coordinate detection unit 21 detects the coordinate value at which the maximum pop-out pixel at the current time exists at the next time, that is, the destination coordinate. In the present embodiment, an image correlation method is used in which the pixel movement amount and the movement destination coordinates are detected from the correlation between the image data at the current time and the image data at the next time. However, the present invention is not limited to this. The current time is an example of the first time point, and the next time is an example of the second time point. The image data at the current time is an example of first image data, and the image data at the next time is an example of second image data.

遅延部２５は、次時刻の入力画像データ１０１と時刻を一致させるために、現時刻の入力画像データ１０１を一単位時間遅延させる。移動先座標検出部２１は、遅延部２５により遅延された現時刻の入力画像データ１１１と次時刻の入力画像データ１０１との相関関係をとり、移動先座標１１２と移動先座標の存在有無１１３とを検出する。ここで、最大飛び出し画素存在可能領域の座標値１０２を算出するには、現時刻の最大飛び出し画素の移動先座標とその存在有無のみが必要であり、それ以外の画素については不要である。従って、移動先座標検出部２１は、現時刻の最大飛び出し画素の座標値１０３が示す画素に対してのみ、移動先座標１１２と移動先座標の存在有無１１３とを検出する。移動先座標の存在有無１１３は、移動先座標１１２が存在する場合、ＴＲＵＥの値をとり、一方、移動先座標１１２が存在しない場合、ＦＡＬＳＥの値をとる。 The delay unit 25 delays the input image data 101 at the current time by one unit time in order to match the time with the input image data 101 at the next time. The destination coordinate detection unit 21 correlates the input image data 111 at the current time delayed by the delay unit 25 with the input image data 101 at the next time, and the destination coordinate 112 and the presence / absence 113 of the destination coordinate exist. Is detected. Here, in order to calculate the coordinate value 102 of the maximum pop-out pixel existence possible area, only the movement destination coordinate of the maximum pop-out pixel at the current time and its presence / absence are required, and other pixels are unnecessary. Accordingly, the movement destination coordinate detection unit 21 detects the movement destination coordinates 112 and the presence / absence of the movement destination coordinates 113 only for the pixel indicated by the coordinate value 103 of the maximum protruding pixel at the current time. The presence / absence 113 of the movement destination coordinate takes a value of TRUE when the movement destination coordinate 112 exists, and takes a value of FALSE when the movement destination coordinate 112 does not exist.

移動先座標予測部２２は、現時刻の最大飛び出し画素が次時刻で存在するであろう移動先座標を予測し、予測座標値１１４として出力する。本実施形態では、動きベクトルを用いて移動先座標を予測するが、これに限定されるものではない。 The destination coordinate prediction unit 22 predicts a destination coordinate where the maximum pop-out pixel at the current time will exist at the next time, and outputs it as a predicted coordinate value 114. In the present embodiment, the movement destination coordinates are predicted using the motion vector, but the present invention is not limited to this.

図３は、移動先座標予測部２２の詳細な構成を示す図である。図３に示すように、移動先座標予測部２２は、動きベクトル算出部２６、移動先座標演算部２８及び遅延部２５を備える。 FIG. 3 is a diagram illustrating a detailed configuration of the destination coordinate prediction unit 22. As illustrated in FIG. 3, the movement destination coordinate prediction unit 22 includes a motion vector calculation unit 26, a movement destination coordinate calculation unit 28, and a delay unit 25.

動きベクトル算出部２６は、次時刻の入力画像データ１０１と、遅延後の現時刻の入力画像データ１１１とから動きベクトル１１６を算出する。遅延部２５は、動きベクトル１１６の時刻を、現時刻の最大飛び出し画素の座標値１０３に合わせて、遅延後の動きベクトル１１６を生成する。移動先座標演算部２８は、遅延後の動きベクトル１１７と現時刻の最大飛び出し画素の座標値１０３とをベクトル演算して、予測座標値１１４を算出する。なお、予測座標値１１４の算出方法は、上述した動きベクトルを利用した方法に限定されない。 The motion vector calculation unit 26 calculates a motion vector 116 from the input image data 101 at the next time and the input image data 111 at the current time after the delay. The delay unit 25 generates the delayed motion vector 116 by matching the time of the motion vector 116 with the coordinate value 103 of the maximum pop-out pixel at the current time. The destination coordinate calculation unit 28 calculates a predicted coordinate value 114 by performing a vector calculation on the delayed motion vector 117 and the coordinate value 103 of the maximum pop-out pixel at the current time. Note that the method of calculating the predicted coordinate value 114 is not limited to the method using the motion vector described above.

シーンチェンジ判定部２３は、現時刻と次時刻との間でシーンチェンジの有無を判定し、シーンチェンジフラグ１１５を出力する。現時刻と次時刻との間でシーンチェンジが有る場合、シーンチェンジフラグ１１５はＴＲＵＥの値をとり、一方、シーンチェンジが無い場合、シーンチェンジフラグ１１５はＦＡＬＳＥの値をとる。なお、シーンチェンジの判定手法としては、例えば、次の文献により開示される動きベクトルを用いた手法を適用することが可能である。
＜文献＞
曽根原源等：動きベクトルを用いたシーン・チェンジ検出、映像情報メディア学会年次大会講演予稿集、１９９７−０７−２９、ｐｐ３０４（１９９７） The scene change determination unit 23 determines the presence / absence of a scene change between the current time and the next time, and outputs a scene change flag 115. When there is a scene change between the current time and the next time, the scene change flag 115 takes a value of TRUE, whereas when there is no scene change, the scene change flag 115 takes a value of FALSE. As a scene change determination method, for example, a method using a motion vector disclosed in the following document can be applied.
<Reference>
Sonehara et al .: Scene change detection using motion vectors, Proceedings of the Annual Conference of the Institute of Image Information and Television Engineers, 1997-07-29, pp 304 (1997)

最大飛び出し画素存在可能領域選択部２４は、移動先座標１１２、移動先座標の存在有無１１３及びシーンチェンジフラグ１１５に基づいて、以下の方針で、最大飛び出し画素存在可能領域の座標値１０２を生成する。ここで、Ｐｘを予測座標値１１４のＸ成分とし、Ｐｙを予測座標値１１４のＹ成分とする。また、表示面の水平方向の大きさをＤｘとし、垂直方向の大きさをＤｙとし、表示面の左上の画素を原点（０，０）とする。従って、画素の座標（Ｉｘ，Ｉｙ）がとる範囲は、｛Ｉｘ：０＜＝Ｉｘ＜＝Ｄｘ−１｝、｛Ｉｙ：０＜＝Ｉｙ＜＝Ｄｙ−１｝になる。 The maximum pop-out pixel existence area selection unit 24 generates the coordinate value 102 of the maximum pop-out pixel existence area based on the movement destination coordinates 112, the presence / absence of movement destination coordinates 113, and the scene change flag 115 according to the following policy. . Here, Px is the X component of the predicted coordinate value 114, and Py is the Y component of the predicted coordinate value 114. In addition, the horizontal size of the display surface is Dx, the vertical size is Dy, and the upper left pixel of the display surface is the origin (0, 0). Accordingly, the range of the pixel coordinates (Ix, Iy) is {Ix: 0 <= Ix <= Dx-1} and {Iy: 0 <= Iy <= Dy-1}.

・方針１
現時刻の最大飛び出し画素が次時刻で存在する場合（移動先座標の存在有無１１３＝＝ＴＲＵＥ）、最大飛び出し画素存在可能領域選択部２４は、最大飛び出し画素存在可能領域の中心が移動先座標１１２となるように、最大飛び出し画素存在可能領域の座標値１０２を算出する。・ Policy 1
When the maximum pop-out pixel at the current time exists at the next time (presence / absence of movement destination coordinates 113 == TRUE), the maximum pop-out pixel existence area selection unit 24 determines that the center of the maximum protrusion pixel existence area is the movement destination coordinates 112 Then, the coordinate value 102 of the maximum pop-out pixel existence possible area is calculated.

・方針２
現時刻の最大飛び出し画素が次時刻で存在せず、且つ、シーンチェンジが有る場合（移動先座標の存在有無１１３＝＝ＴＲＵＥＡＮＤシーンチェンジフラグ１１５＝＝ＴＲＵＥ）、最大飛び出し画素存在可能領域選択部２４は、最大飛び出し画素存在可能領域を表示画面全体に決定する。・ Policy 2
When the maximum pop-out pixel at the current time does not exist at the next time and there is a scene change (presence / absence of movement destination coordinates 113 == TRUE AND scene change flag 115 == TRUE), the maximum pop-out pixel existence area selection unit In step 24, the maximum pop-out pixel existence region is determined for the entire display screen.

・方針３
現時刻の最大飛び出し画素が次時刻で存在せず、シーンチェンジが無く、且つ、予測座標値１１４が表示画面内である場合｛移動先座標の存在有無１１３＝＝ＦＡＬＳＥＡＮＤシーンチェンジフラグ１１５＝＝ＦＡＬＳＥＡＮＤ（０＜＝Ｐｘ＜＝Ｄｘ−１）ＡＮＤ（０＜＝Ｐｙ＜＝Ｄｙ−１）｝、最大飛び出し画素存在可能領域選択部２４は、最大飛び出し画素存在可能領域の中心が予測座標値１１４となるように、最大飛び出し画素存在可能領域の座標値１０２を算出する。・ Policy 3
When the maximum pop-out pixel at the current time does not exist at the next time, there is no scene change, and the predicted coordinate value 114 is within the display screen {presence / absence of movement destination coordinate 113 == FALSE AND scene change flag 115 == FALSE AND (0 <= Px <= Dx-1) AND (0 <= Py <= Dy-1)}, the maximum pop-out pixel existence area selection unit 24 has a predicted coordinate value at the center of the maximum pop-out pixel existence area. The coordinate value 102 of the maximum pop-out pixel existence possible area is calculated so as to be 114.

・方針４
現時刻の最大飛び出し画素が次時刻で存在せず、シーンチェンジが無く、且つ、予測座標値１１４が表示画面外である場合｛移動先座標の存在有無１１３＝＝ＦＡＬＳＥＡＮＤシーンチェンジフラグ１１５＝＝ＦＡＬＳＥＡＮＤ（０＞ＰｘＯＲＤｘ＜Ｐｘ）ＯＲ（０＞ＰｙＯＲＤｙ＜Ｐｙ）｝、最大飛び出し画素存在可能領域選択部２４は、最大飛び出し画素存在可能領域の中心が現時刻の最大飛び出し画素の座標値１０３となるように、最大飛び出し画素存在可能領域の座標値１０２を算出する。・ Policy 4
When the maximum pop-out pixel at the current time does not exist at the next time, there is no scene change, and the predicted coordinate value 114 is outside the display screen {presence / absence of destination coordinate 113 == FALSE AND scene change flag 115 == FALSE AND (0> Px OR Dx <Px) OR (0> Py OR Dy <Py)}, the maximum pop-out pixel existence area selection unit 24 sets the center of the maximum pop-out pixel existence area to the maximum pop-out pixel at the current time. The coordinate value 102 of the maximum pop-out pixel existing area is calculated so that the coordinate value 103 is obtained.

現時刻の最大飛び出し画素が次時刻で存在せず、且つ、シーンチェンジが有る場合を除き、最大飛び出し画素存在可能領域の形状や大きさは、表示画面の大きさや特性に依存して決定される。例えば、視野角を用いて、最大飛び出し画素存在可能領域の形状や大きさを決定してもよい。 Except when the maximum pop-out pixel at the current time does not exist at the next time and there is a scene change, the shape and size of the maximum pop-out pixel existence area are determined depending on the size and characteristics of the display screen. . For example, the shape and size of the maximum pop-out pixel existence area may be determined using the viewing angle.

図４は、最大飛び出し画素存在可能領域選択部２４の処理を示すフローチャートである。ステップＳ１０１において、最大飛び出し画素存在可能領域選択部２４は、現時刻の最大飛び出し画素が次時刻に存在するか否かを判定する。最大飛び出し画素が次時刻に存在する場合、処理はステップＳ１０２に移行する。一方、最大飛び出し画素が次時刻に存在しない場合、処理はステップＳ１０３に移行する。 FIG. 4 is a flowchart showing processing of the maximum pop-out pixel existence possible area selection unit 24. In step S <b> 101, the maximum pop-out pixel existence area selection unit 24 determines whether or not the maximum pop-out pixel at the current time exists at the next time. If the maximum pop-out pixel exists at the next time, the process proceeds to step S102. On the other hand, if the maximum pop-out pixel does not exist at the next time, the process proceeds to step S103.

ステップＳ１０２において、最大飛び出し画素存在可能領域選択部２４は、最大飛び出し画素存在可能領域の中心が移動先座標１１２となるように、最大飛び出し画素存在可能領域の座標値１０２を算出する（方針１）。ステップＳ１０３において、最大飛び出し画素存在可能領域選択部２４は、シーンチェンジが有るか否かを判定する。シーンチェンジが有る場合、処理はステップＳ１０４に移行する。一方、シーンチェンジが無い場合、処理はステップＳ１０５に移行する。 In step S102, the maximum pop-out pixel existence region selection unit 24 calculates the coordinate value 102 of the maximum pop-out pixel existence region so that the center of the maximum pop-out pixel existence region becomes the movement destination coordinate 112 (policy 1). . In step S103, the maximum pop-out pixel existence possible area selection unit 24 determines whether or not there is a scene change. If there is a scene change, the process proceeds to step S104. On the other hand, if there is no scene change, the process proceeds to step S105.

ステップＳ１０４において、最大飛び出し画素存在可能領域選択部２４は、最大飛び出し画素存在可能領域を表示画面全体に決定する（方針２）。ステップＳ１０５において、最大飛び出し画素存在可能領域選択部２４は、予測座標値１１４が表示画面内であるか否かを判定する。予測座標値１１４が表示画面内である場合、処理はステップＳ１０６に移行する。一方、予測座標値１１４が表示画面内でない場合、処理はステップＳ１０７に移行する。 In step S104, the maximum pop-out pixel existence area selection unit 24 determines the maximum pop-out pixel existence area on the entire display screen (policy 2). In step S105, the maximum pop-out pixel existence possible area selection unit 24 determines whether or not the predicted coordinate value 114 is within the display screen. If the predicted coordinate value 114 is within the display screen, the process proceeds to step S106. On the other hand, if the predicted coordinate value 114 is not within the display screen, the process proceeds to step S107.

ステップＳ１０６において、最大飛び出し画素存在可能領域選択部２４は、最大飛び出し画素存在可能領域の中心が予測座標値１１４となるように、最大飛び出し画素存在可能領域の座標値１０２を算出する（方針３）。ステップＳ１０７において、最大飛び出し画素存在可能領域選択部２４は、最大飛び出し画素存在可能領域の中心が現時刻の最大飛び出し画素の座標値１０３となるように、最大飛び出し画素存在可能領域の座標値１０２を算出する（方針４）。 In step S106, the maximum pop-out pixel existence area selection unit 24 calculates the coordinate value 102 of the maximum pop-out pixel existence area so that the center of the maximum pop-out pixel existence area becomes the predicted coordinate value 114 (policy 3). . In step S107, the maximum pop-out pixel existence area selection unit 24 sets the coordinate value 102 of the maximum pop-out pixel existence area so that the center of the maximum pop-out pixel existence area becomes the coordinate value 103 of the maximum pop-out pixel at the current time. Calculate (policy 4).

次に、最大飛び出し量算出部１２の処理について説明する。最大飛び出し量算出部１２は、入力画像データ１０１に対して最大飛び出し画素存在可能領域の範囲内で、最大飛び出し画素の座標値１０３と、最大飛び出し画素の飛び出し量１０４とを算出する。飛び出し量は視差に比例するため、最大飛び出し量算出部１２は、最大飛び出し画素存在可能領域の範囲内で最も大きな視差を持つ画素の座標値を最大飛び出し画素の座標値１０３とし、その視差量を最大飛び出し画素の飛び出し量１０４として決定する。最大飛び出し画素存在可能領域の範囲内で最も大きな視差を持つ画素が複数存在する場合、最大飛び出し量算出部１２は、最大飛び出し画素存在可能領域の中心に最も近い画素の座標値を最大飛び出し画素の座標値１０３として決定する。また、最大飛び出し量算出部１２は、その視差量を最大飛び出し画素の飛び出し量１０４として決定する。さらに、最も大きな視差を持つ画素が、最大飛び出し画素存在可能領域の中心から最も近い距離に複数存在する場合、最大飛び出し量算出部１２は、該当する画素の中から任意に選択した画素の座標値を最大飛び出し画素の座標値１０３として決定する。そして、最大飛び出し量算出部１２は、その視差量を最大飛び出し画素の飛び出し量１０４として決定する。 Next, the processing of the maximum pop-out amount calculation unit 12 will be described. The maximum pop-out amount calculation unit 12 calculates the maximum pop-out pixel coordinate value 103 and the maximum pop-out pixel pop-out amount 104 within the range of the maximum pop-out pixel existence area with respect to the input image data 101. Since the pop-out amount is proportional to the parallax, the maximum pop-out amount calculation unit 12 sets the coordinate value of the pixel having the largest parallax within the range of the maximum pop-out pixel existence area as the coordinate value 103 of the maximum pop-out pixel, and sets the parallax amount. The maximum pop-out pixel pop-out amount 104 is determined. When there are a plurality of pixels having the largest parallax within the range of the maximum pop-out pixel existence area, the maximum pop-out amount calculation unit 12 determines the coordinate value of the pixel closest to the center of the maximum pop-out pixel existence area as the maximum pop-out pixel existence area. The coordinate value 103 is determined. The maximum pop-out amount calculation unit 12 determines the parallax amount as the pop-out amount 104 of the maximum pop-out pixel. Further, when there are a plurality of pixels having the largest parallax at the closest distance from the center of the maximum pop-out pixel existence area, the maximum pop-out amount calculation unit 12 selects the coordinate value of the pixel arbitrarily selected from the corresponding pixels. Is determined as the coordinate value 103 of the maximum pop-out pixel. Then, the maximum pop-out amount calculation unit 12 determines the parallax amount as the pop-out amount 104 of the maximum pop-out pixel.

次に、立体画像生成部１３の処理について説明する。立体画像生成部１３は、入力画像データ１０１から立体画像データ１０５を生成する。立体画像データ１０５の生成にあたり、立体画像データ１０５の飛び出し量、即ち、立体画像データ１０５の視差量は、最大飛び出し画素の飛び出し量１０４を超えないように決定される。 Next, processing of the stereoscopic image generation unit 13 will be described. The stereoscopic image generation unit 13 generates stereoscopic image data 105 from the input image data 101. In generating the stereoscopic image data 105, the pop-out amount of the stereo image data 105, that is, the parallax amount of the stereo image data 105 is determined so as not to exceed the pop-out amount 104 of the maximum pop-out pixel.

図５は、立体画像生成部１３による立体画像データ１０５の視差量の決定処理を示すフローチャートである。ステップＳ２０１において、立体画像生成部１３は、入力画像データ１０１の或る画素が有する視差量、又は、当該入力画像データ１０１に付加される視差量が、最大飛び出し画素の飛び出し量１０４を超過しているか否かを判定する。最大飛び出し画素の飛び出し量１０４を超過している場合、処理はステップＳ２０２に移行する。一方、最大飛び出し画素の飛び出し量１０４を超過していない場合、即ち、入力画像データ１０１の或る画素が有する視差量、又は、当該入力画像データ１０１に付加される視差量が、最大飛び出し画素の飛び出し量１０４以下である場合、処理はステップＳ２０３に移行する。 FIG. 5 is a flowchart illustrating the determination process of the parallax amount of the stereoscopic image data 105 by the stereoscopic image generation unit 13. In step S <b> 201, the stereoscopic image generation unit 13 causes the amount of parallax included in a certain pixel of the input image data 101 or the amount of parallax added to the input image data 101 to exceed the pop-out amount 104 of the maximum pop-out pixel. It is determined whether or not. If the maximum pop-out pixel pop-out amount 104 is exceeded, the process proceeds to step S202. On the other hand, when the maximum pop-out pixel pop-out amount 104 is not exceeded, that is, the amount of parallax that a certain pixel of the input image data 101 has or the amount of parallax added to the input image data 101 is the maximum pop-out pixel. If the pop-out amount is 104 or less, the process proceeds to step S203.

ステップＳ２０２において、立体画像生成部１３は、立体画像データ１０５の視差量を、最大飛び出し画素の飛び出し量１０４に抑止して決定する。ステップＳ２０３において、立体画像生成部１３は、立体画像データ１０５の視差量を、入力画像データ１０１の画素が有する視差量、又は、当該入力画像データ１０１に付加される視差量に決定する。 In step S <b> 202, the stereoscopic image generation unit 13 determines the parallax amount of the stereoscopic image data 105 while suppressing the amount of protrusion 104 of the maximum protruding pixel. In step S <b> 203, the stereoscopic image generation unit 13 determines the amount of parallax of the stereoscopic image data 105 as the amount of parallax included in the pixels of the input image data 101 or the amount of parallax added to the input image data 101.

図６及び図７は、比較例における立体画像データと最大飛び出し画素の位置との一例を示す図である。これに対し、図８及び図９は、本実施形態に係る立体画像生成装置により生成される立体画像データと最大飛び出し画素の位置との一例を示す図である。以下、図６乃至図９を用いて、比較例と本実施形態とにおいて与えられる最大飛び出し画素の位置の違いについて説明する。 6 and 7 are diagrams illustrating an example of the stereoscopic image data and the position of the maximum protruding pixel in the comparative example. 8 and 9 are diagrams showing an example of the stereoscopic image data generated by the stereoscopic image generating apparatus according to the present embodiment and the position of the maximum pop-out pixel. Hereinafter, the difference in the position of the maximum pop-out pixel given in the comparative example and the present embodiment will be described with reference to FIGS.

先ず、図６及び図７を参照しながら、比較例における立体画像データと最大の飛び出し画素の位置との関係について説明する。図６は、時刻ｎから時刻ｎ＋３までの状態を示し、図７は、時刻ｎ＋３から時刻ｎ＋６までの状態を示している。 First, the relationship between the stereoscopic image data and the position of the largest pop-out pixel in the comparative example will be described with reference to FIGS. FIG. 6 shows a state from time n to time n + 3, and FIG. 7 shows a state from time n + 3 to time n + 6.

（時刻ｎ）
図６（ａ）は、時刻ｎの立体画像データを示している。図６（ａ）に示す時刻ｎの立体画像データには、人物Ａ４１のみが映っている。時刻ｎの立体画像データにおいて、最も手前に存在するのは、人物Ａ４１の鼻であるから、図６（ｅ）に示すように、最大飛び出し画素１２１は、人物Ａ４１の鼻に該当する。 (Time n)
FIG. 6A shows stereoscopic image data at time n. Only the person A41 is shown in the stereoscopic image data at time n shown in FIG. In the stereoscopic image data at time n, the nose of the person A41 is present in the foreground, so that the maximum pop-out pixel 121 corresponds to the nose of the person A41 as shown in FIG.

（時刻ｎ＋１）
図６（ｂ）は、時刻ｎ＋１の立体画像データを示している。図６（ｂ）に示す時刻ｎ＋１の立体画像データでは、人物Ａ４１の手前に人物Ｂ４２が出現している。時刻ｎ＋１の立体画像データにおいて、最も手前に存在するのは、人物Ｂ４２の鼻であるから、図６（ｆ）に示すように、最大飛び出し画素１２１は、人物Ｂ４２の鼻に該当する。 (Time n + 1)
FIG. 6B shows stereoscopic image data at time n + 1. In the stereoscopic image data at time n + 1 shown in FIG. 6B, a person B42 appears in front of the person A41. In the stereoscopic image data at the time n + 1, since the nose of the person B42 is present in the foreground, the maximum pop-out pixel 121 corresponds to the nose of the person B42 as shown in FIG.

（時刻ｎ＋２）
図６（ｃ）は、時刻ｎ＋２の立体画像データを示している。図６（ｃ）に示すように、時刻ｎ＋２の立体画像データでは、人物Ｂ４２が表示画面から消え、人物Ａ４１が手前に出ている。時刻ｎ＋２の立体画像データにおいて、最も手前に存在するのは、人物Ａ４１の鼻であるから、図６（ｇ）に示すように、最大飛び出し画素１２１は、人物Ａ４１の鼻に該当する。但し、人物Ａ４１が手前に移動しているため、最大飛び出し画素１２１の位置は、時刻ｎとは異なる。 (Time n + 2)
FIG. 6C shows stereoscopic image data at time n + 2. As shown in FIG. 6C, in the stereoscopic image data at time n + 2, the person B42 disappears from the display screen, and the person A41 comes out to the front. In the stereoscopic image data at the time n + 2, the nose of the person A41 is present in the foreground, and therefore the maximum pop-out pixel 121 corresponds to the nose of the person A41 as shown in FIG. However, since the person A41 has moved forward, the position of the maximum pop-out pixel 121 is different from the time n.

（時刻ｎ＋３）
図６（ｄ）及び図７（ａ）は、時刻ｎ＋３の立体画像データを示している。図６（ｄ）及び図７（ａ）に示す時刻ｎ＋３の立体画像データでは、ボールＡ４３が表示画面の右側で且つ人物Ａ４１の手前に出現する。時刻ｎ＋３の立体画像データにおいて、最も手前に存在するのは、ボールＡ４３であるから、図６（ｈ）及び図７（ｅ）に示すように、最大飛び出し画素１２１は、ボールＡ４３に該当する。 (Time n + 3)
FIG. 6D and FIG. 7A show the stereoscopic image data at time n + 3. In the stereoscopic image data at time n + 3 shown in FIGS. 6D and 7A, the ball A43 appears on the right side of the display screen and in front of the person A41. In the three-dimensional image data at time n + 3, the ball A43 is closest to the front, so that the maximum protruding pixel 121 corresponds to the ball A43, as shown in FIGS. 6 (h) and 7 (e).

（時刻ｎ＋４）
図７（ｂ）は、時刻ｎ＋４の立体画像データを示している。図７（ｂ）に示す時刻ｎ＋４の立体画像データでは、ボールＡ４３が人物Ａ４１の表示画面の前面に映り、ボールＢ４４が表示画面の右側に映っている。ボールＢ４４はボールＡ４３より手前に存在し、最も手前に存在するのはボールＢ４４であるから、図７（ｆ）に示すように、最大飛び出し画素１２１は、ボールＢ４４に該当する。 (Time n + 4)
FIG. 7B shows stereoscopic image data at time n + 4. In the stereoscopic image data at time n + 4 shown in FIG. 7B, the ball A43 is reflected in front of the display screen of the person A41, and the ball B44 is reflected on the right side of the display screen. Since the ball B44 exists in front of the ball A43, and the ball B44 exists in the foremost side, the maximum protruding pixel 121 corresponds to the ball B44 as shown in FIG.

（時刻ｎ＋５）
図７（ｃ）は、時刻ｎ＋５の立体画像データを示している。図７（ｃ）に示す時刻ｎ＋５の立体画像データでは、ボールＡ４３が表示画面から消え、人物Ａ４１とボールＢ４４とが映っている。ボールＢ４４は時刻ｎ＋４より更に手前に存在し、最も手前に存在するのはボールＢ４４であるから、図７（ｇ）に示すように、最大飛び出し画素１２１は、ボールＢ４４に該当する。但し、ボールＢ４４が手前に移動しているため、最大飛び出し画素１２１の位置は、時刻ｎ＋４とは異なる。 (Time n + 5)
FIG. 7C shows stereoscopic image data at time n + 5. In the stereoscopic image data at time n + 5 shown in FIG. 7C, the ball A43 disappears from the display screen, and the person A41 and the ball B44 are shown. Since the ball B44 is present before the time n + 4 and the ball B44 is present at the foremost position, the maximum protruding pixel 121 corresponds to the ball B44 as shown in FIG. However, since the ball B44 has moved forward, the position of the maximum pop-out pixel 121 is different from the time n + 4.

（時刻ｎ＋６）
図７（ｄ）は、時刻ｎ＋６の立体画像データを示している。図７（ｃ）及び図７（ｄ）に示すように、時刻ｎ＋５と時刻ｎ＋６との間にはシーンチェンジが存在し、時刻ｎ＋５と時刻ｎ＋６とは異なる映像になる。時刻ｎ＋６の立体画像データにおいて、最も手前に存在するのは自動車４５であるから、図７（ｈ）に示すように、最大飛び出し画素１２１は、自動車の左前方に該当する。 (Time n + 6)
FIG. 7D shows stereoscopic image data at time n + 6. As shown in FIGS. 7C and 7D, there is a scene change between time n + 5 and time n + 6, and the images at time n + 5 and time n + 6 are different. In the stereoscopic image data at time n + 6, since the automobile 45 is present in the foreground, the maximum pop-out pixel 121 corresponds to the left front of the automobile as shown in FIG.

次に、図８及び図９を参照しながら、本実施形態における立体画像データと最大の飛び出し画素の位置との関係について説明する。図８は、時刻ｎから時刻ｎ＋３までの状態を示し、図９は、時刻ｎ＋３から時刻ｎ＋６までの状態を示している。また、本実施形態では、最大飛び出し画素存在可能領域５１の形状を楕円としている。勿論、最大飛び出し画素存在可能領域５１の形状は、楕円に限定されるものではない。 Next, the relationship between the stereoscopic image data and the position of the largest pop-out pixel in this embodiment will be described with reference to FIGS. FIG. 8 shows a state from time n to time n + 3, and FIG. 9 shows a state from time n + 3 to time n + 6. In the present embodiment, the shape of the maximum pop-out pixel existence possible area 51 is an ellipse. Of course, the shape of the maximum protruding pixel existence region 51 is not limited to an ellipse.

（時刻ｎ）
図８（ａ）は、時刻ｎの立体画像データを示している。図８（ａ）に示す時刻ｎの立体画像データには、人物Ａ４１のみが映っている。時刻ｎの立体画像データにおいて、最も手前に存在するのは、人物Ａ４１の鼻であるから、図８（ｅ）に示すように、最大飛び出し画素１２１は、人物Ａ４１の鼻に該当する。 (Time n)
FIG. 8A shows stereoscopic image data at time n. Only the person A41 is shown in the stereoscopic image data at time n shown in FIG. In the three-dimensional image data at time n, since the nose of the person A41 is present in the foreground, the maximum pop-out pixel 121 corresponds to the nose of the person A41 as shown in FIG.

（時刻ｎ＋１）
図８（ｂ）は、時刻ｎ＋１の立体画像データを示している。図８（ｂ）に示す時刻ｎ＋１の立体画像データでは、人物Ａ４１の手前に人物Ｂ４２が出現している。図８（ｅ）に示す最大飛び出し画素１２１である人物Ａ４１の鼻は、時刻ｎ＋１の立体画像データにおいても存在している。従って、移動先座標の存在有無１１３の値は、ＴＲＵＥとなる。よって、最大飛び出し画素存在可能領域設定部１１は、方針１に従って、最大飛び出し画素の移動先座標１１２を中心に、最大飛び出し画素存在可能領域５１を設定する。そして、最大飛び出し量算出部１２は、最大飛び出し画素存在可能領域５１内で最大の飛び出し量を持つ画素を、最大飛び出し画素１２１に決定する。 (Time n + 1)
FIG. 8B shows stereoscopic image data at time n + 1. In the stereoscopic image data at time n + 1 shown in FIG. 8B, the person B42 appears in front of the person A41. The nose of the person A41, which is the maximum pop-out pixel 121 shown in FIG. 8E, also exists in the stereoscopic image data at time n + 1. Therefore, the value of presence / absence 113 of the movement destination coordinates is TRUE. Therefore, the maximum pop-out pixel existence area setting unit 11 sets the maximum pop-out pixel existence area 51 around the movement destination coordinates 112 of the maximum pop-out pixel according to the policy 1. Then, the maximum pop-out amount calculation unit 12 determines the pixel having the maximum pop-out amount within the maximum pop-out pixel existence area 51 as the maximum pop-out pixel 121.

図８（ｆ）に示すように、最大飛び出し画素１２１は、人物Ａ４１の鼻に該当する。立体画像生成部１３は、最大飛び出し画素１２１の飛び出し量以下になるように、飛び出し量を抑止した立体画像データを生成する。これにより、図７（ｆ）に示すように、人物Ｂ４２の鼻に最大飛び出し画素１２１が与えられた場合に比べ、人物Ｂ４２の飛び出し量が抑止される。 As shown in FIG. 8F, the maximum protruding pixel 121 corresponds to the nose of the person A41. The stereoscopic image generation unit 13 generates stereoscopic image data in which the pop-out amount is suppressed so as to be equal to or less than the pop-out amount of the maximum pop-out pixel 121. Accordingly, as shown in FIG. 7F, the pop-out amount of the person B42 is suppressed as compared with the case where the maximum pop-out pixel 121 is given to the nose of the person B42.

（時刻ｎ＋２）
図８（ｃ）は、時刻ｎ＋２の立体画像データを示している。図８（ｃ）に示す時刻ｎ＋２の立体画像データでは、人物Ｂ４２が表示画面から消え、人物Ａ４１が手前に出ている。時刻ｎ＋１の最大飛び出し画素１２１、即ち、人物Ａ４１の鼻は、時刻ｎ＋２の立体画像データにおいても存在している。従って、移動先座標の存在有無１１３の値は、ＴＲＵＥとなる。よって、最大飛び出し画素存在可能領域設定部１１は、方針１に従って、最大飛び出し画素の移動先座標１１２を中心に、最大飛び出し画素存在可能領域５１を設定する。そして、最大飛び出し量算出部１２は、最大飛び出し画素存在可能領域５１内で最大の飛び出し量を持つ画素を、最大飛び出し画素１２１に決定する。 (Time n + 2)
FIG. 8C shows stereoscopic image data at time n + 2. In the stereoscopic image data at time n + 2 shown in FIG. 8C, the person B42 disappears from the display screen, and the person A41 comes out to the front. The maximum protruding pixel 121 at time n + 1, that is, the nose of the person A41 also exists in the stereoscopic image data at time n + 2. Therefore, the value of presence / absence 113 of the movement destination coordinates is TRUE. Therefore, the maximum pop-out pixel existence area setting unit 11 sets the maximum pop-out pixel existence area 51 around the movement destination coordinates 112 of the maximum pop-out pixel according to the policy 1. Then, the maximum pop-out amount calculation unit 12 determines the pixel having the maximum pop-out amount within the maximum pop-out pixel existence area 51 as the maximum pop-out pixel 121.

図８（ｇ）に示すように、最大飛び出し画素１２１は人物Ａ４１の鼻に該当する。但し、人物Ａ４１が手前に移動しているため、最大飛び出し画素１２１の位置は、時刻ｎ＋１とは異なる。 As shown in FIG. 8G, the maximum protruding pixel 121 corresponds to the nose of the person A41. However, since the person A41 has moved forward, the position of the maximum pop-out pixel 121 is different from the time n + 1.

（時刻ｎ＋３）
図８（ｄ）及び図９（ａ）は、時刻ｎ＋３の立体画像データを示している。図８（ｄ）及び図９（ａ）に示す時刻ｎ＋３の立体画像データでは、ボールＡ４３が表示画面の右側で且つ人物Ａ４１の手前に出現している。時刻ｎ＋２の最大飛び出し画素１２１、即ち、人物Ａ４１の鼻は、時刻ｎ＋３の立体画像データ上に存在している。従って、移動先座標の存在有無１１３の値は、ＴＲＵＥとなる。よって、最大飛び出し画素存在可能領域設定部１１は、方針１に従って、最大飛び出し画素の移動先座標１１２を中心に、最大飛び出し画素存在可能領域５１を設定する。そして、最大飛び出し量算出部１２は、最大飛び出し画素存在可能領域５１内で最大の飛び出し量を持つ画素を、最大飛び出し画素１２１に決定する。 (Time n + 3)
FIG. 8D and FIG. 9A show stereoscopic image data at time n + 3. In the stereoscopic image data at time n + 3 shown in FIGS. 8D and 9A, the ball A43 appears on the right side of the display screen and in front of the person A41. The maximum protruding pixel 121 at time n + 2, that is, the nose of the person A41 is present on the stereoscopic image data at time n + 3. Therefore, the value of presence / absence 113 of the movement destination coordinates is TRUE. Therefore, the maximum pop-out pixel existence area setting unit 11 sets the maximum pop-out pixel existence area 51 around the movement destination coordinates 112 of the maximum pop-out pixel according to the policy 1. Then, the maximum pop-out amount calculation unit 12 determines the pixel having the maximum pop-out amount within the maximum pop-out pixel existence area 51 as the maximum pop-out pixel 121.

図８（ｈ）に示すように、最大飛び出し画素１２１は、人物Ａ４１の鼻に該当する。立体画像生成部１３は、最大飛び出し画素１２１の飛び出し量以下になるように、飛び出し量を抑止した立体画像データを生成する。これにより、図６（ｆ）に示すように、ボールＡ４３に最大飛び出し画素１２１が与えられた場合に比べ、ボールＡ４３の飛び出し量が抑止される。 As shown in FIG. 8H, the maximum protruding pixel 121 corresponds to the nose of the person A41. The stereoscopic image generation unit 13 generates stereoscopic image data in which the pop-out amount is suppressed so as to be equal to or less than the pop-out amount of the maximum pop-out pixel 121. As a result, as shown in FIG. 6F, the pop-out amount of the ball A43 is suppressed as compared with the case where the maximum pop-out pixel 121 is given to the ball A43.

（時刻ｎ＋４）
図９（ｂ）は、時刻ｎ＋４の立体画像データを示している。図９（ｂ）に示す時刻ｎ＋４の立体画像データでは、ボールＡ４３が人物Ａ４１の前面に映り、ボールＢ４４が表示画面の右側に映っている。時刻ｎ＋３の最大飛び出し画素１２１、即ち、人物Ａ４１の鼻は、時刻ｎ＋４の立体画像データ上では、ボールＡ４３に隠れて存在していない。従って、移動先座標の存在有無１１３の値は、ＦＡＬＳＥとなる。また、最大飛び出し画素の予測座標値１１４は、表示画面内に存在する。よって、最大飛び出し画素存在可能領域設定部１１は、方針３に従って、最大飛び出し画素の予測座標値１１４を中心に、最大飛び出し画素存在可能領域５１を設定する。そして、最大飛び出し量算出部１２は、最大飛び出し画素存在可能領域５１内で最大の飛び出し量を持つ画素を、最大飛び出し画素１２１に決定する。 (Time n + 4)
FIG. 9B shows stereoscopic image data at time n + 4. In the stereoscopic image data at time n + 4 shown in FIG. 9B, the ball A43 is reflected in front of the person A41 and the ball B44 is reflected on the right side of the display screen. The maximum protruding pixel 121 at time n + 3, that is, the nose of the person A41 does not exist behind the ball A43 on the stereoscopic image data at time n + 4. Therefore, the value of presence / absence 113 of the movement destination coordinates is FALSE. Further, the predicted coordinate value 114 of the largest pop-out pixel exists in the display screen. Therefore, the maximum pop-out pixel existence region setting unit 11 sets the maximum pop-out pixel existence region 51 around the predicted coordinate value 114 of the maximum pop-out pixel according to the policy 3. Then, the maximum pop-out amount calculation unit 12 determines the pixel having the maximum pop-out amount within the maximum pop-out pixel existence area 51 as the maximum pop-out pixel 121.

図９（ｆ）に示すように、最大飛び出し画素１２１は、ボールＡ４３に該当する。立体画像生成部１３は、最大飛び出し画素１２１の飛び出し量以下になるように、飛び出し量を抑止した立体画像データを生成する。これにより、図７（ｆ）に示すように、ボールＢ４４に最大飛び出し画素１２１が与えられた場合に比べ、ボールＢ４４の飛び出し量が抑止される。 As shown in FIG. 9F, the maximum protruding pixel 121 corresponds to the ball A43. The stereoscopic image generation unit 13 generates stereoscopic image data in which the pop-out amount is suppressed so as to be equal to or less than the pop-out amount of the maximum pop-out pixel 121. As a result, as shown in FIG. 7F, the amount of protrusion of the ball B44 is suppressed as compared with the case where the maximum protrusion pixel 121 is given to the ball B44.

（時刻ｎ＋５）
図９（ｃ）は、時刻ｎ＋５の立体画像データを示している。図９（ｃ）に示す時刻ｎ＋５の立体画像データでは、ボールＡ４３が表示画面から消え、人物Ａ４１とボールＢ４４とが映っている。時刻ｎ＋５において、ボールＢ４４は、時刻ｎ＋４より更に手前に存在する。時刻ｎ＋４の最大飛び出し画素１２１、即ち、ボールＡ４３は、時刻ｎ＋５において表示画面から出てしまい、表示画面上に存在していない。従って、移動先座標の存在有無１１３の値はＦＡＬＳＥとなり、且つ、最大飛び出し画素の予測座標値１１４も表示画面外となる。よって、最大飛び出し画素存在可能領域設定部１１は、方針４に従って、時刻ｎ＋４の最大飛び出し画素１２１の座標を中心に、最大飛び出し画素存在可能領域５１を設定する。そして、最大飛び出し量算出部１２は、最大飛び出し画素存在可能領域５１内で最大の飛び出し量を持つ画素を、最大飛び出し画素１２１に決定する。 (Time n + 5)
FIG. 9C shows stereoscopic image data at time n + 5. In the stereoscopic image data at time n + 5 shown in FIG. 9C, the ball A43 disappears from the display screen, and the person A41 and the ball B44 are shown. At time n + 5, the ball B44 is present before the time n + 4. The maximum pop-out pixel 121 at time n + 4, that is, the ball A43, has left the display screen at time n + 5 and does not exist on the display screen. Therefore, the value of presence / absence 113 of the movement destination coordinate is FALSE, and the predicted coordinate value 114 of the maximum pop-out pixel is also outside the display screen. Therefore, the maximum pop-out pixel existence region setting unit 11 sets the maximum pop-out pixel existence region 51 around the coordinates of the maximum pop-out pixel 121 at time n + 4 according to the policy 4. Then, the maximum pop-out amount calculation unit 12 determines the pixel having the maximum pop-out amount within the maximum pop-out pixel existence area 51 as the maximum pop-out pixel 121.

図９（ｇ）に示すように、最大飛び出し画素１２１は、人物Ａ４１の鼻に該当する。立体画像生成部１３は、最大飛び出し画素１２１の飛び出し量以下になるように、飛び出し量を抑止した立体画像データを生成する。これにより、図７（ｇ）に示すように、ボールＢ４４に最大飛び出し画素１２１が与えられた場合に比べ、ボールＢ４４の飛び出し量が抑止される。 As shown in FIG. 9G, the maximum pop-out pixel 121 corresponds to the nose of the person A41. The stereoscopic image generation unit 13 generates stereoscopic image data in which the pop-out amount is suppressed so as to be equal to or less than the pop-out amount of the maximum pop-out pixel 121. As a result, as shown in FIG. 7G, the pop-out amount of the ball B44 is suppressed as compared with the case where the maximum pop-out pixel 121 is given to the ball B44.

（時刻ｎ＋６）
図７（ｄ）は、時刻ｎ＋６の立体画像データを示している。時刻ｎ＋５と時刻ｎ＋６との間にシーンチェンジが存在するため、時刻ｎ＋５と時刻ｎ＋６とでは異なる映像になる。よって、最大飛び出し画素存在可能領域設定部１１は、方針２に従って、表示画面全体で最大飛び出し量を持つ画素を、最大飛び出し画素１２１に決定する。時刻ｎ＋６の立体画像データにおいて、最も手前に存在するのは自動車４５であるから、図７（ｈ）に示すように、最大飛び出し画素１２１は、自動車の左前方に該当する。 (Time n + 6)
FIG. 7D shows stereoscopic image data at time n + 6. Since there is a scene change between the time n + 5 and the time n + 6, the video is different at the time n + 5 and the time n + 6. Therefore, the maximum pop-out pixel existence area setting unit 11 determines the pixel having the maximum pop-out amount on the entire display screen as the maximum pop-out pixel 121 according to the policy 2. In the stereoscopic image data at time n + 6, since the automobile 45 is present in the foreground, the maximum pop-out pixel 121 corresponds to the left front of the automobile as shown in FIG.

次に、図１０を参照しながら、時刻ｎから時刻ｎ＋６までの、最大飛び出し画素１２１の位置の変移について説明する。図１０（ａ）は、比較例における最大飛び出し画素１２１が変移する経路を示している。図１０（ｂ）は、本実施形態における最大飛び出し画素１２１が変移する経路を示している。図１０において、カッコ内に記した数字は、最大飛び出し画素１２１の変移の順序を示す。 Next, transition of the position of the maximum pop-out pixel 121 from time n to time n + 6 will be described with reference to FIG. FIG. 10A shows a path along which the maximum protruding pixel 121 changes in the comparative example. FIG. 10B shows a path along which the maximum pop-out pixel 121 changes in the present embodiment. In FIG. 10, numbers in parentheses indicate the order of transition of the maximum pop-out pixel 121.

図１０（ａ）に示すように、比較例における最大飛び出し画素１２１の変移は大きく、しかも、変移方向が様々である。一方、図１０（ｂ）に示すように、本実施形態における最大飛び出し画素１２１の変移は小さい。また、変移方向は人物Ａ４１の移動方向に沿っている。 As shown in FIG. 10A, the transition of the maximum protruding pixel 121 in the comparative example is large, and the transition direction is various. On the other hand, as shown in FIG. 10B, the transition of the maximum protruding pixel 121 in this embodiment is small. Further, the transition direction is along the movement direction of the person A41.

立体画像データにおいて飛び出し量が大きい領域に視聴者は注目するため、最大飛び出し画素１２１の変移が大きい場合、視線の変移も大きくなる。また、視線の変移が大きい場合、疲労の蓄積も大きく、映像酔いを引き起こしやすい。しかも、視線の変移方向が不規則である場合、映像酔いを引き起こしやすい。 Since the viewer pays attention to the region where the pop-out amount is large in the stereoscopic image data, the shift of the line of sight increases when the shift of the maximum pop-out pixel 121 is large. In addition, when the line of sight changes greatly, the accumulation of fatigue is large, and video sickness is likely to occur. In addition, when the line-of-sight shift direction is irregular, video sickness is likely to occur.

従って、本実施形態によれば、最大飛び出し画素１２１の変移を抑止し、立体画像データの視聴に伴う視線の変移量を抑止することが可能となる。また、本実施形態によれば、最大飛び出し画素１２１の変移方向を制限することにより、映像酔いの大きな要因である不規則な視線の変移を防止することが可能となる。 Therefore, according to the present embodiment, it is possible to suppress the shift of the maximum pop-out pixel 121 and to suppress the shift amount of the line of sight accompanying viewing of the stereoscopic image data. Further, according to the present embodiment, by restricting the transition direction of the maximum pop-out pixel 121, it is possible to prevent an irregular line-of-sight transition that is a major cause of video sickness.

次に、本発明の第２の実施形態について説明する。図１１は、本発明の第２の実施形態に係る立体画像生成装置の構成を示す図である。図１１に示す第２の実施形態に係る立体画像生成装置と、図１に示す第１の実施形態に係る立体画像生成装置との違いは、第２の実施形態における最大飛び出し画素存在可能領域設定部１１１１は、第１の実施形態で説明した最大飛び出し画素存在可能領域の設定機能に加え、最大飛び出し画素存在可能領域の形状や大きさを変更する機能を備える。即ち、第２の実施形態における最大飛び出し画素存在可能領域設定部１１１１は、入力される最大飛び出し画素存在可能領域属性１０６の値に従って、最大飛び出し画素存在可能領域の形状や大きさを変更する。 Next, a second embodiment of the present invention will be described. FIG. 11 is a diagram illustrating a configuration of a stereoscopic image generating apparatus according to the second embodiment of the present invention. The difference between the stereoscopic image generating apparatus according to the second embodiment shown in FIG. 11 and the stereoscopic image generating apparatus according to the first embodiment shown in FIG. 1 is that the maximum pop-out pixel existence region setting in the second embodiment is performed. The unit 1111 has a function of changing the shape and size of the maximum pop-out pixel existence area in addition to the function of setting the maximum pop-out pixel existence area described in the first embodiment. In other words, the maximum pop-out pixel existence area setting unit 1111 in the second embodiment changes the shape and size of the maximum pop-out pixel existence area according to the value of the input maximum pop-out pixel existence area attribute 106.

最大飛び出し画素存在可能領域属性１０６は、最大飛び出し画素存在可能領域の形状や大きさを表す属性値である。なお、最大飛び出し画素存在可能領域属性は、本実施形態に係る立体映像生成裝置のユーザにより任意に設定可能な値である。最大飛び出し画素存在可能領域属性の設定手法としては、例えば、リモコンやＯＳＤ（On-Screen-Display）を使用してユーザが任意の最大飛び出し画素存在可能領域属性を設定する手法が挙げられる。 The maximum pop-out pixel existence region attribute 106 is an attribute value representing the shape and size of the maximum pop-out pixel existence region. Note that the maximum pop-out pixel existence region attribute is a value that can be arbitrarily set by the user of the stereoscopic video generation apparatus according to the present embodiment. As a method for setting the maximum pop-out pixel existence region attribute, for example, there is a method in which the user sets an arbitrary maximum pop-out pixel existence region attribute using a remote controller or OSD (On-Screen-Display).

図１２は、第２の実施形態における最大飛び出し画素存在可能領域設定部１１１１の詳細な構成を示す図である。図１１に示す第２の実施形態における最大飛び出し画素存在可能領域設定部１１１１と、図２に示す第１の実施形態における最大飛び出し画素存在可能領域設定部１１との違いは、最大飛び出し画素存在可能領域選択部１１２４の機能にある。 FIG. 12 is a diagram illustrating a detailed configuration of the maximum pop-out pixel existence area setting unit 1111 in the second embodiment. The difference between the maximum pop-out pixel existence region setting unit 1111 in the second embodiment shown in FIG. 11 and the maximum pop-out pixel existence region setting unit 11 in the first embodiment shown in FIG. This is in the function of the area selection unit 1124.

即ち、第２の実施形態における最大飛び出し画素存在可能領域選択部１１２４は、最大飛び出し画素存在可能領域属性１０６の値に従って、最大飛び出し画素存在可能領域の位置を決定し、最大飛び出し画素存在可能領域の座標値１０２を算出する。 That is, the maximum pop-out pixel existence area selection unit 1124 according to the second embodiment determines the position of the maximum pop-out pixel existence area according to the value of the maximum pop-out pixel existence area attribute 106, and determines the maximum pop-out pixel existence area. A coordinate value 102 is calculated.

ここで、最大飛び出し画素存在可能領域属性１０６の具体例について説明する。最大飛び出し画素存在可能領域属性１０６は、４ビットの値をとり、上位２ビットで形状を表し、下位２ビットで大きさを表すものとする。例えば、上位２ビットと形状との関係、及び、下位２ビットと大きさとの関係は、以下のようになる。
最大飛び出し画素存在可能領域属性１０６の上位２ビット
００不使用
０１円（ｃｉｒｃｌｅ）
１０角丸四角（ｒｏｕｎｄｅｄｒｅｃｔａｎｇｌｅ）
１１楕円（ｅｌｌｉｐｓｅ）
最大飛び出し画素存在可能領域属性１０６の下位２ビット
００不使用
０１小（表示画面上で水平方向の長さ５ｃｍ）
１０中（表示画面上で水平方向の長さ１０ｃｍ）
１１大（表示画面上で水平方向の長さ１５ｃｍ） Here, a specific example of the maximum pop-out pixel existence region attribute 106 will be described. The maximum pop-out pixel existence area attribute 106 takes a 4-bit value, and the shape is represented by the upper 2 bits and the size is represented by the lower 2 bits. For example, the relationship between the upper 2 bits and the shape and the relationship between the lower 2 bits and the size are as follows.
Upper 2 bits of the maximum pop-out pixel existence area attribute 106 00 Not used 01 Circle
10 rounded rectangle
11 Ellipse
Lower 2 bits of maximum pop-out pixel existence area attribute 106 00 Not used 01 Small (horizontal length 5 cm on display screen)
10 Medium (Length 10cm in the horizontal direction on the display screen)
11 large (horizontal length of 15 cm on the display screen)

なお、角丸四角及び楕円の水平方向の大きさと垂直方向の大きさとの比は、例えば、水平：垂直＝１．２：１．０とする。例えば、画素存在可能領域属性１０６＝１１１０である場合、最大飛び出し画素存在可能領域は、表示画面上で水平方向の長さが１０ｃｍの楕円となる。勿論、画素存在可能領域属性１０６の様式や画素存在可能領域の形状及び大きさが、上述した例に限定されるものではない。 Note that the ratio of the horizontal size and the vertical size of rounded squares and ellipses is, for example, horizontal: vertical = 1.2: 1.0. For example, when the pixel existence possibility area attribute 106 = 1110, the maximum pop-out pixel existence possibility area is an ellipse having a horizontal length of 10 cm on the display screen. Of course, the form of the pixel existence possible area attribute 106 and the shape and size of the pixel existence possible area are not limited to the above-described example.

また、本発明は、以下の処理を実行することによっても実現される。即ち、上述した実施形態の機能を実現するソフトウェア（プログラム）を、ネットワーク又は各種記憶媒体を介してシステム或いは装置に供給し、そのシステム或いは装置のコンピュータ（またはＣＰＵやＭＰＵ等）がプログラムを読み出して実行する処理である。 The present invention can also be realized by executing the following processing. That is, software (program) that realizes the functions of the above-described embodiments is supplied to a system or apparatus via a network or various storage media, and a computer (or CPU, MPU, or the like) of the system or apparatus reads the program. It is a process to be executed.

１１、１１１１：最大飛び出し画素存在可能領域設定部、１２：最大飛び出し量算出部、１３：立体画像生成部、２１：移動先座標検出部、２２：移動先座標予測部、２３：シーンチェンジ判定部、２４、１１２４：最大飛び出し画素存在可能領域選択部、２５：遅延部、２６：動きベクトル算出部、２８：移動先座標演算部 11, 1111: Maximum pop-out pixel existence possible area setting unit, 12: Maximum pop-out amount calculation unit, 13: Stereo image generation unit, 21: Destination coordinate detection unit, 22: Destination coordinate prediction unit, 23: Scene change determination unit , 24, 1124: maximum pop-out pixel existence region selection unit, 25: delay unit, 26: motion vector calculation unit, 28: destination coordinate calculation unit

Claims

Input means for inputting image data;
Setting means for setting a region where a pixel having the maximum pop-out amount may exist for the image data input by the input means;
Calculating means for calculating the maximum amount of popping out of the pixels in the region set by the setting means;
Determining means for determining the maximum pop-out amount of the pixels in the stereoscopic image data based on the maximum pop-out amount calculated by the calculating means;
A stereoscopic image generating apparatus comprising: generating means for generating the stereoscopic image data from the image data based on the maximum pop-out amount determined by the determining means.

2. The stereoscopic image generation according to claim 1, wherein the determining unit determines a maximum pop-out amount of a pixel in the stereoscopic image data so as not to exceed a maximum pop-out amount calculated by the calculating unit. apparatus.

In the setting unit, a pixel having the maximum pop-out amount in the first image data input at the first time point by the input unit is input by the input unit at a second time point after the first time point. 3. The stereoscopic image generation apparatus according to claim 1, wherein the region is set based on a position of the pixel in the second image data when the second image data exists.

The setting means predicts a movement destination of the pixel at the second time point when a pixel with the maximum pop-out amount in the first image data does not exist in the second image data, and The stereoscopic image generating apparatus according to claim 3, wherein the area is set based on a destination.

The three-dimensional object according to claim 3 or 4, wherein the setting means sets the entire display screen as the region when a scene change exists between the first time point and the second time point. Image generation device.

The stereoscopic image generation according to any one of claims 1 to 5, further comprising changing means for changing at least one of the shape and size of the area set by the setting means. apparatus.

A stereoscopic image generation method executed by a stereoscopic image generation apparatus,
An input step for inputting image data;
A setting step for setting an area where there is a possibility that a pixel having a maximum pop-out amount exists for the image data input in the input step;
A calculation step for calculating a maximum amount of protrusion of the pixel in the region set by the setting step;
A determination step for determining a maximum amount of pop-up of pixels in the stereoscopic image data based on the maximum amount of pop-up calculated by the calculating step;
And a generating step of generating the stereoscopic image data from the image data based on the maximum pop-out amount determined in the determining step.

An input step for inputting image data;
A setting step for setting an area where there is a possibility that a pixel having a maximum pop-out amount exists for the image data input in the input step;
A calculation step for calculating a maximum amount of protrusion of the pixel in the region set by the setting step;
A determination step for determining a maximum amount of pop-up of pixels in the stereoscopic image data based on the maximum amount of pop-up calculated by the calculating step;
A program for causing a computer to execute a generation step of generating the stereoscopic image data from the image data based on the maximum pop-out amount determined by the determination step.