JP7047878B2

JP7047878B2 - Video encoding device

Info

Publication number: JP7047878B2
Application number: JP2020173828A
Authority: JP
Inventors: 章弘屋森
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2020-10-15
Filing date: 2020-10-15
Publication date: 2022-04-05
Anticipated expiration: 2036-12-27
Also published as: JP2021005906A

Description

本発明は、動画像符号化装置に関する。 The present invention relates to a moving image coding device.

動画像符号化においては、矩形領域の画像をブロックに分割して、ブロック単位で動き補償及び周波数変換を行うことが多い。代表的な動画像符号化方式としては、ＩＳＯ／ＩＥＣ２３００８－２ High Efficiency Video Coding（ＨＥＶＣ）が挙げられる。 In moving image coding, an image in a rectangular region is often divided into blocks, and motion compensation and frequency conversion are performed in block units. As a typical moving image coding method, ISO / IEC 23008.2 High Efficiency Video Coding (HEVC) can be mentioned.

近年、映像を撮影するカメラの機能拡張が進み、矩形領域の撮影からより広角のパノラマ撮影に移行する傾向が見られる。さらに、魚眼レンズを用いた全周囲パノラマ映像の撮影、及び複数のカメラを用いた３６０°パノラマ映像の撮影も可能となりつつある（例えば、非特許文献１～非特許文献３を参照）。 In recent years, the functions of cameras that shoot images have been expanded, and there is a tendency to shift from shooting rectangular areas to wider-angle panoramic shooting. Further, it is becoming possible to shoot an omnidirectional panoramic image using a fisheye lens and a 360 ° panoramic image using a plurality of cameras (see, for example, Non-Patent Documents 1 to 3).

３６０°全方向映像情報を含むパノラマ映像の動き推定方法も知られている（例えば、特許文献１を参照）。画像間におけるグローバルベクトルを用いて動きベクトル探索エリアを決定する方法、及びパノラマ画像を分割して動体を監視しやすいパノラマ画像を生成する方法も知られている（例えば、特許文献２及び特許文献３を参照）。 A method for estimating motion of a panoramic image including 360 ° omnidirectional image information is also known (see, for example, Patent Document 1). A method of determining a motion vector search area using a global vector between images and a method of dividing a panoramic image to generate a panoramic image in which a moving object can be easily monitored are also known (for example, Patent Documents 2 and 3). See).

特表２００８－５１０３５９号公報Japanese Patent Publication No. 2008-510359 特開２０１０－１０９９１７号公報Japanese Unexamined Patent Publication No. 2010-109917 特開２０１３－２１８４３２号公報Japanese Unexamined Patent Publication No. 2013-218432

“Entaniya Fisheyeサポートブログ”、［online］、［平成２８年１１月１６日検索］、インターネット＜ＵＲＬ：https://www.entapano.com/blog/360-degree-panoramic-video-camera/＞"Entaniya Fisheye Support Blog", [online], [Search on November 16, 2016], Internet <URL: https://www.entapano.com/blog/360-degree-panoramic-video-camera/> “THETA”、［online］、［平成２８年１１月１６日検索］、インターネット＜ＵＲＬ：https://theta360.com/ja/about/theta/＞"THETA", [online], [Search on November 16, 2016], Internet <URL: https://theta360.com/ja/about/theta/> “Professional Plug&Play 360゜ Video Camera”、［online］、［平成２８年１１月１６日検索］、インターネット＜ＵＲＬ：http://www.sphericam.com/sphericam2/＞"Professional Plug & Play 360 ° Video Camera", [online], [Searched on November 16, 2016], Internet <URL: http://www.sphericam.com/sphericam2/>

動きが大きいパノラマ映像を符号化する場合、多くの動きベクトルが発生して符号化効率（圧縮率）が低下することがある。
１つの側面において、本発明は、パノラマ映像の符号化効率を向上させることを目的とする。 When encoding a panoramic image with large motion, many motion vectors may be generated and the coding efficiency (compression rate) may decrease.
In one aspect, the present invention aims to improve the coding efficiency of panoramic images.

１つの案では、動画像符号化装置は、記憶部、決定部、補正部、及び符号化部を含む。
記憶部は、複数の撮像装置が撮影した複数の映像を組み合わせたパノラマ映像に含まれる符号化対象パノラマ画像を符号化するために用いられる参照画像を記憶する。 In one proposal, the moving image coding device includes a storage unit, a determination unit, a correction unit, and a coding unit.
The storage unit stores a reference image used for encoding a coded panoramic image included in a panoramic image in which a plurality of images captured by a plurality of image pickup devices are combined.

決定部は、複数の映像それぞれの撮影範囲が移動することによって、符号化対象パノラマ画像内の符号化対象領域が参照画像に対してずれた場合、参照画像に対する符号化対象領域のずれ量を表すベクトルを決定する。補正部は、符号化対象パノラマ画像内における符号化対象領域の位置をずれ量を表すベクトルに基づいて補正することで、補正された符号化対象領域を生成する。 The determination unit represents the amount of deviation of the coding target area with respect to the reference image when the coding target area in the coding target panoramic image shifts with respect to the reference image due to the movement of the shooting range of each of the plurality of images. Determine the vector. The correction unit corrects the position of the coded target area in the coded target panoramic image based on the vector representing the deviation amount, thereby generating the corrected coded target area.

符号化部は、補正された符号化対象領域の画像を、参照画像を用いて符号化する。 The coding unit encodes the corrected image of the coded target area using the reference image.

実施形態によれば、パノラマ映像の符号化効率を向上させることができる。 According to the embodiment, the coding efficiency of the panoramic image can be improved.

第１の展開パノラマ画像を示す図である。It is a figure which shows the 1st development panoramic image. 第２の展開パノラマ画像を示す図である。It is a figure which shows the 2nd development panoramic image. 第３の展開パノラマ画像を示す図である。It is a figure which shows the 3rd development panoramic image. 第１の動画像符号化装置の機能的構成図である。It is a functional block diagram of the 1st moving image coding apparatus. 第１の動画像符号化処理のフローチャートである。It is a flowchart of the 1st moving image coding process. 第２の動画像符号化装置の機能的構成図である。It is a functional block diagram of the 2nd moving image coding apparatus. 第２の動画像符号化処理のフローチャートである。It is a flowchart of the 2nd moving image coding process. 動画像復号装置の機能的構成図である。It is a functional block diagram of a moving image decoding apparatus. 動画像復号処理のフローチャートである。It is a flowchart of a moving image decoding process. パノラマ映像符号化システムの構成図である。It is a block diagram of the panoramic image coding system. 動画像符号化装置の具体例を示す機能的構成図である。It is a functional block diagram which shows the specific example of a moving image coding apparatus. 補正処理を示す図である。It is a figure which shows the correction process. 全周囲パノラマ映像を示す図である。It is a figure which shows the panoramic image of the whole circumference. 全周囲パノラマ映像に対する補正処理を示す図である。It is a figure which shows the correction process for the all-around panoramic image. 第１の動画像符号化処理の具体例を示すフローチャートである。It is a flowchart which shows the specific example of the 1st moving image coding process. 動きベクトルの発生数に基づく補正処理のフローチャートである。It is a flowchart of the correction process based on the number of occurrences of a motion vector. ＭＥ効率に基づく補正処理のフローチャートである。It is a flowchart of the correction process based on ME efficiency. 符号化効率に基づく動画像符号化処理のフローチャートである。It is a flowchart of the moving image coding process based on the coding efficiency. 符号化効率に基づく補正処理のフローチャートである。It is a flowchart of the correction process based on the coding efficiency. 動画像復号装置の具体例を示す機能的構成図である。It is a functional block diagram which shows the specific example of the moving image decoding apparatus. 動画像復号処理の具体例を示すフローチャートである。It is a flowchart which shows the specific example of the moving image decoding process. 第２の動画像符号化処理の具体例を示すフローチャートである。It is a flowchart which shows the specific example of the 2nd moving image coding processing. 情報処理装置の構成図である。It is a block diagram of an information processing apparatus.

以下、図面を参照しながら、実施形態を詳細に説明する。
パノラマ映像に含まれる各時刻のパノラマ画像を符号化する場合、撮影されたパノラマ画像は歪んだ状態であるため、そのまま符号化すると符号化効率が低下する。そこで、パノラマ画像の歪みを補正して、矩形のパノラマ画像に展開した上で、展開されたパノラマ画像を符号化単位の所定領域に分割し、所定領域単位で符号化を行うのが効果的である。 Hereinafter, embodiments will be described in detail with reference to the drawings.
When the panoramic image at each time included in the panoramic image is encoded, the captured panoramic image is in a distorted state, and if it is encoded as it is, the coding efficiency is lowered. Therefore, it is effective to correct the distortion of the panoramic image, develop it into a rectangular panoramic image, divide the developed panoramic image into a predetermined area of the coding unit, and perform coding in the predetermined area unit. be.

各時刻における矩形のパノラマ画像は、フレーム又はピクチャと呼ばれることがあり、符号化単位の所定領域は、ブロックと呼ばれることがある。 The rectangular panoramic image at each time may be referred to as a frame or picture, and a predetermined area of coding unit may be referred to as a block.

矩形のパノラマ画像をブロックに分割する際の基準位置が、撮影されたパノラマ画像に対して固定された位置である場合、カメラのパニング、位置ずれ、又は移動によって、フレーム間で全ブロックが一定方向に移動したような映像が生成されることがある。基準位置を固定したままでこのようなパノラマ画像を符号化すると、不必要に多くの動きベクトルが発生して符号化効率が低下する。 When the reference position when dividing a rectangular panoramic image into blocks is a fixed position with respect to the captured panoramic image, panning, misalignment, or movement of the camera causes all blocks to be in a fixed direction between frames. An image that looks like it has moved to may be generated. If such a panoramic image is encoded while the reference position is fixed, an unnecessarily large number of motion vectors are generated and the coding efficiency is lowered.

特許文献１の動き推定方法では、３６０°全方向映像情報を含むパノラマ映像の相関性の高さを利用しており、画面右側境界のさらに右側の領域の画素として、画面左側境界部分の画素が読み込まれる。また、画面左側境界のさらに左側の領域の画素として、画面右側境界部分の画素が読み込まれる。これにより、画面の右側境界部分及び左側境界部分の画質を改善することができる。 The motion estimation method of Patent Document 1 utilizes the high correlation of the panoramic image including 360 ° omnidirectional image information, and the pixel of the left boundary portion of the screen is used as the pixel of the region on the right side of the right boundary of the screen. Loaded. Further, the pixels in the right boundary portion of the screen are read as the pixels in the region on the left side of the left boundary of the screen. Thereby, the image quality of the right side boundary portion and the left side boundary portion of the screen can be improved.

この動き推定方法では、撮影される全方向のパノラマ映像について、カメラの位置が固定されており、大きく移動しないことが前提となっている。つまり、所定の点を中心として、周囲を３６０°見渡すパノラマ映像が対象となっている。 In this motion estimation method, it is premised that the position of the camera is fixed and does not move significantly for the panoramic image in all directions to be captured. That is, the target is a panoramic image that looks around 360 ° around a predetermined point.

ところが、近年では、非特許文献１～非特許文献３に示されるような、ハンディ型又は車載型の全周囲カメラが使用されるようになっている。 However, in recent years, handheld or in-vehicle omnidirectional cameras as shown in Non-Patent Documents 1 to 3 have come to be used.

図１は、非特許文献１のカメラで撮影したパノラマ画像を展開することで生成される、第１の展開パノラマ画像の例を示している。このカメラは、単一の凸レンズ（魚眼レンズ）を用いて、円形のパノラマ画像１０１を撮影することができる。 FIG. 1 shows an example of a first developed panoramic image generated by developing a panoramic image taken by the camera of Non-Patent Document 1. This camera can capture a circular panoramic image 101 using a single convex lens (fisheye lens).

例えば、屋外で凸レンズを真上に向けてパノラマ画像１０１を撮影した場合、パノラマ画像１０１の中央部分には空が写るため、その中央部分を除いた領域を利用して展開パノラマ画像１０３を生成することができる。この場合、パノラマ画像１０１を境界線１０２で切断して矩形状に変換することで、展開パノラマ画像１０３が生成される。 For example, when the panoramic image 101 is photographed outdoors with the convex lens directed directly upward, the sky is captured in the central portion of the panoramic image 101, so the developed panoramic image 103 is generated using the region excluding the central portion. be able to. In this case, the developed panoramic image 103 is generated by cutting the panoramic image 101 at the boundary line 102 and converting it into a rectangular shape.

図２は、非特許文献２の３６０°カメラで撮影したパノラマ画像を展開することで生成される、第２の展開パノラマ画像の例を示している。この３６０°カメラは、背中合わせに配置された２台のカメラを用いて、円形のパノラマ画像２０１及びパノラマ画像２０２を撮影することができる。 FIG. 2 shows an example of a second developed panoramic image generated by developing a panoramic image taken by a 360 ° camera of Non-Patent Document 2. This 360 ° camera can capture a circular panoramic image 201 and a circular panoramic image 202 by using two cameras arranged back to back.

例えば、車両に乗車しているユーザが一方のカメラを進行方向の前方に向けて撮影した場合、パノラマ画像２０１には前方の風景が写り、パノラマ画像２０２には後方の風景が写る。この場合、パノラマ画像２０１及びパノラマ画像２０２をそれぞれ矩形状に変換することで、前方の展開パノラマ画像２０３及び後方の展開パノラマ画像２０４が生成される。さらに、展開パノラマ画像２０３及び展開パノラマ画像２０４を繋ぎ合わせることで、全周囲パノラマ画像２０５を生成することも可能である。 For example, when a user in a vehicle shoots one camera toward the front in the traveling direction, the panoramic image 201 shows the scenery in front and the panoramic image 202 shows the scenery in the rear. In this case, by converting the panoramic image 201 and the panoramic image 202 into rectangular shapes, the front unfolded panorama image 203 and the rear unfolded panorama image 204 are generated. Further, it is also possible to generate the omnidirectional panoramic image 205 by connecting the expanded panoramic image 203 and the expanded panoramic image 204.

図３は、非特許文献３の３６０°カメラで撮影したパノラマ画像を展開することで生成される、第３の展開パノラマ画像の例を示している。この３６０°カメラは、球状に配置された６台のカメラを用いて、水平全周囲だけでなく、天と地も含めた全空間のパノラマ画像３０１を撮影することができる。 FIG. 3 shows an example of a third developed panoramic image generated by developing a panoramic image taken by a 360 ° camera of Non-Patent Document 3. This 360 ° camera can capture a panoramic image 301 of not only the entire horizontal circumference but also the entire space including the heavens and the earth by using six cameras arranged in a spherical shape.

例えば、この３６０°カメラを車両の屋根に搭載して撮影した場合、パノラマ画像３０１内の領域３１１には、進行方向の前方の風景が写り、領域３１２には左後方の風景が写り、領域３１３及び領域３１４には右後方の風景が写る。この場合、領域３１１及び領域３１２の画像をそれぞれ矩形状に変換することで、前方の展開パノラマ画像３２１及び左後方の展開パノラマ画像３２２が生成される。また、領域３１３及び領域３１４の画像を矩形状に変換することで、右後方の展開パノラマ画像３２３が生成される。 For example, when this 360 ° camera is mounted on the roof of a vehicle and photographed, the landscape in front of the traveling direction is captured in the region 311 in the panoramic image 301, and the landscape in the rear left is captured in the region 312, and the region 313 is captured. And the landscape on the right rear is reflected in the area 314. In this case, by converting the images of the region 311 and the region 312 into rectangular shapes, the front expanded panoramic image 321 and the left rear expanded panoramic image 322 are generated. Further, by converting the images of the area 313 and the area 314 into a rectangular shape, the developed panoramic image 323 on the right rear side is generated.

さらに、展開パノラマ画像３２１～展開パノラマ画像３２３を含む複数の展開パノラマ画像を繋ぎ合わせることで、全周囲パノラマ画像を生成することも可能である。 Further, it is also possible to generate an omnidirectional panoramic image by connecting a plurality of developed panoramic images including the developed panoramic image 321 to the developed panoramic image 323.

図１のパノラマ画像１０１、図２のパノラマ画像２０１、パノラマ画像２０２、及び図３のパノラマ画像３０１は、広い視野を撮影した画像であるため、歪んでいることが多い。一方、展開パノラマ画像１０３、展開パノラマ画像２０３、展開パノラマ画像２０４、全周囲パノラマ画像２０５、及び展開パノラマ画像３２１～展開パノラマ画像３２３は、矩形状に変換されており、歪みがない画像である。 Since the panoramic image 101 of FIG. 1, the panoramic image 201 of FIG. 2, the panoramic image 202, and the panoramic image 301 of FIG. 3 are images taken with a wide field of view, they are often distorted. On the other hand, the unfolded panorama image 103, the unfolded panorama image 203, the unfolded panorama image 204, the all-around panorama image 205, and the unfolded panorama image 321 to the unfolded panorama image 323 are converted into a rectangular shape and are images without distortion.

動画像符号化においては、歪みのあるパノラマ映像よりも、歪みのないパノラマ映像を符号化する方が、動き推定（ＭＥ）等の効率が良くなる。したがって、パノラマ映像を符号化する際には、展開パノラマ画像を用いる機会が多くなると予想される。 In moving image coding, it is more efficient to encode a panoramic image without distortion than to encode a panoramic image with distortion, such as motion estimation (ME). Therefore, it is expected that there will be many opportunities to use the developed panoramic image when encoding the panoramic image.

ＭＥ効率は、例えば、符号化対象パノラマ画像内の各ブロックの画像と、動きベクトルによって決定される予測ブロックの画像との差分の絶対値で表される。この場合、差分の絶対値が小さいほど、ＭＥ効率が良くなる。 The ME efficiency is represented by, for example, the absolute value of the difference between the image of each block in the coded panoramic image and the image of the predicted block determined by the motion vector. In this case, the smaller the absolute value of the difference, the better the ME efficiency.

動画像符号化はブロック単位の動き補償及び周波数変換を含むため、生成される情報は、主として、動きベクトル情報と周波数変換された差分情報である。このため、ＭＥ効率がほぼ同等である場合は、動きベクトル情報の情報量が符号化効率に大きな影響を与える。また、動きがほとんどないパノラマ画像と動きが大きいパノラマ画像とでは、後者の方が多くの動きベクトル情報が発生する。つまり、動きが大きいパノラマ画像ほど発生する符号量が多くなり、符号化効率が低下する。 Since the moving image coding includes motion compensation and frequency conversion in block units, the generated information is mainly motion vector information and frequency-converted difference information. Therefore, when the ME efficiencies are almost the same, the amount of motion vector information has a great influence on the coding efficiency. Further, in the panoramic image with almost no movement and the panoramic image with large movement, the latter generates more motion vector information. That is, the larger the motion of the panoramic image, the larger the amount of code generated, and the lower the coding efficiency.

発生する符号量を抑えて符号化効率を一定に保つために、量子化スケールを大きくすることも可能であるが、その場合、符号化パノラマ画像から復元されたパノラマ画像の画質が劣化する。 It is possible to increase the quantization scale in order to suppress the amount of code generated and keep the coding efficiency constant, but in that case, the image quality of the panoramic image restored from the coded panoramic image is deteriorated.

上述したように、カメラのパニング、位置ずれ、又は移動によって、フレーム間で全ブロックが一定方向に移動したような映像が生成される場合、特許文献１の動き推定方法では、多くの動きベクトルが発生して符号化効率が低下する。 As described above, when an image in which all blocks move in a certain direction between frames is generated by panning, misalignment, or movement of the camera, in the motion estimation method of Patent Document 1, many motion vectors are generated. It occurs and the coding efficiency decreases.

図４は、実施形態の第１の動画像符号化装置の機能的構成例を示している。図４の動画像符号化装置４０１は、記憶部４１１、決定部４１２、補正部４１３、及び符号化部４１４を含む。記憶部４１１は、符号化対象パノラマ画像を符号化するために用いられる参照パノラマ画像４２１を記憶する。符号化対象パノラマ画像は、撮像装置が撮影したパノラマ映像に含まれるパノラマ画像を展開したパノラマ画像である。決定部４１２、補正部４１３、及び符号化部４１４は、参照パノラマ画像４２１を用いて動画像符号化処理を行う。 FIG. 4 shows a functional configuration example of the first moving image coding device of the embodiment. The moving image coding device 401 of FIG. 4 includes a storage unit 411, a determination unit 412, a correction unit 413, and a coding unit 414. The storage unit 411 stores the reference panoramic image 421 used for encoding the coded panoramic image. The coded panoramic image is a panoramic image obtained by developing a panoramic image included in the panoramic image taken by the image pickup apparatus. The determination unit 412, the correction unit 413, and the coding unit 414 perform moving image coding processing using the reference panoramic image 421.

図５は、図４の動画像符号化装置４０１が行う第１の動画像符号化処理の例を示すフローチャートである。まず、決定部４１２は、参照パノラマ画像４２１に対する符号化対象パノラマ画像のずれ量を表すベクトルを決定する（ステップ５０１）。 FIG. 5 is a flowchart showing an example of the first moving image coding process performed by the moving image coding device 401 of FIG. First, the determination unit 412 determines a vector representing the amount of deviation of the coded panoramic image with respect to the reference panoramic image 421 (step 501).

次に、補正部４１３は、符号化対象パノラマ画像内における複数の符号化対象領域それぞれの位置を、ずれ量を表すベクトルに基づいて補正することで、補正された符号化対象パノラマ画像を生成する（ステップ５０２）。そして、符号化部４１４は、補正された符号化対象パノラマ画像内の複数の符号化対象領域それぞれの画像を、参照パノラマ画像４２１を用いて符号化する（ステップ５０３）。 Next, the correction unit 413 generates a corrected coded target panoramic image by correcting the positions of each of the plurality of coded target areas in the coded target panoramic image based on a vector representing the amount of deviation. (Step 502). Then, the coding unit 414 encodes the image of each of the plurality of coding target regions in the corrected coded target panorama image by using the reference panorama image 421 (step 503).

このような動画像符号化装置４０１によれば、パノラマ映像の符号化効率を向上させることができる。 According to such a moving image coding device 401, it is possible to improve the coding efficiency of the panoramic image.

図６は、実施形態の第２の動画像符号化装置の機能的構成例を示している。図６の動画像符号化装置６０１は、記憶部６１１、決定部６１２、補正部６１３、及び符号化部６１４を含む。記憶部６１１は、符号化対象パノラマ画像を符号化するために用いられる参照画像６２１を記憶する。符号化対象パノラマ画像は、複数の撮像装置が撮影した複数の映像を組み合わせたパノラマ映像に含まれるパノラマ画像である。決定部６１２、補正部６１３、及び符号化部６１４は、参照画像６２１を用いて動画像符号化処理を行う。 FIG. 6 shows a functional configuration example of the second moving image coding device of the embodiment. The moving image coding device 601 of FIG. 6 includes a storage unit 611, a determination unit 612, a correction unit 613, and a coding unit 614. The storage unit 611 stores a reference image 621 used for encoding the coded panoramic image. The coded panoramic image is a panoramic image included in a panoramic image in which a plurality of images taken by a plurality of image pickup devices are combined. The determination unit 612, the correction unit 613, and the coding unit 614 perform a moving image coding process using the reference image 621.

図７は、図６の動画像符号化装置６０１が行う第２の動画像符号化処理の例を示すフローチャートである。まず、決定部６１２は、複数の映像それぞれの撮影範囲が移動することによって、符号化対象パノラマ画像内の符号化対象領域が参照画像６２１に対してずれた場合、参照画像６２１に対する符号化対象領域のずれ量を表すベクトルを決定する（ステップ７０１）。 FIG. 7 is a flowchart showing an example of the second moving image coding process performed by the moving image coding device 601 of FIG. First, when the coding target area in the coding target panoramic image is deviated from the reference image 621 due to the movement of the shooting range of each of the plurality of images, the determination unit 612 determines the coding target area for the reference image 621. A vector representing the amount of deviation of is determined (step 701).

次に、補正部６１３は、符号化対象パノラマ画像内における符号化対象領域の位置を、ずれ量を表すベクトルに基づいて補正することで、補正された符号化対象領域を生成する（ステップ７０２）。そして、符号化部６１４は、補正された符号化対象領域の画像を、参照画像６２１を用いて符号化する（ステップ７０３）。 Next, the correction unit 613 corrects the position of the coded target area in the coded panoramic image based on the vector representing the deviation amount, thereby generating the corrected coded target area (step 702). .. Then, the coding unit 614 encodes the corrected image of the coded target region using the reference image 621 (step 703).

このような動画像符号化装置６０１によれば、パノラマ映像の符号化効率を向上させることができる。 According to such a moving image coding device 601, it is possible to improve the coding efficiency of the panoramic image.

図８は、実施形態の動画像復号装置の機能的構成例を示している。図８の動画像復号装置８０１は、記憶部８１１、抽出部８１２、復号部８１３、及び補正部８１４を含む。図４の動画像符号化装置４０１は、符号化対象パノラマ画像を符号化することで、符号化パノラマ画像を生成する。動画像復号装置８０１は、動画像符号化装置４０１が生成する符号化パノラマ画像を含む、符号化パノラマ映像を復号する。 FIG. 8 shows a functional configuration example of the moving image decoding device of the embodiment. The moving image decoding device 801 of FIG. 8 includes a storage unit 811, an extraction unit 812, a decoding unit 813, and a correction unit 814. The moving image coding device 401 of FIG. 4 generates a coded panoramic image by encoding the coded panoramic image. The moving image decoding device 801 decodes the coded panoramic image including the coded panoramic image generated by the moving image coding device 401.

記憶部８１１は、符号化パノラマ画像を復号するために用いられる参照パノラマ画像８２１を記憶する。抽出部８１２、復号部８１３、及び補正部８１４は、参照パノラマ画像８２１を用いて動画像復号処理を行う。 The storage unit 811 stores the reference panoramic image 821 used for decoding the coded panoramic image. The extraction unit 812, the decoding unit 813, and the correction unit 814 perform a moving image decoding process using the reference panoramic image 821.

図９は、図８の動画像復号装置８０１が行う動画像復号処理の例を示すフローチャートである。まず、抽出部８１２は、参照パノラマ画像８２１に対する符号化対象パノラマ画像のずれ量を表すベクトルを、符号化パノラマ映像から抽出する（ステップ９０１）。 FIG. 9 is a flowchart showing an example of the moving image decoding process performed by the moving image decoding device 801 of FIG. First, the extraction unit 812 extracts a vector representing the amount of deviation of the coded panoramic image with respect to the reference panoramic image 821 from the coded panoramic image (step 901).

次に、復号部８１３は、符号化パノラマ画像内の複数の復号対象領域それぞれを参照パノラマ画像８２１を用いて復号して、復号パノラマ画像を生成する（ステップ９０２）。そして、補正部８１４は、復号パノラマ画像内における複数の復号対象領域それぞれの位置を、ずれ量を表すベクトルに基づいて補正することで、符号化対象パノラマ画像を復元する（ステップ９０３）。 Next, the decoding unit 813 decodes each of the plurality of decoding target regions in the coded panoramic image using the reference panoramic image 821 to generate a decoded panoramic image (step 902). Then, the correction unit 814 restores the coded panoramic image by correcting the positions of the plurality of decoded target regions in the decoded panoramic image based on the vector representing the deviation amount (step 903).

このような動画像復号装置８０１によれば、パノラマ映像の符号化効率を向上させることができる。 According to such a moving image decoding device 801, it is possible to improve the coding efficiency of the panoramic image.

図１０は、パノラマ映像符号化システムの構成例を示している。図１０のパノラマ映像符号化システム１００１は、撮影部１０１１、合成部１０１２、展開部１０１３、及び動画像符号化装置１０１４を含む。 FIG. 10 shows a configuration example of a panoramic video coding system. The panoramic video coding system 1001 of FIG. 10 includes a photographing unit 1011, a compositing unit 1012, a developing unit 1013, and a moving image coding device 1014.

撮影部１０１１は、１台又は複数台の撮像装置を含む。１台の撮像装置を含む撮影部１０１１としては、例えば、非特許文献１のカメラを用いることができる。また、複数台の撮像装置を含む撮影部１０１１としては、例えば、非特許文献２の３６０°カメラ又は非特許文献３の３６０°カメラを用いることができる。 The photographing unit 1011 includes one or a plurality of imaging devices. As the photographing unit 1011 including one image pickup device, for example, the camera of Non-Patent Document 1 can be used. Further, as the photographing unit 1011 including a plurality of image pickup devices, for example, a 360 ° camera of Non-Patent Document 2 or a 360 ° camera of Non-Patent Document 3 can be used.

撮影部１０１１が１台の撮像装置を含む場合、合成部１０１２は、その撮像装置が撮影したパノラマ映像に含まれる各時刻の全周囲パノラマ画像を出力する。展開部１０１３は、全周囲パノラマ画像を複数の領域に分割し、領域単位で平面射影変換を繰り返すことで、展開パノラマ画像を生成する。この場合、動画像符号化装置１０１４は、図４の動画像符号化装置４０１に対応し、各時刻の展開パノラマ画像を符号化して、符号化パノラマ映像のビットストリームを生成する。 When the photographing unit 1011 includes one image pickup device, the synthesis unit 1012 outputs an omnidirectional panoramic image at each time included in the panoramic image captured by the image pickup device. The expansion unit 1013 divides the omnidirectional panoramic image into a plurality of regions and repeats the planar projective transformation in each region to generate the expansion panoramic image. In this case, the moving image coding device 1014 corresponds to the moving image coding device 401 of FIG. 4, and encodes the developed panoramic image at each time to generate a bit stream of the coded panoramic image.

一方、撮影部１０１１が複数台の撮像装置を含む場合、合成部１０１２は、それぞれの撮像装置が撮影した映像に含まれる画像を合成して、全周囲パノラマ画像を生成し、展開部１０１３は、全周囲パノラマ画像を変換して展開パノラマ画像を生成する。この場合、動画像符号化装置１０１４は、図６の動画像符号化装置６０１に対応し、各時刻の展開パノラマ画像を符号化して、符号化パノラマ映像のビットストリームを生成する。 On the other hand, when the photographing unit 1011 includes a plurality of imaging devices, the combining unit 1012 synthesizes the images included in the images captured by the respective imaging devices to generate an omnidirectional panoramic image, and the developing unit 1013 generates an omnidirectional panoramic image. Convert the omnidirectional panoramic image to generate a developed panoramic image. In this case, the moving image coding device 1014 corresponds to the moving image coding device 601 of FIG. 6 and encodes the developed panoramic image at each time to generate a bit stream of the coded panoramic image.

撮影部１０１１が静止している場合、全周囲パノラマ画像全体としては、常に、撮影部１０１１を中心として周囲を３６０°見渡す同じ撮影範囲が撮影されている。ただし、撮影部１０１１のパニング等によって、各時刻における全周囲パノラマ画像の起点がずれることで、写っている被写体の位置がずれる可能性がある。一方、撮影部１０１１が移動している場合、時間の経過とともに撮影範囲が変化し、写っている被写体も変化することが想定される。 When the photographing unit 1011 is stationary, the same photographing range overlooking the surroundings by 360 ° with the photographing unit 1011 as the center is always photographed as the whole panoramic image of the entire circumference. However, there is a possibility that the position of the subject in the picture may shift due to the shift of the starting point of the panoramic image of the entire circumference at each time due to the panning of the photographing unit 1011 or the like. On the other hand, when the photographing unit 1011 is moving, it is assumed that the photographing range changes with the passage of time and the captured subject also changes.

図１１は、図１０の動画像符号化装置１０１４の具体例を示している。図１１の動画像符号化装置１０１４は、変更部１１０１、判定部１１０２、決定部１１０３、変更部１１０４、減算部１１０５、変換及び量子化部（Ｔ／Ｑ）１１０６、及びエントロピー符号化部（ＥＮＴ）１１０７を含む。さらに、動画像符号化装置１０１４は、逆量子化及び逆変換部（ＩＱ／ＩＴ）１１０８、加算部１１０９、動き補償部１１１０、予測画像生成部１１１１、及びフレームメモリ１１１２を含む。 FIG. 11 shows a specific example of the moving image coding device 1014 of FIG. The moving image coding device 1014 of FIG. 11 includes a change unit 1101, a determination unit 1102, a determination unit 1103, a change unit 1104, a subtraction unit 1105, a conversion and quantization unit (T / Q) 1106, and an entropy coding unit (ENT). ) 1107 is included. Further, the moving image coding device 1014 includes an inverse quantization and inverse conversion unit (IQ / IT) 1108, an addition unit 1109, a motion compensation unit 1110, a prediction image generation unit 1111 and a frame memory 1112.

まず、動画像符号化装置１０１４が図４の動画像符号化装置４０１に対応する場合の動作について説明する。この場合、変更部１１０１及び判定部１１０２は、補正部４１３に対応し、決定部１１０３は、決定部４１２に対応する。減算部１１０５、Ｔ／Ｑ１１０６、ＥＮＴ１１０７、ＩＱ／ＩＴ１１０８、加算部１１０９、動き補償部１１１０、及び予測画像生成部１１１１は、符号化部４１４に対応し、フレームメモリ１１１２は、記憶部４１１に対応する。 First, the operation when the moving image coding device 1014 corresponds to the moving image coding device 401 of FIG. 4 will be described. In this case, the change unit 1101 and the determination unit 1102 correspond to the correction unit 413, and the determination unit 1103 corresponds to the determination unit 412. The subtraction unit 1105, T / Q1106, ENT1107, IQ / IT1108, addition unit 1109, motion compensation unit 1110, and prediction image generation unit 1111 correspond to the coding unit 414, and the frame memory 1112 corresponds to the storage unit 411. ..

展開部１０１３が生成する展開パノラマ画像は、符号化対象パノラマ画像として、動画像符号化装置１０１４に入力される。動画像符号化装置１０１４は、符号化対象パノラマ画像を符号化し、符号化パノラマ画像をビットストリームとして出力する。符号化対象パノラマ画像は、複数のブロックに分割され、各ブロックが符号化対象ブロックとして、減算部１１０５及び動き補償部１１１０に入力される。符号化対象ブロックは、符号化対象領域に対応する。 The unfolded panoramic image generated by the unfolding unit 1013 is input to the moving image coding device 1014 as a coded target panoramic image. The moving image coding device 1014 encodes the coded panoramic image and outputs the coded panoramic image as a bit stream. The coded panoramic image is divided into a plurality of blocks, and each block is input to the subtraction unit 1105 and the motion compensation unit 1110 as the coded block. The coded block corresponds to the coded area.

決定部１１０３は、動き補償部１１１０から出力される動きベクトルを用いて、フレームメモリ１１１２が記憶する参照パノラマ画像に対する、符号化対象パノラマ画像のずれ量を表すグローバルベクトルを決定する。そして、決定部１１０３は、決定したグローバルベクトルを判定部１１０２、変更部１１０４、及びＥＮＴ１１０７へ出力する。 The determination unit 1103 uses the motion vector output from the motion compensation unit 1110 to determine a global vector representing the amount of deviation of the coded panoramic image with respect to the reference panoramic image stored in the frame memory 1112. Then, the determination unit 1103 outputs the determined global vector to the determination unit 1102, the change unit 1104, and the ENT 1107.

決定部１１０３は、既に符号化済みのパノラマ画像のグローバルベクトルを、符号化対象パノラマ画像のグローバルベクトルとして用いてもよく、処理量の小さな縮小パノラマ画像を用いた動き推定の結果に基づいて、グローバルベクトルを決定してもよい。 The determination unit 1103 may use the global vector of the panoramic image that has already been encoded as the global vector of the panoramic image to be encoded, and is global based on the result of motion estimation using the reduced panoramic image with a small amount of processing. The vector may be determined.

判定部１１０２は、グローバルベクトルに基づいて、符号化対象パノラマ画像内の各符号化対象ブロックの位置を変更するか否かを判定し、グローバルベクトル及び判定結果を変更部１１０１へ出力する。 The determination unit 1102 determines whether or not to change the position of each coded target block in the coded target panoramic image based on the global vector, and outputs the global vector and the determination result to the change unit 1101.

判定結果が各符号化対象ブロックの位置を変更することを示す場合、変更部１１０１は、グローバルベクトルに基づいて各符号化対象ブロックの位置を変更し、変更した符号化対象ブロックを減算部１１０５及び動き補償部１１１０へ出力する。一方、判定結果が各符号化対象ブロックの位置を変更しないことを示す場合、変更部１１０１は、各符号化対象ブロックをそのまま減算部１１０５及び動き補償部１１１０へ出力する。 When the determination result indicates that the position of each coding target block is changed, the changing unit 1101 changes the position of each coding target block based on the global vector, and the changed coding target block is subtracted from the subtracting unit 1105 and It is output to the motion compensation unit 1110. On the other hand, when the determination result indicates that the position of each coded block is not changed, the changing unit 1101 outputs each coded block as it is to the subtraction unit 1105 and the motion compensation unit 1110.

減算部１１０５は、符号化対象ブロックと、予測画像生成部１１１１から出力される予測ブロック画像との差分を表す予測誤差信号を、Ｔ／Ｑ１１０６へ出力する。Ｔ／Ｑ１１０６は、直交変換によって予測誤差信号を周波数信号に変換し、周波数信号を量子化して係数情報を生成する。直交変換としては、例えば、離散コサイン変換、離散ウェーブレット変換等が用いられる。そして、Ｔ／Ｑ１１０６は、生成した係数情報をＥＮＴ１１０７及びＩＱ／ＩＴ１１０８へ出力する。 The subtraction unit 1105 outputs a prediction error signal representing the difference between the coded block and the prediction block image output from the prediction image generation unit 1111 to the T / Q 1106. The T / Q1106 converts a prediction error signal into a frequency signal by orthogonal transformation, and quantizes the frequency signal to generate coefficient information. As the orthogonal transform, for example, a discrete cosine transform, a discrete wavelet transform, or the like is used. Then, the T / Q1106 outputs the generated coefficient information to the ENT1107 and the IQ / IT1108.

ＩＱ／ＩＴ１１０８は、Ｔ／Ｑ１１０６から出力される係数情報を逆量子化して、周波数信号を生成し、逆直交変換によって周波数信号を再構成予測誤差信号に変換する。そして、ＩＱ／ＩＴ１１０８は、再構成予測誤差信号を加算部１１０９へ出力する。 IQ / IT1108 inversely quantizes the coefficient information output from T / Q1106 to generate a frequency signal, and converts the frequency signal into a reconstruction prediction error signal by inverse orthogonal transformation. Then, IQ / IT1108 outputs the reconstruction prediction error signal to the addition unit 1109.

加算部１１０９は、予測画像生成部１１１１から出力される予測ブロック画像と、再構成予測誤差信号とを加算することで、復号ブロック画像を生成し、生成した復号ブロック画像をフレームメモリ１１１２へ出力する。符号化対象ブロックの位置が変更されている場合、変更部１１０４は、グローバルベクトルに基づいて、復号ブロック画像の位置を変更前の符号化対象ブロックの位置に戻す。 The addition unit 1109 generates a decoding block image by adding the prediction block image output from the prediction image generation unit 1111 and the reconstruction prediction error signal, and outputs the generated decoding block image to the frame memory 1112. .. When the position of the coded block is changed, the changing unit 1104 returns the position of the decoded block image to the position of the coded block before the change based on the global vector.

フレームメモリ１１１２は、復号ブロック画像を蓄積し、蓄積した復号ブロック画像を参照画像として、動き補償部１１１０及び予測画像生成部１１１１へ出力する。符号化対象パノラマ画像内の複数の符号化対象ブロックそれぞれから生成された複数の復号ブロック画像は、参照パノラマ画像に対応する。 The frame memory 1112 accumulates the decoded block image and outputs the accumulated decoded block image as a reference image to the motion compensation unit 1110 and the predicted image generation unit 1111. The plurality of decoded block images generated from each of the plurality of coded blocks in the coded panoramic image correspond to the reference panoramic image.

動き補償部１１１０は、参照画像を用いて符号化対象ブロックに対する動き推定を行うことで、動きベクトルを生成し、生成した動きベクトルを決定部１１０３へ出力する。動き推定では、例えば、１枚の参照パノラマ画像を用いた片方向予測、又は２枚以上の参照パノラマ画像を用いた双方向予測が行われる。そして、動き補償部１１１０は、参照パノラマ画像から動きベクトルが示す参照画像を取得して、予測画像生成部１１１１へ出力する。 The motion compensation unit 1110 generates a motion vector by performing motion estimation for the coded block using the reference image, and outputs the generated motion vector to the determination unit 1103. In motion estimation, for example, one-way prediction using one reference panoramic image or two-way prediction using two or more reference panoramic images is performed. Then, the motion compensation unit 1110 acquires the reference image indicated by the motion vector from the reference panoramic image and outputs it to the prediction image generation unit 1111.

予測画像生成部１１１１は、参照画像を用いて、符号化対象パノラマ画像内の既に符号化された周辺画素の画素値から、符号化対象ブロックのイントラ予測ブロック画像を生成する。また、予測画像生成部１１１１は、動き補償部１１１０から出力される参照画像をインター予測ブロック画像として使用する。そして、予測画像生成部１１１１は、イントラ予測ブロック画像又はインター予測ブロック画像のいずれか一方を選択し、選択した予測ブロック画像を減算部１１０５及び加算部１１０９へ出力する。 The prediction image generation unit 1111 uses the reference image to generate an intra prediction block image of the coding target block from the pixel values of the peripheral pixels already encoded in the coding target panorama image. Further, the prediction image generation unit 1111 uses the reference image output from the motion compensation unit 1110 as the inter-prediction block image. Then, the prediction image generation unit 1111 selects either the intra prediction block image or the inter prediction block image, and outputs the selected prediction block image to the subtraction unit 1105 and the addition unit 1109.

ＥＮＴ１１０７は、Ｔ／Ｑ１１０６から出力される係数情報、イントラ予測又はインター予測の予測モードの情報、及び決定部１１０３から出力されるグローバルベクトルの情報をエントロピー符号化する。エントロピー符号化では、信号中の各シンボルの出現頻度に応じて、可変長符号が割り当てられる。そして、ＥＮＴ１１０７は、可変長符号を含むビットストリームを出力する。 The ENT1107 entropy-codes the coefficient information output from the T / Q1106, the information in the prediction mode of the intra-prediction or the inter-prediction, and the information of the global vector output from the determination unit 1103. In entropy coding, a variable length code is assigned according to the frequency of appearance of each symbol in the signal. Then, the ENT 1107 outputs a bit stream including a variable length code.

図１２は、変更部１１０１が行う補正処理の例を示している。この例では、全周囲パノラマ画像内の複数の位置を区別するために、０～９の番号が用いられている。全周囲パノラマ画像の場合、図１に示したように、左端の位置０の画像と右端の位置９の画像とが連続しているため、この連続性を利用して補正処理を行うことができる。 FIG. 12 shows an example of the correction process performed by the change unit 1101. In this example, numbers 0-9 are used to distinguish between a plurality of positions in the omnidirectional panoramic image. In the case of the omnidirectional panoramic image, as shown in FIG. 1, since the image at the leftmost position 0 and the image at the rightmost position 9 are continuous, the correction process can be performed by utilizing this continuity. ..

参照パノラマ画像１２０１内の各被写体の位置に対して、符号化対象パノラマ画像１２０２内の各被写体の位置が一斉に右へずれた場合、位置０～位置７のブロックに対して同じ動きベクトル１２１１が生成される。しかし、このままでは、位置８及び位置９のブロックに対して、動きベクトル１２１１とは全く異なる動きベクトルが生成される。 When the positions of the subjects in the coded panoramic image 1202 are all shifted to the right with respect to the position of each subject in the reference panoramic image 1201, the same motion vector 1211 is generated for the blocks from position 0 to position 7. Generated. However, as it is, a motion vector completely different from the motion vector 1211 is generated for the blocks at positions 8 and 9.

そこで、決定部１１０３は、動きベクトル１２１１をグローバルベクトルに決定し、変更部１１０１は、符号化対象パノラマ画像１２０２内の各ブロックの位置を、動きベクトル１２１１に従って左へずらす。そして、変更部１１０１は、符号化対象パノラマ画像１２０２からはみ出した位置８及び位置９のブロックの画像を、符号化対象パノラマ画像１２０２の右端に付け加えることで、符号化対象パノラマ画像１２０３を生成する。この場合、符号化対象パノラマ画像１２０３の起点が位置８から位置０に変更され、参照パノラマ画像１２０１の起点に一致する。 Therefore, the determination unit 1103 determines the motion vector 1211 as a global vector, and the change unit 1101 shifts the position of each block in the coded target panorama image 1202 to the left according to the motion vector 1211. Then, the changing unit 1101 generates the coded target panorama image 1203 by adding the image of the block at the position 8 and the position 9 protruding from the coded target panorama image 1202 to the right end of the coded target panorama image 1202. In this case, the starting point of the coded panoramic image 1203 is changed from the position 8 to the position 0, which coincides with the starting point of the reference panoramic image 1201.

これにより、参照パノラマ画像１２０１内の各被写体に対する、符号化対象パノラマ画像１２０３内の各被写体の動きがほぼ０になるため、生成される動きベクトルを最小限に抑えることが可能になる。 As a result, the movement of each subject in the coded panoramic image 1203 with respect to each subject in the reference panoramic image 1201 becomes almost 0, so that the generated motion vector can be minimized.

図１３は、人物を含む全周囲パノラマ映像の例を示している。パノラマ画像１３０１、パノラマ画像１３０２、及びパノラマ画像１３０３は、それぞれ、時刻Ｔ、時刻Ｔ＋１、及び時刻Ｔ＋２における符号化対象パノラマ画像を表す。これらの３枚のパノラマ画像に写っている人物の位置は変化しておらず、代わりに、背景の被写体の位置が時間の経過とともに右へずれている。この場合、動きのある背景の面積の方が静止している人物の面積よりも大きいため、背景が静止するように、各パノラマ画像の起点を左へずらすことが好ましい。 FIG. 13 shows an example of an omnidirectional panoramic image including a person. The panoramic image 1301, the panoramic image 1302, and the panoramic image 1303 represent the coded panoramic images at time T, time T + 1, and time T + 2, respectively. The position of the person in these three panoramic images has not changed, but instead the position of the subject in the background has shifted to the right over time. In this case, since the area of the moving background is larger than the area of the stationary person, it is preferable to shift the starting point of each panoramic image to the left so that the background is stationary.

図１４は、図１３の全周囲パノラマ映像に対する補正処理の例を示している。パノラマ画像１３０２の動き推定において、パノラマ画像１３０１が参照パノラマ画像として用いられる場合、決定部１１０３は、パノラマ画像１３０１の背景に対するパノラマ画像１３０２の背景の動きを表わす動きベクトルを、グローバルベクトルに決定する。 FIG. 14 shows an example of correction processing for the omnidirectional panoramic image of FIG. When the panoramic image 1301 is used as a reference panoramic image in the motion estimation of the panoramic image 1302, the determination unit 1103 determines a motion vector representing the motion of the background of the panoramic image 1302 with respect to the background of the panoramic image 1301 as a global vector.

そして、変更部１１０１は、パノラマ画像１３０２内の各ブロックの位置を、グローバルベクトルに従って左へずらし、パノラマ画像１３０２からはみ出したブロックの画像を右端に付け加えることで、パノラマ画像１４０１を生成する。パノラマ画像１３０２の代わりにパノラマ画像１４０１を符号化すれば、面積の大きな背景に対する動きベクトルはほとんど発生せず、面積の小さな人物に集中して動きベクトルが発生する。したがって、発生する動きベクトル情報を抑制することができ、符号化効率が向上する。 Then, the changing unit 1101 shifts the position of each block in the panoramic image 1302 to the left according to the global vector, and adds the image of the block protruding from the panoramic image 1302 to the right end to generate the panoramic image 1401. If the panoramic image 1401 is encoded instead of the panoramic image 1302, a motion vector for a background having a large area is hardly generated, and a motion vector is generated concentrated on a person having a small area. Therefore, the generated motion vector information can be suppressed, and the coding efficiency is improved.

パノラマ画像１３０３の動き推定において、パノラマ画像１３０２が参照パノラマ画像として用いられる場合も、同様の補正処理が行われる。この場合、決定部１１０３は、パノラマ画像１３０２の背景に対するパノラマ画像１３０３の背景の動きを表わす動きベクトルを、グローバルベクトルに決定する。 When the panoramic image 1302 is used as the reference panoramic image in the motion estimation of the panoramic image 1303, the same correction processing is performed. In this case, the determination unit 1103 determines a motion vector representing the movement of the background of the panoramic image 1303 with respect to the background of the panoramic image 1302 as a global vector.

そして、変更部１１０１は、パノラマ画像１３０３内の各ブロックの位置を、グローバルベクトルに従って左へずらし、パノラマ画像１３０３からはみ出したブロックの画像を右端に付け加えることで、パノラマ画像１４０２を生成する。パノラマ画像１３０３の代わりにパノラマ画像１４０１を符号化することで、符号化効率が向上する。 Then, the changing unit 1101 shifts the position of each block in the panoramic image 1303 to the left according to the global vector, and adds the image of the block protruding from the panoramic image 1303 to the right end to generate the panoramic image 1402. By encoding the panoramic image 1401 instead of the panoramic image 1303, the coding efficiency is improved.

動画像符号化装置１０１４が生成した符号化パノラマ画像を復号する際は、グローバルベクトルに基づいて、符号化パノラマ画像内の各ブロックの位置を逆方向にずらすことで、パノラマ画像１３０２及びパノラマ画像１３０３を復元することができる。 When decoding the coded panoramic image generated by the moving image coding device 1014, the panoramic image 1302 and the panoramic image 1303 are obtained by shifting the position of each block in the coded panoramic image in the opposite direction based on the global vector. Can be restored.

このように、グローバルベクトルに基づいて符号化対象パノラマ画像の起点を変更することで、全周囲パノラマ画像を切断する境界線の位置を、画像の動きに合わせて適応的に変更することが可能になる。起点の変更は、前方向予測の動きに応じて実施することが好ましい。例えば、前方向予測ピクチャ（Ｐピクチャ）のみを対象として起点を変更してもよい。 In this way, by changing the starting point of the panoramic image to be encoded based on the global vector, it is possible to adaptively change the position of the boundary line that cuts the panoramic image around the entire circumference according to the movement of the image. Become. It is preferable to change the starting point according to the movement of the forward prediction. For example, the starting point may be changed only for the forward prediction picture (P picture).

ＥＮＴ１１０７は、グローバルベクトルの情報をヘッダ情報としてビットストリームに挿入することができる。この場合、ビットストリームを受信した動画像復号装置は、ヘッダ情報を参照することでグローバルベクトルを取得し、符号化パノラマ画像内の各ブロックの位置を元の位置に戻すことができる。 The ENT1107 can insert the global vector information into the bitstream as header information. In this case, the moving image decoding device that has received the bitstream can acquire the global vector by referring to the header information and return the position of each block in the coded panoramic image to the original position.

図１５は、図１１の動画像符号化装置１０１４が行う動画像符号化処理の具体例を示すフローチャートである。動画像符号化装置１０１４は、符号化対象映像に含まれる各時刻の画像を符号化対象画像として、符号化対象画像をブロック毎に符号化する。 FIG. 15 is a flowchart showing a specific example of the moving image coding process performed by the moving image coding device 1014 of FIG. The moving image coding device 1014 encodes the coded target image block by block, using the image at each time included in the coded target image as the coded target image.

まず、変更部１１０１は、符号化対象映像が全周囲パノラマ映像であるか否かをチェックする（ステップ１５０１）。 First, the change unit 1101 checks whether or not the coded image is an omnidirectional panoramic image (step 1501).

符号化対象映像が全周囲パノラマ映像ではない場合（ステップ１５０１，Ｎｏ）、変更部１１０１は、符号化対象ブロックの位置を変更することなく、そのまま減算部１１０５及び動き補償部１１１０へ出力する。そして、動画像符号化装置１０１４は、符号化対象ブロックを符号化する（ステップ１５０２）。 When the coded image is not an all-around panoramic image (steps 1501 and No), the changing unit 1101 outputs the coded object block to the subtracting unit 1105 and the motion compensating unit 1110 as it is without changing the position of the coded block. Then, the moving image coding device 1014 encodes the coded block (step 1502).

一方、符号化対象映像が全周囲パノラマ映像である場合（ステップ１５０１，Ｙｅｓ）、決定部１１０３は、符号化対象パノラマ画像のグローバルベクトルを決定する（ステップ１５０５）。 On the other hand, when the coded target image is an omnidirectional panoramic image (steps 1501 and Yes), the determination unit 1103 determines the global vector of the coded target panoramic image (step 1505).

例えば、決定部１１０３は、符号化対象パノラマ画像内で発生する動きベクトルのうち、他の動きベクトルよりも多く発生する動きベクトルをグローバルベクトルに決定することができる。他の動きベクトルよりも多く発生する動きベクトルは、符号化対象パノラマ画像内で最も多く発生する動きベクトルであってもよい。この場合、動き補償部１１１０は、事前に符号化対象パノラマ画像内のすべてのブロックに対する動き推定を実施し、各ブロックの動きベクトルを求めて、決定部１１０３へ出力する。そして、決定部１１０３は、発生数が最も多い動きベクトルを選択して、グローバルベクトルに決定する。 For example, the determination unit 1103 can determine a motion vector that occurs more than other motion vectors among the motion vectors that occur in the coded panoramic image as a global vector. The motion vector that occurs more than the other motion vectors may be the motion vector that occurs most in the coded panoramic image. In this case, the motion compensation unit 1110 performs motion estimation for all the blocks in the coded panoramic image in advance, obtains the motion vector of each block, and outputs the motion vector to the determination unit 1103. Then, the determination unit 1103 selects the motion vector having the largest number of occurrences and determines it as a global vector.

発生数が最も多い動きベクトルを選択することで、符号化対象パノラマ画像全体の動きを表わすベクトルをグローバルベクトルに決定することができる。例えば、背景の面積が前景の面積よりも大きく、背景に一定の動きが見られる場合、背景の動きを表わすベクトルがグローバルベクトルとして採用される。また、前景がクローズアップされており、前景の面積が背景の面積よりも大きい場合は、前景の動きを表わすベクトルがグローバルベクトルとして採用される。 By selecting the motion vector with the largest number of occurrences, the vector representing the motion of the entire panoramic image to be encoded can be determined as the global vector. For example, when the area of the background is larger than the area of the foreground and a certain movement is seen in the background, the vector representing the movement of the background is adopted as the global vector. Further, when the foreground is close-up and the area of the foreground is larger than the area of the background, the vector representing the movement of the foreground is adopted as the global vector.

次に、変更部１１０１は、符号化対象ブロックが前方向予測ピクチャのブロックであるか否かをチェックする（ステップ１５０６）。符号化対象ブロックが前方向予測ピクチャのブロックではない場合（ステップ１５０６，Ｎｏ）、動画像符号化装置１０１４は、ステップ１５０２以降の処理を行う。 Next, the change unit 1101 checks whether or not the coded block is a block of the forward prediction picture (step 1506). When the coded block is not a block of the forward prediction picture (steps 1506, No), the moving image coding device 1014 performs the processing after step 1502.

一方、符号化対象ブロックが前方向予測ピクチャのブロックである場合（ステップ１５０６，Ｙｅｓ）、変更部１１０１及び判定部１１０２は、グローバルベクトルに基づいて補正処理を行う（ステップ１５０７）。 On the other hand, when the coded block is a block of the forward prediction picture (steps 1506, Yes), the changing unit 1101 and the determination unit 1102 perform correction processing based on the global vector (step 1507).

この補正処理において、判定部１１０２が符号化対象ブロックを補正しないと判定した場合、動画像符号化装置１０１４は、ステップ１５０２以降の処理を行う。一方、判定部１１０２が符号化対象ブロックを補正すると判定した場合、変更部１１０１は、符号化対象ブロックの位置を、グローバルベクトルの向きにグローバルベクトルの大きさだけずらすことで、補正された符号化対象ブロックを生成する。 When the determination unit 1102 determines that the coded block is not corrected in this correction process, the moving image coding device 1014 performs the processes after step 1502. On the other hand, when the determination unit 1102 determines that the coded target block is to be corrected, the changing unit 1101 shifts the position of the coded target block in the direction of the global vector by the size of the global vector to correct the coding. Generate the target block.

次に、ＥＮＴ１１０７は、グローバルベクトルの情報を含むヘッダ情報を生成し（ステップ１５０８）、動画像符号化装置１０１４は、補正された符号化対象ブロックを符号化する（ステップ１５０９）。そして、変更部１１０４は、グローバルベクトルに基づいて、復号ブロック画像の位置を元の位置に戻す（ステップ１５１０）。このとき、変更部１１０４は、復号ブロック画像の位置を、グローバルベクトルとは逆の向きにグローバルベクトルの大きさだけずらすことで、補正された復号ブロック画像を生成して、フレームメモリ１１１２に書き込む。 Next, the ENT 1107 generates header information including the global vector information (step 1508), and the moving image coding device 1014 encodes the corrected coded block (step 1509). Then, the change unit 1104 returns the position of the decoded block image to the original position based on the global vector (step 1510). At this time, the change unit 1104 generates a corrected decoded block image by shifting the position of the decoded block image by the size of the global vector in the direction opposite to the global vector, and writes the corrected block image to the frame memory 1112.

次に、変更部１１０１は、符号化対象画像内のすべてのブロックを符号化したか否かをチェックする（ステップ１５０３）。符号化していないブロックが残っている場合（ステップ１５０３，Ｎｏ）、動画像符号化装置１０１４は、次のブロックを符号化対象ブロックとして、ステップ１５０１以降の処理を繰り返す。 Next, the change unit 1101 checks whether or not all the blocks in the image to be encoded have been encoded (step 1503). When an unencoded block remains (steps 1503, No), the moving image coding apparatus 1014 repeats the processing after step 1501 with the next block as the coding target block.

すべてのブロックを符号化した場合（ステップ１５０３，Ｙｅｓ）、変更部１１０１は、符号化対象映像に含まれるすべての画像を符号化したか否かをチェックする（ステップ１５０４）。符号化していない画像が残っている場合（ステップ１５０４，Ｎｏ）、動画像符号化装置１０１４は、次の画像を符号化対象画像として、ステップ１５０１以降の処理を繰り返す。そして、すべての画像を符号化した場合（ステップ１５０４，Ｙｅｓ）、動画像符号化装置１０１４は、処理を終了する。 When all the blocks are encoded (steps 1503, Yes), the changing unit 1101 checks whether or not all the images included in the coded target video are encoded (step 1504). When an unencoded image remains (steps 1504, No), the moving image coding apparatus 1014 repeats the processes after step 1501 with the next image as the image to be encoded. Then, when all the images are encoded (steps 1504, Yes), the moving image encoding device 1014 ends the process.

例えば、動画像符号化方式がＨＥＶＣである場合、ステップ１５０８において、ＥＮＴ１１０７は、ヘッダ情報に含まれるユーザデータを用いて、グローバルベクトルの情報を送信することができる。 For example, when the moving image coding method is HEVC, in step 1508, the ENT 1107 can transmit the global vector information by using the user data included in the header information.

具体的には、Supplemental Enhancement Information（ＳＥＩ）のpayloadType=5のuser_data_unregistered(payloadSize)を用いることができる。この場合、グローバルベクトルのＸ成分及びＹ成分のビット数として、それぞれ、log2_max_mv_length_horizontal及びlog2_max_mv_length_verticalで示されるビット数を確保すればよい。あるいは、そのsyntax elementの最大値である１６ビットを確保してもよい。 Specifically, user_data_unregistered (payloadSize) of payloadType = 5 of Supplemental Enhancement Information (SEI) can be used. In this case, as the number of bits of the X component and the Y component of the global vector, the number of bits indicated by log2_max_mv_length_horizontal and log2_max_mv_length_vertical may be secured, respectively. Alternatively, 16 bits, which is the maximum value of the syntax element, may be secured.

ユーザデータが他の目的で使用されることを考慮して、グローバルベクトルを示す所定の識別情報をグローバルベクトルの前に挿入したＳＥＩ（ユーザデータ）を用いることも好ましい。所定の識別情報としては、例えば、グローバルベクトルを表わす所定の文字列に対応するAmerican Standard Code for Information Interchange（ＡＳＣＩＩ）コードを用いることができる。 Considering that the user data is used for other purposes, it is also preferable to use SEI (user data) in which predetermined identification information indicating the global vector is inserted before the global vector. As the predetermined identification information, for example, an American Standard Code for Information Interchange (ASCII) code corresponding to a predetermined character string representing a global vector can be used.

図１５の動画像符号化処理によれば、全周囲パノラマ映像に含まれる前方向予測ピクチャを符号化する際に、符号化対象パノラマ画像全体の動きに応じて、その符号化対象パノラマ画像の起点を変更することができる。これにより、符号化対象ブロックに対して生成される動きベクトル情報を抑制することが可能になるとともに、符号化対象パノラマ画像内の全ブロックに対して適切な動き推定を行うことが可能になる。したがって、ＭＥ効率及び符号化効率が向上する。 According to the moving image coding process of FIG. 15, when coding the forward prediction picture included in the omnidirectional panoramic image, the starting point of the coded panoramic image is according to the movement of the entire coded panoramic image. Can be changed. As a result, it becomes possible to suppress the motion vector information generated for the coded target block, and it is possible to perform appropriate motion estimation for all the blocks in the coded target panoramic image. Therefore, ME efficiency and coding efficiency are improved.

図１６は、図１５のステップ１５０７において、符号化対象ブロックを補正するか否かを動きベクトルの発生数を用いて判定する補正処理の例を示すフローチャートである。まず、判定部１１０２は、符号化対象パノラマ画像内における動きベクトルの総数Ｎｕｍ＿ＡｌｌＭＶに対するグローバルベクトルの発生数Ｎｕｍ＿ＧＭＶの比率Ｒを、閾値ＴＨ１と比較する（ステップ１６０１）。 FIG. 16 is a flowchart showing an example of a correction process in which, in step 1507 of FIG. 15, it is determined whether or not to correct the coded block by using the number of motion vector generations. First, the determination unit 1102 compares the ratio R of the number of occurrences of global vectors Num_GMV to the total number of motion vectors Num_AllMV in the coded panoramic image with the threshold value TH1 (step 1601).

ＲがＴＨ１以上である場合（ステップ１６０１，Ｙｅｓ）、判定部１１０２は、符号化対象ブロックを補正すると判定する。そして、変更部１１０１は、グローバルベクトルを用いて符号化対象ブロックの位置をずらすことで、補正された符号化対象ブロックを生成し（ステップ１６０２）、動画像符号化装置１０１４は、ステップ１５０８以降の処理を行う。 When R is TH1 or more (steps 1601, Yes), the determination unit 1102 determines that the coded block is corrected. Then, the changing unit 1101 generates the corrected coded object block by shifting the position of the coded object block using the global vector (step 1602), and the moving image coding device 1014 performs the motion image coding device 1014 and subsequent steps. Perform processing.

一方、ＲがＴＨ１未満である場合（ステップ１６０１，Ｎｏ）、判定部１１０２は、符号化対象ブロックを補正しないと判定し、動画像符号化装置１０１４は、ステップ１５０２以降の処理を行う。 On the other hand, when R is less than TH1 (steps 1601 and No), the determination unit 1102 determines that the coding target block is not corrected, and the moving image coding device 1014 performs the processing after step 1502.

図１６の補正処理によれば、決定部１１０３が決定したグローバルベクトルが符号化対象パノラマ画像内において所定値以上の比率で発生している場合に、符号化対象ブロックの位置が変更される。これにより、符号化対象パノラマ画像全体に発生している一定の動きに応じて、符号化対象ブロックの位置をずらすことができる。 According to the correction process of FIG. 16, when the global vector determined by the determination unit 1103 is generated at a ratio of a predetermined value or more in the coded target panoramic image, the position of the coded target block is changed. As a result, the position of the coded target block can be shifted according to the constant movement occurring in the entire coded target panoramic image.

図１７は、図１５のステップ１５０７において、符号化対象ブロックを補正するか否かをＭＥ効率を用いて判定する補正処理の例を示すフローチャートである。 FIG. 17 is a flowchart showing an example of a correction process in which it is determined by using the ME efficiency whether or not to correct the coded block in step 1507 of FIG.

まず、判定部１１０２は、符号化対象パノラマ画像を補正しない場合の動き推定を、動き補償部１１１０に対して要求する（ステップ１７０１）。動き補償部１１１０は、符号化対象パノラマ画像内のすべてのブロックに対する動き推定を実施して、各ブロックの動きベクトルを求める。そして、動き補償部１１１０は、参照パノラマ画像と符号化対象パノラマ画像との間の差分絶対値和（ＳＡＤ）を計算し、計算したＳＡＤをＳＡＤ１として判定部１１０２へ出力する。 First, the determination unit 1102 requests the motion compensation unit 1110 to estimate the motion when the coded panoramic image is not corrected (step 1701). The motion compensation unit 1110 performs motion estimation for all blocks in the coded panoramic image and obtains a motion vector for each block. Then, the motion compensation unit 1110 calculates the difference absolute value sum (SAD) between the reference panoramic image and the coded target panoramic image, and outputs the calculated SAD as SAD1 to the determination unit 1102.

次に、判定部１１０２は、符号化対象パノラマ画像を補正した場合の動き推定を、動き補償部１１１０に対して要求する（ステップ１７０２）。動き補償部１１１０は、グローバルベクトルを用いて符号化対象ブロックの位置をずらすことで、補正された符号化対象ブロックを生成する。次に、動き補償部１１１０は、補正された符号化対象パノラマ画像内のすべてのブロックに対する動き推定を実施して、各ブロックの動きベクトルを求める。そして、動き補償部１１１０は、参照パノラマ画像と補正された符号化対象パノラマ画像との間のＳＡＤを計算し、計算したＳＡＤをＳＡＤ２として判定部１１０２へ出力する。 Next, the determination unit 1102 requests the motion compensation unit 1110 to estimate the motion when the coded panoramic image is corrected (step 1702). The motion compensation unit 1110 generates a corrected coded object block by shifting the position of the coded object block using a global vector. Next, the motion compensation unit 1110 performs motion estimation for all the blocks in the corrected panoramic image to be coded, and obtains a motion vector for each block. Then, the motion compensation unit 1110 calculates the SAD between the reference panoramic image and the corrected panoramic image to be encoded, and outputs the calculated SAD as SAD2 to the determination unit 1102.

次に、判定部１１０２は、ＳＡＤ１とＳＡＤ２を比較する（ステップ１７０３）。ＳＡＤ２がＳＡＤ１よりも小さい場合（ステップ１７０３，Ｙｅｓ）、判定部１１０２は、符号化対象ブロックを補正すると判定する。そして、変更部１１０１は、グローバルベクトルを用いて符号化対象ブロックの位置をずらすことで、補正された符号化対象ブロックを生成し（ステップ１７０４）、動画像符号化装置１０１４は、ステップ１５０８以降の処理を行う。 Next, the determination unit 1102 compares SAD1 and SAD2 (step 1703). When SAD2 is smaller than SAD1 (steps 1703, Yes), the determination unit 1102 determines that the coded block is to be corrected. Then, the changing unit 1101 generates the corrected coded object block by shifting the position of the coded object block using the global vector (step 1704), and the moving image coding device 1014 performs after step 1508. Perform processing.

一方、ＳＡＤ２がＳＡＤ１以上である場合（ステップ１７０３，Ｎｏ）、判定部１１０２は、符号化対象ブロックを補正しないと判定し、動画像符号化装置１０１４は、ステップ１５０２以降の処理を行う。 On the other hand, when SAD2 is SAD1 or more (steps 1703 and No), the determination unit 1102 determines that the coded block is not corrected, and the moving image coding device 1014 performs the processing after step 1502.

図１７の補正処理によれば、決定部１１０３が決定したグローバルベクトルに基づいて符号化対象ブロックの位置をずらすことで、ＭＥ効率が向上する場合に、符号化対象ブロックの位置が変更される。これにより、符号化対象パノラマ画像のＭＥ効率を向上させることができる。 According to the correction process of FIG. 17, the position of the coded target block is changed when the ME efficiency is improved by shifting the position of the coded target block based on the global vector determined by the determination unit 1103. This makes it possible to improve the ME efficiency of the panoramic image to be encoded.

図１８は、符号化対象パノラマ画像を補正するか否かを符号化効率を用いて判定する動画像符号化処理の例を示すフローチャートである。ステップ１８０１～ステップ１８０６の処理は、図１５のステップ１５０１～ステップ１５０６の処理と同様である。 FIG. 18 is a flowchart showing an example of a moving image coding process for determining whether or not to correct a panoramic image to be coded by using the coding efficiency. The processing of steps 1801 to 1806 is the same as the processing of steps 1501 to 1506 of FIG.

符号化対象ブロックが前方向予測ピクチャのブロックである場合（ステップ１８０６，Ｙｅｓ）、動画像符号化装置１０１４は、ステップ１８０７の処理と、ステップ１８０８～ステップ１８１１の処理とを行う。ステップ１８０７の処理は、符号化対象ブロックを補正しない場合のステップ１８０２の処理と同様である。 When the coded block is a block of a forward prediction picture (step 1806, Yes), the moving image coding device 1014 performs the process of step 1807 and the process of steps 1808 to 1811. The process of step 1807 is the same as the process of step 1802 when the coded block is not corrected.

ステップ１８０８において、変更部１１０１は、グローバルベクトルを用いて符号化対象ブロックの位置をずらすことで、補正された符号化対象ブロックを生成する。ステップ１８０９～ステップ１８１１の処理は、図１５において符号化対象ブロックを補正する場合のステップ１５０８～ステップ１５１０の処理と同様である。 In step 1808, the change unit 1101 generates the corrected coded object block by shifting the position of the coded object block using the global vector. The processing of steps 1809 to 1811 is the same as the processing of steps 1508 to 1510 when the coded block is corrected in FIG.

次に、変更部１１０１は、符号化対象画像内のすべてのブロックを符号化したか否かをチェックする（ステップ１８１２）。符号化していないブロックが残っている場合（ステップ１８１２，Ｎｏ）、動画像符号化装置１０１４は、次のブロックを符号化対象ブロックとして、ステップ１８０１以降の処理を繰り返す。 Next, the change unit 1101 checks whether or not all the blocks in the image to be encoded have been encoded (step 1812). When an unencoded block remains (step 1812, No), the moving image coding apparatus 1014 repeats the processing after step 1801 with the next block as the coding target block.

すべてのブロックを符号化した場合（ステップ１８１２，Ｙｅｓ）、変更部１１０１及び判定部１１０２は、符号化効率に基づく補正処理を行う（ステップ１８１３）。そして、動画像符号化装置１０１４は、ステップ１８０４以降の処理を行う。 When all the blocks are encoded (step 1812, Yes), the changing unit 1101 and the determination unit 1102 perform a correction process based on the coding efficiency (step 1813). Then, the moving image coding device 1014 performs the processing after step 1804.

図１９は、図１８のステップ１８１３における補正処理の例を示すフローチャートである。判定部１１０２は、Ｔ／Ｑ１１０６が周波数信号を量子化する際に用いる量子化スケールと、ＥＮＴ１１０７が生成するビットストリームの情報量とを用いて符号化効率を計算する。例えば、符号化対象パノラマ画像内のすべてのブロックに対する量子化スケールの平均値と、その符号化対象パノラマ画像から生成される符号の総ビット数との積を、その符号化対象パノラマ画像の符号化効率として用いることができる。 FIG. 19 is a flowchart showing an example of the correction process in step 1813 of FIG. The determination unit 1102 calculates the coding efficiency using the quantization scale used by the T / Q1106 to quantize the frequency signal and the amount of information of the bitstream generated by the ENT1107. For example, the product of the average value of the quantization scale for all the blocks in the coded panorama image and the total number of bits of the code generated from the coded panorama image is encoded in the coded panorama image. It can be used as an efficiency.

まず、判定部１１０２は、符号化対象パノラマ画像内のすべてのブロックに対するステップ１８０７の処理の結果から、符号化対象パノラマ画像を補正しない場合の符号化効率ＣＥ１を計算する（ステップ１９０１）。符号化効率ＣＥ１は、符号化対象パノラマ画像を補正しない場合の量子化スケールの平均値ＱＰ１と符号の総ビット数Ｉ１との積として求められる。 First, the determination unit 1102 calculates the coding efficiency CE1 when the coding target panoramic image is not corrected from the processing result of step 1807 for all the blocks in the coding target panoramic image (step 1901). The coding efficiency CE1 is obtained as the product of the average value QP1 of the quantization scale and the total number of bits I1 of the code when the panoramic image to be coded is not corrected.

次に、判定部１１０２は、符号化対象パノラマ画像内のすべてのブロックに対するステップ１８０８～ステップ１８１１の処理の結果から、符号化対象パノラマ画像を補正した場合の符号化効率ＣＥ２を計算する（ステップ１９０２）。符号化効率ＣＥ２は、符号化対象パノラマ画像を補正した場合の量子化スケールの平均値ＱＰ２と符号の総ビット数Ｉ２との積として求められる。 Next, the determination unit 1102 calculates the coding efficiency CE2 when the coding target panorama image is corrected from the processing results of steps 1808 to 1811 for all the blocks in the coding target panorama image (step 1902). ). The coding efficiency CE2 is obtained as the product of the average value QP2 of the quantization scale when the panoramic image to be coded is corrected and the total number of bits I2 of the code.

次に、判定部１１０２は、ＣＥ１とＣＥ２を比較する（ステップ１９０３）。ＣＥ２がＣＥ１よりも小さい場合（ステップ１９０３，Ｙｅｓ）、判定部１１０２は、符号化対象パノラマ画像を補正すると判定する。そして、動画像符号化装置１０１４は、ステップ１８０８～ステップ１８１１の処理の結果を採用する（ステップ１９０５）。 Next, the determination unit 1102 compares CE1 and CE2 (step 1903). When CE2 is smaller than CE1 (step 1903, Yes), the determination unit 1102 determines that the coded panoramic image is corrected. Then, the moving image coding device 1014 adopts the result of the processing of steps 1808 to 1811 (step 1905).

一方、ＣＥ２がＣＥ１以上である場合（ステップ１９０３，Ｎｏ）、判定部１１０２は、符号化対象パノラマ画像を補正しないと判定し、動画像符号化装置１０１４は、ステップ１８０７の処理の結果を採用する（ステップ１９０４）。 On the other hand, when CE2 is CE1 or more (step 1903, No), the determination unit 1102 determines that the coded panoramic image is not corrected, and the moving image coding device 1014 adopts the result of the process of step 1807. (Step 1904).

図１９の補正処理によれば、決定部１１０３が決定したグローバルベクトルに基づいて符号化対象ブロックの位置をずらすことで、符号化効率が向上する場合に、符号化対象ブロックの位置が変更される。これにより、符号化対象パノラマ画像の符号化効率を向上させることができる。 According to the correction process of FIG. 19, the position of the coding target block is changed when the coding efficiency is improved by shifting the position of the coding target block based on the global vector determined by the determination unit 1103. .. This makes it possible to improve the coding efficiency of the panoramic image to be coded.

図２０は、図８の動画像復号装置８０１の具体例を示している。図２０の動画像復号装置２００１は、抽出部２０１１、変更部２０１２、ブロック復号部２０１３、加算部２０１４、動き補償部２０１５、予測画像生成部２０１６、及びフレームメモリ２０１７を含む。 FIG. 20 shows a specific example of the moving image decoding device 801 of FIG. The moving image decoding device 2001 of FIG. 20 includes an extraction unit 2011, a change unit 2012, a block decoding unit 2013, an addition unit 2014, a motion compensation unit 2015, a prediction image generation unit 2016, and a frame memory 2017.

抽出部２０１１は、図８の抽出部８１２に対応し、変更部２０１２は、補正部８１４に対応し、ブロック復号部２０１３、加算部２０１４、動き補償部２０１５、及び予測画像生成部２０１６は、復号部８１３に対応し、フレームメモリ２０１７は、記憶部８１１に対応する。 The extraction unit 2011 corresponds to the extraction unit 812 of FIG. 8, the change unit 2012 corresponds to the correction unit 814, and the block decoding unit 2013, the addition unit 2014, the motion compensation unit 2015, and the prediction image generation unit 2016 are decoded. The frame memory 2017 corresponds to the storage unit 813 and corresponds to the storage unit 811.

図１１の動画像符号化装置１０１４が出力するビットストリームは、符号化パノラマ映像として、動画像復号装置２００１に入力される。動画像復号装置２００１は、符号化パノラマ映像に含まれる各時刻の符号化パノラマ画像を復号して、符号化対象パノラマ映像を復元する。 The bit stream output by the moving image coding device 1014 of FIG. 11 is input to the moving image decoding device 2001 as a coded panoramic image. The moving image decoding device 2001 decodes the coded panoramic image at each time included in the coded panoramic image and restores the coded panoramic image.

抽出部２０１１は、符号化パノラマ映像に含まれるヘッダ情報からグローバルベクトルの情報を抽出して、変更部２０１２へ出力する。 The extraction unit 2011 extracts the global vector information from the header information included in the coded panoramic image and outputs it to the change unit 2012.

ブロック復号部２０１３は、符号化パノラマ画像内の各復号対象ブロックの係数情報を逆量子化して周波数信号を生成し、逆直交変換によって周波数信号を再構成予測誤差信号に変換する。そして、ブロック復号部２０１３は、再構成予測誤差信号を加算部２０１４へ出力する。 The block decoding unit 2013 reverse-quantifies the coefficient information of each decoding target block in the coded panoramic image to generate a frequency signal, and converts the frequency signal into a reconstruction prediction error signal by inverse orthogonal transformation. Then, the block decoding unit 2013 outputs the reconstruction prediction error signal to the addition unit 2014.

加算部２０１４は、予測画像生成部２０１６から出力される予測ブロック画像と、再構成予測誤差信号とを加算することで、復号ブロック画像を生成し、生成した復号ブロック画像をフレームメモリ２０１７へ出力する。抽出部２０１１から符号化パノラマ画像に対するグローバルベクトルが出力された場合、変更部２０１２は、グローバルベクトルに基づいて、復号ブロック画像の位置を変更前の符号化対象ブロックの位置に戻す。 The addition unit 2014 generates a decoding block image by adding the prediction block image output from the prediction image generation unit 2016 and the reconstruction prediction error signal, and outputs the generated decoding block image to the frame memory 2017. .. When the global vector for the coded panoramic image is output from the extraction unit 2011, the change unit 2012 returns the position of the decoded block image to the position of the coded block before the change based on the global vector.

フレームメモリ２０１７は、復号ブロック画像を蓄積し、蓄積した復号ブロック画像を参照画像として、動き補償部２０１５及び予測画像生成部２０１６へ出力する。符号化パノラマ画像内の複数の復号対象ブロックそれぞれから生成された複数の復号ブロック画像は、復号パノラマ画像に対応する。そして、変更部２０１２によって位置を変更された後の複数の復号ブロック画像は、復元された符号化対象パノラマ画像を表し、参照パノラマ画像に対応する。 The frame memory 2017 accumulates the decoded block image, and outputs the accumulated decoded block image as a reference image to the motion compensation unit 2015 and the prediction image generation unit 2016. The plurality of decoded block images generated from each of the plurality of decoded blocks in the coded panoramic image correspond to the decoded panoramic image. The plurality of decoded block images after the position is changed by the change unit 2012 represent the restored panoramic image to be encoded and correspond to the reference panoramic image.

動き補償部２０１５は、参照パノラマ画像から動きベクトルが示す参照画像を取得して、予測画像生成部２０１６へ出力する。 The motion compensation unit 2015 acquires the reference image indicated by the motion vector from the reference panoramic image and outputs it to the prediction image generation unit 2016.

予測画像生成部２０１６は、参照画像を用いて、符号化パノラマ画像内の既に復号された周辺画素の画素値から、復号対象ブロックのイントラ予測ブロック画像を生成する。また、予測画像生成部２０１６は、動き補償部２０１５から出力される参照画像をインター予測ブロック画像として使用する。そして、予測画像生成部２０１６は、イントラ予測ブロック画像又はインター予測ブロック画像のいずれか一方を選択し、選択した予測ブロック画像を加算部２０１４へ出力する。 The prediction image generation unit 2016 generates an intra prediction block image of the decoding target block from the pixel values of the peripheral pixels already decoded in the coded panorama image using the reference image. Further, the prediction image generation unit 2016 uses the reference image output from the motion compensation unit 2015 as the inter-prediction block image. Then, the prediction image generation unit 2016 selects either the intra prediction block image or the inter prediction block image, and outputs the selected prediction block image to the addition unit 2014.

図２１は、図２０の動画像復号装置２００１が行う動画像復号処理の具体例を示すフローチャートである。動画像復号装置２００１は、符号化映像に含まれる各時刻の画像を復号対象画像として、復号対象画像をブロック毎に符号化する。 FIG. 21 is a flowchart showing a specific example of the moving image decoding process performed by the moving image decoding device 2001 of FIG. 20. The moving image decoding device 2001 encodes the image to be decoded for each block, using the image at each time included in the encoded video as the image to be decoded.

まず、動画像復号装置２００１は、復号対象映像が全周囲パノラマ映像であるか否かをチェックする（ステップ２１０１）。復号対象映像が全周囲パノラマ映像ではない場合（ステップ２１０１，Ｎｏ）、動画像復号装置２００１は、復号対象ブロックをそのまま復号する（ステップ２１０２）。 First, the moving image decoding device 2001 checks whether or not the video to be decoded is an omnidirectional panoramic video (step 2101). When the video to be decoded is not an all-around panoramic video (steps 2101 and No), the moving image decoding apparatus 2001 decodes the decoding target block as it is (step 2102).

一方、復号対象映像が全周囲パノラマ映像である場合（ステップ２１０１，Ｙｅｓ）、抽出部２０１１は、ヘッダ情報からグローバルベクトルの情報を抽出する（ステップ２１０５）。 On the other hand, when the video to be decoded is an all-around panoramic video (steps 2101 and Yes), the extraction unit 2011 extracts global vector information from the header information (step 2105).

次に、動画像復号装置２００１は、復号対象ブロックが前方向予測ピクチャのブロックであるか否かをチェックする（ステップ２１０６）。復号対象ブロックが前方向予測ピクチャのブロックではない場合（ステップ２１０６，Ｎｏ）、動画像復号装置２００１は、ステップ２１０２以降の処理を行う。 Next, the moving image decoding device 2001 checks whether or not the decoding target block is a block of the forward prediction picture (step 2106). When the block to be decoded is not a block of the forward prediction picture (steps 2106, No), the moving image decoding apparatus 2001 performs the processes after step 2102.

一方、復号対象ブロックが前方向予測ピクチャのブロックである場合（ステップ２１０６，Ｙｅｓ）、動画像復号装置２００１は、復号対象ブロックを復号する（ステップ２１０７）。そして、変更部２０１２は、グローバルベクトルに基づいて、復号ブロック画像の位置を変更前の符号化対象ブロックの位置に戻す（ステップ２１０８）。このとき、変更部２０１２は、復号ブロック画像の位置を、グローバルベクトルとは逆の向きにグローバルベクトルの大きさだけずらすことで、補正された復号ブロック画像を生成して、フレームメモリ２０１７に書き込む。 On the other hand, when the decoding target block is a block of the forward prediction picture (step 2106, Yes), the moving image decoding apparatus 2001 decodes the decoding target block (step 2107). Then, the change unit 2012 returns the position of the decoded block image to the position of the coded block before the change based on the global vector (step 2108). At this time, the change unit 2012 generates a corrected decoded block image by shifting the position of the decoded block image by the size of the global vector in the direction opposite to the global vector, and writes it in the frame memory 2017.

次に、動画像復号装置２００１は、復号対象画像内のすべてのブロックを復号したか否かをチェックする（ステップ２１０３）。復号していないブロックが残っている場合（ステップ２１０３，Ｎｏ）、動画像復号装置２００１は、次のブロックを復号対象ブロックとして、ステップ２１０１以降の処理を繰り返す。 Next, the moving image decoding device 2001 checks whether or not all the blocks in the image to be decoded have been decoded (step 2103). When a block that has not been decoded remains (steps 2103 and No), the moving image decoding apparatus 2001 repeats the processes after step 2101 with the next block as the decoding target block.

すべてのブロックを復号した場合（ステップ２１０３，Ｙｅｓ）、動画像復号装置２００１は、復号対象映像に含まれるすべての画像を復号したか否かをチェックする（ステップ２１０４）。復号していない画像が残っている場合（ステップ２１０４，Ｎｏ）、動画像復号装置２００１は、次の画像を復号対象画像として、ステップ２１０１以降の処理を繰り返す。そして、すべての画像を復号した場合（ステップ２１０４，Ｙｅｓ）、動画像復号装置２００１は、処理を終了する。 When all the blocks are decoded (step 2103, Yes), the moving image decoding apparatus 2001 checks whether or not all the images included in the video to be decoded have been decoded (step 2104). When the undecrypted image remains (step 2104, No), the moving image decoding device 2001 repeats the processes after step 2101 with the next image as the image to be decoded. Then, when all the images are decoded (steps 2104, Yes), the moving image decoding apparatus 2001 ends the process.

次に、図１１の動画像符号化装置１０１４が図６の動画像符号化装置６０１に対応する場合の動作について説明する。この場合、変更部１１０１及び判定部１１０２は、補正部６１３に対応し、決定部１１０３は、決定部６１２に対応する。減算部１１０５、Ｔ／Ｑ１１０６、ＥＮＴ１１０７、ＩＱ／ＩＴ１１０８、加算部１１０９、動き補償部１１１０、及び予測画像生成部１１１１は、符号化部６１４に対応し、フレームメモリ１１１２は、記憶部６１１に対応する。 Next, the operation when the moving image coding device 1014 of FIG. 11 corresponds to the moving image coding device 601 of FIG. 6 will be described. In this case, the change unit 1101 and the determination unit 1102 correspond to the correction unit 613, and the determination unit 1103 corresponds to the determination unit 612. The subtraction unit 1105, T / Q1106, ENT1107, IQ / IT1108, addition unit 1109, motion compensation unit 1110, and prediction image generation unit 1111 correspond to the coding unit 614, and the frame memory 1112 corresponds to the storage unit 611. ..

図１０の撮影部１０１１が移動している場合、時間の経過とともに全周囲パノラマ映像の撮影範囲が変化し、写っている被写体も変化することが想定される。この場合、全周囲パノラマ映像に含まれる各パノラマ画像の全体を展開パノラマ画像に変換しても、１つの動画として捉えることは難しい。そこで、図３に示したように、各パノラマ画像から一部の領域を抽出して矩形状に変換することで、抽出した部分領域の動画を生成することが効果的である。 When the photographing unit 1011 of FIG. 10 is moving, it is assumed that the photographing range of the panoramic image of the entire circumference changes with the passage of time, and the subject in the image also changes. In this case, even if the entire panoramic image included in the omnidirectional panoramic image is converted into a developed panoramic image, it is difficult to capture it as one moving image. Therefore, as shown in FIG. 3, it is effective to extract a partial region from each panoramic image and convert it into a rectangular shape to generate a moving image of the extracted partial region.

この場合、合成部１０１２は、撮影部１０１１に含まれる複数の撮像装置が撮影した複数の映像を組み合わせることで、符号化対象パノラマ映像を生成することができる。展開部１０１３が生成する展開パノラマ画像は、符号化対象パノラマ画像として、動画像符号化装置１０１４に入力される。 In this case, the compositing unit 1012 can generate a coded panoramic image by combining a plurality of images captured by a plurality of image pickup devices included in the photographing unit 1011. The unfolded panoramic image generated by the unfolding unit 1013 is input to the moving image coding device 1014 as a coded target panoramic image.

決定部１１０３は、動き補償部１１１０から出力される動きベクトルを用いて、フレームメモリ１１１２が記憶する参照画像に対する符号化対象ブロックのずれ量を表すグローバルベクトルを決定する。そして、決定部１１０３は、決定したグローバルベクトルを変更部１１０１へ出力する。 The determination unit 1103 uses the motion vector output from the motion compensation unit 1110 to determine a global vector representing the amount of deviation of the coded block with respect to the reference image stored in the frame memory 1112. Then, the determination unit 1103 outputs the determined global vector to the change unit 1101.

例えば、決定部１１０３は、符号化対象パノラマ画像内の特定のブロックの動きベクトルをグローバルベクトルに決定することができる。この場合、動き補償部１１１０は、事前に符号化対象パノラマ画像内のすべてのブロックに対する動き推定を実施し、各ブロックの動きベクトルを求めて、決定部１１０３へ出力する。そして、決定部１１０３は、特定のブロックの動きベクトルを選択して、グローバルベクトルに決定する。 For example, the determination unit 1103 can determine the motion vector of a specific block in the coded panoramic image as a global vector. In this case, the motion compensation unit 1110 performs motion estimation for all the blocks in the coded panoramic image in advance, obtains the motion vector of each block, and outputs the motion vector to the determination unit 1103. Then, the determination unit 1103 selects a motion vector of a specific block and determines it as a global vector.

特定のブロックの動きベクトルをグローバルベクトルとして用いることで、そのブロックの動きに合わせて、符号化対象パノラマ画像から抽出される部分領域の位置をずらすことができる。これにより、特定のブロックに写っている被写体が常に画面内の同じ位置に写るように、画面に表示される部分領域の画像が補正されるため、特定の被写体を追従する映像を生成することが可能になる。 By using the motion vector of a specific block as a global vector, the position of the partial region extracted from the coded panoramic image can be shifted according to the motion of the block. As a result, the image of the partial area displayed on the screen is corrected so that the subject in the specific block always appears in the same position on the screen, so that it is possible to generate an image that follows the specific subject. It will be possible.

例えば、動画像符号化装置１０１４がタッチパネルを搭載した表示装置を含む場合、決定部１１０３は、画面内でユーザがタッチした位置が属するブロックのアドレスを求め、そのアドレスを用いて特定のブロックを決定することができる。動画像符号化装置１０１４がタッチパネルを含まない場合は、画面内の位置を示す座標からブロックのアドレスを計算し、計算したアドレスを用いて特定のブロックを指定することができる。 For example, when the moving image coding device 1014 includes a display device equipped with a touch panel, the determination unit 1103 obtains the address of the block to which the position touched by the user in the screen belongs, and determines a specific block using the address. can do. When the moving image coding device 1014 does not include the touch panel, the address of the block can be calculated from the coordinates indicating the position in the screen, and the specific block can be specified by using the calculated address.

例えば、ブロックサイズが１６×１６であり、画面内で指定する画素の座標が（ｘ，ｙ）である場合、ブロックのアドレスは［ｘ／１６，ｙ／１６］となる。また、ブロックサイズが６４×６４であり、画面内で指定する画素の座標が（ｘ，ｙ）である場合、ブロックのアドレスは［ｘ／６４，ｙ／６４］となる。 For example, if the block size is 16 × 16 and the coordinates of the pixels specified in the screen are (x, y), the block address is [x / 16, y / 16]. If the block size is 64 × 64 and the coordinates of the pixels specified in the screen are (x, y), the block address is [x / 64, y / 64].

図２２は、動画像符号化装置１０１４が動画像符号化装置６０１に対応する場合の動画像符号化処理の具体例を示すフローチャートである。動画像符号化装置１０１４は、符号化対象映像に含まれる各時刻の画像を符号化対象画像として、符号化対象画像をブロック毎に符号化する。 FIG. 22 is a flowchart showing a specific example of the moving image coding process when the moving image coding device 1014 corresponds to the moving image coding device 601. The moving image coding device 1014 encodes the coded target image block by block, using the image at each time included in the coded target image as the coded target image.

符号化対象映像が全周囲パノラマ映像である場合、動画像符号化装置１０１４は、符号化対象パノラマ画像から抽出される１つ又は複数の部分領域を対象として、その部分領域の画像をブロック毎に符号化する。 When the coded image is an omnidirectional panoramic image, the moving image coding apparatus 1014 targets one or a plurality of partial areas extracted from the coded panoramic image, and displays the image of the partial area block by block. Encode.

まず、変更部１１０１は、符号化対象映像が全周囲パノラマ映像であるか否かをチェックする（ステップ２２０１）。 First, the change unit 1101 checks whether or not the coded image is an omnidirectional panoramic image (step 2201).

符号化対象映像が全周囲パノラマ映像ではない場合（ステップ２２０１，Ｎｏ）、変更部１１０１は、符号化対象ブロックをそのまま減算部１１０５及び動き補償部１１１０へ出力する。そして、動画像符号化装置１０１４は、符号化対象ブロックを符号化する（ステップ２２０２）。 When the coded image is not an all-around panoramic image (steps 2201 and No), the changing unit 1101 outputs the coded block as it is to the subtraction unit 1105 and the motion compensation unit 1110. Then, the moving image coding device 1014 encodes the coded block (step 2202).

一方、符号化対象映像が全周囲パノラマ映像である場合（ステップ２２０１，Ｙｅｓ）、決定部１１０３は、符号化対象パノラマ画像のグローバルベクトルを決定する（ステップ２２０５）。 On the other hand, when the coded target image is an omnidirectional panoramic image (step 2201, Yes), the determination unit 1103 determines the global vector of the coded target panoramic image (step 2205).

次に、変更部１１０１は、符号化対象パノラマ画像内において、符号化対象ブロックの位置を、グローバルベクトルとは逆の向きにグローバルベクトルの大きさだけずらすことで、新たな符号化対象ブロックを設定する（ステップ２２０６）。そして、動画像符号化装置１０１４は、設定された新たな符号化対象ブロックを符号化する（ステップ２２０７）。部分領域内の各ブロックの位置をグローバルベクトルとは逆の向きにずらすことにより、部分領域の起点を変更して、補正された部分領域を生成することができる。 Next, the changing unit 1101 sets a new coding target block by shifting the position of the coding target block in the direction opposite to the global vector by the size of the global vector in the coded panoramic image. (Step 2206). Then, the moving image coding device 1014 encodes the set new coded target block (step 2207). By shifting the position of each block in the subregion in the direction opposite to the global vector, the starting point of the subregion can be changed to generate the corrected subregion.

次に、変更部１１０１は、符号化対象画像又は部分領域内のすべてのブロックを符号化したか否かをチェックする（ステップ２２０３）。符号化していないブロックが残っている場合（ステップ２２０３，Ｎｏ）、動画像符号化装置１０１４は、次のブロックを符号化対象ブロックとして、ステップ２２０１以降の処理を繰り返す。 Next, the change unit 1101 checks whether or not all the blocks in the image to be encoded or the partial region are encoded (step 2203). When the unencoded block remains (step 2203, No), the moving image coding apparatus 1014 repeats the processing after step 2201 with the next block as the coding target block.

すべてのブロックを符号化した場合（ステップ２２０３，Ｙｅｓ）、変更部１１０１は、符号化対象映像に含まれるすべての画像を符号化したか否かをチェックする（ステップ２２０４）。符号化していない画像が残っている場合（ステップ２２０４，Ｎｏ）、動画像符号化装置１０１４は、次の画像を符号化対象画像として、ステップ２２０１以降の処理を繰り返す。そして、すべての画像を符号化した場合（ステップ２２０４，Ｙｅｓ）、動画像符号化装置１０１４は、処理を終了する。 When all the blocks are encoded (step 2203, Yes), the changing unit 1101 checks whether or not all the images included in the coded target video are encoded (step 2204). When an unencoded image remains (step 2204, No), the moving image coding apparatus 1014 repeats the processes after step 2201 with the next image as the image to be encoded. Then, when all the images are encoded (step 2204, Yes), the moving image encoding device 1014 ends the process.

図２２の動画像符号化処理では、図１５の動画像符号化処理とは異なり、グローバルベクトルに基づいて復号ブロック画像の位置を元の位置に戻す処理は行われない。また、動画像復号装置においても、復号ブロック画像の位置を元の位置に戻す処理は行われないため、動画像符号化装置１０１４は、グローバルベクトルの情報を動画像復号装置へ送信する必要はない。 In the moving image coding process of FIG. 22, unlike the moving image coding process of FIG. 15, the process of returning the position of the decoded block image to the original position based on the global vector is not performed. Further, even in the moving image decoding device, the processing of returning the position of the decoded block image to the original position is not performed, so that the moving image coding device 1014 does not need to transmit the global vector information to the moving image decoding device. ..

図２２の動画像符号化処理によれば、撮影部１０１１が移動することで複数の撮像装置それぞれの映像の撮影範囲が変化した場合であっても、特定のブロックの動きに応じて、符号化対象パノラマ画像から抽出される部分領域の起点を変更することができる。これにより、部分領域内の各ブロックに対して生成される動きベクトル情報を抑制することが可能になるとともに、各ブロックに対して適切な動き推定を行うことが可能になる。したがって、ＭＥ効率及び符号化効率が向上する。 According to the moving image coding process of FIG. 22, even when the shooting range of the image of each of the plurality of imaging devices changes due to the movement of the shooting unit 1011, the coding is performed according to the movement of a specific block. The starting point of the partial area extracted from the target panoramic image can be changed. This makes it possible to suppress the motion vector information generated for each block in the partial region, and it is possible to perform appropriate motion estimation for each block. Therefore, ME efficiency and coding efficiency are improved.

図４、図６、及び図１１の動画像符号化装置の構成は一例に過ぎず、動画像符号化装置の用途又は条件に応じて一部の構成要素を省略又は変更してもよい。例えば、図１１の動画像符号化装置１０１４において、エントロピー符号化を行わない場合、ＥＮＴ１１０７を省略することができる。 The configuration of the moving image coding device of FIGS. 4, 6 and 11 is only an example, and some components may be omitted or changed depending on the use or conditions of the moving image coding device. For example, in the moving image coding device 1014 of FIG. 11, when entropy coding is not performed, ENT1107 can be omitted.

図８及び図２０の動画像復号装置の構成は一例に過ぎず、動画像復号装置の用途又は条件に応じて一部の構成要素を省略又は変更してもよい。図１０のパノラマ映像符号化システムの構成は一例に過ぎず、パノラマ映像符号化システムの用途又は条件に応じて一部の構成要素を省略又は変更してもよい。例えば、撮影部１０１１が１台の撮像装置のみを含む場合、合成部１０１２を省略することができる。 The configurations of the moving image decoding apparatus shown in FIGS. 8 and 20 are merely examples, and some components may be omitted or changed depending on the use or conditions of the moving image decoding apparatus. The configuration of the panoramic video coding system of FIG. 10 is only an example, and some components may be omitted or changed depending on the use or conditions of the panoramic video coding system. For example, when the photographing unit 1011 includes only one image pickup device, the compositing unit 1012 can be omitted.

図５、図７、図９、図１５～図１９、図２１、及び図２２に示したフローチャートは一例に過ぎず、動画像符号化装置又は動画像復号装置の構成又は条件に応じて、一部の処理を省略又は変更してもよい。例えば、図１５の動画像符号化処理において、入力される符号化対象映像が全周囲パノラマ映像に限られる場合、ステップ１５０１及びステップ１５０２の処理を省略することができる。決定部１１０３は、ステップ１５０５の処理をブロック毎に毎回行う必要はなく、１枚の符号化対象パノラマ画像に対して１回のみ、ステップ１５０５の処理を行ってもよい。 The flowcharts shown in FIGS. 5, 7, 9, 15 to 19, 21, and 22 are merely examples, and depending on the configuration or conditions of the moving image coding device or the moving image decoding device, one The processing of the part may be omitted or changed. For example, in the moving image coding process of FIG. 15, when the input coded image is limited to the omnidirectional panoramic image, the processes of steps 1501 and 1502 can be omitted. The determination unit 1103 does not need to perform the process of step 1505 for each block, and may perform the process of step 1505 only once for one coded panoramic image.

図１７の補正処理において、動き補償部１１１０は、ＳＡＤの代わりに、参照パノラマ画像と符号化対象パノラマ画像との差分を表す別の指標を計算し、判定部１１０２は、その指標を用いて符号化対象ブロックを補正するか否かを判定してもよい。 In the correction process of FIG. 17, the motion compensation unit 1110 calculates another index representing the difference between the reference panoramic image and the coded target panoramic image instead of the SAD, and the determination unit 1102 uses the index to code. It may be determined whether or not to correct the conversion target block.

図１８の動画像符号化処理において、入力される符号化対象映像が全周囲パノラマ映像に限られる場合、ステップ１８０１及びステップ１８０２の処理を省略することができる。決定部１１０３は、ステップ１８０５の処理をブロック毎に毎回行う必要はなく、１枚の符号化対象パノラマ画像に対して１回のみ、ステップ１８０５の処理を行ってもよい。 In the moving image coding process of FIG. 18, when the input coded target image is limited to the omnidirectional panoramic image, the processes of steps 1801 and 1802 can be omitted. The determination unit 1103 does not need to perform the process of step 1805 for each block, and may perform the process of step 1805 only once for one coded panoramic image.

図１９の補正処理において、判定部１１０２は、量子化スケールの平均値の代わりに、中央値、最大値等の別の統計値を用いて、符号化効率を計算してもよい。 In the correction process of FIG. 19, the determination unit 1102 may calculate the coding efficiency by using another statistical value such as a median value and a maximum value instead of the average value of the quantization scale.

図２１の動画像復号処理において、入力される復号対象映像が全周囲パノラマ映像に限られる場合、ステップ２１０１及びステップ２１０２の処理を省略することができる。抽出部２０１１は、ステップ２１０５の処理をブロック毎に毎回行う必要はなく、１枚の復号対象画像に対して１回のみ、ステップ２１０５の処理を行ってもよい。 In the moving image decoding process of FIG. 21, when the input decoding target image is limited to the omnidirectional panoramic image, the processes of step 2101 and step 2102 can be omitted. The extraction unit 2011 does not need to perform the process of step 2105 for each block, and may perform the process of step 2105 only once for one image to be decoded.

図２２の動画像符号化処理において、入力される符号化対象映像が全周囲パノラマ映像に限られる場合、ステップ２２０１及びステップ２２０２の処理を省略することができる。決定部１１０３は、ステップ２２０５の処理をブロック毎に毎回行う必要はなく、１枚の符号化対象パノラマ画像に対して１回のみ、ステップ２２０５の処理を行ってもよい。 In the moving image coding process of FIG. 22, when the input coded target image is limited to the omnidirectional panoramic image, the processes of step 2201 and step 2202 can be omitted. The determination unit 1103 does not need to perform the process of step 2205 for each block, and may perform the process of step 2205 only once for one coded panoramic image.

図１～図３の展開パノラマ画像は一例に過ぎず、パノラマ映像符号化システムの構成又は条件に応じて、別の展開パノラマ画像を用いてもよい。 The developed panoramic images of FIGS. 1 to 3 are merely examples, and another developed panoramic image may be used depending on the configuration or conditions of the panoramic video coding system.

図１２～図１４の全周囲パノラマ画像は一例に過ぎず、撮影範囲に存在する被写体に応じて全周囲パノラマ画像は変化する。符号化対象パノラマ画像に写った被写体の移動方向は、水平方向に限られず、垂直方向又は斜め方向である場合もある。符号化対象パノラマ画像全体の動きが垂直方向又は斜め方向であっても、図５、図７、図１５、図１８、及び図２２の動画像符号化処理と図９及び図２１の動画像復号処理を適用することができる。符号化対象パノラマ画像の直前の時刻における符号化済みのパノラマ画像の代わりに、別の時刻における符号化済みのパノラマ画像を、参照パノラマ画像として用いてもよい。 The omnidirectional panoramic image of FIGS. 12 to 14 is only an example, and the omnidirectional panoramic image changes depending on the subject existing in the shooting range. The moving direction of the subject in the coded panoramic image is not limited to the horizontal direction, but may be a vertical direction or an oblique direction. Even if the movement of the entire panoramic image to be encoded is vertical or diagonal, the moving image coding processing of FIGS. 5, 7, 15, 18 and 22 and the moving image decoding of FIGS. 9 and 21 are performed. Processing can be applied. Instead of the coded panoramic image at the time immediately before the coded panoramic image, the coded panoramic image at another time may be used as the reference panoramic image.

図２３は、図４、図６、又は図１１の動画像符号化装置、図８又は図２０の動画像復号装置として用いられる情報処理装置（コンピュータ）の構成例を示している。図２３の情報処理装置は、Central Processing Unit（ＣＰＵ）２３０１、メモリ２３０２、入力装置２３０３、出力装置２３０４、補助記憶装置２３０５、媒体駆動装置２３０６、及びネットワーク接続装置２３０７を備える。これらの構成要素はバス２３０８により互いに接続されている。 FIG. 23 shows a configuration example of an information processing device (computer) used as the moving image coding device of FIG. 4, FIG. 6, or FIG. 11, or the moving image decoding device of FIG. 8 or FIG. The information processing device of FIG. 23 includes a Central Processing Unit (CPU) 2301, a memory 2302, an input device 2303, an output device 2304, an auxiliary storage device 2305, a medium drive device 2306, and a network connection device 2307. These components are connected to each other by bus 2308.

メモリ２３０２は、例えば、Read Only Memory（ＲＯＭ）、Random Access Memory（ＲＡＭ）、フラッシュメモリ等の半導体メモリであり、動画像符号化処理又は動画像復号処理に用いられるプログラム及びデータを記憶する。メモリ２３０２は、図４の記憶部４１１、図６の記憶部６１１、図８の記憶部８１１、図１１のフレームメモリ１１１２、及び図２０のフレームメモリ２０１７として用いることができる。 The memory 2302 is, for example, a semiconductor memory such as a Read Only Memory (ROM), a Random Access Memory (RAM), or a flash memory, and stores a program and data used for a moving image coding process or a moving image decoding process. The memory 2302 can be used as the storage unit 411 of FIG. 4, the storage unit 611 of FIG. 6, the storage unit 811 of FIG. 8, the frame memory 1112 of FIG. 11, and the frame memory 2017 of FIG.

ＣＰＵ２３０１（プロセッサ）は、例えば、メモリ２３０２を利用してプログラムを実行することにより、図４の決定部４１２、補正部４１３、及び符号化部４１４として動作する。ＣＰＵ２３０１は、メモリ２３０２を利用してプログラムを実行することにより、図６の決定部６１２、補正部６１３、及び符号化部６１４としても動作する。ＣＰＵ２３０１は、メモリ２３０２を利用してプログラムを実行することにより、図８の抽出部８１２、復号部８１３、及び補正部８１４としても動作する。 The CPU 2301 (processor) operates as a determination unit 412, a correction unit 413, and a coding unit 414 in FIG. 4 by executing a program using, for example, the memory 2302. The CPU 2301 also operates as the determination unit 612, the correction unit 613, and the coding unit 614 in FIG. 6 by executing the program using the memory 2302. The CPU 2301 also operates as the extraction unit 812, the decoding unit 813, and the correction unit 814 in FIG. 8 by executing the program using the memory 2302.

ＣＰＵ２３０１は、メモリ２３０２を利用してプログラムを実行することにより、図１１の変更部１１０１、判定部１１０２、決定部１１０３、変更部１１０４、減算部１１０５、Ｔ／Ｑ１１０６、及びＥＮＴ１１０７としても動作する。ＣＰＵ２３０１は、ＩＱ／ＩＴ１１０８、加算部１１０９、動き補償部１１１０、及び予測画像生成部１１１１としても動作する。 The CPU 2301 also operates as the change unit 1101, the determination unit 1102, the determination unit 1103, the change unit 1104, the subtraction unit 1105, the T / Q1106, and the ENT 1107 by executing the program using the memory 2302. The CPU 2301 also operates as an IQ / IT1108, an addition unit 1109, a motion compensation unit 1110, and a prediction image generation unit 1111.

ＣＰＵ２３０１は、メモリ２３０２を利用してプログラムを実行することにより、図２０の抽出部２０１１、変更部２０１２、ブロック復号部２０１３、加算部２０１４、動き補償部２０１５、及び予測画像生成部２０１６としても動作する。 The CPU 2301 also operates as the extraction unit 2011, the change unit 2012, the block decoding unit 2013, the addition unit 2014, the motion compensation unit 2015, and the prediction image generation unit 2016 in FIG. 20 by executing the program using the memory 2302. do.

入力装置２３０３は、例えば、キーボード、ポインティングデバイス等であり、ユーザ又はオペレータからの指示や情報の入力に用いられる。出力装置２３０４は、例えば、表示装置、プリンタ、スピーカ等であり、ユーザ又はオペレータへの問い合わせや処理結果の出力に用いられる。情報処理装置が動画像復号装置である場合、処理結果は、復元された符号化対象パノラマ映像であってもよい。 The input device 2303 is, for example, a keyboard, a pointing device, or the like, and is used for inputting instructions or information from a user or an operator. The output device 2304 is, for example, a display device, a printer, a speaker, or the like, and is used for making an inquiry to a user or an operator and outputting a processing result. When the information processing device is a moving image decoding device, the processing result may be the restored panoramic image to be encoded.

補助記憶装置２３０５は、例えば、磁気ディスク装置、光ディスク装置、光磁気ディスク装置、テープ装置等である。補助記憶装置２３０５は、ハードディスクドライブ又はフラッシュメモリであってもよい。情報処理装置は、補助記憶装置２３０５にプログラム及びデータを格納しておき、それらをメモリ２３０２にロードして使用することができる。 The auxiliary storage device 2305 is, for example, a magnetic disk device, an optical disk device, a magneto-optical disk device, a tape device, or the like. The auxiliary storage device 2305 may be a hard disk drive or a flash memory. The information processing device can store programs and data in the auxiliary storage device 2305 and load them into the memory 2302 for use.

媒体駆動装置２３０６は、可搬型記録媒体２３０９を駆動し、その記録内容にアクセスする。可搬型記録媒体２３０９は、メモリデバイス、フレキシブルディスク、光ディスク、光磁気ディスク等である。可搬型記録媒体２３０９は、Compact Disk Read Only Memory（ＣＤ－ＲＯＭ）、Digital Versatile Disk（ＤＶＤ）、又はUniversal Serial Bus（ＵＳＢ）メモリであってもよい。ユーザ又はオペレータは、この可搬型記録媒体２３０９にプログラム及びデータを格納しておき、それらをメモリ２３０２にロードして使用することができる。 The medium driving device 2306 drives the portable recording medium 2309 to access the recorded contents. The portable recording medium 2309 is a memory device, a flexible disk, an optical disk, a magneto-optical disk, or the like. The portable recording medium 2309 may be a Compact Disk Read Only Memory (CD-ROM), a Digital Versatile Disk (DVD), or a Universal Serial Bus (USB) memory. The user or the operator can store the programs and data in the portable recording medium 2309 and load them into the memory 2302 for use.

このように、処理に用いられるプログラム及びデータを格納するコンピュータ読み取り可能な記録媒体には、メモリ２３０２、補助記憶装置２３０５、及び可搬型記録媒体２３０９のような、物理的な（非一時的な）記録媒体が含まれる。 As described above, computer-readable recording media that store programs and data used for processing include physical (non-temporary) recording media such as memory 2302, auxiliary storage device 2305, and portable recording medium 2309. A recording medium is included.

ネットワーク接続装置２３０７は、Local Area Network（ＬＡＮ）、インターネット等の通信ネットワークに接続され、通信に伴うデータ変換を行う通信インタフェースである。情報処理装置が動画像符号化装置である場合、ネットワーク接続装置２３０７は、符号化パノラマ映像のビットストリームを動画像復号装置へ送信することができる。情報処理装置が動画像復号装置である場合、ネットワーク接続装置２３０７は、符号化パノラマ映像のビットストリームを動画像符号化装置から受信することができる。 The network connection device 2307 is a communication interface that is connected to a communication network such as a Local Area Network (LAN) or the Internet and performs data conversion associated with the communication. When the information processing device is a moving image coding device, the network connection device 2307 can transmit a bit stream of the coded panoramic video to the moving image decoding device. When the information processing device is a moving image decoding device, the network connection device 2307 can receive a bit stream of the coded panoramic image from the moving image coding device.

情報処理装置は、プログラム及びデータを外部の装置からネットワーク接続装置２３０７を介して受け取り、それらをメモリ２３０２にロードして使用することもできる。 The information processing device can also receive programs and data from an external device via the network connection device 2307 and load them into the memory 2302 for use.

なお、情報処理装置が図２３のすべての構成要素を含む必要はなく、用途又は条件に応じて一部の構成要素を省略することも可能である。例えば、ユーザ又はオペレータとのインタフェースが不要の場合は、入力装置２３０３及び出力装置２３０４を省略してもよい。また、情報処理装置が可搬型記録媒体２３０９にアクセスしない場合は、媒体駆動装置２３０６を省略してもよい。 It is not necessary for the information processing apparatus to include all the components shown in FIG. 23, and some components may be omitted depending on the intended use or conditions. For example, if the interface with the user or the operator is unnecessary, the input device 2303 and the output device 2304 may be omitted. If the information processing device does not access the portable recording medium 2309, the medium drive device 2306 may be omitted.

開示の実施形態とその利点について詳しく説明したが、当業者は、特許請求の範囲に明確に記載した本発明の範囲から逸脱することなく、様々な変更、追加、省略をすることができるであろう。 Although the embodiments of the disclosure and their advantages have been described in detail, those skilled in the art will be able to make various changes, additions and omissions without departing from the scope of the invention expressly described in the claims. Let's do it.

図１乃至図２３を参照しながら説明した実施形態に関し、さらに以下の付記を開示する。
（付記１）
撮像装置が撮影したパノラマ映像に含まれるパノラマ画像を展開した符号化対象パノラマ画像を符号化するために用いられる参照パノラマ画像を記憶する記憶部と、
前記参照パノラマ画像に対する前記符号化対象パノラマ画像のずれ量を表すベクトルを決定する決定部と、
前記符号化対象パノラマ画像内における複数の符号化対象領域それぞれの位置を前記ずれ量を表すベクトルに基づいて補正することで、補正された符号化対象パノラマ画像を生成する補正部と、
前記補正された符号化対象パノラマ画像内の前記複数の符号化対象領域それぞれの画像を、前記参照パノラマ画像を用いて符号化する符号化部と、
を備えることを特徴とする動画像符号化装置。
（付記２）
前記決定部は、補正前の前記複数の符号化対象領域それぞれの動きベクトルのうち、他の動きベクトルよりも多く発生する動きベクトルを、前記ずれ量を表すベクトルに決定することを特徴とする付記１記載の動画像符号化装置。
（付記３）
前記補正部は、補正前の前記符号化対象パノラマ画像内における動きベクトルの総数に対する前記ずれ量を表すベクトルの個数の比率が、所定値よりも大きい場合、前記補正前の前記複数の符号化対象領域それぞれの位置を前記ずれ量を表すベクトルに基づいて補正することを特徴とする付記２記載の動画像符号化装置。
（付記４）
前記補正部は、前記参照パノラマ画像と補正前の前記符号化対象パノラマ画像とを用いた第１動き推定に基づく第１差分を求め、前記参照パノラマ画像と前記補正された前記符号化対象パノラマ画像とを用いた第２動き推定に基づく第２差分を求め、前記第２差分が前記第１差分よりも小さい場合、前記補正前の前記複数の符号化対象領域それぞれの位置を前記ずれ量を表すベクトルに基づいて補正することを特徴とする付記２記載の動画像符号化装置。
（付記５）
前記補正部は、前記参照パノラマ画像と補正前の前記符号化対象パノラマ画像とを用いた第１動き推定に基づく量子化スケールと、前記第１動き推定に基づく符号の情報量とを求め、前記参照パノラマ画像と前記補正された前記前記符号化対象パノラマ画像とを用いた第２動き推定に基づく量子化スケールと、前記第２動き推定に基づく符号の情報量とを求め、前記第２動き推定に基づく量子化スケールと前記第２動き推定に基づく符号の情報量との積が、前記第１動き推定に基づく量子化スケールと前記第１動き推定に基づく符号の情報量との積よりも小さい場合、前記補正前の前記複数の符号化対象領域それぞれの位置を前記ずれ量を表すベクトルに基づいて補正することを特徴とする付記２記載の動画像符号化装置。
（付記６）
複数の撮像装置が撮影した複数の映像を組み合わせたパノラマ映像に含まれる符号化対象パノラマ画像を符号化するために用いられる参照画像を記憶する記憶部と、
前記複数の映像それぞれの撮影範囲が移動することによって、前記符号化対象パノラマ画像内の符号化対象領域が前記参照画像に対してずれた場合、前記参照画像に対する前記符号化対象領域のずれ量を表すベクトルを決定する決定部と、
前記符号化対象パノラマ画像内における前記符号化対象領域の位置を前記ずれ量を表すベクトルに基づいて補正することで、補正された符号化対象領域を生成する補正部と、
前記補正された符号化対象領域の画像を、前記参照画像を用いて符号化する符号化部と、
を備えることを特徴とする動画像符号化装置。
（付記７）
撮像装置が撮影したパノラマ映像に含まれるパノラマ画像を展開した符号化対象パノラマ画像を符号化することで生成される符号化パノラマ画像を含む、符号化パノラマ映像を復号する動画像復号装置であって、
前記符号化パノラマ画像を復号するために用いられる参照パノラマ画像を記憶する記憶部と、
前記参照パノラマ画像に対する前記符号化対象パノラマ画像のずれ量を表すベクトルを、前記符号化パノラマ映像から抽出する抽出部と、
前記符号化パノラマ画像内の複数の復号対象領域それぞれを前記参照パノラマ画像を用いて復号して、復号パノラマ画像を生成する復号部と、
前記復号パノラマ画像内における前記複数の復号対象領域それぞれの位置を前記ずれ量を表すベクトルに基づいて補正することで、前記符号化対象パノラマ画像を復元する補正部と、
を備えることを特徴とする動画像復号装置。
（付記８）
撮像装置が撮影したパノラマ映像に含まれるパノラマ画像を展開した符号化対象パノラマ画像を符号化するために用いられる参照パノラマ画像を用いて、前記参照パノラマ画像に対する前記符号化対象パノラマ画像のずれ量を表すベクトルを決定し、
前記符号化対象パノラマ画像内における複数の符号化対象領域それぞれの位置を前記ずれ量を表すベクトルに基づいて補正することで、補正された符号化対象パノラマ画像を生成し、
前記補正された符号化対象パノラマ画像内の前記複数の符号化対象領域それぞれの画像を、前記参照パノラマ画像を用いて符号化する、
処理をコンピュータに実行させる動画像符号化プログラム。
（付記９）
前記コンピュータは、補正前の前記複数の符号化対象領域それぞれの動きベクトルのうち、他の動きベクトルよりも多く発生する動きベクトルを、前記ずれ量を表すベクトルに決定することを特徴とする付記８記載の動画像符号化プログラム。
（付記１０）
複数の撮像装置が撮影した複数の映像それぞれの撮影範囲が移動することによって、前記複数の映像を組み合わせたパノラマ映像に含まれる符号化対象パノラマ画像を符号化するために用いられる参照画像に対して、前記符号化対象パノラマ画像内の符号化対象領域がずれた場合、前記参照画像に対する前記符号化対象領域のずれ量を表すベクトルを決定し、
前記符号化対象パノラマ画像内における前記符号化対象領域の位置を前記ずれ量を表すベクトルに基づいて補正することで、補正された符号化対象領域を生成し、
前記補正された符号化対象領域の画像を、前記参照画像を用いて符号化する、
処理をコンピュータに実行させる動画像符号化プログラム。
（付記１１）
撮像装置が撮影したパノラマ映像に含まれるパノラマ画像を展開した符号化対象パノラマ画像を符号化することで生成される符号化パノラマ画像を含む、符号化パノラマ映像を復号するコンピュータに、
前記符号化パノラマ画像を復号するために用いられる参照パノラマ画像に対する、前記符号化対象パノラマ画像のずれ量を表すベクトルを、前記符号化パノラマ映像から抽出し、
前記符号化パノラマ画像内の複数の復号対象領域それぞれを前記参照パノラマ画像を用いて復号して、復号パノラマ画像を生成し、
前記復号パノラマ画像内における前記複数の復号対象領域それぞれの位置を前記ずれ量を表すベクトルに基づいて補正することで、前記符号化対象パノラマ画像を復元する、
処理を実行させる動画像復号プログラム。
（付記１２）
コンピュータが、
撮像装置が撮影したパノラマ映像に含まれるパノラマ画像を展開した符号化対象パノラマ画像を符号化するために用いられる参照パノラマ画像を用いて、前記参照パノラマ画像に対する前記符号化対象パノラマ画像のずれ量を表すベクトルを決定し、
前記符号化対象パノラマ画像内における複数の符号化対象領域それぞれの位置を前記ずれ量を表すベクトルに基づいて補正することで、補正された符号化対象パノラマ画像を生成し、
前記補正された符号化対象パノラマ画像内の前記複数の符号化対象領域それぞれの画像を、前記参照パノラマ画像を用いて符号化する、
ことを特徴とする動画像符号化方法。
（付記１３）
前記コンピュータは、補正前の前記複数の符号化対象領域それぞれの動きベクトルのうち、他の動きベクトルよりも多く発生する動きベクトルを、前記ずれ量を表すベクトルに決定することを特徴とする付記１２記載の動画像符号化方法。
（付記１４）
コンピュータが、
複数の撮像装置が撮影した複数の映像それぞれの撮影範囲が移動することによって、前記複数の映像を組み合わせたパノラマ映像に含まれる符号化対象パノラマ画像を符号化するために用いられる参照画像に対して、前記符号化対象パノラマ画像内の符号化対象領域がずれた場合、前記参照画像に対する前記符号化対象領域のずれ量を表すベクトルを決定し、
前記符号化対象パノラマ画像内における前記符号化対象領域の位置を前記ずれ量を表すベクトルに基づいて補正することで、補正された符号化対象領域を生成し、
前記補正された符号化対象領域の画像を、前記参照画像を用いて符号化する、
ことを特徴とする動画像符号化方法。
（付記１５）
撮像装置が撮影したパノラマ映像に含まれるパノラマ画像を展開した符号化対象パノラマ画像を符号化することで生成される符号化パノラマ画像を含む、符号化パノラマ映像を復号するコンピュータが、
前記符号化パノラマ画像を復号するために用いられる参照パノラマ画像に対する、前記符号化対象パノラマ画像のずれ量を表すベクトルを、前記符号化パノラマ映像から抽出し、
前記符号化パノラマ画像内の複数の復号対象領域それぞれを前記参照パノラマ画像を用いて復号して、復号パノラマ画像を生成し、
前記復号パノラマ画像内における前記複数の復号対象領域それぞれの位置を前記ずれ量を表すベクトルに基づいて補正することで、前記符号化対象パノラマ画像を復元する、
ことを特徴とする動画像復号方法。 Further, the following appendices will be disclosed with respect to the embodiments described with reference to FIGS. 1 to 23.
(Appendix 1)
A storage unit that stores a reference panoramic image used for encoding a coded panoramic image obtained by expanding the panoramic image included in the panoramic image captured by the image pickup device.
A determination unit that determines a vector representing the amount of deviation of the coded panoramic image with respect to the reference panoramic image.
A correction unit that generates a corrected panoramic image to be coded by correcting the position of each of a plurality of areas to be coded in the panoramic image to be coded based on a vector representing the amount of deviation.
A coding unit that encodes an image of each of the plurality of coded target areas in the corrected panoramic image to be coded by using the reference panoramic image.
A moving image encoding device comprising.
(Appendix 2)
The appendix is characterized in that, among the motion vectors of each of the plurality of coded target regions before correction, the motion vector generated more than the other motion vectors is determined as the vector representing the deviation amount. 1 The motion image coding device according to 1.
(Appendix 3)
When the ratio of the number of vectors representing the deviation amount to the total number of motion vectors in the coded panoramic image before correction is larger than a predetermined value, the correction unit may perform the plurality of coded targets before correction. The moving image coding device according to Appendix 2, wherein the position of each region is corrected based on a vector representing the deviation amount.
(Appendix 4)
The correction unit obtains a first difference based on a first motion estimation using the reference panoramic image and the coded panoramic image before correction, and obtains the reference panoramic image and the corrected panoramic image to be coded. When the second difference is obtained based on the second motion estimation using and, and the second difference is smaller than the first difference, the position of each of the plurality of coded target regions before the correction represents the deviation amount. The moving image coding device according to Appendix 2, characterized in that correction is performed based on a vector.
(Appendix 5)
The correction unit obtains a quantization scale based on the first motion estimation using the reference panoramic image and the coded target panoramic image before correction, and the information amount of the code based on the first motion estimation. The quantization scale based on the second motion estimation using the reference panoramic image and the corrected panoramic image to be encoded, and the information amount of the code based on the second motion estimation are obtained, and the second motion estimation is performed. The product of the quantization scale based on the above and the amount of code information based on the second motion estimation is smaller than the product of the quantization scale based on the first motion estimation and the amount of code information based on the first motion estimation. In this case, the moving image coding apparatus according to Appendix 2, wherein the position of each of the plurality of coded target regions before the correction is corrected based on a vector representing the deviation amount.
(Appendix 6)
A storage unit that stores a reference image used for encoding a coded panoramic image included in a panoramic image that is a combination of a plurality of images taken by a plurality of image pickup devices.
When the coding target area in the coding target panoramic image shifts with respect to the reference image due to the movement of the shooting range of each of the plurality of images, the amount of deviation of the coding target area with respect to the reference image is calculated. The decision part that determines the vector to be represented, and
A correction unit that generates a corrected coded target area by correcting the position of the coded target area in the coded target panoramic image based on a vector representing the deviation amount.
A coding unit that encodes the corrected image of the coded target area using the reference image, and a coding unit.
A moving image encoding device comprising.
(Appendix 7)
A moving image decoding device that decodes a coded panoramic image including a coded panoramic image generated by encoding a coded panoramic image obtained by expanding the panoramic image included in the panoramic image taken by the image pickup device. ,
A storage unit for storing a reference panoramic image used for decoding the coded panoramic image, and a storage unit.
An extraction unit that extracts a vector representing the amount of deviation of the coded panoramic image with respect to the reference panoramic image from the coded panoramic image, and
A decoding unit that generates a decoded panoramic image by decoding each of a plurality of decoding target areas in the coded panoramic image using the reference panoramic image.
A correction unit that restores the coded panoramic image by correcting the position of each of the plurality of decoded panoramic images in the decoded panoramic image based on a vector representing the amount of deviation.
A moving image decoding device.
(Appendix 8)
Using the reference panoramic image used to encode the coded panoramic image obtained by developing the panoramic image included in the panoramic image taken by the image pickup apparatus, the amount of deviation of the coded target panoramic image with respect to the reference panoramic image is determined. Determine the vector to represent,
By correcting the position of each of the plurality of coded target areas in the coded target panoramic image based on the vector representing the deviation amount, the corrected coded target panoramic image is generated.
Each of the plurality of coded areas in the corrected panoramic image to be coded is encoded by using the reference panoramic image.
A video encoding program that causes a computer to perform processing.
(Appendix 9)
The computer is characterized in that, among the motion vectors of each of the plurality of coded target regions before correction, a motion vector that is generated more than the other motion vectors is determined as a vector representing the deviation amount. The described video coding program.
(Appendix 10)
With respect to the reference image used for encoding the coded panoramic image included in the panoramic image in which the plurality of images are combined by moving the shooting range of each of the plurality of images captured by the plurality of image pickup devices. When the coded area in the coded panoramic image is deviated, a vector representing the amount of deviating of the coded area with respect to the reference image is determined.
By correcting the position of the coded target area in the coded target panoramic image based on the vector representing the deviation amount, the corrected coded target area is generated.
The image of the corrected coded area is encoded by using the reference image.
A video encoding program that causes a computer to perform processing.
(Appendix 11)
A computer that decodes a coded panoramic image, including a coded panoramic image generated by encoding a coded panoramic image obtained by expanding the panoramic image included in the panoramic image taken by the image pickup device.
A vector representing the amount of deviation of the coded panoramic image with respect to the reference panoramic image used for decoding the coded panoramic image is extracted from the coded panoramic image.
Each of the plurality of decoding target areas in the coded panoramic image is decoded using the reference panoramic image to generate a decoded panoramic image.
The coded panoramic image is restored by correcting the position of each of the plurality of decoded panoramic images in the decoded panoramic image based on the vector representing the deviation amount.
A moving image decoding program that executes processing.
(Appendix 12)
The computer
Using the reference panoramic image used to encode the coded panoramic image obtained by developing the panoramic image included in the panoramic image taken by the image pickup apparatus, the amount of deviation of the coded target panoramic image with respect to the reference panoramic image is determined. Determine the vector to represent,
By correcting the position of each of the plurality of coded target areas in the coded target panoramic image based on the vector representing the deviation amount, the corrected coded target panoramic image is generated.
Each of the plurality of coded areas in the corrected panoramic image to be coded is encoded by using the reference panoramic image.
A moving image coding method characterized by the above.
(Appendix 13)
The computer is characterized in that, among the motion vectors of each of the plurality of coded target regions before correction, a motion vector that is generated more than the other motion vectors is determined as a vector representing the deviation amount. The described moving image coding method.
(Appendix 14)
The computer
With respect to the reference image used for encoding the coded panoramic image included in the panoramic image in which the plurality of images are combined by moving the shooting range of each of the plurality of images captured by the plurality of image pickup devices. When the coded area in the coded panoramic image is deviated, a vector representing the amount of deviating of the coded area with respect to the reference image is determined.
By correcting the position of the coded target area in the coded target panoramic image based on the vector representing the deviation amount, the corrected coded target area is generated.
The image of the corrected coded area is encoded by using the reference image.
A moving image coding method characterized by the above.
(Appendix 15)
A computer that decodes a coded panoramic image, including a coded panoramic image generated by encoding a coded panoramic image obtained by expanding the panoramic image included in the panoramic image taken by the image pickup device,
A vector representing the amount of deviation of the coded panoramic image with respect to the reference panoramic image used for decoding the coded panoramic image is extracted from the coded panoramic image.
Each of the plurality of decoding target areas in the coded panoramic image is decoded using the reference panoramic image to generate a decoded panoramic image.
The coded panoramic image is restored by correcting the position of each of the plurality of decoded panoramic images in the decoded panoramic image based on the vector representing the deviation amount.
A moving image decoding method characterized by this.

１０１、２０１、２０２、３０１、１３０１～１３０３、１４０１、１４０２パノラマ画像
１０２境界線
１０３、２０３、２０４、３２１～３２３展開パノラマ画像
２０５全周囲パノラマ画像
３１１～３１４領域
４０１、６０１、１０１４動画像符号化装置
４１１、６１１、８１１記憶部
４１２、６１２、１１０３決定部
４１３、６１３、８１４補正部
４１４、６１４符号化部
４２１、８２１、１２０１参照パノラマ画像
６２１参照画像
８０１、２００１動画像復号装置
８１２、２０１１抽出部
８１３復号部
１００１パノラマ映像符号化システム
１０１１撮影部
１０１２合成部
１０１３展開部
１１０１、１１０４、２０１２変更部
１１０２判定部
１１０５減算部
１１０９、２０１４加算部
１１１０、２０１５動き補償部
１１１１、２０１６予測画像生成部
１１１２、２０１７フレームメモリ
１２０２、１２０３符号化対象パノラマ画像
１２１１動きベクトル
２０１３ブロック復号部
２３０１ＣＰＵ
２３０２メモリ
２３０３入力装置
２３０４出力装置
２３０５補助記憶装置
２３０６媒体駆動装置
２３０７ネットワーク接続装置
２３０８バス
２３０９可搬型記録媒体 101, 201, 202, 301, 1301 to 1303, 1401, 1402 Panorama image 102 Borderline 103, 203, 204, 321 to 323 Expanded panoramic image 205 All-around panoramic image 311 to 314 Regions 401, 601 and 1014 Dynamic image coding Equipment 411, 611, 811 Storage unit 412, 612, 1103 Determination unit 413, 613, 814 Correction unit 414, 614 Coding unit 421, 821, 1201 Reference panoramic image 621 Reference image 801, 2001 Video decoding device 812, 2011 Extraction Part 813 Decoding part 1001 Panorama video coding system 1011 Imaging part 1012 Synthesis part 1013 Expansion part 1101, 1104, 2012 Change part 1102 Judgment part 1105 Subtraction part 1109, 2014 Addition part 1110, 2015 Motion compensation part 1111, 2016 Predictive image generation part 1112, 2017 Frame memory 1202, 1203 Codified panoramic image 1211 Motion vector 2013 Block decoding unit 2301 CPU
2302 Memory 2303 Input device 2304 Output device 2305 Auxiliary storage device 2306 Media drive device 2307 Network connection device 2308 Bus 2309 Portable recording medium

Claims

A storage unit that stores a reference image used for encoding a coded panoramic image included in a panoramic image that is a combination of a plurality of images taken by a plurality of image pickup devices.
When the coded panoramic image deviates from the reference image due to the movement of the shooting range of each of the plurality of images , the motion vector of each of the plurality of coded areas in the coded panoramic image. A determination unit that selects a motion vector of a specific coding target region from among the motion vectors and determines the motion vector of the specific coding target region as a vector representing the amount of deviation of the plurality of coding target regions with respect to the reference image. When,
A correction unit that generates a corrected coded target area by correcting the position of each of the plurality of coded target areas in the coded target panoramic image based on a vector representing the deviation amount.
A coding unit that encodes the corrected image of the coded target area using the reference image, and a coding unit.
A moving image encoding device comprising.

The moving image coding device according to claim 1, wherein the specific coding target area is a coding target area corresponding to a position designated by a user among the plurality of coding target areas.

With respect to the reference image used for encoding the coded panoramic image included in the panoramic image in which the plurality of images are combined by moving the shooting range of each of the plurality of images captured by the plurality of image pickup devices. When the coded panoramic image is displaced , a motion vector of a specific coded area is selected from the motion vectors of each of the plurality of coded areas in the coded panoramic image.
The motion vector of the specific coding target region is determined as a vector representing the deviation amount of the plurality of coding target regions with respect to the reference image.
By correcting the position of each of the plurality of coded target areas in the coded target panoramic image based on the vector representing the deviation amount, the corrected coded target area is generated.
The image of the corrected coded area is encoded by using the reference image.
A video encoding program that causes a computer to perform processing.

The moving image coding program according to claim 3, wherein the specific coding target area is a coding target area corresponding to a position designated by a user among the plurality of coding target areas.

The computer
With respect to the reference image used for encoding the coded panoramic image included in the panoramic image in which the plurality of images are combined by moving the shooting range of each of the plurality of images captured by the plurality of image pickup devices. When the coded panoramic image is displaced , a motion vector of a specific coded area is selected from the motion vectors of each of the plurality of coded areas in the coded panoramic image.
The motion vector of the specific coded area is determined as a vector representing the amount of deviation of the plurality of coded areas with respect to the reference image.
By correcting the position of each of the plurality of coded target areas in the coded target panoramic image based on the vector representing the deviation amount, the corrected coded target area is generated.
The image of the corrected coded area is encoded by using the reference image.
A moving image coding method characterized by this.

The moving image coding method according to claim 5, wherein the specific coding target area is a coding target area corresponding to a position designated by a user among the plurality of coding target areas.