JP2018056686A

JP2018056686A - Image encoder, image encoding method and image encoding program, and image decoder, image decoding method and image decoding program

Info

Publication number: JP2018056686A
Application number: JP2016188129A
Authority: JP
Inventors: 佐藤　数史; Kazufumi Sato; 数史佐藤
Original assignee: Dwango Co Ltd
Current assignee: Dwango Co Ltd
Priority date: 2016-09-27
Filing date: 2016-09-27
Publication date: 2018-04-05
Also published as: WO2018061589A1

Abstract

PROBLEM TO BE SOLVED: To provide an image encoder which enables the increase in encoding efficiency when an image produced by projecting a 360-degree whole sky image onto a hexahedron is encoded in such a way that the image is allocated to regions in a frame, except some regions.SOLUTION: In an image encoder, a picture division part 104 divides each frame of image information into one or more of first tiles set on an allocation region to which an image is allocated, and one or more of second tiles set on a non-allocation region; a coded data generation part encodes an image of the one or more of first tiles to generate coded data; a syntax generation part (entropy encoding part 5) generates syntax including syntax elements which define each of first and second tiles in frame by horizontal and vertical positions of an upper left corner portion, and horizontal and vertical sizes; and a bit stream generation part (entropy encoding part 5) generates bet streams including coded data and the syntax.SELECTED DRAWING: Figure 6

Description

本開示は、画像符号化装置、画像符号化方法、及び画像符号化プログラム、並びに、画像復号装置、画像復号方法、及び画像復号プログラムに関する。 The present disclosure relates to an image encoding device, an image encoding method, an image encoding program, an image decoding device, an image decoding method, and an image decoding program.

H.264/MPEG-4 AVC (Advanced Video Coding)よりもさらに画像情報を高効率に符号化することができる圧縮符号化方式H.265/MPEG-H HEVC(High Efficiency Video Coding)（以下、HEVCと略記する）が標準化されている（非特許文献１参照）。HEVCにおいては、フレーム（ピクチャ）を構成する画素を符号化ツリーユニット（ＣＴＵ：Coding Tree Unit）と称されるブロックを分割単位として分割するタイルが規定されている。 H.265 / MPEG-H HEVC (High Efficiency Video Coding) (hereinafter referred to as HEVC), which can encode image information more efficiently than H.264 / MPEG-4 AVC (Advanced Video Coding). Are abbreviated to be standardized) (see Non-Patent Document 1). In HEVC, tiles are defined that divide pixels constituting a frame (picture) into blocks called coding tree units (CTUs) as division units.

非特許文献２には、図１に示すように、ヘッドマウントディスプレイによって視認する３６０度の全天画像４０１を六面体４０２の面４０２Ａ〜４０２Ｆに投影し、面４０２Ａ〜４０２Ｆに投影した画像を、図２に示すように、フレーム内の６つの領域に展開することが記載されている。図２において、ハッチングを付していない領域は画像が割り当てられている割り当て領域であり、ハッチングを付した領域は、画像が割り当てられていない非割り当て領域である。 In Non-Patent Document 2, as shown in FIG. 1, 360-degree all-sky image 401 visually recognized by a head-mounted display is projected onto surfaces 402A to 402F of hexahedron 402, and images projected onto surfaces 402A to 402F are illustrated in FIG. As shown in FIG. 2, it is described that the image is expanded into six regions in the frame. In FIG. 2, an area that is not hatched is an allocated area to which an image is assigned, and an area that is hatched is an unassigned area to which no image is assigned.

ヘッドマウントディスプレイを装着したユーザは、図２に示す画像をヘッドマウントディスプレイで視認することによって、目の前に球状に広がる仮想現実画像（ＶＲ画像）のコンテンツを鑑賞することができる。 A user wearing a head-mounted display can view the contents of a virtual reality image (VR image) spreading in a spherical shape in front of the eyes by viewing the image shown in FIG. 2 with the head-mounted display.

” High Efficiency Video Coding (HEVC) text specification draft 10 (for FDIS & Last Call)”, JCTVC-L1003, January 2013”High Efficiency Video Coding (HEVC) text specification draft 10 (for FDIS & Last Call)”, JCTVC-L1003, January 2013 ”GoPro test sequences for Virtual Reality video coding”, JVET-C0021, May 2016“GoPro test sequences for Virtual Reality video coding”, JVET-C0021, May 2016

ところが、HEVCのタイルでは、図２に示す画像を図３または図４に示すように分割して符号化することはできない。また、図２における非割り当て領域では、本来であれば符号化データを画像復号装置に送信して復号する必要がないが、HEVCではスキップモード、ダイレクトモード、イントラ予測におけるＤＣ（直流）モード等を用いて符号化する必要がある。従って、画像符号化装置は非割り当て領域でも符号を伝送して、画像復号装置は伝送された符号を復号しなければならない。 However, in the HEVC tile, the image shown in FIG. 2 cannot be divided and encoded as shown in FIG. 3 or FIG. In the non-allocation area in FIG. 2, originally, it is unnecessary to transmit the encoded data to the image decoding apparatus and decode it. However, in HEVC, skip mode, direct mode, DC (direct current) mode in intra prediction, etc. Need to be encoded. Therefore, the image encoding device must transmit the code even in the non-allocation area, and the image decoding device must decode the transmitted code.

実施形態は、３６０度の全天画像を六面体に投影した画像をフレーム内の一部の領域を除いた領域に割り当てて符号化する際の符号化効率を向上させ、符号化または復号の処理量を低減させることができる画像符号化装置、画像符号化方法、及び画像符号化プログラム、並びに、画像復号装置、画像復号方法、及び画像復号プログラムを提供することを目的とする。 The embodiment improves the encoding efficiency when encoding an image obtained by projecting a 360-degree all-sky image onto a hexahedron to an area excluding a part of the area in the frame, and the amount of encoding or decoding processing It is an object to provide an image encoding device, an image encoding method, an image encoding program, an image decoding device, an image decoding method, and an image decoding program.

実施形態の第１の態様によれば、フレーム内に画像が割り当てられている割り当て領域と画像が割り当てられていない非割り当て領域とを含むフォーマットの画像情報が入力され、前記画像情報の各フレームを、前記割り当て領域に設定された１または複数の第１のタイルと、前記非割り当て領域に設定された１または複数の第２のタイルとに分割するピクチャ分割部と、前記第１のタイルの画像を符号化して符号化データを生成する符号化データ生成部と、前記第１及び第２のタイルそれぞれを、フレーム内での左上角部の水平位置及び垂直位置と水平サイズ及び垂直サイズとで定義した第１のシンタックス要素を含むシンタックスを生成するシンタックス生成部と、前記符号化データと前記シンタックスとを含むビットストリームを生成するビットストリーム生成部とを備えることを特徴とする画像符号化装置が提供される。 According to the first aspect of the embodiment, image information in a format including an allocated area in which an image is allocated in a frame and an unallocated area in which no image is allocated is input, and each frame of the image information is A picture dividing unit that divides into one or a plurality of first tiles set in the allocation area and one or a plurality of second tiles set in the non-allocation area, and an image of the first tile The encoded data generation unit that generates encoded data by encoding the image, and the first and second tiles are defined by the horizontal position and vertical position of the upper left corner in the frame, and the horizontal size and vertical size, respectively. Generating a syntax including the first syntax element, and generating a bitstream including the encoded data and the syntax The image coding apparatus is provided, characterized in that it comprises a appropriate bit stream generation unit.

実施形態の第２の態様によれば、フレーム内に画像が割り当てられている割り当て領域と画像が割り当てられていない非割り当て領域とを含むフォーマットの画像情報の各フレームを、前記割り当て領域に設定された１または複数の第１のタイルと、前記非割り当て領域に設定された１または複数の第２のタイルとに分割し、前記第１のタイルの画像を符号化して符号化データを生成し、前記第１及び第２のタイルそれぞれを、フレーム内での左上角部の水平位置及び垂直位置と水平サイズ及び垂直サイズとで定義したシンタックス要素を含むシンタックスを生成し、前記符号化データと前記シンタックスとを含むビットストリームを生成することを特徴とする画像符号化方法が提供される。 According to the second aspect of the embodiment, each frame of image information in a format including an allocation area in which an image is allocated in a frame and a non-allocation area in which no image is allocated is set as the allocation area. Dividing into one or a plurality of first tiles and one or a plurality of second tiles set in the non-allocation area, encoding an image of the first tiles to generate encoded data, Each of the first and second tiles generates a syntax including syntax elements defined by a horizontal position and a vertical position of the upper left corner in the frame, a horizontal size and a vertical size, and the encoded data; An image encoding method is provided that generates a bitstream including the syntax.

実施形態の第３の態様によれば、コンピュータに、フレーム内に画像が割り当てられている割り当て領域と画像が割り当てられていない非割り当て領域とを含むフォーマットの画像情報の各フレームを、前記割り当て領域に設定された１または複数の第１のタイルと、前記非割り当て領域に設定された１または複数の第２のタイルとに分割するステップと、前記第１のタイルの画像を符号化して符号化データを生成するステップと、前記第１及び第２のタイルそれぞれを、フレーム内での左上角部の水平位置及び垂直位置と水平サイズ及び垂直サイズとで定義したシンタックス要素を含むシンタックスを生成するステップと、前記符号化データと前記シンタックスとを含むビットストリームを生成するステップとを実行させることを特徴とする画像符号化プログラムが提供される。 According to the third aspect of the embodiment, each frame of image information in a format including an allocation area to which an image is allocated in a frame and a non-allocation area to which no image is allocated is assigned to the computer. Dividing into one or more first tiles set to 1 and a plurality of second tiles set to the non-allocation area, and encoding and encoding an image of the first tile Generating data, and generating a syntax including a syntax element for each of the first and second tiles defined by a horizontal position and a vertical position of the upper left corner in the frame, and a horizontal size and a vertical size. And generating a bitstream including the encoded data and the syntax. Image coding program is provided.

実施形態の第４の態様によれば、フレーム内に画像が割り当てられている割り当て領域と画像が割り当てられていない非割り当て領域とを含むフォーマットの画像情報であり、前記画像情報の各フレームが、前記割り当て領域に設定された１または複数の第１のタイルと、前記非割り当て領域に設定された１または複数の第２のタイルとに分割され、前記第１のタイルの画像が符号化された符号化データと、前記第１及び第２のタイルそれぞれを、フレーム内での左上角部の水平位置及び垂直位置と水平サイズ及び垂直サイズとで定義した第１のシンタックス要素を含むシンタックスとを含むビットストリームを受信するビットストリーム受信部と、前記符号化データを復号して前記第１のタイルの復号画像を生成する復号部と、前記第１のシンタックス要素に基づいて、前記第１のタイルの復号画像と前記第２のタイルの単一画像とを合成して各フレームの復号画像情報を生成するピクチャ合成部とを備えることを特徴とする画像復号装置が提供される。 According to the fourth aspect of the embodiment, it is image information in a format including an allocation area in which an image is allocated in a frame and a non-allocation area in which no image is allocated, and each frame of the image information includes: The image is divided into one or more first tiles set in the allocated area and one or more second tiles set in the non-allocated area, and the image of the first tile is encoded A syntax including encoded data and a first syntax element that defines each of the first and second tiles by a horizontal position and a vertical position of the upper left corner in the frame, and a horizontal size and a vertical size; A bitstream receiving unit that receives a bitstream including: a decoding unit that decodes the encoded data to generate a decoded image of the first tile; and the first An image comprising: a picture composition unit that composes a decoded image of the first tile and a single image of the second tile based on a syntax element to generate decoded image information of each frame. A decoding device is provided.

実施形態の第５の態様によれば、フレーム内に画像が割り当てられている割り当て領域と画像が割り当てられていない非割り当て領域とを含むフォーマットの画像情報であり、前記画像情報の各フレームが、前記割り当て領域に設定された１または複数の第１のタイルと、前記非割り当て領域に設定された１または複数の第２のタイルとに分割され、前記第１のタイルの画像が符号化された符号化データと、前記第１及び第２のタイルそれぞれを、フレーム内での左上角部の水平位置及び垂直位置と水平サイズ及び垂直サイズとで定義したシンタックス要素を含むシンタックスとを含むビットストリームを受信し、前記符号化データを復号して前記第１のタイルの復号画像を生成し、前記シンタックス要素に基づいて、前記第１のタイルの復号画像と前記第２のタイルの単一画像とを合成して各フレームの復号画像情報を生成することを特徴とする画像復号方法が提供される。 According to the fifth aspect of the embodiment, the image information has a format including an allocation area in which an image is allocated in a frame and a non-allocation area in which no image is allocated, and each frame of the image information includes: The image is divided into one or more first tiles set in the allocated area and one or more second tiles set in the non-allocated area, and the image of the first tile is encoded A bit including encoded data and a syntax including a syntax element in which each of the first and second tiles is defined by the horizontal position and vertical position of the upper left corner in the frame and the horizontal size and vertical size. Receiving a stream, decoding the encoded data to generate a decoded image of the first tile, and decoding the first tile based on the syntax element. Image decoding method characterized by by combining the single image of the image and the second tile to generate a decoded image information of each frame is provided.

実施形態の第６の態様によれば、コンピュータに、フレーム内に画像が割り当てられている割り当て領域と画像が割り当てられていない非割り当て領域とを含むフォーマットの画像情報であり、前記画像情報の各フレームが、前記割り当て領域に設定された１または複数の第１のタイルと、前記非割り当て領域に設定された１または複数の第２のタイルとに分割され、前記第１のタイルの画像が符号化された符号化データと、前記第１及び第２のタイルそれぞれを、フレーム内での左上角部の水平位置及び垂直位置と水平サイズ及び垂直サイズとで定義したシンタックス要素を含むシンタックスとを含むビットストリームを受信するステップと、前記符号化データを復号して前記第１のタイルの復号画像を生成受信するステップと、前記シンタックス要素に基づいて、前記第１のタイルの復号画像と前記第２のタイルの単一画像とを合成して各フレームの復号画像情報を生成するステップとを実行させることを特徴とする画像復号プログラムが提供される。 According to the sixth aspect of the embodiment, the image information is in a format including an allocation area in which an image is allocated in a frame and a non-allocation area in which no image is allocated in the computer. A frame is divided into one or a plurality of first tiles set in the allocated area and one or a plurality of second tiles set in the non-allocated area, and an image of the first tile is encoded. Encoded syntax data, and a syntax including syntax elements defined by the horizontal position and vertical position of the upper left corner of the frame, and the horizontal size and vertical size, respectively, of the first and second tiles. Receiving a bitstream including: decoding the encoded data to generate and receive a decoded image of the first tile; and And a step of generating a decoded image information of each frame by combining the decoded image of the first tile and the single image of the second tile based on a block element. A program is provided.

実施形態の画像符号化装置、画像符号化方法、及び画像符号化プログラム、並びに、画像復号装置、画像復号方法、及び画像復号プログラムによれば、３６０度の全天画像を六面体に投影した画像をフレーム内の一部の領域を除いた領域に割り当てて符号化する際の符号化効率を向上させ、符号化または復号の処理量を低減させることができる。 According to the image encoding device, the image encoding method, and the image encoding program of the embodiment, and the image decoding device, the image decoding method, and the image decoding program, an image obtained by projecting a 360-degree all-sky image onto a hexahedron is obtained. It is possible to improve encoding efficiency when encoding is performed by allocating to a region excluding a part of the region in the frame, and the processing amount of encoding or decoding can be reduced.

図１は、３６０度の全天画像と、３６０度の全天画像が投影される六面体を示す斜視図である。FIG. 1 is a perspective view showing a 360 degree whole sky image and a hexahedron onto which the 360 degree whole sky image is projected. 図２は、３６０度の全天画像を六面体に投影した画像をフレーム内の６つの領域に展開した状態を示す図である。FIG. 2 is a diagram illustrating a state in which an image obtained by projecting a 360-degree all-sky image onto a hexahedron is developed into six regions in the frame. 図３は、HEVCのタイルによって分割して符号化することができない第１の例を示す図である。FIG. 3 is a diagram illustrating a first example that cannot be divided and encoded by HEVC tiles. 図４は、図３は、HEVCのタイルによって分割して符号化することができない第２の例を示す図である。FIG. 4 is a diagram illustrating a second example in which FIG. 3 cannot be divided and encoded by HEVC tiles. 図５は、画像符号化装置及び画像復号装置を含む画像符号化・復号システムの全体的な構成例を示すブロック図である。FIG. 5 is a block diagram illustrating an overall configuration example of an image encoding / decoding system including an image encoding device and an image decoding device. 図６は、一実施形態の画像符号化装置を示すブロック図である。FIG. 6 is a block diagram illustrating an image encoding device according to an embodiment. 図７は、複数のスライスに分割されたフレームの構成例を示す図である。FIG. 7 is a diagram illustrating a configuration example of a frame divided into a plurality of slices. 図８は、複数のタイルに分割されたフレームの構成例を示す図である。FIG. 8 is a diagram illustrating a configuration example of a frame divided into a plurality of tiles. 図９は、スライスとタイルとの関係の例を示す図である。FIG. 9 is a diagram illustrating an example of the relationship between slices and tiles. 図１０は、各タイルの位置及び大きさを示す図である。FIG. 10 is a diagram showing the position and size of each tile. 図１１は、各タイルの位置、大きさ、スキップライルフラグを表形式で示す図である。FIG. 11 is a diagram showing the position, size, and skipline flag of each tile in a table format. 図１２は、あるタイルを符号化または復号する際に他のタイルを参照する一例を示す図である。FIG. 12 is a diagram illustrating an example in which another tile is referred to when a certain tile is encoded or decoded. 図１３は、一実施形態の画像符号化装置が伝送するシンタックスの一例を示す図である。FIG. 13 is a diagram illustrating an example of syntax transmitted by the image encoding device according to the embodiment. 図１４は、一実施形態の画像符号化装置の動作、一実施形態の画像符号化方法による処理、一実施形態の画像符号化プログラムで実行される処理を示すフローチャートである。FIG. 14 is a flowchart illustrating the operation of the image encoding device according to the embodiment, the processing by the image encoding method according to the embodiment, and the processing executed by the image encoding program according to the embodiment. 図１５Ａは、一実施形態の画像符号化プログラムを記憶する記憶部を備えるコンピュータの概略的な構成を示すブロック図である。FIG. 15A is a block diagram illustrating a schematic configuration of a computer including a storage unit that stores an image encoding program according to an embodiment. 図１５Ｂは、一実施形態の画像復号プログラムを記憶する記憶部を備えるコンピュータの概略的な構成を示すブロック図である。FIG. 15B is a block diagram illustrating a schematic configuration of a computer including a storage unit that stores an image decoding program according to an embodiment. 図１６は、一実施形態の画像復号装置を示すブロック図である。FIG. 16 is a block diagram illustrating an image decoding apparatus according to an embodiment. 図１７は、一実施形態の画像復号装置の動作、一実施形態の画像復号方法による処理、一実施形態の画像復号プログラムで実行される処理を示すフローチャートである。FIG. 17 is a flowchart illustrating the operation of the image decoding apparatus according to the embodiment, the processing by the image decoding method according to the embodiment, and the processing executed by the image decoding program according to the embodiment. 図１８は、第１の変形例を説明するための図である。FIG. 18 is a diagram for explaining the first modification. 図１９は、第２の変形例を説明するための図である。FIG. 19 is a diagram for explaining the second modification. 図２０は、第３の変形例を説明するための図である。FIG. 20 is a diagram for explaining a third modification.

以下、一実施形態の画像符号化装置、画像符号化方法、及び画像符号化プログラム、並びに、画像復号装置、画像復号方法、及び画像復号プログラムについて、添付図面を参照して説明する。 Hereinafter, an image encoding device, an image encoding method, an image encoding program, an image decoding device, an image decoding method, and an image decoding program according to an embodiment will be described with reference to the accompanying drawings.

まず、図５を用いて、画像符号化装置及び画像復号装置を含む画像符号化・復号システムの全体的な構成例について説明する。図５において、前処理装置５０は、ユーザによる操作に従って、一実施形態の画像符号化装置１００による画像情報の符号化のための各種の条件を設定する。画像符号化装置１００は、前処理装置５０からの設定情報に応じて、画像情報を符号化してビットストリームを出力する。ビットストリームは、符号化された画像情報及びシンタックスを含む。 First, an overall configuration example of an image encoding / decoding system including an image encoding device and an image decoding device will be described with reference to FIG. In FIG. 5, the preprocessing device 50 sets various conditions for encoding image information by the image encoding device 100 according to the embodiment in accordance with an operation by the user. The image encoding device 100 encodes the image information according to the setting information from the preprocessing device 50 and outputs a bit stream. The bit stream includes encoded image information and syntax.

送信装置１１０は、ビットストリームを所定の伝送路１２０に送信する。伝送路１２０は有線または無線であり、インターネットのような通信回線、電話回線、テレビジョン信号を送信する地上波放送または衛星放送用の電波のいずれでもよい。受信装置１５０は、伝送路１２０によって送信されたビットストリームを受信する。一実施形態の画像復号装置２００は、ビットストリームを復号して復号画像情報を出力する。 The transmission device 110 transmits a bit stream to a predetermined transmission path 120. The transmission path 120 is wired or wireless, and may be a communication line such as the Internet, a telephone line, or a radio wave for terrestrial broadcasting or satellite broadcasting that transmits a television signal. The receiving device 150 receives the bit stream transmitted through the transmission path 120. The image decoding apparatus 200 according to an embodiment decodes a bit stream and outputs decoded image information.

＜画像符号化装置＞
図６は、画像符号化装置１００の具体的な構成例を示す。図６において、画像符号化装置１００に入力される画像情報は、図２に示すような、フレーム内に画像が割り当てられている割り当て領域と画像が割り当てられていない非割り当て領域とを含むフォーマットの画像情報である。 <Image encoding device>
FIG. 6 shows a specific configuration example of the image encoding device 100. In FIG. 6, the image information input to the image coding apparatus 100 is in a format including an allocation area where an image is allocated in a frame and a non-allocation area where no image is allocated, as shown in FIG. Image information.

並べ替え（Re-order）バッファ１には、デジタル信号の画像情報の画素が順に入力される。画像情報がアナログ信号であれば、並べ替えバッファ１の前段でＡ／Ｄ変換器によってデジタル信号に変換されていればよい。画像情報は例えば輝度信号Ｙ（以下、Ｙ信号）と色差信号Ｃｂ及びＣｒ（以下、Ｃｂ及びＣｒ信号）であり、４：２：０フォーマットのＹ，Ｃｂ，Ｃｒ映像信号を例とする。 Pixels of image information of digital signals are sequentially input to the reorder buffer 1. If the image information is an analog signal, it may be converted into a digital signal by the A / D converter in the previous stage of the rearrangement buffer 1. The image information is, for example, a luminance signal Y (hereinafter referred to as a Y signal) and color difference signals Cb and Cr (hereinafter referred to as a Cb and Cr signal).

並べ替えバッファ１は入力された画素を複数フレーム分蓄積し、必要に応じてフレーム（ピクチャ）を並べ替えて読み出す。画像情報の各フレームは、フレーム内の画素を用いて符号化するＩピクチャ、過去のフレーム内の画素を用いて予測符号化するＰピクチャ、過去及び未来のフレーム内の画素を用いて予測符号化するＢピクチャのいずれかに設定される。Ｉピクチャ、Ｐピクチャ、Ｂピクチャは前処理装置５０によって設定されてもよいし、画像符号化装置１００が予め定めた規則に従って選択してもよい。 The rearrangement buffer 1 accumulates input pixels for a plurality of frames, and rearranges and reads frames (pictures) as necessary. Each frame of image information is encoded using an I picture that is encoded using pixels in the frame, a P picture that is encoded predictively using pixels in a past frame, and a predictive encoding using pixels in past and future frames Is set to one of the B pictures. The I picture, P picture, and B picture may be set by the preprocessing device 50, or may be selected by the image encoding device 100 according to a predetermined rule.

並べ替えバッファ１は、後述するビットストリームを構成するシーケンスの構成単位となる複数のピクチャ群（ＧＯＰ）にＢピクチャが含まれる場合には、符号化のためにフレームの順を並び替えて読み出す。並べ替えバッファ１より読み出された各フレームの画素は、ピクチャ分割部１０４を介して減算器２に供給される。 The rearrangement buffer 1 rearranges the order of frames for encoding when a plurality of picture groups (GOPs), which are constituent units of a sequence constituting a bit stream to be described later, are included. The pixels of each frame read from the rearrangement buffer 1 are supplied to the subtracter 2 via the picture dividing unit 104.

タイル分割設定部１０１、スキップタイル設定部１０２、参照タイル設定部１０３、ピクチャ分割部１０４の動作を説明する前に、減算器２以降の動作を説明する。 Before describing the operations of the tile division setting unit 101, the skip tile setting unit 102, the reference tile setting unit 103, and the picture division unit 104, operations after the subtractor 2 will be described.

減算器２に入力される各フレームは、例えば水平６４画素、垂直６４画素のＣＴＵに分割され、各ＣＴＵは、再帰的な四分木ブロック分割に基づいて可変サイズの符号化ユニット（ＣＵ：Coding Unit）に分割されることがある。最大サイズのＣＵは最大符号化ユニット（ＬＣＵ：Largest Coding Unit）と称され、最小サイズのＣＵは最小符号化ユニット（ＳＣＵ：Smallest Coding Unit）と称される。 Each frame input to the subtracter 2 is divided into, for example, horizontal 64 pixel and vertical 64 pixel CTUs. Each CTU is divided into a variable size coding unit (CU: Coding) based on recursive quadtree block division. Unit). The maximum size CU is referred to as a maximum coding unit (LCU), and the minimum size CU is referred to as a minimum coding unit (SCU).

ＣＵは、イントラ予測部１４及びインター予測部１５における予測処理のために予測ユニット（ＰＵ：Prediction Unit）に分割される。また、ＣＵは、直交変換部３における直交変換処理及び量子化部４における量子化処理のために、再帰的な四分木ブロック分割に基づいて可変サイズの変換ユニット（ＴＵ：Transform Unit）に分割される。 The CU is divided into prediction units (PUs) for prediction processing in the intra prediction unit 14 and the inter prediction unit 15. Further, the CU is divided into variable-size transform units (TUs) based on recursive quadtree block division for orthogonal transformation processing in the orthogonal transformation unit 3 and quantization processing in the quantization unit 4. Is done.

以上の可変サイズのＣＵ、ＰＵ、ＴＵは、後述するコスト関数値が最小となるように選択される。画像情報の絵柄に応じてコスト関数値が最小となるＣＵ、ＰＵ、ＴＵは異なるから、各フレームの画素はサイズが異なるＣＵ、ＰＵ、ＴＵが混在した状態で分割される。 The variable size CU, PU, and TU described above are selected so that the cost function value described later is minimized. Since the CU, PU, and TU that minimize the cost function value are different according to the pattern of the image information, the pixels of each frame are divided in a state where CU, PU, and TU having different sizes are mixed.

減算器２は、ピクチャ分割部１０４より出力された原画像である画像情報のＣＵより、後述するイントラ予測部１４またはインター予測部１５によって生成された予測値（予測画像）を減算して、予測残差を生成する。減算器２は、予測残差を直交変換部３に供給する。 The subtracter 2 subtracts a prediction value (predicted image) generated by the intra prediction unit 14 or the inter prediction unit 15 (to be described later) from the CU of the image information that is the original image output from the picture dividing unit 104 to perform prediction. Generate a residual. The subtracter 2 supplies the prediction residual to the orthogonal transformation unit 3.

直交変換部３は、予測残差をＴＵ単位で直交変換して、予測残差を周波数領域の信号に変換する。直交変換部３は、直交変換として、イントラ予測が選択されてＴＵが水平４画素、垂直４画素の場合のみ離散サイン変換（ＤＳＴ）を用い、他の場合は離散コサイン変換（ＤＣＴ）を用いて、予測残差を直交変換する。直交変換部３は、直交変換係数を量子化部４に供給する。 The orthogonal transform unit 3 performs orthogonal transform on the prediction residual in units of TUs to convert the prediction residual into a frequency domain signal. The orthogonal transform unit 3 uses the discrete sine transform (DST) only when the intra prediction is selected and the TU is 4 pixels horizontal and 4 pixels vertical, and the discrete cosine transform (DCT) is used in other cases. The orthogonal transformation is performed on the prediction residual. The orthogonal transform unit 3 supplies the orthogonal transform coefficient to the quantization unit 4.

量子化部４は、直交変換係数を量子化してエントロピー符号化部５及び逆量子化部８に供給する。エントロピー符号化部５は、量子化された直交変換係数に対して発生確率に基づいて異なる長さの符号を割り当てて、直交変換係数をエントロピー符号化する。 The quantization unit 4 quantizes the orthogonal transform coefficient and supplies it to the entropy coding unit 5 and the inverse quantization unit 8. The entropy encoding unit 5 assigns codes having different lengths to the quantized orthogonal transform coefficients based on the occurrence probabilities, and entropy codes the orthogonal transform coefficients.

エントロピー符号化部５は、画像情報を符号化する際のシンタックスもエントロピー符号化する。シンタックスは、イントラ予測部１４またはインター予測部１５で選択された予測モード、動きベクトル、参照画素を特定するための情報等の各種のシンタックス要素を含む。また、シンタックスは、タイル分割設定部１０１、スキップタイル設定部１０２、参照タイル設定部１０３での設定情報を示すシンタックス要素も含む。 The entropy encoding unit 5 also entropy encodes the syntax for encoding image information. The syntax includes various syntax elements such as information for specifying a prediction mode, a motion vector, and a reference pixel selected by the intra prediction unit 14 or the inter prediction unit 15. The syntax also includes syntax elements indicating setting information in the tile division setting unit 101, the skip tile setting unit 102, and the reference tile setting unit 103.

エントロピー符号化部５は、一例として、コンテクスト適応算術符号（CABAC: Context-based Adaptive Binary Arithmetic Coding）を用いて直交変換係数及びシンタックスをエントロピー符号化することができる。 As an example, the entropy encoding unit 5 can entropy encode orthogonal transform coefficients and syntax using context adaptive arithmetic code (CABAC: Context-based Adaptive Binary Arithmetic Coding).

レート制御部７は、エントロピー符号化部５より出力される符号化データがオーバーフローまたはアンダーフローしないよう、量子化部４における量子化動作のレートを制御する。 The rate control unit 7 controls the rate of the quantization operation in the quantization unit 4 so that the encoded data output from the entropy encoding unit 5 does not overflow or underflow.

ＨＲＤ（Hypothetical Reference Decoder）バッファ６はエントロピー符号化部５より出力される符号化データよりなるビットストリームを一時的に蓄積して出力する。 An HRD (Hypothetical Reference Decoder) buffer 6 temporarily accumulates and outputs a bit stream composed of encoded data output from the entropy encoding unit 5.

逆量子化部８は、量子化された直交変換係数をＴＵ単位で逆量子化して、逆直交変換部９に供給する。逆量子化部８における逆量子化の動作は、量子化部４における量子化の動作とは逆の動作である。 The inverse quantization unit 8 inversely quantizes the quantized orthogonal transform coefficient in units of TU and supplies the quantized orthogonal transform coefficient to the inverse orthogonal transform unit 9. The inverse quantization operation in the inverse quantization unit 8 is an operation opposite to the quantization operation in the quantization unit 4.

逆直交変換部９は、入力された直交変換係数をＴＵ単位で逆直交変換して、予測残差を加算器１０に供給する。加算器１０は、入力された予測残差と、予測値選択部１６により選択されたイントラ予測部１４またはインター予測部１５によって生成された予測値とを加算して、復号信号を生成する。復号信号はループフィルタ１１及びフレームメモリ１２に供給される。 The inverse orthogonal transform unit 9 performs inverse orthogonal transform on the input orthogonal transform coefficient in units of TU, and supplies the prediction residual to the adder 10. The adder 10 adds the input prediction residual and the prediction value generated by the intra prediction unit 14 or the inter prediction unit 15 selected by the prediction value selection unit 16 to generate a decoded signal. The decoded signal is supplied to the loop filter 11 and the frame memory 12.

ループフィルタ１１は、復号信号の符号化ノイズを低減させる。ループフィルタ１１は、ブロックの境界に生じる歪を低減させるデブロッキング・フィルタと、リンギング歪を低減させる画素適応オフセットとを含む。ループフィルタ１１によってフィルタ処理された復号信号はフレームメモリ１２に供給される。 The loop filter 11 reduces coding noise of the decoded signal. The loop filter 11 includes a deblocking filter that reduces distortion generated at a block boundary and a pixel adaptive offset that reduces ringing distortion. The decoded signal filtered by the loop filter 11 is supplied to the frame memory 12.

フレームメモリ１２は、加算器１０より出力されたループフィルタ１１によるフィルタ処理を施していない復号信号と、ループフィルタ１１によるフィルタ処理が施された復号信号とを蓄積する。スイッチ１３は、フレームメモリ１２に蓄積されたフィルタ処理を施していない復号信号をイントラ予測部１４に供給し、フレームメモリ１２に蓄積されたフィルタ処理が施された復号信号をインター予測部１５に供給する。 The frame memory 12 stores the decoded signal output from the adder 10 that has not been subjected to the filter processing by the loop filter 11 and the decoded signal that has been subjected to the filter processing by the loop filter 11. The switch 13 supplies the decoded signal stored in the frame memory 12 that has not been subjected to the filter processing to the intra prediction unit 14 and supplies the decoded signal stored in the frame memory 12 to which the filter processing has been performed to the inter prediction unit 15. To do.

イントラ予測部１４は、Ｙ信号、Ｃｂ及びＣｒ信号のそれぞれで、ＴＵ単位で、複数の予測モードで予測値を生成する。但し、予測モードはＰＵ単位で選択される。イントラ予測部１４は、コスト関数値を算出して、コスト関数値が最小となるＣＵ、ＰＵ、ＴＵのサイズを選択し、かつ、コスト関数値が最小となる予測モードの予測値を選択する。 The intra prediction unit 14 generates predicted values in a plurality of prediction modes in units of TUs for each of the Y signal, the Cb signal, and the Cr signal. However, the prediction mode is selected in units of PUs. The intra prediction unit 14 calculates the cost function value, selects the size of the CU, PU, and TU that minimizes the cost function value, and selects the prediction value of the prediction mode that minimizes the cost function value.

インター予測部１５は、画像の動きを検出し、ＣＵのサイズを上限として、最小で水平８画素、垂直４画素または水平４画素、垂直８画素のＰＵから最大で水平６４画素、垂直６４画素のＰＵで、フレーム間動き補償予測を行って予測値を生成する。インター予測部１５は、過去のフレームもしくは未来のフレーム、または、過去及び未来のフレームを参照して予測値（予測画像）を生成する。過去及び未来のフレームはそれぞれ複数のフレームであってもよい。 The inter prediction unit 15 detects the motion of the image, and sets the maximum size of the CU to a maximum of 64 horizontal pixels and 64 vertical pixels from a PU of 8 horizontal pixels, 4 vertical pixels or 4 horizontal pixels, and 8 vertical pixels. A prediction value is generated by performing inter-frame motion compensation prediction in the PU. The inter prediction unit 15 generates a predicted value (predicted image) with reference to a past frame, a future frame, or a past and future frame. Each of the past and future frames may be a plurality of frames.

同様に、インター予測部１５は、コスト関数値を算出して、コスト関数値が最小となるＣＵ、ＰＵ、ＴＵのサイズを選択し、かつ、コスト関数値が最小となる予測モードの予測値を選択する。 Similarly, the inter prediction unit 15 calculates the cost function value, selects the size of the CU, PU, and TU that minimizes the cost function value, and calculates the prediction value of the prediction mode that minimizes the cost function value. select.

予測値選択部１６は、イントラ予測部１４で選択された予測値とインター予測部１５で選択された予測値とのうち、コスト関数値が小さい方を最終的な予測値として選択して減算器２及び加算器１０に供給する。 The prediction value selection unit 16 selects a subtraction unit having a smaller cost function value as a final prediction value among the prediction value selected by the intra prediction unit 14 and the prediction value selected by the inter prediction unit 15. 2 and the adder 10.

なお、Ｉピクチャを符号化する際にはイントラ予測部１４による予測値のみが用いられる。Ｐピクチャ及びＢピクチャを符号化する際には、イントラ予測部１４で選択された予測値とインター予測部１５で選択された予測値とのうちの小さい方の予測値が選択される。 Note that only the prediction value by the intra prediction unit 14 is used when encoding an I picture. When the P picture and the B picture are encoded, the smaller one of the prediction value selected by the intra prediction unit 14 and the prediction value selected by the inter prediction unit 15 is selected.

減算器２からエントロピー符号化部５までの部分、及び、逆量子化部８から予測値選択部１６までの部分は、画像情報を符号化して符号化データを生成する符号化データ生成部として機能する。 The part from the subtractor 2 to the entropy encoding unit 5 and the part from the inverse quantization unit 8 to the predicted value selection unit 16 function as an encoded data generation unit that encodes image information and generates encoded data. To do.

次に、タイル分割設定部１０１、スキップタイル設定部１０２、参照タイル設定部１０３、ピクチャ分割部１０４の動作を説明する。 Next, operations of the tile division setting unit 101, the skip tile setting unit 102, the reference tile setting unit 103, and the picture division unit 104 will be described.

画像情報は、タイル分割設定部１０１にも入力される。タイル分割設定部１０１は、各フレームを複数のタイルに分割するよう設定する。スキップタイル設定部１０２は、スキップするタイルを設定する。参照タイル設定部１０３は、参照するタイルを設定する。タイル分割設定部１０１〜参照タイル設定部１０３によって設定した各情報は、ピクチャ分割部１０４に供給される。 The image information is also input to the tile division setting unit 101. The tile division setting unit 101 sets each frame to be divided into a plurality of tiles. The skip tile setting unit 102 sets a tile to be skipped. The reference tile setting unit 103 sets a tile to be referred to. Each piece of information set by the tile division setting unit 101 to the reference tile setting unit 103 is supplied to the picture division unit 104.

タイル分割設定部１０１〜参照タイル設定部１０３は、予め設定されて条件でそれぞれの設定を実行してもよいし、前処理装置５０からの指示に応じてそれぞれの設定を実行してもよい。 The tile division setting unit 101 to the reference tile setting unit 103 may perform each setting under a preset condition, or may perform each setting according to an instruction from the preprocessing device 50.

ピクチャ分割部１０４に供給される画像情報は、１または複数のスライスに分割される。図７に示すように、複数のＣＴＵよりなるフレームは、少なくとも１つのスライスを含む。図７において、太実線はスライスの境界を示しており、フレームは例えば３つのスライスＳＬ０〜ＳＬ２に分割される。フレームをどのように複数のスライスに分割するかは、前処理装置５０によって設定される。スライスＳＬ０〜ＳＬ２はそれぞれ連続する少なくとも１つのＣＴＵを含む。 The image information supplied to the picture dividing unit 104 is divided into one or a plurality of slices. As shown in FIG. 7, a frame made up of a plurality of CTUs includes at least one slice. In FIG. 7, a thick solid line indicates a slice boundary, and the frame is divided into, for example, three slices SL0 to SL2. How the frame is divided into a plurality of slices is set by the preprocessing device 50. Each of the slices SL0 to SL2 includes at least one continuous CTU.

画像符号化装置１００はスライス単位で画像情報を符号化し、画像復号装置２００はスライス単位で画像情報を復号する。図７において、実線で示す矢印は符号化及び復号の順を示している。 The image encoding device 100 encodes image information in units of slices, and the image decoding device 200 decodes image information in units of slices. In FIG. 7, arrows indicated by solid lines indicate the order of encoding and decoding.

また、複数のＣＴＵよりなるフレームは、図８に示すように、タイル分割設定部１０１による設定によって、複数のタイルで分割することができる。図８において、太実線はタイルの境界を示しており、フレームは６つのタイルＴＬ０〜ＴＬ８に分割される。後述するようにスライスとタイルとの互いの関係に制約はあるものの、ここでは図７に示すスライスの分割とは無関係にタイルの分割を示している。 Also, a frame composed of a plurality of CTUs can be divided into a plurality of tiles by setting by the tile division setting unit 101 as shown in FIG. In FIG. 8, thick solid lines indicate tile boundaries, and the frame is divided into six tiles TL0 to TL8. As will be described later, although there is a restriction on the relationship between the slice and the tile, here, the division of the tile is shown regardless of the division of the slice shown in FIG.

画像符号化装置１００はタイル単位で画像情報を符号化し、画像復号装置２００はタイル単位で画像情報を復号する。図８において、実線で示す矢印は符号化及び復号の順を示している。タイルは、ＶＲ画像の符号化の他、並列処理またはＲＯＩ（Region Of Interest）符号化のために用いることができる。 The image encoding device 100 encodes image information in tile units, and the image decoding device 200 decodes image information in tile units. In FIG. 8, arrows indicated by solid lines indicate the order of encoding and decoding. The tile can be used for parallel processing or ROI (Region Of Interest) coding in addition to coding of a VR image.

図９（ａ）は、タイルＴＬ０の中にスライスＳＬ０及びＳＬ１が設定され、タイルＴＬ１の中にスライスＳＬ２及びＳＬ３が設定されている状態を示している。タイルはスライスのスーパーセットとすることができる。図９（ｂ）は、スライスＳＬ０の中にタイルＴＬ０及びＴＬ１が設定されている状態を示している。スライスはタイルのスーパーセットとすることができる。図９（ａ）及び図９（ｂ）に示すように、スライスとタイルとの互いの関係を設定することができる。 FIG. 9A shows a state in which slices SL0 and SL1 are set in the tile TL0, and slices SL2 and SL3 are set in the tile TL1. A tile can be a superset of slices. FIG. 9B shows a state where tiles TL0 and TL1 are set in the slice SL0. A slice can be a superset of tiles. As shown in FIGS. 9A and 9B, the relationship between slices and tiles can be set.

図９（ｃ）におけるスライスＳＬ０〜ＳＬ２とタイルＴＬ０及びＴＬ１とは、スライスとタイルとのうちの一方がスーパーセットで他方がサブセットの関係にないので、設定不可である。 The slices SL0 to SL2 and the tiles TL0 and TL1 in FIG. 9C cannot be set because one of the slices and tiles is not a superset and the other is not a subset.

図６において、タイル分割設定部１０１は、具体的に次のように各フレームに対してタイルの分割を設定する。タイル分割設定部１０１は、１つのフレームに含まれるタイルの個数を設定する。タイルの個数を４とし、図１０に示すように４つのタイルを設定する場合を例とする。 In FIG. 6, the tile division setting unit 101 sets tile division for each frame specifically as follows. The tile division setting unit 101 sets the number of tiles included in one frame. Assume that the number of tiles is 4, and four tiles are set as shown in FIG.

図１０において、タイル番号０〜３のタイルＴＬ０〜ＴＬ３は、図１１に示すように定義することができる。ハッチングを付したタイルＴＬ１及びＴＬ３は、図２における非割り当て領域に相当する。ハッチングを付していないタイルＴＬ０及びＴＬ２は、画像が割り当てられた割り当て領域である。 In FIG. 10, tiles TL0 to TL3 with tile numbers 0 to 3 can be defined as shown in FIG. The hatched tiles TL1 and TL3 correspond to the non-allocation areas in FIG. The tiles TL0 and TL2 that are not hatched are assigned areas to which images are assigned.

図１１に示すように、タイル番号０のタイルＴＬ０は、黒丸で示す左上角部の座標を水平位置Ｘ０、垂直位置Ｙ０とし、水平サイズｈ０、垂直サイズｖ０と定義される。タイルＴＬ０におけるスキップタイルフラグは０であり、スキップされないタイルであることを示す。 As shown in FIG. 11, the tile TL0 with tile number 0 is defined as a horizontal size h0 and a vertical size v0 with the coordinates of the upper left corner indicated by a black circle as the horizontal position X0 and the vertical position Y0. The skip tile flag in the tile TL0 is 0, indicating that the tile is not skipped.

タイル番号１のタイルＴＬ１は、黒丸で示す左上角部の座標を水平位置Ｘ１、垂直位置Ｙ１とし、水平サイズｈ１、垂直サイズｖ１と定義される。タイルＴＬ１におけるスキップタイルフラグは１であり、スキップされるタイルであることを示す。 The tile TL1 with the tile number 1 is defined as a horizontal size h1 and a vertical size v1 with the coordinates of the upper left corner indicated by a black circle as the horizontal position X1 and the vertical position Y1. The skip tile flag in the tile TL1 is 1, indicating that the tile is skipped.

タイル番号２のタイルＴＬ２は、黒丸で示す左上角部の座標を水平位置Ｘ２、垂直位置Ｙ２とし、水平サイズｈ１、垂直サイズｖ２と定義される。タイルＴＬ１におけるスキップタイルフラグは０である。タイル番号３のタイルＴＬ３は、黒丸で示す左上角部の座標を水平位置Ｘ３、垂直位置Ｙ３とし、水平サイズｈ１、垂直サイズｖ３と定義される。タイルＴＬ３におけるスキップタイルフラグは１である。 The tile TL2 of tile number 2 is defined as a horizontal size h1 and a vertical size v2 with the coordinates of the upper left corner indicated by a black circle as the horizontal position X2 and the vertical position Y2. The skip tile flag in the tile TL1 is 0. The tile TL3 with the tile number 3 is defined as a horizontal size h1 and a vertical size v3 with the coordinates of the upper left corner indicated by a black circle as the horizontal position X3 and the vertical position Y3. The skip tile flag in the tile TL3 is 1.

タイルＴＬ０〜ＴＬ３の水平位置及び垂直位置はＣＴＵ単位の座標である。タイルＴＬ０〜ＴＬ３の水平サイズ及び垂直サイズはＣＴＵ単位の大きさである。例えばＣＴＵの大きさが水平６４画素、垂直６４画素であり、水平サイズが１２８画素、垂直サイズが１９２画素であるとすると、伝送される水平サイズを示すシンタックス要素は２、垂直サイズを示すシンタックス要素は３となる。本実施形態においては、１つのフレームに含まれるタイルの個数を設定し、それぞれのタイルの左上角部の座標と、水平サイズ及び垂直サイズとを設定することにより、従来のHEVCでは設定できなかったタイルを設定することができる。 The horizontal and vertical positions of the tiles TL0 to TL3 are coordinates in CTU units. The horizontal size and vertical size of the tiles TL0 to TL3 are CTU units. For example, if the size of the CTU is 64 pixels horizontal and 64 pixels vertical, the horizontal size is 128 pixels, and the vertical size is 192 pixels, the syntax element indicating the transmitted horizontal size is 2, and the syntax element indicating the vertical size is 2. The tax element is 3. In this embodiment, the number of tiles included in one frame is set, and the coordinates of the upper left corner of each tile, the horizontal size, and the vertical size are set, which cannot be set in the conventional HEVC. Tiles can be set.

スキップタイル設定部１０２は、従来のHEVCでは設定できなかったスキップするタイルを設定することができる。図１０において、タイルＴＬ１及びＴＬ３をスキップするタイルと設定すれば、タイルＴＬ１及びＴＬ３を符号化する必要がなくなり、符号化効率を向上させることができる。 The skip tile setting unit 102 can set tiles to be skipped that cannot be set by the conventional HEVC. In FIG. 10, if the tiles TL1 and TL3 are set to be skipped tiles, it is not necessary to encode the tiles TL1 and TL3, and the encoding efficiency can be improved.

図１２に示すタイルＴＬ４とタイルＴＬ７とは、フレーム内では離れているものの、図１に示す六面体４０２では隣り合う面（４０２Ｆ及び４０２Ａ）に投影されている画像である。従って、タイルＴＬ４とタイルＴＬ７とは相関が高い。そこで、参照タイル設定部１０３は、タイルごとに、参照する１または複数のタイルを設定することができる。参照タイル設定部１０３が生成する参照するタイルの設定情報は、符号化するあるタイルと参照するタイルとの予測依存関係を示す情報である。 A tile TL4 and a tile TL7 illustrated in FIG. 12 are images projected on adjacent surfaces (402F and 402A) in the hexahedron 402 illustrated in FIG. 1 although they are separated in the frame. Therefore, the tile TL4 and the tile TL7 have a high correlation. Therefore, the reference tile setting unit 103 can set one or a plurality of tiles to be referenced for each tile. The reference tile setting information generated by the reference tile setting unit 103 is information indicating a prediction dependency relationship between a certain tile to be encoded and a reference tile.

図６に戻り、ピクチャ分割部１０４は、図１１に示すように設定されたタイル分割の設定情報に基づき、各フレームを分割する。ピクチャ分割部１０４は、画像情報のフォーマットに応じて、割り当て領域を１または複数のタイル（第１のタイル）に分割し、非割り当て領域を１または複数のタイル（第２のタイル）に分割すればよい。図２に示すフォーマットであれば、割り当て領域と非割り当て領域とをそれぞれ複数のタイルに分割することが必要である。 Returning to FIG. 6, the picture dividing unit 104 divides each frame based on the tile division setting information set as shown in FIG. The picture dividing unit 104 divides the allocated area into one or more tiles (first tile) and divides the non-allocated area into one or more tiles (second tile) according to the format of the image information. That's fine. In the format shown in FIG. 2, it is necessary to divide the allocation area and the non-allocation area into a plurality of tiles.

ピクチャ分割部１０４は、タイル分割とスキップするタイルと参照するタイルの設定情報とをエントロピー符号化部５に供給する。エントロピー符号化部５は、これらの設定情報を示すシンタックス要素を含むシンタックスを生成してエントロピー符号化する。エントロピー符号化部５は、シンタックス生成部及びビットストリーム生成部として機能する。 The picture dividing unit 104 supplies the tile division, the tile to be skipped, and the setting information of the tile to be referenced to the entropy encoding unit 5. The entropy encoding unit 5 generates a syntax including syntax elements indicating the setting information and performs entropy encoding. The entropy encoding unit 5 functions as a syntax generation unit and a bit stream generation unit.

図１３は、エントロピー符号化部５によって生成されて出力される、タイルに関するシンタックス要素を含むシンタックスの一例を示している。タイルに関するシンタックス要素は、ビットストリームのピクチャパラメータセット（Picture Parameter Set）に含まれて伝送されればよい。 FIG. 13 shows an example of a syntax including syntax elements related to tiles generated and output by the entropy encoding unit 5. The syntax element related to the tile may be transmitted by being included in the picture parameter set (Picture Parameter Set) of the bit stream.

タイルイネーブルフラグ（tiles_enabled_flag）が１のとき、シンタックスに、水平位置（hpos）、垂直位置（vpos）、水平サイズ（hsize）、垂直サイズ（vsize）、スキップタイルフラグ（skip_tile_flag）、参照タイル数（num_ref_tiles）、参照タイル番号（ref_tile_no）が設定される。 When the tile enable flag (tiles_enabled_flag) is 1, the syntax includes horizontal position (hpos), vertical position (vpos), horizontal size (hsize), vertical size (vsize), skip tile flag (skip_tile_flag), number of reference tiles ( num_ref_tiles) and reference tile number (ref_tile_no) are set.

スキップタイルフラグ０はスキップしないタイルであることを示し、スキップタイルフラグ１はスキップするタイルであることを示す。図１３に示すように、参照タイル数は０であってもよい。なお、図１３におけるloop_filter_across_tiles_enabled_flagは、タイル間にループフィルタ１１によるフィルタリングを施すか否かを設定するフラグである。 The skip tile flag 0 indicates that the tile is not skipped, and the skip tile flag 1 indicates that the tile is skipped. As shown in FIG. 13, the number of reference tiles may be zero. Note that loop_filter_across_tiles_enabled_flag in FIG. 13 is a flag for setting whether to perform filtering by the loop filter 11 between tiles.

以上のようにして、画像符号化装置１００は、フレーム内に画像が割り当てられている割り当て領域と画像が割り当てられていない非割り当て領域とを含むフォーマットのＶＲ画像の符号化効率を向上させ、符号化の処理量を低減させることができる。 As described above, the image encoding device 100 improves the encoding efficiency of a VR image in a format including an allocation area to which an image is allocated in a frame and a non-allocation area to which no image is allocated. The amount of processing can be reduced.

＜画像符号化方法＞
図１４に示すフローチャートを用いて、画像符号化装置１００で実行される本実施形態の画像符号化方法による処理を説明する。図１４は、タイル分割設定部１０１、スキップタイル設定部１０２、参照タイル設定部１０３で実行される処理を示している。 <Image coding method>
The process by the image coding method of this embodiment performed with the image coding apparatus 100 is demonstrated using the flowchart shown in FIG. FIG. 14 shows processing executed by the tile division setting unit 101, the skip tile setting unit 102, and the reference tile setting unit 103.

図１４において、タイル分割設定部１０１は、ステップS101にて、タイル数を設定し、ステップS102にて、まず、タイル番号を０に設定する。タイル分割設定部１０１は、ステップS103にて、水平位置及び垂直位置を設定し、ステップS104にて、水平サイズ及び垂直サイズを設定する。 In FIG. 14, the tile division setting unit 101 sets the number of tiles in step S101, and first sets the tile number to 0 in step S102. The tile division setting unit 101 sets a horizontal position and a vertical position in step S103, and sets a horizontal size and a vertical size in step S104.

スキップタイル設定部１０２は、ステップS105にて、各タイルに対してスキップタイルフラグを設定する。参照タイル設定部１０３は、ステップS106にて、スキップタイルフラグが１であるか否かを判定する。スキップタイルフラグが１でなければ（NO）、参照タイル設定部１０３は、ステップS107にて、参照タイル数を設定する。参照タイル数は０以上の数であり、０は他のタイルを参照しないことを示し、１以上の数はその数のタイルを参照することを示す。 In step S105, the skip tile setting unit 102 sets a skip tile flag for each tile. In step S106, the reference tile setting unit 103 determines whether or not the skip tile flag is 1. If the skip tile flag is not 1 (NO), the reference tile setting unit 103 sets the reference tile number in step S107. The number of reference tiles is a number of 0 or more, and 0 indicates that other tiles are not referred to, and a number of 1 or more indicates that the number of tiles is referred to.

参照タイル設定部１０３は、ステップS108にて、参照タイル数が０以外であるか否かを判定する。参照タイル数が０以外であれば（YES）、参照タイル設定部１０３は、ステップS109にて、参照タイル番号を設定する。その後、処理はステップS110に移行される。 In step S108, the reference tile setting unit 103 determines whether the reference tile number is other than zero. If the reference tile number is other than 0 (YES), the reference tile setting unit 103 sets a reference tile number in step S109. Thereafter, the process proceeds to step S110.

ステップS106にてスキップタイルフラグが１であれば（YES）、また、ステップS108にて参照タイル数が０以外でなければ（NO）（即ち、参照タイル数が０であれば）、処理はステップS110に移行される。 If the skip tile flag is 1 in step S106 (YES), and if the number of reference tiles is not 0 in step S108 (NO) (that is, if the number of reference tiles is 0), the process proceeds to step S106. Moved to S110.

タイル分割設定部１０１は、ステップS110にて、タイル番号がタイル数−１となったか否かを判定する。タイル番号がタイル数−１でなければ（NO）、まだタイルが残っているので、タイル分割設定部１０１は、ステップS111にて、タイル番号を１増加させ、処理をステップS103に戻す。 In step S110, the tile division setting unit 101 determines whether the tile number is equal to the number of tiles -1. If the tile number is not -1 (NO), since tiles still remain, the tile division setting unit 101 increments the tile number by 1 in step S111 and returns the process to step S103.

タイル分割設定部１０１、スキップタイル設定部１０２、参照タイル設定部１０３は、次のタイル番号のタイルでもステップS103〜S110の処理を繰り返す。ステップS110にてタイル番号がタイル数−１であれば（YES）、全てのタイルに対する設定が終了したので、タイル分割設定部１０１、スキップタイル設定部１０２、参照タイル設定部１０３は処理を終了させる。 The tile division setting unit 101, the skip tile setting unit 102, and the reference tile setting unit 103 repeat the processes of steps S103 to S110 for the tile with the next tile number. If the tile number is -1 in step S110 (YES), the setting for all tiles has been completed, so the tile division setting unit 101, the skip tile setting unit 102, and the reference tile setting unit 103 end the processing. .

＜画像符号化プログラム＞
図６に示す画像符号化装置１００による動作をコンピュータプログラム（画像符号化プログラム）によってコンピュータに実行させることができる。図１４に示す各処理を画像符号化プログラムによってコンピュータに実行させることができる。 <Image coding program>
The operation of the image encoding device 100 shown in FIG. 6 can be executed by a computer by a computer program (image encoding program). Each process shown in FIG. 14 can be executed by a computer using an image encoding program.

図１５Ａにおいて、コンピュータ３００は、中央処理装置（ＣＰＵ）３０１及び記憶部３０２を有する。コンピュータ３００には、操作部３１０が接続されている。記憶部３０２には、画像符号化プログラムが記憶されている。操作部３１０を図５に示す前処理装置５０として機能させることができる。ＣＰＵ３０１が画像符号化プログラムを実行させることによって、コンピュータ３００を、入力された画像情報を符号化する画像符号化装置１００として機能させることができる。 In FIG. 15A, the computer 300 includes a central processing unit (CPU) 301 and a storage unit 302. An operation unit 310 is connected to the computer 300. The storage unit 302 stores an image encoding program. The operation unit 310 can function as the preprocessing device 50 shown in FIG. By causing the CPU 301 to execute the image encoding program, the computer 300 can function as the image encoding device 100 that encodes the input image information.

記憶部３０２は、半導体メモリ、ハードディスクドライブ、光ディスク等の任意の非一時的な記憶媒体である。画像符号化プログラムは、インターネット等の通信回線を介してコンピュータ３００に提供されてもよい。 The storage unit 302 is an arbitrary non-temporary storage medium such as a semiconductor memory, a hard disk drive, or an optical disk. The image encoding program may be provided to the computer 300 via a communication line such as the Internet.

以上説明した画像符号化装置１００、画像符号化方法、画像符号化プログラムは、１つのフレーム内で、タイルはオーバラップせず、タイルが設定されていない領域が存在しないように画像情報を符号化するものとする。 The image encoding device 100, the image encoding method, and the image encoding program described above encode image information so that tiles do not overlap and there is no area in which no tile is set in one frame. It shall be.

＜画像復号装置＞
図１６は、画像復号装置２００の具体的な構成例を示す。図１６において、ＨＲＤバッファ２１はビットストリームを一時的に蓄積して、エントロピー復号部２２に供給する。ＨＲＤバッファ２１は、ビットストリームを受信するビットストリーム受信部として機能する。 <Image decoding device>
FIG. 16 shows a specific configuration example of the image decoding device 200. In FIG. 16, the HRD buffer 21 temporarily accumulates the bit stream and supplies it to the entropy decoding unit 22. The HRD buffer 21 functions as a bit stream receiving unit that receives a bit stream.

ビットストリームは、符号化データとシンタックスとを含む。符号化データは、前述のように、フレーム内に画像が割り当てられている割り当て領域と画像が割り当てられていない非割り当て領域とを含むフォーマットの画像情報を符号化した符号化データである。 The bit stream includes encoded data and syntax. As described above, the encoded data is encoded data obtained by encoding image information in a format including an allocated area in which an image is allocated in a frame and an unallocated area in which no image is allocated.

また、符号化データは、各フレームが、割り当て領域に設定された１または複数のタイルと、非割り当て領域に設定された１または複数のタイルとに分割され、割り当て領域に設定されたタイルの画像が符号化された符号化データである。 Also, the encoded data is divided into one or a plurality of tiles set in the allocation area and one or a plurality of tiles set in the non-allocation area, and an image of the tile set in the allocation area. Is the encoded data.

シンタックスは、それぞれのタイルを、フレーム内での左上角部の水平位置及び垂直位置と水平サイズ及び垂直サイズとで定義したシンタックス要素を含む。シンタックスは、非割り当て領域に設定されたタイルの画像を符号化しないことを示すスキップタイルフラグであるシンタックス要素を含むことが好ましい。シンタックスは、予測依存関係を示すシンタックス要素を含むことが好ましい。 The syntax includes syntax elements that define each tile with a horizontal position and vertical position of the upper left corner in the frame, and a horizontal size and vertical size. It is preferable that the syntax includes a syntax element that is a skip tile flag indicating that the image of the tile set in the non-allocation area is not encoded. The syntax preferably includes a syntax element indicating a prediction dependency.

エントロピー復号部２２は、ビットストリームに含まれる直交変換係数及びシンタックスをエントロピー復号する。 The entropy decoding unit 22 entropy decodes the orthogonal transform coefficients and syntax included in the bitstream.

直交変換係数は逆量子化部２３に供給される。イントラ予測とインター予測とのいずれが採用されたかを示す情報は、スイッチ３２に供給される。イントラ予測に関する情報は、イントラ予測部３０に供給される。インター予測に関する情報は、インター予測部３１に供給される。 The orthogonal transform coefficient is supplied to the inverse quantization unit 23. Information indicating whether intra prediction or inter prediction is adopted is supplied to the switch 32. Information regarding intra prediction is supplied to the intra prediction unit 30. Information regarding inter prediction is supplied to the inter prediction unit 31.

タイル分割に関する情報は、タイル分割復号部２０１に供給される。スキップタイルに関する情報は、タイル分割復号部２０１を介してスキップタイル設定部２０２に供給される。参照タイルに関する情報は、インター予測部３１と、タイル分割復号部２０１及びスキップタイル設定部２０２を介して参照タイル設定部２０３に供給される。 Information relating to tile division is supplied to the tile division decoding unit 201. Information regarding the skip tile is supplied to the skip tile setting unit 202 via the tile division decoding unit 201. Information on the reference tile is supplied to the reference tile setting unit 203 via the inter prediction unit 31, the tile division decoding unit 201, and the skip tile setting unit 202.

逆量子化部２３は、直交変換係数をＴＵ単位で逆量子化して、逆直交変換部２４に供給する。逆直交変換部２４は、逆量子化された直交変換係数をＴＵ単位で逆直交変換して、予測残差を加算器２５に供給する。加算器２５は、入力された予測残差と、スイッチ３２より供給されるイントラ予測部３０またはインター予測部３１によって生成された予測値とを加算して、復号信号を生成する。復号信号はループフィルタ２６及びフレームメモリ２８に供給される。 The inverse quantization unit 23 inversely quantizes the orthogonal transform coefficient in units of TU and supplies the inverse transform coefficient to the inverse orthogonal transform unit 24. The inverse orthogonal transform unit 24 performs inverse orthogonal transform on the inversely quantized orthogonal transform coefficient in units of TU, and supplies the prediction residual to the adder 25. The adder 25 adds the input prediction residual and the prediction value generated by the intra prediction unit 30 or the inter prediction unit 31 supplied from the switch 32 to generate a decoded signal. The decoded signal is supplied to the loop filter 26 and the frame memory 28.

ループフィルタ２６はループフィルタ１１と同様の構成であり、復号信号の符号化ノイズを低減させる。並べ替え（Re-order）バッファ２７は、ループフィルタ２６から供給される画素を複数フレーム分蓄積する。並べ替えバッファ２７は、フレームの順が並び替えられていれば、フレームを原画像の画像情報の順に並び替える。並べ替えバッファ２７より出力された各フレームのタイルを構成する画素は、ピクチャ合成部２０４に供給される。 The loop filter 26 has the same configuration as the loop filter 11 and reduces the encoding noise of the decoded signal. The re-order buffer 27 accumulates pixels supplied from the loop filter 26 for a plurality of frames. The rearrangement buffer 27 rearranges the frames in the order of the image information of the original image if the order of the frames is rearranged. The pixels constituting the tile of each frame output from the rearrangement buffer 27 are supplied to the picture composition unit 204.

フレームメモリ２８は、加算器２５より出力されたループフィルタ２６によるフィルタ処理を施していない復号信号と、ループフィルタ２６によるフィルタ処理が施された復号信号とを蓄積する。スイッチ２９は、フレームメモリ２８に蓄積されたフィルタ処理を施していない復号信号をイントラ予測部３０に供給し、フレームメモリ２８に蓄積されたフィルタ処理が施された復号信号をインター予測部３１に供給する。 The frame memory 28 stores the decoded signal output from the adder 25 that has not been subjected to the filter processing by the loop filter 26 and the decoded signal that has been subjected to the filter processing by the loop filter 26. The switch 29 supplies the decoded signal that has not been subjected to the filter processing accumulated in the frame memory 28 to the intra prediction unit 30, and supplies the decoded signal that has been subjected to the filter processing accumulated in the frame memory 28 to the inter prediction unit 31. To do.

イントラ予測部３０は、イントラ予測の予測モードを示す情報に従ってフレーム内予測を行って、Ｙ信号とＣｂ及びＣｒ信号のそれぞれの予測値を生成する。インター予測部３１は、インター予測に関する情報に従ってフレーム間予測を行って、Ｙ信号とＣｂ及びＣｒ信号のそれぞれの予測値を生成する。 The intra prediction unit 30 performs intra-frame prediction according to information indicating a prediction mode of intra prediction, and generates respective prediction values of the Y signal, the Cb signal, and the Cr signal. The inter prediction unit 31 performs inter-frame prediction in accordance with information related to inter prediction, and generates respective prediction values for the Y signal and the Cb and Cr signals.

インター予測部３１は、参照タイルに関する情報が入力されるとき、図１２に示すように、他のタイル（ここではＴＬ７）の画像を参照して、あるタイル（ここではＴＬ４）の画像の予測値を生成する。 When the information regarding the reference tile is input, the inter prediction unit 31 refers to an image of another tile (here, TL7) and predicts the image of a certain tile (here, TL4) as illustrated in FIG. Is generated.

スイッチ３２は、イントラ予測とインター予測とのいずれが採用されたかを示す情報に従って、イントラ予測部３０またはインター予測部３１によって生成された予測値を加算器２５に供給する。 The switch 32 supplies the adder 25 with the prediction value generated by the intra prediction unit 30 or the inter prediction unit 31 according to information indicating which of intra prediction and inter prediction is adopted.

エントロピー復号部２２からスイッチ３２までの部分は、符号化データを復号して、割り当て領域に設定されたタイルの復号画像を生成する復号部として機能する。 The portion from the entropy decoding unit 22 to the switch 32 functions as a decoding unit that decodes the encoded data and generates a decoded image of the tile set in the allocation area.

タイル分割復号部２０１は、図１１に示すように設定されたタイル分割の設定情報を復号して、スキップタイル設定部２０２に供給する。スキップタイル設定部２０２は、スキップタイルフラグに基づいてスキップするタイルを設定する。参照タイル設定部２０３は、参照するタイルを設定する。タイル分割の設定情報、スキップするタイルの情報、参照するタイルの情報は、ピクチャ合成部２０４に供給される。 The tile division decoding unit 201 decodes the tile division setting information set as illustrated in FIG. 11 and supplies the decoded information to the skip tile setting unit 202. The skip tile setting unit 202 sets a tile to be skipped based on the skip tile flag. The reference tile setting unit 203 sets a tile to be referred to. The tile division setting information, the tile information to be skipped, and the tile information to be referred to are supplied to the picture composition unit 204.

ピクチャ合成部２０４は、タイル分割の設定情報に基づいて、復号されたタイルの画像をフレーム内の割り当て領域に配置する。ピクチャ合成部２０４は、スキップするタイルの情報に基づいて、非割り当て領域に例えば黒の単一画像を配置する。ピクチャ合成部２０４はこのように複数のタイルをピクチャ合成して、図２に示すようなフォーマットの復号画像情報の各フレームの復号画像情報を生成する。復号画像情報は、必要に応じてＤ／Ａ変換器によってアナログ信号に変換される。 The picture composition unit 204 arranges the decoded tile image in the allocation area in the frame based on the tile division setting information. The picture composition unit 204 arranges, for example, a single black image in the non-allocation area based on the information on the tile to be skipped. The picture composition unit 204 synthesizes a plurality of tiles in this way, and generates decoded image information of each frame of the decoded image information in the format shown in FIG. The decoded image information is converted into an analog signal by a D / A converter as necessary.

以上のようにして、画像復号装置２００は、フレーム内に画像が割り当てられている割り当て領域と画像が割り当てられていない非割り当て領域とを含むフォーマットのＶＲ画像を復号する処理量を低減させることができる。 As described above, the image decoding apparatus 200 can reduce the amount of processing for decoding a VR image having a format including an allocation area to which an image is allocated in a frame and a non-allocation area to which no image is allocated. it can.

＜画像復号方法＞
図１７に示すフローチャートを用いて、画像復号装置２００で実行される本実施形態の画像復号方法による処理を説明する。図１７は、タイル分割復号部２０１、スキップタイル設定部２０２、参照タイル設定部２０３で実行される処理を示している。 <Image decoding method>
The process by the image decoding method of this embodiment performed with the image decoding apparatus 200 is demonstrated using the flowchart shown in FIG. FIG. 17 illustrates processing executed by the tile division decoding unit 201, the skip tile setting unit 202, and the reference tile setting unit 203.

図１７において、タイル分割復号部２０１は、ステップS201にて、タイル数を受信し、ステップS202にて、まず、タイル番号を０に設定する。タイル分割復号部２０１は、ステップS203にて、水平位置及び垂直位置を受信し、ステップS204にて、水平サイズ及び垂直サイズを受信する。 In FIG. 17, the tile division decoding unit 201 receives the number of tiles in step S201, and first sets the tile number to 0 in step S202. The tile division decoding unit 201 receives the horizontal position and the vertical position in step S203, and receives the horizontal size and the vertical size in step S204.

スキップタイル設定部２０２は、ステップS205にて、各タイルのスキップタイルフラグを受信する。参照タイル設定部２０３は、ステップS206にて、スキップタイルフラグが１であるか否かを判定する。スキップタイルフラグが１でなければ（NO）、参照タイル設定部２０３は、ステップS207にて、参照タイル数を受信する。 In step S205, the skip tile setting unit 202 receives the skip tile flag of each tile. In step S206, the reference tile setting unit 203 determines whether or not the skip tile flag is 1. If the skip tile flag is not 1 (NO), the reference tile setting unit 203 receives the reference tile number in step S207.

参照タイル設定部２０３は、ステップS208にて、参照タイル数が０以外であるか否かを判定する。参照タイル数が０以外であれば（YES）、参照タイル設定部２０３は、ステップS209にて、参照タイル番号を受信する。その後、処理はステップS210に移行される。 In step S208, the reference tile setting unit 203 determines whether the reference tile number is other than zero. If the number of reference tiles is other than 0 (YES), the reference tile setting unit 203 receives the reference tile number in step S209. Thereafter, the process proceeds to step S210.

ステップS206にてスキップタイルフラグが１であれば（YES）、また、ステップS208にて参照タイル数が０以外でなければ（NO）（即ち、参照タイル数が０であれば）、処理はステップS210に移行される。 If the skip tile flag is 1 in step S206 (YES), and if the reference tile number is not 0 in step S208 (NO) (that is, if the reference tile number is 0), the process proceeds to step S206. Moved to S210.

タイル分割復号部２０１は、ステップS210にて、タイル番号がタイル数−１となったか否かを判定する。タイル番号がタイル数−１でなければ（NO）、まだタイルが残っているので、タイル分割復号部２０１は、ステップS211にて、タイル番号を１増加させ、処理をステップS203に戻す。 In step S210, the tile division decoding unit 201 determines whether or not the tile number is equal to the number of tiles−1. If the tile number is not the number of tiles -1 (NO), since tiles still remain, the tile division decoding unit 201 increments the tile number by 1 in step S211 and returns the process to step S203.

タイル分割復号部２０１、スキップタイル設定部２０２、参照タイル設定部２０３は、次のタイル番号のタイルでもステップS203〜S210の処理を繰り返す。ステップS210にてタイル番号がタイル数−１であれば（YES）、全てのタイルの情報を受信したので、タイル分割復号部２０１、スキップタイル設定部２０２、参照タイル設定部２０３は処理を終了させる。 The tile division decoding unit 201, the skip tile setting unit 202, and the reference tile setting unit 203 repeat the processes in steps S203 to S210 for the tile with the next tile number. If the tile number is the number of tiles −1 in step S210 (YES), the tile division decoding unit 201, the skip tile setting unit 202, and the reference tile setting unit 203 end the processing because information on all tiles has been received. .

＜画像復号プログラム＞
図１６に示す画像復号装置における動作をコンピュータプログラム（画像復号プログラム）によってコンピュータに実行させることができる。図１７に示す各処理を画像復号プログラムによってコンピュータに実行させることができる。 <Image decoding program>
The operation of the image decoding apparatus shown in FIG. 16 can be executed by a computer using a computer program (image decoding program). Each process shown in FIG. 17 can be executed by a computer using an image decoding program.

図１５Ｂに示すように、コンピュータは図１５Ａと同様の構成であり、図１５Ａと共通部分の説明を省略する。図１５Ｂにおいて、記憶部３０２には、画像符号化プログラムに代えて画像復号プログラムが記憶されている。ＣＰＵ３０１が画像復号プログラムを実行させることによって、コンピュータ３００を、符号化されているビットストリームを復号する画像復号装置２００として機能させることができる。 As shown in FIG. 15B, the computer has the same configuration as that in FIG. 15A, and the description of the common parts with FIG. In FIG. 15B, the storage unit 302 stores an image decoding program instead of the image encoding program. By causing the CPU 301 to execute the image decoding program, the computer 300 can function as the image decoding device 200 that decodes the encoded bit stream.

記憶部３０２に画像符号化プログラム及び画像復号プログラムを記憶させて、コンピュータ３００を、画像符号化装置１００及び画像復号装置２００として機能させることも可能である。 It is also possible to store the image encoding program and the image decoding program in the storage unit 302 so that the computer 300 functions as the image encoding device 100 and the image decoding device 200.

＜第１の変形例＞
以上説明した本実施形態の画像符号化装置、画像符号化方法、及び画像符号化プログラム、並びに、画像復号装置、画像復号方法、及び画像復号プログラムにおいて、次のような第１の変形例とすることができる。 <First Modification>
In the image encoding device, the image encoding method, and the image encoding program, and the image decoding device, the image decoding method, and the image decoding program of the present embodiment described above, the following first modified example is used. be able to.

図１８に示すように、タイルを符号化して伝送する順番は、左上のタイルを最初としなくてもよい。図１８において、画像符号化装置１００はタイルＴＬ０〜ＴＬ５の順で符号化して符号化データを含むビットストリームを伝送する。画像復号装置２００は、タイルＴＬ０〜ＴＬ５の順で復号する。ヘッドマウントディスプレイを装着するユーザが最初に視認する視野に対応するタイルを最初に符号化し復号するタイルＴＬ０とする。 As shown in FIG. 18, the order of encoding and transmitting the tiles does not have to be the top left tile first. In FIG. 18, the image encoding device 100 encodes the tiles TL <b> 0 to TL <b> 5 in this order and transmits a bit stream including encoded data. The image decoding apparatus 200 decodes in the order of the tiles TL0 to TL5. A tile TL0 that first encodes and decodes a tile corresponding to the field of view that the user wearing the head-mounted display first visually recognizes.

スキップタイルフラグを伝送しなくてもよい。この場合、上記の説明では、１つのフレーム内でタイルが設定されていない領域が存在しないように画像情報を符号化するとしたが、タイルが設定されていない領域が存在してもよい。図１８において、ハッチングを付した非割り当て領域にタイルを設定しなくてもよい。 The skip tile flag need not be transmitted. In this case, in the above description, the image information is encoded so that there is no region in which no tile is set in one frame, but there may be a region in which no tile is set. In FIG. 18, it is not necessary to set a tile in a non-allocated area with hatching.

＜第２の変形例＞
図９（ｂ）で説明したように、スライスはタイルのスーパーセットとすることができるとしたが、スライスがタイルのスーパーセットであることを禁じてもよい。スライスがタイルのスーパーセットであることを禁じないと、図１９に示すように、互いに離れた位置にあるタイルＴＬ０及びＴＬ１のスーパーセットであるスライスＳＬ０が設定されることがある。互いに離れた位置にある複数のタイルで、スライスヘッダに含まれるパラメータが共有されないよう、スライスがタイルのスーパーセットであることを禁じるのがよい。 <Second Modification>
As described with reference to FIG. 9B, the slice can be a superset of tiles, but the slice may be prohibited from being a superset of tiles. If the slice is not prohibited from being a superset of tiles, a slice SL0 that is a superset of tiles TL0 and TL1 that are located at a distance from each other may be set as shown in FIG. It is advisable to prohibit a slice from being a superset of tiles so that multiple tiles that are far from each other do not share the parameters contained in the slice header.

＜第３の変形例＞
上記の説明では、１つのフレーム内でタイルはオーバラップしないとしたが、タイルのオーバラップを許容してもよい。図２０は、実線で示すタイルＴＬ０と一点鎖線で示すタイルＴＬ１とがオーバラップしている状態を示している。 <Third Modification>
In the above description, tiles do not overlap in one frame, but tile overlap may be allowed. FIG. 20 shows a state where the tile TL0 indicated by the solid line and the tile TL1 indicated by the alternate long and short dash line overlap.

ヘッドマウントディスプレイを装着しているユーザの視野が破線で示す領域ＡＲ０にあるとき、画像復号装置２００は、タイルＴＬ０のうちの領域ＡＲ０の部分の画像を復号してヘッドマウントディスプレイに表示する。ユーザの視野が領域ＡＲ０から破線で示す領域ＡＲ１に移動したとき、画像復号装置２００は、タイルＴＬ１の復号処理を開始するまでの間に、タイルＴＬ０のタイルＴＬ１とオーバラップしているオーバラップ領域の画像を復号して表示する。従って、ユーザの視野が移動したときの画像表示の遅延を低減させることができる。 When the field of view of the user wearing the head mounted display is in the area AR0 indicated by the broken line, the image decoding apparatus 200 decodes the image of the area AR0 in the tile TL0 and displays it on the head mounted display. When the user's field of view moves from the area AR0 to the area AR1 indicated by the broken line, the image decoding apparatus 200 overlaps the tile TL1 of the tile TL0 before starting the decoding process of the tile TL1. Decode the image and display it. Accordingly, it is possible to reduce a delay in image display when the user's visual field moves.

上述した本実施形態においては、画像情報としてＹ，Ｃｂ，Ｃｒ映像信号を例としたが、色空間はＹ，Ｃｂ，Ｃｒに限定されず、Ｙ，Ｃｏ，Ｃｇ映像信号であってもよい。画像情報は、輝度成分と２つの色差信号よりなる色成分を含む任意の色空間の映像信号であればよい。 In the present embodiment described above, Y, Cb, and Cr video signals are taken as examples of image information. However, the color space is not limited to Y, Cb, and Cr, and may be Y, Co, and Cg video signals. The image information may be a video signal in an arbitrary color space including a luminance component and a color component composed of two color difference signals.

本発明は以上説明した本実施形態に限定されるものではなく、本発明の要旨を逸脱しない範囲において種々変更可能である。画像符号化装置１００及び画像復号装置２００は、集積回路等のハードウェアで構成されてもよく、ソフトウェアで構成されてもよく、両者が混在してもよい。画像符号化方法及び画像復号方法は、集積回路またはコンピュータ等の任意のハードウェア資源が実行すればよい。 The present invention is not limited to the embodiment described above, and various modifications can be made without departing from the scope of the present invention. The image encoding device 100 and the image decoding device 200 may be configured by hardware such as an integrated circuit, may be configured by software, or both may be mixed. The image encoding method and the image decoding method may be executed by any hardware resource such as an integrated circuit or a computer.

１００画像符号化装置
１０１タイル分割設定部
１０２，２０２スキップタイル設定部
１０３，２０３参照タイル設定部
１０４ピクチャ分割部
２００画像復号装置
２０１タイル分割復号部
２０４ピクチャ合成部
３００コンピュータ
３０２記憶部 DESCRIPTION OF SYMBOLS 100 Image coding apparatus 101 Tile division | segmentation setting part 102,202 Skip tile setting part 103,203 Reference tile setting part 104 Picture division part 200 Image decoding apparatus 201 Tile division decoding part 204 Picture composition part 300 Computer 302 Storage part

Claims

Image information in a format including an allocation area in which an image is allocated in a frame and a non-allocation area in which no image is allocated is input, and each frame of the image information is set to one or more set in the allocation area A picture dividing unit for dividing the first tile into one or a plurality of second tiles set in the non-allocation area;
An encoded data generation unit that encodes the image of the first tile to generate encoded data;
Syntax generation for generating a syntax including a first syntax element for each of the first and second tiles defined by the horizontal position and vertical position of the upper left corner in the frame, and the horizontal size and vertical size. And
A bit stream generation unit that generates a bit stream including the encoded data and the syntax;
An image encoding device comprising:

The encoded data generation unit does not encode the image of the second tile,
The syntax generation unit generates a syntax including a second syntax element that is a skip tile flag indicating that the image of the second tile is not encoded. Image encoding device.

The picture dividing unit divides the allocation area into a plurality of first tiles,
The encoded data generation unit is based on setting information on whether or not to refer to at least one other tile of the plurality of first tiles when encoding each of the plurality of first tiles. And encoding each of the plurality of first tiles to generate encoded data,
The syntax generation unit indicates that when the encoded data generation unit encodes the first tile without referring to other tiles, the syntax generation unit does not refer to other tiles, and refers to the other tiles. The image encoding apparatus according to claim 2, wherein when one tile is encoded, a syntax including a third syntax element indicating a tile to be referred to is generated.

Each frame of image information in a format including an allocation area in which an image is allocated in a frame and a non-allocation area in which no image is allocated, and one or more first tiles set in the allocation area; Dividing into one or a plurality of second tiles set in the non-allocation area,
Encoding the image of the first tile to generate encoded data;
Each of the first and second tiles generates a syntax including syntax elements defined by a horizontal position and a vertical position of the upper left corner in the frame, and a horizontal size and a vertical size,
An image encoding method, comprising: generating a bitstream including the encoded data and the syntax.

On the computer,
Each frame of image information in a format including an allocation area in which an image is allocated in a frame and a non-allocation area in which no image is allocated, and one or more first tiles set in the allocation area; Dividing into one or more second tiles set in the unallocated area;
Encoding the image of the first tile to generate encoded data;
Each of the first and second tiles generating a syntax including syntax elements defined by a horizontal position and a vertical position of the upper left corner in the frame and a horizontal size and a vertical size;
Generating a bitstream including the encoded data and the syntax;
An image encoding program characterized in that

It is image information in a format including an allocation area in which an image is allocated in a frame and a non-allocation area in which no image is allocated, and each frame of the image information includes one or a plurality of frames set in the allocation area Divided into a first tile and one or a plurality of second tiles set in the non-allocation area, encoded data obtained by encoding an image of the first tile, and the first and second A bit stream receiving unit that receives a bit stream including a syntax including a first syntax element defined by a horizontal position and a vertical position of the upper left corner in the frame and a horizontal size and a vertical size of each tile of When,
A decoding unit that decodes the encoded data to generate a decoded image of the first tile;
A picture synthesis unit that synthesizes the decoded image of the first tile and the single image of the second tile based on the first syntax element to generate decoded image information of each frame;
An image decoding apparatus comprising:

The encoded data is encoded data in which the image of the second tile is not encoded,
The bitstream receiving unit receives a syntax including a second syntax element that is a skip tile flag indicating that the second tile is not encoded;
The image decoding apparatus according to claim 6, wherein the picture composition unit assigns the single image to the second tile based on the second syntax element.

The allocation area of the image information is divided into a plurality of first tiles;
The bitstream reception unit indicates whether each of the plurality of first tiles is encoded with reference to at least one other tile of the plurality of first tiles, Receiving a syntax including a third syntax element indicating the referenced tile when encoded with reference to
The decoding unit decodes each of the plurality of first tiles based on the third syntax element without referring to the other tiles when not encoded with reference to the other tiles, The image decoding apparatus according to claim 7, wherein when decoding is performed with reference to another tile, the decoding is performed with reference to the other tile.

It is image information in a format including an allocation area in which an image is allocated in a frame and a non-allocation area in which no image is allocated, and each frame of the image information includes one or a plurality of frames set in the allocation area Divided into a first tile and one or a plurality of second tiles set in the non-allocation area, encoded data obtained by encoding an image of the first tile, and the first and second Each of the tiles receives a bitstream including a syntax including a syntax element defined by a horizontal position and a vertical position of the upper left corner in the frame and a horizontal size and a vertical size;
Decoding the encoded data to generate a decoded image of the first tile;
Based on the syntax element, the decoded image information of each frame is generated by synthesizing the decoded image of the first tile and the single image of the second tile.

On the computer,
It is image information in a format including an allocation area in which an image is allocated in a frame and a non-allocation area in which no image is allocated, and each frame of the image information includes one or a plurality of frames set in the allocation area Divided into a first tile and one or a plurality of second tiles set in the non-allocation area, encoded data obtained by encoding an image of the first tile, and the first and second Receiving a bitstream that includes a syntax including a syntax element defined by a horizontal position and a vertical position of the upper left corner and a horizontal size and a vertical size of each of the tiles in the frame;
Decoding the encoded data to generate a decoded image of the first tile;
Combining the decoded image of the first tile and the single image of the second tile based on the syntax element to generate decoded image information of each frame;
An image decoding program characterized in that