JP2019024246A

JP2019024246A - Image encoding apparatus, image encoding method, image decoding apparatus, image decoding method, and program

Info

Publication number: JP2019024246A
Application number: JP2018192307A
Authority: JP
Inventors: 前田　充; Mitsuru Maeda; 充前田; 真悟志摩; Shingo Shima
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2018-10-11
Filing date: 2018-10-11
Publication date: 2019-02-14
Anticipated expiration: 2033-07-12
Also published as: JP6656335B2

Abstract

To designate a tile capable of independent processing and perform processing of a specific area at high speed when an image is divided into tiles and encoded and decoded, in hierarchical encoding of an image.SOLUTION: An image encoding device 100 for hierarchically encoding an input image in a plurality of hierarchies, includes: acquisition means for acquiring a first image generated from an input image and a second image different in resolution from the first image; and encoding means for performing encoding using a second region of the second image existing at a position relatively equal to a first region in the first image as a reference image when encoding the first region of the first image acquired by the acquiring means.SELECTED DRAWING: Figure 1

Description

本発明は空間解像度や画質が異なるレイヤの符号化及び復号に関する。特に動画像において、画像を複数の領域に分割し、分割した領域ごとに符号化及び復号を行う画像符号化及び復号技術に関する。 The present invention relates to encoding and decoding of layers having different spatial resolution and image quality. In particular, the present invention relates to an image encoding and decoding technique for dividing an image into a plurality of regions and encoding and decoding each divided region in a moving image.

動画像の圧縮記録の符号化方式として、Ｈ．２６４／ＭＰＥＧ−４ＡＶＣ（以下Ｈ．２６４）が知らされている。 As an encoding method for compression recording of moving images, H.264 is used. H.264 / MPEG-4 AVC (hereinafter H.264) is known.

近年、Ｈ．２６４の後継として、さらに高効率な符号化方式の国際標準化を行う活動が開始され、ＪＣＴ−ＶＣ（ＪｏｉｎｔＣｏｌｌａｂｏｒａｔｉｖｅＴｅａｍｏｎＶｉｄｅｏＣｏｄｉｎｇ）がＩＳＯ／ＩＥＣとＩＴＵ−Ｔの間で設立された。このＪＣＴ−ＶＣでは、ＨｉｇｈＥｆｆｉｃｉｅｎｃｙＶｉｄｅｏＣｏｄｉｎｇ符号化方式（以下、ＨＥＶＣ）の標準化が進められている（非特許文献１）。 In recent years, H.C. As a successor to H.264, an international standardization of a more efficient coding system was started, and JCT-VC (Joint Collaborative Team on Video Coding) was established between ISO / IEC and ITU-T. In this JCT-VC, standardization of the High Efficiency Video Coding encoding method (hereinafter referred to as HEVC) is underway (Non-Patent Document 1).

ＨＥＶＣでは、画像を矩形領域（タイル）に分割し、各領域を独立に符号化及び復号する、タイル分割方式という技術が採用されている。さらに、タイル分割方式において、１つ以上のタイルからなるＭｏｔｉｏｎＣｏｎｓｔｒａｉｎｅｄＴｉｌｅＳｅｔｓ（以下、ＭＣＴＳ）を用いて、当該ＭＣＴＳを、他のタイルに依存せずに独立に符号化及び復号することが提案された（非特許文献２）。そして、非特許文献２に記載の提案では、シーケンス単位でＭＣＴＳを設定可能であることが定義されている。即ち、同一のシーケンスであれば、各フレームにおけるＭＣＴＳの位置は相対的に等しい。そして、上記の提案では、処理対象のフレーム内のＭＣＴＳを符号化及び復号する場合に、他のフレーム内にある、当該ＭＣＴＳと相対的に等しい位置の画素群をフレーム間予測の対象とする。即ち、当該画素群以外の画素を動きベクトル探索において参照する参照画素としない。これにより、ＭＣＴＳにおける符号化及び復号の独立性を確保することができる。尚、画像におけるＭＣＴＳに含まれるタイルの位置は、ＳＥＩ（ＳｕｐｐｌｅｍｅｎｔａｌＥｎｈａｎｃｅｍｅｎｔＩｎｆｏｒｍａｔｉｏｎ）メッセージに含めて符号化される。 HEVC employs a technique called a tile division method in which an image is divided into rectangular regions (tiles), and each region is independently encoded and decoded. Furthermore, in the tile division method, using Motion Constrained Tile Sets (hereinafter referred to as MCTS) composed of one or more tiles, it has been proposed to independently encode and decode the MCTS without depending on other tiles. (Non-Patent Document 2). In the proposal described in Non-Patent Document 2, it is defined that the MCTS can be set in sequence units. That is, in the same sequence, the MCTS positions in each frame are relatively equal. And in said proposal, when encoding and decoding MCTS in the frame of a process target, the pixel group of the position relatively equal to the said MCTS in another frame is made into the object of inter-frame prediction. That is, pixels other than the pixel group are not used as reference pixels to be referenced in the motion vector search. Thereby, the independence of encoding and decoding in MCTS can be ensured. Note that the position of the tile included in the MCTS in the image is encoded by being included in a SEI (Supplemental Enhancement Information) message.

一方で、ＨＥＶＣの標準化においては、階層符号化への拡張も検討されている。階層符号化では、基本レイヤと拡張レイヤにおいて符号化対象のタイルをそれぞれ符号化する。そして、各レイヤで符号化されたタイルを多重化してビットストリームを生成する。上記のような階層符号化では、基本レイヤのタイルの境界位置と拡張レイヤのタイルの境界位置とは独立に設定することが可能である。そして、拡張レイヤの符号化対象のタイルを符号化する場合に、基本レイヤの対応するタイルを参照する必要があるため、基本レイヤにおける当該タイルの位置を特定する必要がある。そこで、拡張レイヤのＶＵＩ（ＶｉｄｅｏＵｓａｂｉｌｉｔｙＩｎｆｏｒｍａｔｉｏｎ）パラメータ（ｖｕｉ＿ｐａｒａｍｅｔｅｒｓ）としてｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号を用いること提案されている（非特許文献３）。当該符号は、各レイヤ間でタイルの相対的な位置が一致しているか否かを表す、一致情報を符号化したものである。当該符号が１の場合、拡張レイヤのタイルの境界の位置が基本レイヤの対応するタイルの境界の位置と一致することを保証する。これにより、拡張レイヤのタイルの符号化及び復号において呼び出される基本レイヤのタイルの位置が特定できるため、拡張レイヤのタイルを独立に符号化及び復号することができ、高速な符号化及び復号を可能にする。尚、基本レイヤが最も上位のレイヤとなり、続く拡張レイヤが順に下位のレイヤとなる。 On the other hand, in the standardization of HEVC, expansion to hierarchical coding is also being studied. In hierarchical encoding, tiles to be encoded are encoded in the base layer and the enhancement layer, respectively. Then, the bit stream is generated by multiplexing the tiles encoded in each layer. In the hierarchical encoding as described above, the boundary position of the base layer tile and the boundary position of the enhancement layer tile can be set independently. When encoding a tile to be encoded in the enhancement layer, it is necessary to refer to the corresponding tile in the base layer, and thus it is necessary to specify the position of the tile in the base layer. Therefore, it has been proposed to use a tile_boundaries_aligned_flag code as a VUI (Video Usability Information) parameter (vui_parameters) of the enhancement layer (Non-patent Document 3). This code is obtained by encoding coincidence information that indicates whether or not the relative positions of tiles coincide between the layers. When the code is 1, it is guaranteed that the position of the tile boundary of the enhancement layer matches the position of the corresponding tile boundary of the base layer. As a result, the position of the base layer tile called in the encoding and decoding of the enhancement layer tile can be specified, so that the enhancement layer tile can be encoded and decoded independently, enabling high-speed encoding and decoding. To. Note that the base layer is the highest layer, and the subsequent extension layers are sequentially lower layers.

ＩＴＵ−ＴＨ．２６５（０４／２０１３）ＨｉｇｈｅｆｆｉｃｉｅｎｃｙｖｉｄｅｏｃｏｄｉｎｇITU-TH. 265 (04/2013) High efficiency video coding ＪＣＴ−ＶＣ寄書ＪＣＴＶＣ−Ｍ０２３５インターネット＜ｈｔｔｐ：／／ｐｈｅｎｉｘ．ｉｎｔ−ｅｖｒｙ．ｆｒ／ｊｃｔ／ｄｏｃ＿ｅｎｄ＿ｕｓｅｒ／ｄｏｃｕｍｅｎｔｓ／１３＿Ｉｎｃｈｅｏｎ／ｗｇ１１／＞JCT-VC Contribution JCTVC-M0235 Internet <http: // phenix. int-evry. fr / jct / doc_end_user / documents / 13_Incheon / wg11 /> ＪＣＴ−ＶＣ寄書ＪＣＴＶＣ−Ｍ０２０２インターネット＜ｈｔｔｐ：／／ｐｈｅｎｉｘ．ｉｎｔ−ｅｖｒｙ．ｆｒ／ｊｃｔ／ｄｏｃ＿ｅｎｄ＿ｕｓｅｒ／ｄｏｃｕｍｅｎｔｓ／１３＿Ｉｎｃｈｅｏｎ／ｗｇ１１／＞JCT-VC Contribution JCTVC-M0202 Internet <http: // phenix. int-evry. fr / jct / doc_end_user / documents / 13_Incheon / wg11 />

しかしながら、上記の非特許文献２に記載のＭＣＴＳでは、階層符号化に関して考慮されていない。即ち、タイルの境界及びＭＣＴＳの位置をレイヤ毎に設定できる場合に、各レイヤにおけるタイルの相対的な位置は不一致であることが考えられる。例えば、拡張レイヤの所定のタイルがＭＣＴＳに含まれる場合であって、基本レイヤの当該所定のタイルに対応する位置のタイルがＭＣＴＳに含まれない場合に、基本レイヤでは当該所定のタイルに対応する位置以外の、周囲のタイルも復号する必要がある。 However, the MCTS described in Non-Patent Document 2 does not consider hierarchical coding. That is, when the tile boundary and the MCTS position can be set for each layer, the relative positions of the tiles in each layer may be inconsistent. For example, when a predetermined tile of the enhancement layer is included in the MCTS and a tile at a position corresponding to the predetermined tile of the basic layer is not included in the MCTS, the basic layer corresponds to the predetermined tile. The surrounding tiles other than the position also need to be decoded.

ここで、図１３を用いて具体的に説明する。図１３はタイル分割の様子を表している。１３０１〜１３１０は、それぞれフレームを表す。各フレーム１３０１〜１３１０は、タイル番号０〜１１の１２個のタイルでそれぞれ構成される。以下、タイル番号１のタイルをタイル１と称す。番号が変化しても同様である。また、説明のため、基本レイヤでは各フレームのタイル分割は水平方向に２分割し、垂直方向に分割はしないものとする。さらに、拡張レイヤでは各フレームのタイル分割は、フレームを水平方向に４分割、垂直方向に３分割するものとする。また、図中の細枠はタイルの境界を表す。 Here, it demonstrates concretely using FIG. FIG. 13 shows a state of tile division. Reference numerals 1301 to 1310 denote frames. Each of the frames 1301 to 1310 includes 12 tiles having tile numbers 0 to 11, respectively. Hereinafter, the tile with tile number 1 is referred to as tile 1. The same is true even if the number changes. For the sake of explanation, in the base layer, the tile division of each frame is divided into two in the horizontal direction and not divided in the vertical direction. Furthermore, in the enhancement layer, the tile division of each frame is assumed to be divided into four in the horizontal direction and three in the vertical direction. A thin frame in the figure represents a tile boundary.

各フレーム１３０１、１３０３、１３０５、１３０７は、時刻ｔにおける各レイヤのフレームを表す。フレーム１３０１は時刻ｔの基本レイヤのフレームを表す。フレーム１３０５は時刻ｔの拡張第１階層（第１拡張レイヤ）のフレームを表す。フレーム１３０３はフレーム１３０１を局所復号した再構成画像を第１拡張レイヤの解像度に拡大したフレームを表す。フレーム１３０９は時刻ｔの拡張第２階層（第２拡張レイヤ）のフレームを表す。フレーム１３０７はフレーム１３０５の復号画像を第２拡張レイヤの解像度に拡大したフレームを表す。 Each frame 1301, 1303, 1305, 1307 represents a frame of each layer at time t. A frame 1301 represents a frame of the base layer at time t. A frame 1305 represents a frame in the extended first layer (first extended layer) at time t. A frame 1303 represents a frame obtained by enlarging the reconstructed image obtained by locally decoding the frame 1301 to the resolution of the first enhancement layer. A frame 1309 represents a frame in the extended second layer (second extended layer) at time t. A frame 1307 represents a frame obtained by enlarging the decoded image of the frame 1305 to the resolution of the second enhancement layer.

さらに、各フレーム１３０２、１３０４、１３０６、１３０８は、時刻ｔ＋δにおける各レイヤのフレームを表す。フレーム１３０２は時刻ｔ＋δの基本レイヤのフレームを表す。フレーム１３０６は時刻ｔ＋δの第１拡張レイヤのフレームを表す。フレーム１３０４はフレーム１３０２の復号画像を第１拡張レイヤの解像度に拡大したフレームを表す。フレーム１３１０は時刻ｔ＋δの第２拡張レイヤのフレームを表す。フレーム１３０８はフレーム１３０６の復号画像を第２拡張レイヤの解像度に拡大したフレームを表す。 Further, each frame 1302, 1304, 1306, 1308 represents a frame of each layer at time t + δ. A frame 1302 represents a frame of the base layer at time t + δ. A frame 1306 represents a frame of the first enhancement layer at time t + δ. A frame 1304 represents a frame obtained by enlarging the decoded image of the frame 1302 to the resolution of the first enhancement layer. Frame 1310 represents the frame of the second enhancement layer at time t + δ. A frame 1308 represents a frame obtained by enlarging the decoded image of the frame 1306 to the resolution of the second enhancement layer.

以下、拡張レイヤの各フレーム（フレーム１３０５、１３０６、１３０９、１３１０）のタイル５をＭＣＴＳのタイルとして説明する。図１３において、太枠はＭＣＴＳに属するタイル乃至はその対応位置を表す。 Hereinafter, the tile 5 of each frame (frames 1305, 1306, 1309, and 1310) of the enhancement layer will be described as an MCTS tile. In FIG. 13, a thick frame represents a tile belonging to the MCTS or its corresponding position.

図１３において、第２拡張レイヤのフレーム１３１０のＭＣＴＳ（タイル５）を復号するためには、第１拡張レイヤのフレーム１３０６のタイル５が復号されている必要がある。さらに、第１拡張レイヤのフレーム１３０６のタイル５を復号するためには、基本レイヤのフレーム１３０２のタイル０が復号されている必要がある。さらに、基本レイヤのフレーム１３０２のタイル０を復号するためには、フレーム１３０１を参照してフレーム間予測を行う必要があり、フレーム１３０１の全てのタイルを復号する必要がある。 In FIG. 13, in order to decode the MCTS (tile 5) of the frame 1310 of the second enhancement layer, the tile 5 of the frame 1306 of the first enhancement layer needs to be decoded. Further, in order to decode the tile 5 of the frame 1306 of the first enhancement layer, the tile 0 of the frame 1302 of the base layer needs to be decoded. Further, in order to decode the tile 0 of the frame 1302 of the base layer, it is necessary to perform inter-frame prediction with reference to the frame 1301, and it is necessary to decode all the tiles of the frame 1301.

即ち、従来技術において、時刻ｔ＋δにおける第２拡張レイヤのＭＣＴＳを復号する場合に、時刻ｔにおける基本レイヤのフレーム１３０２のタイル５の位置を示す領域（フレーム１３０２の点線部分）以外の領域を復号する必要がある。このため、階層符号化において、ＭＣＴＳ等を用いて所定のタイルを符号化及び復号する場合に、当該ＭＣＴＳの位置に対応するタイルだけを、独立して符号化及び復号するということができないという課題がある。 That is, in the conventional technique, when decoding the MCTS of the second enhancement layer at time t + δ, a region other than the region indicating the position of the tile 5 of the frame 1302 of the base layer at time t (dotted line portion of the frame 1302) is decoded. There is a need. Therefore, in hierarchical encoding, when a predetermined tile is encoded and decoded using MCTS or the like, it is not possible to independently encode and decode only the tile corresponding to the position of the MCTS. There is.

本発明は上述した課題を解決するためになされたものであり、階層符号化において、所定のタイルを他のタイルに依存せずに独立に符号化及び復号することを可能とすることを目的としている。 The present invention has been made in order to solve the above-described problems, and an object of the present invention is to make it possible to independently encode and decode a predetermined tile without depending on other tiles in hierarchical encoding. Yes.

本発明の画像符号化装置は、例えば、下記の構成を有する。すなわち、動画像を構成する画像を複数の階層で階層符号化する画像符号化装置であって、第１の画像とは階層が異なる第２の画像を生成する生成手段と、前記第１の画像における、１又は複数のタイルから構成される第１のタイルセットと、前記第２の画像における、１又は複数のタイルから構成される第２のタイルセットとを符号化する符号化手段と、情報符号化手段とを有し、前記第２のタイルセットは、前記第２の画像における前記第１のタイルセットに対応する位置にあり、前記符号化手段は、前記第１の画像においては前記第１のタイルセット以外を参照せずに前記第１のタイルセットを符号化するとともに、前記第２の画像においては前記第２のタイルセット以外を参照せずに前記第２のタイルセットを符号化し、前記符号化手段は、前記第２の画像の少なくとも一部の領域を参照して前記第１のタイルセットを符号化する場合、前記第２の画像においては前記第２のタイルセットのみを参照するよう制限して前記第１のタイルセットを符号化し、前記情報符号化手段は、前記第１のタイルセット及び前記第２タイルセットの復号処理に関する制限を示すＳＥＩメッセージを符号化する。 The image encoding apparatus of the present invention has the following configuration, for example. That is, an image encoding apparatus that hierarchically encodes an image constituting a moving image in a plurality of layers, the generation unit generating a second image having a layer different from the first image, and the first image Encoding means for encoding the first tile set composed of one or more tiles and the second tile set composed of one or more tiles in the second image, and information Encoding means, wherein the second tile set is in a position corresponding to the first tile set in the second image, and the encoding means is the first tile in the first image. The first tile set is encoded without referring to other than the one tile set, and the second tile set is encoded without referring to other than the second tile set in the second image. The encoding The stage restricts reference to only the second tile set in the second image when encoding the first tile set with reference to at least a partial region of the second image. Then, the first tile set is encoded, and the information encoding means encodes an SEI message indicating a restriction on decoding processing of the first tile set and the second tile set.

本発明の画像復号装置は、例えば、下記の構成を有する。すなわち、動画像を構成する画像を複数の階層で階層符号化して生成された符号化データを復号する画像復号装置であって、１又は複数のタイルから構成されるタイルセットの復号処理に関する制限を示すＳＥＩメッセージを復号する情報復号手段と、前記ＳＥＩメッセージに従って、前記第１の画像における第１のタイルセットと、前記第２の画像における第２のタイルセットとを復号する復号手段とを有し、前記第２のタイルセットは、前記第２の画像における前記第１のタイルセットに対応する位置にあり、前記復号手段は、前記第１の画像においては前記第１のタイルセット以外を参照せずに前記第１のタイルセットを復号するとともに、前記第２の画像においては前記第２のタイルセット以外を参照せずに前記第２のタイルセットを復号し、前記復号手段は、前記第２の画像の少なくとも一部の領域を参照して前記第１のタイルセットを復号する場合、前記第２の画像においては前記第２のタイルセットのみを参照するよう制限して前記第１のタイルセットを復号する。 The image decoding apparatus of the present invention has the following configuration, for example. That is, an image decoding apparatus that decodes encoded data generated by hierarchically encoding images constituting a moving image in a plurality of hierarchies. Information decoding means for decoding the indicated SEI message, and decoding means for decoding the first tile set in the first image and the second tile set in the second image according to the SEI message. The second tile set is at a position corresponding to the first tile set in the second image, and the decoding means refers to a part other than the first tile set in the first image. Without decoding the first tile set without referring to other than the second tile set in the second image. And when the decoding means decodes the first tile set with reference to at least a partial area of the second image, only the second tile set is referred to in the second image. And the first tile set is decoded.

本発明により、階層符号化において、独立に符号化及び復号が可能なタイルを設定することを可能にする。 The present invention makes it possible to set tiles that can be encoded and decoded independently in hierarchical encoding.

実施形態１における画像符号化装置１００の構成を示すブロック図1 is a block diagram illustrating a configuration of an image encoding device 100 according to a first embodiment. タイルの構成の一例を示す図A diagram showing an example of a tile configuration 実施形態１の画像符号化装置１００における画像符号化処理を表すフローチャート7 is a flowchart illustrating image encoding processing in the image encoding device 100 according to the first embodiment. 実施形態１における画像符号化装置４００の構成を示すブロック図1 is a block diagram illustrating a configuration of an image encoding device 400 according to Embodiment 1. FIG. 実施形態１の画像符号化装置４００における画像符号化処理を表すフローチャート7 is a flowchart illustrating image encoding processing in the image encoding device 400 according to the first embodiment. 実施形態２における画像表示装置６００の構成を示すブロック図FIG. 6 is a block diagram illustrating a configuration of an image display apparatus 600 according to the second embodiment. 実施形態２における画像復号部６０５の構成を示すブロック図A block diagram showing composition of picture decoding part 605 in Embodiment 2. 実施形態２の画像復号部６０５における画像復号処理を表すフローチャートThe flowchart showing the image decoding process in the image decoding part 605 of Embodiment 2. 実施形態２における画像復号部６０５装置の別な構成を示すブロック図The block diagram which shows another structure of the image decoding part 605 apparatus in Embodiment 2. FIG. 実施形態２の画像復号部６０５の別な形態における画像復号処理を表すフローチャートThe flowchart showing the image decoding process in another form of the image decoding part 605 of Embodiment 2. 本発明の画像符号化装置、又は画像復号装置に適用可能なコンピュータのハードウェアの構成例を示すブロック図The block diagram which shows the structural example of the hardware of the computer applicable to the image coding apparatus of this invention, or an image decoding apparatus. ビットストリームのｖｕｉ＿ｐａｒａｍｅｔｅｒｓに関するシンタックスを表す図The figure showing the syntax about vui_parameters of a bit stream タイルの構成の従来例の一例を示す図The figure which shows an example of the prior art example of the structure of a tile

以下、添付の図面を参照して、本願発明をその好適な実施形態に基づいて詳細に説明する。尚、以下の実施形態において示す構成は一例に過ぎず、本発明は図示された構成に限定されるものではない。
以下、ＭＣＴＳに含まれる各タイルのように、独立に符号化及び復号できるタイルを独立タイルと呼び、ＭＣＴＳのような独立タイルの集まりを独立タイルセットと呼ぶことにする。 Hereinafter, the present invention will be described in detail based on the preferred embodiments with reference to the accompanying drawings. The configurations shown in the following embodiments are merely examples, and the present invention is not limited to the illustrated configurations.
Hereinafter, tiles that can be encoded and decoded independently like each tile included in the MCTS are referred to as independent tiles, and a collection of independent tiles such as MCTS is referred to as an independent tile set.

＜実施形態１＞
以下、図１を用いて本実施形態に係る画像符号化装置を構成する各処理部の概要を説明する。図１は、本実施形態の画像符号化装置１００を示すブロック図である。 <Embodiment 1>
Hereinafter, the outline of each processing unit constituting the image coding apparatus according to the present embodiment will be described with reference to FIG. FIG. 1 is a block diagram showing an image encoding device 100 of the present embodiment.

図１における１０１は、画像（入力画像）を入力する端子（入力手段）である。入力画像は１フレームずつ入力されるものとする。１０２は１フレーム内の垂直方向及び水平方向のタイル分割の数、及び各タイルの位置を決定するタイル設定部である。さらに、タイル設定部１０２は分割されたタイルのうちいずれかを独立タイルとして符号化するか否かを決定する。以下、タイル設定部１０２によって設定される、水平方向タイルの分割数、垂直方向タイルの分割数、及び分割の位置を表す情報を、タイル分割情報と称す。また、当該タイル分割情報に関しては、非特許文献１において、ピクチャのヘッダデータであるＰｉｃｔｕｒｅＰａｒａｍｅｔｅｒＳｅｔ（ＰＰＳ）の記載部分に説明されているのでここでは説明を省略する。 Reference numeral 101 in FIG. 1 denotes a terminal (input means) for inputting an image (input image). The input image is input frame by frame. Reference numeral 102 denotes a tile setting unit that determines the number of tile divisions in the vertical and horizontal directions within one frame and the position of each tile. Further, the tile setting unit 102 determines whether or not to encode any one of the divided tiles as an independent tile. Hereinafter, the information set by the tile setting unit 102 and indicating the number of horizontal tile divisions, the number of vertical tile divisions, and the position of division is referred to as tile division information. The tile division information is described in Non-Patent Document 1 in the description part of Picture Parameter Set (PPS), which is the header data of a picture, so the description thereof is omitted here.

ここで、本実施形態におけるタイル分割の例を、図２を用いて説明する。本実施形態の図２において、１フレームを４Ｋ２Ｋ（水平方向４０９６画素×垂直方向２１６０画素）とする。以下、本実施形態では、水平方向４０９６画素×垂直方向２１６０画素を、４０９６×２１６０画素と表記する。画素数が変化しても同様である。さらに、図２における２０１〜２０６はそれぞれフレームを表す。各フレーム２０１〜２０６は、水平方向に４分割、垂直方向に３分割することにより、タイル番号０〜１１の１２個のタイルで構成される。即ち、１タイルのサイズは１０２４×７２０画素となる。但し、分割数はこれに限定されない。さらに、図２に示すフレーム２０１〜２０６の中の太枠で示されたタイル５及びタイル６をそれぞれ独立タイルとし、タイル５とタイル６とからなる領域を独立タイルセットとする。また、図２に示すフレーム２０１〜２０６中の細枠は、各タイルの境界を表す。また、図２に示す拡大画像における太枠はこれらの独立タイルセットに対応する位置を表す。さらに、図２から明らかなように、各レイヤにおいて、水平方向及び垂直方向のタイルの分割数、及び各タイルの相対的な位置はそれぞれ一致している。 Here, an example of tile division in the present embodiment will be described with reference to FIG. In FIG. 2 of the present embodiment, one frame is 4K2K (horizontal direction 4096 pixels × vertical direction 2160 pixels). Hereinafter, in this embodiment, 4096 pixels in the horizontal direction × 2160 pixels in the vertical direction will be referred to as 4096 × 2160 pixels. The same applies even if the number of pixels changes. Furthermore, 201 to 206 in FIG. 2 each represent a frame. Each of the frames 201 to 206 is composed of 12 tiles having tile numbers 0 to 11 by dividing the frame into four in the horizontal direction and three in the vertical direction. That is, the size of one tile is 1024 × 720 pixels. However, the number of divisions is not limited to this. Furthermore, the tile 5 and the tile 6 indicated by the thick frames in the frames 201 to 206 shown in FIG. 2 are respectively independent tiles, and an area composed of the tiles 5 and 6 is an independent tile set. Moreover, the thin frames in the frames 201 to 206 shown in FIG. 2 represent the boundaries of the tiles. Moreover, the thick frame in the enlarged image shown in FIG. 2 represents the position corresponding to these independent tile sets. Further, as is apparent from FIG. 2, in each layer, the number of tile divisions in the horizontal and vertical directions and the relative position of each tile are the same.

図２におけるフレーム２０１は、時刻ｔに入力された基本レイヤのフレームを表す。フレーム２０２は、時刻ｔ＋δに入力された基本レイヤのフレームを表す。時刻ｔ＋δにおいてフレーム２０１は符号化及び局所復号（逆量子化及び逆変換）済みであり、フレーム２０２を符号化する際には局所復号されたフレーム２０１を参照フレームとして用いることができる。 A frame 201 in FIG. 2 represents the frame of the base layer input at time t. A frame 202 represents a frame of the base layer input at time t + δ. At time t + δ, the frame 201 has been encoded and locally decoded (inverse quantization and inverse transformation), and when the frame 202 is encoded, the locally decoded frame 201 can be used as a reference frame.

フレーム２０３は、フレーム２０１を符号化した後に、局所復号を行うことで再構成画像の生成を行い、さらに当該再構成画像を拡張レイヤと同等のサイズに拡大した拡大画像である。フレーム２０４は、フレーム２０２を符号化した後に、局所復号を行うことで再構成画像の生成を行い、さらに当該再構成画像を拡張レイヤと同等のサイズに拡大した拡大画像である。 The frame 203 is an enlarged image obtained by generating a reconstructed image by performing local decoding after encoding the frame 201 and further expanding the reconstructed image to a size equivalent to that of the enhancement layer. The frame 204 is an enlarged image obtained by encoding the frame 202, generating a reconstructed image by performing local decoding, and further expanding the reconstructed image to a size equivalent to that of the enhancement layer.

フレーム２０５は、時刻ｔに入力された拡張レイヤのフレームを表す。フレーム２０６は、時刻ｔ＋δに入力された拡張レイヤのフレームを表す。 A frame 205 represents an enhancement layer frame input at time t. A frame 206 represents an enhancement layer frame input at time t + δ.

再び、図１の各処理部の説明に戻る。以下、時刻ｔ＋δのフレームを符号化対象のフレームとして説明する。 Returning to the description of each processing unit in FIG. Hereinafter, the frame at time t + δ will be described as a frame to be encoded.

タイル設定部１０２は、シーケンス単位で独立タイルを含むか否かの情報を表す独立タイルフラグを生成する。タイル設定部１０２は、符号化対象のフレームに独立タイルが含まれる場合に独立タイルフラグの値を１とし、符号化対象のフレームに独立タイルが含まれない場合に独立タイルフラグの値を０とする。さらに、タイル設定部１０２は、符号化対象のフレームに独立タイルが含まれる（独立タイルフラグの値が１）場合、当該独立タイルの位置を表す独立タイル位置情報を生成する。一般的に、独立タイル位置情報は、画像内のタイル番号で表されるが、本発明はこれに限定されない。そして、タイル設定部１０２は、生成した独立タイルフラグ及び独立タイル位置情報をタイル分割情報として後段へ出力する。本実施形態において、タイル設定部１０２から出力されたタイル分割情報は、拡張レイヤ分割部１０４、基本レイヤ分割部１０５、独立タイル判定部１０６、及びヘッダ符号化部１１４に入力される。 The tile setting unit 102 generates an independent tile flag representing information on whether or not an independent tile is included in sequence units. The tile setting unit 102 sets the value of the independent tile flag to 1 when the encoding target frame includes an independent tile, and sets the value of the independent tile flag to 0 when the encoding target frame does not include the independent tile. To do. Further, when an independent tile is included in the encoding target frame (the value of the independent tile flag is 1), the tile setting unit 102 generates independent tile position information indicating the position of the independent tile. In general, the independent tile position information is represented by a tile number in the image, but the present invention is not limited to this. Then, the tile setting unit 102 outputs the generated independent tile flag and independent tile position information to the subsequent stage as tile division information. In the present embodiment, the tile division information output from the tile setting unit 102 is input to the enhancement layer division unit 104, the base layer division unit 105, the independent tile determination unit 106, and the header encoding unit 114.

１０３は縮小部である。縮小部１０３は、端子１０１から入力した入力画像を予め決められたフィルタ等を用いて縮小し、解像度を低下させた縮小画像（基本レイヤ画像）を生成する。 Reference numeral 103 denotes a reduction unit. The reduction unit 103 reduces the input image input from the terminal 101 using a predetermined filter or the like, and generates a reduced image (base layer image) with reduced resolution.

１０４は拡張レイヤ分割部である。拡張レイヤ分割部１０４は、端子１０１から入力した入力画像を拡張レイヤの画像（拡張レイヤ画像）とし、タイル設定部１０２によって出力されたタイル分割情報に基づいて、当該拡張レイヤ画像を１つ以上のタイルに分割する。ここで、拡張レイヤ分割部１０４は、図２に示すように、入力されたフレーム２０６をタイル０〜１１の１２個のタイルに分割する。さらに、拡張レイヤ分割部１０４は、分割した各タイルを、タイル番号の順番（０、１、２、・・・、１１の順）で後段にそれぞれ出力する。 Reference numeral 104 denotes an enhancement layer dividing unit. The enhancement layer dividing unit 104 sets an input image input from the terminal 101 as an enhancement layer image (enhancement layer image), and based on the tile division information output by the tile setting unit 102, the enhancement layer image is converted into one or more enhancement layer images. Divide into tiles. Here, the enhancement layer dividing unit 104 divides the input frame 206 into 12 tiles 0 to 11 as shown in FIG. Further, the enhancement layer dividing unit 104 outputs each divided tile to the subsequent stage in the order of tile numbers (in the order of 0, 1, 2,..., 11).

１１４はヘッダ符号化部である。シーケンス単位及びピクチャ単位のヘッダ符号データを生成する。特に、ヘッダ符号化部１１４は、タイル設定部１０２で生成された独立タイルフラグと独立タイル位置情報とを入力し、ＭＣＴＳＳＥＩ（ＳＥＩメッセージ）を生成し、ＶＵＩパラメータ（ｖｕｉ＿ｐａｒａｍｅｔｅｒｓ）を符号化する。 Reference numeral 114 denotes a header encoding unit. Header code data in units of sequences and pictures is generated. In particular, the header encoding unit 114 receives the independent tile flag and the independent tile position information generated by the tile setting unit 102, generates MCTS SEI (SEI message), and encodes VUI parameters (vui_parameters).

１０５は基本レイヤ分割部である。基本レイヤ分割部１０５は、縮小部１０３によって生成された基本レイヤ画像を、タイル設定部１０２によって出力されたタイル分割情報に基づいて、１つ以上のタイルに分割する。即ち、基本レイヤ分割部１０５は、当該タイル分割情報に基づく各タイルの位置が縮小部１０３によって生成された基本レイヤ画像において相対的に等しい位置になるように、当該基本レイヤ画像をタイルに分割する。本実施形態において、基本レイヤ分割部１０５は、図２に示すように入力されたフレーム２０２をタイル０〜１１の１２個のタイルに分割する。さらに、基本レイヤ分割部１０５は、分割した各タイルを、タイル番号の順番で後段にそれぞれ出力する。また、基本レイヤ分割部１０５は、出力するタイル（符号化対象のタイル）の番号を独立タイル判定部１０６に通達する。 Reference numeral 105 denotes a base layer dividing unit. The base layer division unit 105 divides the base layer image generated by the reduction unit 103 into one or more tiles based on the tile division information output by the tile setting unit 102. That is, the base layer division unit 105 divides the base layer image into tiles such that the positions of the tiles based on the tile division information are relatively equal in the base layer image generated by the reduction unit 103. . In the present embodiment, the base layer dividing unit 105 divides the input frame 202 into 12 tiles 0 to 11 as shown in FIG. Further, the base layer division unit 105 outputs the divided tiles to the subsequent stage in the order of tile numbers. Further, the base layer division unit 105 notifies the independent tile determination unit 106 of the number of the tile to be output (tile to be encoded).

１０６は、符号化対象のタイル（符号化対象タイル）が独立タイルであるか否かを判定する、独立タイル判定部である。独立タイル判定部１０６は、タイル設定部１０２で生成された独立タイルフラグ及び独立タイル位置情報と、基本レイヤ分割部１０５から入力された符号化対象タイルの番号とに基づいて、符号化対象タイルが独立タイルであるか否かを判定する。ここで、独立タイルフラグが１であり、独立タイル位置情報によって独立タイルの位置がタイル５であり、符号化対象タイルがタイル５である場合に、独立タイル判定部１０６は、符号化対象タイルが独立タイルであると判定することができる。さらに、独立タイル判定部１０６は、判定結果を独立タイル符号化フラグとして後段に出力する。ここで、独立タイル判定部１０６は、符号化対象タイルが独立タイルである場合に当該独立タイル符号化フラグの値を１とし、符号化対象タイルが独立タイルではない場合に当該独立タイル符号化フラグの値を０とする。 Reference numeral 106 denotes an independent tile determination unit that determines whether or not an encoding target tile (encoding target tile) is an independent tile. The independent tile determination unit 106 determines whether the encoding target tile is based on the independent tile flag and the independent tile position information generated by the tile setting unit 102 and the encoding target tile number input from the base layer division unit 105. Determine whether it is an independent tile. Here, when the independent tile flag is 1, the position of the independent tile is the tile 5 according to the independent tile position information, and the encoding target tile is the tile 5, the independent tile determination unit 106 determines that the encoding target tile is It can be determined that the tile is an independent tile. Furthermore, the independent tile determination unit 106 outputs the determination result to the subsequent stage as an independent tile encoding flag. Here, the independent tile determination unit 106 sets the value of the independent tile encoding flag to 1 when the encoding target tile is an independent tile, and sets the independent tile encoding flag when the encoding target tile is not an independent tile. The value of is assumed to be 0.

１０７は、基本レイヤ分割部１０５から入力された、基本レイヤ画像の符号化対象タイルの画像を符号化する基本レイヤ符号化部である。基本レイヤ符号化部１０７は、独立タイル判定部１０６から入力された独立タイル符号化フラグに基づいて、符号化対象タイルを符号化し、基本レイヤ符号データを生成する。 Reference numeral 107 denotes a base layer encoding unit that encodes the encoding target tile image of the base layer image input from the base layer dividing unit 105. The base layer encoding unit 107 encodes the encoding target tile based on the independent tile encoding flag input from the independent tile determination unit 106, and generates base layer code data.

ここで、独立タイル符号化フラグが、符号化対象タイルが独立タイルであることを示す場合の基本レイヤ符号化部１０７における符号化処理について説明する。この場合、基本レイヤ符号化部１０７は局所復号済みの基本レイヤの再構成画像のうち当該符号化対象タイルを含む独立タイルセットの位置と相対的に等しい位置の画素のみを参照して予測及び符号化を行う。さらに、図２を例にとって説明すれば、フレーム２０２のタイル５を符号化対象とする場合、基本レイヤ符号化部１０７はフレーム２０１の独立タイルセット内のタイル５及びタイル６のみを参照して予測及び符号化を行う。一方、独立タイル符号化フラグが、符号化対象タイルが独立タイルでないことを示す場合、基本レイヤ符号化部１０７は局所復号済みの基本レイヤの再構成画像の全ての画素を参照して予測及び予測誤差等の符号化を行う。図２を用いて説明すれば、フレーム２０２のタイル２を符号化対象とする場合、基本レイヤ符号化部１０７はフレーム２０１の全てのタイル（タイル０〜１１）を参照して予測及び符号化を行う。 Here, the encoding process in the base layer encoding unit 107 when the independent tile encoding flag indicates that the encoding target tile is an independent tile will be described. In this case, the base layer encoding unit 107 performs prediction and encoding with reference to only a pixel at a position relatively equal to the position of the independent tile set including the encoding target tile in the reconstructed image of the base layer that has been locally decoded. Do. Further, referring to FIG. 2 as an example, when the tile 5 of the frame 202 is to be encoded, the base layer encoding unit 107 performs prediction with reference to only the tile 5 and the tile 6 in the independent tile set of the frame 201. And encoding. On the other hand, when the independent tile encoding flag indicates that the encoding target tile is not an independent tile, the base layer encoding unit 107 refers to all pixels of the reconstructed image of the base layer that has been locally decoded, and performs prediction and prediction. Encode errors and so on. Referring to FIG. 2, when the tile 2 of the frame 202 is to be encoded, the base layer encoding unit 107 performs prediction and encoding with reference to all the tiles (tiles 0 to 11) of the frame 201. Do.

さらに、基本レイヤ符号化部１０７は、予測のために用いられた予測モード、予測によって生成された予測誤差、当該予測誤差を符号化して生成した基本レイヤ符号データ等を後段に出力する。 Furthermore, the base layer encoding unit 107 outputs the prediction mode used for prediction, the prediction error generated by the prediction, the base layer code data generated by encoding the prediction error, and the like to the subsequent stage.

１０８は基本レイヤ符号化部１０７で生成された係数（予測モード及び予測誤差）等を入力し、当該予測誤差を局所復号して基本レイヤの再構成画像を生成する基本レイヤ再構成部である。さらに、基本レイヤ再構成部１０８は、生成した再構成画像を保持する。これは、基本レイヤ符号化部１０７及び拡張レイヤ符号化部１１２において、当該再構成画像を用いて予測を行うためである。 Reference numeral 108 denotes a base layer reconstruction unit that receives the coefficients (prediction mode and prediction error) generated by the base layer coding unit 107 and locally decodes the prediction error to generate a reconstructed image of the base layer. Furthermore, the base layer reconstruction unit 108 holds the generated reconstructed image. This is because the base layer encoding unit 107 and the enhancement layer encoding unit 112 perform prediction using the reconstructed image.

１０９は拡大部であり、基本レイヤの再構成画像を拡張レイヤのサイズに拡大する。図２において、拡大部１０９はフレーム２０１及びフレーム２０２の各々の再構成画像に対して拡大を行い、フレーム２０３及びフレーム２０４を生成する。 Reference numeral 109 denotes an enlargement unit that enlarges the reconstructed image of the base layer to the size of the enhancement layer. In FIG. 2, the enlargement unit 109 enlarges each reconstructed image of the frame 201 and the frame 202 to generate a frame 203 and a frame 204.

１１２は、拡張レイヤ分割部１０４から入力されたタイルの画像を符号化する拡張レイヤ符号化部である。拡張レイヤ符号化部１１２は、独立タイル判定部１０６から入力された独立タイル符号化フラグに基づいて参照画像を選択し、符号化対象タイルを符号化し、拡張レイヤ符号データを生成する。 Reference numeral 112 denotes an enhancement layer encoding unit that encodes a tile image input from the enhancement layer dividing unit 104. The enhancement layer encoding unit 112 selects a reference image based on the independent tile encoding flag input from the independent tile determination unit 106, encodes the encoding target tile, and generates enhancement layer code data.

ここで、独立タイル符号化フラグが１（符号化対象タイルが独立タイルである）の場合、拡張レイヤ符号化部１１２は基本レイヤの局所復号済みの再構成画像を拡大した拡大画像と、局所復号済みの拡張レイヤの再構成画像とを参照する。そして、拡張レイヤ符号化部１１２は、当該拡大画像及び当該再構成画像の各画像の独立タイルセットに含まれる画像を参照して予測及び符号化を行う。さらに、図２を例にとって説明すれば、フレーム２０６のタイル５を符号化対象とする場合、拡張レイヤ符号化部１１２は、フレーム２０４のタイル５及びタイル６とフレーム２０６のタイル５のうち局所復号済みの再構成画像とを参照して予測及び符号化を行う。一方、独立タイル符号化フラグが０（符号化対象タイルが独立タイルでない）の場合、拡張レイヤ符号化部１１２は、局所復号済みの基本レイヤの拡大画像及び局所復号済みの拡張レイヤの再構成画像を参照して独立タイルに限定せずに予測を行う。そして、拡張レイヤ符号化部１１２は、予測して生成された予測誤差等を符号化する。 Here, when the independent tile encoding flag is 1 (the encoding target tile is an independent tile), the enhancement layer encoding unit 112 expands the reconstructed image of the base layer that has been locally decoded, and local decoding. Reference is made to the reconstructed image of the completed enhancement layer. Then, the enhancement layer encoding unit 112 performs prediction and encoding with reference to images included in the independent tile set of each image of the enlarged image and the reconstructed image. Further, referring to FIG. 2 as an example, when the tile 5 of the frame 206 is to be encoded, the enhancement layer encoding unit 112 performs local decoding among the tile 5 and the tile 6 of the frame 204 and the tile 5 of the frame 206. Prediction and encoding are performed with reference to the completed reconstructed image. On the other hand, when the independent tile encoding flag is 0 (the encoding target tile is not an independent tile), the enhancement layer encoding unit 112 performs the locally decoded base layer enlarged image and the locally decoded enhancement layer reconstructed image. To make predictions without limiting to independent tiles. Then, the enhancement layer encoding unit 112 encodes a prediction error or the like generated by prediction.

さらに、拡張レイヤ符号化部１１２は、基本レイヤ符号化部１０７と同様に、予測のために用いられた予測モード、予測によって生成された予測誤差、当該予測誤差を符号化して生成した拡張レイヤ符号データ等を後段に出力する。 Further, similar to the base layer encoding unit 107, the enhancement layer encoding unit 112, the prediction mode used for prediction, the prediction error generated by the prediction, and the enhancement layer code generated by encoding the prediction error Output data etc. to the subsequent stage.

１１３は拡張レイヤ符号化部１１２によって符号化の途中で生成された係数（予測モード及び予測誤差）等を用いて局所復号を行い、拡張レイヤの再構成画像を生成する拡張レイヤ再構成部である。さらに、拡張レイヤ再構成部１１３は、拡張レイヤ符号化部１１２における符号化処理で用いるために、生成した再構成画像を保持する。 Reference numeral 113 denotes an enhancement layer reconstruction unit that performs local decoding using coefficients (prediction mode and prediction error) generated in the middle of coding by the enhancement layer coding unit 112 and generates a reconstructed image of the enhancement layer. . Furthermore, the enhancement layer reconstruction unit 113 holds the generated reconstructed image for use in the encoding process in the enhancement layer encoding unit 112.

１１０は基本レイヤ符号化部１０７で生成された基本レイヤ符号データ、拡張レイヤ符号化部１１２で生成された拡張レイヤ符号データ、ヘッダ符号化部１１４で生成されたヘッダ符号データを統合し、ビットストリームを生成する統合部である。また、１１１は、統合部１１０によって生成されたビットストリームを外部に出力する端子である。 110 integrates the base layer code data generated by the base layer encoding unit 107, the enhancement layer code data generated by the enhancement layer encoding unit 112, and the header code data generated by the header encoding unit 114, and a bit stream Is an integration unit that generates Reference numeral 111 denotes a terminal that outputs the bitstream generated by the integration unit 110 to the outside.

全体制御部１１５は、画像符号化装置内の各処理部の制御、及び各処理部間のパラメータ伝達を行う。尚、図１において、全体制御部１１５と画像符号化装置内の各処理部との間の結線を省略している。そして、全体制御部１１５は画像符号化装置内の各処理部の制御、及び各処理部間のパラメータの読み書きを、パラメータ信号線またはレジスタバスのいずれかを通じて行うことが可能である。また、本実施形態において、図１の全体制御部１１５は、画像符号化装置内に設置されているが、本発明はこれに限定されない。即ち、全体制御部１１５は、当該画像符号化装置外に設置され、当該画像符号化装置内の各処理部の制御、及び各処理部間のパラメータの読み書きを、パラメータ信号線またはレジスタバスのいずれかを通じて行ってもよい。 The overall control unit 115 controls each processing unit in the image encoding device and transmits parameters between the processing units. In FIG. 1, the connection between the overall control unit 115 and each processing unit in the image coding apparatus is omitted. The overall control unit 115 can control each processing unit in the image coding apparatus and read / write parameters between the processing units through either a parameter signal line or a register bus. In the present embodiment, the overall control unit 115 in FIG. 1 is installed in the image encoding apparatus, but the present invention is not limited to this. That is, the overall control unit 115 is installed outside the image encoding device, and controls each processing unit in the image encoding device and reads / writes parameters between the processing units using either a parameter signal line or a register bus. You may go through.

上述した画像符号化装置１００における、画像の符号化動作を図３に示したフローチャートを用いて以下に説明する。 An image encoding operation in the image encoding apparatus 100 described above will be described below with reference to the flowchart shown in FIG.

ステップＳ３０１にて、画像符号化装置１００は、ユーザによって指示された階層符号化の階層数を取得する。本実施形態では、拡張レイヤを１階層とし、全体で２階層（基本レイヤと１つの拡張レイヤ）の階層符号化を行うものとする。 In step S301, the image coding apparatus 100 acquires the number of layers of layer coding instructed by the user. In the present embodiment, it is assumed that the enhancement layer is one layer, and hierarchical coding of two layers (base layer and one enhancement layer) is performed as a whole.

ステップＳ３０２にて、タイル設定部１０２は符号化対象のフレーム内のタイル分割の数及び分割の位置を決定し、さらに当該符号化対象のフレーム内のいずれかのタイルを独立タイルとするか否かを決定する。また、本実施形態ではタイル５及びタイル６を独立タイルとし、タイル５とタイル６とを合わせて１つの独立タイルセットを構成する。従って本実施形態では、独立タイル判定部１０６は独立タイルフラグを１とする。ちなみに、符号化対象のフレーム内に独立タイルが含まれていない場合には、独立タイル判定部１０６は独立タイルフラグを０とする。さらに、独立タイル判定部１０６は、決定した独立タイルフラグを拡張レイヤ分割部１０４、基本レイヤ分割部１０５、独立タイル判定部１０６、及びヘッダ符号化部１１４に入力する。 In step S302, the tile setting unit 102 determines the number of tile divisions and the division positions in the encoding target frame, and determines whether or not any tile in the encoding target frame is an independent tile. To decide. In this embodiment, the tile 5 and the tile 6 are independent tiles, and the tile 5 and the tile 6 are combined to form one independent tile set. Therefore, in the present embodiment, the independent tile determination unit 106 sets the independent tile flag to 1. Incidentally, when an independent tile is not included in the encoding target frame, the independent tile determination unit 106 sets the independent tile flag to 0. Further, the independent tile determination unit 106 inputs the determined independent tile flag to the enhancement layer division unit 104, the base layer division unit 105, the independent tile determination unit 106, and the header encoding unit 114.

ステップＳ３０３にて、ヘッダ符号化部１１４は独立タイル判定部１０６から入力される独立タイルフラグを判定する。ヘッダ符号化部１１４が、独立タイルフラグが１であると判定した場合はステップＳ３０４の処理へ進み、独立タイルフラグが０であると判定した場合はステップＳ３０５の処理へ進む。 In step S303, the header encoding unit 114 determines the independent tile flag input from the independent tile determination unit 106. If the header encoding unit 114 determines that the independent tile flag is 1, the process proceeds to step S304. If the header encoding unit 114 determines that the independent tile flag is 0, the process proceeds to step S305.

ステップＳ３０４にて、ヘッダ符号化部１１４は、各タイルの位置の一致情報を表すｖｕｉ＿ｐａｒａｍｅｔｅｒｓのｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号を１に設定する。尚、当該ｖｕｉ＿ｐａｒａｍｅｔｅｒｓのｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号は、各レイヤ間でタイルの相対的な位置が一致しているか否かを表す、一致情報を符号化したものである。 In step S304, the header encoding unit 114 sets the tile_boundaries_aligned_flag code of vui_parameters representing the matching information of the positions of the tiles to 1. Note that the tile_boundaries_aligned_flag code of the vui_parameters is obtained by encoding coincidence information indicating whether or not the relative positions of the tiles coincide between the layers.

ステップＳ３０５にて、ヘッダ符号化部１１４は、シーケンスヘッダの１つであるｖｉｄｅｏ＿ｐａｒａｍｅｔｅｒ＿ｓｅｔを符号化する。当該ｖｉｄｅｏ＿ｐａｒａｍｅｔｅｒ＿ｓｅｔ符号には、階層符号化の階層数を表すｖｐｓ＿ｍａｘ＿ｌａｙｅｒｓ＿ｍｉｎｕｓ１符号が含まれる。尚、本実施形態において、ｖｐｓ＿ｍａｘ＿ｌａｙｅｒｓ＿ｍｉｎｕｓ１は１となる。続いて、ヘッダ符号化部１１４は、Ｓｅｑｕｅｎｃｅｐａｒａｍｅｔｅｒｓｅｔ（非特許文献１に７．３．２．２に記載）を符号化する。Ｓｅｑｕｅｎｃｅｐａｒａｍｅｔｅｒｓｅｔ符号にはｖｕｉ＿ｐａｒａｍｅｔｅｒｓも含まれる。ｖｕｉ＿ｐａｒａｍｅｔｅｒｓにはステップＳ３０４で設定されたｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号が含まれる。統合部１１０は、これらの符号データ（ｖｉｄｅｏ＿ｐａｒａｍｅｔｅｒ＿ｓｅｔ符号及びＳｅｑｕｅｎｃｅｐａｒａｍｅｔｅｒｓｅｔ符号）を入力し、ビットストリームを生成する。さらに、統合部１１０は、生成した当該ビットストリームを、端子１１１を介して画像符号化装置１００の外へ出力する。 In step S305, the header encoding unit 114 encodes video_parameter_set, which is one of the sequence headers. The video_parameter_set code includes a vps_max_layers_minus1 code that represents the number of layers of hierarchical coding. In this embodiment, vps_max_layers_minus1 is 1. Subsequently, the header encoding unit 114 encodes a Sequence parameter set (described in 7.3.2.2 in Non-Patent Document 1). The sequence parameter set code also includes vui_parameters. The vui_parameters include the tile_boundaries_aligned_flag code set in step S304. The integration unit 110 inputs these code data (video_parameter_set code and Sequence parameter set code), and generates a bitstream. Further, the integration unit 110 outputs the generated bit stream to the outside of the image encoding device 100 via the terminal 111.

ステップＳ３０６にて、ヘッダ符号化部１１４はピクチャヘッダであるＰｉｃｔｕｒｅｐａｒａｍｅｔｅｒｓｅｔ（非特許文献１に７．４．３．３に記載）を符号化する。統合部１１０は、ピクチャヘッダの符号データ（Ｐｉｃｔｕｒｅｐａｒａｍｅｔｅｒｓｅｔ符号）を入力し、ビットストリームを生成する。さらに、統合部１１０は、生成した当該ビットストリームを、端子１１１を介して画像符号化装置１００の外へ出力する。 In step S306, the header encoding unit 114 encodes a picture parameter set (described in 7.4.3.3 in Non-Patent Document 1) that is a picture header. The integration unit 110 receives the code data (Picture parameter set code) of the picture header, and generates a bit stream. Further, the integration unit 110 outputs the generated bit stream to the outside of the image encoding device 100 via the terminal 111.

ステップＳ３０７にて、ヘッダ符号化部１１４は独立タイル判定部１０６から入力される独立タイルフラグを判定する。ヘッダ符号化部１１４が、独立タイルフラグが１であると判定した場合はステップＳ３０８の処理へ進み、独立タイルフラグが０であると判定した場合はステップＳ３０９の処理へ進む。 In step S307, the header encoding unit 114 determines the independent tile flag input from the independent tile determination unit 106. If the header encoding unit 114 determines that the independent tile flag is 1, the process proceeds to step S308. If the header encoding unit 114 determines that the independent tile flag is 0, the process proceeds to step S309.

ステップＳ３０８にて、符号化対象のシーケンスは独立タイルを含んでいるので、ヘッダ符号化部１１４はＭＣＴＳＳＥＩを符号化する。ＭＣＴＳＳＥＩ符号については非特許文献２の第２章に記載されている通りである。本実施形態において、１フレームに含まれる独立タイルセットは１つであるため、ｎｕｍ＿ｓｅｔｓ＿ｉｎ＿ｍｅｓｓａｇｅ＿ｍｉｎｕｓ１符号は０となる。また、ｍｃｔｓ＿ｉｄ符号は０とする。さらに、ｎｕｍ＿ｔｉｌｅ＿ｒｅｃｔｓ＿ｉｎ＿ｓｅｔ＿ｍｉｎｕｓ１符号は１となる。尚、ｎｕｍ＿ｔｉｌｅ＿ｒｅｃｔｓ＿ｉｎ＿ｓｅｔ＿ｍｉｎｕｓ１符号はＭＣＴＳに属する独立タイルの数を表す。本実施形態では独立タイルセットの中にタイル５とタイル６の２つのタイルが独立タイルとして含まれるので、ｎｕｍ＿ｔｉｌｅ＿ｒｅｃｔｓ＿ｉｎ＿ｓｅｔ＿ｍｉｎｕｓ１符号の値は１となる。また、ｔｏｐ＿ｌｅｆｔ＿ｔｉｌｅ＿ｉｎｄｅｘ符号及びｂｏｔｔｏｍ＿ｒｉｇｈｔ＿ｔｉｌｅ＿ｉｎｄｅｘ符号は独立タイルの位置を表すもので、本実施形態では前者の値は５であり、後者の値は６となる。ヘッダ符号化部１１４は、上記のように各ヘッダ情報を符号化して、ＭＣＴＳＳＥＩの符号を生成する。さらに、統合部１１０は、ヘッダ符号化部１１４で生成されたＭＣＴＳＳＥＩ符号を入力してビットストリームを生成し、当該ビットストリームを、端子１１１を介して画像符号化装置１００の外へ出力する。 In step S308, since the encoding target sequence includes independent tiles, the header encoding unit 114 encodes MCTS SEI. The MCTS SEI code is as described in Chapter 2 of Non-Patent Document 2. In this embodiment, since one independent tile set is included in one frame, the num_sets_in_message_minus1 code is 0. The mcts_id code is 0. Further, the num_tile_rects_in_set_minus1 code is 1. The num_tile_rects_in_set_minus1 code represents the number of independent tiles belonging to the MCTS. In the present embodiment, two tiles of the tile 5 and the tile 6 are included as independent tiles in the independent tile set, so that the value of the num_tile_rects_in_set_minus1 code is 1. Further, the top_left_tile_index code and the bottom_right_tile_index code represent the positions of independent tiles. In the present embodiment, the former value is 5 and the latter value is 6. The header encoding unit 114 encodes each piece of header information as described above to generate a MCTS SEI code. Further, the integration unit 110 inputs the MCTS SEI code generated by the header encoding unit 114 to generate a bit stream, and outputs the bit stream to the outside of the image encoding device 100 via the terminal 111.

ステップＳ３０９にて、縮小部１０３は入力画像を縮小し、基本レイヤ画像を生成する。尚、本実施形態では拡張レイヤが１階層であるため、縮小部１０３によって基本レイヤを生成するが、本発明はこれに限定されない。拡張レイヤが２階層以上（全体で３階層以上）の階層符号化の場合、縮小部１０３を複数設けてもよいし、１つの縮小部１０３で必要な階層数の画像を生成してもよい。 In step S309, the reduction unit 103 reduces the input image and generates a base layer image. In this embodiment, since the enhancement layer is one layer, the base layer is generated by the reduction unit 103, but the present invention is not limited to this. In the case of hierarchical encoding with two or more enhancement layers (three or more layers in total), a plurality of reduction units 103 may be provided, or one reduction unit 103 may generate images of the required number of layers.

ステップＳ３１０にて、基本レイヤ分割部１０５は画像の左上からタイル番号順で、符号化する基本レイヤのタイルの画像を抽出する。基本レイヤ分割部１０５は抽出した基本レイヤのタイルの画像を基本レイヤ符号化部１０７へ出力する。 In step S310, the base layer dividing unit 105 extracts base layer tile images to be encoded in the order of tile numbers from the upper left of the image. The base layer dividing unit 105 outputs the extracted base layer tile image to the base layer encoding unit 107.

ステップＳ３１１にて、独立タイル判定部１０６は、基本レイヤ分割部１０５から符号化対象タイルのタイル番号を入力する。さらに、独立タイル判定部１０６は、タイル設定部１０２から当該符号化対象タイルの独立タイル位置情報を入力する。尚、本実施形態において独立タイル位置情報は５と６である。独立タイル判定部１０６は入力された符号化対象タイルのタイル番号と独立タイル位置情報のタイル番号とを比較する。符号化対象タイルのタイル番号と独立タイル位置情報のタイル番号とが一致する場合、独立タイル判定部１０６は、符号化対象タイルが独立タイルであると判定し、独立タイル符号化フラグを１とし、ステップＳ３１２へ進む。一方、符号化対象タイルのタイル番号と独立タイル位置情報のタイル番号とが一致しない場合、独立タイル判定部１０６は、符号化対象タイルが独立タイルではないと判定し、独立タイル符号化フラグを０とし、ステップＳ３１３へ進む。 In step S 311, the independent tile determination unit 106 inputs the tile number of the encoding target tile from the base layer division unit 105. Further, the independent tile determination unit 106 inputs the independent tile position information of the encoding target tile from the tile setting unit 102. In this embodiment, the independent tile position information is 5 and 6. The independent tile determination unit 106 compares the input tile number of the encoding target tile with the tile number of the independent tile position information. When the tile number of the encoding target tile matches the tile number of the independent tile position information, the independent tile determination unit 106 determines that the encoding target tile is an independent tile, sets the independent tile encoding flag to 1, Proceed to step S312. On the other hand, if the tile number of the encoding target tile does not match the tile number of the independent tile position information, the independent tile determination unit 106 determines that the encoding target tile is not an independent tile, and sets the independent tile encoding flag to 0. And go to Step S313.

ステップＳ３１２にて、符号化対象タイルは基本レイヤの符号化対象のフレームにおける独立タイルである。このため、基本レイヤ符号化部１０７は、局所復号済みの基本レイヤの他のフレームにおける、当該符号化対象タイルの位置と相対的に等しい位置の独立タイルセットに含まれる再構成画像を参照してフレーム間予測及び符号化を行う。また、基本レイヤ符号化部１０７は、符号化対象のフレームの符号化対象タイル内の局所復号済みの再構成画像を参照してイントラ予測及び符号化を行う。図２において、フレーム２０２のタイル５を符号化する場合について説明する。基本レイヤ符号化部１０７は、基本レイヤ再構成部１０８に格納されているフレーム２０１のタイル５とタイル６、及びフレーム２０２のタイル５の局所復号済みの再構成画像を参照して予測及び符号化を行う。さらに、基本レイヤ符号化部１０７は、符号化して得られた基本レイヤの符号化対象タイルの符号データを基本レイヤ符号データとして統合部１１０に出力する。統合部１１０は基本レイヤ符号化部１０７から出力される基本レイヤ符号データと、ヘッダ符号化部１１４及び拡張レイヤ符号化部１１２から出力されるその他の符号データとを統合し、ビットストリームを生成する。そして、統合部１１０は、生成したビットストリームを、端子１１１を介して出力する。また、基本レイヤ再構成部１０８は、基本レイヤ符号化部１０７で符号化の途中で生成された係数（予測モード及び予測残差）等を用いて、基本レイヤの再構成画像を順次生成し、保持する。 In step S312, the encoding target tile is an independent tile in the base layer encoding target frame. Therefore, the base layer encoding unit 107 refers to the reconstructed image included in the independent tile set at a position relatively equal to the position of the encoding target tile in another frame of the base layer that has been locally decoded. Perform inter-frame prediction and encoding. In addition, the base layer encoding unit 107 performs intra prediction and encoding with reference to the reconstructed image that has been locally decoded in the encoding target tile of the encoding target frame. A case where the tile 5 of the frame 202 is encoded in FIG. 2 will be described. The base layer encoding unit 107 performs prediction and encoding with reference to the locally decoded reconstructed images of the tile 5 and the tile 6 of the frame 201 and the tile 5 of the frame 202 stored in the base layer reconstruction unit 108. I do. Furthermore, the base layer encoding unit 107 outputs the code data of the encoding target tile of the base layer obtained by encoding to the integrating unit 110 as base layer code data. The integration unit 110 integrates the base layer code data output from the base layer encoding unit 107 and the other code data output from the header encoding unit 114 and the enhancement layer encoding unit 112, and generates a bitstream. . Then, the integration unit 110 outputs the generated bit stream via the terminal 111. In addition, the base layer reconstruction unit 108 sequentially generates a reconstructed image of the base layer using the coefficients (prediction mode and prediction residual) generated in the middle of the encoding by the base layer encoding unit 107, Hold.

ステップＳ３１３にて、符号化対象タイルは基本レイヤの符号化対象のフレームにおける独立タイルではない。このため、基本レイヤ符号化部１０７は、局所復号済みの基本レイヤの他のフレームの画像全体を参照して、符号化対象タイルをフレーム間予測及び符号化する。図２において、フレーム２０２のタイル５を符号化する場合に、基本レイヤ符号化部１０７は、基本レイヤ再構成部１０８に格納されているフレーム２０１の全てのタイル及びフレーム２０２のタイル５の局所復号済みの再構成画像を参照して予測及び符号化する。さらに、基本レイヤ符号化部１０７は、生成した基本レイヤ符号データを統合部１１０に出力する。統合部１１０は、ステップＳ３１２における説明と同様に、基本レイヤ符号データとその他の符号データとを統合してビットストリームを生成し、当該ビットストリームを、端子１１１を介して出力する。さらに、基本レイヤ再構成部１０８は、基本レイヤ符号化部１０７で符号化の途中で生成された係数等を用いて、基本レイヤの再構成画像を順次生成し、保持する。 In step S313, the encoding target tile is not an independent tile in the base layer encoding target frame. For this reason, the base layer encoding unit 107 performs inter-frame prediction and encoding of the encoding target tile with reference to the entire image of another frame of the base layer that has been locally decoded. In FIG. 2, when encoding tile 5 of frame 202, base layer encoding section 107 performs local decoding of all tiles of frame 201 and tile 5 of frame 202 stored in base layer reconstruction section 108. Prediction and encoding are performed with reference to the already reconstructed image. Furthermore, the base layer encoding unit 107 outputs the generated base layer code data to the integration unit 110. Similarly to the description in step S 312, the integrating unit 110 integrates the base layer code data and other code data to generate a bit stream, and outputs the bit stream via the terminal 111. Furthermore, the base layer reconstructing unit 108 sequentially generates and retains base layer reconstructed images using the coefficients generated in the middle of the encoding by the base layer encoding unit 107.

ステップＳ３１４にて、全体制御部１１５は、基本レイヤの全てのタイルを符号化し終わったか否かを判定する。基本レイヤの全てのタイルの符号化処理が終わっていないと判定された場合（ステップＳ３１４のＮＯ）、ステップＳ３１０に戻り、基本レイヤ分割部１０５は次のタイル番号のタイルを抽出及び出力し、処理を続行する。一方、基本レイヤの全てのタイルの画像の符号化処理が終了していると判定された場合（ステップＳ３１４のＹＥＳ）、ステップＳ３１５に進む。 In step S314, overall control unit 115 determines whether all tiles of the base layer have been encoded. When it is determined that the encoding process for all the tiles of the base layer is not completed (NO in step S314), the process returns to step S310, and the base layer dividing unit 105 extracts and outputs the tile having the next tile number, and the process To continue. On the other hand, when it is determined that the encoding process for all the tile layer images has been completed (YES in step S314), the process proceeds to step S315.

ステップＳ３１５にて、拡張レイヤ分割部１０４は画像の左上からタイル番号順で、符号化する拡張レイヤのタイルの画像を抽出する。拡張レイヤ分割部１０４は抽出した拡張レイヤのタイルの画像を拡張レイヤ符号化部１１２へ出力する。 In step S315, the enhancement layer dividing unit 104 extracts tile layer images to be encoded in the order of tile numbers from the upper left of the image. The enhancement layer dividing unit 104 outputs the extracted enhancement layer tile image to the enhancement layer encoding unit 112.

ステップＳ３１６にて、独立タイル判定部１０６はステップＳ３１１における処理と同様に、入力された符号化対象タイルのタイル番号と独立タイル位置情報のタイル番号とを比較する。符号化対象タイルのタイル番号と独立タイル位置情報のタイル番号とが一致する場合、独立タイル判定部１０６は、符号化対象タイルが独立タイルであると判定し、独立タイル符号化フラグを１とし、ステップＳ３１７へ進む。一方、符号化対象タイルのタイル番号と独立タイル位置情報のタイル番号とが一致しない場合、独立タイル判定部１０６は、符号化対象タイルが独立タイルではないと判定し、独立タイル符号化フラグを０とし、ステップＳ３１９へ進む。 In step S316, the independent tile determination unit 106 compares the input tile number of the encoding target tile with the tile number of the independent tile position information, similarly to the processing in step S311. When the tile number of the encoding target tile matches the tile number of the independent tile position information, the independent tile determination unit 106 determines that the encoding target tile is an independent tile, sets the independent tile encoding flag to 1, The process proceeds to step S317. On the other hand, if the tile number of the encoding target tile does not match the tile number of the independent tile position information, the independent tile determination unit 106 determines that the encoding target tile is not an independent tile, and sets the independent tile encoding flag to 0. And go to step S319.

ステップＳ３１７にて、符号化対象タイルは拡張レイヤの符号化対象のフレームにおける独立タイルである。このため、拡大部１０９は、基本レイヤ再構成部１０８に格納されている、局所復号済みの基本レイヤの再構成画像から、符号化対象タイルの位置と相対的に等しい位置の独立タイルセットに含まれる再構成画像を入力する。拡大部１０９は、入力された再構成画像のみを用いて、フィルタリング等で拡大して拡大画像を生成し、当該拡大画像を拡張レイヤ符号化部１１２に出力する。 In step S317, the encoding target tile is an independent tile in the encoding target frame of the enhancement layer. For this reason, the expansion unit 109 is included in the independent tile set at a position relatively equal to the position of the encoding target tile from the reconstructed image of the base layer that has been locally decoded and stored in the base layer reconstruction unit 108. Enter the reconstructed image. The enlargement unit 109 uses only the input reconstructed image to generate an enlarged image by filtering or the like, and outputs the enlarged image to the enhancement layer encoding unit 112.

ステップＳ３１８にて、拡張レイヤ符号化部１１２は、拡張レイヤ分割部１０４から入力された符号化対象タイルの画像を、基本レイヤの局所復号済みの再構成画像を参照対象として予測及び符号化する。即ち、拡張レイヤ符号化部１１２は、ステップＳ３１７で生成された拡大画像を参照してレイヤ間予測を行う。また、拡張レイヤ符号化部１１２は、拡張レイヤ再構成部１１３に格納されている局所復号済みの拡張レイヤのうち、符号化対象タイルの位置と相対的に等しい位置の独立タイルセットの再構成画像を参照対象として、符号化対象タイルのフレーム間予測を行う。さらに、拡張レイヤ符号化部１１２は、符号化対象タイル内の局所復号済みの再構成画像を参照対象としてイントラ予測を行う。拡張レイヤ符号化部１１２は、これらの予測によって得られた予測に関する情報（フレーム間予測によって得られた動きベクトル等）及び予測誤差を符号化する。さらに、拡張レイヤ再構成部１１３は、拡張レイヤ符号化部１１２による符号化の途中で生成された係数（予測モード及び予測残差）等を用いて、拡張レイヤの再構成画像を順次生成し、保持する。 In step S318, the enhancement layer encoding unit 112 predicts and encodes the image of the encoding target tile input from the enhancement layer dividing unit 104 using the reconstructed image of the base layer that has been locally decoded as a reference target. That is, the enhancement layer encoding unit 112 performs inter-layer prediction with reference to the enlarged image generated in step S317. Also, the enhancement layer encoding unit 112 is a reconstructed image of an independent tile set at a position relatively equal to the position of the encoding target tile among the locally decoded enhancement layers stored in the enhancement layer reconstruction unit 113. As a reference target, inter-frame prediction of the encoding target tile is performed. Furthermore, the enhancement layer encoding unit 112 performs intra prediction using a locally decoded reconstructed image in the encoding target tile as a reference target. The enhancement layer encoding unit 112 encodes information related to prediction (such as a motion vector obtained by inter-frame prediction) and a prediction error obtained by these predictions. Furthermore, the enhancement layer reconfiguration unit 113 sequentially generates reconstructed images of the enhancement layer using the coefficients (prediction mode and prediction residual) generated during the encoding by the enhancement layer encoding unit 112, Hold.

ステップＳ３１９にて、符号化対象タイルは拡張レイヤの符号化対象のフレームにおける独立タイルではない。このため、拡大部１０９は、基本レイヤ再構成部１０８に格納されている基本レイヤの再構成画像の全体を用いて、フィルタリング等で拡大して拡大画像を生成し、当該拡大画像を拡張レイヤ符号化部１１２に出力する。 In step S319, the encoding target tile is not an independent tile in the enhancement target layer encoding target frame. For this reason, the enlarging unit 109 generates an enlarged image by performing filtering or the like using the entire reconstructed image of the base layer stored in the base layer reconstructing unit 108, and the enlarged image is encoded with an enhancement layer code. To the conversion unit 112.

ステップＳ３２０にて、拡張レイヤ符号化部１１２は、拡張レイヤ分割部１０４から入力された符号化対象タイルの画像を、基本レイヤの局所復号済みの再構成画像を参照して符号化する。即ち、拡張レイヤ符号化部１１２は、ステップＳ３１９で生成された拡大画像を参照してレイヤ間予測を行う。また、拡張レイヤ符号化部１１２は、拡張レイヤ再構成部１１３に格納されている局所復号済みの拡張レイヤの再構成画像を参照して、符号化対象タイルのフレーム間予測を行う。さらに、拡張レイヤ符号化部１１２は、符号化対象タイル内の局所復号済みの再構成画像を参照して、符号化対象タイルのイントラ予測を行う。拡張レイヤ符号化部１１２は、これらの予測によって得られた予測に関する情報及び予測誤差を符号化する。さらに、拡張レイヤ再構成部１１３は、拡張レイヤ符号化部１１２で符号化の途中で生成された係数等を用いて、拡張レイヤ再構成画像を順次生成し、保持する。 In step S320, enhancement layer encoding section 112 encodes the encoding target tile image input from enhancement layer dividing section 104 with reference to the base layer locally decoded reconstructed image. That is, the enhancement layer encoding unit 112 performs inter-layer prediction with reference to the enlarged image generated in step S319. Also, the enhancement layer encoding unit 112 refers to the reconstructed image of the enhancement layer that has been locally decoded and stored in the enhancement layer reconstruction unit 113, and performs interframe prediction of the encoding target tile. Furthermore, the enhancement layer encoding unit 112 performs intra prediction of the encoding target tile with reference to the locally decoded reconstructed image in the encoding target tile. The enhancement layer encoding unit 112 encodes information and prediction error related to prediction obtained by these predictions. Furthermore, the enhancement layer reconstruction unit 113 sequentially generates and holds enhancement layer reconstructed images using the coefficients and the like generated during the encoding by the enhancement layer encoding unit 112.

ステップＳ３２１にて、全体制御部１１５は、拡張レイヤの全てのタイルを符号化し終わったか否かを判定する。拡張レイヤの全てのタイルの符号化処理が終わっていないと判定した場合（ステップＳ３２１のＮＯ）、ステップＳ３１５に戻り、拡張レイヤ分割部１０４は次のタイル番号のタイルを抽出及び出力し、処理を続行する。一方、拡張レイヤの全てのタイルの画像の符号化処理が終了していると判定した場合（ステップＳ３２１のＹＥＳ）、ステップＳ３２２に進む。 In step S321, overall control unit 115 determines whether or not all tiles of the enhancement layer have been encoded. If it is determined that all the tiles of the enhancement layer have not been encoded (NO in step S321), the process returns to step S315, and the enhancement layer dividing unit 104 extracts and outputs the tile having the next tile number, and performs the process. continue. On the other hand, if it is determined that the image encoding process for all tiles in the enhancement layer has been completed (YES in step S321), the process proceeds to step S322.

ステップＳ３２２にて、全体制御部１１５は、端子１０１から入力されるシーケンスに含まれる全てのフレームの画像の符号化処理が終了したか否かを判定する。符号化処理を行っていないフレームが存在する場合は（ステップＳ３２２のＮＯ）、ステップＳ３０９に進み、次のフレームの処理を行う。符号化処理を行っていないフレームが存在しない場合は（ステップＳ３２２のＹＥＳ）、符号化処理を終了する。 In step S322, overall control unit 115 determines whether or not the encoding processing for all the frames included in the sequence input from terminal 101 has been completed. If there is a frame that has not been encoded (NO in step S322), the process proceeds to step S309 to process the next frame. If there is no frame that has not been encoded (YES in step S322), the encoding process is terminated.

以上の構成と動作により、独立タイル及び独立タイルセットを使用する場合において、拡張レイヤと基本レイヤの各タイルの相対的な位置を一致させることができる。即ち、基本レイヤで設定された独立タイルセットに含まれるタイルが、各拡張レイヤにおいて当該独立タイルセットと相対的に等しい位置の独立タイルセットに含まれるように設定する。これにより、階層符号化のいずれの階層においても、独立タイルの予測及び復号のために参照する画素を制限することができ、予測処理を高速化することができる。特に、注目領域等を独立タイルに設定することで、独立タイルは基本レイヤから拡張レイヤまで他のタイルを参照せずに、独立に符号化できるため、必要な部分を従来よりも高速に処理することが可能になる。 With the above configuration and operation, when using independent tiles and independent tile sets, the relative positions of the tiles of the enhancement layer and the base layer can be matched. That is, the tiles included in the independent tile set set in the base layer are set so as to be included in the independent tile set at a position relatively equal to the independent tile set in each expansion layer. Thereby, in any hierarchy of hierarchical encoding, the pixel referred for prediction and decoding of an independent tile can be restrict | limited, and a prediction process can be sped up. In particular, by setting the attention area etc. as independent tiles, independent tiles can be encoded independently without referring to other tiles from the base layer to the extended layer, so the necessary parts are processed faster than before. It becomes possible.

尚、本実施形態において、図２のように、符号化対象のフレームより時間的に前のフレームのみを参照フレームとして予測及び符号化する例を示したが、これに限定されない。即ち、複数フレームを参照して予測及び符号化する場合においても同様に参照されることは上記の説明から明白である。 In this embodiment, as shown in FIG. 2, an example is shown in which only a frame temporally prior to the encoding target frame is predicted and encoded as a reference frame, but the present invention is not limited to this. In other words, it is clear from the above description that the same reference is made in the case of prediction and encoding with reference to a plurality of frames.

また、本実施形態において、縮小部１０３及び拡大部１０９を用いた画像符号化装置１００について説明したが、本発明はこれに限定されない。即ち、縮小部１０３及び拡大部１０９を省略してもよい。または、縮小率及び拡大率を１として基本レイヤ符号化部１０７で設定される量子化パラメータよりも拡張レイヤ符号化部１１２で設定される量子化パラメータを小さくするようにしてもよい。これによって、ＳＮＲ階層符号化を行うことが可能になる。 In the present embodiment, the image coding apparatus 100 using the reduction unit 103 and the enlargement unit 109 has been described. However, the present invention is not limited to this. That is, the reduction unit 103 and the enlargement unit 109 may be omitted. Alternatively, the reduction parameter and the enlargement factor may be set to 1, and the quantization parameter set by the enhancement layer encoding unit 112 may be made smaller than the quantization parameter set by the base layer encoding unit 107. This makes it possible to perform SNR hierarchical coding.

また、本実施形態において、拡張レイヤの独立タイルセットのタイルを予測する場合に参照する拡大画像を、当該独立タイルセットと相対的に等しい位置の基本レイヤのタイルの画像だけで生成を行ったが、本発明はこれに限定されない。即ち、ステップＳ３１９のように基本レイヤの独立タイルの周辺の画素も参照対象としても構わない。 In the present embodiment, the enlarged image referred to when predicting the tiles of the independent tile set of the enhancement layer is generated using only the tile image of the basic layer at a position relatively equal to the independent tile set. However, the present invention is not limited to this. That is, as in step S319, the pixels around the independent tile of the base layer may be used as a reference target.

また、本実施形態において、基本レイヤと１階層の拡張レイヤの階層符号化（全体で２階層の階層符号化）を行うものとして説明したが、本発明はこれに限定されず、全体で３階層以上の階層符号化であっても構わない。この場合、縮小部１０３、拡張レイヤ分割部１０４、拡張レイヤ符号化部１１２、拡張レイヤ再構成部１１３、及び拡大部１０９を１つのセットとして、当該セットを拡張レイヤの階層数分だけ設けることにより、より多くの階層に対応することができる。また、図４に示すように、拡張レイヤ符号化部１１２、拡張レイヤ再構成部４１３、拡大部４０９、及び縮小部４０３を１つずつ有し、各拡張レイヤの符号化において、兼用で使用しても構わない。 Further, in the present embodiment, it has been described that the hierarchical encoding of the basic layer and the enhancement layer of one hierarchy (two hierarchical encoding as a whole) is performed, but the present invention is not limited to this, and three hierarchical layers as a whole. The above hierarchical encoding may be used. In this case, the reduction unit 103, the enhancement layer division unit 104, the enhancement layer encoding unit 112, the enhancement layer reconstruction unit 113, and the enlargement unit 109 are set as one set, and the set is provided for the number of enhancement layer hierarchies. , Can accommodate more hierarchies. Also, as shown in FIG. 4, the enhancement layer encoding unit 112, the enhancement layer reconstruction unit 413, the enlargement unit 409, and the reduction unit 403 are provided one by one, and are used for encoding each enhancement layer. It doesn't matter.

図４は、複数の階層の拡張レイヤを符号化可能な画像符号化装置であって、拡張レイヤ符号化部１１２、拡張レイヤ再構成部４１３、拡大部４０９、及び縮小部４０３を１つずつ有する画像符号化装置のブロック図である。図４において、図１の画像符号化装置１００の各処理部と同じ機能を果たすものについては同じ番号を付し、説明を省略する。４０１は階層符号化の階層数を設定する階層数設定部である。４０３は縮小部である。図１の縮小部１０３が端子１０１から入力した入力画像を縮小して１つの縮小画像を生成するのに対し、縮小部４０３は、階層数設定部４０１から入力した階層数に基づいて、入力画像を縮小して複数の階層の縮小画像を生成する。４０２はフレームメモリであり、縮小部４０３で生成された各階層の縮小画像を格納する。４０９は拡大部である。図１の拡大部１０９が基本レイヤの再構成画像を拡張レイヤのサイズに拡大して１つの拡大画像を生成するのに対し、拡大部１０９は階層数設定部４０１から入力した階層数に基づいて、当該再構成画像を拡大して複数の異なる解像度の階層の拡大画像を生成する。４１３は拡張レイヤ再構成部である。拡張レイヤ再構成部４１３は、階層数設定部４０１から階層数を入力し、拡張レイヤ符号化部１１２で生成された係数等を用いて拡張レイヤの再構成画像を生成し、当該再構成画像を拡大部４０９及び拡張レイヤ符号化部１１２へ出力する。４１０は統合部であり、階層数設定部４０１から階層数を入力し、当該階層数分の符号データをビットストリームに統合する。 FIG. 4 is an image encoding apparatus capable of encoding a plurality of enhancement layers, and includes an enhancement layer encoding unit 112, an enhancement layer reconstruction unit 413, an enlargement unit 409, and a reduction unit 403 one by one. It is a block diagram of an image coding apparatus. In FIG. 4, the same numbers are assigned to components that perform the same functions as the processing units of the image encoding device 100 in FIG. 1, and descriptions thereof are omitted. Reference numeral 401 denotes a layer number setting unit for setting the number of layers for layer encoding. Reference numeral 403 denotes a reduction unit. While the reduction unit 103 in FIG. 1 reduces the input image input from the terminal 101 to generate one reduced image, the reduction unit 403 uses the input image based on the number of hierarchies input from the hierarchy number setting unit 401. Are reduced to generate reduced images of a plurality of hierarchies. Reference numeral 402 denotes a frame memory, which stores the reduced image of each layer generated by the reduction unit 403. Reference numeral 409 denotes an enlarged portion. The enlargement unit 109 in FIG. 1 enlarges the reconstructed image of the base layer to the size of the extension layer to generate one enlarged image, whereas the enlargement unit 109 is based on the number of hierarchies input from the hierarchy number setting unit 401. The reconstructed image is enlarged to generate enlarged images having a plurality of different resolution layers. Reference numeral 413 denotes an enhancement layer reconstruction unit. The enhancement layer reconstruction unit 413 receives the number of layers from the layer number setting unit 401, generates a reconstruction image of the enhancement layer using the coefficients generated by the enhancement layer encoding unit 112, and the reconstructed image The result is output to expansion section 409 and enhancement layer encoding section 112. Reference numeral 410 denotes an integration unit that inputs the number of layers from the layer number setting unit 401 and integrates code data for the number of layers into a bitstream.

図４に示す画像符号化装置４００を用いて符号化を行う場合の、各処理部の動作を図５に示したフローチャートを用いて以下に説明する。図５は、図３のステップＳ３０９からステップＳ３２０の間を変更した部分のみを示している。図５において、図３のステップと同様の機能を果たすステップに関しては図３と同じ番号を付与し、説明を省略する。また、図３のステップＳ３０１にて、階層数設定部４０１は階層数を３に設定するとして、以下に説明する。尚、本発明において階層数は特に限定されない。また、ステップＳ３０５にて、ｖｐｓ＿ｍａｘ＿ｌａｙｅｒｓ＿ｍｉｎｕｓ１符号を２としてヘッダ符号データの生成が行われるとする。 The operation of each processing unit when encoding is performed using the image encoding device 400 illustrated in FIG. 4 will be described below with reference to the flowchart illustrated in FIG. FIG. 5 shows only the part changed from step S309 to step S320 in FIG. In FIG. 5, steps having the same functions as those in FIG. 3 are given the same numbers as in FIG. Further, the following description will be made assuming that the number of layers setting unit 401 sets the number of layers to 3 in step S301 of FIG. In the present invention, the number of hierarchies is not particularly limited. In step S305, it is assumed that header code data is generated with the vps_max_layers_minus1 code set to 2.

ステップＳ５０１にて、縮小部４０３は１フレームの階層数分の縮小画像を生成する。本実施形態ではステップＳ３０１で階層数が３に設定されるため、縮小部４０３は１つの基本レイヤ画像と２つの拡張レイヤ画像とを生成する。即ち、縮小部４０３は、入力画像を縦横１／２にした拡張第１階層（第１拡張レイヤ）画像と、第１拡張レイヤ画像をさらに縦横１／２にした基本レイヤ画像とを生成する。ここで、縮小部４０３は、入力された解像度の画像を拡張第２階層（第２拡張レイヤ）画像とする。さらに、縮小部４０３は、基本レイヤ画像、第１拡張レイヤ画像、及び第２拡張レイヤ画像をそれぞれフレームメモリ４０２に出力する。 In step S501, the reduction unit 403 generates reduced images for the number of layers of one frame. In the present embodiment, since the number of hierarchies is set to 3 in step S301, the reduction unit 403 generates one base layer image and two enhancement layer images. That is, the reduction unit 403 generates an extended first layer (first enhancement layer) image in which the input image is halved in the vertical and horizontal directions, and a base layer image in which the first enhancement layer image is further halved in the vertical and horizontal directions. Here, the reduction unit 403 sets the input resolution image as an extended second layer (second extended layer) image. Further, the reduction unit 403 outputs the base layer image, the first enhancement layer image, and the second enhancement layer image to the frame memory 402, respectively.

尚、ステップＳ３１２からステップＳ３１４にて、前述の通り、全体制御部１１５は、フレームメモリ４０２から出力された基本レイヤ画像を符号化する。基本レイヤ再構成部１０８は符号化された画像を局所復号して再構成画像を生成する、これを保持しておく。 In steps S312 to S314, the overall control unit 115 encodes the base layer image output from the frame memory 402 as described above. The base layer reconstruction unit 108 locally decodes the encoded image to generate a reconstructed image, and holds this.

ステップＳ５０２にて、階層数設定部４０１は、ステップＳ３１２乃至ステップＳ３１３で符号化された基本レイヤ、又は後述するステップＳ５１８乃至ステップＳ５２０で符号化された階層の拡張レイヤを上位レイヤとする。さらに、階層数設定部４０１は、そのレイヤに続く符号化対象の拡張レイヤを下位レイヤとする。ここでは、まず、ステップＳ３１２乃至ステップＳ３１３で符号化された基本レイヤを上位レイヤとし、第１拡張レイヤを下位レイヤとして設定する。 In step S502, the layer number setting unit 401 sets the base layer encoded in steps S312 to S313 or the enhancement layer of the layer encoded in steps S518 to S520 described later as an upper layer. Further, the hierarchy number setting unit 401 sets the enhancement layer to be encoded following that layer as a lower layer. Here, first, the base layer encoded in steps S312 to S313 is set as an upper layer, and the first enhancement layer is set as a lower layer.

ステップＳ５１５にて、拡張レイヤ分割部１０４は符号化対象の階層の画像の左上からタイル番号順で、符号化する拡張レイヤのタイルの画像を抽出する。拡張レイヤ分割部１０４は抽出した拡張レイヤのタイルの画像を拡張レイヤ符号化部１１２へ出力する。ここでは、第１拡張レイヤ画像における符号化対象タイルの画像を抽出し、拡張レイヤ符号化部１１２に入力する。 In step S515, the enhancement layer dividing unit 104 extracts the tile images of the enhancement layer to be coded in order of tile numbers from the upper left of the image of the layer to be coded. The enhancement layer dividing unit 104 outputs the extracted enhancement layer tile image to the enhancement layer encoding unit 112. Here, the image of the encoding target tile in the first enhancement layer image is extracted and input to the enhancement layer encoding unit 112.

ステップＳ５１７にて、符号化対象タイルは符号化対象のフレームにおける独立タイルである。このため、拡大部４０９は、基本レイヤ再構成部１０８又は拡張レイヤ再構成部４１３に格納されている上位レイヤの再構成画像から、符号化対象タイルの位置と相対的に等しい位置の独立タイルセットに含まれる再構成画像を入力する。拡大部４０９は、入力された再構成画像のみを用いて、フィルタリング等で拡大して拡大画像を生成し、当該拡大画像を拡張レイヤ符号化部１１２に入力する。ここでは、拡大部４０９は、基本レイヤ再構成部１０８に格納されている再構成画像から拡大画像を生成し、当該拡大画像を拡張レイヤ符号化部１１２に入力する。 In step S517, the encoding target tile is an independent tile in the encoding target frame. For this reason, the enlargement unit 409 determines the independent tile set at a position relatively equal to the position of the encoding target tile from the upper layer reconstructed image stored in the base layer reconstruction unit 108 or the enhancement layer reconstruction unit 413. The reconstructed image included in is input. The enlargement unit 409 uses only the input reconstructed image to generate an enlarged image by filtering or the like, and inputs the enlarged image to the enhancement layer encoding unit 112. Here, the enlargement unit 409 generates an enlarged image from the reconstructed image stored in the base layer reconstruction unit 108 and inputs the enlarged image to the enhancement layer encoding unit 112.

ステップＳ５１８にて、拡張レイヤ符号化部１１２は、拡張レイヤ分割部１０４から入力された符号化対象タイルの画像を、局所復号済みの再構成画像を参照して予測及び符号化する。即ち、拡張レイヤ符号化部１１２は、ステップＳ５１７で生成された拡大画像を参照してレイヤ間予測を行う。また、拡張レイヤ符号化部１１２は、拡張レイヤ再構成部４１３に格納されている局所復号済みの拡張レイヤの他のフレームにおいて符号化対象タイルの位置と相対的に等しい位置の独立タイルセットの再構成画像を参照して、フレーム間予測を行う。さらに、拡張レイヤ符号化部１１２は、符号化対象タイル内の局所復号済みの再構成画像を参照してイントラ予測を行う。拡張レイヤ符号化部１１２は、これらの予測によって得られた予測に関する情報（フレーム間予測によって得られた動きベクトル等）及び予測誤差を符号化する。さらに、拡張レイヤ再構成部４１３は、拡張レイヤ符号化部１１２による符号化の途中で生成された係数（予測モード及び予測残差）等を用いて、拡張レイヤの再構成画像を順次生成し、保持する。 In step S518, enhancement layer encoding section 112 predicts and encodes the encoding target tile image input from enhancement layer dividing section 104 with reference to the locally decoded reconstructed image. That is, the enhancement layer encoding unit 112 performs inter-layer prediction with reference to the enlarged image generated in step S517. Also, the enhancement layer encoding unit 112 reconstructs an independent tile set at a position relatively equal to the position of the encoding target tile in another frame of the enhancement layer that has been locally decoded and stored in the enhancement layer reconstruction unit 413. Inter-frame prediction is performed with reference to the constituent images. Furthermore, the enhancement layer encoding unit 112 performs intra prediction with reference to the locally decoded reconstructed image in the encoding target tile. The enhancement layer encoding unit 112 encodes information related to prediction (such as a motion vector obtained by inter-frame prediction) and a prediction error obtained by these predictions. Furthermore, the enhancement layer reconfiguration unit 413 sequentially generates reconstructed images of the enhancement layer using coefficients (prediction mode and prediction residual) generated in the middle of the encoding by the enhancement layer encoding unit 112, Hold.

ステップＳ５１９にて、符号化対象タイルは符号化対象のフレームにおける独立タイルではない。このため、拡大部４０９は、基本レイヤ再構成部１０８に格納されている基本レイヤの再構成画像の全体又は拡張レイヤ再構成部４１３に格納されている上位の拡張レイヤの再構成画像の全体を用いてフィルタリング等で拡大して拡大画像を生成する。さらに、拡大部４０９は、生成した拡大画像を拡張レイヤ符号化部１１２に出力する。ここでは、拡大部４０９は、基本レイヤ再構成部１０８に格納されている再構成画像から拡大画像を生成する。 In step S519, the encoding target tile is not an independent tile in the encoding target frame. Therefore, the enlarging unit 409 displays the entire base layer reconstructed image stored in the base layer reconstructing unit 108 or the entire upper layer reconstructed image stored in the enhancement layer reconstructing unit 413. The image is enlarged by filtering or the like to generate an enlarged image. Further, the enlargement unit 409 outputs the generated enlarged image to the enhancement layer encoding unit 112. Here, the enlargement unit 409 generates an enlarged image from the reconstructed image stored in the base layer reconstruction unit 108.

ステップＳ５２０にて、拡張レイヤ符号化部１１２は、拡張レイヤ分割部１０４から入力された符号化対象タイルの画像を、局所復号済みの再構成画像を参照して符号化する。即ち、拡張レイヤ符号化部１１２は、ステップＳ５１９で生成された拡大画像を参照してレイヤ間予測を行う。また、拡張レイヤ符号化部１１２は、拡張レイヤ再構成部４１３に格納されている局所復号済みの拡張レイヤの再構成画像を参照して、符号化対象タイルのフレーム間予測を行う。さらに、拡張レイヤ符号化部１１２は、符号化対象タイル内の局所復号済みの再構成画像を参照して、符号化対象タイルのイントラ予測を行う。拡張レイヤ符号化部１１２は、これらの予測によって得られた予測に関する情報及び予測誤差を符号化する。さらに、拡張レイヤ再構成部４１３は、拡張レイヤ符号化部１１２で符号化の途中で生成された係数等を用いて、拡張レイヤの再構成画像を順次生成し、保持する。 In step S520, enhancement layer encoding section 112 encodes the encoding target tile image input from enhancement layer dividing section 104 with reference to the locally decoded reconstructed image. That is, the enhancement layer encoding unit 112 performs inter-layer prediction with reference to the enlarged image generated in step S519. Also, the enhancement layer encoding unit 112 refers to the reconstructed image of the enhancement layer that has been locally decoded and stored in the enhancement layer reconstruction unit 413, and performs interframe prediction of the encoding target tile. Furthermore, the enhancement layer encoding unit 112 performs intra prediction of the encoding target tile with reference to the locally decoded reconstructed image in the encoding target tile. The enhancement layer encoding unit 112 encodes information and prediction error related to prediction obtained by these predictions. Furthermore, the enhancement layer reconstruction unit 413 sequentially generates and holds enhancement layer reconstructed images using the coefficients and the like generated during the encoding by the enhancement layer encoding unit 112.

ステップＳ５０３にて、全体制御部１１５は、階層数設定部４０１で設定された全ての階層について符号化が終了したか否かを判定する。全ての階層のタイルの符号化処理が終わっていないと判定した場合（ステップＳ５２１のＮＯ）、ステップＳ５０２に戻り、階層数設定部４０１は次の階層を下位レイヤに設定し、処理を続行する。一方、拡張レイヤの全てのタイルの画像の符号化処理が終了していると判定した場合（ステップＳ５２１のＹＥＳ）、ステップＳ５２３に進む。ここでは、第２拡張レイヤの符号化が終了していないと判定し、ステップＳ５０２に戻る。 In step S503, overall control unit 115 determines whether or not encoding has been completed for all layers set by layer number setting unit 401. When it is determined that the encoding process for all the tiles has not been completed (NO in step S521), the process returns to step S502, and the hierarchy number setting unit 401 sets the next hierarchy as a lower layer and continues the process. On the other hand, when it is determined that the image encoding process for all tiles in the enhancement layer has been completed (YES in step S521), the process proceeds to step S523. Here, it is determined that the encoding of the second enhancement layer has not ended, and the process returns to step S502.

ステップＳ５２２にて、全体制御部１１５は、端子１０１から入力されるシーケンスに含まれる全てのフレームの画像の符号化処理が終了したか否かを判定する。符号化処理を行っていないフレームが存在する場合は（ステップＳ５２２のＮＯ）、ステップＳ５０１に進み、次のフレームの処理を行う。符号化処理を行っていないフレームが存在しない場合は（ステップＳ５２２のＹＥＳ）、符号化処理を終了する。 In step S522, overall control unit 115 determines whether or not the encoding processing for all the frames included in the sequence input from terminal 101 has been completed. If there is a frame that has not been subjected to the encoding process (NO in step S522), the process proceeds to step S501 to process the next frame. If there is no frame that has not been subjected to the encoding process (YES in step S522), the encoding process ends.

以下、第２拡張レイヤ画像の符号化処理について説明する。即ち、ステップＳ５０２にて、階層数設定部４０１は、ステップＳ５１８乃至ステップＳ５２０で符号化された第１拡張レイヤ階層を上位レイヤとし、第２拡張レイヤを下位レイヤとして設定する。ステップＳ５１５にて、拡張レイヤ分割部１０４は第２拡張レイヤ画像における符号化対象タイルの画像を抽出し、拡張レイヤ符号化部１１２に入力する。 Hereinafter, the encoding process of the second enhancement layer image will be described. That is, in step S502, the layer number setting unit 401 sets the first enhancement layer layer encoded in steps S518 to S520 as an upper layer and the second enhancement layer as a lower layer. In step S515, enhancement layer dividing section 104 extracts an image of the encoding target tile in the second enhancement layer image, and inputs it to enhancement layer encoding section 112.

ステップＳ５１７にて、符号化対象タイルは符号化対象のフレームにおける独立タイルである。このため、拡大部４０９は、拡張レイヤ再構成部４１３に格納されている上位レイヤ（第１拡張レイヤ）の再構成画像から、符号化対象タイルの位置と相対的に等しい位置の独立タイルセットに含まれる再構成画像を入力する。拡大部４０９は、入力された独立タイルセットの再構成画像のみを用いて、フィルタリング等で拡大して上位レイヤ（第１拡張レイヤ）の拡大画像を生成し、当該拡大画像を拡張レイヤ符号化部１１２に入力する。ステップＳ５１８にて、拡張レイヤ符号化部１１２は、拡張レイヤ分割部１０４から入力された符号化対象の下位レイヤ（第２拡張レイヤ）のタイルの画像を、局所復号済みの再構成画像を参照して予測及び符号化する。即ち、拡張レイヤ符号化部１１２は、ステップＳ５１７で生成された上位レイヤ（第１拡張レイヤ）の拡大画像を参照してレイヤ間予測を行う。また、拡張レイヤ符号化部１１２は、拡張レイヤ再構成部４１３に格納されている局所復号済みの下位レイヤ（第２拡張レイヤ）において符号化対象タイルの位置と相対的に等しい位置の独立タイルセットの画像を参照してフレーム間予測を行う。さらに、拡張レイヤ符号化部１１２は、符号化対象タイル内の下位レイヤ（第２拡張レイヤ）の局所復号済みの再構成画像を参照してイントラ予測を行う。拡張レイヤ符号化部１１２は、これらの予測によって得られた予測に関する情報（フレーム間予測によって得られた動きベクトル等）及び予測誤差を符号化する。さらに、拡張レイヤ再構成部４１３は、拡張レイヤ符号化部１１２で符号化の途中で生成された係数等を用いて、下位レイヤ（第２拡張レイヤ）の再構成画像を順次生成し、保持する。 In step S517, the encoding target tile is an independent tile in the encoding target frame. For this reason, the enlargement unit 409 converts an upper layer (first enhancement layer) reconstructed image stored in the enhancement layer reconstruction unit 413 into an independent tile set at a position relatively equal to the position of the encoding target tile. Input the reconstructed image to be included. The enlargement unit 409 uses only the reconstructed image of the input independent tile set to generate an enlarged image of a higher layer (first enhancement layer) by filtering or the like, and the enlarged image is an enhancement layer encoding unit 112. In step S518, enhancement layer encoding section 112 refers to the tiled image of the encoding target lower layer (second enhancement layer) input from enhancement layer dividing section 104, and the locally decoded reconstructed image. Predict and encode. That is, the enhancement layer encoding unit 112 performs inter-layer prediction with reference to the enlarged image of the upper layer (first enhancement layer) generated in step S517. Also, the enhancement layer encoding unit 112 is an independent tile set at a position relatively equal to the position of the encoding target tile in the locally decoded lower layer (second enhancement layer) stored in the enhancement layer reconstruction unit 413. The inter-frame prediction is performed with reference to the image. Furthermore, the enhancement layer encoding unit 112 performs intra prediction with reference to the locally decoded reconstructed image of the lower layer (second enhancement layer) in the encoding target tile. The enhancement layer encoding unit 112 encodes information related to prediction (such as a motion vector obtained by inter-frame prediction) and a prediction error obtained by these predictions. Furthermore, the enhancement layer reconfiguration unit 413 sequentially generates and holds a reconstructed image of the lower layer (second enhancement layer) using the coefficients and the like generated during the encoding by the enhancement layer encoding unit 112. .

一方、ステップＳ５１９にて、符号化対象タイルは符号化対象のフレームにおける独立タイルではない。このため、拡大部４０９は、拡張レイヤ再構成部４１３に格納されている上位の拡張レイヤ（第１拡張レイヤ）の再構成画像を用いてフィルタリング等で拡大して上位レイヤ（第１拡張レイヤ）の拡大画像を生成する。さらに、拡大部４０９は、生成した拡大画像を拡張レイヤ符号化部１１２に出力する。 On the other hand, in step S519, the encoding target tile is not an independent tile in the encoding target frame. Therefore, the enlargement unit 409 enlarges the upper layer (first enhancement layer) by filtering or the like using the reconstructed image of the upper enhancement layer (first enhancement layer) stored in the enhancement layer reconstruction unit 413. An enlarged image of is generated. Further, the enlargement unit 409 outputs the generated enlarged image to the enhancement layer encoding unit 112.

ステップＳ５２０にて、拡張レイヤ符号化部１１２は、拡張レイヤ分割部１０４から入力された符号化対象タイルの画像を局所復号済みの再構成画像を参照して符号化する。即ち、拡張レイヤ符号化部１１２は、ステップＳ５１９で生成された上位レイヤ（第１拡張レイヤ）の拡大画像を参照してレイヤ間予測を行う。また、拡張レイヤ符号化部１１２は、拡張レイヤ再構成部４１３に格納されている局所復号済みの下位レイヤ（第２拡張レイヤ階層）の再構成画像を参照してフレーム間予測を行う。さらに拡張レイヤ符号化部１１２は、下位レイヤ（第２拡張レイヤ）の符号化対象タイル内の局所復号済みの再構成画像を参照してイントラ予測を行う。拡張レイヤ符号化部１１２は、これらの予測によって得られた予測に関する情報及び予測誤差を符号化する。さらに、拡張レイヤ再構成部４１３は、拡張レイヤ符号化部１１２で符号化の途中で生成された係数等を用いて、下位レイヤ（第２拡張レイヤ）の再構成生画像を順次生成し、保持する。 In step S520, enhancement layer encoding section 112 encodes the encoding target tile image input from enhancement layer dividing section 104 with reference to the locally decoded reconstructed image. That is, the enhancement layer encoding unit 112 performs inter-layer prediction with reference to the enlarged image of the upper layer (first enhancement layer) generated in step S519. Also, the enhancement layer encoding unit 112 performs inter-frame prediction with reference to the reconstructed image of the locally decoded lower layer (second enhancement layer hierarchy) stored in the enhancement layer reconstruction unit 413. Further, the enhancement layer encoding unit 112 performs intra prediction with reference to the locally decoded reconstructed image in the encoding target tile of the lower layer (second enhancement layer). The enhancement layer encoding unit 112 encodes information and prediction error related to prediction obtained by these predictions. Furthermore, the enhancement layer reconstruction unit 413 sequentially generates and holds the reconstructed raw image of the lower layer (second enhancement layer) using the coefficients and the like generated during the encoding by the enhancement layer encoding unit 112. To do.

ステップＳ５０３にて、全体制御部１１５は、階層数設定部４０１で設定された全ての階層について符号化が終了したか否かを判定し、終了していると判定した場合はステップＳ５２２へ進み、終了していないと判定した場合はステップＳ５０２に戻る。ここでは、第２拡張レイヤまでの符号化が終了しているため、ステップＳ５２２に進む。ステップＳ５２２にて、全てのフレームの符号化が終了すれば、符号化処理を終了する。 In step S503, the overall control unit 115 determines whether or not encoding has been completed for all the layers set by the number-of-layers setting unit 401. If it is determined that encoding has been completed, the overall control unit 115 proceeds to step S522. If it is determined that the process has not ended, the process returns to step S502. Here, since the encoding up to the second enhancement layer has been completed, the process proceeds to step S522. If the encoding of all the frames is completed in step S522, the encoding process is ended.

以上の動作によって、拡張レイヤが複数階層存在する場合においても、独立タイルセットを必要な符号データだけを復号し、最小の画像の参照のみで復号画像を再生できる符号データを生成できる。 With the above operation, even when there are a plurality of enhancement layers, it is possible to decode only the code data that requires the independent tile set and generate code data that can reproduce the decoded image only by referring to the minimum image.

また、ＭＣＴＳＳＥＩ符号がビットストリームに存在する場合、タイル位置一致情報であるｖｕｉ＿ｐａｒａｍｅｔｅｒｓのｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号は１に必ずセットされる。即ち、ｖｕｉ＿ｐａｒａｍｅｔｅｒｓにおいて、ＭＣＴＳＳＥＩ符号がビットストリームに存在する場合、符号データとしてのｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号を省略することもできる。もし、ＭＣＴＳＳＥＩ符号がビットストリームに無ければ、ｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号の値を符号化し、その符号データがビットストリームに含まれる。ＭＣＴＳＳＥＩ符号がビットストリームにあれば、ｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号の値は符号化されず、復号側で必ず１の値が設定される。このようにすることで、冗長となるｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号を削減することが可能になる。 When the MCTS SEI code is present in the bitstream, the tile_boundaries_aligned_flag code of vui_parameters, which is tile position match information, is always set to 1. That is, when MCTS SEI code is present in the bitstream in vui_parameters, the tile_boundaries_aligned_flag code as code data can be omitted. If the MCTS SEI code is not in the bitstream, the value of the tile_boundaries_aligned_flag code is encoded, and the code data is included in the bitstream. If the MCTS SEI code is in the bitstream, the value of the tile_boundaries_aligned_flag code is not encoded, and a value of 1 is always set on the decoding side. This makes it possible to reduce redundant tile_boundaries_aligned_flag codes.

また、階層符号化において、重要な領域を切り出し、その切り出された領域に独立タイルセットを適応させて符号化することで、重要な領域を高速に読み出せる符号データを生成することができる。 Further, in hierarchical encoding, an important area is cut out and encoded by applying an independent tile set to the extracted area, thereby generating code data capable of reading out the important area at high speed.

＜実施形態２＞
以下、図６を用いて本実施形態に係る画像復号装置を構成する各処理部の概要を説明する。図６は、本実施形態の画像復号部６０５を有する画像表示装置６００を示すブロック図である。本実施形態では、実施形態１で生成されたビットストリームを復号する場合を例にとり説明を行う。 <Embodiment 2>
Hereinafter, the outline of each processing unit constituting the image decoding apparatus according to the present embodiment will be described with reference to FIG. FIG. 6 is a block diagram showing an image display device 600 having the image decoding unit 605 of the present embodiment. In the present embodiment, the case where the bitstream generated in the first embodiment is decoded will be described as an example.

６０１は通信等によってビットストリームを入力するインターフェースである。６０２はインターフェース６０１から入力されたビットストリームや予め記録されていたビットストリームを格納する記憶部である。６０３はユーザによって指示されたビットストリームを表示するための、表示方法を指定する表示制御部である。表示制御部６０３は復号する階層（レイヤ）と復号する領域（表示領域）とを表示制御信号として画像復号部６０５に出力する。本実施形態において、復号する階層は階層数で表され、表示領域は表示するタイルの位置で表されるものとするが、本発明はこれに限定されない。６０４はセレクタであり、入力するビットストリームの入力先を指定する。６０５は本実施形態に係る画像復号部である。６０６は表示部であり、画像復号部６０５で生成された復号画像を表示する。 Reference numeral 601 denotes an interface for inputting a bit stream by communication or the like. A storage unit 602 stores a bit stream input from the interface 601 and a bit stream recorded in advance. Reference numeral 603 denotes a display control unit that designates a display method for displaying the bitstream designated by the user. The display control unit 603 outputs the decoding hierarchy (layer) and the decoding area (display area) to the image decoding unit 605 as display control signals. In this embodiment, the hierarchy to be decoded is represented by the number of hierarchies, and the display area is represented by the position of the tile to be displayed. A selector 604 designates an input destination of an input bit stream. Reference numeral 605 denotes an image decoding unit according to the present embodiment. Reference numeral 606 denotes a display unit that displays the decoded image generated by the image decoding unit 605.

次に、画像表示装置６００における画像の表示動作を以下に説明する。尚、表示制御部６０３が、ユーザによってビットストリームの基本レイヤ画像を復号及び表示すること指示された場合について説明する。これは監視カメラ等によって撮像された画像を符号化したビットストリームを入力して、当該撮像された画像の全体をモニタリングする場合に相当する。インターフェース６０１は、監視カメラ等からフレーム単位で入力されるビットストリーム（入力ビットストリーム）を受信し、記憶部６０２及びセレクタ６０４に出力する。記憶部６０２は、入力ビットストリームを記録し、セレクタ６０４は表示制御部６０３による指示によって、入力ビットストリームを画像復号部６０５に出力する。画像復号部６０５は、表示制御部６０３から表示制御信号として表示するレイヤ、及び表示するタイル等の情報を入力する。即ち、表示制御部６０３はビットストリームの基本レイヤの復号及び表示をユーザによって指示されるため、画像復号部６０５には、復号するレイヤが基本レイヤであることを示す情報と、表示領域が全てのタイルであることを示す情報とが入力される。 Next, an image display operation in the image display apparatus 600 will be described below. A case will be described in which the display control unit 603 is instructed by the user to decode and display the base layer image of the bitstream. This corresponds to a case where a bit stream obtained by encoding an image captured by a monitoring camera or the like is input and the entire captured image is monitored. The interface 601 receives a bit stream (input bit stream) input in units of frames from a monitoring camera or the like, and outputs it to the storage unit 602 and the selector 604. The storage unit 602 records the input bit stream, and the selector 604 outputs the input bit stream to the image decoding unit 605 according to an instruction from the display control unit 603. The image decoding unit 605 receives information such as a layer to be displayed as a display control signal and a tile to be displayed from the display control unit 603. That is, since the display control unit 603 is instructed by the user to decode and display the base layer of the bitstream, the image decoding unit 605 includes information indicating that the layer to be decoded is the base layer, and all display areas are displayed. Information indicating that it is a tile is input.

以下、図７を用いて本実施形態に係る画像復号部６０５を構成する各処理部の概要を説明する。図７は、本実施形態の画像復号部６０５を示すブロック図である。 Hereinafter, the outline of each processing unit constituting the image decoding unit 605 according to the present embodiment will be described with reference to FIG. FIG. 7 is a block diagram showing the image decoding unit 605 of the present embodiment.

図７における７０１は、セレクタ６０４から出力されたビットストリームを入力する端子である。説明を容易にするため、ビットストリームは、ヘッダデータや１フレームずつの符号データが入力されるものとする。本実施形態において、このフレーム単位の符号データには、１フレームを構成する全ての階層符号データが含まれているものとするが、本発明はこれに限定されず、スライス等の単位で入力されても構わない。また、フレームのデータ構成もこれに限定されない。 701 in FIG. 7 is a terminal for inputting the bit stream output from the selector 604. For ease of explanation, it is assumed that the bit stream is input with header data and code data for each frame. In the present embodiment, it is assumed that this frame-unit code data includes all the hierarchical code data constituting one frame, but the present invention is not limited to this, and is input in units such as slices. It doesn't matter. The data structure of the frame is not limited to this.

７０２は図６の表示制御部６０３から出力された復号に関する表示制御信号を入力する端子である。表示制御信号としては、復号するレイヤ及び復号するタイルの位置情報が入力される。さらに、端子７０２に入力された表示制御信号は、分離部７０４、基本レイヤ復号部７０７、拡張レイヤ復号部７１０に入力される。７０３はバッファであり、端子７０１から入力された１フレーム分の階層符号データを格納する。 Reference numeral 702 denotes a terminal for inputting a display control signal related to decoding output from the display control unit 603 in FIG. As the display control signal, the layer information to be decoded and the position information of the tile to be decoded are input. Further, the display control signal input to terminal 702 is input to separation section 704, base layer decoding section 707, and enhancement layer decoding section 710. A buffer 703 stores one frame of hierarchical code data input from the terminal 701.

７０４は分離部である。分離部７０４は、バッファ７０３から入力された階層符号データからヘッダ符号データ、基本レイヤ符号データ、各拡張レイヤ符号データを分離する。さらに、分離部７０４は、レイヤ毎に分離した階層符号データ（基本レイヤ符号データ及び各拡張レイヤ符号データ）を、タイル毎の符号データにそれぞれ分割して出力する。そして、分離されたそれぞれの符号データは、ヘッダ復号部７０５、基本レイヤ復号部７０７、拡張レイヤ復号部７１０に出力される。また、分離部７０４は、タイル毎に分離した符号データを各処理部へ出力する場合に、出力するタイル（復号対象のタイル）の番号をタイルの位置情報として独立タイル判定部７０６に出力する。 Reference numeral 704 denotes a separation unit. The separation unit 704 separates the header code data, base layer code data, and each enhancement layer code data from the hierarchical code data input from the buffer 703. Furthermore, the separation unit 704 divides the hierarchical code data (base layer code data and each enhancement layer code data) separated for each layer into code data for each tile and outputs the divided code data. Each separated code data is output to header decoding section 705, base layer decoding section 707, and enhancement layer decoding section 710. Further, when outputting the code data separated for each tile to each processing unit, the separation unit 704 outputs the number of the tile to be output (decoding target tile) to the independent tile determination unit 706 as tile position information.

７０５はヘッダ復号部である。ヘッダ復号部７０５は、シーケンス単位及びピクチャ単位のヘッダ符号データを復号し、復号に必要なパラメータを再生する。特に、ヘッダ符号データにＭＣＴＳＳＥＩ符号が存在する場合、ヘッダ復号部７０５はこれも復号する。特に、ヘッダ復号部７０５は、独立タイルフラグと独立タイル位置情報とを復号し、再生する。７０６は、復号対象のタイル（復号対象タイル）が独立タイルであるか否かを判定する独立タイル判定部である。独立タイル判定部７０６は、ヘッダ復号部７０５から入力した独立タイルフラグ及び独立タイル位置情報と、分離部７０４から入力した復号対象タイルの位置情報とに基づいて、復号対象タイルが独立タイルであるか否かを判定する。さらに、独立タイル判定部７０６は、判定結果を基本レイヤ復号部７０７及び拡張レイヤ復号部７１０に入力する。 Reference numeral 705 denotes a header decoding unit. The header decoding unit 705 decodes header code data in sequence units and picture units, and reproduces parameters necessary for decoding. In particular, when the MCTS SEI code exists in the header code data, the header decoding unit 705 also decodes this. In particular, the header decoding unit 705 decodes and reproduces the independent tile flag and the independent tile position information. Reference numeral 706 denotes an independent tile determination unit that determines whether a decoding target tile (decoding target tile) is an independent tile. The independent tile determination unit 706 determines whether the decoding target tile is an independent tile based on the independent tile flag and the independent tile position information input from the header decoding unit 705 and the decoding target tile position information input from the separation unit 704. Determine whether or not. Furthermore, the independent tile determination unit 706 inputs the determination result to the base layer decoding unit 707 and the enhancement layer decoding unit 710.

７０７は基本レイヤ復号部である。基本レイヤ復号部７０７は、分離部７０４で分離された基本レイヤのタイルの符号データを復号し、基本レイヤの復号画像を生成する。７０８はフレームメモリであり、基本レイヤ復号部７０７で生成された基本レイヤの各タイルの復号画像を保持する。７０９は拡大部であり、基本レイヤの復号画像を拡張レイヤの解像度に拡大して拡大画像を生成する。７２０はセレクタであり、基本レイヤの復号画像又は拡張レイヤの復号画のうち所望の復号画像を選択し、選択した復号画像を端子７１２に出力する。７１２は端子であり、セレクタ７２０から入力された復号画像を画像復号部６０５の外部に出力する。 Reference numeral 707 denotes a base layer decoding unit. The base layer decoding unit 707 decodes the code data of the tiles of the base layer separated by the separation unit 704, and generates a decoded image of the base layer. A frame memory 708 holds a decoded image of each tile of the base layer generated by the base layer decoding unit 707. Reference numeral 709 denotes an enlargement unit that enlarges the decoded image of the base layer to the resolution of the enhancement layer and generates an enlarged image. A selector 720 selects a desired decoded image from the decoded image of the base layer or the decoded image of the enhancement layer, and outputs the selected decoded image to the terminal 712. Reference numeral 712 denotes a terminal that outputs the decoded image input from the selector 720 to the outside of the image decoding unit 605.

７１０は拡張レイヤ復号部である。拡張レイヤ復号部７１０は、分離部７０４で分離された拡張レイヤのタイルの符号データを復号し、拡張レイヤの復号画像を生成する。７１１はフレームメモリであり、拡張レイヤ復号部７１０で生成された拡張レイヤの各タイルの復号画像を保持する。 Reference numeral 710 denotes an enhancement layer decoding unit. The enhancement layer decoding unit 710 decodes the code data of the enhancement layer tile separated by the separation unit 704, and generates an enhancement layer decoded image. Reference numeral 711 denotes a frame memory which holds a decoded image of each tile of the enhancement layer generated by the enhancement layer decoding unit 710.

全体制御部７１４は、画像復号部６０５の各処理部の制御、及び各処理部間のパラメータ伝達を行う。尚、図１において、全体制御部７１４と画像復号部６０５内の各処理部との間の結線を省略している。そして、全体制御部７１４は画像復号部６０５内の各処理部の制御、及び各処理部間のパラメータの読み書きを、パラメータ信号線またはレジスタバスのいずれかを通じて行うことが可能である。また、本実施形態において、図１の全体制御部７１４は、画像復号部６０５内に設置されているが、本発明はこれに限定されない。即ち、全体制御部７１４は、当該画像復号部６０５外に設置され、当該画像復号部６０５内の各処理部の制御、及び各処理部間のパラメータの読み書きを、パラメータ信号線またはレジスタバスのいずれかを通じて行ってもよい。 The overall control unit 714 controls each processing unit of the image decoding unit 605 and transmits parameters between the processing units. In FIG. 1, connection between the overall control unit 714 and each processing unit in the image decoding unit 605 is omitted. The overall control unit 714 can control each processing unit in the image decoding unit 605 and read / write parameters between the processing units through either a parameter signal line or a register bus. In the present embodiment, the overall control unit 714 in FIG. 1 is installed in the image decoding unit 605, but the present invention is not limited to this. That is, the overall control unit 714 is installed outside the image decoding unit 605, and controls each processing unit in the image decoding unit 605 and reads / writes parameters between the processing units using either a parameter signal line or a register bus. You may go through.

上述した画像復号部６０５における、画像の復号動作を図８に示したフローチャートを用いて以下に説明する。 The image decoding operation in the image decoding unit 605 described above will be described below with reference to the flowchart shown in FIG.

まず、復号対象のレイヤ（復号対象レイヤ）が基本レイヤのみの場合について述べる。ここでは、ユーザが表示制御部６０３に、インターフェース６０１から入力されるビットストリームにおいて基本レイヤの、復号及び表示を指示するとする。 First, the case where the decoding target layer (decoding target layer) is only the base layer will be described. Here, it is assumed that the user instructs the display control unit 603 to decode and display the base layer in the bitstream input from the interface 601.

ステップＳ８０１にて、端子７０１から入力された、ビットストリームの先頭に存在するヘッダ符号データは、バッファ７０３及び分離部７０４による処理を経てヘッダ復号部７０５に入力される。ヘッダ復号部７０５は、シーケンスヘッダの１つであるｖｉｄｅｏ＿ｐａｒａｍｅｔｅｒ＿ｓｅｔを復号する。このｖｉｄｅｏ＿ｐａｒａｍｅｔｅｒ＿ｓｅｔには階層符号化の階層数を表すｖｐｓ＿ｍａｘ＿ｌａｙｅｒｓ＿ｍｉｎｕｓ１符号が含まれる。本実施形態において、ｖｐｓ＿ｍａｘ＿ｌａｙｅｒｓ＿ｍｉｎｕｓ１符号は１である。続いてヘッダ復号部７０５は、Ｓｅｑｕｅｎｃｅｐａｒａｍｅｔｅｒｓｅｔ符号を復号する。Ｓｅｑｕｅｎｃｅｐａｒａｍｅｔｅｒｓｅｔ符号にはｖｕｉ＿ｐａｒａｍｅｔｅｒｓも含まれる。ｖｕｉ＿ｐａｒａｍｅｔｅｒｓにはタイル位置一致情報であるｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号が含まれている。本実施形態において、ｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号は１である。 In step S 801, the header code data that is input from the terminal 701 and exists at the head of the bitstream is input to the header decoding unit 705 through processing by the buffer 703 and the separation unit 704. The header decoding unit 705 decodes video_parameter_set, which is one of the sequence headers. This video_parameter_set includes a vps_max_layers_minus1 code that represents the number of layers of layer encoding. In the present embodiment, the vps_max_layers_minus1 code is 1. Subsequently, the header decoding unit 705 decodes the Sequence parameter set code. The sequence parameter set code also includes vui_parameters. The vui_parameters includes a tile_boundaries_aligned_flag code that is tile position match information. In the present embodiment, the tile_boundaries_aligned_flag code is 1.

ステップＳ８０２にて、ヘッダ復号部７０５は、Ｐｉｃｔｕｒｅｐａｒａｍｅｔｅｒｓｅｔ符号を復号する。これらのヘッダ符号データの復号については非特許文献１に詳細に記載されているのでここでは説明を省略する。 In step S802, the header decoding unit 705 decodes the Picture parameter set code. Since decoding of these header code data is described in detail in Non-Patent Document 1, description thereof is omitted here.

ステップＳ８０３にて、独立タイル判定部７０６は、復号対象のフレーム内に独立タイルがあるか否かを判定する。そして、判定結果を、独立タイルフラグとする。尚、実際には〜にＭＣＴＳＳＥＩの有無を判定する。ヘッダ符号データにＭＣＴＳＳＥＩが存在するのであれば、独立タイルフラグを１とし、ステップＳ８０４に進む。ヘッダ符号データにＭＣＴＳＳＥＩが存在しないのであれば、独立タイル可否フラグを０とし、ステップＳ８０５に進む。本実施形態では、ヘッダ符号データにＭＣＴＳＳＥＩが存在すると判断して、独立タイルフラグを１とし、ステップＳ８０４に進む。尚、復号対象のフレーム内に独立タイルが存在する場合、タイル位置一致情報であるｖｕｉ＿ｐａｒａｍｅｔｅｒｓのｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号は１となっている必要がある。もし、ｖｕｉ＿ｐａｒａｍｅｔｅｒｓのｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号が１でなければ、ヘッダ復号部７０５は、エラーを返して復号を停止するようにしても構わない。さらに、ヘッダ復号部７０５は、独立タイルフラグを独立タイル判定部７０６、基本レイヤ復号部７０７、及び拡張レイヤ復号部７１０に入力さする。 In step S803, the independent tile determination unit 706 determines whether there is an independent tile in the decoding target frame. Then, the determination result is an independent tile flag. Actually, it is determined whether or not MCTS SEI is present. If MCTS SEI exists in the header code data, the independent tile flag is set to 1, and the process proceeds to step S804. If MCTS SEI does not exist in the header code data, the independent tile availability flag is set to 0, and the process proceeds to step S805. In this embodiment, it is determined that MCTS SEI is present in the header code data, the independent tile flag is set to 1, and the process proceeds to step S804. When an independent tile exists in the decoding target frame, the tile_boundaries_aligned_flag code of vui_parameters that is tile position matching information needs to be 1. If the tile_boundaries_aligned_flag code of vui_parameters is not 1, the header decoding unit 705 may return an error and stop decoding. Further, the header decoding unit 705 inputs the independent tile flag to the independent tile determination unit 706, the base layer decoding unit 707, and the enhancement layer decoding unit 710.

ステップＳ８０４にて、ヘッダ復号部７０５はＭＣＴＳＳＥＩ符号を復号し、独立タイルフラグと独立タイル位置情報を取得する。 In step S804, the header decoding unit 705 decodes the MCTS SEI code, and acquires the independent tile flag and the independent tile position information.

ステップＳ８０５にて、分離部７０４は端子７０２から入力された表示部分にかかるタイルの位置情報を入力する。本実施形態において、基本レイヤ全体の表示が指示されている。このため、表示部分にかかるタイルは基本レイヤの全てのタイルとなる。即ち、分離部７０４は、基本レイヤの復号対象タイルの符号データを、タイル０からタイル番号順でバッファ７０３から抽出し、基本レイヤ復号部７０７に出力する。 In step S 805, the separation unit 704 inputs tile position information regarding the display portion input from the terminal 702. In this embodiment, display of the entire base layer is instructed. For this reason, the tiles related to the display portion are all the tiles of the base layer. That is, the separation unit 704 extracts the code data of the decoding target tile of the base layer from the buffer 703 in order of the tile number from the tile 0 and outputs the code data to the base layer decoding unit 707.

ステップＳ８０６にて、独立タイル判定部７０６は、分離部７０４から復号対象タイルの番号を入力する。また、独立タイル判定部７０６は、ヘッダ復号部７０５から独立タイル位置情報を入力する。本実施形態では、独立タイルセットは１つであり、独立タイル位置情報は５と６である。独立タイル判定部７０６は、復号対象タイルのタイル番号と独立タイル位置情報のタイル番号とを比較する。そして、復号対象タイルのタイル番号が独立タイル位置情報のタイル番号と一致する場合（ステップＳ８０６のＹＥＳ）、独立タイル判定部７０６は復号対象タイルが独立タイルであると判定し、ステップＳ８０７に進む。復号対象タイルのタイル番号が独立タイル位置情報のタイル番号と一致しない場合（ステップＳ８０６のＮＯ）、復号対象タイルが独立タイルセットのタイルではないと判定し、ステップＳ８０８に進む。 In step S806, the independent tile determination unit 706 inputs the number of the decoding target tile from the separation unit 704. The independent tile determination unit 706 also receives the independent tile position information from the header decoding unit 705. In this embodiment, there is one independent tile set, and the independent tile position information is 5 and 6. The independent tile determination unit 706 compares the tile number of the decoding target tile with the tile number of the independent tile position information. If the tile number of the decoding target tile matches the tile number of the independent tile position information (YES in step S806), the independent tile determination unit 706 determines that the decoding target tile is an independent tile, and proceeds to step S807. If the tile number of the decoding target tile does not match the tile number of the independent tile position information (NO in step S806), it is determined that the decoding target tile is not an independent tile set tile, and the process proceeds to step S808.

ステップＳ８０７にて、復号対象タイルは基本レイヤの復号対象のフレームにおける独立タイルである。このため、基本レイヤ復号部７０７は、復号済みの基本レイヤの他のフレームにおける、当該復号対象タイルの位置と相対的に等しい位置の独立タイルセット内の独立タイルと、復号対象タイル内の復号済み画素のみを参照して復号を行う。即ち、基本レイヤ復号部７０７は、フレームメモリ７０８に格納されている、復号対象タイルの位置と相対的に等しい位置の独立タイルセット内の独立タイルの復号画像を参照してフレーム間予測を行う。さらに、基本レイヤ復号部７０７は、フレームメモリ７０８に格納されている復号対象タイル内の復号画像を参照してイントラ予測を行う。そして、基本レイヤ復号部７０７は、復号した基本レイヤの復号対象タイルの復号画像をフレームメモリ７０８に格納する。尚、当該復号画像は以後のタイルの復号時に参照される。また、基本レイヤ復号部７０７は、当該基本レイヤのタイルの復号画像をセレクタ７２０及び端子７１２を介して図６の表示部６０６へ出力する。 In step S807, the decoding target tile is an independent tile in the decoding target frame of the base layer. For this reason, the base layer decoding unit 707, in other frames of the decoded base layer, the independent tile in the independent tile set at a position relatively equal to the position of the decoding target tile, and the decoded in the decoding target tile Decoding is performed with reference to only pixels. That is, the base layer decoding unit 707 performs inter-frame prediction with reference to the decoded image of the independent tile in the independent tile set that is stored in the frame memory 708 and is relatively equal to the position of the decoding target tile. Further, the base layer decoding unit 707 performs intra prediction with reference to the decoded image in the decoding target tile stored in the frame memory 708. Then, the base layer decoding unit 707 stores the decoded image of the decoded base layer decoding target tile in the frame memory 708. The decoded image is referred to when decoding the tile thereafter. Also, the base layer decoding unit 707 outputs the decoded image of the tile of the base layer to the display unit 606 in FIG. 6 via the selector 720 and the terminal 712.

ステップＳ８０８にて、復号対象タイルは基本レイヤの復号対象のフレームにおける独立タイルではない。このため、基本レイヤ復号部７０７は、復号済みのフレームの基本レイヤの復号画像と復号対象のフレームの基本レイヤの復号済み画素を参照して復号を行う。即ち基本レイヤ復号部７０７は、フレームメモリ７０８に格納されている復号画像を参照してフレーム間予測を行う。さらに、基本レイヤ復号部７０７は、復号対象タイル内の復号済みの復号画像を参照してイントラ予測を行う。そして、基本レイヤ復号部７０７は、復号した基本レイヤの復号対象タイルの復号画像をフレームメモリ７０８に格納する。尚、当該復号画像は以降のタイルの復号時に参照される。また、基本レイヤ復号部７０７は、当該基本レイヤのタイルの復号画像をセレクタ７２０及び端子７１２を介して図６の表示部６０６へ出力する。 In step S808, the decoding target tile is not an independent tile in the decoding target frame of the base layer. For this reason, the base layer decoding unit 707 performs decoding with reference to the decoded image of the base layer of the decoded frame and the decoded pixels of the base layer of the decoding target frame. That is, the base layer decoding unit 707 performs inter-frame prediction with reference to the decoded image stored in the frame memory 708. Furthermore, the base layer decoding unit 707 performs intra prediction with reference to the decoded image that has been decoded in the decoding target tile. Then, the base layer decoding unit 707 stores the decoded image of the decoded base layer decoding target tile in the frame memory 708. The decoded image is referred to when decoding the subsequent tiles. Also, the base layer decoding unit 707 outputs the decoded image of the tile of the base layer to the display unit 606 in FIG. 6 via the selector 720 and the terminal 712.

ステップＳ８０９にて、全体制御部７１４は、基本レイヤの１フレーム分の全てのタイルの符号データを復号したか否かを判定する。基本レイヤの１フレーム分の全てのタイルの符号データの復号処理が終わっていないと判定された場合（ステップＳ８０９のＮＯ）、ステップＳ８０５に戻り、分離部７０４は次のタイルを抽出して出力し、処理を続行する。一方、基本レイヤの１フレーム分の全てのタイルの符号データの復号処理が終了していると判定された場合（ステップＳ８０９のＹＥＳ）、ステップＳ８１０に進む。 In step S809, the overall control unit 714 determines whether the code data of all tiles for one frame of the base layer have been decoded. If it is determined that the decoding process of the encoded data of all the tiles for one frame of the base layer has not been completed (NO in step S809), the process returns to step S805, and the separation unit 704 extracts and outputs the next tile. ,continue processing. On the other hand, if it is determined that the decoding process of the code data of all tiles for one frame of the base layer has been completed (YES in step S809), the process proceeds to step S810.

ステップＳ８１０にて、分離部７０４は、図６の表示制御部６０３から端子７０２を介して入力された表示制御信号に基づいて、復号及び表示するレイヤに拡張レイヤが含まれているか否かを判定する。拡張レイヤの復号及び表示が指示されている場合（ステップＳ８１０におけるＹＥＳ）はステップＳ８１１に進み、そうでない場合（ステップＳ８１０におけるＮＯ）はステップＳ８１８に進む。ここでは、基本レイヤのみの復号であることからステップＳ８１８に進み、拡張レイヤ復号部７１０は復号処理を行わない。 In step S810, the separation unit 704 determines whether or not the layer to be decoded and displayed includes an enhancement layer based on the display control signal input from the display control unit 603 in FIG. 6 via the terminal 702. To do. If decoding and display of the enhancement layer is instructed (YES in step S810), the process proceeds to step S811, and if not (NO in step S810), the process proceeds to step S818. Here, since it is decoding of only a base layer, it progresses to step S818 and the enhancement layer decoding part 710 does not perform a decoding process.

ステップＳ８１８にて、全体制御部７１４は、端子７０１から入力されるシーケンスに含まれる全てのフレームの基本レイヤの符号データ又は拡張レイヤの符号データの復号処理が終了したか否かを判定する。ここでは、全体制御部７１４が全てのフレームの基本レイヤの符号データの復号を終了したか否かを判定する。復号処理を行っていない基本レイヤ又は拡張レイヤの符号データが存在する場合は（ステップＳ８１８のＮＯ）、ステップＳ８０５に進み、次のフレームの処理を行う。復号処理を行っていないフレームの符号データが存在しない場合は（ステップＳ８１８のＹＥＳ）、復号処理を終了する。 In step S818, overall control unit 714 determines whether or not the decoding process of the base layer code data or the enhancement layer code data of all the frames included in the sequence input from terminal 701 has been completed. Here, the overall control unit 714 determines whether or not the decoding of the code data of the base layer of all frames has been completed. If there is base layer or enhancement layer code data that has not been decoded (NO in step S818), the process advances to step S805 to process the next frame. If there is no code data for a frame that has not been decoded (YES in step S818), the decoding process ends.

尚、画像復号部６０５によって復号された画像は、図６の表示部６０６に出力される。表示部６０６は、表示制御部６０３から基本レイヤの画像の表示が指示されることにより、画像復号部６０５から出力された基本レイヤの復号画像全体を表示する。 Note that the image decoded by the image decoding unit 605 is output to the display unit 606 of FIG. The display unit 606 displays the entire base layer decoded image output from the image decoding unit 605 when the display control unit 603 instructs the display of the base layer image.

また、ユーザの指示によって表示制御部６０３から記録されている映像の基本レイヤの表示が指示された場合、セレクタ６０４の入力を記憶部６０２とする。そして、表示制御部６０３は記憶部６０２から必要なビットストリームを選択し、セレクタ６０４に出力するよう制御する。 When the display control unit 603 instructs the display of the base layer of the recorded video according to the user's instruction, the input of the selector 604 is the storage unit 602. Then, the display control unit 603 performs control to select a necessary bit stream from the storage unit 602 and output it to the selector 604.

続いて、復号対象レイヤが拡張レイヤの場合について述べる。ユーザから表示制御部６０３に、インターフェース６０１から入力されるビットストリームの拡張レイヤの、復号と一部の表示を指示された場合の復号処理について説明する。これは、監視カメラ等によって撮影された画像の一部を詳細にモニタリングする場合に相当する。画像復号部６０５は、基本レイヤと拡張レイヤの、復号及び表示する領域に含まれるタイルの番号を表示制御部６０３から指示される。本実施形態では説明を簡単にするために、表示する領域に含まれるタイルを図２のタイル５とタイル６の領域とする。以下、画像復号部６０５における、拡張レイヤの画像の復号動作を、基本レイヤのみの復号及び表示を指示された場合と同様に、図８に示したフローチャートに基づいて説明する。また、基本レイヤのみの復号と同じ動作を行う部分は説明を簡略化する。 Next, a case where the decoding target layer is an enhancement layer will be described. Decoding processing when the user instructs the display control unit 603 to decode and partially display the enhancement layer of the bitstream input from the interface 601 will be described. This corresponds to a case where a part of an image taken by a monitoring camera or the like is monitored in detail. The image decoding unit 605 is instructed by the display control unit 603 to specify the tile numbers included in the decoding and display areas of the base layer and the enhancement layer. In this embodiment, in order to simplify the description, the tiles included in the display area are the areas of the tile 5 and the tile 6 in FIG. Hereinafter, the decoding operation of the enhancement layer image in the image decoding unit 605 will be described based on the flowchart shown in FIG. 8 in the same manner as when the decoding and display of only the base layer is instructed. Also, the description of the part that performs the same operation as the decoding of only the base layer is simplified.

ステップＳ８０１にて、基本レイヤのみの表示を指示された場合と同様に、ヘッダ復号部７０５は、ｖｉｄｅｏ＿ｐａｒａｍｅｔｅｒ＿ｓｅｔ及びＳｅｑｕｅｎｃｅｐａｒａｍｅｔｅｒｓｅｔを復号する。そして、ヘッダ復号部７０５はこれらの中のｖｐｓ＿ｍａｘ＿ｌａｙｅｒｓ＿ｍｉｎｕｓ１符号及び、ｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号を復号する。 In step S801, as instructed to display only the base layer, the header decoding unit 705 decodes the video_parameter_set and the Sequence parameter set. The header decoding unit 705 decodes the vps_max_layers_minus1 code and the tile_boundaries_aligned_flag code among them.

ステップＳ８０２にて、基本レイヤのみの表示時と同様に、ヘッダ復号部７０５は、Ｐｉｃｔｕｒｅｐａｒａｍｅｔｅｒｓｅｔ符号を復号する。 In step S802, the header decoding unit 705 decodes the picture parameter set code in the same manner as when only the base layer is displayed.

ステップＳ８０３にて、基本レイヤのみの表示時と同様に、ヘッダ復号部７０５はヘッダ符号データに独立タイルがあるか否かを判定する。 In step S803, as in the case of displaying only the base layer, the header decoding unit 705 determines whether there is an independent tile in the header code data.

ステップＳ８０４にて、基本レイヤのみの表示時と同様に、ヘッダ復号部７０５はＭＣＴＳＳＥＩ符号を復号し、独立タイルフラグと独立タイル位置情報を取得する。 In step S804, the header decoding unit 705 decodes the MCTS SEI code and acquires the independent tile flag and the independent tile position information in the same manner as when only the base layer is displayed.

ステップＳ８０５にて、分離部７０４は端子７０２から入力された表示部分にかかるタイルの位置情報を入力する。本説明では、表示を指示されているタイルの位置はタイル５とタイル６である。ここではまず、分離部７０４は、端子７０２から入力された、表示を指示されているタイルの位置情報に基づいて復号対象タイルをタイル５とし、当該タイル５の基本レイヤの符号データを抽出し、抽出した符号データを基本レイヤ復号部７０７に出力する。また、表示を指示されているタイル位置情報を独立タイル判定部７０６に入力する。 In step S 805, the separation unit 704 inputs tile position information regarding the display portion input from the terminal 702. In this description, the positions of the tiles that are instructed to be displayed are the tile 5 and the tile 6. Here, first, the separating unit 704 sets the decoding target tile as the tile 5 based on the position information of the tile instructed to be input, which is input from the terminal 702, and extracts the code data of the base layer of the tile 5, The extracted code data is output to base layer decoding section 707. Further, tile position information instructed to be displayed is input to the independent tile determination unit 706.

ステップＳ８０６にて、独立タイル判定部７０６は、復号対象タイルのタイル番号と独立タイル位置情報のタイル番号とを比較する。ここで、復号対象タイルであるタイル５は独立タイルであるので、ステップ８０７に進む。 In step S806, the independent tile determination unit 706 compares the tile number of the decoding target tile with the tile number of the independent tile position information. Here, since tile 5 that is the decoding target tile is an independent tile, the process proceeds to step 807.

ステップＳ８０７にて、復号対象タイルは独立タイルである。基本レイヤのみの表示時と同様に、基本レイヤ復号部７０７は、基本レイヤのタイル５の符号データを復号して復号画像を生成し、当該復号画像をフレームメモリ７０８へ格納する。尚、ここでは拡張レイヤの表示を行うので、基本レイヤ復号部７０７は、生成した復号画像の、端子７１２からの出力は行わない。但し、本発明はこれに限定されず、基本レイヤ復号部７０７が復号画像を出力することも可能である。その場合、基本レイヤ復号部７０７で生成される復号画像と、拡張レイヤ復号部７１０で生成される復号画像の両方を出力し、表示部６０６で選択して表示することも可能である。 In step S807, the decoding target tile is an independent tile. As in the case of displaying only the base layer, the base layer decoding unit 707 generates a decoded image by decoding the code data of the tile 5 of the base layer, and stores the decoded image in the frame memory 708. Since the enhancement layer is displayed here, the base layer decoding unit 707 does not output the generated decoded image from the terminal 712. However, the present invention is not limited to this, and the base layer decoding unit 707 can output a decoded image. In this case, both the decoded image generated by the base layer decoding unit 707 and the decoded image generated by the enhancement layer decoding unit 710 can be output and selected and displayed by the display unit 606.

ステップＳ８０９にて、全体制御部７１４は、分離部７０４から入力された表示部分にかかる基本レイヤの全てのタイルの符号データを復号したか否かを判定する。ここでは、タイル６の符号データの復号が終わっていないため、ステップＳ８０６に戻り、タイル６の基本レイヤの符号データの復号を行う。 In step S809, the overall control unit 714 determines whether the code data of all tiles of the base layer related to the display portion input from the separation unit 704 has been decoded. Here, since the decoding of the code data of the tile 6 has not been completed, the process returns to step S806, and the code data of the base layer of the tile 6 is decoded.

以下、タイル６の基本レイヤの符号データの復号について説明する。 Hereinafter, decoding of the code data of the base layer of the tile 6 will be described.

ステップＳ８０５にて、分離部７０４は、タイル６の基本レイヤの符号データを抽出する。ステップＳ８０６にて、独立タイル判定部７０６は復号対象タイルのタイル番号と独立タイル位置情報のタイル番号とを比較し、復号対象タイルであるタイル６は独立タイルであるため、ステップＳ８０７に進む。ステップＳ８０７にて、基本レイヤ復号部７０７は、タイル６の基本レイヤの符号データを復号し、復号画像をフレームメモリ７０８へ格納する。 In step S805, the separation unit 704 extracts the code data of the base layer of the tile 6. In step S806, the independent tile determination unit 706 compares the tile number of the decoding target tile with the tile number of the independent tile position information. Since the tile 6 that is the decoding target tile is an independent tile, the process proceeds to step S807. In step S807, the base layer decoding unit 707 decodes the code data of the base layer of the tile 6 and stores the decoded image in the frame memory 708.

ステップＳ８０９にて、全体制御部７１４は、分離部７０４から入力された表示部分にかかる基本レイヤの全てのタイルの符号データを復号したと判定し、ステップＳ８１０に進む。 In step S809, the overall control unit 714 determines that the code data of all the tiles of the base layer related to the display portion input from the separation unit 704 has been decoded, and proceeds to step S810.

ステップＳ８１０にて、分離部７０４は、図６の表示制御部６０３から端子７０２を介して入力された表示制御信号に基づいて、表示するレイヤに拡張レイヤが含まれているか否かを判定する。ここでは、拡張レイヤまで表示するので、ステップＳ８１１に進む。 In step S810, the separation unit 704 determines whether or not an extension layer is included in the layer to be displayed, based on the display control signal input from the display control unit 603 in FIG. 6 via the terminal 702. Here, since the enhancement layer is displayed, the process advances to step S811.

ステップＳ８１１にて、ステップＳ８０５と同様に、分離部７０４は端子７０２から入力された表示部分にかかるタイルの位置情報を入力する。ここでは、表示を指示されているタイルの位置はタイル５とタイル６である。分離部７０４は、入力された、表示を指示されているタイルの位置情報に基づいて、復号対象タイルであるタイル５の拡張レイヤの符号データを抽出し、抽出した符号データを拡張レイヤ復号部７１０に出力する。また、表示を指示されているタイル位置情報を独立タイル判定部７０６に入力する。 In step S811, as in step S805, the separation unit 704 inputs tile position information relating to the display portion input from the terminal 702. Here, the positions of the tiles that are instructed to be displayed are the tile 5 and the tile 6. The separation unit 704 extracts the code data of the enhancement layer of the tile 5 that is the decoding target tile based on the input position information of the tile that is instructed to be displayed, and the enhancement layer decoding unit 710 extracts the extracted code data. Output to. Further, tile position information instructed to be displayed is input to the independent tile determination unit 706.

ステップＳ８１２にて、ステップＳ８０６と同様に、独立タイル判定部７０６は復号対象タイルのタイル番号と独立タイル位置情報のタイル番号とを比較する。各タイル番号が一致すればステップＳ８１３に進み、一致しなければステップＳ８１５に進む。ここでは、独立タイル位置情報は５と６である。したがって、独立タイル判定部７０６は、復号対象タイルであるタイル５は独立タイルセットのタイルであると判定し、ステップＳ８１３に進む。 In step S812, as in step S806, the independent tile determination unit 706 compares the tile number of the decoding target tile with the tile number of the independent tile position information. If the tile numbers match, the process proceeds to step S813, and if not, the process proceeds to step S815. Here, the independent tile position information is 5 and 6. Therefore, the independent tile determination unit 706 determines that the tile 5 that is the decoding target tile is a tile of the independent tile set, and proceeds to step S813.

ステップＳ８１３にて、復号対象タイルは拡張レイヤの復号対象のフレームにおける独立タイルである。拡大部７０９は、フレームメモリ７０８に格納されている、復号済みの基本レイヤの復号画像から復号対象タイルの位置と相対的に等しい位置の独立タイルセットに含まれる復号画像を入力する。拡大部７０９は、入力された独立タイルの復号画像のみを用いて、フィルタリング等で拡大して拡大画像を生成し、当該拡大画像を拡張レイヤ復号部７１０に出力する。 In step S813, the decoding target tile is an independent tile in the decoding target frame of the enhancement layer. The enlargement unit 709 inputs a decoded image included in an independent tile set at a position that is relatively equal to the position of the decoding target tile from the decoded image of the base layer that is stored in the frame memory 708. The enlargement unit 709 uses only the input decoded image of the independent tile to generate an enlarged image by filtering or the like, and outputs the enlarged image to the enhancement layer decoding unit 710.

ステップＳ８１４にて、拡張レイヤ復号部７１０は、分離部７０４から入力された復号対象タイルの拡張レイヤ符号データを復号する。拡張レイヤ復号部７１０は、拡大部７０９から入力される拡大画像と、フレームメモリ７１１に格納された復号済みの拡張レイヤの復号画像と、復号対象タイルの復号済みの画素とを参照して復号画像を生成する。即ち、拡張レイヤ復号部７１０は、ステップＳ８１３で生成された基本レイヤの拡大画像を参照してレイヤ間予測を行う。また、拡張レイヤ復号部７１０は、フレームメモリ７１１に格納されている拡張レイヤの復号画像のうち復号対象タイルの位置と相対的に等しい位置の独立タイルセット内の復号画像を参照してフレーム間予測を行う。さらに、拡張レイヤ復号部７１０は、復号対象タイル内の復号画像を参照してイントラ予測を行う。図２を用いて具体的に説明すると、フレーム２０６のタイル５の復号を行う際に、フレーム２０４の拡大画像、復号済みのフレーム２０５のタイル５とタイル６の復号画像、及びフレーム２０６のタイル５の復号済み画素を参照して復号を行う。拡張レイヤ復号部７１０によって生成された拡張レイヤのタイルの復号画像はフレームメモリ７１１に出力され、フレームメモリ７１１で保持される。また、拡張レイヤ復号部７１０で生成された拡張レイヤの復号画像は、セレクタ７２０及び端子７１２を介して図６の表示部６０６に出力される。 In step S814, enhancement layer decoding section 710 decodes the enhancement layer code data of the decoding target tile input from separation section 704. The enhancement layer decoding unit 710 refers to the enlarged image input from the enlargement unit 709, the decoded enhancement layer decoded image stored in the frame memory 711, and the decoded pixel of the decoding target tile. Is generated. That is, the enhancement layer decoding unit 710 performs inter-layer prediction with reference to the enlarged image of the base layer generated in step S813. Also, the enhancement layer decoding unit 710 refers to the decoded image in the independent tile set at a position relatively equal to the position of the decoding target tile among the decoded images of the enhancement layer stored in the frame memory 711, and performs inter-frame prediction. I do. Furthermore, the enhancement layer decoding unit 710 performs intra prediction with reference to the decoded image in the decoding target tile. Specifically, referring to FIG. 2, when decoding the tile 5 of the frame 206, the enlarged image of the frame 204, the decoded images of the tiles 5 and 6 of the decoded frame 205, and the tile 5 of the frame 206. Decoding is performed with reference to the decoded pixels. The decoded image of the enhancement layer tile generated by the enhancement layer decoding unit 710 is output to the frame memory 711 and held in the frame memory 711. Also, the decoded image of the enhancement layer generated by the enhancement layer decoding unit 710 is output to the display unit 606 in FIG. 6 via the selector 720 and the terminal 712.

ステップＳ８１７にて、全体制御部７１４は、分離部７０４から入力された表示部分にかかる拡張レイヤの全てのタイルの符号データを復号したか否かを判定する。ここでは、タイル６の拡張レイヤの符号データの復号が終わっていないため、ステップＳ８１１に戻り、タイル６の拡張レイヤの符号データの復号を行う。 In step S817, overall control unit 714 determines whether or not the code data of all tiles of the enhancement layer relating to the display portion input from separation unit 704 has been decoded. Here, since the decoding of the code data of the enhancement layer of tile 6 has not been completed, the process returns to step S811, and the code data of the enhancement layer of tile 6 is decoded.

以下、タイル６の拡張レイヤの符号データの復号について説明する。 Hereinafter, decoding of the code data of the enhancement layer of tile 6 will be described.

ステップＳ８１１にて、タイル６の拡張レイヤの符号データを抽出する。ステップＳ８１２にて、独立タイル判定部７０６は復号対象タイルのタイル番号と独立タイル位置情報のタイル番号とを比較する。ここで、復号対象タイルであるタイル６は独立タイルであるため、ステップＳ８１３に進む。 In step S811, the code data of the enhancement layer of tile 6 is extracted. In step S812, the independent tile determination unit 706 compares the tile number of the decoding target tile with the tile number of the independent tile position information. Here, since the tile 6 that is the decoding target tile is an independent tile, the process advances to step S813.

ステップＳ８１３にて、拡大部７０９は、入力された独立タイルの復号画像のみを用いて、拡大画像を生成する。 In step S813, the enlargement unit 709 generates an enlarged image using only the input decoded image of the independent tile.

ステップＳ８１４にて、拡張レイヤ復号部７１０は、タイル６の拡張レイヤの符号データを復号して復号画像を生成し、当該復号画像をフレームメモリ７１１へ格納する。拡張レイヤ復号部７１０は、タイル６の拡張レイヤの符号データの復号において、拡大部７０９から入力される拡大画像と、フレームメモリ７１１に格納された復号済みの拡張レイヤの復号画像と、復号対象タイルの復号済みの画素とを参照する。即ち、拡張レイヤ復号部７１０は、ステップＳ８１３で生成された基本レイヤの拡大画像を参照してレイヤ間予測を行う。また、拡張レイヤ復号部７１０は、フレームメモリ７１１に格納されている拡張レイヤにおいて復号対象タイルの位置と相対的に等しい位置の独立タイルセット内の復号画像を参照してフレーム間予測を行う。さらに、拡張レイヤ復号部７１０は、復号対象タイル内の復号画像を参照してイントラ予測を行う。図２を用いて具体的に説明すると、フレーム２０６のタイル６の復号を行う際に、フレーム２０４の拡大画像、復号済みのフレーム２０５のタイル５とタイル６の復号画像、及びフレーム２０６のタイル６の復号済み画素を参照して復号を行う。拡張レイヤ復号部７１０によって生成された拡張レイヤのタイルの復号画像はフレームメモリ７１１に出力され、フレームメモリ７１１で保持される。また、拡張レイヤ復号部７１０で生成された拡張レイヤの復号画像は、セレクタ７２０及び端子７１２を介して図６の表示部６０６に出力される。 In step S814, enhancement layer decoding section 710 decodes the enhancement layer code data of tile 6 to generate a decoded image, and stores the decoded image in frame memory 711. The enhancement layer decoding unit 710 performs decoding of the enhancement layer code data of the tile 6, the enlarged image input from the enlargement unit 709, the decoded enhancement layer decoded image stored in the frame memory 711, and the decoding target tile To the decoded pixel. That is, the enhancement layer decoding unit 710 performs inter-layer prediction with reference to the enlarged image of the base layer generated in step S813. Also, the enhancement layer decoding unit 710 performs inter-frame prediction with reference to the decoded image in the independent tile set at a position that is relatively equal to the position of the decoding target tile in the enhancement layer stored in the frame memory 711. Furthermore, the enhancement layer decoding unit 710 performs intra prediction with reference to the decoded image in the decoding target tile. Specifically, referring to FIG. 2, when decoding tile 6 of frame 206, an enlarged image of frame 204, decoded images of tile 5 and tile 6 of decoded frame 205, and tile 6 of frame 206. Decoding is performed with reference to the decoded pixels. The decoded image of the enhancement layer tile generated by the enhancement layer decoding unit 710 is output to the frame memory 711 and held in the frame memory 711. Also, the decoded image of the enhancement layer generated by the enhancement layer decoding unit 710 is output to the display unit 606 in FIG. 6 via the selector 720 and the terminal 712.

ステップＳ８１７にて、全体制御部７１４は、表示部分にかかる拡張レイヤの全てのタイルの符号データを復号したと判定し、ステップＳ８１８に進む。 In step S817, overall control unit 714 determines that the encoded data of all tiles of the enhancement layer related to the display portion has been decoded, and proceeds to step S818.

ステップＳ８１８にて、全体制御部７１４は、端子７０１から入力されるシーケンスに含まれる全てのフレームの、表示部分にかかるタイルの符号データの復号処理が終了したか否かを判定する。復号処理を行っていないフレームが存在する場合は（ステップＳ８１８のＮＯ）、ステップＳ８０５に進み、次のフレームの処理を行う。復号処理を行っていないフレームが存在しない場合は（ステップＳ８１８のＹＥＳ）、復号処理を終了する。 In step S818, overall control unit 714 determines whether or not the decoding process of the encoded data of the tiles related to the display portion of all the frames included in the sequence input from terminal 701 has been completed. If there is a frame that has not been decoded (NO in step S818), the process advances to step S805 to process the next frame. If there is no frame that has not been decoded (YES in step S818), the decoding process ends.

以上、表示領域（復号対象タイル）が独立タイルセットで構成されている場合について述べたが、独立タイルで構成されない場合について述べる。ステップＳ８０５までは前述のとおりである。 The case where the display area (decoding target tile) is configured with an independent tile set has been described above, but the case where it is not configured with an independent tile will be described. Steps up to step S805 are as described above.

ステップＳ８０６にて、独立タイル判定部７０６は、復号対象タイルが独立タイルではないと判定し、ステップＳ８０８に進む。ステップＳ８０８にて、復号対象レイヤが基本レイヤのみの場合と同様に、基本レイヤ復号部７０７は、基本レイヤのタイルを復号して復号画像を生成し、当該復号画像をフレームメモリ７０８に格納する。尚、ここでは拡張レイヤの表示を行うので、基本レイヤ復号部７０７は、生成した復号画像の、端子７１２からの出力は行わない。 In step S806, the independent tile determination unit 706 determines that the decoding target tile is not an independent tile, and the process advances to step S808. In step S808, as in the case where the decoding target layer is only the base layer, the base layer decoding unit 707 generates a decoded image by decoding the tiles of the base layer, and stores the decoded image in the frame memory 708. Since the enhancement layer is displayed here, the base layer decoding unit 707 does not output the generated decoded image from the terminal 712.

ステップＳ８０９にて、全体制御部７１４は、基本レイヤの１フレーム分の全てのタイルの符号データを復号したか否かを判定する。ここでは、全体制御部７１４は基本レイヤの１フレーム分の全てのタイルの符号化データを復号したと判定して、ステップＳ８１０へ進む。ステップＳ８１０にて、分離部７０４は、入力された表示制御信号に基づいて拡張レイヤまで表示することが指示されていると判定し、ステップＳ８１１に進む。ステップＳ８１１にて、分離部７０４は、表示部分にかかるタイルの位置情報を端子７０２から入力する。分離部７０４は入力された位置情報に基づいて、復号対象タイルであるタイルの拡張レイヤの符号データを抽出する。ステップＳ８１２にて、独立タイル判定部７０６は、復号対象タイルのタイル番号と独立タイル位置情報のタイル番号とを比較する。ここでは、独立タイル判定部７０６は、復号対象タイルが独立タイルセットのタイルではない（復号対象タイルのタイル番号は独立タイル位置情報のタイル番号と一致しない）と判定し、ステップＳ８１５に進む。 In step S809, the overall control unit 714 determines whether the code data of all tiles for one frame of the base layer have been decoded. Here, overall control unit 714 determines that the encoded data of all tiles for one frame of the base layer has been decoded, and proceeds to step S810. In step S810, the separation unit 704 determines that it is instructed to display up to the enhancement layer based on the input display control signal, and proceeds to step S811. In step S 811, the separation unit 704 inputs tile position information related to the display portion from the terminal 702. Based on the input position information, the separation unit 704 extracts code data of the enhancement layer of the tile that is the decoding target tile. In step S812, the independent tile determination unit 706 compares the tile number of the decoding target tile with the tile number of the independent tile position information. Here, the independent tile determination unit 706 determines that the decoding target tile is not a tile of the independent tile set (the tile number of the decoding target tile does not match the tile number of the independent tile position information), and the process proceeds to step S815.

ステップＳ８１５にて、復号対象タイルは独立タイルではない。拡大部７０９はフレームメモリ７０８に格納されている、復号済みの基本レイヤの復号画像から、復号対象タイルの位置と相対的に等しい位置の基本レイヤのタイルと、当該タイルの周辺の復号画像とを入力する。拡大部７０９は、入力された基本レイヤの復号画像を用いて、フィルタリング等で拡大して拡大画像を生成し、当該拡大画像を拡張レイヤ復号部７１０に出力する。 In step S815, the decoding target tile is not an independent tile. The enlarging unit 709 extracts, from the decoded base layer decoded image stored in the frame memory 708, the base layer tile at a position relatively equal to the position of the decoding target tile and the decoded images around the tile. input. The enlarging unit 709 uses the input decoded image of the base layer to generate an enlarged image by filtering or the like, and outputs the enlarged image to the enhancement layer decoding unit 710.

ステップＳ８１６にて、拡張レイヤ復号部７１０は、分離部７０４から入力された復号対象タイルの拡張レイヤ符号データを復号する。拡張レイヤ復号部７１０は、拡大部７０９から入力される拡大画像と、フレームメモリ７１１に格納された復号済みの拡張レイヤの復号画像と、復号対象タイルの復号済みの画素とを参照して復号画像を生成する。即ち、拡張レイヤ復号部７１０は、ステップＳ８１５で生成された基本レイヤの拡大画像を参照してレイヤ間予測を行う。また、拡張レイヤ復号部７１０は、フレームメモリ７１１に格納されている拡張レイヤの復号画像を参照してフレーム間予測を行う。さらに、拡張レイヤ復号部７１０は、復号対象タイル内の復号画像を参照してイントラ予測を行う。拡張レイヤ復号部７１０によって生成された拡張レイヤのタイルの復号画像はフレームメモリ７１１に出力され、フレームメモリ７１１で保持される。また、拡張レイヤ復号部７１０で生成された拡張レイヤの復号画像は、端子７１２を介して図６の表示部６０６に出力される。 In step S816, enhancement layer decoding section 710 decodes the enhancement layer code data of the decoding target tile input from separation section 704. The enhancement layer decoding unit 710 refers to the enlarged image input from the enlargement unit 709, the decoded enhancement layer decoded image stored in the frame memory 711, and the decoded pixel of the decoding target tile. Is generated. That is, the enhancement layer decoding unit 710 performs inter-layer prediction with reference to the enlarged image of the base layer generated in step S815. Further, the enhancement layer decoding unit 710 performs inter-frame prediction with reference to the enhancement layer decoded image stored in the frame memory 711. Furthermore, the enhancement layer decoding unit 710 performs intra prediction with reference to the decoded image in the decoding target tile. The decoded image of the enhancement layer tile generated by the enhancement layer decoding unit 710 is output to the frame memory 711 and held in the frame memory 711. Also, the decoded image of the enhancement layer generated by the enhancement layer decoding unit 710 is output to the display unit 606 in FIG.

ステップＳ８１７にて、全体制御部７１４は、端子７０２から分離部７０４に入力された表示部分にかかるタイルの位置情報に基づく関係する全てのタイルが復号されたか否かを判定する。全てのタイルの復号処理が終わっていなければ（ステップＳ８１７のＮＯ）、ステップＳ８１１に戻り、分離部７０４は次のタイルを抽出して出力し、処理を続行する。表示部分にかかる全てのタイルの符号データの復号処理が終了していれば（ステップＳ８１７のＹＥＳ）、ステップＳ８１８に進む。 In step S817, the overall control unit 714 determines whether or not all related tiles have been decoded based on tile position information regarding the display portion input from the terminal 702 to the separation unit 704. If all tiles have not been decoded (NO in step S817), the process returns to step S811, and the separation unit 704 extracts and outputs the next tile, and continues the process. If the decoding process of the code data of all tiles related to the display portion has been completed (YES in step S817), the process proceeds to step S818.

ステップＳ８１８にて、全体制御部７１４は、全てのフレーム分の符号データの復号処理が終了したか否かを判定する。復号処理を行っていない符号データが存在する場合は（ステップＳ８１８のＮＯ）、ステップＳ８０５に進み、次のフレームの処理を行う。復号処理を行っていない符号データが存在しない場合は（ステップＳ８１８のＹＥＳ）、復号処理を終了する。 In step S818, overall control unit 714 determines whether or not the decoding processing of the code data for all the frames has been completed. If there is code data that has not been decoded (NO in step S818), the process proceeds to step S805 to process the next frame. If there is no code data that has not been decoded (YES in step S818), the decoding process ends.

図６に戻り、表示部６０６は表示制御部６０３から拡張レイヤの画像の表示を指示されている。このため、表示部６０６は、画像復号部６０５によって復号された拡張レイヤの復号画像を表示する。尚、拡張レイヤは基本レイヤよりも高解像度であるため、拡張レイヤの復号画像を表示することにより、表示部６０６は基本レイヤの画像の一部分を拡大表示したような効果が得ることができる。 Returning to FIG. 6, the display unit 606 is instructed by the display control unit 603 to display the enhancement layer image. Therefore, the display unit 606 displays the enhancement layer decoded image decoded by the image decoding unit 605. Since the enhancement layer has a higher resolution than the base layer, the display unit 606 can obtain the effect of enlarging and displaying a part of the base layer image by displaying the decoded image of the enhancement layer.

以上の構成と動作により、独立タイル及び独立タイルセットを使用する場合において、拡張レイヤと基本レイヤの各タイルの相対的な位置を一致させることができる。即ち、基本レイヤで独立復号タイルセットのタイルであれば、全ての拡張レイヤで当該タイルの位置と相対的に等しい位置のタイルは独立復号タイルセットのタイルとすることができる。これにより、階層符号化されたビットストリームを復号する場合に、いずれの階層においても独立タイルを最小の画像データの参照のみで復号できる。このように、予測において参照する画像データを減らすことにより、データの転送量を抑えたり、演算量を削減したり、低消費電力を実現することが可能となる。また、独立タイルの復号処理において基本レイヤから拡張レイヤまで各階層で、当該独立タイル以外のタイルを参照せずに独立に復号することにより、高速処理が可能となる。特に、符号化側で重要な領域に独立タイルセットを適応するように符号化してビットストリームを生成することで、当該ビットストリームを復号する場合において、当該重要な領域を高速に復号することができる。 With the above configuration and operation, when using independent tiles and independent tile sets, the relative positions of the tiles of the enhancement layer and the base layer can be matched. In other words, if the tile is an independent decoding tile set in the base layer, the tiles at positions relatively equal to the positions of the tiles in all enhancement layers can be used as the tiles of the independent decoding tile set. As a result, when decoding a hierarchically encoded bitstream, independent tiles can be decoded only by referring to the minimum image data in any hierarchy. As described above, by reducing the image data referred to in the prediction, it is possible to suppress the data transfer amount, reduce the calculation amount, and realize low power consumption. Further, in independent tile decoding processing, high-speed processing can be performed by independently decoding each layer from the basic layer to the enhancement layer without referring to tiles other than the independent tile. In particular, when a bit stream is generated by encoding so that an independent tile set is applied to an important area on the encoding side, the important area can be decoded at high speed when the bit stream is decoded. .

尚、本実施形態において、図２のように、復号対象のフレームより時間的に前のフレームのみを参照フレームとして予測及び復号する例を示したが、これに限定されない。即ち、複数フレームを参照して予測及び復号する場合においても同様に参照されることは上記の説明から明白である。 In the present embodiment, as shown in FIG. 2, an example is shown in which only a frame temporally prior to a decoding target frame is predicted and decoded as a reference frame, but the present invention is not limited to this. That is, it is clear from the above description that the same reference is made in the case of prediction and decoding with reference to a plurality of frames.

また、本実施形態において、拡大部７０９を用いた画像復号部６０５について説明したが、本発明はこれに限定されない。即ち、拡大部７０９を省略してもよい。または、拡大率を１とし、基本レイヤ復号部７０７で復号される量子化パラメータよりも拡張レイヤ復号部７１０で復号される量子化パラメータを小さくするようにしてもよい。これによって、ＳＮＲ階層データの復号を行うことが可能になる。 Moreover, although the image decoding part 605 using the expansion part 709 was demonstrated in this embodiment, this invention is not limited to this. That is, the enlargement unit 709 may be omitted. Alternatively, the expansion rate may be set to 1, and the quantization parameter decoded by the enhancement layer decoding unit 710 may be made smaller than the quantization parameter decoded by the base layer decoding unit 707. This makes it possible to decode the SNR layer data.

また、本実施形態において１フレームの符号データに全ての階層の符号データを含む例を取って説明したが、これに限定されず、レイヤ毎に入力されても構わない。例えば、記憶部６０２にレイヤ毎に符号データをまとめて格納しておき、拡張レイヤに関しては必要に応じてそこから符号データを切り出して読み出してももちろん構わない。 In the present embodiment, an example in which code data of all layers is included in one frame of code data has been described. However, the present invention is not limited to this and may be input for each layer. For example, the code data may be stored together in the storage unit 602 for each layer and the code data may be cut out and read out from the extension layer as necessary.

また、本実施形態において、基本レイヤと１階層の拡張レイヤ（全体で２階層）のある場合で説明したが、本発明はこれに限定されず、全体で３階層以上あっても構わない。この場合、拡張レイヤ復号部７１０、フレームメモリ７１１、及び拡大部７０９を１つのセットとして、当該セットを拡張レイヤの階層数分だけ設けることにより、より多くの階層に対応することができる。また、図９に示すように、拡張レイヤ復号部７１０、フレームメモリ９１１、及び拡大部９０９を１つずつ有し、各階層の復号において兼用で使用しても構わない。図９は、複数の階層の拡張レイヤを復号可能な画像復号装置であって、拡張レイヤ復号部７１０、フレームメモリ９１１、及び拡大部９０９を１つずつ有する画像復号装置のブロック図である。図９において、図７の画像復号部６０５の各処理部と同じ機能を果たすものについては同じ番号を付し、説明を省略する。９０８はフレームメモリであり、基本レイヤ復号部７０７で生成された復号画像を保持している。フレームメモリ９０８は、図７のフレームメモリ７０８とはセレクタ９２０への出力を行う機能が追加されていることが異なる。９０９は拡大部であり、図７の拡大部７０９とは、フレームメモリ９１１からの入力とフレームメモリ９０８からの入力を選択して入力が可能になっていることが異なる。９１１はフレームメモリであり、図７のフレームメモリ７１１とは、所望のタイルの符号データを拡大部９０９及びセレクタ９２０に出力する機能を付与されていることが異なる。９２０はセレクタであり、フレームメモリ９０８又はフレームメモリ９１１から所望の復号画像を選択して入力し、選択した復号画像を端子９１２に出力する。９１２は端子であり、セレクタ９２０から入力された復号画像を画像復号部６０５の外部に出力する。 Further, in the present embodiment, the case where there is a base layer and one extended layer (two layers in total) has been described, but the present invention is not limited to this, and there may be three or more layers in total. In this case, by providing the enhancement layer decoding unit 710, the frame memory 711, and the expansion unit 709 as one set and providing the set for the number of enhancement layer hierarchies, it is possible to deal with more hierarchies. Also, as shown in FIG. 9, one enhancement layer decoding unit 710, one frame memory 911, and one enlargement unit 909 may be provided and used in the decoding of each layer. FIG. 9 is a block diagram of an image decoding apparatus that can decode an enhancement layer of a plurality of hierarchies and includes an enhancement layer decoding unit 710, a frame memory 911, and an enlargement unit 909 one by one. In FIG. 9, the same functions as those of the respective processing units of the image decoding unit 605 in FIG. Reference numeral 908 denotes a frame memory, which holds the decoded image generated by the base layer decoding unit 707. The frame memory 908 is different from the frame memory 708 of FIG. 7 in that a function for outputting to the selector 920 is added. Reference numeral 909 denotes an enlargement unit, which is different from the enlargement unit 709 in FIG. 7 in that the input from the frame memory 911 and the input from the frame memory 908 are selected and input is possible. Reference numeral 911 denotes a frame memory, which is different from the frame memory 711 in FIG. 7 in that a function of outputting code data of a desired tile to the enlargement unit 909 and the selector 920 is given. A selector 920 selects and inputs a desired decoded image from the frame memory 908 or the frame memory 911, and outputs the selected decoded image to the terminal 912. Reference numeral 912 denotes a terminal which outputs the decoded image input from the selector 920 to the outside of the image decoding unit 605.

図９に示す画像復号部６０５を用いて復号処理を行う場合に、各処理部の動作を図１０に示したフローチャートを用いて以下に説明する。図１０は、図８のステップＳ８０５からステップＳ８１８を変更した部分のみを示している。図１０において、図８のステップと同様の機能を果たすステップに関しては図８と同じ番号を付与し、説明を省略する。また、本実施形態では実施形態１の図４に記載の画像符号化装置４００によって、図５の符号化方法で生成されたビットストリームであって、階層数が３であるビットストリームを復号する一例につて説明する。図８のステップＳ８０１からステップＳ８０４にて、前述の通り、ヘッダ復号部７０５はヘッダ符号データを復号する。ここではｖｐｓ＿ｍａｘ＿ｌａｙｅｒｓ＿ｍｉｎｕｓ１符号は２である。 When the decoding process is performed using the image decoding unit 605 illustrated in FIG. 9, the operation of each processing unit will be described below with reference to the flowchart illustrated in FIG. FIG. 10 shows only a part obtained by changing step S805 to step S818 in FIG. 10, the steps having the same functions as those in FIG. 8 are given the same numbers as those in FIG. Further, in the present embodiment, an example of decoding a bitstream having the number of layers of 3 that is the bitstream generated by the encoding method of FIG. 5 by the image encoding apparatus 400 illustrated in FIG. 4 of the first embodiment. Will be described. In steps S801 to S804 in FIG. 8, the header decoding unit 705 decodes the header code data as described above. Here, the vps_max_layers_minus1 code is 2.

まず、復号対象レイヤが基本レイヤのみの場合について述べる。ここでは、ユーザが表示制御部６０３に、インターフェース６０１から入力されるビットストリームにおいて基本レイヤの全体の復号及び表示の開始を指示するとする。以下、前述の基本レイヤのみの表示時と同様に、図８のステップＳ８０５からステップＳ８０９によって、基本レイヤの１フレーム分の復号が終了しているとする。但し、基本レイヤ復号部７０７で生成された復号画像は全てフレームメモリ９０８に格納される。 First, the case where the decoding target layer is only the base layer will be described. Here, it is assumed that the user instructs the display control unit 603 to start decoding and display of the entire base layer in the bitstream input from the interface 601. Hereinafter, it is assumed that the decoding of one frame of the base layer is completed in steps S805 to S809 in FIG. However, all the decoded images generated by the base layer decoding unit 707 are stored in the frame memory 908.

ステップＳ１０１０にて、基本レイヤ復号部７０７又は拡張レイヤ復号部７１０は、復号済みの階層の階層数と表示制御部６０３が指示する表示する階層とを比較し、表示する階層が復号済みであるか否かを判定する。復号済みの階層数が表示する階層に達している場合（ステップＳ１０１０のＹＥＳ）、ステップＳ１００３に進み、達していない場合（ステップＳ１０１０のＮＯ）、ステップＳ１００１に進む。ここでは、分離部７０４が、端子７０２から入力される表示制御信号に基づいて、表示する階層は基本レイヤのみであると判定したとする。このため、基本レイヤ復号部７０７は、ステップＳ１０１０において表示する階層に達したと判断し、ステップＳ１００３に進む。 In step S1010, base layer decoding section 707 or enhancement layer decoding section 710 compares the number of decoded hierarchies with the display hierarchy indicated by display control section 603, and whether the display hierarchy has been decoded. Determine whether or not. If the number of decoded layers has reached the displayed layer (YES in step S1010), the process proceeds to step S1003. If not reached (NO in step S1010), the process proceeds to step S1001. Here, it is assumed that the separation unit 704 determines that the layer to be displayed is only the base layer based on the display control signal input from the terminal 702. For this reason, the base layer decoding unit 707 determines that the hierarchy to be displayed in step S1010 has been reached, and proceeds to step S1003.

ステップＳ１００３にて、セレクタ９２０は、復号された階層のうち、最下位の階層の復号画像を選択する。この場合、最下位の階層は基本レイヤであるので、セレクタ９２０は、フレームメモリ９０８から復号された基本レイヤの復号画像を読み出し、読み出した復号画像を端子９１２を介して図６の表示部６０６に出力する。そして、表示部６０６は、表示制御部６０３から基本レイヤの画像の表示が指示されることにより、画像復号部６０５から出力された基本レイヤの復号画像全体を表示する。 In step S1003, the selector 920 selects the decoded image of the lowest hierarchy among the decoded hierarchies. In this case, since the lowest layer is the base layer, the selector 920 reads the decoded image of the base layer decoded from the frame memory 908, and the read decoded image is displayed on the display unit 606 in FIG. Output. The display unit 606 displays the entire decoded image of the base layer output from the image decoding unit 605 when the display control unit 603 instructs the display of the image of the base layer.

続いて、復号対象レイヤが拡張レイヤの場合について述べる。ここでは、ユーザが表示制御部６０３に、インターフェース６０１から入力されるビットストリームにおいて拡張レイヤの復号と、拡張レイヤの復号画像の一部の表示を指示した場合の、復号について説明する。また、例として、表示する階層は第２拡張レイヤ（階層数は３）として説明を行う。さらに、本実施形態では説明を簡単にするために、表示する領域に含まれるタイルを図２のタイル５とタイル６の領域とする。復号動作については基本レイヤのみの復号及び表示が指示された場合と同様に、図１０に示したフローチャートに基づいて説明する。また、基本レイヤのみの復号と同じ動作を行う部分は説明を簡略化する。 Next, a case where the decoding target layer is an enhancement layer will be described. Here, a description will be given of decoding when the user instructs the display control unit 603 to decode the enhancement layer in the bitstream input from the interface 601 and display a part of the decoded image of the enhancement layer. In addition, as an example, the hierarchy to be displayed is described as the second enhancement layer (the number of hierarchies is 3). Further, in the present embodiment, for the sake of simplicity, the tiles included in the display area are the areas of the tile 5 and the tile 6 in FIG. The decoding operation will be described based on the flowchart shown in FIG. 10 in the same manner as when decoding and display of only the base layer is instructed. Also, the description of the part that performs the same operation as the decoding of only the base layer is simplified.

ステップＳ８０６にて、独立タイル判定部７０６は、復号対象タイルのタイル番号と独立タイル位置情報のタイル番号とを比較する。ここでは、復号対象タイルであるタイル５は独立タイルであるので、ステップ８０７に進む。ステップＳ８０７にて、基本レイヤ復号部７０７は、基本レイヤのタイル５の復号データを復号して復号画像を生成し、当該復号画像をフレームメモリ９０８へ格納する。ステップＳ８０９にて、全体制御部７１４は、分離部７０４から入力された表示部分にかかる基本レイヤの全てのタイルの符号データを復号したか否かを判定する。 In step S806, the independent tile determination unit 706 compares the tile number of the decoding target tile with the tile number of the independent tile position information. Here, since tile 5 that is the decoding target tile is an independent tile, the process proceeds to step 807. In step S807, the base layer decoding unit 707 decodes the decoded data of the tile 5 of the base layer to generate a decoded image, and stores the decoded image in the frame memory 908. In step S809, the overall control unit 714 determines whether the code data of all tiles of the base layer related to the display portion input from the separation unit 704 has been decoded.

ステップＳ１０１０にて、基本レイヤ復号部７０７又は拡張レイヤ復号部７１０は、復号済みの階層の階層数と表示制御部６０３が指示する表示する階層とを比較し、表示する階層が復号済みであるか否かを判定する。ここでは、端子７０２から入力される表示制御信号によれば、表示する階層は第２拡張レイヤ（階層数は３）である。したがって、拡張レイヤ復号部７１０は、表示する階層が復号済みでないと判断し、ステップＳ１００１に進む。 In step S1010, base layer decoding section 707 or enhancement layer decoding section 710 compares the number of decoded hierarchies with the display hierarchy indicated by display control section 603, and whether the display hierarchy has been decoded. Determine whether or not. Here, according to the display control signal input from the terminal 702, the layer to be displayed is the second enhancement layer (the number of layers is 3). Therefore, enhancement layer decoding section 710 determines that the hierarchy to be displayed has not been decoded, and proceeds to step S1001.

ステップＳ１００１にて、拡張レイヤ復号部７１０は、ステップＳ８０７乃至ステップＳ８０８で復号された基本レイヤ、又は後述するステップＳ１０１４乃至ステップＳ１０１６で復号された階層の拡張レイヤを上位レイヤとする。さらに、続く復号対象の拡張レイヤを下位レイヤとする。最初はステップＳ８０７乃至ステップＳ８０８で符号化された基本レイヤを上位レイヤとし、第１拡張レイヤを下位レイヤとして設定する。 In step S1001, enhancement layer decoding section 710 sets the base layer decoded in steps S807 to S808 or the enhancement layer of the hierarchy decoded in steps S1014 to S1016 described later as an upper layer. Further, the subsequent enhancement target decoding layer is set as a lower layer. Initially, the base layer encoded in steps S807 to S808 is set as an upper layer, and the first enhancement layer is set as a lower layer.

ステップＳ１０１１にて、分離部７０４は、端子７０２から入力された表示部分にかかるタイルの位置情報を入力する。本説明では、当該表示部分にかかるタイルの位置はタイル５とタイル６である。そして、分離部７０４は、端子７０２から入力された位置情報に基づいて、バッファ７０３に格納された階層符号データのうち復号対象タイルであるタイル５の下位レイヤ（第１拡張レイヤ）の符号データを抽出する。さらに、分離部７０４は、抽出した符号データを拡張レイヤ復号部７１０に出力する。また、分離部７０４は、そのタイル位置情報を独立タイル判定部７０６に入力する。 In step S 1011, the separation unit 704 inputs tile position information regarding the display portion input from the terminal 702. In this description, the positions of the tiles related to the display portion are the tile 5 and the tile 6. Then, based on the position information input from the terminal 702, the separation unit 704 converts the code data of the lower layer (first enhancement layer) of the tile 5 that is the decoding target tile among the hierarchical code data stored in the buffer 703. Extract. Furthermore, the separation unit 704 outputs the extracted code data to the enhancement layer decoding unit 710. Further, the separation unit 704 inputs the tile position information to the independent tile determination unit 706.

ステップＳ８１２にて、独立タイル判定部７０６は、復号対象タイルのタイル番号と独立タイル位置情報のタイル番号とを比較する。タイル番号が一致すればステップＳ１０１３に進み、一致しなければステップＳ１０１５に進む。ここでは、独立タイル位置情報は５と６であり、復号対象タイルであるタイル５は独立タイル位置情報のタイル番号と一致する。したがって、独立タイル判定部７０６は、復号対象タイルが独立タイルセットのタイルであると判定し、ステップＳ１０１３に進む。 In step S812, the independent tile determination unit 706 compares the tile number of the decoding target tile with the tile number of the independent tile position information. If the tile numbers match, the process proceeds to step S1013. If the tile numbers do not match, the process proceeds to step S1015. Here, the independent tile position information is 5 and 6, and the tile 5 that is the decoding target tile matches the tile number of the independent tile position information. Therefore, the independent tile determination unit 706 determines that the decoding target tile is a tile of the independent tile set, and proceeds to step S1013.

ステップＳ１０１３にて、復号対象タイルは独立タイルである。拡大部９０９は、上位レイヤが基本レイヤであることから、フレームメモリ９０８に格納されている基本レイヤの復号画像から復号対象タイルの位置と相対的に等しい位置の独立タイルセットに含まれる独立タイルの復号画像を入力する。拡大部９０９は、入力された独立タイルの復号画像のみを用いて、フィルタリング等で拡大して拡大画像を生成し、当該拡大画像を拡張レイヤ復号部７１０へ出力する。 In step S1013, the decoding target tile is an independent tile. Since the upper layer is the base layer, the enlargement unit 909 determines the independent tiles included in the independent tile set at a position relatively equal to the position of the decoding target tile from the decoded image of the base layer stored in the frame memory 908. Input the decoded image. The enlargement unit 909 uses only the input decoded image of the independent tile to generate an enlarged image by filtering or the like, and outputs the enlarged image to the enhancement layer decoding unit 710.

ステップＳ１０１４にて、ステップＳ８１４と同様に、拡張レイヤ復号部７１０は分離部７０４から入力された復号対象タイルの下位レイヤ（第１拡張レイヤ）の符号データを復号する。拡張レイヤ復号部７１０は、拡大部９０９から入力される拡大画像と、フレームメモリ９１１に格納された復号済みの拡張レイヤ（第１拡張レイヤ）の復号画像と、復号対象タイルの復号済みの画素とを参照して復号画像を生成する。即ち、拡張レイヤ復号部７１０は、ステップＳ１０１３で生成された上位レイヤ（基本レイヤ）の拡大画像を参照してレイヤ間予測を行う。また、拡張レイヤ復号部７１０は、フレームメモリ９１１に格納されている下位レイヤ（第１拡張レイヤ）の復号画像のうち復号対象タイルの位置と相対的に等しい位置の独立タイルセット内の復号画像を参照してフレーム間予測を行う。さらに、拡張レイヤ復号部７１０は、復号対象タイル内の復号画像を参照してイントラ予測を行う。拡張レイヤ復号部７１０で復号された下位レイヤ（第１拡張レイヤ）のタイルの復号画像はフレームメモリ９１１に出力され、フレームメモリ９１１で保持される。 In step S1014, as in step S814, the enhancement layer decoding unit 710 decodes the code data of the lower layer (first enhancement layer) of the decoding target tile input from the separation unit 704. The enhancement layer decoding unit 710, the enlarged image input from the enlargement unit 909, the decoded image of the decoded enhancement layer (first enhancement layer) stored in the frame memory 911, and the decoded pixels of the decoding target tile To generate a decoded image. That is, the enhancement layer decoding unit 710 performs inter-layer prediction with reference to the enlarged image of the upper layer (base layer) generated in step S1013. Also, the enhancement layer decoding unit 710 displays the decoded image in the independent tile set at a position relatively equal to the position of the decoding target tile among the decoded images of the lower layer (first enhancement layer) stored in the frame memory 911. Inter-frame prediction is performed with reference. Furthermore, the enhancement layer decoding unit 710 performs intra prediction with reference to the decoded image in the decoding target tile. The decoded image of the lower layer (first enhancement layer) tile decoded by the enhancement layer decoding unit 710 is output to the frame memory 911 and held in the frame memory 911.

ステップＳ１０１７にて、全体制御部７１４は、分離部７０４から入力された表示部分にかかる下位レイヤ（第１拡張レイヤ）の全てのタイルの符号データを復号したか否かを判定する。ここでは、タイル６の拡張レイヤの符号データの復号が終わっていないため、ステップＳ１０１１に戻り、タイル６の下位レイヤ（第１拡張レイヤ）の符号データの復号を行う。 In step S1017, overall control unit 714 determines whether or not the code data of all tiles in the lower layer (first enhancement layer) related to the display portion input from separation unit 704 has been decoded. Here, since the decoding of the code data of the enhancement layer of tile 6 has not been completed, the process returns to step S1011 and the code data of the lower layer (first enhancement layer) of tile 6 is decoded.

以下、タイル６の下位レイヤ（第１拡張レイヤ）の符号データの復号について説明する。 Hereinafter, decoding of the code data of the lower layer (first enhancement layer) of the tile 6 will be described.

ステップＳ１０１１にて、分離部７０４は、バッファ７０３に格納された階層符号データのうち復号対象タイルであるタイル６の下位レイヤ（第１拡張レイヤ）の符号データを抽出する。ステップＳ８１２にて、独立タイル判定部７０６は、復号対象タイルのタイル番号と独立タイル位置情報のタイル番号とを比較する。ここでは、独立タイル判定部７０６は、復号対象タイルであるタイル６が独立タイルであると判定し、ステップＳ１０１３に進む。 In step S1011, the separation unit 704 extracts code data of the lower layer (first enhancement layer) of the tile 6 that is the decoding target tile from the hierarchical code data stored in the buffer 703. In step S812, the independent tile determination unit 706 compares the tile number of the decoding target tile with the tile number of the independent tile position information. Here, the independent tile determination unit 706 determines that the tile 6 that is the decoding target tile is an independent tile, and proceeds to step S1013.

ステップＳ１０１３にて、拡大部９０９は、入力された上位レイヤ（基本レイヤ）の独立タイルの復号画像のみを用いて、拡大画像を生成する。即ち、拡大部９０９はフレームメモリ９０８から復号画像を入力して、フィルタリング等で拡大して拡大画像を生成する。 In step S1013, the enlargement unit 909 generates an enlarged image using only the input decoded image of the independent layer of the higher layer (basic layer). That is, the enlargement unit 909 receives the decoded image from the frame memory 908 and enlarges it by filtering or the like to generate an enlarged image.

ステップＳ１０１４にて、拡張レイヤ復号部７１０は、タイル６の下位レイヤ（第１拡張レイヤ）の符号データを復号して復号画像を生成し、当該復号画像をフレームメモリ９１１へ格納する。拡張レイヤ復号部７１０は、タイル６の下位レイヤ（第１拡張レイヤ）の符号データの復号において、拡大部９０９から入力される拡大画像とフレームメモリ９１１に格納された復号済みの拡張レイヤの復号画像と復号対象タイルの復号済みの画素とを参照する。即ち、拡張レイヤ復号部７１０は、ステップＳ１０１３で生成された上位レイヤ（基本レイヤ）の拡大画像を参照してレイヤ間予測を行う。また、拡張レイヤ復号部７１０は、フレームメモリ９１１に格納されている下位レイヤ（第１拡張レイヤ）の復号画像のうち復号対象タイルの位置と相対的に等しい位置の独立タイルセット内の復号画像を参照してフレーム間予測を行う。さらに、拡張レイヤ復号部７１０は、復号対象タイル内の復号画像を参照してイントラ予測を行う。さらに、拡張レイヤ復号部７１０で復号された下位レイヤ（第１拡張レイヤ）のタイルの復号画像はフレームメモリ９１１に出力され、フレームメモリ９１１で保持される。 In step S1014, enhancement layer decoding section 710 decodes the code data of the lower layer (first enhancement layer) of tile 6 to generate a decoded image, and stores the decoded image in frame memory 911. The enhancement layer decoding unit 710 decodes the encoded data of the lower layer (first enhancement layer) of the tile 6, the enlarged image input from the enlargement unit 909, and the decoded image of the decoded enhancement layer stored in the frame memory 911 And the decoded pixel of the decoding target tile. That is, the enhancement layer decoding unit 710 performs inter-layer prediction with reference to the enlarged image of the upper layer (base layer) generated in step S1013. Also, the enhancement layer decoding unit 710 displays the decoded image in the independent tile set at a position relatively equal to the position of the decoding target tile among the decoded images of the lower layer (first enhancement layer) stored in the frame memory 911. Inter-frame prediction is performed with reference. Furthermore, the enhancement layer decoding unit 710 performs intra prediction with reference to the decoded image in the decoding target tile. Further, the decoded image of the lower layer (first enhancement layer) tile decoded by the enhancement layer decoding unit 710 is output to the frame memory 911 and held in the frame memory 911.

ステップＳ１０１７にて、全体制御部７１４は、表示部分にかかる下位レイヤ（第１拡張レイヤ）の全てのタイルの符号データを復号したと判定し、ステップＳ１００２に進む。 In step S1017, overall control unit 714 determines that the encoded data of all tiles of the lower layer (first enhancement layer) related to the display portion has been decoded, and proceeds to step S1002.

ステップＳ１００２にて、全体制御部７１４は、復号されたｖｐｓ＿ｍａｘ＿ｌａｙｅｒｓ＿ｍｉｎｕｓ１符号で表される全ての階層について符号化が終了したか否かを判定する。全ての階層のタイルの復号処理が終わっていなければ（ステップＳ１００２のＮＯ）、ステップＳ１０１０に戻り、表示の判定を行う。全ての階層のタイルの復号処理が終了していれば（ステップＳ１００２のＹＥＳ）、ステップＳ１００３に進む。ここでは、拡張レイヤの復号処理が終了していないため、拡張レイヤ復号部７１０は、全ての階層のタイルの復号処理が終了していないと判定し、ステップＳ１０１０に戻る。 In step S1002, overall control unit 714 determines whether or not encoding has been completed for all layers represented by the decoded vps_max_layers_minus1 code. If decoding of tiles in all layers is not completed (NO in step S1002), the process returns to step S1010 to determine display. If decoding of tiles in all layers has been completed (YES in step S1002), the process proceeds to step S1003. Here, since the enhancement layer decoding process has not ended, the enhancement layer decoding unit 710 determines that the decoding process of tiles of all layers has not been completed, and returns to step S1010.

以下、第２拡張レイヤの復号を行う。即ち、ステップＳ１０１０にて、拡張レイヤ復号部７１０は、表示する階層が復号済みであるか否かを判定する。端子７０２から入力される表示制御信号によれば、表示する階層は第２拡張レイヤである。ここでは、拡張レイヤ復号部７１０は、第１拡張レイヤまでしか復号していない（第２拡張レイヤは復号済みでない）と判定するため、ステップＳ１００１に進む。ステップＳ１００１にて、拡張レイヤ復号部７１０は、ステップＳ１０１４乃至ステップＳ１０１６で復号された第１拡張レイヤを上位レイヤとし、第２拡張レイヤを下位レイヤとして設定する。 Hereinafter, decoding of the second enhancement layer is performed. That is, in step S1010, enhancement layer decoding section 710 determines whether or not the hierarchy to be displayed has been decoded. According to the display control signal input from the terminal 702, the display layer is the second enhancement layer. Here, enhancement layer decoding section 710 proceeds to step S1001 in order to determine that only the first enhancement layer has been decoded (the second enhancement layer has not been decoded). In step S1001, enhancement layer decoding section 710 sets the first enhancement layer decoded in steps S1014 to S1016 as an upper layer and the second enhancement layer as a lower layer.

ステップＳ１０１１にて、分離部７０４は、バッファ７０３に格納された階層符号データのうち下位レイヤ（第２拡張レイヤ）のタイルの符号データを抽出し、拡張レイヤ復号部７１０に入力する。ここではまず、分離部７０４は、タイル５の下位レイヤ（第２拡張レイヤ）の符号データを抽出し、抽出した符号データを拡張レイヤ復号部７１０に入力する。ステップＳ８１２にて、独立タイル判定部７０６は、復号対象タイルであるタイル５が独立タイルであると判定し、ステップＳ１０１３に進む。ステップＳ１０１３にて、拡大部９０９は、上位レイヤが拡張レイヤ（第１拡張レイヤ）である。このため、拡大部９０９は、フレームメモリ９０８に格納されている上位レイヤ（第１拡張レイヤ）の復号画像から復号対象タイルの位置と相対的に等しい位置の独立タイルセットに含まれる独立タイルの復号画像を入力する。即ち、拡大部９０９は入力された上位レイヤ（第１拡張レイヤ）の独立タイルの復号画像のみを用いて、フィルタリング等で拡大して拡大画像を生成し、当該拡大画像を拡張レイヤ復号部７１０に入力する。 In step S 1011, the separation unit 704 extracts code data of a lower layer (second enhancement layer) tile from the hierarchical code data stored in the buffer 703, and inputs the code data to the enhancement layer decoding unit 710. Here, first, the separation unit 704 extracts code data of a lower layer (second enhancement layer) of the tile 5 and inputs the extracted code data to the enhancement layer decoding unit 710. In step S812, the independent tile determination unit 706 determines that the tile 5 that is the decoding target tile is an independent tile, and proceeds to step S1013. In step S1013, the enlargement unit 909 has the enhancement layer (first enhancement layer) as the upper layer. Therefore, the enlargement unit 909 decodes the independent tiles included in the independent tile set at a position relatively equal to the position of the decoding target tile from the decoded image of the upper layer (first enhancement layer) stored in the frame memory 908. Enter an image. That is, the enlargement unit 909 uses only the decoded image of the input independent tile of the upper layer (first enhancement layer) to generate an enlarged image by filtering or the like, and sends the enlarged image to the enhancement layer decoding unit 710. input.

ステップＳ１０１４にて、拡張レイヤ復号部７１０は、分離部７０４から入力された復号対象タイルの下位レイヤ（第２拡張レイヤ）の符号データを復号する。拡張レイヤ復号部７１０は、次の画像を参照して復号画像を生成する。即ち、拡張レイヤ復号部７１０は、拡大部９０９から入力される上位レイヤ（第１拡張レイヤ階層）の拡大画像と、フレームメモリ９１１に格納された復号済みの拡張レイヤ（第２拡張レイヤ）の復号画像と、復号対象タイルの復号済みの画素とを参照する。即ち、拡張レイヤ復号部７１０は、ステップＳ１０１３で生成された上位レイヤ（第１拡張レイヤ）の拡大画像を参照してレイヤ間予測を行う。また、拡張レイヤ復号部７１０は、フレームメモリ９１１に格納されている下位レイヤ（第１拡張レイヤ）の復号画像のうち復号対象タイルの位置と相対的に等しい位置の独立タイルセット内の復号画像を参照してフレーム間予測を行う。さらに、拡張レイヤ復号部７１０は、復号対象タイル内の復号画像を参照してイントラ予測を行う。さらに、拡張レイヤ復号部７１０で復号された下位レイヤ（第２拡張レイヤ）のタイルの復号画像はフレームメモリ９１１に出力され、フレームメモリ９１１で保持される。 In step S1014, enhancement layer decoding section 710 decodes the code data of the lower layer (second enhancement layer) of the decoding target tile input from separation section 704. The enhancement layer decoding unit 710 generates a decoded image with reference to the next image. That is, the enhancement layer decoding unit 710 decodes the enlarged image of the upper layer (first enhancement layer hierarchy) input from the enlargement unit 909 and the decoded enhancement layer (second enhancement layer) stored in the frame memory 911. The image and the decoded pixel of the decoding target tile are referred to. That is, the enhancement layer decoding unit 710 performs inter-layer prediction with reference to the enlarged image of the upper layer (first enhancement layer) generated in step S1013. Also, the enhancement layer decoding unit 710 displays the decoded image in the independent tile set at a position relatively equal to the position of the decoding target tile among the decoded images of the lower layer (first enhancement layer) stored in the frame memory 911. Inter-frame prediction is performed with reference. Furthermore, the enhancement layer decoding unit 710 performs intra prediction with reference to the decoded image in the decoding target tile. Further, the decoded image of the lower layer (second enhancement layer) tile decoded by the enhancement layer decoding unit 710 is output to the frame memory 911 and held in the frame memory 911.

ステップＳ１０１７にて、全体制御部７１４は、分離部７０４から入力された表示部分にかかる下位レイヤ（第２拡張レイヤ）の全てのタイルの符号データを復号したか否かを判定する。ここでは、タイル６の拡張レイヤの符号データの復号が終わっていないため、ステップＳ１０１１に戻り、タイル６の下位レイヤ（第２拡張レイヤ）の符号データの復号を行う。タイル６の下位レイヤの復号については、上位レイヤを第１拡張レイヤ階層とし、下位レイヤを第２拡張レイヤとすれば、前述のとおりタイル５の第２拡張レイヤの符号データの復号処理と同様であるので、説明を省略する。 In step S1017, overall control unit 714 determines whether or not the code data of all tiles in the lower layer (second enhancement layer) related to the display portion input from separation unit 704 has been decoded. Here, since the decoding of the code data of the enhancement layer of tile 6 has not been completed, the process returns to step S1011 and the code data of the lower layer (second enhancement layer) of tile 6 is decoded. The decoding of the lower layer of tile 6 is the same as the decoding process of the code data of the second enhancement layer of tile 5 as described above, assuming that the upper layer is the first enhancement layer hierarchy and the lower layer is the second enhancement layer. Since there is, description is abbreviate | omitted.

ステップＳ１００２にて、全体制御部７１４は、第２拡張レイヤまで復号したので、全ての階層のタイルの復号処理が終わったと判定し、ステップＳ１００３に進む。ステップＳ１００３にて、セレクタ９２０は、復号された階層のうち、最下位の階層の復号画像を選択する。この場合、最下位の階層は第２拡張レイヤであるので、セレクタ９２０はフレームメモリ９１１から第２拡張レイヤの復号画像を読み出し、当該第２拡張レイヤの復号画像を端子９１２を介して図６の表示部６０６に出力する。そして、表示部６０６は、表示制御部６０３から第２拡張レイヤの画像の表示が指示されることにより、画像復号部６０５から出力された第２拡張レイヤの復号画像全体を表示部６０６は表示する。 In step S1002, since the overall control unit 714 has decoded up to the second enhancement layer, the overall control unit 714 determines that the decoding process for all the tiles has been completed, and the process advances to step S1003. In step S1003, the selector 920 selects the decoded image of the lowest hierarchy among the decoded hierarchies. In this case, since the lowest hierarchy is the second enhancement layer, the selector 920 reads out the decoded image of the second enhancement layer from the frame memory 911 and sends the decoded image of the second enhancement layer to the terminal 912 in FIG. The data is output to the display unit 606. The display unit 606 displays the entire decoded image of the second enhancement layer output from the image decoding unit 605 when the display control unit 603 instructs the display of the second enhancement layer image. .

尚、上記において、表示する階層を第２拡張レイヤ（階層数は３）として説明を行った。しかしながら、階層符号化の符号データの階層数が３以上であり、表示する階層を第１拡張レイヤ（階層数は２）とした場合、第１拡張レイヤの復号が終了した（ステップＳ１００２にてＮＯ）後に、スタップＳ１０１０にてステップＳ１００３に進む。このため、第２拡張レイヤより上位の階層の符号データの復号の復号は行われない。 In the above description, the hierarchy to be displayed is described as the second enhancement layer (the number of hierarchies is 3). However, when the number of hierarchies of the encoded data of the hierarchical coding is 3 or more and the hierarchy to be displayed is the first enhancement layer (the number of hierarchies is 2), decoding of the first enhancement layer is completed (NO in step S1002). ) After that, the process proceeds to step S1003 in step S1010. For this reason, the decoding of the decoding of the code data of a hierarchy higher than a 2nd enhancement layer is not performed.

ステップＳ８０６にて、独立タイル判定部７０６は、復号対象タイルが独立タイルではないと判定し、ステップＳ８０８に進む。ステップＳ８０８にて、復号対象レイヤが基本レイヤのみの場合と同様に、基本レイヤ復号部７０７は、基本レイヤのタイルを復号して復号画像を生成し、当該復号画像をフレームメモリ９０８に格納する。 In step S806, the independent tile determination unit 706 determines that the decoding target tile is not an independent tile, and the process advances to step S808. In step S808, as in the case where the decoding target layer is only the base layer, the base layer decoding unit 707 generates a decoded image by decoding the tiles of the base layer, and stores the decoded image in the frame memory 908.

ステップＳ８０９にて、全体制御部７１４は、基本レイヤの１フレーム分の全てのタイルの符号データを復号したか否かを判定する。ここでは、基本レイヤ復号部７０７は基本レイヤの１フレーム分の全てのタイルの符号化データを復号したと判定して、ステップＳ１０１０へ進む。ステップＳ１０１０にて、基本レイヤ復号部７０７又は拡張レイヤ復号部７１０は、第２拡張レイヤまで表示するので、表示する階層が復号済みでないと判定し、ステップＳ１００１に進む。 In step S809, the overall control unit 714 determines whether the code data of all tiles for one frame of the base layer have been decoded. Here, base layer decoding section 707 determines that the encoded data of all tiles for one frame of the base layer has been decoded, and proceeds to step S1010. In step S1010, since base layer decoding section 707 or enhancement layer decoding section 710 displays up to the second enhancement layer, it determines that the layer to be displayed has not been decoded, and proceeds to step S1001.

ステップＳ１００１にて、拡張レイヤ復号部７１０は、ステップＳ８０８で復号された基本レイヤを上位レイヤとし、続く復号対象の拡張レイヤ（第１拡張レイヤ）を下位レイヤとする。ステップＳ１０１１にて、分離部７０４は、端子７０２から入力された表示部分にかかるタイルの位置情報を入力する。そして、分離部７０４は、入力された位置情報に基づいて、バッファ７０３に格納された階層符号データのうち復号対象タイルの下位レイヤ（第１拡張レイヤ）の符号データを抽出する。ステップＳ８１２にて、独立タイル判定部７０６は、復号対象タイルのタイル番号と独立タイル位置情報のタイル番号とを比較する。ここでは、復号対象タイルであるタイル５は独立タイル位置情報のタイル番号と一致しない。従って、独立タイル判定部７０６は、復号対象タイルが独立タイルセットのタイルではないと判定し、ステップＳ１０１５に進む。 In step S1001, enhancement layer decoding section 710 sets the base layer decoded in step S808 as an upper layer, and sets the subsequent decoding target enhancement layer (first enhancement layer) as a lower layer. In step S 1011, the separation unit 704 inputs tile position information regarding the display portion input from the terminal 702. Then, based on the input position information, the separation unit 704 extracts code data of a lower layer (first enhancement layer) of the decoding target tile from the hierarchical code data stored in the buffer 703. In step S812, the independent tile determination unit 706 compares the tile number of the decoding target tile with the tile number of the independent tile position information. Here, the tile 5 that is the decoding target tile does not match the tile number of the independent tile position information. Therefore, the independent tile determination unit 706 determines that the decoding target tile is not a tile of the independent tile set, and the process proceeds to step S1015.

ステップＳ１０１５にて、拡大部９０９はフレームメモリ９０８に格納されている上位レイヤ（基本レイヤ）の復号画像から、復号対象タイルの位置と相対的に等しい位置の基本レイヤのタイルと、当該タイルの周辺の復号画像とを入力する。拡大部９０９は、入力された基本レイヤのタイルの復号画像のみを用いて、フィルタリング等で拡大して拡大画像を生成し、当該拡大画像を拡張レイヤ復号部７１０に出力する。 In step S1015, the enlargement unit 909 determines, from the decoded image of the upper layer (basic layer) stored in the frame memory 908, the tile of the basic layer at a position relatively equal to the position of the decoding target tile, and the periphery of the tile The decoded image is input. The enlargement unit 909 uses only the input decoded image of the base layer tile to generate an enlarged image by filtering or the like, and outputs the enlarged image to the enhancement layer decoding unit 710.

ステップＳ１０１６にて、拡張レイヤ復号部７１０は、分離部７０４から入力された復号対象タイルの下位レイヤ（第１拡張レイヤ）の符号データを復号する。拡張レイヤ復号部７１０は、以下を参照して予測画像を生成する。即ち、拡大部９０９から入力される上位レイヤ（基本レイヤ）の拡大画像と、フレームメモリ９１１に格納された復号済みの下位レイヤ（第１拡張レイヤ）の復号画像と、復号対象タイルの下位レイヤ（第１拡張レイヤ）の復号済みの画素とを参照する。さらに、拡張レイヤ復号部７１０は、参照により生成した予測画像と復号した予測誤差から復号画像を生成する。即ち、拡張レイヤ復号部７１０は、ステップＳ１０１５で生成された上位レイヤ（基本レイヤ）の拡大画像を参照してレイヤ間予測を行う。また、拡張レイヤ復号部７１０は、フレームメモリ７１１に格納されている下位レイヤ（第１拡張レイヤ）の復号画像を参照してフレーム間予測を行う。さらに、拡張レイヤ復号部７１０は、下位レイヤ（第１拡張レイヤ）の復号対象タイル内の復号画像を参照してイントラ予測を行う。拡張レイヤ復号部７１０によって生成された下位レイヤ（第１拡張レイヤ）のタイルの復号画像はフレームメモリ９１１に出力され、フレームメモリ９１１で保持される。 In step S1016, enhancement layer decoding section 710 decodes the code data of the lower layer (first enhancement layer) of the decoding target tile input from separation section 704. The enhancement layer decoding unit 710 generates a prediction image with reference to the following. That is, the enlarged image of the upper layer (basic layer) input from the enlargement unit 909, the decoded image of the decoded lower layer (first enhancement layer) stored in the frame memory 911, and the lower layer ( Reference is made to the decoded pixels of the first enhancement layer. Furthermore, the enhancement layer decoding unit 710 generates a decoded image from the predicted image generated by reference and the decoded prediction error. That is, the enhancement layer decoding unit 710 performs inter-layer prediction with reference to the enlarged image of the upper layer (base layer) generated in step S1015. Further, the enhancement layer decoding unit 710 performs inter-frame prediction with reference to the decoded image of the lower layer (first enhancement layer) stored in the frame memory 711. Furthermore, the enhancement layer decoding unit 710 performs intra prediction with reference to the decoded image in the decoding target tile of the lower layer (first enhancement layer). The decoded image of the lower layer (first enhancement layer) tile generated by the enhancement layer decoding unit 710 is output to the frame memory 911 and held in the frame memory 911.

ステップＳ１０１７にて、全体制御部７１４は、表示部分にかかる下位レイヤ（第１拡張レイヤ）の全てのタイルの符号データの復号処理を終了したか否かを判定する。ここでは拡張レイヤ復号部７１０は、第１拡張レイヤの全てのタイルの符号データの復号処理を終了したと判定し、ステップＳ１００２に進む。ステップＳ１００２にて、全体制御部７１４は、全ての階層について復号処理が終了したか否かを判定する。ここでは、拡張レイヤ復号部７１０は、第２拡張レイヤの復号処理が終了していないと判定し、ステップＳ１０１０に戻る。 In step S1017, overall control unit 714 determines whether or not the decoding processing of the code data of all tiles of the lower layer (first enhancement layer) related to the display portion has been completed. Here, the enhancement layer decoding unit 710 determines that the decoding processing of the code data of all the tiles of the first enhancement layer has been completed, and proceeds to step S1002. In step S1002, overall control unit 714 determines whether or not the decoding process has been completed for all layers. Here, enhancement layer decoding section 710 determines that the decoding process of the second enhancement layer has not ended, and returns to step S1010.

以下、第２拡張レイヤの復号を行う。ステップＳ１０１０にて、拡張レイヤ復号部７１０は、表示する階層である第２拡張レイヤ階層の復号が終わっていないと判定し、ステップＳ１００１に進む。ステップＳ１００１にて、拡張レイヤ復号部７１０は、ステップＳ１０１６で復号された第１拡張レイヤを上位レイヤとし、第２拡張レイヤを下位レイヤとして設定する。ステップＳ１０１１にて、分離部７０４は、下位レイヤ（第２拡張レイヤ）の復号対象タイルの符号データを抽出し、当該符号データを拡張レイヤ復号部７１０へ出力する。ステップＳ８１２にて、独立タイル判定部７０６は、復号対象タイルが独立タイルセットのタイルではないと判定し、ステップＳ１０１５に進む。ステップＳ１０１５にて、拡大部９０９は、上位レイヤが拡張レイヤ（第１拡張レイヤ）であることから、フレームメモリ９０８に格納されている拡張レイヤ（第１拡張レイヤ）レイヤの復号画像を入力する。そして、拡大部９０９は、入力された上位レイヤ（第１拡張レイヤ）の復号画像を用いて、フィルタリング等で拡大して拡大画像を生成する。この時、復号対象タイルの位置と相対的に等しい位置のタイルと、当該タイルの周囲の画素を用いて拡大画像を生成してもよい。さらに、拡大部９０９は、生成した拡大画像を拡張レイヤ復号部７１０に入力する。 Hereinafter, decoding of the second enhancement layer is performed. In step S1010, enhancement layer decoding section 710 determines that decoding of the second enhancement layer hierarchy that is the hierarchy to be displayed has not been completed, and proceeds to step S1001. In step S1001, enhancement layer decoding section 710 sets the first enhancement layer decoded in step S1016 as the upper layer and the second enhancement layer as the lower layer. In step S1011, the separation unit 704 extracts code data of the decoding target tile of the lower layer (second enhancement layer), and outputs the code data to the enhancement layer decoding unit 710. In step S812, the independent tile determination unit 706 determines that the decoding target tile is not a tile of the independent tile set, and proceeds to step S1015. In step S1015, since the upper layer is the enhancement layer (first enhancement layer), the enlargement unit 909 inputs the decoded image of the enhancement layer (first enhancement layer) layer stored in the frame memory 908. Then, the enlarging unit 909 generates an enlarged image by enlarging by filtering or the like using the input decoded image of the upper layer (first enhancement layer). At this time, an enlarged image may be generated using a tile at a position relatively equal to the position of the decoding target tile and pixels around the tile. Furthermore, the enlargement unit 909 inputs the generated enlarged image to the enhancement layer decoding unit 710.

ステップＳ１０１４にて、拡張レイヤ復号部７１０は、分離部７０４から入力された復号対象タイルの下位レイヤ（第２拡張レイヤ）の符号データを復号する。拡張レイヤ復号部７１０は拡大部９０９から入力される上位レイヤ（第１拡張レイヤ）の拡大画像と、フレームメモリ９１１に格納された復号済みの拡張レイヤ（第２拡張レイヤ）の復号画像と、復号対象タイルの復号済みの画素とを参照して復号画像を生成する。即ち、拡張レイヤ復号部７１０はステップＳ１０１５で生成された上位レイヤ（第１拡張レイヤ）の拡大画像を参照してレイヤ間予測を行う。また、拡張レイヤ復号部７１０は、フレームメモリ９１１に格納されている下位レイヤ（第１拡張レイヤ）の復号画像を参照してフレーム間予測を行う。さらに、拡張レイヤ復号部７１０は、復号対象タイル内の復号画像を参照してイントラ予測を行う。拡張レイヤ復号部７１０で復号された下位レイヤ（第２拡張レイヤ）のタイルの復号画像はフレームメモリ９１１に出力され、フレームメモリ９１１で保持される。 In step S1014, enhancement layer decoding section 710 decodes the code data of the lower layer (second enhancement layer) of the decoding target tile input from separation section 704. The enhancement layer decoding unit 710 receives the enlarged image of the upper layer (first enhancement layer) input from the enlargement unit 909, the decoded image of the decoded enhancement layer (second enhancement layer) stored in the frame memory 911, and the decoding A decoded image is generated with reference to the decoded pixels of the target tile. That is, the enhancement layer decoding unit 710 performs inter-layer prediction with reference to the enlarged image of the upper layer (first enhancement layer) generated in step S1015. Further, the enhancement layer decoding unit 710 performs inter-frame prediction with reference to the decoded image of the lower layer (first enhancement layer) stored in the frame memory 911. Furthermore, the enhancement layer decoding unit 710 performs intra prediction with reference to the decoded image in the decoding target tile. The decoded image of the lower layer (second enhancement layer) tile decoded by the enhancement layer decoding unit 710 is output to the frame memory 911 and held in the frame memory 911.

ステップＳ１０１７にて、全体制御部７１４は、分離部７０４から入力された表示部分にかかる下位レイヤ（第２拡張レイヤ）の全てのタイルの符号データを復号したか否かを判定する。ここでは、拡張レイヤ復号部７１０は、第２拡張レイヤの全てのタイルの符号データの復号を終了したと判定し、ステップＳ１００２へ進む。ステップＳ１００２にて、全体制御部７１４は、第２拡張レイヤまで復号したので、全ての階層のタイルの復号処理が終わったと判定し、ステップＳ１００３に進む。ステップＳ１００３にて、セレクタ９２０は、復号された階層のうち、最下位の階層の復号画像を選択する。この場合、最下位の階層は第２拡張レイヤであるので、セレクタ９２０は、フレームメモリ９１１から復号画像を読み出し、読み出した復号画像を端子９１２を介して図６の表示部６０６に出力する。そして、表示部６０６は、表示制御部６０３から第２拡張レイヤの画像の表示が指示されることにより、画像復号部６０５から出力された第２拡張レイヤの復号画像を表示する。 In step S1017, overall control unit 714 determines whether or not the code data of all tiles in the lower layer (second enhancement layer) related to the display portion input from separation unit 704 has been decoded. Here, enhancement layer decoding section 710 determines that decoding of the code data of all tiles of the second enhancement layer has ended, and proceeds to step S1002. In step S1002, since the overall control unit 714 has decoded up to the second enhancement layer, the overall control unit 714 determines that the decoding process for all the tiles has been completed, and the process advances to step S1003. In step S1003, the selector 920 selects the decoded image of the lowest hierarchy among the decoded hierarchies. In this case, since the lowest hierarchy is the second enhancement layer, the selector 920 reads out the decoded image from the frame memory 911 and outputs the read out decoded image to the display unit 606 in FIG. The display unit 606 displays the second enhancement layer decoded image output from the image decoding unit 605 in response to an instruction to display the second enhancement layer image from the display control unit 603.

尚、上記において、表示する階層は第２拡張レイヤ（階層数は３）として説明を行った。しかしながら、階層符号化の符号データの階層数が３以上であり、表示する階層を第１拡張レイヤ（階層数は２）とした場合、第１拡張レイヤの復号が終了した（ステップＳ１００２にてＮＯ）後に、スタップＳ１０１０にてステップＳ１００３に進む。このため、第２拡張レイヤより上位の階層の符号データの復号は行われない。 In the above description, the hierarchy to be displayed is described as the second enhancement layer (the number of hierarchies is 3). However, when the number of hierarchies of the encoded data of the hierarchical coding is 3 or more and the hierarchy to be displayed is the first enhancement layer (the number of hierarchies is 2), decoding of the first enhancement layer is completed (NO in step S1002). ) After that, the process proceeds to step S1003 in step S1010. For this reason, the decoding of the code data of a hierarchy higher than a 2nd enhancement layer is not performed.

以上の構成と動作により、各拡張レイヤと基本レイヤの各レイヤにおいて独立復号タイルの相対的な位置を一致させることができる。即ち、基本レイヤで所定のタイルを独立タイルを設定した場合、各拡張レイヤにおいて、当該基本レイヤの独立タイルの位置と相対的に等しい位置のタイルは独立タイルとすることができる。これにより、階層符号化のいずれの階層においても、独立タイルの符号データの予測及び復号のために参照する画素を制限することができる。特に、図６において、表示制御部６０３によって表示を指示されたタイルが独立タイルであれば、記憶部６０２から必要な符号データを読み出し、画像復号部６０５は当該符号データのみを復号すれば良い。このため、従来よりも高速に処理することが可能になる。 With the above configuration and operation, the relative positions of the independent decoding tiles can be matched in each of the enhancement layer and the base layer. That is, when an independent tile is set as a predetermined tile in the base layer, in each extension layer, a tile at a position relatively equal to the position of the independent tile of the base layer can be an independent tile. This makes it possible to limit pixels to be referred to for prediction and decoding of code data of independent tiles in any layer of hierarchical encoding. In particular, in FIG. 6, if the tile instructed to be displayed by the display control unit 603 is an independent tile, necessary code data is read from the storage unit 602, and the image decoding unit 605 only has to decode the code data. For this reason, it becomes possible to process at higher speed than before.

さらに、ＭＣＴＳＳＥＩ符号がビットストリームに存在する場合、タイル位置一致情報であるｖｕｉ＿ｐａｒａｍｅｔｅｒｓのｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号は１に必ずセットされる。即ち、ｖｕｉ＿ｐａｒａｍｅｔｅｒｓにおいて、ＭＣＴＳＳＥＩ符号がビットストリームに存在する場合、符号データとしてのｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号を省略することができる。もし、ＭＣＴＳＳＥＩ符号がビットストリームに無ければ、ｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号を復号し、後段の復号で参照される。ＭＣＴＳＳＥＩ符号がビットストリームにあれば、ｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号は符号化されていないので、復号側で必ず１の値を設定する。このようにすることで、ｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号が無くても同様に復号することが可能になる。 Further, when the MCTS SEI code is present in the bitstream, the tile_boundaries_aligned_flag code of vui_parameters which is tile position matching information is always set to 1. That is, when the MCTS SEI code is present in the bitstream in vui_parameters, the tile_boundaries_aligned_flag code as code data can be omitted. If the MCTS SEI code is not present in the bitstream, the tile_boundaries_aligned_flag code is decoded and referenced in the subsequent decoding. If the MCTS SEI code is in the bitstream, the tile_boundaries_aligned_flag code is not encoded, so the value of 1 is always set on the decoding side. By doing in this way, it becomes possible to perform the decoding similarly even if there is no tile_boundaries_aligned_flag code.

＜実施形態３＞
上記実施形態１及び実施形態２において、それぞれ図１、図４、図６、図７、及び図９に示した各処理部はハードウェアでもって構成しているものとして説明した。しかし、これらの図に示した各処理部で行なう処理をコンピュータプログラムで実行しても良い。 <Embodiment 3>
In the first embodiment and the second embodiment, each processing unit illustrated in FIGS. 1, 4, 6, 7, and 9 has been described as being configured by hardware. However, the processing performed in each processing unit shown in these figures may be executed by a computer program.

図１１は、上記実施形態１及び実施形態２に係る画像符号化装置及び画像復号装置の各処理部が行う処理を実行するコンピュータのハードウェアの構成例を示すブロック図である。 FIG. 11 is a block diagram illustrating a hardware configuration example of a computer that executes processing performed by each processing unit of the image encoding device and the image decoding device according to the first and second embodiments.

ＣＰＵ１１０１は、ＲＡＭ１１０２やＲＯＭ１１０３に格納されているコンピュータプログラムやデータを用いてコンピュータ全体の制御を行うと共に、上術した各実施形態に係る画像符号化装置及び画像復号装置が行うものとして上述した各処理を実行する。即ち、ＣＰＵ１１０１は、図１、図４、図６、図７、及び図９に示した各処理部として機能することになる。 The CPU 1101 controls the entire computer using computer programs and data stored in the RAM 1102 and the ROM 1103, and performs the above-described processes performed by the image encoding device and the image decoding device according to each of the above-described embodiments. Execute. That is, the CPU 1101 functions as each processing unit shown in FIGS. 1, 4, 6, 7, and 9.

ＲＡＭ１１０２は、外部記憶装置１１０６からロードされたコンピュータプログラムやデータ、Ｉ／Ｆ（インターフェース）１１０７を介して外部から取得したデータ等を一時的に記憶するためのエリアを有する。さらに、ＲＡＭ１１０２は、ＣＰＵ１１０１が各種の処理を実行する際に用いるワークエリアを有する。即ち、ＲＡＭ１１０２は、例えば、フレームメモリとして割当てたり、その他の各種のエリアを適宜提供したりすることができる。 The RAM 1102 has an area for temporarily storing computer programs and data loaded from the external storage device 1106, data acquired from the outside via an I / F (interface) 1107, and the like. Further, the RAM 1102 has a work area used when the CPU 1101 executes various processes. That is, the RAM 1102 can be allocated as, for example, a frame memory or can provide other various areas as appropriate.

ＲＯＭ１１０３は、本コンピュータの設定データや、ブートプログラム等を格納する。操作部１１０４は、キーボードやマウス等により構成されており、本コンピュータをユーザが操作することで、各種の指示をＣＰＵ１１０１に対して入力することができる。出力部１１０５は、ＣＰＵ１１０１による処理結果を表示させるための制御を行う。また、出力部１１０５は、例えば液晶ディスプレイで構成される表示部（不図示）において、ＣＰＵ１６０１による処理結果を表示するための制御を行う。 The ROM 1103 stores setting data of the computer, a boot program, and the like. The operation unit 1104 includes a keyboard, a mouse, and the like, and various instructions can be input to the CPU 1101 by the user operating the computer. The output unit 1105 performs control for displaying the processing result by the CPU 1101. Further, the output unit 1105 performs control for displaying a processing result by the CPU 1601 in a display unit (not shown) configured by, for example, a liquid crystal display.

外部記憶装置１１０６は、ハードディスクドライブ装置に代表される、大容量情報記憶装置である。外部記憶装置１１０６には、オペレーティングシステム（ＯＳ）や、図１、図４、図６、図７、及び図９に示した各部の機能をＣＰＵ１１０１に実現させるためのコンピュータプログラムが保存されている。さらには、外部記憶装置１１０６には、処理対象としての各画像データが保存されていても良い。 The external storage device 1106 is a large-capacity information storage device represented by a hard disk drive device. The external storage device 1106 stores an operating system (OS) and computer programs for causing the CPU 1101 to realize the functions of the units illustrated in FIGS. 1, 4, 6, 7, and 9. Furthermore, each image data as a processing target may be stored in the external storage device 1106.

外部記憶装置１１０６に保存されているコンピュータプログラムやデータは、ＣＰＵ１１０１による制御に従って適宜、ＲＡＭ１１０２にロードされ、ＣＰＵ１１０１による処理対象となる。Ｉ／Ｆ１１０７には、ＬＡＮやインターネット等のネットワーク、投影装置や表示装置等の他の機器を接続することができ、本コンピュータはこのＩ／Ｆ１１０７を介して様々な情報を取得したり、送出したりすることができる。バス１１０８は上述の各部を繋ぐ。 Computer programs and data stored in the external storage device 1106 are appropriately loaded into the RAM 1102 under the control of the CPU 1101 and are processed by the CPU 1101. The I / F 1107 can be connected to other devices such as a network such as a LAN or the Internet, a projection device, a display device, etc., and the computer acquires and sends various information via the I / F 1107. Can be. A bus 1108 connects the above-described units.

上述の構成における作動は、前述のフローチャートで説明した作動をＣＰＵ１１０１が中心となってその制御を行う。 The operation in the above-described configuration is controlled by the CPU 1101 centering on the operation described in the above flowchart.

＜その他の実施形態＞
尚、本発明を容易に実現するために、ビットストリームの先頭に近いレベルで独立タイルの有無を明示することは有用である。例えば、ｖｕｉ＿ｐａｒａｍｅｔｅｒｓを利用する方法について図１２を用いて説明する。図１２はｖｕｉ＿ｐａｒａｍｅｔｅｒｓのシンタックスを表す図である。ｖｕｉ＿ｐａｒａｍｅｔｅｒｓの中には、ビットストリームに必ず独立タイルが存在することを表すｍｏｔｉｏｎ＿ｃｏｎｓｔｒａｉｎｅｄ＿ｔｉｌｅ＿ｓｅｔｓ＿ｆｌａｇ符号が含まれている。この符号の値が１であれば、ＭＣＴＳＳＥＩを含み、ビットストリームに独立タイルが存在し、基本レイヤと拡張レイヤの各タイルの相対的な位置が一致することを示す。即ち、必ずｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号が１で固定なので、ｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号の符号化を行う必要が無い。一方、ｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号が０であれば、ＭＣＴＳＳＥＩを含まず、ビットストリームに独立タイルが存在しない。そのため、ｔｉｌｅ＿ｂｏｕｎｄａｒｉｅｓ＿ａｌｉｇｎｅｄ＿ｆｌａｇ符号を符号化する必要がある。このようなビットストリームを復号する画像復号装置は、ビットストリームに独立タイルが含まれるという情報を各タイルの復号処理を行う前に取得することができる。このため、特定の領域を復号する場合に、画像復号装置は独立タイルを用いて高速に復号処理をすることが可能である。さらに、その結果、部分拡大表示のようなアプリケーションが有効かどうかを、各タイルの復号処理を行う前に判断することができるようになる。 <Other embodiments>
In order to easily realize the present invention, it is useful to clearly indicate the presence or absence of an independent tile at a level close to the head of the bit stream. For example, a method using vui_parameters will be described with reference to FIG. FIG. 12 is a diagram illustrating the syntax of vui_parameters. In vui_parameters, a motion_constrained_tile_sets_flag code indicating that an independent tile always exists in the bitstream is included. If the value of this code is 1, it indicates that MCTS SEI is included, an independent tile exists in the bitstream, and the relative positions of the tiles of the base layer and the enhancement layer match. That is, since the tile_boundaries_aligned_flag code is always fixed at 1, there is no need to encode the tile_boundaries_aligned_flag code. On the other hand, if the tile_boundaries_aligned_flag code is 0, the MCTS SEI is not included and no independent tile exists in the bitstream. Therefore, it is necessary to encode the tile_boundaries_aligned_flag code. An image decoding apparatus that decodes such a bitstream can acquire information that an independent tile is included in the bitstream before performing the decoding process of each tile. For this reason, when decoding a specific area, the image decoding apparatus can perform high-speed decoding processing using independent tiles. Further, as a result, it is possible to determine whether an application such as partial enlarged display is valid before performing the decoding process of each tile.

尚、画像のサイズ、タイルの分割数、独立タイルの１フレームにおける位置は上記に示した各実施形態に限定されない。 The size of the image, the number of tile divisions, and the position of the independent tile in one frame are not limited to the above-described embodiments.

また、本発明は、以下の処理を実行することによっても実現される。即ち、上述した実施形態の機能を実現するソフトウェア（プログラム）を、ネットワーク又は各種記憶媒体を介してシステム或いは装置に供給し、そのシステム或いは装置のコンピュータ（またはＣＰＵやＭＰＵ等）がプログラムを読み出して実行する処理である。 The present invention can also be realized by executing the following processing. That is, software (program) that realizes the functions of the above-described embodiments is supplied to a system or apparatus via a network or various storage media, and a computer (or CPU, MPU, or the like) of the system or apparatus reads the program. It is a process to be executed.

Claims

An image encoding device that hierarchically encodes images constituting a moving image in a plurality of layers,
Generating means for generating a second image having a different hierarchy from the first image;
Code that encodes a first tile set composed of one or more tiles in the first image and a second tile set composed of one or more tiles in the second image And
Information encoding means, and
The second tile set is in a position corresponding to the first tile set in the second image;
The encoding means encodes the first tile set without referring to other than the first tile set in the first image, and the second tile set in the second image. Encoding the second tile set without referring to
The encoding means refers to only the second tile set in the second image when encoding the first tile set with reference to at least a partial region of the second image. Encoding the first tile set with restrictions such that
The image encoding device, wherein the information encoding means encodes an SEI message indicating a restriction on decoding processing of the first tile set and the second tile set.

The image encoding apparatus according to claim 1, wherein the first image and the second image have different resolutions or image quality.

The first image is an enhancement layer;
The image coding apparatus according to claim 1, wherein the second image is a base layer.

The first tile set in the first image and the second tile set in the second image are present at the same position in each image. The image encoding device according to item 1.

The image coding apparatus according to any one of claims 1 to 4, wherein the SEI message includes at least information indicating a position of the first tile set.

An image decoding device that decodes encoded data generated by hierarchically encoding images constituting a moving image in a plurality of layers,
Information decoding means for decoding an SEI message indicating a restriction on decoding processing of a tile set composed of one or a plurality of tiles;
Decoding means for decoding a first tile set in the first image and a second tile set in the second image according to the SEI message;
The second tile set is in a position corresponding to the first tile set in the second image;
The decoding means decodes the first tile set without referring to the first tile set other than the first tile set in the first image, and the second image other than the second tile set. Decoding the second tile set without reference;
The decoding means is limited to refer to only the second tile set in the second image when the first tile set is decoded with reference to at least a partial region of the second image. Then, the image decoding apparatus characterized in that the first tile set is decoded.

The image decoding apparatus according to claim 6, wherein the first image and the second image have different resolutions or image quality.

The first image is an enhancement layer;
The image decoding apparatus according to claim 6 or 7, wherein the second image is a base layer.

The first tile set in the first image and the second tile set in the second image are in the same position in each image. 9. The image decoding device according to item 1.

The image decoding apparatus according to any one of claims 6 to 9, wherein the SEI message includes at least information indicating a position of the first tile set.

An image encoding method for hierarchically encoding images constituting a moving image in a plurality of layers,
A generation step of generating a second image having a hierarchy different from that of the first image;
Code that encodes a first tile set composed of one or more tiles in the first image and a second tile set composed of one or more tiles in the second image Conversion process,
An information encoding process, and
The second tile set is in a position corresponding to the first tile set in the second image;
In the encoding step, the first tile set is encoded in the first image without referring to other than the first tile set, and the second tile set is encoded in the second image. Encoding the second tile set without referring to
In the encoding step, when the first tile set is encoded with reference to at least a partial region of the second image, only the second tile set is referred to in the second image. Encoding the first tile set with restrictions such that
An image encoding method, wherein, in the information encoding step, an SEI message indicating a restriction regarding decoding processing of the first tile set and the second tile set is encoded.

An image decoding method for decoding encoded data generated by hierarchically encoding images constituting a moving image in a plurality of layers,
An information decoding step of decoding an SEI message indicating restrictions on decoding processing of a tile set composed of one or a plurality of tiles;
Decoding according to the SEI message a first tile set in the first image and a second tile set in the second image;
The second tile set is in a position corresponding to the first tile set in the second image;
In the decoding step, the first image is decoded without referring to other than the first tile set in the first image, and other than the second tile set in the second image. Decoding the second tile set without reference;
In the decoding step, when the first tile set is decoded with reference to at least a partial area of the second image, the second image is limited to refer to only the second tile set. And decoding the first tile set.

A program that causes a computer to function as each unit of the image encoding device according to any one of claims 1 to 5.

A program that causes a computer to function as each unit of the image decoding device according to any one of claims 6 to 10.