JP2012186763A

JP2012186763A - Video encoding device, video decoding device, video encoding method, and video decoding method

Info

Publication number: JP2012186763A
Application number: JP2011050206A
Authority: JP
Inventors: Kazuo Sugimoto; 和夫杉本; Akifumi Hattori; 亮史服部; Shunichi Sekiguchi; 俊一関口
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2011-03-08
Filing date: 2011-03-08
Publication date: 2012-09-27

Abstract

PROBLEM TO BE SOLVED: To obtain a video encoding device and a video encoding method that are capable of notifying a video decoding device side of a division state of CUs by signaling with a small number of bits.SOLUTION: A variable length encoding unit 13 performs variable length encoding on a maximum CU division layer number indicating a division layer number of a portion in the deepest layer of a quadtree structure in each LCU, and also performs variable length encoding on a division flag indicating whether a CU other than CUs belonging to the deepest layer is divided into a quadtree by a block division unit 2.

Description

この発明は、画像圧縮符号化技術や圧縮画像データ伝送技術などに用いられる動画像符号化装置、動画像復号装置、動画像符号化方法及び動画像復号方法に関するものである。 The present invention relates to a moving image encoding device, a moving image decoding device, a moving image encoding method, and a moving image decoding method used for image compression encoding technology, compressed image data transmission technology, and the like.

例えば、ＭＰＥＧ，ＩＴＵ−ＴＨ．２６ｘシリーズなどの従来の映像符号化方式では、映像信号の各フレームを「マクロブロック」と呼ばれる正方ブロックに均等分割し、マクロブロックの単位で、フレーム内・フレーム間予測処理、予測差分信号に対するＤＣＴ等の直交変換処理や量子化処理を実施して、エントロピー符号化を行うことで、最終的な圧縮データであるビットストリームを得る方式が採用されている。
ＡＶＣ／Ｈ．２６４までの方式では、マクロブロックのサイズとして、輝度信号上で、１６画素×１６ラインのブロックサイズが採用されているが、このサイズを拡張して予測対象ブロックや変換ブロックのサイズを適応化させることで、符号化性能を大きく改善する手法が報告されている（例えば、非特許文献１を参照)。 For example, MPEG, ITU-TH. In a conventional video encoding method such as the 26x series, each frame of a video signal is equally divided into square blocks called “macroblocks”, and the intra-frame / interframe prediction processing and DCT for the prediction differential signal are performed in units of macroblocks. A method of obtaining a bit stream which is final compressed data by performing entropy coding by performing orthogonal transformation processing such as the above and quantization processing is adopted.
AVC / H. In the method up to H.264, the block size of 16 pixels × 16 lines is adopted as the macroblock size on the luminance signal, but this size is expanded to adapt the size of the prediction target block and the transform block. Thus, a technique for greatly improving the coding performance has been reported (see, for example, Non-Patent Document 1).

非特許文献１に開示されている従来のマクロブロックサイズの拡張方式では、マクロブロックがＬＣＵ（ＬａｒｇｅｓｔＣｏｄｉｎｇＵｎｉｔ：最大符号化単位）と称され、そのＬＣＵを更に分割して、分割後のブロック毎に動きベクトルを検出する構成や、直交変換のサイズを適応化させる構成を開示しているが、フレーム内符号化やフレーム間符号化を実施する際の符号化単位であるＣＵ（ＣｏｄｉｎｇＵｎｉｔ）は、図１３に示す四分木分割を採用している。 In the conventional macroblock size expansion method disclosed in Non-Patent Document 1, a macroblock is referred to as an LCU (Large Coding Unit), and the LCU is further divided into blocks after division. Have disclosed a configuration for detecting a motion vector and a configuration for adapting the size of orthogonal transform, but a CU (Coding Unit) which is a coding unit when performing intraframe coding or interframe coding is disclosed. The quadtree division shown in FIG. 13 is adopted.

図１３において、「第０階層」と表記された輝度成分で、（Ｍ⁰，Ｍ⁰）(右上肩の数字は階層レベルを示している) のサイズを有する符号化ブロックがＬＣＵであり、このＬＣＵを出発点として、四分木構造で、シーケンス毎に別途定める階層数の上限（シーケンスでの上限階層数）まで階層的に分割を行うことによって、分割状態のＣＵを得ている。
四分木分割を行うため、常に（Ｍⁿ⁺¹，Ｍⁿ⁺¹）＝（Ｍⁿ／２，Ｍⁿ／２）が成立している。
従来のマクロブロックサイズの拡張方式では、ＬＣＵを出発点として、ＣＵを分割するか否かを示す分割フラグをＣＵ毎にシグナリングすることで、ＣＵの分割状態を表現している。 In FIG. 13, an encoded block having a size of (M ⁰ , M ⁰ ) (the number at the upper right shoulder indicates a layer level) having a luminance component expressed as “0th layer” is an LCU. Starting from the LCU, a CU in a divided state is obtained by hierarchically dividing up to the upper limit of the number of hierarchies separately determined for each sequence (upper limit hierarchy number in the sequence) in a quadtree structure.
Since quadtree partitioning is performed, (M ^{n + 1} , M ^{n + 1} ) = (M ⁿ / 2, M ⁿ / 2) always holds.
In the conventional macroblock size expansion method, the division state of the CU is expressed by signaling a division flag indicating whether or not to divide the CU for each CU, starting from the LCU.

例えば、図１４に示すように、ＬＣＵを四分木構造で階層的に分割する場合、１個のＬＣＵ（第０階層）と、４個のＣＵ（第１階層）と、１２個のＣＵ（第２階層）とが得られるため、合計で１７個のブロック（ＬＣＵ＋ＣＵ）のそれぞれが分割されているか否かを示す分割フラグ（１＝分割：０＝非分割）が用意される。
このため、動画像符号化装置は、ＣＵの分割状態を動画像復号装置側に通知するため、１７個の分割フラグを１７ｂｉｔでシグナリングする。 For example, as shown in FIG. 14, when an LCU is divided hierarchically in a quadtree structure, one LCU (0th hierarchy), 4 CUs (1st hierarchy), and 12 CUs ( Therefore, a division flag (1 = division: 0 = non-division) indicating whether each of the 17 blocks (LCU + CU) in total is divided is prepared.
For this reason, the moving picture coding apparatus signals 17 division flags in 17 bits in order to notify the moving picture decoding apparatus side of the division state of the CU.

T. Wiegand, W.-J. Han, B. Bross, J.-R. Ohm and G. J. Sullivan, “WD1: Working Draft 1 of High-Efficiency Video Coding,” Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11, doc. JCTVC-C403, Guangzhou, China, October 2010.T. Wiegand, W.-J. Han, B. Bross, J.-R. Ohm and GJ Sullivan, “WD1: Working Draft 1 of High-Efficiency Video Coding,” Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO / IEC JTC1 / SC29 / WG11, doc.JCTVC-C403, Guangzhou, China, October 2010.

従来の動画像符号化装置は以上のように構成されているので、四分木構造の階層を深くする程、自由度の高いＣＵの分割状態を表現することができる。しかし、ＣＵの分割状態を動画像復号装置側に伝えるには、ビット数が多いシグナリングを行う必要があり、必要な符号量が増加してしまうなどの課題があった。 Since the conventional moving picture coding apparatus is configured as described above, the division state of the CU having a higher degree of freedom can be expressed as the hierarchy of the quadtree structure is deepened. However, in order to transmit the division state of the CU to the video decoding device side, it is necessary to perform signaling with a large number of bits, and there is a problem that a necessary code amount increases.

この発明は上記のような課題を解決するためになされたもので、少ないビット数のシグナリングでＣＵの分割状態を動画像復号装置側に伝えることができる動画像符号化装置及び動画像符号化方法を得ることを目的とする。
また、この発明は、上記の動画像符号化装置に適用することができる動画像復号装置及び動画像復号方法を得ることを目的とする。 The present invention has been made to solve the above-described problems, and is a moving picture coding apparatus and a moving picture coding method capable of transmitting the division state of a CU to the moving picture decoding apparatus side by signaling with a small number of bits. The purpose is to obtain.
Another object of the present invention is to obtain a moving picture decoding apparatus and a moving picture decoding method that can be applied to the above moving picture encoding apparatus.

この発明に係る動画像符号化装置は、入力画像を所定のサイズのマクロブロックに分割するとともに、各々のマクロブロックを四分木構造で階層的に分割することで、マクロブロックよりも細かい符号化単位のブロックである符号化処理単位ブロックを出力するブロック分割手段と、ブロック分割手段から出力された符号化処理単位ブロックに対応する符号化モードで、符号化処理単位ブロックに対する予測処理を実施して予測画像を生成する予測画像生成手段と、ブロック分割手段から出力された符号化処理単位ブロックと予測画像生成手段により生成された予測画像との差分画像を圧縮し、その差分画像の圧縮データを出力する画像圧縮手段と、画像圧縮手段から出力された圧縮データ及び符号化モードを可変長符号化して、その圧縮データ及び符号化モードの符号化データが多重化されているビットストリームを生成する可変長符号化手段とを備え、可変長符号化手段が、各々のマクロブロックにおける四分木構造で階層が最も深い部分の分割階層数を示す最大分割階層数を可変長符号化するとともに、最も深い階層に属する符号化処理単位ブロック以外の符号化処理単位ブロックがブロック分割手段によって四分木分割されているか否かを示す分割フラグを可変長符号化するようにしたものである。 The moving picture coding apparatus according to the present invention divides an input image into macroblocks of a predetermined size and further divides each macroblock hierarchically by a quadtree structure, thereby encoding more finely than the macroblock. A block dividing unit that outputs an encoding processing unit block that is a unit block, and a prediction process for the encoding processing unit block in an encoding mode corresponding to the encoding processing unit block output from the block dividing unit A prediction image generating unit that generates a prediction image, a differential image between the encoding processing unit block output from the block dividing unit and the prediction image generated by the prediction image generation unit is compressed, and compressed data of the difference image is output Image compression means, and the compressed data output from the image compression means and the encoding mode are variable-length encoded, and the compressed data And variable-length encoding means for generating a bit stream in which encoded data of the encoding mode and the encoding mode are multiplexed, and the variable-length encoding means is the deepest in the quadtree structure in each macroblock Whether the maximum number of divided hierarchies indicating the number of divided hierarchies is variable-length coded, and whether or not the coding processing unit blocks other than the coding processing unit block belonging to the deepest hierarchy are divided into quadtrees by the block dividing means The division flag indicating the variable length is encoded with variable length.

この発明によれば、入力画像を所定のサイズのマクロブロックに分割するとともに、各々のマクロブロックを四分木構造で階層的に分割することで、マクロブロックよりも細かい符号化単位のブロックである符号化処理単位ブロックを出力するブロック分割手段と、ブロック分割手段から出力された符号化処理単位ブロックに対応する符号化モードで、符号化処理単位ブロックに対する予測処理を実施して予測画像を生成する予測画像生成手段と、ブロック分割手段から出力された符号化処理単位ブロックと予測画像生成手段により生成された予測画像との差分画像を圧縮し、その差分画像の圧縮データを出力する画像圧縮手段と、画像圧縮手段から出力された圧縮データ及び符号化モードを可変長符号化して、その圧縮データ及び符号化モードの符号化データが多重化されているビットストリームを生成する可変長符号化手段とを備え、可変長符号化手段が、各々のマクロブロックにおける四分木構造で階層が最も深い部分の分割階層数を示す最大分割階層数を可変長符号化するとともに、最も深い階層に属する符号化処理単位ブロック以外の符号化処理単位ブロックがブロック分割手段によって四分木分割されているか否かを示す分割フラグを可変長符号化するように構成したので、少ないビット数のシグナリングで符号化処理単位ブロックの分割状態を動画像復号装置側に伝えることができる効果がある。 According to the present invention, the input image is divided into macroblocks of a predetermined size, and each macroblock is hierarchically divided in a quadtree structure, so that it is a block of a coding unit smaller than the macroblock. A prediction image is generated by performing a prediction process on the encoding processing unit block in a block dividing unit that outputs the encoding processing unit block and an encoding mode corresponding to the encoding processing unit block output from the block dividing unit. A prediction image generation unit; and an image compression unit that compresses a differential image between the encoding processing unit block output from the block division unit and the prediction image generated by the prediction image generation unit, and outputs compressed data of the difference image; Then, the compressed data output from the image compression means and the encoding mode are subjected to variable length encoding, and the compressed data and the encoding mode are encoded. Variable-length encoding means for generating a bit stream in which encoded data of a group is multiplexed, and the variable-length encoding means is a divided hierarchy of the deepest part in the quadtree structure in each macroblock A division flag that indicates whether or not the encoding processing unit block other than the encoding processing unit block belonging to the deepest hierarchy is divided into quadtrees by the block dividing means, while the maximum number of division hierarchies indicating the number is variable-length encoded Therefore, there is an effect that the division state of the encoding processing unit block can be transmitted to the moving picture decoding apparatus side by signaling with a small number of bits.

この発明の実施の形態１による動画像符号化装置を示す構成図である。It is a block diagram which shows the moving image encoder by Embodiment 1 of this invention. この発明の実施の形態１による動画像符号化装置の処理内容を示すフローチャートである。It is a flowchart which shows the processing content of the moving image encoder by Embodiment 1 of this invention. ＣＵ分割状態のエントロピー符号化処理を示すフローチャートである。It is a flowchart which shows the entropy encoding process of a CU division | segmentation state. この発明の実施の形態１による動画像復号装置を示す構成図である。It is a block diagram which shows the moving image decoding apparatus by Embodiment 1 of this invention. この発明の実施の形態１による動画像復号装置の処理内容を示すフローチャートである。It is a flowchart which shows the processing content of the moving image decoding apparatus by Embodiment 1 of this invention. ＣＵ分割状態のエントロピー復号処理を示すフローチャートである。It is a flowchart which shows the entropy decoding process of a CU division | segmentation state. ＬＣＵが階層的に分割されて複数のＣＵが得られる様子を示す説明図である。It is explanatory drawing which shows a mode that LCU is divided | segmented hierarchically and several CU is obtained. （ａ）は分割後のパーティションの分布を示し、（ｂ）は階層分割によって符号化モードｍ（Ｂ_j ⁿ）が割り当てられる状況を四分木グラフで示す説明図である。(A) shows the distribution of the partitions after the division, and (b) is an explanatory diagram showing a situation where the encoding mode m (B _j ⁿ ) is assigned by the hierarchical division in a quadtree graph. 可変長符号化部１３により生成されるビットストリームの一例を示す説明図である。It is explanatory drawing which shows an example of the bit stream produced | generated by the variable length encoding part. 最大ＣＵ分割階層数の符号例を示す説明図である。It is explanatory drawing which shows the code example of the maximum CU division | segmentation hierarchy number. １７個のブロック（ＬＣＵ＋ＣＵ）の分割フラグを示す説明図である。It is explanatory drawing which shows the division | segmentation flag of 17 blocks (LCU + CU). 最大ＣＵ分割階層数のハフマン符号の例を示す説明図である。It is explanatory drawing which shows the example of the Huffman code | cord | chord of the maximum CU division | segmentation hierarchy number. 四分木構造によるＬＣＵの階層的な分割を示す説明図である。It is explanatory drawing which shows the hierarchical division | segmentation of LCU by a quadtree structure. １７個のブロック（ＬＣＵ＋ＣＵ）の分割フラグを示す説明図である。It is explanatory drawing which shows the division | segmentation flag of 17 blocks (LCU + CU).

実施の形態１．
この実施の形態１では、映像の各フレーム画像を入力し、近接フレーム間で動き補償予測を実施することで得られる予測差分信号に対して直交変換や量子化による圧縮処理を実施した後に可変長符号化を行ってビットストリームを生成する動画像符号化装置と、その動画像符号化装置から出力されたビットストリームを復号する動画像復号装置について説明する。 Embodiment 1 FIG.
In the first embodiment, each frame image of a video is input, and after performing compression processing by orthogonal transformation or quantization on a prediction difference signal obtained by performing motion compensation prediction between adjacent frames, the variable length is obtained. A video encoding device that performs encoding to generate a bitstream and a video decoding device that decodes a bitstream output from the video encoding device will be described.

この実施の形態１の動画像符号化装置は、映像信号の空間・時間方向の局所的な変化に適応して、映像信号を多様なサイズの領域に分割してフレーム内・フレーム間適応符号化を行うことを特徴としている。
一般的に映像信号は、空間・時間的に信号の複雑さが局所的に変化する特性を有している。空間的に見ると、ある特定の映像フレーム上では、空や壁などのような比較的広い画像領域中で均一な信号特性を持つ絵柄もあれば、人物や細かいテクスチャを持った絵画など小さい画像領域内で複雑なテクスチャパターンを持つ絵柄も混在することがある。
時間的に見ても、空や壁は局所的に時間方向の絵柄の変化が小さいが、動く人物や物体はその輪郭が時間的に剛体・非剛体の運動をするため、時間的な変化が大きい。 The moving picture coding apparatus according to the first embodiment adapts to local changes in the spatial and temporal directions of a video signal, divides the video signal into regions of various sizes, and performs intraframe / interframe adaptive coding. It is characterized by performing.
In general, a video signal has a characteristic that the complexity of the signal changes locally in space and time. When viewed spatially, on a particular video frame, there are patterns with uniform signal characteristics in a relatively large image area such as the sky and walls, and small images such as people and paintings with fine textures. A pattern having a complicated texture pattern may be mixed in the region.
Even when viewed temporally, the sky and the wall have small changes in the pattern in the time direction locally, but the moving person or object has a rigid or non-rigid motion in time, so the temporal change does not occur. large.

符号化処理は、時間・空間的な予測によって信号電力やエントロピーの小さい予測差分信号を生成することで、全体の符号量を削減するが、予測のためのパラメータをできるだけ大きな画像信号領域に均一に適用できれば、当該パラメータの符号量を小さくすることができる。
一方、時間的・空間的に変化の大きい画像信号パターンに対して、同一の予測パラメータを適用すると、予測の誤りが増えるため、予測差分信号の符号量を削減することができない。
そこで、時間的・空間的に変化の大きい画像信号パターンに対しては、予測対象の領域を小さくして、予測のためのパラメータのデータ量を増やしても予測差分信号の電力・エントロピーを低減するほうが望ましい。
このような映像信号の一般的な性質に適応した符号化を行うため、この実施の形態１の動画像符号化装置では、所定の最大ブロックサイズから階層的に映像信号の領域を分割して、分割領域毎に予測処理や、予測差分の符号化処理を実施するようにしている。 The encoding process reduces the overall code amount by generating a prediction difference signal with small signal power and entropy by temporal and spatial prediction, but the parameters for prediction are made uniform in as large an image signal region as possible. If applicable, the code amount of the parameter can be reduced.
On the other hand, if the same prediction parameter is applied to an image signal pattern having a large temporal and spatial change, the number of prediction differential signals cannot be reduced because prediction errors increase.
Therefore, for image signal patterns with large temporal and spatial changes, the prediction target signal power / entropy is reduced even if the prediction target area is reduced and the amount of parameter data for prediction is increased. Is preferable.
In order to perform coding adapted to the general properties of such a video signal, the moving picture coding apparatus of the first embodiment divides the video signal area hierarchically from a predetermined maximum block size, Prediction processing and prediction difference encoding processing are performed for each divided region.

この実施の形態１の動画像符号化装置が処理対象とする映像信号は、輝度信号と２つの色差信号からなるＹＵＶ信号や、ディジタル撮像素子から出力されるＲＧＢ信号等の任意の色空間のカラー映像信号のほか、モノクロ画像信号や赤外線画像信号など、映像フレームが水平・垂直２次元のディジタルサンプル（画素）列から構成される任意の映像信号である。
各画素の諧調は８ビットでもよいし、１０ビット、１２ビットなどの諧調であってもよい。
ただし、以下の説明においては、特に断らない限り、入力される映像信号がＹＵＶ信号であるものとする。また、２つの色差成分Ｕ，Ｖが輝度成分Ｙに対して、サブサンプルされた４：２：０フォーマットの信号であるものとする。
なお、映像の各フレームに対応する処理データ単位を「ピクチャ」と称し、この実施の形態１では、「ピクチャ」は順次走査（プログレッシブスキャン）された映像フレームの信号として説明を行う。ただし、映像信号がインタレース信号である場合、「ピクチャ」は映像フレームを構成する単位であるフィールド画像信号であってもよい。 The video signal to be processed by the moving image coding apparatus according to the first embodiment is a color in an arbitrary color space such as a YUV signal composed of a luminance signal and two color difference signals, or an RGB signal output from a digital image sensor. In addition to the video signal, the video frame is an arbitrary video signal such as a monochrome image signal or an infrared image signal, in which the video frame is composed of a horizontal and vertical two-dimensional digital sample (pixel) sequence.
The gradation of each pixel may be 8 bits, or may be gradation such as 10 bits or 12 bits.
However, in the following description, it is assumed that the input video signal is a YUV signal unless otherwise specified. In addition, it is assumed that the two color difference components U and V are subsampled 4: 2: 0 format signals with respect to the luminance component Y.
The processing data unit corresponding to each frame of the video is referred to as “picture”. In the first embodiment, “picture” is described as a signal of a video frame that has been sequentially scanned (progressive scan). However, when the video signal is an interlace signal, the “picture” may be a field image signal which is a unit constituting a video frame.

図１はこの発明の実施の形態１による動画像符号化装置を示す構成図である。
図１において、符号化制御部１は動き補償予測処理（フレーム間予測処理）又はイントラ予測処理（フレーム内予測処理）が実施される際の符号化単位であるＬＣＵ（マクロブロック）のサイズ（ＬＣＵサイズ）を決定するとともに、予め設定されているＣＵ（ＬＣＵよりも細かい符号化単位のブロック）の分割階層数の上限（ＣＵの分割階層数の上限は、例えば、シーケンス、ピクチャあるいはスライス単位で設定される）の範囲内で、各々のＬＣＵにおける四分木構造で階層が最も深い部分の分割階層数を示す最大ＣＵ分割階層数を決定する処理を実施する。
また、符号化制御部１はＬＣＵが最大ＣＵ分割階層数に至るまで分割されることで得られる各々のＣＵに適する符号化モードを決定する処理を実施する。即ち、利用可能な１以上の符号化モード（１以上のイントラ符号化モード、１以上のインター符号化モード）の中から、各々のＣＵに適する符号化モードを選択する処理を実施する。 1 is a block diagram showing a moving picture coding apparatus according to Embodiment 1 of the present invention.
In FIG. 1, the encoding control unit 1 has a size (LCU) of an LCU (macroblock) that is an encoding unit when a motion compensation prediction process (interframe prediction process) or an intra prediction process (intraframe prediction process) is performed. Size), and the upper limit of the number of division layers of a CU (block of encoding unit smaller than LCU) set in advance (the upper limit of the number of division layers of a CU is set, for example, in units of sequences, pictures, or slices) In the quadrant tree structure in each LCU, a process of determining the maximum number of divided CU layers indicating the number of divided layers in the deepest part is performed.
Further, the encoding control unit 1 performs a process of determining an encoding mode suitable for each CU obtained by dividing the LCU until the maximum number of CU division layers is reached. That is, a process of selecting a coding mode suitable for each CU from one or more available coding modes (one or more intra coding modes and one or more inter coding modes) is performed.

ブロック分割部２は入力画像を示す映像信号を入力すると、その映像信号が示す入力画像を符号化制御部１により決定されたＬＣＵサイズのＬＣＵに分割するとともに、符号化制御部１により決定された最大ＣＵ分割階層数に至るまで、そのＬＣＵを四分木構造で階層的に分割することでＣＵを得て、各々のＣＵを出力する処理を実施する。なお、ブロック分割部２はブロック分割手段を構成している。 When the block dividing unit 2 inputs a video signal indicating an input image, the block dividing unit 2 divides the input image indicated by the video signal into LCUs having the LCU size determined by the encoding control unit 1 and is determined by the encoding control unit 1. Until the maximum number of CU division hierarchies is reached, the LCU is hierarchically divided by a quadtree structure to obtain a CU, and a process of outputting each CU is performed. The block dividing unit 2 constitutes block dividing means.

切替スイッチ３は符号化制御部１により選択された符号化モードがイントラ符号化モードであれば、ブロック分割部２により分割されたＣＵをイントラ予測部４に出力し、符号化制御部１により選択された符号化モードがインター符号化モードであれば、ブロック分割部２により分割されたＣＵを動き補償予測部５に出力する処理を実施する。
イントラ予測部４は切替スイッチ３からブロック分割部２により分割されたＣＵを受けると、符号化制御部１から出力されたイントラ予測パラメータを用いて、そのＣＵに対するイントラ予測処理を実施して予測画像を生成する処理を実施する。
動き補償予測部５は切替スイッチ３からブロック分割部２により分割されたＣＵを受けると、符号化制御部１から出力されたインター予測パラメータを用いて、そのＣＵに対する動き補償予測処理を実施して予測画像を生成する処理を実施する。
なお、切替スイッチ３、イントラ予測部４及び動き補償予測部５から予測画像生成手段が構成されている。 If the coding mode selected by the coding control unit 1 is the intra coding mode, the changeover switch 3 outputs the CU divided by the block dividing unit 2 to the intra prediction unit 4 and selects it by the coding control unit 1. If the encoded mode is the inter encoding mode, a process of outputting the CU divided by the block division unit 2 to the motion compensation prediction unit 5 is performed.
When the intra prediction unit 4 receives the CU divided by the block division unit 2 from the changeover switch 3, the intra prediction process is performed on the CU using the intra prediction parameter output from the encoding control unit 1, and the predicted image The process to generate is performed.
When the motion compensation prediction unit 5 receives the CU divided by the block division unit 2 from the changeover switch 3, the motion compensation prediction unit 5 performs the motion compensation prediction process for the CU using the inter prediction parameter output from the encoding control unit 1. A process for generating a predicted image is performed.
The changeover switch 3, the intra prediction unit 4, and the motion compensation prediction unit 5 constitute a predicted image generation unit.

減算部６はブロック分割部２により分割されたＣＵから、イントラ予測部４又は動き補償予測部５により生成された予測画像を減算することで差分画像（＝ＣＵ−予測画像）を生成し、その差分画像を示す予測差分信号を出力する処理を実施する。
変換・量子化部７は符号化制御部１から出力された予測差分符号化パラメータに含まれている変換ブロックサイズ単位で、減算部６から出力された予測差分信号の変換処理（例えば、ＤＣＴ（離散コサイン変換）や、予め特定の学習系列に対して基底設計がなされているＫＬ変換等の直交変換処理）を実施するとともに、その予測差分符号化パラメータに含まれている量子化パラメータを用いて、その予測差分信号の変換係数を量子化することで、量子化後の変換係数を差分画像の圧縮データとして出力する処理を実施する。なお、変換・量子化部７は画像圧縮手段を構成している。 The subtracting unit 6 generates a difference image (= CU-predicted image) by subtracting the prediction image generated by the intra prediction unit 4 or the motion compensation prediction unit 5 from the CU divided by the block dividing unit 2. A process of outputting a prediction difference signal indicating a difference image is performed.
The transform / quantization unit 7 converts the prediction difference signal output from the subtraction unit 6 in units of transform block size included in the prediction difference encoding parameter output from the encoding control unit 1 (for example, DCT ( Discrete cosine transformation) or orthogonal transformation processing such as KL transformation for which a base design is made in advance for a specific learning sequence) and using the quantization parameter included in the prediction differential coding parameter Then, by quantizing the transform coefficient of the prediction difference signal, a process of outputting the quantized transform coefficient as compressed data of the difference image is performed. The transform / quantization unit 7 constitutes an image compression unit.

逆量子化・逆変換部８は符号化制御部１から出力された予測差分符号化パラメータに含まれている量子化パラメータを用いて、変換・量子化部７から出力された圧縮データを逆量子化し、その予測差分符号化パラメータに含まれている変換ブロックサイズ単位で、逆量子化の圧縮データの逆変換処理（例えば、逆ＤＣＴ（逆離散コサイン変換）や、逆ＫＬ変換等の逆変換処理）を実施することで、逆変換処理後の圧縮データを局所復号予測差分信号として出力する処理を実施する。 The inverse quantization / inverse transform unit 8 performs inverse quantization on the compressed data output from the transform / quantization unit 7 using the quantization parameter included in the prediction difference encoding parameter output from the encoding control unit 1. Inverse transform processing (for example, inverse DCT (Inverse Discrete Cosine Transform), inverse KL transform, etc.) of the inverse quantized compressed data in units of transform block sizes included in the prediction difference encoding parameter ), The process of outputting the compressed data after the inverse transform process as a local decoded prediction difference signal is performed.

加算部９は逆量子化・逆変換部８から出力された局所復号予測差分信号とイントラ予測部４又は動き補償予測部５により生成された予測画像を示す予測信号を加算することで、局所復号画像を示す局所復号画像信号を生成する処理を実施する。
イントラ予測用メモリ１０はイントラ予測部４により次回のイントラ予測処理で用いられる画像として、加算部９により生成された局所復号画像信号が示す局所復号画像を格納するＲＡＭなどの記録媒体である。 The adding unit 9 adds the local decoded prediction difference signal output from the inverse quantization / inverse transform unit 8 and the prediction signal indicating the prediction image generated by the intra prediction unit 4 or the motion compensation prediction unit 5 to thereby perform local decoding. A process of generating a locally decoded image signal indicating an image is performed.
The intra prediction memory 10 is a recording medium such as a RAM that stores a local decoded image indicated by the local decoded image signal generated by the adding unit 9 as an image used in the next intra prediction process by the intra prediction unit 4.

ループフィルタ部１１は加算部９により生成された局所復号画像信号に含まれている符号化歪みを補償し、符号化歪み補償後の局所復号画像信号が示す局所復号画像を参照画像として動き補償予測フレームメモリ１２に出力する処理を実施する。
動き補償予測フレームメモリ１２は動き補償予測部５により次回の動き補償予測処理で用いられる参照画像として、ループフィルタ部１１によるフィルタリング処理後の局所復号画像を格納するＲＡＭなどの記録媒体である。 The loop filter unit 11 compensates for the coding distortion included in the locally decoded image signal generated by the adding unit 9, and performs motion compensation prediction using the locally decoded image indicated by the locally decoded image signal after the coding distortion compensation as a reference image. A process of outputting to the frame memory 12 is performed.
The motion compensated prediction frame memory 12 is a recording medium such as a RAM that stores a locally decoded image after the filtering process by the loop filter unit 11 as a reference image used in the next motion compensated prediction process by the motion compensated prediction unit 5.

可変長符号化部１３は例えば変換・量子化部７から出力された圧縮データと、符号化制御部１から出力された符号化モード、予測差分符号化パラメータ、最大ＣＵ分割階層数及び分割フラグ（最も深い階層に属するＣＵ以外のＣＵがブロック分割部２によって四分木分割されているか否かを示すフラグ）と、イントラ予測部４から出力されたイントラ予測パラメータ又は動き補償予測部５から出力されたインター予測パラメータとを可変長符号化して、その圧縮データ、符号化モード、予測差分符号化パラメータ、最大ＣＵ分割階層数、分割フラグ、イントラ予測パラメータ／インター予測パラメータの符号化データが多重化されているビットストリームを生成する処理を実施する。なお、可変長符号化部１３は可変長符号化手段を構成している。 The variable length encoding unit 13, for example, the compressed data output from the transform / quantization unit 7, the encoding mode output from the encoding control unit 1, the prediction differential encoding parameter, the maximum number of CU partition layers, and the partition flag ( A flag indicating whether or not a CU other than the CU belonging to the deepest hierarchy has been quadtree-divided by the block dividing unit 2), an intra-prediction parameter output from the intra-prediction unit 4, or a motion-compensated prediction unit 5 The inter-prediction parameter is subjected to variable length coding, and the compressed data, the coding mode, the prediction differential coding parameter, the maximum number of CU partition layers, the partition flag, and the intra-prediction parameter / inter-prediction parameter encoded data are multiplexed. To generate a bitstream. The variable length encoding unit 13 constitutes variable length encoding means.

図１では、動画像符号化装置の構成要素である符号化制御部１、ブロック分割部２、切替スイッチ３、イントラ予測部４、動き補償予測部５、減算部６、変換・量子化部７、逆量子化・逆変換部８、加算部９、ループフィルタ部１１及び可変長符号化部１３のそれぞれが専用のハードウェア（例えば、ＣＰＵを実装している半導体集積回路、あるいは、ワンチップマイコンなど）で構成されているものを想定しているが、動画像符号化装置がコンピュータで構成される場合、符号化制御部１、ブロック分割部２、切替スイッチ３、イントラ予測部４、動き補償予測部５、減算部６、変換・量子化部７、逆量子化・逆変換部８、加算部９、ループフィルタ部１１及び可変長符号化部１３の処理内容を記述しているプログラムを当該コンピュータのメモリに格納し、当該コンピュータのＣＰＵが当該メモリに格納されているプログラムを実行するようにしてもよい。
図２はこの発明の実施の形態１による動画像符号化装置の処理内容を示すフローチャートである。
また、図３はＣＵ分割状態のエントロピー符号化処理を示すフローチャートである。 In FIG. 1, a coding control unit 1, a block division unit 2, a changeover switch 3, an intra prediction unit 4, a motion compensation prediction unit 5, a subtraction unit 6, and a transform / quantization unit 7, which are components of the moving image coding apparatus. , The inverse quantization / inverse transform unit 8, the adder unit 9, the loop filter unit 11 and the variable length coding unit 13 each have dedicated hardware (for example, a semiconductor integrated circuit on which a CPU is mounted, or a one-chip microcomputer) However, when the moving image encoding apparatus is configured by a computer, the encoding control unit 1, the block division unit 2, the changeover switch 3, the intra prediction unit 4, the motion compensation A program describing the processing contents of the prediction unit 5, subtraction unit 6, transformation / quantization unit 7, inverse quantization / inverse transformation unit 8, addition unit 9, loop filter unit 11, and variable length coding unit 13 Computer Stored in the memory, CPU of the computer may execute a program stored in the memory.
FIG. 2 is a flowchart showing the processing contents of the moving picture coding apparatus according to Embodiment 1 of the present invention.
FIG. 3 is a flowchart showing the entropy encoding process in the CU division state.

図４はこの発明の実施の形態１による動画像復号装置を示す構成図である。
図４において、可変長復号部５１はビットストリームに多重化されている符号化データから階層的に分割されている各々のＣＵに係る圧縮データ、符号化モード、予測差分符号化パラメータ、イントラ予測パラメータ／インター予測パラメータを可変長復号して、その圧縮データ及び予測差分符号化パラメータを逆量子化・逆変換部５５に出力するとともに、その符号化モード及びイントラ予測パラメータ／インター予測パラメータを切替スイッチ５２に出力する処理を実施する。
また、可変長復号部５１は符号化データから最大ＣＵ分割階層数及び分割フラグを可変長復号して、その最大ＣＵ分割階層数と分割フラグからＬＣＵの四分木構造を復号する処理を実施する。
なお、可変長復号部５１は可変長復号手段を構成している。 FIG. 4 is a block diagram showing a moving picture decoding apparatus according to Embodiment 1 of the present invention.
In FIG. 4, the variable length decoding unit 51 includes compressed data, encoding mode, prediction differential encoding parameter, and intra prediction parameter related to each CU divided hierarchically from encoded data multiplexed in the bitstream. / Inter-prediction parameters are variable-length decoded, and the compressed data and prediction differential encoding parameters are output to the inverse quantization / inverse transform unit 55, and the encoding mode and the intra-prediction parameters / inter-prediction parameters are changed over by the switch 52. Execute the process to output to.
In addition, the variable length decoding unit 51 performs variable length decoding on the maximum number of CU division layers and the division flag from the encoded data, and performs processing for decoding the LCU quadtree structure from the maximum number of CU division layers and the division flag. .
The variable length decoding unit 51 constitutes variable length decoding means.

切替スイッチ５２は可変長復号部５１から出力されたＣＵに係る符号化モードがイントラ符号化モードである場合、可変長復号部５１から出力されたイントラ予測パラメータをイントラ予測部５３に出力し、その符号化モードがインター符号化モードである場合、可変長復号部５１から出力されたインター予測パラメータを動き補償予測部５４に出力する処理を実施する。
イントラ予測部５３は切替スイッチ５２から出力されたイントラ予測パラメータを用いて、当該ＣＵに対するイントラ予測処理を実施して予測画像を生成する処理を実施する。
動き補償予測部５４は切替スイッチ５２から出力されたインター予測パラメータを用いて、当該ＣＵに対する動き補償予測処理を実施して予測画像を生成する処理を実施する。
なお、切替スイッチ５２、イントラ予測部５３及び動き補償予測部５４から予測画像生成手段が構成されている。 When the coding mode related to the CU output from the variable length decoding unit 51 is the intra coding mode, the changeover switch 52 outputs the intra prediction parameter output from the variable length decoding unit 51 to the intra prediction unit 53, and When the coding mode is the inter coding mode, a process of outputting the inter prediction parameter output from the variable length decoding unit 51 to the motion compensation prediction unit 54 is performed.
The intra prediction unit 53 uses the intra prediction parameter output from the changeover switch 52 to perform an intra prediction process for the CU and generate a predicted image.
The motion compensation prediction unit 54 performs a process of generating a predicted image by performing a motion compensation prediction process for the CU using the inter prediction parameter output from the changeover switch 52.
The changeover switch 52, the intra prediction unit 53, and the motion compensation prediction unit 54 constitute a predicted image generation unit.

逆量子化・逆変換部５５は可変長復号部５１から出力された予測差分符号化パラメータに含まれている量子化パラメータを用いて、可変長復号部５１から出力された符号化ブロックに係る圧縮データを逆量子化し、その予測差分符号化パラメータに含まれている変換ブロックサイズ単位で、逆量子化の圧縮データの逆変換処理（例えば、逆ＤＣＴ（逆離散コサイン変換）や、逆ＫＬ変換等の逆変換処理）を実施することで、逆変換処理後の圧縮データを復号予測差分信号（圧縮前の差分画像を示す信号）として出力する処理を実施する。
加算部５６は逆量子化・逆変換部５５から出力された復号予測差分信号とイントラ予測部５３又は動き補償予測部５４により生成された予測画像を示す予測信号を加算することで、復号画像を示す復号画像信号を生成する処理を実施する。
なお、逆量子化・逆変換部５５及び加算部５６から復号画像生成手段が構成されている。 The inverse quantization / inverse transform unit 55 uses the quantization parameter included in the prediction difference encoding parameter output from the variable length decoding unit 51 to compress the encoded block output from the variable length decoding unit 51. Data is inversely quantized, and inverse transform processing (for example, inverse DCT (Inverse Discrete Cosine Transform), inverse KL transform, etc.) is performed on the transform block size unit included in the prediction differential encoding parameter. (Inverse transform process) is performed to output the compressed data after the inverse transform process as a decoded prediction difference signal (a signal indicating a difference image before compression).
The addition unit 56 adds the decoded prediction difference signal output from the inverse quantization / inverse conversion unit 55 and the prediction signal indicating the prediction image generated by the intra prediction unit 53 or the motion compensation prediction unit 54, thereby adding the decoded image. The process which produces | generates the decoded image signal shown is implemented.
The inverse quantization / inverse transform unit 55 and the addition unit 56 constitute decoded image generation means.

イントラ予測用メモリ５７はイントラ予測部５３により次回のイントラ予測処理で用いられる画像として、加算部５６により生成された復号画像信号が示す復号画像を格納するＲＡＭなどの記録媒体である。
ループフィルタ部５８は加算部５６により生成された復号画像信号に含まれている符号化歪みを補償し、符号化歪み補償後の復号画像信号が示す復号画像を参照画像として動き補償予測フレームメモリ５９に出力する処理を実施する。
動き補償予測フレームメモリ５９は動き補償予測部５４により次回の動き補償予測処理で用いられる参照画像として、ループフィルタ部５８によるフィルタリング処理後の復号画像を格納するＲＡＭなどの記録媒体である。 The intra prediction memory 57 is a recording medium such as a RAM that stores a decoded image indicated by the decoded image signal generated by the adding unit 56 as an image used by the intra prediction unit 53 in the next intra prediction process.
The loop filter unit 58 compensates for the encoding distortion included in the decoded image signal generated by the adding unit 56, and uses the decoded image indicated by the decoded image signal after the encoding distortion compensation as a reference image for the motion compensated prediction frame memory 59. Execute the process to output to.
The motion compensated prediction frame memory 59 is a recording medium such as a RAM that stores a decoded image after the filtering process by the loop filter unit 58 as a reference image to be used by the motion compensation prediction unit 54 in the next motion compensation prediction process.

図４では、動画像復号装置の構成要素である可変長復号部５１、切替スイッチ５２、イントラ予測部５３、動き補償予測部５４、逆量子化・逆変換部５５、加算部５６及びループフィルタ部５８のそれぞれが専用のハードウェア（例えば、ＣＰＵを実装している半導体集積回路、あるいは、ワンチップマイコンなど）で構成されているものを想定しているが、動画像復号装置がコンピュータで構成される場合、可変長復号部５１、切替スイッチ５２、イントラ予測部５３、動き補償予測部５４、逆量子化・逆変換部５５、加算部５６及びループフィルタ部５８の処理内容を記述しているプログラムを当該コンピュータのメモリに格納し、当該コンピュータのＣＰＵが当該メモリに格納されているプログラムを実行するようにしてもよい。
図５はこの発明の実施の形態１による動画像復号装置の処理内容を示すフローチャートである。
また、図６はＣＵ分割状態のエントロピー復号処理を示すフローチャートである。 In FIG. 4, a variable length decoding unit 51, a changeover switch 52, an intra prediction unit 53, a motion compensation prediction unit 54, an inverse quantization / inverse conversion unit 55, an addition unit 56, and a loop filter unit, which are components of the moving picture decoding apparatus. 58 is assumed to be configured by dedicated hardware (for example, a semiconductor integrated circuit on which a CPU is mounted, or a one-chip microcomputer). A program describing the processing contents of the variable length decoding unit 51, the changeover switch 52, the intra prediction unit 53, the motion compensation prediction unit 54, the inverse quantization / inverse transformation unit 55, the addition unit 56, and the loop filter unit 58. May be stored in the memory of the computer, and the CPU of the computer may execute the program stored in the memory.
FIG. 5 is a flowchart showing the processing contents of the moving picture decoding apparatus according to Embodiment 1 of the present invention.
FIG. 6 is a flowchart showing the entropy decoding process in the CU division state.

次に動作について説明する。
最初に、図１の動画像符号化装置の処理内容を説明する。
まず、符号化制御部１は、符号化対象のピクチャ（カレントピクチャ）の符号化に用いるＬＣＵのサイズ（ＬＣＵサイズ）と、予め設定されているＣＵの分割階層数の上限の範囲内で、そのＬＣＵにおける四分木構造で階層が最も深い部分の分割階層数を示す最大ＣＵ分割階層数とを決定する（図２のステップＳＴ１）。
また、符号化制御部１は、ブロック分割部２によりＬＣＵが最大ＣＵ分割階層数に至るまで分割されることで得られる各々のＣＵに適する符号化モードを決定する。
即ち、利用可能な１以上の符号化モード（１以上のイントラ符号化モード、１以上のインター符号化モード）の中から、各々のＣＵに適する符号化モードを選択する。
符号化モードの選択処理は公知の技術であるため詳細な説明は省略する。なお、符号化モードの選択処理は、後述するブロック分割部２によりＬＣＵが階層的に分割されて複数のＣＵが得られた後に、各々のＣＵに適する符号化モードを決定するようにしてもよい。 Next, the operation will be described.
First, the processing contents of the moving picture encoding apparatus in FIG. 1 will be described.
First, the encoding control unit 1 sets the LCU size (LCU size) used for encoding the encoding target picture (current picture) and the upper limit of the number of division hierarchies of the CU set in advance. The maximum number of CU division layers indicating the number of division layers of the deepest part in the quadtree structure in the LCU is determined (step ST1 in FIG. 2).
Also, the encoding control unit 1 determines an encoding mode suitable for each CU obtained by dividing the LCU up to the maximum number of CU division layers by the block dividing unit 2.
That is, a coding mode suitable for each CU is selected from one or more available coding modes (one or more intra coding modes and one or more inter coding modes).
Since the encoding mode selection process is a known technique, a detailed description thereof will be omitted. In the coding mode selection process, after the LCU is hierarchically divided by the block dividing unit 2 to be described later and a plurality of CUs are obtained, a coding mode suitable for each CU may be determined. .

ＬＣＵサイズの決め方として、例えば、全てのピクチャに対して、入力画像の解像度に応じたサイズに決定する方法が考えられる。
また、入力画像の局所的な動きの複雑さの違いをパラメータとして定量化しておき、動きの激しいピクチャではＬＣＵサイズを小さな値に決定し、動きが少ないピクチャではＬＣＵサイズを大きな値に決定する方法などが考えられる。
最大ＣＵ分割階層数については、例えば、入力画像の動きが激しい程、階層数を深くして、より細かい動きが検出できるように設定し、入力画像の動きが少なければ、階層数を抑えるように設定する方法が考えられる。
なお、最大ＣＵ分割階層数は、シーケンス単位で固定の値を用いるよう構成してもよい。 As a method of determining the LCU size, for example, a method of determining a size corresponding to the resolution of the input image for all the pictures can be considered.
Also, a method of quantifying the difference in complexity of local motion of an input image as a parameter, determining a LCU size to a small value for a picture with intense motion, and determining a LCU size to a large value for a picture with little motion And so on.
The maximum number of CU division layers is set so that, for example, the higher the motion of the input image, the deeper the number of layers, and the more detailed motion can be detected. If the motion of the input image is small, the number of layers is suppressed. A method of setting is conceivable.
Note that the maximum number of CU partition layers may be configured to use a fixed value in sequence units.

ブロック分割部２は、入力画像を示す映像信号を入力すると、その映像信号が示す入力画像を符号化制御部１により決定されたＬＣＵサイズに分割する。
また、ブロック分割部２は、ＬＣＵサイズの画像領域（ＬＣＵ）毎に、符号化制御部１により決定された最大ＣＵ分割階層数に至るまで、そのＬＣＵを四分木構造で階層的に分割することで、ＬＣＵよりも細かい符号化単位のブロックであるＣＵを得て、各々のＣＵを切替スイッチ３及び減算部６に出力する（ステップＳＴ２）。 When the video signal indicating the input image is input, the block dividing unit 2 divides the input image indicated by the video signal into the LCU size determined by the encoding control unit 1.
In addition, the block division unit 2 divides the LCU hierarchically in a quadtree structure until the maximum number of CU division layers determined by the encoding control unit 1 is reached for each LCU size image area (LCU). As a result, CUs, which are blocks of coding units smaller than the LCU, are obtained, and each CU is output to the changeover switch 3 and the subtraction unit 6 (step ST2).

ここで、図７はＬＣＵが階層的に分割されて複数のＣＵが得られる様子を示す説明図である。
ＬＣＵは、図７において、「第０階層」と表記されている輝度成分で、（Ｌ⁰，Ｍ⁰）の画素サイズを有するＣＵであると定義する。
図７の例では、ＬＣＵを出発点として、４分木構造で、別途定める所定の深さ（最大ＣＵ分割階層数）まで階層的に分割を行うことによって、複数のＣＵを得ている。 Here, FIG. 7 is an explanatory diagram showing a state in which the LCU is hierarchically divided to obtain a plurality of CUs.
The LCU is defined as a CU having a pixel size of (L ⁰ , M ⁰ ) with a luminance component represented as “0th layer” in FIG.
In the example of FIG. 7, a plurality of CUs are obtained by performing a hierarchical division to a predetermined depth (the maximum number of CU division hierarchies) with a quadtree structure starting from an LCU.

深さｎでは、ＣＵはサイズ（Ｌⁿ，Ｍⁿ）の画像領域である。
ただし、ＬⁿとＭⁿは同じであってもよいし異なっていてもよいが、図７の例ではＬⁿ＝Ｍⁿのケースを示している。
以降、ＣＵのサイズは、ＣＵの輝度成分におけるサイズ（Ｌⁿ，Ｍⁿ）と定義する。 At depth n, the CU is an image area of size (L ⁿ , M ⁿ ).
However, L ⁿ and M ⁿ may be the same or different, but the example of FIG. 7 shows the case of L ⁿ = M ⁿ .
Hereinafter, the size of the CU is defined as the size (L ⁿ , M ⁿ ) in the luminance component of the CU.

ブロック分割部２は、４分木分割を行うため、常に（Ｌⁿ⁺¹，Ｍⁿ⁺¹）＝（Ｌⁿ／２，Ｍⁿ／２）が成立する。
ただし、ＲＧＢ信号などのように、全ての色成分が同一サンプル数を有するカラー映像信号（４：４：４フォーマット）では、全ての色成分のサイズが（Ｌⁿ，Ｍⁿ）になるが、４：２：０フォーマットを扱う場合、対応する色差成分の符号化ブロックのサイズは（Ｌⁿ／２，Ｍⁿ／２）である。
以降、第ｎ階層のＣＵをＢ_j ⁿ（ｊ：第ｎ階層におけるＣＵ番号）で表し、Ｂ_j ⁿで選択しうる符号化モードをｍ（Ｂ_j ⁿ）と表記する。 Since the block division unit 2 performs quadtree division, (L ^{n + 1} , M ^{n + 1} ) = (L ⁿ / 2, M ⁿ / 2) always holds.
However, in a color video signal (4: 4: 4 format) in which all color components have the same number of samples, such as RGB signals, the size of all color components is (L ⁿ , M ⁿ ). When the 4: 2: 0 format is handled, the size of the corresponding color difference component coding block is (L ⁿ / 2, M ⁿ / 2).
Hereinafter, the n-th layer CU is represented by B _j ⁿ (j: CU number in the n-th layer), and the encoding mode that can be selected by B _j ⁿ is represented by m (B _j ⁿ ).

複数の色成分からなるカラー映像信号の場合、符号化モードｍ（Ｂ_j ⁿ）は、色成分ごとに、それぞれ個別のモードを用いるように構成されてもよいが、以降、特に断らない限り、ＹＵＶ信号、４：２：０フォーマットのＣＵの輝度成分に対する符号化モードのことを指すものとして説明を行う。
符号化モードｍ（Ｂ_j ⁿ）には、１つないし複数のイントラ符号化モード（総称して「ＩＮＴＲＡ」）、１つないし複数のインター符号化モード（総称して「ＩＮＴＥＲ」）があり、符号化制御部１は、上述したように、当該ピクチャで利用可能な全ての符号化モードないしは、そのサブセットの中から、ＣＵであるＢ_j ⁿに対して最も符号化効率がよい符号化モードを選択する。 In the case of a color video signal composed of a plurality of color components, the encoding mode m (B _j ⁿ ) may be configured to use an individual mode for each color component. The description will be made on the assumption that it indicates a coding mode for a luminance component of a YUV signal, 4: 2: 0 format CU.
The coding mode m (B _j ⁿ ) includes one or more intra coding modes (collectively “INTRA”), one or more inter coding modes (collectively “INTER”), As described above, the encoding control unit 1 selects an encoding mode having the highest encoding efficiency for B _j ⁿ that is a CU from all the encoding modes that can be used in the picture or a subset thereof. select.

符号化制御部１は、符号化対象のピクチャ（カレントピクチャ）の符号化に用いる各々のＬＣＵに対して、例えば、図８に示すようなＣＵ分割状態を生成してＣＵであるＢ_j ⁿを特定する。
図８（ａ）の網がけ部分はＢ_j ⁿで定義される領域を示し、また、図８（ｂ）は階層分割によって符号化モードｍ（Ｂ_j ⁿ）が割り当てられる状況を四分木グラフで示している。
図８（ｂ）において、□で囲まれているノードが、符号化モードｍ（Ｂ_j ⁿ）が割り当てられたノード（ＣＵ）を示している。 The encoding control unit 1 generates, for example, a CU partition state as shown in FIG. 8 for each LCU used for encoding a picture to be encoded (current picture), and sets B _j ⁿ as a CU. Identify.
The shaded portion in FIG. 8A shows an area defined by B _j ⁿ , and FIG. 8B shows a situation where a coding mode m (B _j ⁿ ) is assigned by hierarchical division in a quadtree graph. Is shown.
In FIG. 8B, nodes surrounded by squares indicate nodes (CU) to which the encoding mode m (B _j ⁿ ) is assigned.

なお、ＣＵであるＢ_j ⁿは、図８に示すように、更に１つないし複数のパーティションであるＰＵ（ＰｒｅｄｉｃｔｉｏｎＵｎｉｔ：予測処理単位）に分割される。
以降、Ｂ_j ⁿに属するパーティションをＰ_ji ⁿ（ｉ：第ｎ階層に属する第ｊＣＵにおけるＰＵ番号）と表記する。
Ｂ_j ⁿに属するパーティションＰ_ji ⁿの分割がどのようになされているかは符号化モードｍ（Ｂ_j ⁿ）の中に情報として含まれる。
パーティションＰ_ji ⁿは、すべて符号化モードｍ（Ｂ_j ⁿ）に従って予測処理が行われるが、パーティションＰ_ji ⁿ毎に、個別の予測パラメータを選択することができる。 Note that B _j ⁿ that is a CU is further divided into PUs (Prediction Units) that are one or more partitions, as shown in FIG.
Hereinafter, the partition belonging to B _j ⁿ is ^denoted as P _ji ⁿ (i: PU number in the _j- ^th CU belonging to the nth layer).
How the partition P _ji ⁿ belonging to B _j ⁿ is divided is included as information in the encoding mode m (B _j ⁿ ).
All partitions P _ji ⁿ are subjected to prediction processing according to the encoding mode m (B _j ⁿ ), but individual prediction parameters can be selected for each partition P _ji ⁿ .

切替スイッチ３は、符号化制御部１が各々のＢ_j ⁿのパーティションＰ_ji ⁿに対して最適な符号化モードｍ（Ｂ_j ⁿ）を選択すると、その符号化モードｍ（Ｂ_j ⁿ）がイントラ符号化モードであれば（ステップＳＴ３）、ブロック分割部２により分割されたＣＵであるＢ_j ⁿのパーティションＰ_ji ⁿをイントラ予測部４に出力する。
一方、その符号化モードｍＢ_j ⁿがインター符号化モードであれば（ステップＳＴ３）、ブロック分割部２により分割されたＣＵであるＢ_j ⁿのパーティションＰ_ji ⁿを動き補償予測部５に出力する。 Changeover switch 3, the encoding control unit 1 selects the optimal coding mode m (B _j ⁿ⁾ for the partition P _ji ⁿ of each B _j ^n, the encoding mode m (B _j ⁿ⁾ is if intra-coding mode (step ST3), and outputs the partition P _ji ⁿ of B _j ⁿ a CU divided by the block division unit 2 to the intra prediction unit 4.
On the other hand, if the coding mode mB _j ⁿ is the inter coding mode (step ST3), the partition P _ji ⁿ of B _j ⁿ that is a CU divided by the block dividing unit 2 is output to the motion compensation prediction unit 5. .

イントラ予測部４は、切替スイッチ３からＣＵであるＢ_j ⁿのパーティションＰ_ji ⁿを受けると、符号化制御部１により選択された符号化モードｍ（Ｂ_j ⁿ）に対応するイントラ予測パラメータを用いて、そのＢ_j ⁿのパーティションＰ_ji ⁿに対するイントラ予測処理を実施して、イントラ予測画像Ｐ_ji ⁿを生成する（ステップＳＴ４）。
イントラ予測部４は、イントラ予測画像Ｐ_ji ⁿを生成すると、そのイントラ予測画像Ｐ_ji ⁿを減算部６及び加算部９に出力するが、図４の動画像復号装置でも同じイントラ予測画像Ｐ_ji ⁿを生成できるようにするため、そのイントラ予測パラメータを可変長符号化部１３に出力する。
この実施の形態１におけるイントラ予測処理は、ＡＶＣ／Ｈ．２６４規格（ＩＳＯ／ＩＥＣ１４４９６−１０）に定められるような方向性を有する適応空間予測を行う場合、ＰＵ毎に選択される予測モード情報などの情報を含んでいる。
この実施の形態１におけるイントラ予測処理は、ＡＶＣ／Ｈ．２６４規格に定められるアルゴリズムに限定されないが、イントラ予測パラメータとしては、動画像符号化装置側と動画像復号装置側でまったく同じイントラ予測画像を生成するために必要な情報を含んでいる必要がある。 Intra prediction unit 4 receives a partition P _ji ⁿ of B _j ⁿ from the changeover switch 3 is CU, intra prediction parameter corresponding to the encoding control unit 1 by the selected coding mode m (B _j ⁿ⁾ used, to implement the intra prediction process for partition P _ji ⁿ of B _j ^n, it generates an intra prediction image P _ji ⁿ (step ST4).
Intra prediction unit 4, when generating an intra prediction image P _ji ^n, the intra prediction image P _ji ⁿ outputs to the subtraction unit 6 and the addition unit 9, but the same intra prediction image P _ji in the video decoding apparatus of FIG. 4 ^{In order} to be able to generate ⁿ , the intra prediction parameter is output to the variable length coding unit 13.
The intra prediction process in the first embodiment is an AVC / H. When performing adaptive space prediction having directionality as defined in the H.264 standard (ISO / IEC 14496-10), information such as prediction mode information selected for each PU is included.
The intra prediction process in the first embodiment is an AVC / H. Although not limited to the algorithm defined in the H.264 standard, the intra-prediction parameter needs to include information necessary for generating exactly the same intra-predicted image on the moving image encoding device side and the moving image decoding device side. .

動き補償予測部５は、切替スイッチ３からＣＵであるＢ_j ⁿのパーティションＰ_ji ⁿを受けると、そのＢ_j ⁿのパーティションＰ_ji ⁿと動き補償予測フレームメモリ１２に格納されている参照画像から動きベクトルを探索し、その動きベクトル及び符号化モードｍ（Ｂ_j ⁿ）に対応するインター予測パラメータを用いて、そのＢ_j ⁿのパーティションＰ_ji ⁿに対する動き補償予測処理を実施して、インター予測画像Ｐ_ji ⁿを生成する（ステップＳＴ５）。 The motion compensation prediction unit 5 receives the partition P _ji ⁿ of B _j ⁿ from the changeover switch 3 is CU, from the B _j ⁿ of the partition P _ji ⁿ and the reference image stored in the motion compensated prediction frame memory 12 A motion vector is searched, using the inter prediction parameter corresponding to the motion vector and the encoding mode m (B _j ⁿ ), a motion compensated prediction process is performed on the partition P _ji ⁿ of the B _j ⁿ to perform inter prediction. It generates an image P _ji ⁿ (step ST5).

動き補償予測部５は、インター予測画像Ｐ_ji ⁿを生成すると、そのインター予測画像Ｐ_ji ⁿを減算部６及び加算部９に出力するが、図４の動画像復号装置でも同じインター予測画像Ｐ_ji ⁿを生成できるようにするため、そのインター予測パラメータを可変長符号化部１３に出力する。
インター予測画像の生成に用いられたインター予測パラメータは、下記に示す情報を含み、動画像復号装置側でまったく同じインター予測画像を生成するために、可変長符号化部１３によってビットストリームに多重化される。
・ＣＵであるＢ_j ⁿ内のパーティション分割を記述するモード情報
・ＰＵであるパーティションＰ_ji ⁿの動きベクトル
・動き補償予測フレームメモリ１２内に複数の参照画像を含む構成の場合、いずれの参照画像を用いて予測を行うかを示す参照画像指示インデックス情報
・複数の動きベクトル予測値候補がある場合に、いずれの動きベクトル予測値を選択して使用するかを示すインデックス情報
・複数の動き補償内挿フィルタがある場合に、いずれのフィルタを選択して使用するかを示すインデックス情報
・ＰＵであるパーティションＰ_ji ⁿの動きベクトルが複数の画素精度（半画素、１／４画素、１／８画素など）を示すことが可能な場合、いずれの画素精度を使用するかを示す選択情報 The motion compensation prediction unit 5 and generates an inter prediction image P _ji ^n, but outputs the inter prediction image P _ji ⁿ to the subtraction unit 6 and the addition unit 9, same inter prediction in a video decoding apparatus of FIG. 4 the image P _In ^order to be able to generate _jin , the inter prediction parameter is output to the variable length coding unit 13.
The inter prediction parameters used for generating the inter prediction image include the following information, and are multiplexed into the bitstream by the variable length encoding unit 13 in order to generate the exact same inter prediction image on the video decoding device side. Is done.
If, CU is in the B _j ⁿ partitioning a configuration including a plurality of reference images to the motion vector and motion compensation prediction frame memory 12 of the partition P _ji ⁿ is described mode information-PU, any reference picture Reference image indication index information that indicates whether to perform prediction using-Index information that indicates which motion vector prediction value to select and use when there are multiple motion vector prediction value candidates-Within multiple motion compensation Index information indicating which filter to select and use when there is an insertion filter. The motion vector of the partition P _ji ⁿ which is the PU has a plurality of pixel accuracy (half pixel, 1/4 pixel, 1/8 pixel). Selection information indicating which pixel precision to use.

減算部６は、イントラ予測部４又は動き補償予測部５が予測画像（イントラ予測画像Ｐ_ji ⁿ、インター予測画像Ｐ_ji ⁿ）を生成すると、ブロック分割部２により分割されたＣＵであるＢ_j ⁿのパーティションＰ_ji ⁿから、イントラ予測部４又は動き補償予測部５により生成された予測画像（イントラ予測画像Ｐ_ji ⁿ、インター予測画像Ｐ_ji ⁿ）を減算することで差分画像を生成し、その差分画像を示す予測差分信号ｅ_ji ⁿを変換・量子化部７に出力する（ステップＳＴ６）。 When the intra prediction unit 4 or the motion compensation prediction unit 5 generates a prediction image (intra prediction image P _ji ⁿ , inter prediction image P _ji ⁿ ), the subtraction unit 6 is B _j that is a CU divided by the block division unit 2. ^A subtraction image is generated by subtracting the prediction image (intra prediction image P _ji ⁿ , inter prediction image P _ji ⁿ ) generated by the intra prediction unit 4 or the motion compensation prediction unit 5 from the partition P _ji ^{n of n} , It outputs the prediction difference signal e _ji ⁿ indicating the differential image to the transform and quantization unit 7 (step ST6).

変換・量子化部７は、減算部６から予測差分信号ｅ_ji ⁿを受けると、符号化制御部１から出力された予測差分符号化パラメータに含まれている変換ブロックサイズ単位に、複数の予測差分信号ｅ_ji ⁿを纏めることで予測差分信号ｅ_i ⁿを求め、その予測差分符号化パラメータに基づいて、その予測差分信号ｅ_i ⁿの変換処理（例えば、ＤＣＴ（離散コサイン変換）や、予め特定の学習系列に対して基底設計がなされているＫＬ変換等の直交変換処理）を実施することで、変換係数を算出する。
また、変換・量子化部７は、その予測差分符号化パラメータに含まれている量子化パラメータを用いて、その変換係数を量子化することで、量子化後の変換係数を差分画像の圧縮データとして逆量子化・逆変換部８及び可変長符号化部１３に出力する（ステップＳＴ７）。 Transform and quantization unit 7 receives the prediction difference signal e _ji ⁿ from the subtracting unit 6, the transform block size unit that is included in the predictive differential coding parameter output from the coding controller 1, a plurality of prediction The difference signal e _ji ⁿ is collected to obtain a prediction difference signal e _i ⁿ , and the prediction difference signal e _i ⁿ is converted based on the prediction difference encoding parameter (for example, DCT (discrete cosine transformation), A transform coefficient is calculated by performing an orthogonal transform process such as a KL transform in which a base design is made for a specific learning sequence.
Further, the transform / quantization unit 7 quantizes the transform coefficient using the quantization parameter included in the prediction difference encoding parameter, thereby converting the quantized transform coefficient into the compressed data of the difference image. Is output to the inverse quantization / inverse transform unit 8 and the variable length coding unit 13 (step ST7).

ここで、直交変換を行う単位は、ＣＵを開始点とする四分木構造で分割して得られるＴＵ（ＴｒａｎｓｆｏｒｍＵｎｉｔ：直交変換処理単位ブロック）単位が適用される。
最大ＴＵサイズ、最小ＴＵサイズ及び最大ＴＵ分割階層数（ＣＵにおける四分木構造で階層が最も深い部分の分割階層数）も、ＬＣＵの場合と同様に、符号化制御部１が決定する。
ＴＵサイズの決め方として、例えば、全てのピクチャに対して、入力画像の解像度に応じたサイズに決定する方法が考えられる。
また、入力画像の局所的な動きの複雑さの違いをパラメータとして定量化しておき、動きの激しいピクチャではＴＵサイズを小さな値に決定し、動きが少ないピクチャではＴＵサイズを大きな値に決定する方法などが考えられる。
最大ＴＵ分割階層数については、例えば、入力画像の動きが激しい程、階層数を深くして、より細かい単位で直交変換が検出できるように設定し、入力画像の動きが少なければ、階層数を抑えるように設定する方法が考えられる。
なお、最大ＴＵ分割階層数は、シーケンス単位で固定の値を用いるよう構成してもよい。
変換・量子化部７では、ＣＵ毎に、上記で定めた最大ＴＵ分割階層数に至るまで階層的にＴＵへ分割し、各ＴＵに対して上記の直交変換と量子化を行う。 Here, as a unit for performing the orthogonal transformation, a TU (Transform Unit: orthogonal transformation processing unit block) unit obtained by dividing with a quadtree structure starting from the CU is applied.
The encoding control unit 1 also determines the maximum TU size, the minimum TU size, and the maximum TU partition layer number (the number of partition layers in the CU with a quadtree structure where the layer is the deepest) as in the case of the LCU.
As a method of determining the TU size, for example, a method of determining a size corresponding to the resolution of the input image for all the pictures can be considered.
A method of quantifying the difference in complexity of local motion of an input image as a parameter, determining a TU size to a small value for a picture with intense motion, and determining a TU size to a large value for a picture with little motion And so on.
As for the maximum number of TU division layers, for example, the more the input image moves, the deeper the layer number is set so that orthogonal transformation can be detected in finer units. A method of setting to suppress can be considered.
Note that the maximum number of TU partition layers may be configured to use a fixed value in sequence units.
The transform / quantization unit 7 divides each CU hierarchically into TUs up to the maximum number of TU partition layers defined above, and performs the orthogonal transform and quantization on each TU.

逆量子化・逆変換部８は、変換・量子化部７から差分画像の圧縮データを受けると、符号化制御部１から出力された予測差分符号化パラメータに含まれている量子化パラメータを用いて、その差分画像の圧縮データを逆量子化し、その予測差分符号化パラメータに含まれている変換ブロックサイズ単位で、逆量子化の圧縮データの逆変換処理（例えば、逆ＤＣＴ（逆離散コサイン変換）や、逆ＫＬ変換等の逆変換処理）を実施することで、逆変換処理後の圧縮データを局所復号予測差分信号ｅ_ji ⁿハット（電子出願の関係上、アルファベット文字に付いた「＾」をハットと表記する）として加算部９に出力する（ステップＳＴ８）。 When the inverse quantization / inverse transform unit 8 receives the compressed data of the difference image from the transform / quantization unit 7, the inverse quantization / inverse transform unit 8 uses the quantization parameter included in the prediction difference encoding parameter output from the encoding control unit 1. Then, the compressed data of the difference image is inversely quantized, and the inverse quantization processing (for example, inverse DCT (Inverse Discrete Cosine Transform) is performed on the transform block size unit included in the prediction difference encoding parameter. ) And inverse transformation processing such as inverse KL transformation), the compressed data after the inverse transformation processing is converted into a local decoded prediction difference signal e _ji ⁿ hat ("^" Is expressed as a hat) to the adder 9 (step ST8).

符号化制御部１から出力される予測差分符号化パラメータは、ＣＵの領域毎に、その内部の予測差分信号ｅ_ji ⁿの符号化に用いる量子化パラメータ、ＴＵの分割状態を示す情報を含んでいる。
その予測差分符号化パラメータは、符号化制御部１が、符号化モードｍ（Ｂ_j ⁿ）を決定する際に一緒に決定される。
量子化パラメータは、ＬＣＵの単位で１つ割り当て、それらを分割したＣＵ単位で共通に使用する形式でもよいし、ＣＵ毎に、ＬＣＵの量子化パラメータからの差分値として表現するようにしてもよい。
なお、ＴＵの分割は、ＣＵではなく、ＣＵから分割されたＰＵであるＰ_ji ⁿを単位として決定するように構成されていてもよい。 Predictive differential coding parameters outputted from the coding controller 1, for each area of the CU, the quantization parameter used for encoding the prediction difference signal e _ji ⁿ therein, include information indicating the division state of the TU Yes.
The prediction differential encoding parameter is determined together when the encoding control unit 1 determines the encoding mode m (B _j ⁿ ).
One quantization parameter may be assigned in units of LCUs and used in common in units of divided CUs, or may be expressed as a difference value from the LCU quantization parameter for each CU. .
Incidentally, division of the TU, the CU without may be configured to determine the P _ji ⁿ is a PU split from CU units.

加算部９は、逆量子化・逆変換部８から局所復号予測差分信号ｅ_ji ⁿハットを受けると、その局所復号予測差分信号ｅ_ji ⁿハットと、イントラ予測部４又は動き補償予測部５により生成された予測画像（イントラ予測画像Ｐ_ji ⁿ、インター予測画像Ｐ_ji ⁿ）を示す予測信号を加算することで、局所復号パーティション画像Ｐ_ji ⁿハットないしはその集まりとしての局所復号符号化ブロック画像である局所復号画像を生成する（ステップＳＴ９）。
加算部９は、局所復号画像を生成すると、その局所復号画像を示す局所復号画像信号をイントラ予測用メモリ１０に格納するとともに、その局所復号画像信号をループフィルタ部１１に出力する。その局所復号画像信号が以降のイントラ予測用の画像信号となる。 When the adder 9 receives the local decoded prediction difference signal e _ji ⁿ hat from the inverse quantization / inverse transform unit 8, the adder 9 performs the local decoded prediction difference signal e _ji ⁿ hat and the intra prediction unit 4 or the motion compensated prediction unit 5. By adding a prediction signal indicating the generated prediction image (intra prediction image P _ji ⁿ , inter prediction image P _ji ⁿ ), the local decoded partition image P _ji ⁿ hat or a local decoded encoded block image as a collection thereof is used. A local decoded image is generated (step ST9).
When generating the locally decoded image, the adding unit 9 stores the locally decoded image signal indicating the locally decoded image in the intra prediction memory 10 and outputs the locally decoded image signal to the loop filter unit 11. The locally decoded image signal becomes an image signal for subsequent intra prediction.

ステップＳＴ３〜ＳＴ９の処理は、階層的に分割された全てのＣＵに対する処理が完了するまで繰り返し実施され、全てのＣＵに対する処理が完了するとステップＳＴ１３の処理に移行する（ステップＳＴ１０，ＳＴ１１）。 The processes of steps ST3 to ST9 are repeatedly performed until the processes for all the hierarchically divided CUs are completed, and when the processes for all the CUs are completed, the process proceeds to the process of step ST13 (steps ST10 and ST11).

ループフィルタ部１１は、加算部９から局所復号画像信号を受けると、その局所復号画像信号に対して所定のフィルタリング処理を実施することで、その局所復号画像信号に含まれている符号化歪みを補償し、符号化歪み補償後の局所復号画像信号が示す局所復号画像を参照画像として動き補償予測フレームメモリ１２に格納する（ステップＳＴ１２）。
ループフィルタ部１１によるフィルタリング処理は、加算部９から出力される局所復号画像信号のＬＣＵあるいは個々のＣＵ単位で行ってもよいし、１画面分のＣＵに相当する局所復号画像信号が出力された後に１画面分まとめて行ってもよい。 When the loop filter unit 11 receives the local decoded image signal from the adder unit 9, the loop filter unit 11 performs a predetermined filtering process on the local decoded image signal, thereby reducing the encoding distortion included in the local decoded image signal. The local decoded image indicated by the local decoded image signal after compensation for coding distortion is stored in the motion compensated prediction frame memory 12 as a reference image (step ST12).
The filtering process by the loop filter unit 11 may be performed in units of LCUs or individual CUs of the local decoded image signal output from the adding unit 9, or a local decoded image signal corresponding to one screen CU is output. You may do it for one screen later.

可変長符号化部１３は、符号化制御部１から出力された符号化モード、予測差分符号化パラメータ、ＬＣＵサイズ、最大ＣＵ分割階層数、ＴＵ最大サイズ、ＴＵ最小サイズ、最大ＴＵ分割階層数及び分割フラグ（ＬＣＵのＣＵ分割状態）と、変換・量子化部７から出力された圧縮データ及び各ＣＵのＴＵ分割状態と、イントラ予測部４から出力されたイントラ予測パラメータ又は動き補償予測部５から出力されたインター予測パラメータをエントロピー符号化する。
可変長符号化部１３は、エントロピー符号化の符号化結果である符号化モード、予測差分符号化パラメータ、ＬＣＵサイズ、最大ＣＵ分割階層数、ＴＵ最大サイズ、ＴＵ最小サイズ、最大ＴＵ分割階層数、分割フラグ（ＬＣＵのＣＵ分割状態）、圧縮データ、各ＣＵのＴＵ分割状態、イントラ予測パラメータ／インター予測パラメータの符号化データを多重化してビットストリームを生成する（ステップＳＴ１３）。
図９は可変長符号化部１３により生成されるビットストリームの一例を示す説明図であり、このビットストリームには、ＬＣＵの符号化データの中に、最大ＣＵ分割階層数が含まれていることを示している。 The variable length encoding unit 13 includes an encoding mode, a prediction differential encoding parameter, an LCU size, a maximum CU partition layer number, a TU maximum size, a TU minimum size, a maximum TU partition layer number output from the encoding control unit 1 and The division flag (the CU division state of the LCU), the compressed data output from the transform / quantization unit 7 and the TU division state of each CU, and the intra prediction parameter or motion compensation prediction unit 5 output from the intra prediction unit 4 Entropy-encode the output inter prediction parameter.
The variable length coding unit 13 is a coding mode that is a coding result of entropy coding, a prediction differential coding parameter, an LCU size, a maximum number of CU partition layers, a TU maximum size, a TU minimum size, a maximum TU partition layer number, The bit stream is generated by multiplexing the division flag (the CU division state of the LCU), the compressed data, the TU division state of each CU, and the intra prediction parameter / inter prediction parameter encoded data (step ST13).
FIG. 9 is an explanatory diagram showing an example of a bit stream generated by the variable length encoding unit 13, and this bit stream includes the maximum number of CU partition layers in the encoded data of the LCU. Is shown.

以下、この実施の形態１の特徴である「ＣＵ分割状態のエントロピー符号化処理」について詳述する。
この実施の形態１では、ＬＣＵ（階層ｎ＝０）を開始点として、各ＣＵの四分木分割が最大ＣＵ分割階層数に至るまで再帰的に繰り返される。
まず、可変長符号化部１３は、最大ＣＵ分割階層数を符号化する（図３のステップＳＴ２１）。 Hereinafter, the “CU division state entropy encoding process”, which is a feature of the first embodiment, will be described in detail.
In the first embodiment, starting from LCU (hierarchy n = 0), quadtree division of each CU is recursively repeated until the maximum number of CU division hierarchies is reached.
First, the variable length encoding unit 13 encodes the maximum number of CU division layers (step ST21 in FIG. 3).

最大ＣＵ分割階層数の符号は、フレーム内符号化のみ行うピクチャ（あるいは、ピクチャを分割したスライス）に属するＬＣＵの場合、例えば、図１０の左側のテーブルを参照することで２値化値が決定され、フレーム間符号化を行うスライス又はピクチャに属するＬＣＵの場合、例えば、図１０の右側のテーブルを参照することで決定される。
フレーム内符号化では、予測が当たり難く、ＬＣＵは細かく分割され易いため、最大ＣＵ分割階層数が大きい程、短い符号を割り当てている。
一方、フレーム間符号化では、予測が比較的当たり易いので、最大ＣＵ分割階層数が小さい程、短い符号を割り当てることにより、効率よくＣＵ分割状態を符号化することができるようにしている。
また、図１０に示すように、シーケンス、ピクチャあるいはスライス毎に設定されたＣＵの分割階層数の上限の値に応じて、テーブルを切り替えて、ＣＵの分割階層数の上限の値までのインデックスしか持たないテーブルを使用することにより、符号の冗長度を最小にとどめることができ、効率よくＣＵの分割状態を符号化することができる。 For the code of the maximum number of CU partition layers, in the case of an LCU belonging to a picture (or a slice obtained by dividing a picture) that only performs intra-frame coding, for example, a binarized value is determined by referring to the table on the left side of FIG. In the case of an LCU belonging to a slice or picture for which interframe coding is performed, for example, it is determined by referring to the table on the right side of FIG.
In intra-frame coding, it is difficult to predict and the LCU is easily divided finely. Therefore, a shorter code is assigned as the maximum number of CU division layers is larger.
On the other hand, since prediction is relatively easy in inter-frame coding, a shorter code is assigned as the maximum number of CU partition layers is smaller, so that the CU partition state can be efficiently encoded.
Further, as shown in FIG. 10, the table is switched according to the upper limit value of the number of divided CU layers set for each sequence, picture, or slice, and only indexes up to the upper limit value of the number of divided CU layers can be obtained. By using a table that does not have, the code redundancy can be minimized, and the division state of the CU can be efficiently encoded.

可変長符号化部１３は、第０階層のＣＵ符号化処理に移行する（ステップＳＴ２２）。
可変長符号化部１３は、処理対象の階層ｎが０であると同時に最大ＣＵ分割階層数が２以上である場合（ステップＳＴ２０１）、処理対象の階層ｎに属するＣＵが四分木分割されているか否かを示す分割フラグを符号化せずに、当該ＣＵを四分木分割した第ｎ＋１階層に属するＣＵのＣＵ符号化処理をＺスキャン順に実行する（ステップＳＴ２０５）。
可変長符号化部１３は、処理対象の階層ｎが最大ＣＵ分割階層数に対応する最も深い階層でなければ（ステップＳＴ２０２）、処理対象の階層ｎに属する各々のＣＵが、四分木分割されているか否かを示す分割フラグを符号化する（ステップＳＴ２０３）。
例えば、ＣＵが四分木分割されている場合、分割フラグは“１”に符号化され、ＣＵが四分木分割されていない場合、分割フラグは“０”に符号化される。ＣＵの分割フラグは、例えば、ＬＣＵから始まるＺスキャン順に符号化される。ＣＵが四分木分割されていれば（ステップＳＴ２０４）当該ＣＵを四分木分割した第ｎ＋１階層に属するＣＵのＣＵ符号化処理をＺスキャン順に実行する（ステップＳＴ２０５）。当該ＣＵが四分木分割されていない場合には当該ＣＵ符号化処理を終了する。この際、当該ＣＵに属する符号化データを符号化するよう構成してもよいし、全てのＣＵ分割状態を符号化した後に各ＣＵの符号化データを符号化するように構成してもよい。
なお、処理対象の階層ｎが最大ＣＵ分割階層数に対応する最も深い階層である場合、当該階層に属するＣＵは、それ以上の四分木分割がなされないため、ＣＵの分割フラグは符号化されないで当該ＣＵ符号化処理を終了する。 The variable length coding unit 13 proceeds to the 0th layer CU coding process (step ST22).
When the processing target layer n is 0 and the maximum number of CU partition layers is 2 or more (step ST201), the variable length encoding unit 13 performs quadtree partitioning on the CUs belonging to the processing target layer n. Without encoding the division flag indicating whether or not there is, the CU encoding processing of the CU belonging to the (n + 1) th hierarchy obtained by dividing the CU into the quadtree is executed in the Z-scan order (step ST205).
If the processing target layer n is not the deepest layer corresponding to the maximum number of CU partition layers (step ST202), the variable length encoding unit 13 performs quadtree partitioning on each CU belonging to the processing target layer n. A division flag indicating whether or not the image is present is encoded (step ST203).
For example, when the CU is divided into quadtrees, the division flag is encoded as “1”, and when the CU is not divided into quadtrees, the division flag is encoded as “0”. The division flag of the CU is encoded in the order of Z scan starting from the LCU, for example. If the CU has been divided into quadtrees (step ST204), the CU encoding processing of the CU belonging to the (n + 1) th layer obtained by dividing the CU into quadtrees is executed in the Z-scan order (step ST205). When the CU is not divided into quadtrees, the CU encoding process is terminated. At this time, the encoded data belonging to the CU may be encoded, or the encoded data of each CU may be encoded after encoding all the CU division states.
When the processing target layer n is the deepest layer corresponding to the maximum number of CU partition layers, the CU belonging to the layer is not further divided into quadtrees, so the CU partition flag is not encoded. Then, the CU encoding process ends.

最大ＣＵ分割階層数が“１”でない場合、上述したように、ＬＣＵは、少なくとも１度は四分木分割がなされることが自明であるため、ＬＣＵに対する分割フラグは符号化しないようにしている。
このように、最大ＣＵ分割階層数が“１”でない場合、ＬＣＵに対する分割フラグが符号化されないため、効率よくＣＵ分割状態を符号化することができる。 When the maximum number of CU partition layers is not “1”, as described above, it is obvious that the LCU performs quadtree partitioning at least once. Therefore, the partition flag for the LCU is not encoded. .
As described above, when the maximum number of CU partition layers is not “1”, the partition flag for the LCU is not encoded, so that the CU partition state can be efficiently encoded.

また、Ｚスキャン順にＣＵの分割フラグを符号化する際に、Ｚスキャン順で符号化されていない最後のＣＵを符号化する前までに、最大ＣＵ分割階層数の階層となるＣＵが発生していない場合、Ｚスキャン順で符号化されていない最後のＣＵが最大ＣＵ分割階層数に対応する階層まで分割されることが自明であるため、この場合にも、ＣＵ分割フラグを符号化しないようにしている。
このように、分割されることが自明であるＣＵに対しては、ＣＵの分割フラグが符号化されないため、効率よくＣＵ分割状態を符号化することができる。 In addition, when encoding the CU partition flag in the Z-scan order, a CU that has the maximum number of CU partition layers is generated before the last CU that is not encoded in the Z-scan order is encoded. If not, it is obvious that the last CU that is not encoded in the Z-scan order is divided up to the layer corresponding to the maximum number of CU division layers, and in this case also, the CU division flag is not encoded. ing.
As described above, since the CU division flag is not encoded for a CU that is obvious to be divided, the CU division state can be efficiently encoded.

これらのエントロピー符号化としては、算術符号化を適用するようしてもよいし、ハフマン符号化を適用するようにしてもよい。
以下、ＣＵ分割状態の符号化処理を図１１の具体例を用いて説明する。
図１１の例では、フレーム間符号化を行うスライス又はピクチャにおけるＬＣＵとして、ＬＣＵサイズが６４画素×６４画素サイズ、ＣＵの分割階層数の上限が４の場合、つまり、ＣＵのサイズとして６４画素×６４画素、３２画素×３２画素、１６画素×１６画素及び８画素×８画素が選択できるものとしている。
図１１の例では、処理対象のＬＣＵが３２画素×３２画素サイズのＣＵに四分木分割され、左上、右上、右下の３２画素×３２画素サイズのＣＵが、更に１６画素×１６画素サイズのＣＵに四分木分割されている。 As these entropy coding, arithmetic coding may be applied, or Huffman coding may be applied.
Hereinafter, the encoding process of the CU division state will be described using a specific example of FIG.
In the example of FIG. 11, when the LCU size is 64 pixels × 64 pixel size and the upper limit of the number of divided layers of the CU is 4 as the LCU in the slice or picture for performing interframe coding, that is, the CU size is 64 pixels × It is assumed that 64 pixels, 32 pixels × 32 pixels, 16 pixels × 16 pixels, and 8 pixels × 8 pixels can be selected.
In the example of FIG. 11, the LCU to be processed is divided into quadrants into CUs of 32 pixels × 32 pixels, and the upper left, upper right, and lower right 32 pixels × 32 pixels of CUs are further 16 pixels × 16 pixels in size. The CU is divided into quadtrees.

従来手法では、ＬＣＵ（第０階層）が四分木分割されるため、ＬＣＵに対するＣＵの分割フラグとして“１”を符号化する。
次に、第１階層において、Ｚスキャン順にＣＵの分割フラグを符号化する。即ち、左上、右上、左下、右下の３２画素×３２画素サイズのＣＵ毎に、ＣＵの分割フラグを符号化する。
図１１の例では、左上、右上、右下の３２画素×３２画素サイズのＣＵについての分割フラグとして“１”を符号化し、左下の３２画素×３２画素サイズのＣＵについての分割フラグとして“０”を符号化する。
１６画素×１６画素サイズのＣＵについては、それ以上の四分木分割が行われないため、１６×１６画素サイズのＣＵ毎に、ＣＵの分割フラグとして“０”を符号化する。
このように、従来手法では、第０階層でＣＵの分割フラグが１個、第１階層でＣＵの分割フラグが４個、第２階層でＣＵの分割フラグが１２個であり、合計で１７個のＣＵの分割フラグが符号化されてＣＵ分割状態の符号化処理が終了する。 In the conventional method, since the LCU (layer 0) is divided into quadtrees, “1” is encoded as a CU division flag for the LCU.
Next, in the first layer, the division flag of the CU is encoded in the Z scan order. That is, the CU division flag is encoded for each CU having a size of 32 pixels × 32 pixels in the upper left, upper right, lower left, and lower right.
In the example of FIG. 11, “1” is encoded as the division flag for the CU having the size of 32 pixels × 32 pixels in the upper left, upper right, and lower right, and “0” is used as the division flag for the CU having the size of 32 pixels × 32 pixels in the lower left. ".
Since no further quadtree division is performed for a 16 pixel × 16 pixel size CU, “0” is encoded as a CU division flag for each CU of 16 × 16 pixel size.
Thus, in the conventional method, there are one CU partition flag in the 0th layer, 4 CU partition flags in the 1st layer, and 12 CU partition flags in the 2nd layer, a total of 17 The CU partition flag is encoded, and the encoding process of the CU partition state is completed.

一方、この実施の形態１の手法では、ＬＣＵが、最大で第２階層に属する１６画素×１６画素サイズのＣＵまで分割されるため、第０階層、第１階層、および第２階層の合計“３”の最大ＣＵ分割階層数に対応する “１１０”（３個の２値化値）を符号化する（図１０の右側のテーブルを参照）。この場合、ＬＣＵの最大ＣＵ分割階層数が“１”ではなく、ＬＣＵが必ず１度は分割されることが自明であるため、ＬＣＵ（第０階層）に対するＣＵの分割フラグは符号化しない。
次に、第１階層において、Ｚスキャン順にＣＵの分割フラグを符号化する。即ち、左上、右上、左下、右下の３２画素×３２画素サイズのＣＵ毎にＣＵの分割フラグを符号化する。
図１１の例では、左上、右上、右下の３２画素×３２画素サイズのＣＵについての分割フラグとして“１”を符号化し、左下の３２画素×３２画素サイズのＣＵについての分割フラグとして“０”を符号化する。
図１１の例では、ＬＣＵの最大ＣＵ分割階層数が“３”であり、これ以上のＣＵの分割が行われないことが明らかであるため、これ以上のＣＵの分割フラグは符号化しない。
このように、この実施の形態１による手法では、最大ＣＵ分割階層数を示す３個の２値化値と、第１階層におけるＣＵの分割フラグの４個が符号化対象となるため、合計で７個の２値化値が符号化されてＣＵ分割状態の符号化処理が終了する。 On the other hand, in the method of the first embodiment, since the LCU is divided up to a CU having a size of 16 pixels × 16 pixels belonging to the second hierarchy, the total of the 0th hierarchy, the first hierarchy, and the second hierarchy “ “110” (three binarized values) corresponding to the maximum number of CU partition layers of 3 ”is encoded (see the table on the right side of FIG. 10). In this case, since the maximum number of CU division layers of the LCU is not “1” and it is obvious that the LCU is divided once, the CU division flag for the LCU (0th layer) is not encoded.
Next, in the first layer, the division flag of the CU is encoded in the Z scan order. That is, the division flag of the CU is encoded for each CU having a size of 32 pixels × 32 pixels in the upper left, upper right, lower left, and lower right.
In the example of FIG. 11, “1” is encoded as the division flag for the CU having the size of 32 pixels × 32 pixels in the upper left, upper right, and lower right, and “0” is used as the division flag for the CU having the size of 32 pixels × 32 pixels in the lower left. ".
In the example of FIG. 11, since the maximum number of CU division layers of an LCU is “3” and it is clear that no further CU division is performed, no more CU division flags are encoded.
As described above, in the method according to the first embodiment, three binarized values indicating the maximum number of CU partition layers and four CU partition flags in the first layer are to be encoded. Seven binarized values are encoded, and the encoding process in the CU division state is completed.

ＣＵ分割状態の符号化処理において、算術符号化を適用する場合には、最大ＣＵ分割階層数の符号化を図１０のテーブルを用いて２値化した後に、各２値化値（ｂｉｎ）に対して適切な条件付発生確率によって適応算術符号化を行うことで、さらに符号量を削減することができる。
ＣＵの分割状態は、画面の動きやテクスチャに依存して形成されるが、画面の動きやテクスチャは一般に空間近傍では類似する可能性が高い。
そこで、例えば、符号化対象のＬＣＵの上側の最大ＣＵ分割階層数をＭａｘＣＵＤｅｐｔｈ＿Ａｂｏｖｅ、左側の最大ＣＵ分割階層数をＭａｘＣＵＤｅｐｔｈ＿Ｌｅｆｔとし、その最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ａｂｏｖｅ，ＭａｘＣＵＤｅｐｔｈ＿Ｌｅｆｔに応じて、符号化対象のＬＣＵの最大ＣＵ分割階層数を表す符号のコード長を切り替えるようにしてもよい。
即ち、符号化対象のＬＣＵの上側及び左側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ａｂｏｖｅ，ＭａｘＣＵＤｅｐｔｈ＿Ｌｅｆｔに応じて、符号化対象のＬＣＵの最大ＣＵ分割階層数の最初のｂｉｎの優勢シンボルの生起確率を保持するメモリを切り替えるようにしてもよい。 In the encoding process in the CU partitioning state, when applying arithmetic coding, the encoding of the maximum number of CU partitioning layers is binarized using the table of FIG. 10, and then each binarized value (bin) is set. On the other hand, by performing adaptive arithmetic coding with an appropriate conditional occurrence probability, the amount of codes can be further reduced.
The division state of the CU is formed depending on the motion and texture of the screen, but the screen motion and texture are generally likely to be similar in the vicinity of the space.
Therefore, for example, the maximum CU partition layer number on the upper side of the LCU to be encoded is MaxCUDepth_Above, the maximum CU partition layer number on the left side is MaxCUDepth_Left, and the maximum CU partition layer number MaxCUDepth_Above, MaxCUDepth according to the encoding of MaxCUDepth_LCU The code length of the code representing the maximum number of CU division layers may be switched.
That is, according to the maximum CU partition layer number MaxCUDepth_Above and MaxCUDepth_Left above and to the left of the LCU to be encoded, a memory that holds the occurrence probability of the dominant symbol of the first bin of the maximum CU partition layer number of the LCU to be encoded You may make it switch.

フレーム内符号化を行うスライス又はピクチャに含まれるＬＣＵの場合、最大ＣＵ分割階層数が大きくなる可能性が高いため、上側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ａｂｏｖｅ及び左側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ｌｅｆｔの双方が大きい値をとる場合（例えば、双方とも“０”でない場合、あるいは、片方でもＣＵの分割階層数の上限である場合、または、双方の和が所定の閾値より大きい場合）、符号化対象のＬＣＵの最大ＣＵ分割階層数が大きい値をとる確率が高くなるような最初のｂｉｎの優勢シンボルおよび生起確率を適用する。
逆に、上側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ａｂｏｖｅ及び左側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ｌｅｆｔの双方が大きい値をとることがない場合、符号化対象のＬＣＵの最大ＣＵ分割階層数が大きい値をとる確率が低くなるような最初のｂｉｎの優勢シンボルおよび生起確率を適用する。
このように、符号化対象のＬＣＵの最大ＣＵ分割階層数の最初のｂｉｎの優勢シンボルの生起確率を保持するメモリを切り替えることで、発生する確率の高いシンボルを少ない情報量で符号化することができるため、効率よくＣＵ分割状態を符号化することができる。 In the case of an LCU included in a slice or picture for which intra-frame coding is performed, there is a high possibility that the maximum number of CU partition layers is large. Therefore, both the upper maximum CU partition layer number MaxCUDepth_Above and the left maximum CU partition layer number MaxCUDepth_Left When taking a large value (for example, when both are not “0”, or when one of them is the upper limit of the number of divided layers of a CU, or when the sum of both is larger than a predetermined threshold), the LCU to be encoded Apply the dominant symbol and occurrence probability of the first bin so that the maximum CU partition hierarchy number of the first CU has a high probability of taking a large value.
On the other hand, when both the maximum CU partition layer number MaxCUDepth_Above on the upper side and the maximum CU partition layer number MaxCUDepth_Left on the left side do not take a large value, there is a probability that the maximum CU partition layer number of the encoding target LCU takes a large value. Apply the first bin dominant symbol and probability of occurrence to be low.
In this way, by switching the memory holding the occurrence probability of the dominant symbol of the first bin of the maximum CU partition layer number of the encoding target LCU, it is possible to encode a symbol with a high probability of occurrence with a small amount of information. Therefore, the CU division state can be efficiently encoded.

また、フレーム間符号化を行うスライス又はピクチャに含まれるＬＣＵの場合、最大ＣＵ分割階層数が小さくなる可能性が高いため、上側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ａｂｏｖｅ及び左側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ｌｅｆｔの双方が小さい値をとる場合（例えば、双方とも“０”である場合、あるいは、双方ともＣＵの分割階層数の上限でない場合、または、双方の和が所定の閾値より小さい場合）、符号化対象のＬＣＵの最大ＣＵ分割階層数が小さい値をとる確率が高くなるような最初のｂｉｎの優勢シンボルおよび生起確率を適用する。
逆に、上側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ａｂｏｖｅ及び左側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ｌｅｆｔの双方が小さい値をとることがない場合、符号化対象のＬＣＵの最大ＣＵ分割階層数が小さい値をとる確率が低くなるような最初のｂｉｎの優勢シンボルおよび生起確率を適用する。
このように、符号化対象のＬＣＵの最大ＣＵ分割階層数の最初のｂｉｎの優勢シンボルの生起確率を保持するメモリを切り替えることで、発生する確率の高いシンボルを少ない情報量で符号化することができるため、効率よくＣＵ分割状態を符号化することができる。 In addition, in the case of an LCU included in a slice or picture to be subjected to inter-frame coding, there is a high possibility that the maximum number of CU partition layers will be small, so the upper maximum CU partition layer number MaxCUDepth_Above and the left maximum CU partition layer number MaxCUDepth_Left When both take a small value (for example, when both are “0”, or when both are not the upper limit of the number of divided layers of a CU, or when the sum of both is smaller than a predetermined threshold value) Apply the first bin dominant symbol and occurrence probability that the maximum number of CU partition layers of the LCU is high.
On the other hand, when both the upper maximum CU partition layer number MaxCUDepth_Above and the left maximum CU partition layer number MaxCUDepth_Left do not take a small value, the probability that the maximum CU partition layer number of the LCU to be encoded takes a small value is high. Apply the first bin dominant symbol and probability of occurrence to be low.
In this way, by switching the memory holding the occurrence probability of the dominant symbol of the first bin of the maximum CU partition layer number of the encoding target LCU, it is possible to encode a symbol with a high probability of occurrence with a small amount of information. Therefore, the CU division state can be efficiently encoded.

ＣＵ分割状態の符号化処理において、ハフマン符号化を適用する場合には、図１２に示すように、上側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ａｂｏｖｅ及び左側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ｌｅｆｔの値が大きい場合と小さい場合で、ハフマンテーブルを切り替えことで、算術符号化において、優勢シンボルの生起確率が高くなるような符号化シンボルには短い符号を割り当て、逆の場合には長い符号を割り当てることにより得られる効果と同様の効果が得られる。 When Huffman coding is applied in the encoding process in the CU partitioning state, as shown in FIG. 12, the upper maximum CU partition layer number MaxCUDepth_Above and the left maximum CU partition layer number MaxCUDepth_Left are small and large. In some cases, by switching the Huffman table, in arithmetic coding, a short code is assigned to a coded symbol whose probability of occurrence of a dominant symbol is high, and in the opposite case, a long code is assigned. Similar effects can be obtained.

この実施の形態１では、ＬＣＵ毎に、常に最大ＣＵ分割階層数を符号化するものを示したが、上側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ａｂｏｖｅ及び左側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ｌｅｆｔが所定の条件を満足する場合に限り、符号化対象のＬＣＵの最大ＣＵ分割階層数を可変長符号化し、所定の条件を満足しなければ、符号化対象のＬＣＵの最大ＣＵ分割階層数を可変長符号化せずに、ブロック分割部２から出力された全てのＣＵの分割フラグを可変長符号化するよう構成してもよい。 In the first embodiment, the maximum CU partition layer number is always encoded for each LCU, but the upper maximum CU partition layer number MaxCUDepth_Above and the left maximum CU partition layer number MaxCUDepth_Left satisfy a predetermined condition. Only when the maximum CU partition layer number of the encoding target LCU is variable-length encoded, and if the predetermined condition is not satisfied, the maximum CU partition layer number of the encoding target LCU is not variable-length encoded. The division flags of all the CUs output from the block division unit 2 may be configured to be variable length encoded.

即ち、上側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ａｂｏｖｅ及び左側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ｌｅｆｔの双方が小さい値をとる場合（例えば、双方とも“０”である場合、あるいは、双方ともＣＵの分割階層数の上限でない場合、または、双方の和が所定の閾値より小さい場合）、上述した符号化対象のＬＣＵの最大ＣＵ分割階層数を符号化する手法でＣＵ分割状態を符号化する。
逆に、上側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ａｂｏｖｅ及び左側の最大ＣＵ分割階層数ＭａｘＣＵＤｅｐｔｈ＿Ｌｅｆｔの双方が小さい値をとることがない場合、符号化対象のＬＣＵの最大ＣＵ分割階層数を可変長符号化せずに、ブロック分割部２から出力された全てのＣＵの分割フラグを可変長符号化することで、ＣＵ分割状態を符号化する。
このように構成することで、偶発的に最大ＣＵ分割階層数が最大となるＬＣＵが多数発生するような場合には、最大ＣＵ分割階層数のオーバーヘッドを少なくすることができ、効率よくＣＵ分割状態を符号化することができる。 That is, when both the upper maximum CU partition layer number MaxCUDepth_Above and the left maximum CU partition layer number MaxCUDepth_Left take a small value (for example, both are “0”, or both are the upper limit of the number of CU partition layers) Otherwise, or when the sum of both is smaller than a predetermined threshold value), the CU partition state is encoded by the above-described method of encoding the maximum number of CU partition layers of the LCU to be encoded.
On the other hand, when both the upper maximum CU partition layer number MaxCUDepth_Above and the left maximum CU partition layer number MaxCUDepth_Left do not take a small value, the maximum CU partition layer number of the LCU to be encoded is not variable-length encoded. In addition, the CU partitioning state is encoded by variable length encoding the partition flags of all the CUs output from the block partitioning unit 2.
With this configuration, when a large number of LCUs having the maximum number of maximum CU partition layers occur accidentally, the overhead of the maximum number of CU partition layers can be reduced, and the CU partition state can be efficiently performed. Can be encoded.

以上で明らかなように、図１の動画像符号化装置によれば、可変長符号化部１３が、各々のＬＣＵにおける四分木構造で階層が最も深い部分の分割階層数を示す最大ＣＵ分割階層数を可変長符号化するとともに、最も深い階層に属するＣＵ以外のＣＵがブロック分割部２によって四分木分割されているか否かを示す分割フラグを可変長符号化するように構成したので、少ないビット数のシグナリングでＣＵの分割状態を動画像復号装置側に伝えることができる効果を奏する。 As can be seen from the above, according to the moving picture coding apparatus of FIG. 1, the variable length coding unit 13 has the maximum CU partitioning that indicates the number of split layers in the deepest part of the quadtree structure in each LCU. Since the number of hierarchies is variable-length coded, and the division flag indicating whether or not a CU other than the CU belonging to the deepest hierarchy is quadtree-divided by the block dividing unit 2 is variable-length coded, There is an effect that the division state of the CU can be transmitted to the video decoding device side by signaling with a small number of bits.

次に、図５の画像復号装置の処理内容を説明する。
可変長復号部５１は、図１の画像符号化装置から出力されたビットストリームを入力すると、そのビットストリームに対する可変長復号処理を実施して（図５のステップＳＴ３１）、１フレーム以上のピクチャから構成されるシーケンス単位あるいはピクチャ単位にフレームサイズ（水平画素数、垂直ライン数）を復号する。
可変長復号部５１は、図１の符号化制御部１と同様の手順で、ＬＣＵサイズ及び最大ＣＵ分割階層数を決定する（ステップＳＴ３２）。
例えば、画像符号化装置において、ＬＣＵサイズが、入力画像の解像度に応じて決定されている場合、先に復号しているフレームサイズに基づいてＬＣＵサイズを決定する。
なお、ＬＣＵサイズ及び最大ＣＵ分割階層数がビットストリームに多重化されている場合には、そのビットストリームから復号した情報を参照する。
また、可変長復号部５１は、符号化データからＣＵの分割フラグを可変長復号して、その最大ＣＵ分割階層数とＣＵの分割フラグからＬＣＵの四分木構造を示すＣＵ分割状態を復号する。 Next, processing contents of the image decoding apparatus in FIG. 5 will be described.
When the variable length decoding unit 51 receives the bit stream output from the image encoding device in FIG. 1, the variable length decoding unit 51 performs variable length decoding processing on the bit stream (step ST31 in FIG. 5), and from one or more frames of pictures. The frame size (the number of horizontal pixels and the number of vertical lines) is decoded in units of sequence units or pictures.
The variable length decoding unit 51 determines the LCU size and the maximum number of CU partition layers in the same procedure as the encoding control unit 1 in FIG. 1 (step ST32).
For example, in the image encoding apparatus, when the LCU size is determined according to the resolution of the input image, the LCU size is determined based on the previously decoded frame size.
In addition, when the LCU size and the maximum number of CU division layers are multiplexed in the bit stream, the information decoded from the bit stream is referred to.
The variable length decoding unit 51 also performs variable length decoding of the CU partition flag from the encoded data, and decodes the CU partition state indicating the LCU quadtree structure from the maximum number of CU partition layers and the CU partition flag. .

以下、ＣＵ分割状態のエントロピー復号処理について詳述する。
まず、可変長復号部５１は、図１０のテーブルを参照して、最大ＣＵ分割階層数を復号する（図６のステップＳＴ５１）。
例えば、ＣＵの分割階層数の上限が４である場合において、符号化データがフレーム間符号化による“１１０”の２値化値であれば、最大ＣＵ分割階層数が“３”であるとして復号される。
可変長復号部５１は、第０階層のＣＵ復号処理に移行する（ステップＳＴ５２）。
可変長復号部５１は、処理対象の階層ｎが０であると同時に最大ＣＵ分割階層数が２以上である場合（ステップＳＴ５０１）、処理対象の階層ｎに属するＣＵが四分木分割されているか否かを示す分割フラグを復号せずに、当該ＣＵを四分木分割した第ｎ＋１階層に属するＣＵのＣＵ復号処理をＺスキャン順に実行する（ステップＳＴ５０５）。 Hereinafter, the entropy decoding process in the CU division state will be described in detail.
First, the variable length decoding unit 51 decodes the maximum number of CU division layers with reference to the table of FIG. 10 (step ST51 of FIG. 6).
For example, when the upper limit of the number of CU partition layers is 4, if the encoded data is a binary value of “110” by interframe coding, decoding is performed assuming that the maximum number of CU partition layers is “3”. Is done.
The variable length decoding unit 51 proceeds to the 0th layer CU decoding process (step ST52).
When the processing target layer n is 0 and the maximum number of CU partition layers is 2 or more (step ST501), the variable length decoding unit 51 determines whether the CU belonging to the processing target layer n is divided into quadtrees. Without decoding the split flag indicating whether or not, the CU decoding process of the CU belonging to the (n + 1) th layer obtained by dividing the CU into the quadtree is executed in the Z-scan order (step ST505).

可変長復号部５１は、処理対象の階層ｎが最大ＣＵ分割階層数に対応する最も深い階層でなければ（ステップＳＴ５０２）、処理対象の階層ｎに属する各々のＣＵの分割フラグを復号する（ステップＳＴ５０３）。
なお、ＣＵの分割フラグは、符号化と同じ順序で復号される。例えば、ＬＣＵから始まるＺスキャン順で復号される。
ＣＵの分割フラグが分割する旨を示している場合（ステップＳＴ５０４）、当該ＣＵを四分木分割した第ｎ＋１階層に属するＣＵのＣＵ復号処理をＺスキャン順に実行する（ステップＳＴ５０５）。当該ＣＵが四分木分割されていない場合には当該ＣＵ復号処理を終了する。この際、当該ＣＵに属する符号化データを復号するよう構成してもよいし、全てのＣＵ分割状態を復号した後に各ＣＵの符号化データを復号するように構成してもよい。
ただし、最大ＣＵ分割階層数に対応する最も深い階層に属するＣＵは、それ以上の四分木分割がなされないため、ＣＵの分割フラグは復号しないで当該ＣＵ復号処理を終了する。 If the processing target layer n is not the deepest layer corresponding to the maximum number of CU partition layers (step ST502), the variable length decoding unit 51 decodes the partition flag of each CU belonging to the process target layer n (step ST502). ST503).
Note that the division flag of the CU is decoded in the same order as encoding. For example, decoding is performed in the order of Z scan starting from LCU.
When the division flag of the CU indicates that the CU is to be divided (step ST504), the CU decoding process of the CU belonging to the (n + 1) th hierarchy obtained by dividing the CU into the quadtree is executed in the Z scan order (step ST505). If the CU is not divided into quadtrees, the CU decoding process is terminated. At this time, the encoded data belonging to the CU may be decoded, or the encoded data of each CU may be decoded after decoding all the CU partition states.
However, since the CU belonging to the deepest hierarchy corresponding to the maximum number of CU division hierarchies is not subjected to further quadtree division, the CU decoding process is terminated without decoding the CU division flag.

最大ＣＵ分割階層数が“１”でない場合、上述したように、ＬＣＵは、少なくとも１度は四分木分割がなされることが自明であるため、ＬＣＵに対する分割フラグは復号しない。
また、Ｚスキャン順にＣＵの分割フラグを復号する際に、Ｚスキャン順で復号されていない最後のＣＵを復号する前までに、最大ＣＵ分割階層数の階層となるＣＵが発生していない場合、Ｚスキャン順で復号されていない最後のＣＵが最大ＣＵ分割階層数に対応する階層まで分割されることが自明であるため、この場合にも、ＣＵの分割フラグを復号しない。
これにより、ＣＵ分割状態をエントロピー復号することができるが、動画像符号化装置において、符号化対象のＬＣＵの最大ＣＵ分割階層数を可変長符号化せずに、ブロック分割部２から出力された全てのＣＵの分割フラグを可変長符号化することで、ＣＵ分割状態を符号化している場合には、全てのＣＵの分割フラグを可変長復号し、その分割フラグからＣＵ分割状態をエントロピー復号する。 When the maximum number of CU partition layers is not “1”, as described above, it is obvious that the LCU is subjected to quadtree partitioning at least once. Therefore, the partition flag for the LCU is not decoded.
Further, when decoding the CU partition flag in the Z scan order, if there is no CU that has the maximum number of CU partition layers before decoding the last CU that has not been decoded in the Z scan order, Since it is obvious that the last CU that has not been decoded in the Z-scan order is divided up to the layer corresponding to the maximum number of CU division layers, the division flag of the CU is not decoded in this case as well.
As a result, the CU partitioning state can be entropy-decoded, but in the video encoding device, the maximum number of CU partition layers of the encoding target LCU is output from the block partitioning unit 2 without being variable-length encoded. When the CU partitioning state is encoded by variable length coding of all the CU partition flags, the CU partition state is variable length decoded and the CU partition state is entropy decoded from the partition flag. .

可変長復号部５１は、上記のようにしてＣＵ分割状態を復号すると、そのＣＵ分割状態から、階層的に分割されている各々のＣＵであるＢ_j ⁿを特定する（ステップＳＴ３３）。
また、可変長復号部５１は、符号化データから各々のＣＵであるＢ_j ⁿに対応する符号化モードｍ（Ｂ_j ⁿ）を復号し、その符号化モードｍ（Ｂ_j ⁿ）に含まれる情報に基づき、ＣＵを更に１つないし複数のＰＵに分割し、各ＰＵに割り当てられている予測パラメータ（符号化モード、圧縮データ、予測差分符号化パラメータ、イントラ予測パラメータ／インター予測パラメータなど）を復号する（ステップＳＴ３４）。
なお、ＣＵに割り当てられている符号化モードがイントラ符号化モードである場合、ＣＵに属している１つ以上のＰＵ毎に、イントラ予測パラメータを復号する。また、ＣＵに割り当てられている符号化モードがインター符号化モードである場合、ＣＵに属している１つ以上のＰＵ毎に、インター予測パラメータを復号する。
イントラ予測パラメータの復号は、画像符号化装置と同じ手順で、周辺の復号済みＰＵのイントラ予測パラメータに基づいて、復号対象であるＰＵのイントラ予測パラメータの予測値を算出し、その予測値を用いて復号する。 When the variable length decoding unit 51 decodes the CU division state as described above, the variable length decoding unit 51 specifies B _j ⁿ that is each CU divided hierarchically from the CU division state (step ST33).
Further, the variable length decoding unit 51 decodes a coding mode m (B _j ⁿ ) corresponding to each CU B _j ⁿ from the coded data, and is included in the coding mode m (B _j ⁿ ). Based on the information, the CU is further divided into one or a plurality of PUs, and prediction parameters (encoding mode, compressed data, prediction differential encoding parameters, intra prediction parameters / inter prediction parameters, etc.) assigned to each PU are allocated. Decode (step ST34).
Note that when the coding mode assigned to the CU is the intra coding mode, the intra prediction parameter is decoded for each of one or more PUs belonging to the CU. Also, when the coding mode assigned to the CU is the inter coding mode, the inter prediction parameter is decoded for each of one or more PUs belonging to the CU.
Intra prediction parameter decoding is performed in the same procedure as the image encoding device, based on the intra prediction parameters of neighboring decoded PUs, the prediction value of the intra prediction parameter of the PU to be decoded is calculated, and the prediction value is used. To decrypt.

切替スイッチ５２は、可変長復号部５１からＣＵであるＢ_j ⁿに属しているパーティションＰ_ji ⁿの符号化モードｍ（Ｂ_j ⁿ）がイントラ符号化モードである場合（ステップＳＴ３５）、可変長復号部５１から出力されたイントラ予測パラメータをイントラ予測部５３に出力する。
一方、パーティションＰ_ji ⁿの符号化モードｍ（Ｂ_j ⁿ）がインター符号化モードである場合（ステップＳＴ３５）、可変長復号部５１から出力されたインター予測パラメータを動き補償予測部５４に出力する。 When the coding mode m (B _j ⁿ ) of the partition P _ji ⁿ belonging to B _j ⁿ which is the CU from the variable length decoding unit 51 is the intra coding mode (step ST35), the changeover switch 52 changes the variable length. The intra prediction parameter output from the decoding unit 51 is output to the intra prediction unit 53.
On the other hand, when the coding mode m (B _j ⁿ ) of the partition P _ji ⁿ is the inter coding mode (step ST35), the inter prediction parameter output from the variable length decoding unit 51 is output to the motion compensation prediction unit 54. .

イントラ予測部５３は、切替スイッチ５２からイントラ予測パラメータを受けると、そのイントラ予測パラメータを用いて、ＣＵであるＢ_j ⁿのパーティションＰ_ji ⁿに対するイントラ予測処理を実施して、イントラ予測画像Ｐ_ji ⁿを生成する（ステップＳＴ３６）。
動き補償予測部５４は、切替スイッチ５２からインター予測パラメータを受けると、そのインター予測パラメータを用いて、ＣＵであるＢ_j ⁿのパーティションＰ_ji ⁿに対する動き補償予測処理を実施して、インター予測画像Ｐ_ji ⁿを生成する（ステップＳＴ３７）。 When the intra prediction unit 53 receives the intra prediction parameter from the changeover switch 52, the intra prediction unit 53 performs an intra prediction process on the partition P _ji ⁿ of B _j ⁿ that is the CU using the intra prediction parameter, and performs the intra prediction image P _ji. ⁿ is generated (step ST36).
Upon receiving the inter prediction parameter from the changeover switch 52, the motion compensation prediction unit 54 performs a motion compensation prediction process on the partition P _ji ⁿ of B _j ⁿ that is a CU using the inter prediction parameter, and performs the inter prediction image. generating a P _ji ⁿ (step ST37).

逆量子化・逆変換部５５は、可変長復号部５１から変換処理単位であるＴＵ毎に圧縮データが出力され、可変長復号部５１から出力された予測差分符号化パラメータに含まれている量子化パラメータを用いて、ＴＵ単位の圧縮データを逆量子化する。
また、逆量子化・逆変換部５５は、その予測差分符号化パラメータに含まれている変換ブロックサイズ単位で、逆量子化の圧縮データの逆変換処理（例えば、逆ＤＣＴ（逆離散コサイン変換）や、逆ＫＬ変換等の逆変換処理）を実施することで、逆変換処理後の圧縮データを復号予測差分信号（圧縮前の差分画像を示す信号）として加算部５６に出力する（ステップＳＴ３８）。 The inverse quantization / inverse transform unit 55 outputs compressed data for each TU that is a transform processing unit from the variable length decoding unit 51, and includes the quantum included in the prediction difference encoding parameter output from the variable length decoding unit 51. The compressed data in TU units is inversely quantized using the quantization parameter.
Further, the inverse quantization / inverse transform unit 55 performs inverse transform processing (for example, inverse DCT (Inverse Discrete Cosine Transform)) on the compressed data of inverse quantization in units of transform block size included in the prediction differential encoding parameter. In addition, by performing reverse conversion processing such as reverse KL conversion), the compressed data after the reverse conversion processing is output to the adder 56 as a decoded prediction difference signal (a signal indicating a difference image before compression) (step ST38). .

予測差分符号化パラメータに含まれている量子化パラメータは、ビットストリームから抽出した符号化データからＣＵの単位で復元し、ＴＵサイズ情報は、ＣＵを起点として、ＬＣＵの分割と同様に、四分木分割で表現された分割情報の形式や、選択可能なＴＵサイズをインデックス情報として表現された形式などで、ビットストリームから抽出して復元する。
逆量子化・逆変換部５５は、このＴＵサイズの情報に基づいて、変換・量子化処理のブロックサイズを特定して処理を行う。なお、ＴＵサイズの情報は、ＣＵではなく、ＣＵを分割するＰＵを単位として決定するように構成されていてもよい。 The quantization parameter included in the predictive differential encoding parameter is restored in units of CUs from the encoded data extracted from the bitstream, and the TU size information is divided into four parts starting from the CU as in the case of LCU division. Extraction is performed from the bitstream in a format of division information expressed by tree division, a format in which selectable TU sizes are expressed as index information, and the like.
Based on this TU size information, the inverse quantization / inverse transform unit 55 specifies the block size of the transform / quantization process and performs the process. Note that the TU size information may be configured to be determined in units of PUs that divide a CU, not CUs.

加算部５６は、逆量子化・逆変換部５５から復号予測差分信号を受けると、その復号予測差分信号とイントラ予測部５３又は動き補償予測部５４により生成された予測画像を示す予測信号を加算することで復号画像（ＣＵ内に含まれている１つないし複数の復号ＰＵ画像の集まり）を生成して、その復号画像を示す復号画像信号をイントラ予測用メモリ５７に格納するとともに、その復号画像信号をループフィルタ部５８に出力する（ステップＳＴ３９）。復号画像が以降のイントラ予測用の画像信号となる。 When the addition unit 56 receives the decoded prediction difference signal from the inverse quantization / inverse conversion unit 55, the addition unit 56 adds the decoded prediction difference signal and the prediction signal indicating the prediction image generated by the intra prediction unit 53 or the motion compensated prediction unit 54. Thus, a decoded image (a collection of one or a plurality of decoded PU images included in the CU) is generated, and a decoded image signal indicating the decoded image is stored in the intra prediction memory 57, and the decoding is performed. The image signal is output to the loop filter unit 58 (step ST39). The decoded image becomes an image signal for subsequent intra prediction.

ステップＳＴ３３〜ＳＴ３９の処理は、階層的に分割された全てのＣＵに対する処理が完了するまで繰り返し実施される（ステップＳＴ４０）。
ループフィルタ部５８は、加算部５６から復号画像信号を受けると、画像符号化装置のループフィルタ部１１と同じフィルタリング処理を実施することで、その復号画像信号に含まれている符号化歪みを補償し、符号化歪み補償後の復号画像信号が示す復号画像を参照画像として動き補償予測フレームメモリ５９に格納する（ステップＳＴ４１）。
ループフィルタ部５８によるフィルタリング処理は、加算部５６から出力される復号画像信号のＬＣＵあるいは個々のＣＵ単位で行ってもよいし、１画面分のＬＣＵに相当する復号画像信号が出力された後に１画面分まとめて行ってもよい。 The processes in steps ST33 to ST39 are repeated until the processes for all the hierarchically divided CUs are completed (step ST40).
When the loop filter unit 58 receives the decoded image signal from the adder unit 56, the loop filter unit 58 performs the same filtering process as the loop filter unit 11 of the image encoding device, thereby compensating for the encoding distortion included in the decoded image signal. Then, the decoded image indicated by the decoded image signal after coding distortion compensation is stored in the motion compensated prediction frame memory 59 as a reference image (step ST41).
The filtering process by the loop filter unit 58 may be performed in units of LCUs or individual CUs of the decoded image signal output from the adding unit 56, or after the decoded image signal corresponding to the LCU for one screen is output. You may do it all on the screen.

以上の構成により、図４の動画像復号装置では、効率的に符号化されたビットストリームを復号し、映像信号を再生することが可能になる。
この実施の形態１では、効率がよいＣＵ分割状態の符号化方法と、それを好適に復号する方法について説明したが、ＴＵ分割状態についても同様の方法で、効率がよい符号化と好適な復号を行うことができる。 With the above configuration, the moving picture decoding apparatus in FIG. 4 can efficiently decode a coded bit stream and reproduce a video signal.
In the first embodiment, an efficient coding method for a CU partition state and a method for suitably decoding the CU partition state have been described. However, efficient coding and a suitable decoding are performed in the same manner for a TU partition state. It can be performed.

なお、本願発明はその発明の範囲内において、実施の形態の任意の構成要素の変形、もしくは実施の形態の任意の構成要素の省略が可能である。 In the present invention, any constituent element of the embodiment can be modified or any constituent element of the embodiment can be omitted within the scope of the invention.

１符号化制御部、２ブロック分割部（ブロック分割手段）、３切替スイッチ（予測画像生成手段）、４イントラ予測部（予測画像生成手段）、５動き補償予測部（予測画像生成手段）、６減算部（画像圧縮手段）、７変換・量子化部（画像圧縮手段）、８逆量子化・逆変換部、９加算部、１０イントラ予測用メモリ、１１ループフィルタ部、１２動き補償予測フレームメモリ、１３可変長符号化部（可変長符号化手段）、５１可変長復号部（可変長復号手段）、５２切替スイッチ（予測画像生成手段）、５３イントラ予測部（予測画像生成手段）、５４動き補償予測部（予測画像生成手段）、５５逆量子化・逆変換部（復号画像生成手段）、５６加算部（復号画像生成手段）、５７イントラ予測用メモリ、５８ループフィルタ部、５９動き補償予測フレームメモリ。 1 encoding control unit, 2 block dividing unit (block dividing unit), 3 changeover switch (predicted image generating unit), 4 intra prediction unit (predicted image generating unit), 5 motion compensation predicting unit (predicted image generating unit), 6 Subtraction unit (image compression means), 7 transformation / quantization unit (image compression means), 8 inverse quantization / inverse transformation unit, 9 addition unit, 10 intra prediction memory, 11 loop filter unit, 12 motion compensated prediction frame memory , 13 Variable length encoding unit (variable length encoding unit), 51 Variable length decoding unit (variable length decoding unit), 52 Changeover switch (prediction image generation unit), 53 Intra prediction unit (prediction image generation unit), 54 Motion Compensated prediction unit (predicted image generating unit), 55 Inverse quantization / inverse transform unit (decoded image generating unit), 56 Adder unit (decoded image generating unit), 57 Intra prediction memory, 58 Loop filter unit, 59 motion-compensated prediction frame memory.

Claims

An input image is divided into macroblocks of a predetermined size, and each macroblock is hierarchically divided into a quadtree structure, thereby enabling a coding processing unit block that is a smaller block of coding units than the macroblock. And a prediction image generation for generating a prediction image by performing a prediction process on the encoding processing unit block in an encoding mode corresponding to the encoding processing unit block output from the block dividing means An image compression unit that compresses a difference image between the encoding processing unit block output from the block dividing unit and the prediction image generated by the prediction image generation unit, and outputs compressed data of the difference image; The compressed data output from the image compression means and the encoding mode are subjected to variable length encoding, and the compressed data and And a variable length coding means for generating a bitstream encoded data of the encoding mode are multiplexed,
The variable-length encoding means variable-length encodes the maximum number of divided hierarchies indicating the number of division hierarchies in the deepest part of the quadtree structure in each macroblock, and encodes processing units belonging to the deepest hierarchy A moving picture coding apparatus characterized by variable length coding a division flag indicating whether or not an encoding processing unit block other than a block has been subjected to quadtree division by the block division means.

The variable length encoding means, when variable length encoding the maximum number of division layers related to each macroblock, according to the maximum number of division layers related to the macroblock existing around the macroblock to be encoded, 2. The moving picture encoding apparatus according to claim 1, wherein the code length of a code representing the maximum number of division layers related to the encoding target macroblock is switched.

The variable-length encoding means is the maximum for the macroblock to be encoded only when the maximum number of division layers related to the macroblock existing above and to the left of the macroblock to be encoded satisfies a predetermined condition. If the number of division hierarchies is variable-length coded and the predetermined condition is not satisfied, the division flag for all coding processing unit blocks output from the block division means without variable length coding the maximum number of division hierarchies. The moving picture encoding apparatus according to claim 1 or 2, wherein the variable length encoding is performed.

The block dividing means divides each encoding processing unit block hierarchically in a quadtree structure, thereby outputting an orthogonal transformation processing unit block that is a block of an orthogonal transformation unit smaller than the coding processing unit block. ,
The variable-length encoding means variable-length encodes the maximum number of divided hierarchies indicating the number of divided hierarchies in the deepest part of the quadtree structure in each encoding processing unit block, and orthogonal transforms belonging to the deepest hierarchies 2. The moving picture coding apparatus according to claim 1, wherein a division flag indicating whether or not an orthogonal transform processing unit block other than the processing unit block has been subjected to quadtree division by the block division means is variable length coded. .

Variable length decoding means for variable length decoding compressed data and coding modes related to each encoding processing unit block hierarchically divided from the encoded data multiplexed in the bitstream, and the variable length decoding means Predictive image generating means for generating a prediction image by performing prediction processing on the encoding processing unit block in the encoding mode related to the encoding processing unit block variable-length decoded by the variable length decoding means, and variable length decoding by the variable length decoding means Decoded image generating means for generating a differential image before compression from the compressed data relating to the decoded encoded block, and adding the difference image and the predicted image generated by the predicted image generating means to generate a decoded image; With
The variable-length decoding means performs variable-length decoding on the maximum number of division layers related to each macroblock from the encoded data, and codes other than encoding processing unit blocks belonging to the deepest layer corresponding to the maximum number of division layers Variable length decoding is performed on a division flag indicating whether or not a unit block is divided into quadtrees, and a quadtree structure of the macroblock is decoded from the maximum number of division layers and the division flag. Video decoding device.

When the variable length decoding means performs variable length decoding on the maximum number of divided layers related to each macroblock, the variable length decoding unit calculates the macroblock to be decoded from the maximum number of divided layers related to the macroblock existing around the macroblock to be decoded. The moving picture decoding apparatus according to claim 5, wherein a code length of a code representing the maximum number of divided hierarchies is determined.

The variable-length decoding means is related to the macroblock to be decoded from the encoded data only when the maximum number of division layers related to the macroblock existing above and to the left of the macroblock to be decoded satisfies a predetermined condition. If the maximum number of division layers is variable-length decoded and a predetermined condition is not satisfied, the division flag for all the encoding processing unit blocks is variable-length decoded from the encoded data, and the macro to be decoded from the division flag 7. The moving picture decoding apparatus according to claim 5, wherein the quadtree structure of the block is decoded.

The variable-length decoding means performs variable-length decoding on the maximum number of division layers related to each encoding processing unit block from the encoded data, and other than orthogonal transform processing unit blocks belonging to the deepest hierarchy corresponding to the maximum number of division layers Variable length decoding of a division flag indicating whether or not the orthogonal transform processing unit block is divided into quadtrees, and decoding the quadtree structure of the encoding processing unit block from the maximum number of division layers and the division flag. The moving picture decoding apparatus according to claim 5.

The block dividing means divides the input image into macroblocks of a predetermined size and hierarchically divides each macroblock with a quadtree structure so that it is a block with a smaller encoding unit than the macroblock. A block division processing step for outputting an encoding processing unit block; and a prediction image generation means for predicting the encoding processing unit block in an encoding mode corresponding to the encoding processing unit block output in the block division processing step. A predicted image generation processing step that performs processing to generate a predicted image, and an image compression unit includes the encoding processing unit block output in the block division processing step and the predicted image generated in the predicted image generation processing step. An image compression processing step of compressing the difference image and outputting compressed data of the difference image; The variable length coding means performs variable length coding on the compressed data and the coding mode output in the image compression processing step, and the compressed data and the coded data of the coding mode are multiplexed. Variable length encoding processing step for generating
In the variable-length encoding processing step, the maximum number of division layers indicating the number of division layers in the deepest part of the quadtree structure in each macroblock is variable-length encoded and the encoding process belonging to the deepest layer A moving picture coding method characterized in that variable length coding is performed on a division flag indicating whether or not an encoding processing unit block other than a unit block has been subjected to quadtree division in the block division processing step.

Variable length decoding processing step in which variable length decoding means performs variable length decoding of compressed data and encoding mode related to each encoding processing unit block hierarchically divided from encoded data multiplexed in a bitstream And the prediction image generating means performs prediction processing on the encoding processing unit block in the encoding mode related to the encoding processing unit block variable-length decoded in the variable length decoding processing step to generate a prediction image. A prediction image generation processing step, and a decoded image generation means generates a difference image before compression from the compressed data related to the encoded block variable-length decoded in the variable length decoding processing step, and generates the difference image and the prediction image A decoded image generation processing step of generating a decoded image by adding the predicted image generated in the processing step,
In the variable-length decoding processing step, the maximum number of division layers related to each macroblock is variable-length decoded from the encoded data, and other than encoding processing unit blocks belonging to the deepest layer corresponding to the maximum number of division layers A variable length decoding is performed on a division flag indicating whether or not an encoding processing unit block is divided into quadtrees, and a quadtree structure of the macroblock is decoded from the maximum number of division layers and the division flag. A moving picture decoding method.