KR102544554B1

KR102544554B1 - Method and Apparatus for Expressing Position of Non-zero Coefficients

Info

Publication number: KR102544554B1
Application number: KR1020220127684A
Authority: KR
Inventors: 임정연; 신재섭; 이선영; 손세훈
Original assignee: 에스케이텔레콤 주식회사
Priority date: 2017-07-31
Filing date: 2022-10-06
Publication date: 2023-06-16
Also published as: KR20220140676A; KR20220140678A; KR102544545B1; KR102544550B1; KR20220140677A; KR102544530B1; KR20220140675A; WO2019027200A1

Abstract

본 개시는 계수 블록 내 비-제로 계수들의 위치를 효율적으로 표현하는 것과 관련되어 있다. 본 개시의 일 측면에 따르면, 복수의 서브 블록들로 분할 가능한 계수 블록을 부호화함에 있어서, 계수 블럭에 대해, 유효 블록으로 결정된 블록을 단일 유효 계수에 도달될 때까지 균등한 크기의 작은 블록들로 반복적으로(recursively) 분할하면서, 생성된 각 블록들에 대해 유효 블록인지 여부를 시그널링한다. 본 개시의 다른 측면에 따르면, 복수의 서브 블록들로 분할 가능한 계수 블록을 부호화함에 있어서, 마지막 유효 서브 블록의 위치를 기초로 계수 블록 내 비-제로 계수들의 위치를 효율적으로 표현하는 방법이 제공된다. The present disclosure is concerned with efficiently representing the location of non-zero coefficients within a coefficient block. According to one aspect of the present disclosure, in encoding a coefficient block divisible into a plurality of sub-blocks, for a coefficient block, a block determined as an effective block is divided into small blocks of equal size until a single effective coefficient is reached. While dividing recursively, it signals whether or not it is a valid block for each generated block. According to another aspect of the present disclosure, in encoding a coefficient block divisible into a plurality of sub-blocks, a method of efficiently expressing the position of non-zero coefficients in a coefficient block based on the position of the last valid sub-block is provided. .

Description

Method and Apparatus for Expressing Position of Non-zero Coefficients

본 발명은 영상 부호화 또는 복호화에 관한 것으로, 보다 상세하게는, 비-제로 계수들의 위치를 표현하는 것과 관련되어 있다.The present invention relates to video encoding or decoding, and more particularly to representing the location of non-zero coefficients.

이 부분에 기술된 내용은 단순히 본 실시예에 대한 배경 정보를 제공할 뿐 종래 기술을 구성하는 것은 아니다. The contents described in this part merely provide background information on the present embodiment and do not constitute prior art.

영상 압축 기법들은 공간적 예측 및/또는 시간적 예측을 포함하여 비디오 시퀀스들 내에 내재된 리던던시(redundancy)를 감소시키거나 제거한다. 블록-기반 비디오 코딩에서는, 비디오 프레임 또는 슬라이스가 블록들로 파티셔닝 (partitioning) 될 수도 있다. 각각의 블록은 더 파티셔닝될 수 있다. 인트라-코딩된 (I) 프레임 또는 슬라이스 내의 블록들은 동일한 프레임 또는 슬라이스 내의 이웃하는 블록들 내의 참조 샘플들에 대한 공간적 예측을 사용하여 인코딩된다. 인터-코딩된 (P 또는 B) 프레임 또는 슬라이스 내의 블록들은 동일한 프레임 또는 슬라이스 내의 이웃하는 블록들 내의 참조 샘플들에 대한 공간적 예측 또는 다른 참조 프레임들 내의 참조 샘플들에 대한 시간적 예측을 사용할 수도 있다. 공간적 또는 시간적 예측은 결과적으로 부호화될 블록에 대한 예측 블록을 생성한다. 잔차 데이터(residual data)는 부호화될 원래의 블록과 예측 블록 사이의 픽셀 차분들을 나타낸다.Image compression techniques include spatial prediction and/or temporal prediction to reduce or remove redundancy inherent within video sequences. In block-based video coding, a video frame or slice may be partitioned into blocks. Each block can be further partitioned. Blocks within an intra-coded (I) frame or slice are encoded using spatial prediction with respect to reference samples within neighboring blocks within the same frame or slice. Blocks within an inter-coded (P or B) frame or slice may use spatial prediction with respect to reference samples within neighboring blocks within the same frame or slice or temporal prediction with respect to reference samples within other reference frames. Spatial or temporal prediction results in a predictive block for the block to be encoded. Residual data represents pixel differences between the original block to be encoded and the prediction block.

인터-코딩된 블록은 예측 블록을 형성하는 참조 샘플들의 블록을 가리키는 모션 벡터, 및 코딩된 블록과 예측 블록 사이의 차분을 나타내는 잔차 데이터에 따라 인코딩된다. 인트라-코딩된 블록은 인트라-코딩 모드 및 잔차 데이터에 따라 인코딩된다. 추가 압축을 위해, 잔차 데이터는 픽셀 도메인으로부터 변환 도메인으로 변환될 수도 있어, 잔차 변환 계수들을 발생시키고, 이 잔차 변환 계수들은 그 후에 양자화될 수도 있다. 2차원 어레이로 정렬되는 양자화된 변환 계수들은 특정한 순서로 스캐닝되어 엔트로피 코딩을 위한 변환 계수들의 1차원 벡터를 생성할 수도 있다.An inter-coded block is encoded according to a motion vector pointing to a block of reference samples forming a prediction block and residual data representing a difference between the coded block and the prediction block. An intra-coded block is encoded according to an intra-coding mode and residual data. For further compression, the residual data may be transformed from the pixel domain to the transform domain to generate residual transform coefficients, which may then be quantized. Quantized transform coefficients arranged in a two-dimensional array may be scanned in a specific order to generate a one-dimensional vector of transform coefficients for entropy coding.

본 발명은 복수의 (서브) 블록으로 분할 가능한 계수들의 어레이를 갖는 계수 블록에서 비-제로 계수들의 위치를 효율적으로 표현하는 데 그 주된 목적이 있다. The main object of the present invention is to efficiently represent the positions of non-zero coefficients in a coefficient block having an array of coefficients divisible into a plurality of (sub)blocks.

본 발명의 일 측면에 의하면, 계수 블록에서 비-제로(non-zero) 계수들인 유효 계수들(significant coefficients)의 분포를 부호화하는 것을 포함하는 영상 부호화 방법을 제공한다. 상기 영상 부호화 방법은, 현재의 계수 블록이 적어도 하나의 비-제로 계수를 갖는 유효 블록인지 여부를 가리키는 유효 플래그(significant flag)를 부호화하는 단계; 상기 유효 플래그가 상기 현재의 계수 블록이 유효 블록임을 가리키는 경우에, 상기 현재의 계수 블록에 대해, 유효 블록으로 결정된 서브 블록을, 단일의 유효 계수에 도달될 때까지, n개(n은 2 이상의 자연수)의 균등한 크기의 서브 블록들로 반복적으로(recursively) 분할하면서, 상기 n개의 서브 블록들이 각각 유효 블록인지 여부를 결정하는 단계; 상기 n개의 서브 블록들의 크기가 기설정된 임계값보다 크거나 같으면, 상기 n개의 서브 블록들에 관련된 유효 플래그의 부호화를 스킵(skip)하는 단계; 및 상기 n개의 서브 블록들의 크기가 기설정된 임계값보다 작으면, 상기 n개의 서브 블록들에 관련된 유효 플래그를 부호화하는 단계를 포함한다. According to one aspect of the present invention, an image encoding method is provided, which includes encoding a distribution of significant coefficients that are non-zero coefficients in a coefficient block. The video encoding method may include encoding a significant flag indicating whether a current coefficient block is a valid block having at least one non-zero coefficient; When the valid flag indicates that the current coefficient block is a valid block, for the current coefficient block, sub-blocks determined as valid blocks are selected in n (n is 2 or more determining whether each of the n sub-blocks is a valid block while recursively dividing the sub-blocks into equal-sized sub-blocks of a natural number); skipping encoding of valid flags related to the n sub-blocks if the sizes of the n sub-blocks are greater than or equal to a preset threshold; and encoding valid flags related to the n sub-blocks if the sizes of the n sub-blocks are smaller than a preset threshold.

본 발명의 다른 측면에 의하면, 계수 블록에서 비-제로 계수들인 유효 계수들의 분포를 결정하는 것을 포함하는 영상 복호화 방법을 제공한다. 상기 영상 복호화 방법은, 비트 스트림으로부터, 현재의 계수 블록이 적어도 하나의 비-제로 계수를 갖는 유효 블록인지 여부를 가리키는 유효 플래그(significant flag)를 파싱하는 단계; 상기 유효 플래그가 상기 현재의 계수 블록이 유효 블록임을 가리키는 경우에, 상기 현재의 계수 블록에 대해, 유효 블록으로 결정된 서브 블록을, 단일의 유효 계수에 도달될 때까지, n개(n은 2 이상의 자연수)의 균등한 크기의 서브 블록들로 반복적으로(recursively) 분할하면서, 상기 n개의 서브 블록들이 각각 유효 블록인지 여부를 결정하는 단계를 포함한다. 여기서, 상기 n개의 서브 블록들이 각각 유효 블록인지 여부를 결정하는 단계는, 상기 n개의 서브 블록들의 크기가 기설정된 임계값보다 크거나 같으면, 상기 n개의 서브 블록들에 관련된 유효 플래그의 파싱을 스킵하는 단계; 및 상기 n개의 서브 블록들의 크기가 기설정된 임계값보다 작으면, 상기 n개의 서브 블록들에 관련된 유효 플래그들을 파싱하는 단계를 더 포함할 수 있다.According to another aspect of the present invention, an image decoding method including determining a distribution of significant coefficients that are non-zero coefficients in a coefficient block is provided. The video decoding method may include parsing a significant flag indicating whether a current coefficient block is a valid block having at least one non-zero coefficient, from a bit stream; When the valid flag indicates that the current coefficient block is a valid block, for the current coefficient block, sub-blocks determined as valid blocks are selected in n (n is 2 or more natural number) of equally sized sub-blocks, and determining whether each of the n sub-blocks is a valid block. In the step of determining whether each of the n sub-blocks is a valid block, if the size of the n sub-blocks is greater than or equal to a preset threshold, parsing of valid flags related to the n sub-blocks is skipped. doing; and parsing valid flags related to the n sub-blocks when the sizes of the n sub-blocks are smaller than a preset threshold.

본 발명의 또 다른 측면에 의하면, 계수 블록에서 비-제로 계수들인 유효 계수들의 분포를 결정하는 것을 포함하는 영상 복호화 장치로서, 메모리; 및 하나 이상의 프로세서들을 포함하며, 상기 하나 이상의 프로세서들은, 다음과 같은 단계들을 포함하는 방법을 수행하도록 구성된다. 상기 방법은 비트 스트림으로부터, 현재의 계수 블록이 적어도 하나의 비-제로 계수를 갖는 유효 블록인지 여부를 가리키는 유효 플래그(significant flag)를 파싱하는 단계; 상기 유효 플래그가 상기 현재의 계수 블록이 유효 블록임을 가리키는 경우에, 상기 현재의 계수 블록에 대해, 유효 블록으로 결정된 서브 블록을, 단일의 유효 계수에 도달될 때까지, n개의 균등한 크기의 서브 블록들로 반복적으로(recursively) 분할하면서, 상기 n개의 서브 블록들이 각각 유효 블록인지 여부를 결정하는 단계를 수행하도록 구성된다. 여기서, 상기 n개의 서브 블록들이 각각 유효 블록인지 여부를 결정하는 단계는, 상기 n개의 서브 블록들의 크기가 기설정된 임계값보다 크거나 같으면, 상기 n개의 서브 블록들에 관련된 유효 플래그의 파싱을 스킵하는 단계; 및 상기 n개의 서브 블록들의 크기가 기설정된 임계값보다 작으면, 상기 n개의 서브 블록들에 관련된 유효 플래그들을 파싱하는 단계를 포함한다. According to another aspect of the present invention, an image decoding apparatus comprising determining a distribution of significant coefficients that are non-zero coefficients in a coefficient block, comprising: a memory; and one or more processors, the one or more processors configured to perform a method comprising the following steps. The method includes parsing a significant flag from the bit stream indicating whether a current coefficient block is a valid block having at least one non-zero coefficient; When the valid flag indicates that the current coefficient block is a valid block, for the current coefficient block, sub-blocks determined as valid blocks are divided into n equally sized sub-blocks until a single effective coefficient is reached. and performing the step of determining whether each of the n sub-blocks is a valid block while recursively dividing into blocks. In the step of determining whether each of the n sub-blocks is a valid block, if the size of the n sub-blocks is greater than or equal to a preset threshold, parsing of valid flags related to the n sub-blocks is skipped. doing; and parsing valid flags related to the n sub-blocks when the sizes of the n sub-blocks are smaller than a preset threshold.

본 발명의 또 다른 측면에 의하면, 복수의 서브 블록들의 계수들로 분할가능한 계수들의 어레이를 갖는 계수 블록을 부호화하는 방법을 제공한다. 상기 계수 블록의 부호화 방법은, 마지막 유효 서브 블록을 결정하는 단계, 여기서 상기 마지막 유효 서브 블록은 적어도 하나의 비-제로(non-zero) 계수를 갖는 서브 블록 스캔 순서상 마지막 서브 블록임; 상기 결정된 마지막 유효 서브 블록의 위치를 가리키는 정보를 부호화하는 단계; 상기 서브 블록 스캔 순서상 상기 마지막 유효 서브 블록에 선행하는 서브 블록들 중 상기 계수 블록의 좌상귀 서브 블록을 제외한 각각의 서브 블록이 적어도 하나의 비-제로 계수를 갖는 유효 서브 블록인지 여부를 가리키는 각각의 서브 블록에 대한 신택스 요소를 부호화하는 단계; 및 상기 마지막 유효 서브 블록의 계수들과, 상기 좌상귀 서브 블록의 계수들과, 상기 신택스 요소가 적어도 하나의 비-제로 계수를 갖는 유효 서브 블록임을 가리키는 서브 블록들의 계수들을 부호화하는 단계를 포함한다.According to another aspect of the present invention, a method of encoding a coefficient block having an array of coefficients divisible into coefficients of a plurality of sub-blocks is provided. The coding method of the coefficient block may include determining a last valid subblock, wherein the last valid subblock is a last subblock in a subblock scan order having at least one non-zero coefficient; encoding information indicating the location of the determined last valid subblock; Each subblock indicating whether each of the subblocks preceding the last valid subblock in the subblock scan order, except for the upper left subblock of the coefficient block, is a valid subblock having at least one non-zero coefficient. Encoding syntax elements for sub-blocks; and encoding the coefficients of the last valid subblock, the coefficients of the upper left subblock, and the coefficients of the subblocks indicating that the syntax element is a valid subblock having at least one non-zero coefficient.

본 발명의 또 다른 측면에 의하면, 복수의 서브 블록들의 계수들로 분할가능한 계수들의 어레이를 갖는 계수 블록을 복호화하는 방법을 제공한다. 상기 계수 블록의 복호화 방법은, 마지막 유효 서브 블록의 위치를 가리키는 정보를 복호화하는 단계 - 상기 마지막 유효 서브 블록은 적어도 하나의 비-제로 계수를 갖는 상기 서브 블록 스캔 순서상 마지막 서브 블록임 - ; 상기 서브 블록 스캔 순서상 상기 마지막 유효 서브 블록에 선행하는 서브 블록들 중 상기 계수 블록의 좌상귀 서브 블록을 제외한 각각의 서브 블록이 적어도 하나의 비-제로 계수를 갖는 유효 서브 블록인지 여부를 가리키는 각각의 서브 블록에 대한 신택스 요소를 복호화하는 단계; 및 상기 마지막 유효 서브 블록의 계수들과, 상기 좌상귀 서브 블록의 계수들과, 상기 신택스 요소가 적어도 하나의 비-제로 계수를 갖는 유효 서브 블록임을 가리키는 서브 블록들의 계수들을 복호화하는 단계를 포함한다.According to another aspect of the present invention, a method of decoding a coefficient block having an array of coefficients divisible into coefficients of a plurality of sub-blocks is provided. The method of decoding the coefficient block may include decoding information indicating a position of a last valid subblock, wherein the last valid subblock is the last subblock having at least one non-zero coefficient in the subblock scan order; Each subblock indicating whether each of the subblocks preceding the last valid subblock in the subblock scan order, except for the upper left subblock of the coefficient block, is a valid subblock having at least one non-zero coefficient. decoding syntax elements for sub-blocks; and decoding coefficients of the last valid subblock, coefficients of the upper left subblock, and coefficients of subblocks indicating that the syntax element is a valid subblock having at least one non-zero coefficient.

본 발명의 또 다른 측면에 의하면, 복수의 서브 블록들의 계수들로 분할가능한 계수들의 어레이를 갖는 계수 블록을 복호화하는 장치를 제공한다. 상기 장치는 메모리; 및 하나 이상의 프로세서들을 포함하며, 상기 하나 이상의 프로세서들은, 다음과 같은 단계들을 포함하는 방법을 수행하도록 구성된다. 상기 방법은, 마지막 유효 서브 블록의 위치를 가리키는 정보를 복호화하는 단계 - 상기 마지막 유효 서브 블록은 적어도 하나의 비-제로 계수를 갖는 상기 서브 블록 스캔 순서상 마지막 서브 블록임 - ; 상기 서브 블록 스캔 순서상 상기 마지막 유효 서브 블록에 선행하는 서브 블록들 중 상기 계수 블록의 좌상귀 서브 블록을 제외한 각각의 서브 블록이 적어도 하나의 비-제로 계수를 갖는 유효 서브 블록인지 여부를 가리키는 각각의 서브 블록에 대한 신택스 요소를 복호화하는 단계; 및 상기 마지막 유효 서브 블록의 계수들과, 상기 좌상귀 서브 블록의 계수들과, 상기 신택스 요소가 적어도 하나의 비-제로 계수를 갖는 유효 서브 블록임을 가리키는 서브 블록들의 계수들을 복호화하는 단계를 포함한다.According to another aspect of the present invention, an apparatus for decoding a coefficient block having an array of coefficients divisible into coefficients of a plurality of sub-blocks is provided. The device includes a memory; and one or more processors, the one or more processors configured to perform a method comprising the following steps. The method includes decoding information indicating a location of a last valid sub-block, wherein the last valid sub-block is a last sub-block in the sub-block scan order having at least one non-zero coefficient; Each subblock indicating whether each of the subblocks preceding the last valid subblock in the subblock scan order, except for the upper left subblock of the coefficient block, is a valid subblock having at least one non-zero coefficient. decoding syntax elements for sub-blocks; and decoding coefficients of the last valid subblock, coefficients of the upper left subblock, and coefficients of subblocks indicating that the syntax element is a valid subblock having at least one non-zero coefficient.

도 1은 본 개시의 기술들을 구현할 수 있는 영상 부호화 장치에 대한 예시적인 블록도이다.
도 2는 현재 블록의 주변블록에 대한 예시도이다.
도 3은 본 개시의 기술들을 구현할 수 있는 영상 복호화 장치에 대한 예시적인 블록도이다.
도 4는 정사각형 계수 블록의 양자화된 계수들의 부호화에 이용되는 예시적인 스캔 방식들을 나타낸 도면이다.
도 5는 diagonal 스캔 방식에 대한 더 상세한 서브 블록 및 계수들의 스캔 순서를 예시한 도면이다.
도 6은 32×32 계수 블록의 일례를 도시한 도면이다.
도 7은 도 6의 32×32 계수 블록에서 스캔 순서상 마지막 유효 서브 블록에 선행하는 서브 블록들을 표시한 도면이다.
도 8은 도 6의 32×32 계수 블록에서 스캔 순서상 마지막 유효 서브 블록에 후행하는 서브 블록들을 표시한 도면이다.
도 9는 본 발명의 일 실시예에 따른 영상 부호화 장치가 계수 블록을 부호화하는 방법을 도시한 흐름도이다.
도 10은 본 발명의 일 실시예에 따른 영상 복호화 장치가 계수 블록을 부호화하는 방법을 도시한 흐름도이다.
도 11은, 도 6에 예시된 32×32 블록에 대해, 유효 블록을 단일 유효 계수에 도달될 때까지 계층적으로 분할함에 따라 생성된 블록들을 표시한 도면이다.
도 12는 도 11의 계수 블록에서 생성되는 블록들을 diagonal 방식으로 스캔하여 구성한 트리이다.
도 13은 3개의 계층으로 구성된 간단한 구조의 트리를 예시한 도면이다.
도 14는 64×64 계수 블록을 예시한 도면이다.
도 15는 영상 복호화 장치가 큰 계수 블록에 대한 처리 과정을 도시한 흐름도이다. 1 is an exemplary block diagram of an image encoding apparatus capable of implementing the techniques of this disclosure.
2 is an exemplary diagram of neighboring blocks of a current block.
3 is an exemplary block diagram of a video decoding apparatus capable of implementing the techniques of this disclosure.
4 is a diagram illustrating exemplary scan schemes used for encoding quantized coefficients of a square coefficient block.
5 is a diagram illustrating a more detailed scan sequence of subblocks and coefficients for a diagonal scan method.
6 is a diagram showing an example of a 32×32 coefficient block.
FIG. 7 is a diagram showing subblocks preceding the last valid subblock in the scan order in the 32×32 coefficient block of FIG. 6 .
FIG. 8 is a diagram showing subblocks following the last effective subblock in scan order in the 32×32 coefficient block of FIG. 6 .
9 is a flowchart illustrating a method of encoding a coefficient block by an image encoding apparatus according to an embodiment of the present invention.
10 is a flowchart illustrating a method of encoding a coefficient block by an image decoding apparatus according to an embodiment of the present invention.
FIG. 11 is a diagram showing blocks generated by hierarchically dividing an effective block until a single effective coefficient is reached for the 32×32 block illustrated in FIG. 6 .
12 is a tree formed by scanning blocks generated from the coefficient blocks of FIG. 11 in a diagonal manner.
13 is a diagram illustrating a tree of a simple structure composed of three layers.
14 is a diagram illustrating a 64x64 coefficient block.
15 is a flowchart illustrating a process of processing a large coefficient block by an image decoding apparatus.

이하, 본 발명의 일부 실시예들을 예시적인 도면을 통해 상세하게 설명한다. 각 도면의 구성 요소들에 식별 부호를 부가함에 있어서, 동일한 구성요소들에 대해서는 비록 다른 도면상에 표시되더라도 가능한 한 동일한 부호를 가지도록 하고 있음에 유의해야 한다. 또한, 본 발명을 설명함에 있어, 관련된 공지 구성 또는 기능에 대한 구체적인 설명이 본 발명의 요지를 흐릴 수 있다고 판단되는 경우에는 그 상세한 설명은 생략한다.Hereinafter, some embodiments of the present invention will be described in detail through exemplary drawings. It should be noted that in adding identification codes to components of each drawing, the same components have the same symbols as much as possible even if they are displayed on different drawings. In addition, in describing the present invention, if it is determined that a detailed description of a related known configuration or function may obscure the gist of the present invention, the detailed description will be omitted.

본 개시의 기술들은, 일반적으로, 변환 및 양자화의 결과물인 양자화된 계수들의 어레이인 계수 블록에 대해 비-제로 계수(즉, 유효 계수; significant coefficient)들의 위치를 효율적으로 표현하는 것과 관련된다. The techniques of this disclosure generally relate to efficiently representing the location of non-zero coefficients (ie, significant coefficients) with respect to a coefficient block, which is an array of quantized coefficients that result from transform and quantization.

도 1은 본 개시의 기술들을 구현할 수 있는 영상 부호화 장치에 대한 예시적인 블록도이다.1 is an exemplary block diagram of an image encoding apparatus capable of implementing the techniques of this disclosure.

영상 부호화 장치는 블록 분할부(110), 예측부(120), 감산기(130), 변환부(140), 양자화부(145), 부호화부(150), 역양자화부(160), 역변환부(165), 가산기(170), 필터부(180) 및 메모리(190)를 포함한다. 영상 부호화 장치는 각 구성요소는 하드웨어 칩으로 구현될 수 있으며, 또는 소프트웨어로 구현되고 마이크로프로세서가 각 구성요소에 대응하는 소프트웨어의 기능을 실행하도록 구현될 수도 있다.The image encoding apparatus includes a block division unit 110, a prediction unit 120, a subtractor 130, a transform unit 140, a quantization unit 145, an encoding unit 150, an inverse quantization unit 160, an inverse transform unit ( 165), an adder 170, a filter unit 180, and a memory 190. Each component of the image encoding apparatus may be implemented as a hardware chip or may be implemented as software and a microprocessor may execute software functions corresponding to each component.

블록 분할부(110)는 영상을 구성하는 각 픽처(picture)를 복수의 CTU(Coding Tree Unit)으로 분할한 이후에, CTU를 트리 구조(tree structure)를 이용하여 반복적으로(recursively) 분할한다. 트리 구조에서의 리프 노드(leaf node)가 부호화의 기본 단위인 CU (coding unit)가 된다. 트리 구조로는 상위 노드(혹은 부모 노드)가 동일한 크기의 네 개의 하위 노드(혹은 자식 노드)로 분할되는 쿼드트리(QuadTree, QT), 또는 이러한 QT 구조 및 상위 노드가 두 개의 하위 노드로 분할되는 바이너리트리(BinaryTree, BT) 구조를 혼용한 QTBT (QuadTree plus BinaryTree) 구조가 사용될 수 있다. 즉, CTU를 다수의 CU로 분할하기 위해 QTBT를 사용할 수 있다.The block divider 110 divides each picture constituting an image into a plurality of Coding Tree Units (CTUs), and then recursively divides the CTUs using a tree structure. A leaf node in the tree structure becomes a coding unit (CU), which is a basic unit of encoding. As a tree structure, a QuadTree (QT) in which a parent node (or parent node) is divided into four subnodes (or child nodes) of the same size, or a QT structure and a parent node are divided into two subnodes A QTBT (QuadTree plus BinaryTree) structure using a binary tree (BinaryTree, BT) structure may be used. That is, QTBT can be used to divide a CTU into multiple CUs.

QTBT (QuadTree plus BinaryTree) 구조에서, CTU는 먼저 QT 구조로 분할될 수 있다. 쿼드트리 분할은 분할 블록(splitting block)의 크기가 QT에서 허용되는 리프 노드의 최소 블록 크기(MinQTSize)에 도달 할 때까지 반복 될 수 있다. 쿼드트리의 리프 노드가 BT에서 허용되는 루트 노드의 최대 블록 크기(MaxBTSize)보다 크지 크지 않은 경우, BT 구조로 더 파티셔닝될 수 있다. BT에서는 복수의 분할 타입이 존재할 수 있다. 예컨대, 일부 예시에서, 해당 노드의 블록을 동일 크기의 두 개 블록으로 가로로 분할하는 타입(즉, symmetric horizontal splitting)과 세로로 분할하는 타입(즉, symmetric vertical splitting) 두 가지가 존재할 수 있다. 또한, 해당 노드의 블록을 서로 비대칭 형태의 두 개의 블록으로 분할하는 타입이 추가로 더 존재할 수도 있다. 비대칭 형태로는 해당 노드의 블록을 1:3의 크기 비율을 가지는 두 개의 직사각형 블록으로 분할하는 형태를 포함할 수 있고, 혹은 해당 노드의 블록을 대각선 방향으로 분할하는 형태를 포함할 수도 있다.In the QTBT (QuadTree plus BinaryTree) structure, the CTU may first be divided into the QT structure. Quadtree splitting can be repeated until the size of the splitting block reaches the minimum block size (MinQTSize) of leaf nodes allowed by QT. If the leaf node of the quad tree is not larger than the maximum block size (MaxBTSize) of the root node allowed in BT, it can be further partitioned into a BT structure. In BT, a plurality of partition types may exist. For example, in some examples, there may be two types of horizontally splitting a block of a corresponding node into two equal-sized blocks (ie, symmetric horizontal splitting) and a vertical splitting type (ie, symmetric vertical splitting). In addition, a type in which a block of a corresponding node is divided into two blocks having an asymmetric shape may additionally exist. The asymmetric form may include a form in which the block of the corresponding node is divided into two rectangular blocks having a size ratio of 1:3, or a form in which the block of the corresponding node is divided in a diagonal direction.

블록 분할부(110)가 QTBT 구조에 의해 CTU를 분할하여 생성하는 분할 정보는 부호화부(150)에 의해 부호화되어 영상 복호화 장치로 전달된다. Split information generated by the block division unit 110 by dividing the CTU by the QTBT structure is encoded by the encoder 150 and transmitted to the video decoding apparatus.

이하에서는, 부호화 또는 복호화하고자 하는 CU(즉, QTBT의 리프 노드)에 해당하는 블록을 '현재 블록'이라 칭한다.Hereinafter, a block corresponding to a CU to be encoded or decoded (ie, a leaf node of QTBT) is referred to as a 'current block'.

예측부(120)는 현재 블록을 예측하여 예측블록을 생성한다. 예측부(120)는 인트라 예측부(122)와 인터 예측부(124)를 포함한다. The prediction unit 120 predicts a current block and generates a prediction block. The prediction unit 120 includes an intra prediction unit 122 and an inter prediction unit 124 .

일반적으로, 픽처 내 현재 블록들은 각각 예측적으로 코딩될 수 있다. 현재 블록의 예측은 (현재 블록을 포함하는 픽처으로부터의 데이터를 사용하는) 인트라 예측 기술 또는 (현재 블록을 포함하는 픽처에 대해 이전에 코딩된 픽처로부터의 데이터를 사용하는) 인터 예측 기술을 사용하여 일반적으로 수행될 수 있다. 인터 예측은 단방향 예측과 양방향 예측 모두를 포함한다.In general, each current block in a picture can be coded predictively. Prediction of the current block is performed using intra-prediction techniques (using data from the picture containing the current block) or inter-prediction techniques (using data from pictures previously coded for the picture containing the current block). can be done normally. Inter prediction includes both uni-prediction and bi-prediction.

각각의 인터 예측된 블록에 대해, 움직임 정보 세트가 이용 가능할 수 있다. 한 세트의 움직임 정보는 순방향(forward) 및 역방향(backward) 예측 방향에 대한 움직임 정보를 포함할 수 있다. 여기서, 순방향 및 역방향 예측 방향은 양방향(bi-directional) 예측 모드의 2개의 예측 방향이고, 용어 "순방향" 및 "역방향"은 반드시 기하학적 의미를 가질 필요는 없다. 대신에, 이들은 일반적으로 참조 픽처가 현재 픽처 전에("역방향") 또는 후에("순방향")에 표시될지 여부에 대응한다. 일부 예에서, "순방향" 및 "역방향" 예측 방향은 현재 픽처의 참조 픽처 리스트 0(RefPicList0) 및 참조 픽처 리스트 1(RefPicList1)에 대응할 수 있다. For each inter predicted block, a set of motion information may be available. A set of motion information may include motion information for forward and backward prediction directions. Here, forward and backward prediction directions are two prediction directions of a bi-directional prediction mode, and the terms "forward direction" and "backward direction" do not necessarily have geometric meanings. Instead, they generally correspond to whether the reference picture is to be presented before (“backwards”) or after (“forward”) the current picture. In some examples, the “forward” and “backward” prediction directions may correspond to reference picture list 0 (RefPicList0) and reference picture list 1 (RefPicList1) of the current picture.

각 예측 방향에 대해, 움직임 정보는 참조 인덱스 및 움직임 벡터를 포함한다. 참조 인덱스는 현재 참조 픽처 리스트 (RefPicList0 또는 RefPicList1) 내의 참조 픽처를 식별하는데 사용될 수 있다. 움직임 벡터는 수평(x) 및 수직(y) 성분을 갖는다. 일반적으로, 수평 성분은 참조 블록의 x 좌표를 위치 시키는데 필요한, 현재 픽처에서의 현재 블록의 위치에 상대적인 참조 픽처 내의 수평 변위(horizontal displacement)를 나타낸다. 수직 성분은 참조 블록의 y 좌표를 위치 시키는데 필요한, 현재 블록의 위치에 상대적인 참조 픽처 내의 수직 변위(vertical displacement)를 나타낸다.For each prediction direction, the motion information includes a reference index and a motion vector. A reference index can be used to identify a reference picture in the current reference picture list (RefPicList0 or RefPicList1). A motion vector has horizontal (x) and vertical (y) components. In general, the horizontal component represents the horizontal displacement in the reference picture relative to the position of the current block in the current picture, necessary to locate the x-coordinate of the reference block. The vertical component represents a vertical displacement in the reference picture relative to the position of the current block, required to locate the y-coordinate of the reference block.

인터 예측부(124)는 현재 픽처보다 먼저 부호화 및 복호화된 참조 픽처 내에서 현재 블록과 가장 유사한 블록을 탐색하고, 그 탐색된 블록을 이용하여 현재 블록에 대한 예측블록을 생성한다. 그리고, 현재 픽처 내의 현재 블록과 참조 픽처 내의 예측블록 간의 변위(displacement)에 해당하는 움직임벡터(motion vector)를 생성한다. 일반적으로, 움직임 추정은 루마(luma) 성분에 대해 수행되고, 루마 성분에 기초하여 계산된 모션 벡터는 루마 성분 및 크로마 성분 모두에 대해 사용된다. 현재 블록을 예측하기 위해 사용된 참조 픽처에 대한 정보 및 움직임벡터에 대한 정보를 포함하는 움직임 정보는 부호화부(150)에 의해 부호화되어 영상 복호화 장치로 전달된다. The inter-prediction unit 124 searches for a block most similar to the current block within the encoded and decoded reference picture prior to the current picture, and generates a prediction block for the current block using the searched block. Then, a motion vector corresponding to displacement between the current block in the current picture and the prediction block in the reference picture is generated. In general, motion estimation is performed on the luma component, and a motion vector calculated based on the luma component is used for both the luma component and the chroma component. Motion information including information on reference pictures and motion vectors used to predict the current block is encoded by the encoder 150 and transmitted to the video decoding apparatus.

움직임 정보를 부호화하는 데에 소요되는 비트량을 최소화하기 위해 다양한 방법이 사용될 수 있다.Various methods may be used to minimize the amount of bits required to encode motion information.

예컨대, 현재블록의 참조 픽처와 움직임벡터가 주변블록의 참조 픽처 및 움직임벡터와 동일한 경우에는 그 주변블록을 식별할 수 있는 정보를 부호화함으로써, 현재블록의 움직임 정보를 복호화 장치로 전달할 수 있다. 이러한 방법을 '머지 모드 (merge mode)'라 한다.For example, when the reference picture and motion vector of the current block are the same as the reference picture and motion vector of the neighboring block, the motion information of the current block can be transmitted to the decoding apparatus by encoding information capable of identifying the neighboring block. This method is called 'merge mode'.

머지 모드에서, 인터 예측부(124)는 현재블록의 주변블록들로부터 기 결정된 개수의 머지 후보블록(이하, '머지 후보'라 함)들을 선택한다. In the merge mode, the inter prediction unit 124 selects a predetermined number of merge candidate blocks (hereinafter referred to as 'merge candidates') from neighboring blocks of the current block.

머지 후보를 유도하기 위한 주변블록으로는, 도 2에 도시된 바와 같이, 현재 픽처 내에서 현재블록에 인접한 좌측블록(L), 상단블록(A), 우상단블록(AR), 좌하단블록(BL), 좌상단블록(AL) 중에서 전부 또는 일부가 사용될 수 있다. 또한, 현재블록이 위치한 현재 픽처가 아닌 참조 픽처(현재블록을 예측하기 위해 사용된 참조 픽처와 동일할 수도 있고 다를 수도 있음) 내에 위치한 블록이 머지 후보로서 사용될 수도 있다. 예컨대, 참조 픽처 내에서 현재블록과 동일 위치에 있는 블록(co-located block) 또는 그 동일 위치의 블록에 인접한 블록들이 머지 후보로서 추가로 더 사용될 수 있다. Neighboring blocks for deriving a merge candidate include a left block (L) adjacent to the current block in the current picture, an upper block (A), an upper right block (AR), and a lower left block (BL), as shown in FIG. ), all or part of the upper left block AL may be used. Also, a block located in a reference picture (which may be the same as or different from a reference picture used to predict the current block) other than the current picture in which the current block is located may be used as a merge candidate. For example, a block co-located with the current block in the reference picture or blocks adjacent to the co-located block may be additionally used as a merge candidate.

인터 예측부(124)는 이러한 주변블록들을 이용하여 기 결정된 개수의 머지 후보를 포함하는 머지 리스트를 구성한다. 머지 리스트에 포함된 머지 후보들 중에서 현재블록의 움직임정보로서 사용할 머지 후보를 선택하고 선택된 후보를 식별하기 위한 머지 인덱스 정보를 생성한다. 생성된 머지 인덱스 정보는 부호화부(150)에 의해 부호화되어 복호화 장치로 전달된다.The inter prediction unit 124 constructs a merge list including a predetermined number of merge candidates using these neighboring blocks. Among the merge candidates included in the merge list, a merge candidate to be used as motion information of the current block is selected, and merge index information for identifying the selected candidate is generated. The generated merge index information is encoded by the encoder 150 and transmitted to the decoding device.

움직임 정보를 부호화하는 또 다른 방법은 차분 움직임벡터를 부호화하는 것이다.Another method of encoding motion information is encoding differential motion vectors.

이 방법에서, 인터 예측부(124)는 현재블록의 주변블록들을 이용하여 현재블록의 움직임벡터에 대한 예측 움직임벡터 후보들을 유도한다. 예측 움직임벡터 후보들을 유도하기 위해 사용되는 주변블록으로는, 도 5에 도시된 현재 픽처 내에서 현재블록에 인접한 좌측블록(L), 상단블록(A), 우상단블록(AR), 좌하단블록(BL), 좌상단블록(AL) 중에서 전부 또는 일부가 사용될 수 있다. 또한, 현재블록이 위치한 현재 픽처가 아닌 참조 픽처(현재블록을 예측하기 위해 사용된 참조 픽처와 동일할 수도 있고 다를 수도 있음) 내에 위치한 블록이 예측 움직임벡터 후보들을 유도하기 위해 사용되는 주변블록으로서 사용될 수도 있다. 예컨대, 참조 픽처 내에서 현재블록과 동일 위치에 있는 블록(co-located block) 또는 그 동일 위치의 블록에 인접한 블록들이 사용될 수 있다.In this method, the inter prediction unit 124 derives predicted motion vector candidates for the motion vector of the current block by using neighboring blocks of the current block. Neighboring blocks used to derive predictive motion vector candidates include a left block (L), an upper block (A), an upper right block (AR), and a lower left block adjacent to the current block in the current picture shown in FIG. 5 ( BL), and all or part of the upper left block (AL) may be used. In addition, a block located in a reference picture (which may be the same as or different from the reference picture used to predict the current block) other than the current picture in which the current block is located will be used as a neighboring block used to derive motion vector candidates. may be For example, a block co-located with the current block in the reference picture or blocks adjacent to the co-located block may be used.

인터 예측부(124)는 이 주변블록들의 움직임벡터를 이용하여 예측 움직임벡터 후보들을 유도하고, 예측 움직임벡터 후보들을 이용하여 현재블록의 움직임벡터에 대한 예측 움직임벡터를 결정한다. 그리고, 현재블록의 움직임벡터로부터 예측 움직임벡터를 감산하여 차분 움직임벡터를 산출한다. The inter-prediction unit 124 derives predicted motion vector candidates using the motion vectors of the neighboring blocks, and determines a predicted motion vector for the motion vector of the current block using the predicted motion vector candidates. Then, a differential motion vector is calculated by subtracting the predicted motion vector from the motion vector of the current block.

예측 움직임벡터는 예측 움직임벡터 후보들에 기 정의된 함수(예컨대, 중앙값, 평균값 연산 등)를 적용하여 구할 수 있다. 이 경우, 영상 복호화 장치도 기 정의된 함수를 알고 있다. 또한, 예측 움직임벡터 후보를 유도하기 위해 사용하는 주변블록은 이미 부호화 및 복호화가 완료된 블록이므로 영상 복호화 장치도 그 주변블록의 움직임벡터도 이미 알고 있다. 그러므로 영상 부호화 장치는 예측 움직임벡터 후보를 식별하기 위한 정보를 부호화할 필요가 없다. 따라서, 이 경우에는 차분 움직임벡터에 대한 정보와 현재블록을 예측하기 위해 사용한 참조 픽처에 대한 정보가 부호화된다.The predicted motion vector may be obtained by applying a predefined function (eg, median value, average value operation, etc.) to predicted motion vector candidates. In this case, the video decoding apparatus also knows the predefined function. In addition, since a neighboring block used to derive a predicted motion vector candidate is a block that has already been encoded and decoded, the video decoding apparatus also knows the motion vector of the neighboring block. Therefore, the video encoding apparatus does not need to encode information for identifying a predictive motion vector candidate. Therefore, in this case, information on differential motion vectors and information on reference pictures used to predict the current block are encoded.

한편, 예측 움직임벡터는 예측 움직임벡터 후보들 중 어느 하나를 선택하는 방식으로 결정될 수도 있다. 이 경우에는 차분 움직임벡터에 대한 정보 및 현재블록을 예측하기 위해 사용한 참조 픽처에 대한 정보와 함께, 선택된 예측 움직임벡터 후보를 식별하기 위한 정보가 추가로 부호화된다.Meanwhile, the predicted motion vector may be determined by selecting one of the predicted motion vector candidates. In this case, along with information on differential motion vectors and information on reference pictures used to predict the current block, information for identifying the selected predictive motion vector candidate is additionally encoded.

인트라 예측부(122)는 현재 블록이 포함된 현재 픽처 내에서 현재 블록의 주변에 위치한 픽셀(참조 픽셀)들을 이용하여 현재 블록 내의 픽셀들을 예측한다. 예측 방향에 따라 복수의 인트라 예측모드가 존재하며, 각 예측모드에 따라 사용할 주변 픽셀과 연산식이 다르게 정의된다. 특히, 인트라 예측부(122)는 현재 블록을 부호화하는데 사용할 인트라 예측 모드를 결정할 수 있다. 일부 예들에서, 인트라 예측부(122)는 여러 인트라 예측 모드들을 사용하여 현재 블록을 인코딩하고, 테스트된 모드들로부터 사용할 적절한 인트라 예측 모드를 선택할 수도 있다. 예를 들어, 인트라 예측부(122)는 여러 테스트된 인트라 예측 모드들에 대한 레이트 왜곡(rate-distortion) 분석을 사용하여 레이트 왜곡 값들을 계산하고, 테스트된 모드들 중 최선의 레이트 왜곡 특징들을 갖는 인트라 예측 모드를 선택할 수도 있다. The intra predictor 122 predicts pixels in the current block using pixels (reference pixels) located around the current block in the current picture including the current block. A plurality of intra prediction modes exist according to prediction directions, and neighboring pixels to be used and operation expressions are defined differently according to each prediction mode. In particular, the intra prediction unit 122 may determine an intra prediction mode to be used for encoding the current block. In some examples, the intra prediction unit 122 may encode the current block using several intra prediction modes and select an appropriate intra prediction mode to use from the tested modes. For example, the intra prediction unit 122 calculates rate-distortion values using rate-distortion analysis for several tested intra-prediction modes, and selects a rate-distortion having the best rate-distortion characteristics among the tested modes. Intra prediction mode can also be selected.

인트라 예측부(122)는 복수의 인트라 예측 모드 중에서 하나의 인트라 예측 모드를 선택하고, 선택된 인트라 예측 모드에 따라 결정되는 주변 픽셀(참조 픽셀)과 연산식을 사용하여 현재 블록을 예측한다. 선택된 인트라 예측 모드에 대한 정보는 부호화부(150)에 의해 부호화되어 영상 복호화 장치로 전달된다.The intra predictor 122 selects one intra prediction mode from among a plurality of intra prediction modes, and predicts a current block using neighboring pixels (reference pixels) determined according to the selected intra prediction mode and an arithmetic expression. Information on the selected intra prediction mode is encoded by the encoder 150 and transmitted to the video decoding apparatus.

감산기(130)는 현재 블록으로부터 인트라 예측부(122) 또는 인터 예측부(124)에 의해 생성된 예측블록을 감산하여 잔차 블록을 생성한다.The subtractor 130 subtracts the prediction block generated by the intra prediction unit 122 or the inter prediction unit 124 from the current block to generate a residual block.

변환부(140)는 공간 영역의 픽셀 값들을 가지는 잔차 블록 내의 잔차 신호를 주파수 도메인의 변환 계수로 변환한다. 변환부(140)는 잔차 블록 내의 잔차 신호들을 현재 블록의 크기를 변환 단위로 사용하여 변환할 수 있으며, 또는 잔차 블록을 더 작은 복수의 서브블록을 분할하고 서브블록 크기의 변환 단위로 잔차 신호들을 변환할 수도 있다. 잔차 블록을 더 작은 서브블록으로 분할하는 방법은 다양하게 존재할 수 있다. 예컨대, 기정의된 동일한 크기의 서브블록으로 분할할 수도 있으며, 또는 잔차 블록을 루트 노드로 하는 QT(quadtree) 방식의 분할을 사용할 수도 있다. The transform unit 140 transforms the residual signal in the residual block having pixel values in the spatial domain into transform coefficients in the frequency domain. The transform unit 140 may transform the residual signals in the residual block by using the size of the current block as a transform unit, or divide the residual block into a plurality of smaller subblocks and transform the residual signals into subblock size transform units. can also be converted. There may be various methods of dividing a residual block into smaller subblocks. For example, it may be divided into subblocks having the same predefined size, or a QT (quadtree) method partitioning using a residual block as a root node may be used.

양자화부(145)는 변환부(140)로부터 출력되는 변환 계수들을 양자화하고, 양자화된 변환 계수들을 부호화부(150)로 출력한다.The quantization unit 145 quantizes transform coefficients output from the transform unit 140 and outputs the quantized transform coefficients to the encoder 150 .

부호화부(150)는 양자화된 변환 계수들을 CABAC 등의 부호화 방식을 사용하여 부호화하여 비트스트림을 생성한다. 이러한 부호화는 통상적으로 복수의 가용 스캔 패턴 중 하나를 이용하여 양자화된 변환 계수에 대해 수행된다.The encoder 150 encodes the quantized transform coefficients using a coding scheme such as CABAC to generate a bitstream. This encoding is typically performed on the quantized transform coefficients using one of a plurality of available scan patterns.

본 개시의 기술들의 일 측면은 일반적으로 변환 및 양자화의 결과물인 양자화된 계수들의 어레이인 계수 블록에 대해 비-제로 계수(즉, 유효 계수)들의 위치를 효율적으로 표현하는 것과 관련된다. 이와 같이, 본 개시의 소정의 기법들은 부호화부(150)에 의해 수행될 수도 있다. 즉, 예를 들어, 부호화부(150)는 아래의 도 6 내지 도 15에 대해 기술된 본 개시의 기법들을 수행할 수도 있다. 다른 예들에서, 부호화 장치의 하나 이상의 다른 유닛들이 추가적으로 본 개시의 기법들을 수행하는 데 관여할 수도 있다. One aspect of the techniques of this disclosure relates to efficiently representing the location of non-zero coefficients (ie, significant coefficients) with respect to a coefficient block, which is generally an array of quantized coefficients that result from transform and quantization. As such, certain techniques of the present disclosure may be performed by the encoder 150 . That is, for example, the encoder 150 may perform the techniques of the present disclosure described with respect to FIGS. 6 to 15 below. In other examples, one or more other units of the encoding device may additionally be involved in performing the techniques of this disclosure.

또한, 부호화부(150)는 블록 분할과 관련된 CTU size, MinQTSize, MaxBTSize, MaxBTDepth, MinBTSize, QT 분할 플래그, BT 분할 플래그, 분할 타입 등의 정보를 부호화하여, 영상 복호화 장치가 영상 부호화 장치와 동일하게 블록을 분할할 수 있도록 한다.In addition, the encoder 150 encodes information such as CTU size, MinQTSize, MaxBTSize, MaxBTDepth, MinBTSize, QT splitting flag, BT splitting flag, and splitting type related to block splitting so that the video decoding apparatus can perform the same operation as the video encoding apparatus. Blocks can be split.

부호화부(150)는 현재 블록이 인트라 예측에 의해 부호화되었는지 아니면 인터 예측에 의해 부호화되었는지 여부를 지시하는 예측 타입에 대한 정보를 부호화하고, 예측 타입에 따라 인트라 예측정보 또는 인터 예측정보를 부호화한다. The encoder 150 encodes prediction type information indicating whether the current block is coded by intra prediction or inter prediction, and encodes intra prediction information or inter prediction information according to the prediction type.

현재블록이 인트라 예측된 경우에는 인트라 예측정보로서 인트라 예측 모드에 대한 신택스 요소(syntax element)를 부호화한다. 현재블록이 인터 예측된 경우, 부호화부(150)는 인터 예측정보에 대한 신택스 요소를 부호화한다. 인터 예측정보에 대한 신택스 요소는 다음을 포함한다.When the current block is intra predicted, a syntax element for an intra prediction mode is encoded as intra prediction information. When the current block is inter-predicted, the encoder 150 encodes syntax elements for inter-prediction information. Syntax elements for inter prediction information include the following.

(1) 현재블록의 움직임정보가 머지 모드로 부호화되는지 아니면 차분 움직임벡터를 부호화하는 모드로 부호화되는지 여부를 지시하는 모드 정보(1) Mode information indicating whether the motion information of the current block is coded in a merge mode or a differential motion vector encoding mode.

(2) 움직임정보에 대한 신택스 요소 (2) Syntax elements for motion information

움직임정보가 머지 모드에 의해 부호화되는 경우, 부호화부(150)는 머지 후보들 중 어느 후보가 현재블록의 움직임정보를 추출하기 위한 후보로서 선택되는지를 지시하는 머지 인덱스 정보를 움직임정보에 대한 신택스 요소로 부호화한다. When motion information is encoded by merge mode, the encoder 150 uses merge index information indicating which of merge candidates is selected as a candidate for extracting motion information of the current block as a syntax element for the motion information. Encode.

반면, 움직임정보가 차분 움직임벡터를 부호화하는 모드에 의해 부호화되는 경우, 차분 움직임벡터에 대한 정보 및 참조 픽처에 대한 정보를 움직임정보에 대한 신택스 요소로 부호화한다. 만약, 예측 움직임벡터가 복수의 예측 움직임벡터 후보들 중 어느 하나의 후보를 선택하는 방식으로 결정되는 경우에는, 움직임정보에 대한 신택스 요소는 그 선택된 후보를 식별하기 위한 예측 움직임벡터 식별 정보를 추가로 더 포함한다.On the other hand, when motion information is coded using a differential motion vector encoding mode, information on differential motion vectors and information on reference pictures are encoded as syntax elements for motion information. If the predicted motion vector is determined by selecting one of a plurality of predicted motion vector candidates, the syntax element for the motion information further includes predictive motion vector identification information for identifying the selected candidate. include

역양자화부(160)는 양자화부(145)로부터 출력되는 양자화된 변환 계수들을 역양자화하여 변환 계수들을 생성한다. 역변환부(165)는 역양자화부(160)로부터 출력되는 변환 계수들을 주파수 도메인으로부터 공간 도메인으로 변환하여 잔차블록을 복원한다.The inverse quantization unit 160 inversely quantizes the quantized transform coefficients output from the quantization unit 145 to generate transform coefficients. The inverse transform unit 165 transforms transform coefficients output from the inverse quantization unit 160 from a frequency domain to a spatial domain to restore a residual block.

가산부(170)는 복원된 잔차블록과 예측부(120)에 의해 생성된 예측블록을 가산하여 현재 블록을 복원한다. 복원된 현재 블록 내의 픽셀들은 다음 순서의 블록을 인트라 예측할 때 참조 픽셀로서 사용된다.The adder 170 restores the current block by adding the restored residual block and the predicted block generated by the predictor 120. Pixels in the reconstructed current block are used as reference pixels when intra-predicting the next block.

필터부(180)는 블록 단위의 부호화/복호화로 인해 발생하는 블록킹 현상(blocking artifact)을 제거하기 위해 복원된 블록 간의 경계를 디블록킹 필터링하고 메모리(190)에 저장한다. 한 픽처 내의 모든 블록들이 복원되면, 복원된 픽처는 이후에 부호화하고자 하는 픽처 내의 블록을 인터 예측하기 위한 참조 픽처로 사용된다.The filter unit 180 performs deblocking filtering on boundaries between reconstructed blocks in order to remove blocking artifacts generated due to block-by-block encoding/decoding and stores them in the memory 190. When all blocks in one picture are reconstructed, the reconstructed picture is used as a reference picture for inter-prediction of blocks in the picture to be encoded later.

이하에서는 영상 복호화 장치에 대해 설명한다.Hereinafter, a video decoding apparatus will be described.

도 3은 본 개시의 기술들을 구현할 수 있는 영상 복호화 장치의 예시적인 블록도이다.3 is an exemplary block diagram of a video decoding apparatus capable of implementing the techniques of this disclosure.

영상 복호화 장치는 복호화부(310), 역양자화부(320), 역변환부(330), 예측부(340), 가산기(350), 필터부(360) 및 메모리(370)를 포함한다. 도 2의 영상 부호화 장치와 마찬가지로, 영상 복호화 장치는 각 구성요소가 하드웨어 칩으로 구현될 수 있으며, 또는 소프트웨어로 구현되고 마이크로프로세서가 각 구성요소에 대응하는 소프트웨어의 기능을 실행하도록 구현될 수도 있다.The image decoding apparatus includes a decoding unit 310, an inverse quantization unit 320, an inverse transform unit 330, a prediction unit 340, an adder 350, a filter unit 360, and a memory 370. Like the video encoding apparatus of FIG. 2 , each component of the video decoding apparatus may be implemented as a hardware chip or implemented as software and a microprocessor may execute software functions corresponding to each component.

복호화부(310)는 영상 부호화 장치로부터 수신한 비트스트림을 복호화하여 블록 분할과 관련된 정보를 추출하여 복호화하고자 하는 현재 블록을 결정하고, 현재 블록을 복원하기 위해 필요한 예측 정보와 잔차신호에 대한 정보 등을 추출한다.The decoder 310 decodes the bitstream received from the video encoding device, extracts information related to block division, determines a current block to be decoded, predictive information required to reconstruct the current block, and information on a residual signal, etc. extract

복호화부(310)는 SPS (Sequence Parameter Set) 또는 PPS (Picture Parameter Set)로부터 CTU size에 대한 정보를 추출하여 CTU의 크기를 결정하고, 픽처를 결정된 크기의 CTU로 분할한다. 그리고 CTU를 트리 구조의 최상위 레이어, 즉, 루트 노드로 결정하고, CTU에 대한 분할 정보를 추출함으로써 CTU를 트리 구조를 이용하여 분할한다. 예컨대, QTBT 구조를 사용하여 CTU를 분할하는 경우, 먼저 QT의 분할과 관련된 제1 플래그(QT_split_flag)를 추출하여 각 노드를 하위 레이어의 네 개의 노드로 분할한다. 그리고, QT의 리프 노드에 해당하는 노드에 대해서는 BT의 분할과 관련된 제2 플래그(BT_split_flag) 및 분할 타입 정보를 추출하여 해당 리프 노드를 BT 구조로 분할한다.The decoder 310 determines the size of the CTU by extracting information about the CTU size from a Sequence Parameter Set (SPS) or a Picture Parameter Set (PPS), and divides the picture into CTUs of the determined size. In addition, the CTU is divided using the tree structure by determining the CTU as the top layer of the tree structure, that is, the root node, and extracting division information for the CTU. For example, when splitting a CTU using a QTBT structure, first, a first flag (QT_split_flag) related to splitting of QT is extracted and each node is split into four nodes of a lower layer. Then, for a node corresponding to a leaf node of QT, a second flag (BT_split_flag) related to BT splitting and split type information are extracted to split the corresponding leaf node into a BT structure.

한편, 복호화부(310)는 트리 구조의 분할을 통해 복호화하고자 하는 현재 블록을 결정하게 되면, 현재 블록이 인트라 예측되었는지 아니면 인터 예측되었는지를 지시하는 예측 타입에 대한 정보를 추출한다. Meanwhile, when the decoder 310 determines a current block to be decoded through partitioning of a tree structure, it extracts information about a prediction type indicating whether the current block is intra-predicted or inter-predicted.

예측 타입 정보가 인트라 예측을 지시하는 경우, 복호화부(310)는 현재블록의 인트라 예측정보(인트라 예측 모드)에 대한 신택스 요소를 추출한다. When the prediction type information indicates intra prediction, the decoder 310 extracts syntax elements for intra prediction information (intra prediction mode) of the current block.

예측 타입 정보가 인터 예측을 지시하는 경우, 복호화부(310)는 인터 예측정보에 대한 신택스 요소를 추출한다. 먼저, 현재블록의 움직임정보가 복수의 부호화 모드 중 어느 모드에 의해 부호화되었는지 여부를 지시하는 모드 정보를 추출한다. 여기서, 복수의 부호화 모드는 스킵모드를 포함한 머지 모드 및 차분 움직임벡터 부호화 모드를 포함한다. 모드 정보가 머지 모드를 지시하는 경우, 복호화부(310)는 머지 후보들 중 어느 후보로부터 현재블록의 움직임벡터를 유도할지 여부를 지시하는 머지 인덱스 정보를 움직임정보에 대한 신택스 요소로서 추출한다. 반면, 모드 정보가 차분 움직임벡터 부호화 모드를 지시하는 경우, 복호화부(310)는 차분 움직임벡터에 대한 정보 및 현재블록의 움직임벡터가 참조하는 참조 픽처에 대한 정보를 움직임벡터에 대한 신택스 요소로서 추출한다. 한편, 영상 부호화 장치가 복수의 예측 움직임벡터 후보들 중에서 어느 하나의 후보를 현재블록의 예측 움직임벡터로 사용한 경우에는 예측 움직임벡터 식별정보가 비트스트림에 포함된다. 따라서 이 경우에는, 차분 움직임벡터에 대한 정보와 참조 픽처에 대한 정보뿐만 아니라 예측 움직임벡터 식별정보도 움직임벡터에 대한 신택스 요소로서 추출한다.When the prediction type information indicates inter prediction, the decoder 310 extracts syntax elements for the inter prediction information. First, mode information indicating whether the motion information of the current block has been encoded by which mode among a plurality of encoding modes is extracted. Here, the plurality of encoding modes include a merge mode including a skip mode and a differential motion vector encoding mode. When the mode information indicates the merge mode, the decoder 310 extracts, as a syntax element for the motion information, merge index information indicating which of merge candidates to derive the motion vector of the current block from. On the other hand, when the mode information indicates the differential motion vector encoding mode, the decoding unit 310 extracts information on the differential motion vector and information on a reference picture referred to by the motion vector of the current block as syntax elements for the motion vector. do. Meanwhile, when an image encoding apparatus uses one of a plurality of predictive motion vector candidates as a predictive motion vector of a current block, predictive motion vector identification information is included in the bitstream. Therefore, in this case, not only information on the differential motion vector and information on the reference picture, but also prediction motion vector identification information is extracted as a syntax element for the motion vector.

한편, 복호화부(310)는 잔차신호에 대한 정보로서 현재 블록의 양자화된 변환계수들에 대한 정보를 추출한다. 본 개시의 기술들의 다른 측면은, 일반적으로, 변환 및 양자화의 결과물인 양자화된 계수들의 어레이인 계수 블록에 대해 비-제로 계수(즉, 유효 계수)들의 위치를 효율적으로 복호화하는 것과 관련된다. 따라서, 본 개시의 소정의 기법들은 복호화부(310)에 의해 수행될 수도 있다. 즉, 예를 들어, 복호화부(310)는 아래의 도 4 내지 도 15에 대해 기술된 본 개시의 기법들을 수행할 수도 있다. 다른 예들에서, 부호화 장치의 하나 이상의 다른 유닛들이 추가적으로 본 개시의 기법들을 수행하는 데 관여할 수도 있다. Meanwhile, the decoder 310 extracts information on quantized transform coefficients of the current block as information on the residual signal. Another aspect of the techniques of this disclosure relates generally to efficiently decoding the location of non-zero coefficients (ie, significant coefficients) relative to a coefficient block, which is an array of quantized coefficients that result from transform and quantization. Accordingly, certain techniques of this disclosure may be performed by decryption unit 310 . That is, for example, the decoder 310 may perform the techniques of the present disclosure described with respect to FIGS. 4 to 15 below. In other examples, one or more other units of the encoding device may additionally be involved in performing the techniques of this disclosure.

역양자화부(320)는 양자화된 변환계수들을 역양자화하고 역변환부(330)는 역양자화된 변환계수들을 주파수 도메인으로부터 공간 도메인으로 역변환하여 잔차신호들을 복원함으로써 현재 블록에 대한 잔차블록을 생성한다.The inverse quantization unit 320 inversely quantizes the quantized transform coefficients, and the inverse transform unit 330 inversely transforms the inverse quantized transform coefficients from the frequency domain to the spatial domain to restore residual signals, thereby generating a residual block for the current block.

예측부(340)는 인트라 예측부(342) 및 인터 예측부(344)를 포함한다. 인트라 예측부(342)는 현재 블록의 예측 타입인 인트라 예측일 때 활성화되고, 인터 예측부(344)는 현재 블록의 예측 타입인 인트라 예측일 때 활성화된다.The prediction unit 340 includes an intra prediction unit 342 and an inter prediction unit 344 . The intra predictor 342 is activated when intra prediction is the prediction type of the current block, and the inter predictor 344 is activated when intra prediction is the prediction type of the current block.

인트라 예측부(342)는 복호화부(310)로부터 추출된 인트라 예측 모드에 대한 신택스 요소로부터 복수의 인트라 예측 모드 중 현재 블록의 인트라 예측 모드를 결정하고, 인트라 예측 모드에 따라 현재 블록 주변의 참조 픽셀들을 이용하여 현재 블록을 예측한다. The intra-prediction unit 342 determines an intra-prediction mode of the current block from among a plurality of intra-prediction modes from the syntax element for the intra-prediction mode extracted from the decoder 310, and reference pixels around the current block according to the intra-prediction mode. Predict the current block using .

인터 예측부(344)는 복호화부(310)로부터 추출된 인트라 예측 모드에 대한 신택스 요소를 이용하여 현재 블록의 움직임정보를 결정하고, 결정된 움직임정보를 이용하여 현재 블록을 예측한다.The inter predictor 344 determines motion information of the current block using the syntax element for the intra prediction mode extracted from the decoder 310 and predicts the current block using the determined motion information.

먼저, 인터 예측부(344)는 복호화부(310)로부터 추출된 인터 예측에서의 모드 정보를 확인한다. 모드 정보가 머지 모드를 지시하는 경우, 인터 예측부(344)는 현재블록의 주변블록을 이용하여 기 결정된 개수의 머지 후보를 포함하는 머지 리스트를 구성한다. 인터 예측부(344)가 머지 리스트를 구성하는 방법은 영상 부호화 장치의 인터 예측부(124)와 동일하다. 그리고, 복호화부(310)으로부터 전달된 머지 인덱스 정보를 이용하여 머지 리스트 내의 머지 후보들 중에서 하나의 머지 후보를 선택한다. 그리고 선택된 머지 후보의 움직임정보, 즉, 머지 후보의 움직임벡터와 참조 픽처를 현재블록의 움직임벡터와 참조픽처로 설정한다. First, the inter prediction unit 344 checks mode information in the inter prediction extracted from the decoding unit 310. When the mode information indicates a merge mode, the inter predictor 344 constructs a merge list including a predetermined number of merge candidates using neighboring blocks of the current block. The inter predictor 344 configures the merge list in the same way as the inter predictor 124 of the video encoding apparatus. Then, one merge candidate is selected from among merge candidates in the merge list using the merge index information transmitted from the decoder 310 . Then, the motion information of the selected merge candidate, that is, the motion vector and reference picture of the merge candidate are set as the motion vector and reference picture of the current block.

반면, 모드 정보가 차분 움직임벡터 부호화 모드를 지시하는 경우, 인터 예측부(344)는 현재블록의 주변블록들의 움직임벡터를 이용하여 예측 움직임벡터 후보들을 유도하고, 예측 움직임벡터 후보들을 이용하여 현재블록의 움직임벡터에 대한 예측 움직임벡터를 결정한다. 인터 예측부(344)가 예측 움직임벡터 후보들을 유도하는 방법은 영상 부호화 장치의 인터 예측부(124)와 동일하다. 만약, 영상 부호화 장치가 복수의 예측 움직임벡터 후보들 중에서 어느 하나의 후보를 현재블록의 예측 움직임벡터로 사용한 경우에는 움직임정보에 대한 신택스 요소는 예측 움직임벡터 식별정보를 포함한다. 따라서, 이 경우에, 인터 예측부(344)는 예측 움직임벡터 후보들 중 예측 움직임벡터 식별정보에 의해 지시되는 후보를 예측 움직임벡터로 선택할 수 있다. 그러나, 영상 부호화 장치가 복수의 예측 움직임벡터 후보들에 기 정의된 함수를 사용하여 예측 움직임벡터를 결정한 경우에는, 인터 예측부는 영상 부호화 장치와 동일한 함수를 적용하여 예측 움직임벡터를 결정할 수도 있다. 현재블록의 예측 움직임벡터가 결정되면, 인터 예측부(344)는 예측 움직임벡터와 복호화부(310)로부터 전달된 차분 움직임벡터를 가산하여 현재블록의 움직임벡터를 결정한다. 그리고 복호화부(310)로부터 전달된 참조픽처에 대한 정보를 이용하여 현재블록의 움직임벡터가 참조하는 참조픽처를 결정한다.On the other hand, when the mode information indicates the differential motion vector encoding mode, the inter predictor 344 derives motion vector prediction candidates using the motion vectors of neighboring blocks of the current block, and uses the motion vectors of the prediction motion vector candidates for the current block. Determines a predicted motion vector for the motion vector of . A method for the inter predictor 344 to derive predictive motion vector candidates is the same as that of the inter predictor 124 of the video encoding apparatus. If an image encoding apparatus uses one of a plurality of predictive motion vector candidates as a predictive motion vector of a current block, a syntax element for motion information includes predictive motion vector identification information. Therefore, in this case, the inter predictor 344 may select a candidate indicated by the predicted motion vector identification information among the predicted motion vector candidates as the predicted motion vector. However, when the video encoding apparatus determines the predicted motion vector by using a function predefined for a plurality of predicted motion vector candidates, the inter predictor may determine the predicted motion vector by applying the same function as that of the video encoding apparatus. When the predicted motion vector of the current block is determined, the inter predictor 344 determines the motion vector of the current block by adding the predicted motion vector and the differential motion vector transmitted from the decoder 310 . Then, a reference picture referred to by the motion vector of the current block is determined using the information on the reference picture transmitted from the decoder 310 .

머지 모드 또는 차분 움직임벡터 부호화 모드에서 현재블록의 움직임벡터와 참조픽처가 결정되면, 인터 예측부(342)는 참조픽처 내에서 움직임벡터가 지시하는 위치의 블록을 이용하여 현재블록의 예측블록을 생성한다.When the motion vector of the current block and the reference picture are determined in merge mode or differential motion vector coding mode, the inter-prediction unit 342 generates a prediction block of the current block using the block at the position indicated by the motion vector in the reference picture. do.

가산기(350)는 역변환부로부터 출력되는 잔차블록과 인터 예측부 또는 인트라 예측부로부터 출력되는 예측블록을 가산하여 현재 블록을 복원한다. 복원된 현재 블록 내의 픽셀들은 이후에 복호화할 블록을 인트라 예측할 때의 참조픽셀로서 활용된다.The adder 350 restores the current block by adding the residual block output from the inverse transform unit and the prediction block output from the inter prediction unit or intra prediction unit. Pixels in the reconstructed current block are used as reference pixels when intra-predicting a block to be decoded later.

필터부(360)는 블록 단위의 복호화로 인해 발생하는 블록킹 현상(blocking artifact)를 제거하기 위해 복원된 블록 간의 경계를 디블록킹 필터링하고 메모리(370)에 저장한다. 한 픽처 내의 모든 블록들이 복원되면, 복원된 픽처는 이후에 복호화하고자 하는 픽처 내의 블록을 인터 예측하기 위한 참조 픽처로 사용된다.The filter unit 360 performs deblocking filtering on boundaries between reconstructed blocks and stores them in the memory 370 in order to remove blocking artifacts generated by block-by-block decoding. When all blocks in one picture are reconstructed, the reconstructed picture is used as a reference picture for inter-prediction of blocks in the picture to be decoded later.

전술한 바와 같이, 본 개시의 기법들은 변환 및 양자화의 결과물인 양자화된 계수들의 어레이인 계수 블록에 대해 비-제로 계수(즉, 유효 계수)들의 위치를 효율적으로 부호화하고 복호화하는 것과 관련되어 있다. 개시된 일부 기법들은 변환을 거치지 않은 잔차 블록에 대해 직접 적용될 수 있다.As noted above, the techniques of this disclosure are concerned with efficiently encoding and decoding the location of non-zero coefficients (i.e., significant coefficients) relative to a coefficient block, which is an array of quantized coefficients that result from transform and quantization. Some of the disclosed techniques can be applied directly to a residual block that has not undergone transformation.

H.265(HEVC) 표준에서는, 양자화 과정을 거친 양자화된 계수(quantized coefficient)들 중 비-제로(non-zero) 계수의 위치를 표현하기 위한 신택스(syntax)는 총 6개로 구성된다.In the H.265 (HEVC) standard, a total of six syntaxes are used to express the position of a non-zero coefficient among quantized coefficients that have undergone a quantization process.

1. last_sig_coeff_x_prefix 1. last_sig_coeff_x_prefix

2. last_sig_coeff_y_prefix 2. last_sig_coeff_y_prefix

3. last_sig_coeff_x_suffix 3. last_sig_coeff_x_suffix

4. last_sig_coeff_y_suffix 4. last_sig_coeff_y_suffix

5. coded_sub_block_flag 5. coded_sub_block_flag

6. sig_coeff_flag 6. sig_coeff_flag

앞의 네 개의 신택스는 계수 블록 내 스캔 순서상 가장 마지막 유효(비-제로) 계수(last significant coefficient)의 위치에 관한 것으로, 해당 위치에 대한 x 성분과 y 성분을 각각 별도로 표시하고, 각 성분은 접두어(prefix) 및 접미어(suffix)로 나누어서 표현된다. coded_sub_block_flag는 계수 블록을 복수의 서브 블록으로 분할하여 각 서브 블록이 비-제로 계수를 하나 이상 포함하고 있는지 여부를 가리키는 플래그이다. 여기서, coded_sub_block_flag는 해당 서브 블록 내 모든 계수들이 제로이면 "0", 하나 이상의 비-제로 계수가 존재하면 "1"로 표시된다. sig _ coeff _flag는 하나의 서브 블록 내 각 계수가 비-제로인지 제로인지를 가리키는 플래그이다. sig _ coeff _flag는 제로 계수의 경우 "0"으로 표시되고, 비-제로 계수이면 "1"로 표시된다. The previous four syntaxes relate to the position of the last significant coefficient in the scan order in the coefficient block, and separately indicate the x component and y component for that position, respectively, and each component is It is expressed by dividing it into a prefix and a suffix. coded_sub_block_flag is a flag indicating whether each sub-block includes one or more non-zero coefficients by dividing a coefficient block into a plurality of sub-blocks. Here, coded_sub_block_flag is displayed as “0” if all coefficients in the corresponding sub block are zero, and as “1” if one or more non-zero coefficients exist. sig_coeff_flag is a flag indicating whether each coefficient in one subblock is non-zero or zero. sig_coeff_flag is displayed as "0" for zero coefficients and as "1" for non-zero coefficients .

계수 블록에 대한 스캔 순서 상 마지막 유효 계수(Last significant coefficient)가 존재하는 마지막 유효 서브 블록(Last significant sub-block)에 선행하는 서브 블록에 한해서 coded_sub_block_flag 신택스가 시그널링된다. coded_sub_block_flag가 "1"인 경우에, 해당 서브 블록 내 모든 계수들 각각에 대한 sig _ coeff _flag 신택스가 시그널링된다. The coded_sub_block_flag syntax is signaled only in the sub-block preceding the last significant sub-block in which the last significant coefficient exists in the scan order for the coefficient block. When coded_sub_block_flag is “1”, sig _ coeff _flag syntax for each of all coefficients in the sub-block is signaled.

계수 블록내의 계수들의 부호화는 통상적으로 복수의 가용한 스캔 방식 중 하나를 이용하여 수행된다. 도 4는 정사각형 계수 블록의 양자화된 계수들의 부호화에 이용되는 예시적인 스캔 방식들을 나타낸 도면이다. 이들 스캔 방식은 up-right diagonal 방식, horizontal 방식, vertical 방식을 포함한다. 부호화 대상 블록이 화면간 예측 방식을 사용하여 부호화하는 경우, 해당 블록의 계수들은 up-right diagonal 방식으로 스캐닝되고, 부호화 대상 블록이 화면내 예측 방식으로 부호화한 경우는 화면내 예측 모드에 따라 상기 세 가지 형태 중 하나를 선택하여 해당 블록의 계수들을 스캐닝하게 된다. Coding of the coefficients within a coefficient block is typically performed using one of a plurality of available scan schemes. 4 is a diagram illustrating exemplary scan schemes used for encoding quantized coefficients of a square coefficient block. These scan methods include an up-right diagonal method, a horizontal method, and a vertical method. When the encoding target block is encoded using the inter-prediction method, the coefficients of the block are scanned in an up-right diagonal method, and when the encoding target block is encoded using the intra-prediction method, the coefficients of the block are scanned according to the intra-prediction mode. By selecting one of the types, the coefficients of the corresponding block are scanned.

예시된 스캔 방식은 계수 블록 내 서브 블록들 및 각 서브 블록 내 계수들에 대해서 동일한 형태의 스캔 패턴을 보인다. 예를 들어, horizontal 스캔 방식의 경우, 서브 블록들의 스캔 순서도 horizontal 방식이고, 각 서브 블록 내 계수들의 스캔 순서도 horizontal 방식이다. 도 5는 diagonal 스캔 방식에 대한 더 상세한 서브 블록 및 계수들의 스캔 순서를 보인다. 다만, 실제 비트스트림에 저장되는 순서는 스캔 순서의 역순으로 저장이 된다. 즉, 도 5의 255번 위치의 화소부터 0번 위치의 화소 순으로 비트스트림에 저장된다. 다시 말해, 이하에서 더 설명되는 바와 같이, 일단 마지막 유효 (비제로) 계수 위치가 알려지고 나면, 계수들은 스캔 패턴 내의 첫 번째 계수쪽으로 스캔 순서의 반대 방향으로 복호화될 수 있다.The illustrated scan scheme shows the same type of scan pattern for subblocks in a coefficient block and coefficients in each subblock. For example, in the case of the horizontal scan method, the scan order of subblocks is also the horizontal method, and the scan order of coefficients in each subblock is also the horizontal method. 5 shows a more detailed scan sequence of subblocks and coefficients for the diagonal scan method. However, the order of being stored in the actual bitstream is stored in the reverse order of the scan order. That is, they are stored in the bitstream in the order from the pixel at position 255 in FIG. 5 to the pixel at position 0. In other words, once the location of the last valid (non-zero) coefficient is known, the coefficients can be decoded in the opposite direction of the scan order towards the first coefficient in the scan pattern, as described further below.

도 6은 32×32 계수 블록의 일례를 도시한 도면이다. 여기서, 해당 32×32 계수 블록의 스캔 방식은 diagonal 방식을 사용하였고, 회색으로 마킹된 픽셀이 비-제로 계수를, 검정색으로 마킹된 픽셀은 스캔 순서상 마지막 비-제로 계수를 뜻하며, 그 이외의 흰색 계수들은 모두 제로 값을 가지게 된다. 비-제로 계수의 위치를 표시하는 전술한 3개의 정보들 중 coded_sub_block_flag 및 sig _ coeff _flag 신택스를 도 6에 예시된 블록에 적용하면, 이들 2개의 신택스를 표시하기 위해 필요한 비트수는 다음과 같다. coded_sub_block_flag는 마지막 유효 계수의 위치를 고려하여, 총 64개 서브 블록 중 30개의 서브 블록들(도 6에서 굵게 표시된 서브 블록들)에 대한 30비트가 필요하다. DC 성분이 포함된 좌상귀 서브 블록에 대해서는 coded_sub_block_flag가 시그널링되지 않는다. 도 6의 계수 블록은 하나 이상의 비-제로 계수를 포함하고 있는 서브 블록(즉, 유효 서브 블록)이 7개이며, 따라서 7개의 유효 서브 블록에 대한 sig _ coeff _flag를 표시하기 위해, 대략 112 비트(= 7×16; 마지막 유효 서브블록에 대해서는 16 비트보다 적은 비트가 필요할 수 있다)가 필요하다. 결과적으로, 도 6의 계수 블록에 대해 상기 두 개의 신택스를 표현하는 데 필요한 비트수는 대략 142 비트(= 30 + 112)이다.6 is a diagram showing an example of a 32×32 coefficient block. Here, the diagonal method was used for the scanning method of the 32×32 coefficient block, and pixels marked in gray represent non-zero coefficients, pixels marked in black represent the last non-zero coefficient in the scan order, and other The white coefficients all have zero values. If the coded_sub_block_flag and sig_coeff_flag syntaxes among the above three pieces of information indicating the location of non - zero coefficients are applied to the block illustrated in FIG . 6, the number of bits required to indicate these two syntaxes is as follows. coded_sub_block_flag requires 30 bits for 30 sub-blocks (sub-blocks marked in bold in FIG. 6) out of a total of 64 sub-blocks, considering the position of the last significant coefficient. Coded_sub_block_flag is not signaled for an upper-left sub-block including a DC component. The coefficient block of FIG . 6 has 7 subblocks (i.e., valid subblocks) containing one or more non-zero coefficients , so to indicate sig_coeff_flag for the 7 valid subblocks, approximately 112 bits (= 7×16; less than 16 bits may be required for the last valid subblock). As a result, the number of bits required to express the two syntaxes for the coefficient block of FIG. 6 is approximately 142 bits (= 30 + 112).

이상에서 설명한 바와 같이, 플래그 비트들의 오버헤드를 감소시키기 위해, 마지막 유효 계수 위치가 이용된다. 문제는, 현재 논의되고 있는 변환 블록의 사이즈가 64×64, 128×128 정도로 클 수 있기 때문에, 마지막 유효 계수의 위치를 x축 성분과 y축 성분으로 표현하는 신택스의 부호화에 상당한 비트수가 필요하다는 것이다. As described above, to reduce the overhead of flag bits, the last significant coefficient position is used. The problem is that since the size of the transform block currently being discussed can be as large as 64 × 64 or 128 × 128, a considerable number of bits are required for encoding the syntax expressing the position of the last significant coefficient as an x-axis component and a y-axis component. will be.

더 적은 비트를 요구하면서 유사한 기능을 제공하기 위해, 본 발명은 부호화 대상인 계수 블록 내에서 마지막 유효 서브 블록(last significant sub-block)의 위치를 가리키는 정보를 시그널링하는 방식을 제안한다. 여기서, 마지막 유효 서브 블록은 적어도 하나의 비-제로 계수를 갖는 서브 블록 스캔 순서상 마지막 서브 블록이다. 본 발명이 제안하는 마지막 유효 서브 블록의 위치를 효율적으로 표현하는 방식은 총 4가지이다.In order to provide a similar function while requiring fewer bits, the present invention proposes a method of signaling information indicating the location of a last significant sub-block within a coefficient block to be encoded. Here, the last valid subblock is the last subblock in the subblock scan order having at least one non-zero coefficient. There are a total of four methods for efficiently expressing the position of the last valid subblock proposed by the present invention.

(1) 마지막 유효 서브 블록의 계수 블록 내에서의 X축 인덱스 및 Y축 인덱스(1) X-axis index and Y-axis index within the coefficient block of the last valid subblock

(2) 스캔 순서상 마지막 유효 서브 블록에 선행하는 서브 블록들의 개수(2) The number of subblocks preceding the last valid subblock in scan order

(3) 스캔 순서상 마지막 유효 서브 블록에 후행하는 서브 블록들의 개수(3) The number of subblocks following the last valid subblock in scan order

(4) 계수 블록 내에서 마지막 유효 서브 블록의 위치에 따라 (2) 방식 또는 (3) 방식 중 유리한 방식을 결정하고, 결정된 방식을 지시하는 플래그와 결정된 방식에 따른 개수 정보를 시그널링(4) Determines an advantageous method among (2) method or (3) method according to the position of the last valid subblock in the coefficient block, and signals a flag indicating the determined method and number information according to the determined method.

예를 들어, 도 6에 예시된 32×32 계수 블록에 대해, 마지막 유효 서브 블록의 위치를 상기 제안하는 4가지 방식으로 각각 표현하면, 다음과 같다. For example, for the 32×32 coefficient block illustrated in FIG. 6, the positions of the last effective subblocks are expressed in the four methods proposed above as follows.

(1) (3, 4) - 도 7 또는 도 8을 참조하면, 계수 블록 내에서 스캔 순서상 마지막 유효 서브 블록은 좌상귀 서브블록으로부터 수평 및 수직 위치의 관점에서 X축 인덱스 3과 Y축 인덱스 4를 가진다. (1) (3, 4) - Referring to FIG. 7 or 8, the last effective subblock in the scan order within the coefficient block is X axis index 3 and Y axis index 4 in terms of horizontal and vertical positions from the upper left subblock. have

(2) 31 (diagonal 스캔 방식을 사용하는 경우) - 도 7을 참조하면, 스캔 순서상 마지막 유효 서브 블록에 선행하는 서브 블록들의 개수는 31개이다.(2) 31 (when using the diagonal scan method) - Referring to FIG. 7, the number of subblocks preceding the last effective subblock in the scan order is 31.

(3) 32 (diagonal 스캔 방식을 사용하는 경우) - 도 8을 참조하면, 스캔 순서상 마지막 유효 서브 블록에 후행하는 서브 블록들의 개수는 32개이다.(3) 32 (when using the diagonal scan method) - Referring to FIG. 8, the number of subblocks following the last effective subblock in the scan order is 32.

(4) 0 31 또는 1 32 - 여기서, 밑줄 표시된 "0", "1"은 (2) 및 (3) 중 어느 방식이 사용되는지를 지시하는 플래그 값이다.(4) 0 31 or 1 32 - Here, the underlined “ 0 ” and “ 1 ” are flag values indicating which method (2) or (3) is used.

이와 같이 마지막 유효 서브 블록의 위치를 시그널링하는 방식을 기초로, 영상 부호화 장치가 계수 블록을 부호화하는 방법과 영상 복호화 장치가 계수 블록을 복호화하는 전체적인 프로세스를 도 9 및 도 10를 참조하여 각각 설명한다. Based on the method of signaling the position of the last valid subblock as described above, a method of encoding a coefficient block by an image encoding apparatus and an overall process of decoding a coefficient block by an image decoding apparatus will be described with reference to FIGS. 9 and 10, respectively. .

도 9는 본 발명의 일 실시예에 따른 영상 부호화 장치가 계수 블록을 부호화하는 방법을 도시한 흐름도이다.9 is a flowchart illustrating a method of encoding a coefficient block by an image encoding apparatus according to an embodiment of the present invention.

먼저, 영상 부호화 장치는 부호화하고자 하는 현재의 계수 블록에서 마지막 유효 서브 블록(significant sub-block)을 결정한다(S910). 여기서 마지막 유효 서브 블록은 적어도 하나의 비-제로(non-zero) 계수를 갖는 서브 블록 스캔 순서상 마지막 서브 블록이다.First, the video encoding apparatus determines the last significant sub-block in the current coefficient block to be encoded (S910). Here, the last valid subblock is the last subblock in the subblock scan order having at least one non-zero coefficient.

영상 부호화 장치는 결정된 마지막 유효 서브 블록의 위치를 가리키는 정보를 부호화한다(S920). 일부 실시예에서, 마지막 유효 서브 블록의 위치를 가리키는 정보는 (1) 마지막 유효 서브 블록의 계수 블록 내에서의 X축 인덱스 및 Y축 인덱스 값일 수 있다. 다른 일부 실시예에서, 마지막 유효 서브 블록의 위치를 가리키는 정보는 스캔 순서상 마지막 유효 서브 블록에 선행하는 서브 블록들의 개수 정보일 수 있다. 또 다른 일부 실시예에서, 마지막 유효 서브 블록의 위치를 가리키는 정보는 스캔 순서상 마지막 유효 서브 블록에 후행하는 서브 블록들의 개수 정보일 수 있다. 또 다른 일부 실시예에서, 마지막 유효 서브 블록의 위치를 가리키는 정보는 스캔 순서상 마지막 유효 서브 블록에 선행하는 서브 블록들의 개수 정보 혹은 후행하는 서브 블록들의 개수 정보 중 선택된 하나의 정보와, 어느 정보가 선택되었는지 여부를 나타내는 플래그 값을 포함할 수 있다.The video encoding apparatus encodes information indicating the position of the determined last valid subblock (S920). In some embodiments, the information indicating the position of the last valid sub-block may be (1) an X-axis index and a Y-axis index value within a coefficient block of the last valid sub-block. In some other embodiments, the information indicating the position of the last valid sub-block may be information on the number of sub-blocks preceding the last valid sub-block in the scan order. In some other embodiments, the information indicating the location of the last valid subblock may be information on the number of subblocks following the last valid subblock in the scan order. In some other embodiments, the information indicating the location of the last valid sub-block is information selected from among information on the number of sub-blocks preceding the last valid sub-block in the scan order and information on the number of sub-blocks following the last valid sub-block, and which information is It may contain a flag value indicating whether it has been selected.

영상 부호화 장치는 상기 서브 블록 스캔 순서상 상기 마지막 유효 서브 블록에 선행하는 서브 블록들 중 상기 계수 블록의 좌상귀 서브 블록을 제외한 각각의 서브 블록이 적어도 하나의 비-제로 계수를 갖는 유효 서브 블록인지 여부를 가리키는 각각의 서브 블록에 대한 신택스 요소를 부호화한다(S930). 이때, 상기 계수 블록의 좌상귀 서브 블록 및 상기 마지막 유효 서브 블록을 적어도 하나의 비-제로 계수를 갖는 유효 서브 블록으로 설정할 수 있다.The image encoding apparatus determines whether each of the subblocks preceding the last valid subblock in the subblock scan order, excluding the upper left subblock of the coefficient block, is a valid subblock having at least one non-zero coefficient. Syntax elements for each sub-block pointing to are encoded (S930). In this case, the upper left subblock of the coefficient block and the last valid subblock may be set as valid subblocks having at least one non-zero coefficient.

영상 부호화 장치는 상기 계수 블록의 좌상귀 서브 블록의 계수들과, 상기 마지막 유효 서브 블록의 계수들과, 상기 신택스 요소가 적어도 하나의 비-제로 계수를 갖는 유효 서브 블록임을 가리키는 서브 블록들의 계수들과 유효 서브 블록으로 설정된 서브 블록들의 계수들을 부호화한다(S940). 이들 계수를 부호화하는 과정은 해당 서브 블록들 내의 각 계수가 비-제로인지 제로인지를 가리키는 플래그를 부호화하는 과정을 포함할 수 있다. 영상 부호화 장치는 신택스 요소가 적어도 하나의 비-제로 계수를 갖는 유효 서브 블록이 아니라고 가리키는 서브 블록들의 계수들의 부호화를 스킵한다. 또한, 영상 부호화 장치는 서브 블록 스캔 순서상 마지막 유효 서브 블록에 후행하는 서브 블록들의 계수들의 부호화를 스킵한다.The image encoding apparatus includes coefficients of the upper left subblock of the coefficient block, coefficients of the last valid subblock, coefficients of subblocks indicating that the syntax element is a valid subblock having at least one non-zero coefficient, Coefficients of subblocks set as valid subblocks are encoded (S940). The process of encoding these coefficients may include a process of encoding a flag indicating whether each coefficient in the corresponding sub-blocks is non-zero or zero. The image encoding apparatus skips coding of coefficients of subblocks indicated by the syntax element not being a valid subblock having at least one non-zero coefficient. Also, the image encoding apparatus skips encoding of coefficients of subblocks following the last effective subblock in the subblock scanning order.

도 10은 본 발명의 일 실시예에 따른 영상 복호화 장치가 계수 블록을 부호화하는 방법을 도시한 흐름도이다.10 is a flowchart illustrating a method of encoding a coefficient block by an image decoding apparatus according to an embodiment of the present invention.

영상 복호화 장치는, 부호화된 비트스트림으로부터, 복호화하고자 하는 현재의 계수 블록에서 마지막 유효 서브 블록의 위치를 가리키는 정보를 복호화한다(S1010). The video decoding apparatus decodes information indicating the position of the last valid subblock in the current coefficient block to be decoded from the encoded bitstream (S1010).

영상 복호화 장치는, 부호화된 비트스트림으로부터, 서브 블록 스캔 순서상 마지막 유효 서브 블록에 선행하는 서브 블록들 중 상기 계수 블록의 좌상귀 서브 블록을 제외한 각각의 서브 블록이 적어도 하나의 비-제로 계수를 갖는 유효 서브 블록인지 여부를 가리키는 각각의 서브 블록에 대한 신택스 요소를 복호화한다(S1020). 이때, 상기 계수 블록의 좌상귀 서브 블록 및 상기 마지막 유효 서브 블록을 적어도 하나의 비-제로 계수를 갖는 유효 서브 블록으로 설정할 수 있다.The video decoding apparatus is configured to have at least one non-zero coefficient in each sub-block except for an upper-left sub-block of the coefficient block among sub-blocks that precede the last effective sub-block in a sub-block scan order from an encoded bitstream. A syntax element for each sub-block indicating whether it is a valid sub-block is decoded (S1020). In this case, the upper left subblock of the coefficient block and the last valid subblock may be set as valid subblocks having at least one non-zero coefficient.

영상 복호화 장치는, 부호화된 비트스트림으로부터, 상기 계수 블록의 좌상귀 서브 블록의 계수들과, 상기 마지막 유효 서브 블록의 계수들과, 상기 신택스 요소가 적어도 하나의 비-제로 계수를 갖는 유효 서브 블록임을 가리키는 서브 블록들의 계수들과 유효 서브 블록으로 설정된 서브 블록들의 계수들을 복호화한다(S1030). 이들 계수를 복호화하는 과정은 해당 서브 블록들 내의 각 계수가 비-제로인지 제로인지를 가리키는 플래그를 복호화하는 과정을 포함할 수 있다. 영상 복호화 장치는 신택스 요소가 적어도 하나의 비-제로 계수를 갖는 유효 서브 블록이 아니라고 가리키는 서브 블록들의 계수들의 복호화를 스킵한다. 또한, 영상 복호화 장치는 서브 블록 스캔 순서상 마지막 유효 서브 블록에 후행하는 서브 블록들의 계수들의 복호화를 스킵한다.The video decoding apparatus determines, from an encoded bitstream, coefficients of an upper-left subblock of the coefficient block, coefficients of the last valid subblock, and the syntax element are a valid subblock having at least one non-zero coefficient. Coefficients of the indicated subblocks and coefficients of subblocks set as valid subblocks are decoded (S1030). The process of decoding these coefficients may include a process of decoding a flag indicating whether each coefficient in the corresponding sub-blocks is non-zero or zero. The video decoding apparatus skips decoding of coefficients of subblocks indicating that the syntax element is not a valid subblock having at least one non-zero coefficient. Also, the video decoding apparatus skips decoding coefficients of subblocks following the last valid subblock in the subblock scanning order.

이상에서 설명한 방법은 변환(transform) 후 비-제로 계수들의 위치가 대부분 저주파 성분에 모이고 고주파에는 위치하지 않는다는 점에 착안하여, 마지막 유효 계수 혹은 마지막 유효 서브 블록의 위치를 우선적으로 시그널링하는 방식이다. 이하에서는, 비-제로 계수들이 어디에 위치하든 그 위치를 효율적으로 표현할 수 있는 기법을 제안한다. 이러한 기법은 부호화 대상 블록의 변환(transform)이 스킵(skip)되어 고주파 및 저주파 성분(혹은 coefficients)이 아닌 잔차(residual) 값 자체를 보내는 경우에 더 유용할 수 있다. The method described above is a method of preferentially signaling the location of the last effective coefficient or the last valid subblock, noting that most of the locations of non-zero coefficients are concentrated in low-frequency components after transformation and are not located in high-frequency components. Hereinafter, a technique capable of efficiently expressing the location of non-zero coefficients is proposed. This technique may be more useful when the transform of the block to be encoded is skipped and thus a residual value itself is transmitted instead of high and low frequency components (or coefficients).

유효 블록의 계층적 분할Hierarchical division of valid blocks

본 발명에서 제안하는 다른 기법은 현재의 계수 블럭에 대해, 적어도 하나의 비-제로 계수를 포함하는 서브 블록(유효 블록)을 단일 유효 계수에 도달될 때까지 n(2 이상의 자연수) 개의 균등한 크기의 작은 서브 블록들(정사각형들)로 반복적으로(recursively) 분할하면서, 유효 플래그(significant_flag)를 사용하여 생성된 각 서브 블록들에 대해 유효 블록인지 여부를 시그널링한다. 유효 플래그는 하나의 서브 블록 내 비-제로 계수(즉, 유효 계수)가 하나 이상 존재하면 예컨대 "1"로, 모든 계수가 제로이면 예컨대 "0"으로 표현된다. 유효 플래그 값이 "0"이면, 해당 서브 블록은 더이상 분할되지 않으며, 그 하위 계층의 블록들에 대한 유효 플래그 값은 더이상 필요치 않다. 반대로, 유효 플래그 값이 "1"이면, 해당 블록은 n(2 이상의 자연수) 개의 서브 블록으로 분할되어, 그 하위 계층의 서브 블록들에 대한 유효 플래그 값을 시그널링 한다.Another technique proposed by the present invention is to divide sub-blocks (effective blocks) including at least one non-zero coefficient into n (2 or more natural numbers) equal sizes for a current coefficient block until a single significant coefficient is reached. While recursively dividing into small sub-blocks (squares) of , a valid flag ( significant_flag ) is used to signal whether or not it is a valid block for each generated sub-block. The valid flag is expressed as "1" when one or more non-zero coefficients (ie, valid coefficients) exist in one sub-block, and as "0" when all coefficients are zero. If the valid flag value is “0”, the corresponding sub-block is not further divided, and the valid flag values for the blocks of the lower layer are no longer needed. Conversely, if the valid flag value is “1”, the corresponding block is divided into n (a natural number greater than or equal to 2) subblocks, and the valid flag values for subblocks of the lower layer are signaled.

주어진 계수 블록의 유효 플래그 값이 "0"이면 해당 블록은 더이상 분할되지 않으며, 따라서 그 하위 계층의 블록들에 대한 유효 플래그 값은 더이상 필요치 않다. 반대로, 주어진 계수 블록의 유효 플래그 값이 "1"이면서 정사각형 모양의 계수 블록인 경우, 그 하위 계층의 4개의 정사각형 블록으로 분할되며, 그 4개의 블록에 대해 각각 비-제로 계수가 존재하는지 표현된다. 유효 블록이 단일 유효 계수에 도달될 때까지 반복적으로 분할되는 바, 생성된 블록의 크기는 계수 블록의 크기에서부터 1×1의 크기(즉, 단일 계수)까지를 포함한다. 예를 들어, 계수 블록이 8×8이라면, 유효 플래그가 결정되는 블록은 8×8, 4×4, 2×2, 1×1의 크기를 모두 포함한다.If the valid flag value of a given coefficient block is “0”, the corresponding block is not further divided, and thus the valid flag values for blocks in the lower layer are no longer needed. Conversely, if the effective flag value of a given coefficient block is “1” and it is a square-shaped coefficient block, it is divided into four square blocks of the lower layer, and it is expressed whether non-zero coefficients exist for each of the four blocks. . Since the effective block is repeatedly divided until reaching a single effective coefficient, the size of the generated block includes from the size of the coefficient block to the size of 1×1 (i.e., a single coefficient). For example, if the coefficient block is 8x8, the block for which the valid flag is determined includes all sizes of 8x8, 4x4, 2x2, and 1x1.

주어진 계수 블록의 유효 플래그 값이 "1"이면서 직사각형 모양의 계수 블록인 경우, 그 하위 계층의 정사각형 블록의 개수는 해당 계수 블록의 크기에 따라 달라질 수 있다. 예를 들어, 계수 블록이 16×8이라면, 하위 계층으로 분할되는 정사각형 블록은 2개의 8×8이 되고, 계수 블록이 4×16이라면, 하위 계층으로 분할되는 정사각형 블록은 4개의 4×4가 된다. 즉, 주어진 계수 블록이 M×kM 또는 kM×M 크기의 직사각형인 경우, 주어진 계수 블록은 먼저 n=k 개의 정사각형(M×M)으로 분할될 수 있다. 이때, k개의 정사각형의 순서는 계수 블록의 모양에 따라 좌에서 우 또는 상에서 하의 역순으로 진행된다. 예를 들어, 계수 블록이 가로로 긴 직사각형이면 그 하위 k 개의 정사각형 블록의 순서는 좌에서 우의 역순으로 진행되며, 계수 블록이 세로로 긴 직사각형이면 그 하위 k 개의 정사각형 블록의 순서는 상에서 하의 역순으로 진행된다. 직사각형 모양의 계수 블록을 그 하위 계층의 k 개의 정사각형으로 분할한 이후의 과정은 상기 정사각형 모양의 계수 블록의 그 하위 계층의 (4개의) 정사각형 서브 블록과 동일하게 진행된다. When a valid flag value of a given coefficient block is “1” and is a rectangular coefficient block, the number of square blocks in the lower layer may vary depending on the size of the corresponding coefficient block. For example, if the coefficient block is 16x8, the square block divided into lower layers becomes two 8x8s, and if the coefficient block is 4x16, the square blocks divided into lower layers become four 4x4s. do. That is, when a given coefficient block is a rectangle with a size of MxkM or kMxM, the given coefficient block can first be divided into n=k squares (MxM). At this time, the order of k squares proceeds from left to right or from top to bottom in reverse order according to the shape of the coefficient block. For example, if the coefficient block is a horizontally long rectangle, the order of the lower k square blocks proceeds in reverse order from left to right, and if the coefficient block is a vertically long rectangle, the order of the lower k square blocks proceeds in reverse order from top to bottom. It goes on. The process after dividing the rectangular coefficient block into k squares of the lower layer is performed in the same way as the (4) square subblocks of the lower layer of the square coefficient block.

도 11은, 도 6에 예시된 32×32 블록에 대해, 상기 제안된 기법을 적용하여 생성된 블록들을 도시한 도면이다. 제안된 방법에 따르면, 도 11에서 굵은 실선으로 표시된 블록들에 대한 유효 플래그들이 시그널링된다. 각 블록에 대한 계층 구조 및 각 블록의 유효 플래그 값을 트리(tree) 형식으로 표현하면 도 12와 같다. 도 12의 트리 내 노드는 여러 크기의 각 블록을 의미하며, 검정색으로 표시된 노드는 해당 블록의 유효 플래그 값이 "1"임을 의미하며, 흰색으로 표시된 노드는 해당 블록의 유효 플래그 값이 "0"임을 의미한다. 또한, 흰색으로 표시된 노드는 해당 노드는 리프 노드(leaf node)가 되어 하위 노드가 존재하지 않는다. 즉, 주어진 블록의 유효 플래그가 "0"이면 해당 블록은 리프 노드가 되고, 하위 계층의 블록이 존재하지 않는다. 여기서, 해당 블록은 부모 노드, 그 하위 계층의 블록들은 자식 노드라 지칭될 수 있다. 또한, 주어진 블록의 유효 플래그가 "1"이면 해당 블록은 항상 4개의 자식 노드를 가진다. FIG. 11 is a diagram illustrating blocks generated by applying the proposed technique to the 32×32 block illustrated in FIG. 6 . According to the proposed method, valid flags for blocks indicated by thick solid lines in FIG. 11 are signaled. A hierarchical structure of each block and a valid flag value of each block are expressed in a tree format as shown in FIG. 12 . Nodes in the tree of FIG. 12 refer to blocks of various sizes. A node marked in black means that the valid flag value of the corresponding block is "1", and a node marked in white means that the valid flag value of the corresponding block is "0". means that In addition, a node marked in white is a leaf node and has no subordinate nodes. That is, if the validity flag of a given block is “0”, the corresponding block becomes a leaf node and no lower layer blocks exist. Here, a corresponding block may be referred to as a parent node, and blocks of a lower layer may be referred to as child nodes. Also, if the validity flag of a given block is “1”, the corresponding block always has 4 child nodes.

도 12의 트리는 도 11의 계수 블록에서 생성되는 블록들을 diagonal 방식으로 스캔하여 구성하였다. 또한, 비트스트림의 저장 순서(즉, 스캔 순서의 역순; 좌상, 좌하, 우상, 우하의 역순)을 고려하여 트리를 구성하였다. 도 12에서 사용한 diagonal 방식은 예시적인 것으로, 앞서 언급한 기존 3가지 타입의 스캔 방식이 기존 조건에 따라 사용될 수 있다. 만약, diagonal이 아닌 다른 스캔 방식을 사용하는 경우에는 도 11의 계수 블록에 대한 트리는 도 12와 다르게 구성된다. 또한, z-스캔(역순) 방식으로 트리를 구성하고, 비트스트림의 저장 순서(즉, z-스캔 순서의 역순; 좌상, 우상, 좌하, 우하의 역순)로도 사용할 수 있다. The tree of FIG. 12 was constructed by scanning the blocks generated from the coefficient blocks of FIG. 11 in a diagonal manner. In addition, the tree was constructed in consideration of the storage order of the bitstream (that is, the reverse order of the scan order; the reverse order of upper left, lower left, upper right, and lower right). The diagonal method used in FIG. 12 is an example, and the three types of scan methods mentioned above may be used according to existing conditions. If a scan method other than diagonal is used, the tree for the coefficient block in FIG. 11 is configured differently from that in FIG. 12. In addition, the tree can be configured in the z-scan (reverse order) method and stored in the order of bitstream (ie, the reverse order of the z-scan order; the reverse order of top left, top right, bottom left, bottom right) can also be used.

계수 블록과 같은 크기의 블록에 대한 유효 플래그는 일종의 cbf(coded block flag)의 기능을 수행하는 것으로 볼 수 있다. 예를 들어, 도 11의 계수 블록의 경우에 32×32 블록에 대한 유효 플래그는 cbf 기능을 수행한다. 도 11에 예시된 계수 블록에 대한 32×32 블록의 유효 플래그 값이 "1"이므로, 하위 4개의 16×16 블록에 대한 유효 플래그들을 구한다. 이때, 4개의 16×16 블록은 diagonal 스캔 방식에 따라 좌상, 좌하, 우상, 우하 블록의 역순으로 스캔된다. 4개의 유효 플래그는 "0011"이 된다. 이 중 "1"에 해당하는 좌하 및 좌상 16×16 블록에 대한 하위 8×8 블록들의 유효 플래그들만 표현하면 된다. 좌하 16×16 블록의 하위 4개의 8×8 블록들을 위한 유효 플래그는 "0100"이고, 좌상 16×16 블록의 하위 4개의 8×8 블록들을 위한 유효 플래그들은 "1111"이 된다. 8×8 블록 중 유효 플래그가 "1"인 블록에 대해서만 하위 4×4 블록들의 유효 플래그 값을 표현하면 된다. 상기 작업을 블록의 크기가 1×1까지 반복한다. The valid flag for a block having the same size as the coefficient block can be regarded as performing a kind of cbf (coded block flag) function. For example, in the case of the coefficient block of FIG. 11, the valid flag for the 32×32 block performs the cbf function. Since the valid flag value of the 32×32 block for the coefficient block illustrated in FIG. 11 is “1”, valid flags for the lower four 16×16 blocks are obtained. At this time, the four 16×16 blocks are scanned in the reverse order of the upper left, lower left, upper right, and lower right blocks according to the diagonal scan method. The four valid flags become "0011". Among them, only the valid flags of the lower 8x8 blocks for the lower left and upper left 16x16 blocks corresponding to "1" need to be expressed. The valid flags for the lower 4 8x8 blocks of the lower left 16x16 block are "0100", and the valid flags for the lower 4 8x8 blocks of the upper left 16x16 block are "1111". Valid flag values of lower 4x4 blocks need only be expressed for blocks whose valid flag is “1” among 8×8 blocks. The above operation is repeated until the block size is 1×1.

도 11의 계수 블록에 사분법을 적용한 결과 생성되는 (32×32 ~ 1×1) 블록에 대한 유효 플래그는 아래 표와 같다. Valid flags for blocks (32×32 to 1×1) generated as a result of applying the quadrant method to the coefficient block of FIG. 11 are shown in the table below.

블록 크기block size 유효 플래그valid flag 32×32 블록32×32 blocks 1One 16×16 블록16×16 blocks 00110011 8×8 블록8×8 blocks 0100
1111 0100
1111 4×4 블록4x4 blocks 0100
0110 0001 0001 10010100
0110 0001 0001 1001 2×2 블록2×2 blocks 0010
0001 0001 0001 1000 0001 00010010
0001 0001 0001 1000 0001 0001 1×1 블록1×1 block 0100
1000 1000 0010 0001 1000 00010100
1000 1000 0010 0001 1000 0001

이하에서는 도 11의 계수 블록에 대한 도 12의 트리를 탐색하는 일례를 보인다. 도 12의 트리를 탐색하는 데에는 깊이-우선 탐색(depth-first search) 혹은 너비-우선 탐색(breadth-first search) 방식이 사용될 수 있다. 여기서, 깊이 우선 방식과 너비 우선 방식은 트리의 노드를 탐색하는 순서를 의미한다. 즉, 본 발명에서는, 깊이 우선 방식과 너비 우선 방식 중 사전에 결정된 방식에 따라 트리의 노드를 탐색하면서 해당 노드에 할당된 비트들을 읽어오고, 이를 통해 비트스트림(bit-stream)을 구성하게 된다. 예컨대, 도 13의 (a)에 예시된 트리 구조를 기준으로 설명하면, (1) 깊이 우선 방식으로 비트스트림을 구성한다면, A B C D F G H I E 순으로 비트들을 읽어오게 되며, (2) 너비 우선 방식으로 비트스트림을 구성한다면, A B C D E F G H I 순으로 비트들을 읽어오게 된다.Hereinafter, an example of searching the tree of FIG. 12 for the coefficient block of FIG. 11 is shown. A depth-first search or a breadth-first search may be used to search the tree of FIG. 12 . Here, the depth-first method and the breadth-first method refer to the order in which nodes of the tree are searched. That is, in the present invention, bits assigned to the corresponding node are read while searching for a node of the tree according to a predetermined method among the depth-first method and the breadth-first method, thereby forming a bit-stream. For example, referring to the tree structure illustrated in (a) of FIG. 13, (1) if a bitstream is configured in a depth-first manner, bits are read in the order of A B C D F G H I E, and (2) a bitstream in a width-first manner , the bits are read in the order of A B C D E F G H I.

따라서, 본 발명에서 제안하는 도 12의 트리를 너비 우선 탐색으로 탐색하여 비트스트림을 구성하면 아래와 같다.Accordingly, a bitstream is formed by searching the tree of FIG. 12 proposed in the present invention by breadth-first search as follows.

1. 11. 1

2. 00112. 0011

3. 0100 11113. 0100 1111

4. 0100 0110 0001 0001 1001 4. 0100 0110 0001 0001 1001

5. 0010 0001 0001 0001 1000 0001 0001 5. 0010 0001 0001 0001 1000 0001 0001

6. 0100 1000 1000 0010 0001 1000 00016. 0100 1000 1000 0010 0001 1000 0001

반대로, 도 12의 트리를 깊이 우선 탐색으로 탐색하여 비트스트림을 구성하면 하기와 같다.Conversely, when the tree of FIG. 12 is searched by depth-first search to construct a bitstream, it is as follows.

1. 11. 1

2. 00Feb. 00

3. 1 01 01 001 0100 0 00 00 3. 1 01 01 001 0100 0 00 00

4. 1 1 01 0001 1000 1 0001 1000 0 4. 1 1 01 0001 1000 1 0001 1000 0

5. 1 0001 0001 00105. 1 0001 0001 0010

6. 1 0001 1 0001 000 6. 1 0001 1 0001 000

7. 1 1 0001 1000 001 0001 00017. 1 1 0001 1000 001 0001 0001

예외 규칙exception rule

한편, 도 13의 (b)에 예시된 트리를 너비 우선 탐색 방식을 적용하면, "1" "0001" "1000"으로 비트스트림을 구성할 수 있다. 이때, 가장 상위 부모 노드의 유효 플래그 값이 "1"이고, 이는 해당 블록의 4개의 자식 노드 중 적어도 하나 이상의 자식 노드는 유효 플래그 값이 "1"이어야 함을 의미한다. 그런데 4개의 자식 노드 중 스캔 순서상 앞선 3개의 자식 노드의 유효 플래그 값이 모두 "0"이라면, 마지막 남은 하나의 자식 노드의 플래그 값은 반드시 "1"이어야 한다. 결과적으로 4번째 자식 노드의 플래그 값은 부호화 장치와 복호화 장치가 동일하게 "1"로 추론할 수 있는바, 굳이 시그널링될 필요가 없다. 도 13의 (b)에 예시된 트리에 이러한 예외 규칙을 적용하면, 비트스트림은 "1" "000" "1000"으로 구성된다. On the other hand, if the breadth-first search method is applied to the tree illustrated in (b) of FIG. 13, a bitstream can be configured with “1” “0001” “1000”. At this time, the valid flag value of the highest parent node is "1", which means that at least one child node among the four child nodes of the corresponding block must have a valid flag value of "1". However, if the valid flag values of the three preceding child nodes in the scan order among the four child nodes are all “0”, the flag value of the last remaining child node must be “1”. As a result, the flag value of the fourth child node can be equally inferred as “1” by the encoding device and the decoding device, so it does not need to be signaled. If this exception rule is applied to the tree illustrated in FIG. 13(b), the bitstream is composed of “1” “000” and “1000”.

이러한 예외 규칙을 적용하여, 도 12에 예시된 트리에 대해 너비 우선 탐색 방식으로 비트스트림을 구성하면 아래와 같다. 여기서, 시그널링되지 않는 총 9개의 "1" 값이 "-"으로 표시되었다. By applying these exception rules, a bitstream is configured in the breadth-first search method for the tree illustrated in FIG. 12 as follows. Here, a total of 9 “1” values that are not signaled are indicated by “-”.

1. 11. 1

2. 00112. 0011

3. 0100 11113. 0100 1111

4. 0100 0110 000- 000- 1001 4. 0100 0110 000 - 000 - 1001

5. 0010 000- 000- 000- 1000 000- 000- 5. 0010 000 - 000 - 000 - 1000 000 - 000 -

6. 0100 1000 1000 0010 000- 1000 000- 6. 0100 1000 1000 0010 000 - 1000 000 -

도 12의 트리에 상기 예외 규칙을 적용하여 깊이 우선 탐색으로 비트스트림을 구성하면 하기와 같다. 여기서, 시그널링되지 않는 총 9개의 "1" 값이 "-"으로 표시되었다. A bitstream is constructed by depth-first search by applying the exception rule to the tree of FIG. 12 as follows. Here, a total of 9 “1” values that are not signaled are indicated by “-”.

1. 11. 1

2. 00Feb. 00

3. 1 01 01 001 0100 0 00 00 3. 1 01 01 001 0100 0 00 00

4. 1 1 01 000- 1000 1 000- 1000 0 4. 1 1 01 000 - 1000 1 000 - 1000 0

5. 1 000- 000- 00105. 1 000 - 000 - 0010

6. 1 000- 1 000- 000 6. 1 000 - 1 000 - 000

7. 1 1 000- 1000 001 000- 000- 7. 1 1 000 - 1000 001 000 - 000 -

고주파 계수들의 of high frequency coefficients 제로화zeroing

본 개시에서 제안하는 또 다른 기법은 계수 블록(혹은 양자화된 변환 블록)의 크기가 크면, 계수 블록 중 고주파 성분에 대응되는 일정 부분의 계수 값들을, 그 실제 값과 상관없이, "0"으로 설정하는 것이다. Another technique proposed in the present disclosure sets the coefficient values of a certain portion of the coefficient block corresponding to the high-frequency component to “0”, regardless of the actual value, when the size of the coefficient block (or quantized transform block) is large. is to do

예를 들어, 계수 블록의 크기가 64×64인 경우, 좌상귀 32×32 블록을 제외한 나머지 블록(우상귀, 좌하귀, 및 우하귀 32×32 블록)의 모든 계수 값들이 강제적으로 제로화 된다. 도 14는 64×64 계수 블록을 예시하고 있으며, 기 설정된 임계 크기가 32×32라면, 좌상귀 32×32 블록을 제외한 영역에 위치한 비-제로 계수들(검정색으로 표시됨)은 모두 제로화 된다. 따라서 좌하귀 32×32 블록, 우상귀 32×32 블록, 및 우하귀 32×32 블록에 대한 유효 플래그는 모두 "0"으로 설정된다.For example, when the size of the coefficient block is 64×64, all coefficient values of the remaining blocks (upper right, lower left, and lower right 32×32 blocks) except for the upper left 32×32 block are forcibly set to zero. 14 illustrates a 64×64 coefficient block, and if the predetermined threshold size is 32×32, all non-zero coefficients (marked in black) located in an area except for the upper left 32×32 block are zeroed. Accordingly, valid flags for the lower left 32×32 block, the upper right 32×32 block, and the lower right 32×32 block are all set to “0”.

주어진 계수 블록이 M×kM 또는 kM×M 크기의 직사각형인 경우, 주어진 계수 블록은 k개의 M×M 블록들로 분할될 수 있는데, 기 설정된 임계 크기가 h×h (h ≤ M)라면, 가장 상측 h×h 블록 혹은 가장 좌측 h×h 블록을 제외한 나머지 블록들의 모든 계수 값들이 강제적으로 제로화 될 수 있다. 반대로, 기 설정된 임계 크기가 h×h (M < h < kM)라면, 가장 상측 M×h 블록 혹은 가장 좌측 h×M 블록을 제외한 나머지 블록들의 모든 계수 값들이 강제적으로 제로화 될 수 있다. 예컨대, 계수 블록이 16×4이고, 기 설정된 임계 크기가 8×8이라면, 4개의 4×4 블록 중 우측 2개의 4×4 블록의 비제로 계수들이 모두 제로화된다. When a given coefficient block is a rectangle with a size of M × kM or kM × M, the given coefficient block can be divided into k M × M blocks. If the predetermined threshold size is h × h (h ≤ M), the most All coefficient values of blocks other than the upper h×h block or the leftmost h×h block may be forcibly set to zero. Conversely, if the predetermined threshold size is h×h (M < h < kM), all coefficient values of blocks other than the uppermost M×h block or the leftmost h×M block may be forcibly set to zero. For example, if the coefficient block is 16×4 and the predetermined threshold size is 8×8, all non-zero coefficients of the right two 4×4 blocks among the four 4×4 blocks are zeroed.

도 14에 예시된 64×64 계수 블록에 대한 유효 플래그 값들을 diagonal 스캔 방식 (역순)과 너비 우선 탐색을 통해 표현하면 다음과 같다. 4개의 32×32 블록 중 스캔 순서상 앞선 3개의 블록은 비-제로 계수들이 실제 존재함에도 불구하고 "000"으로 설정되고, 마지막 네 번째 블록만 플래그 값이 "1"이 된다. 또한, 상기 예외 규칙을 적용하면 32×32 블록들에 대한 4 비트("0001")을 모두 시그널링 하지 않고 유추 가능한 정보가 된다. 나아가, 제로화된 블록들의 계수들은 시그널링되지 않는다.Expressing valid flag values for the 64×64 coefficient block illustrated in FIG. 14 through a diagonal scan method (in reverse order) and breadth-first search is as follows. Among the four 32×32 blocks, the first three blocks in the scan order are set to “000” even though non-zero coefficients actually exist, and the flag value of only the last fourth block is “1”. In addition, when the exception rule is applied, all 4 bits (“0001”) for 32×32 blocks are not signaled, and information that can be inferred is obtained. Furthermore, coefficients of zeroed blocks are not signaled.

블록 크기block size 유효 플래그valid flag 64×64 블록64×64 blocks 1One 32×32 블록32×32 blocks 00010001 16×16 블록16×16 blocks 11111111 8×8 블록8×8 block 0001 00010001 0001 1001 1111 1001 1111 4×4 블록4x4 blocks 0100 0001 0100 0100
0001 1000 0001 10010100 0001 0100 0100
0001 1000 0001 1001 2×2 블록2×2 blocks 0010 0100 0001 0010
1000 0010 0100 1000 00010010 0100 0001 0010
1000 0010 0100 1000 0001 1×1 블록1×1 block 0100 0010 0100 0001
0010 0100 0010 0001 00010100 0010 0100 0001
0010 0100 0010 0001 0001

이러한 고주파 블록의 제로화는 계수 블록의 손자 노드의 블록 크기가 여전히 임계값보다 큰 경우에, 손자 노드에 해당하는 4개의 블록에 재차 적용될 수 있다. The zeroing of these high-frequency blocks may be applied again to the four blocks corresponding to the grandchild nodes of the coefficient block when the block size of the grandchild nodes is still larger than the threshold value.

계수 블록의 크기가 클수록, 변환 후에는 비-제로 계수들이 대부분 저주파 성분에 집중되고, 고주파 성분에는 거의 존재하지 않는다는 점에서, 이러한 예외 규칙은, 큰 계수 블록의 부호화 효율을 높이는 데 특히 유용하다. 따라서, 변환(transform) 사이즈가 커질수록 제안된 제로화 기법에 의해 압축 성능을 더 증가시킬 수 있다. 이러한 제로화 기법은 도 6 내지 도 10을 참조하여 설명한 기법과 결합될 수도 있다.As the size of the coefficient block increases, most of the non-zero coefficients are concentrated in the low-frequency component after transformation and rarely exist in the high-frequency component, so this exception rule is particularly useful for increasing the coding efficiency of the large coefficient block. Therefore, as the transform size increases, compression performance can be further increased by the proposed zeroing technique. This zeroing technique may be combined with the technique described with reference to FIGS. 6 to 10 .

이상에서 설명한 기법과 규칙들을 사용하여, 도 11에 예시된 32×32 계수 블록에 대해, 비-제로 계수들의 분포를 표현하기 위해 시그널링되는 유효 플래그들 값들은 다음과 같다. 적용된 규칙들을 나열하면, 다음과 같다.Using the techniques and rules described above, for the 32×32 coefficient block illustrated in FIG. 11, valid flag values signaled to represent the distribution of non-zero coefficients are as follows. The list of applied rules is as follows.

- 고주파 블록들의 제로화: th 값을 64로 설정- Zeroing of high-frequency blocks: set th value to 64

- 스캔 방식: diagonal 방식을 사용- Scanning method: using the diagonal method

- 트리 탐색 방식: 너비 우선을 사용- Tree traversal method: use breadth first

- 예외 규칙 적용- Application of exception rules

1. 11. 1

2. 00112. 0011

3. 0100 11113. 0100 1111

4. 0100 0110 000- 000- 1001 4. 0100 0110 000 - 000 - 1001

6. 0100 1000 1000 0010 000- 1000 000- 6. 0100 1000 1000 0010 000 - 1000 000 -

이와 같이, 도 11에 예시된 32×32 계수 블록의 비-제로 계수들의 위치를 표현하는 데에는 총 80 비트가 소요된다. 앞서 설명한 바와 같이, 도 11의 계수 블록과 동일한 도 6의 계수 블록을 기존의 HEVC에서 채용된 방식으로 표현하면 마지막 유효 계수의 위치를 표현하기 위한 4개의 신택스를 제외하더라도, 나머지 2개의 신택스(coded_sub_block_flag 및 sig _ coeff _flag)에 필요한 비트수는 대략 140 비트가 된다. 140 비트에 마지막 유효 계수의 위치를 표현하기 위한 정보까지 더하게 되면 그 이상의 비트수가 필요하게 된다.As such, a total of 80 bits are required to express the positions of non-zero coefficients of the 32×32 coefficient block illustrated in FIG. 11 . As described above, if the coefficient block of FIG. 6, which is the same as the coefficient block of FIG. and sig _ coeff _flag ) is approximately 140 bits. If information for expressing the position of the last significant coefficient is added to 140 bits, more bits are required.

만약, 임계값(th )이 16이고 다른 조건들은 위와 동일하다면, 도 4에 예시된 32×32 계수 블록에 대한 유효 플래그들은 다음과 같다. If the threshold value ( th ) is 16 and other conditions are the same as above, valid flags for the 32×32 coefficient block illustrated in FIG. 4 are as follows.

1. 11. 1

2.2. --------

3. 11113. 1111

4. 0110 000- 000- 1001 4. 0110 000 - 000 - 1001

5. 000- 000- 000- 1000 000- 000- 5. 000 - 000 - 000 - 1000 000 - 000 -

6. 1000 1000 0010 000- 1000 000- 6. 1000 1000 0010 000 - 1000 000 -

여기서, 계수 블록 전체(즉, 32×32 블록)에 대한 유효 플래그 값은 "1"이고, 하위 노드인 16×16 블록은 임계값(th = 16)과 동일하므로 스캔 순서상 처음 세 개의 16×16 블록에 대한 플래그 값은 "000"으로 설정되며, 네 번째 블록에 대한 플래그 값은 "1"로 설정된다. 따라서, 4개의 16×16 블록에 대한 유효 플래그 "0001"는 시그널링 할 필요가 없다.Here, the valid flag value for the entire coefficient block (i.e., 32×32 block) is “1”, and the 16×16 block, which is a lower node, is equal to the threshold value ( th = 16), so the first three 16× The flag value for the 16th block is set to "000", and the flag value for the fourth block is set to "1". Therefore, the valid flag "0001" for the four 16x16 blocks does not need to be signaled.

도 15는 영상 복호화 장치가 큰 계수 블록에 대한 처리 과정을 도시한 흐름도이다. 도 15에서는 너비 우선 탐색 방식의 사용을 전제로 하고 있으나, 깊이 우선 탐색을 사용할 수도 있음에 유의한다. 또한, 간략화를 위해, 도 15에는 전술한 예외 규칙을 포함시키지 않았으며, 계수 블록이 정사각형 모양임을 가정하였다. 15 is a flowchart illustrating a process of processing a large coefficient block by an image decoding apparatus. Although FIG. 15 assumes the use of a breadth first search method, it should be noted that a depth first search may be used. Also, for simplicity, the aforementioned exception rule is not included in FIG. 15 and it is assumed that the coefficient block has a square shape.

영상 복호화 장치는 복호화하고자 하는 현재의 계수 블록의 사이즈(즉, TU 사이즈)을 N으로 설정한다. 영상 복호화 장치는 현재의 계수 블록에 대한 유효 플래그를 파싱하고, 해당 유효 플래그가 "0"이 아니면, 계수 블록을 4개의 블록으로 분할한다. 생성된 블록의 크기(N/2)가 임계값 th 이상이면, 영상 복호화 장치는, 유효 플래그의 파싱 없이, 생성된 4개의 블록에 대한 유효 플래그들을 "0001"로 설정한다. 반대로, 생성된 블록의 크기(N/2)가 임계값 th 미만이면, 영상 복호화 장치는 생성된 4개의 블록에 대한 유효 플래그들을 파싱한다. The video decoding apparatus sets the size (ie, TU size) of the current coefficient block to be decoded to N. The video decoding apparatus parses a valid flag for the current coefficient block and, if the valid flag is not “0”, divides the coefficient block into 4 blocks. If the size (N/2) of the generated block is greater than or equal to the threshold value th , the video decoding apparatus sets valid flags for the four generated blocks to “0001” without parsing the valid flags. Conversely, if the size (N/2) of the generated block is less than the threshold value th , the video decoding apparatus parses valid flags for the four generated blocks.

생성된 4개의 블록 각각에 대해, 해당 블록의 유효 플래그 값이 "0"이 아니면, 해당 블록을 단일 유효 계수(N=1)에 도달될 때까지 4개의 균등한 크기의 작은 블록들로 반복적으로(recursively) 분할하면서, 분할할 때마다 생성되는 4개의 블록에 대한 유효 플래그들을 파싱한다. For each of the four generated blocks, if the valid flag value of that block is not "0", that block is iteratively divided into four equally sized small blocks until a single valid count (N=1) is reached. While (recursively) partitioning, valid flags for the four blocks generated each time are parsed.

전술한 바와 같이, 도 11 내지 도 15을 참조하여 설명한 기법은 부호화 대상 블록의 변환(transform)이 스킵되어 고주파 및 저주파 성분(혹은 coefficients)이 아닌 잔차(residual) 값 자체를 보내는 경우에 더 유용할 수 있다. 이점에 착안하여, 영상 부호화 장치는 부호화 대상 블록의 변환이 스킵되는 경우에 도 11 내지 도 15을 참조하여 설명한 기법을 적용되고, 변환이 수행된 양자화된 변환 계수 블록에 대해서는 도 6 내지 도 10을 참조하여 설명한 기법을 적용하도록 구성될 수도 있다.As described above, the technique described with reference to FIGS. 11 to 15 is more useful when the transform of the encoding target block is skipped to transmit the residual value itself rather than the high and low frequency components (or coefficients). can In view of this, the video encoding apparatus applies the technique described with reference to FIGS. 11 to 15 when the transform of the encoding target block is skipped, and applies FIGS. 6 to 10 to the quantized transform coefficient block on which the transform is performed It may also be configured to apply the techniques described with reference.

이상의 설명은 본 실시예의 기술 사상을 예시적으로 설명한 것에 불과한 것으로서, 본 실시예가 속하는 기술 분야에서 통상의 지식을 가진 자라면 본 실시예의 본질적인 특성에서 벗어나지 않는 범위에서 다양한 수정 및 변형이 가능할 것이다. 따라서, 본 실시예들은 본 실시예의 기술 사상을 한정하기 위한 것이 아니라 설명하기 위한 것이고, 이러한 실시예에 의하여 본 실시예의 기술 사상의 범위가 한정되는 것은 아니다. 본 실시예의 보호 범위는 아래의 청구범위에 의하여 해석되어야 하며, 그와 동등한 범위 내에 있는 모든 기술 사상은 본 실시예의 권리범위에 포함되는 것으로 해석되어야 할 것이다.The above description is merely an example of the technical idea of the present embodiment, and various modifications and variations can be made to those skilled in the art without departing from the essential characteristics of the present embodiment. Therefore, the present embodiments are not intended to limit the technical idea of the present embodiment, but to explain, and the scope of the technical idea of the present embodiment is not limited by these embodiments. The scope of protection of this embodiment should be interpreted according to the claims below, and all technical ideas within the scope equivalent thereto should be construed as being included in the scope of rights of this embodiment.

Claims

A method of encoding a coefficient block having an array of coefficients divisible into coefficients of a plurality of sub-blocks, comprising:
if the size of the coefficient block is greater than or equal to a preset threshold, setting all coefficients of the remaining region except for the rectangular region including the upper-left corner of the coefficient block to zero coefficients;
determining a last significant sub-block, wherein the last significant sub-block is the last sub-block in the sub-block scan order having at least one non-zero coefficient;
encoding information indicating the location of the determined last valid subblock;
Each subblock indicating whether each of the subblocks preceding the last valid subblock in the subblock scan order, except for the upper left subblock of the coefficient block, is a valid subblock having at least one non-zero coefficient. Encoding syntax elements for sub-blocks; and
Coefficients of the upper left sub-block, coefficients preceding the last non-zero coefficient in the last valid sub-block, and sub-blocks indicating that the syntax element is a valid sub-block having at least one non-zero coefficient Encoding the Coefficients
including,
The step of encoding coefficients of subblocks indicating that the syntax element is a valid subblock having at least one non-zero coefficient indicates, for each of all coefficients in the given subblock, whether the given coefficient is a non-zero coefficient. Encoding a first flag;
Characterized in that the encoding order of the first flag for each of all coefficients in the given subblock is determined based on a diagonal scan order.
Encoding method of coefficient block.

A method of decoding a coefficient block having an array of coefficients divisible into coefficients of a plurality of sub-blocks, comprising:
if the size of the coefficient block is greater than or equal to a preset threshold, setting all coefficients of the remaining region except for the rectangular region including the upper-left corner of the coefficient block to zero coefficients;
decoding information indicating a location of a last non-zero coefficient in a sub-block scan sequence;
Each subblock indicating whether each subblock excluding the upper left subblock of the coefficient block among the subblocks preceding the last valid subblock in the subblock scan order is a valid subblock having at least one non-zero coefficient decoding a syntax element for a block, wherein the last valid sub-block is the sub-block having the last non-zero coefficient; and
Coefficients of the upper left sub-block, coefficients preceding the last non-zero coefficient in the last valid sub-block, and sub-blocks indicating that the syntax element is a valid sub-block having at least one non-zero coefficient decoding the coefficients
including,
The step of decoding coefficients of subblocks indicating that the syntax element is a valid subblock having at least one non-zero coefficient indicates, for each of all coefficients in the given subblock, whether the given coefficient is a non-zero coefficient. Decrypting the first flag;
Characterized in that the decoding order of the first flag for each of all coefficients in the given subblock is determined based on a diagonal scan order.
Decoding method of coefficient block.

A computer-readable non-transitory recording medium storing a bitstream containing coded data for a coefficient block having an array of coefficients divisible into coefficients of a plurality of sub-blocks, the bitstream comprising:
if the size of the coefficient block is greater than or equal to a preset threshold, setting all coefficients of the remaining region except for the rectangular region including the upper-left corner of the coefficient block to zero coefficients;
decoding information indicating a location of a last non-zero coefficient in a sub-block scan sequence;
Each subblock indicating whether each subblock excluding the upper left subblock of the coefficient block among the subblocks preceding the last valid subblock in the subblock scan order is a valid subblock having at least one non-zero coefficient decoding a syntax element for a block, wherein the last valid sub-block is the sub-block having the last non-zero coefficient; and
Coefficients of the upper left sub-block, coefficients preceding the last non-zero coefficient in the last valid sub-block, and sub-blocks indicating that the syntax element is a valid sub-block having at least one non-zero coefficient decoding the coefficients
It is configured to be decrypted by a decryption process comprising
The step of decoding coefficients of subblocks indicating that the syntax element is a valid subblock having at least one non-zero coefficient indicates, for each of all coefficients in the given subblock, whether the given coefficient is a non-zero coefficient. Decrypting the first flag;
Characterized in that the decoding order of the first flag for each of all coefficients in the given subblock is determined based on a diagonal scan order.
A computer-readable, non-transitory recording medium.