KR102006804B1

KR102006804B1 - Method and apparatus for video encoding by motion prediction using arbitrary partition, and method and apparatus for video decoding by motion compensation using arbitrary partition

Info

Publication number: KR102006804B1
Application number: KR1020180065652A
Authority: KR
Inventors: 이선일; 천민수; 한우진
Original assignee: 삼성전자주식회사
Priority date: 2018-06-07
Filing date: 2018-06-07
Publication date: 2019-08-02
Also published as: KR20180069766A

Abstract

비디오 데이터를 최대 크기의 부호화 단위인 적어도 하나의 최대 부호화 단위로 분할하고, 부호화 단위가 임의적 비율로 분할된 파티션을 이용한 인터 예측을 포함하여, 최대 부호화 단위의 적어도 하나의 분할 영역 별로, 계층적 구조의 심도별 부호화 단위에 기초하여 최대 부호화 단위의 비디오 데이터를 부호화하고, 부호화 결과가 출력될 심도인 부호화 심도를 결정하여, 최대 부호화 단위별로, 분할 영역 별 부호화 심도에 대응하는 부호화된 비디오 데이터, 부호화 심도 및 부호화 모드에 관한 정보를 포함하는 비트스트림을 출력하는 비디오 부호화 방법 및 장치가 개시된다.Hierarchical structure for each of the at least one partitioned region of the largest coding unit, including inter prediction using a partition in which video data is divided into at least one largest coding unit that is a coding unit having a maximum size, and the coding unit is split at an arbitrary ratio. Encodes video data of a maximum coding unit based on coding units according to depths, determines a coded depth that is a depth at which an encoding result is to be output, and encodes video data corresponding to a coded depth of each divided region according to the maximum coding units. Disclosed are a video encoding method and apparatus for outputting a bitstream including information about a depth and an encoding mode.

Description

Method and apparatus for video encoding by motion prediction using arbitrary partitions, and method and apparatus for video decoding by motion compensation using arbitrary partitions {Method and apparatus for video encoding by motion prediction using arbitrary partition, and method and apparatus for video decoding by motion compensation using arbitrary partition}

본 발명은 비디오의 부호화 및 복호화에 관한 것이다.The present invention relates to encoding and decoding of video.

고해상도 또는 고화질 비디오 컨텐트를 재생, 저장할 수 있는 하드웨어의 개발 및 보급에 따라, 고해상도 또는 고화질 비디오 컨텐트를 효과적으로 부호화하거나 복호화하는 비디오 코덱의 필요성이 증대하고 있다. 기존의 비디오 코덱에 따르면, 비디오는 소정 크기의 매크로블록에 기반하여 제한된 부호화 방식에 따라 부호화되고 있다.Background of the Invention [0002] As the development and dissemination of hardware capable of playing back and storing high-resolution or high-definition video content increases the need for video codecs to effectively encode or decode high-definition or high-definition video content. According to the conventional video codec, video is encoded according to a limited encoding method based on a macroblock of a predetermined size.

현존 비디오 코덱의 인터 예측은, 2Nx2N 크기의 매크로블록의 2Nx2N, 2NxN, Nx2N, NxN 크기의 파티션을 이용하여 움직임 벡터를 추정하여 매크로블록의 움직임을 추정한다.Inter prediction of an existing video codec estimates a motion of a macroblock by estimating a motion vector using partitions of 2Nx2N, 2NxN, Nx2N, and NxN sizes of macroblocks having a size of 2Nx2N.

본 발명은, 임의적 형태의 파티션을 이용한 인터 예측에 따른 비디오 부호화 및 비디오 복호화에 관한 것이다.The present invention relates to video encoding and video decoding according to inter prediction using an arbitrary type of partition.

본 발명의 일 실시예에 따른 비디오 부호화 방법은, 비디오 데이터를 최대 크기의 부호화 단위인 적어도 하나의 최대 부호화 단위로 분할하는 단계; 상기 부호화 단위가 임의적 비율로 분할된 파티션을 이용한 인터 예측을 포함하여, 상기 최대 부호화 단위의 적어도 하나의 분할 영역 별로, 심도가 깊어짐에 따라 상위 심도의 부호화 단위가 분할되는 계층적 구조의 심도별 부호화 단위에 기초하여 상기 최대 부호화 단위의 비디오 데이터를 부호화하고, 부호화 결과가 출력될 심도인 부호화 심도를 결정하는 단계; 및 상기 최대 부호화 단위별로, 상기 분할 영역 별 부호화 심도에 대응하는 부호화된 비디오 데이터, 상기 부호화 심도 및 부호화 모드에 관한 정보를 포함하는 비트스트림을 출력하는 단계를 포함한다.A video encoding method according to an embodiment of the present invention includes: dividing video data into at least one maximum coding unit that is a coding unit having a maximum size; Depth-based encoding of a hierarchical structure in which coding units having higher depths are divided as depths are deepened in at least one divided region of the largest coding unit, including inter prediction using partitions in which the coding units are divided at random ratios. Encoding video data of the maximum coding unit based on a unit, and determining a coded depth which is a depth at which an encoding result is to be output; And outputting a bitstream including coded video data corresponding to the coded depths of the divided regions, the coded depth, and the coded mode, for each of the maximum coding units.

일 실시예에 따른 부호화 단위는 최대 크기 및 심도로 특징지어질 수 있다. The coding unit according to an embodiment may be characterized by a maximum size and depth.

심도란 부호화 단위가 계층적으로 분할되는 단계를 나타내며, 심도가 깊어질수록 심도별 부호화 단위는 최대 부호화 단위로부터 최소 부호화 단위까지 분할될 수 있다. 본 명세서에서는, 높은 심도 또는 상위 심도로부터 낮은 심도 또는 하위 심도의 방향으로 '심도가 깊어진다'고 정의한다. 심도가 깊어짐에 따라 최대 부호화 단위의 분할 횟수가 증가하고, 최대 부호화 단위의 분할 가능한 총 횟수가 '최대 심도'로 대응된다. 부호화 단위의 최대 크기 및 최대 심도가 미리 설정되어 있을 수 있다. Depth indicates a stage in which coding units are hierarchically divided. As the depth increases, the depth coding units can be divided from the maximum coding unit to the minimum coding unit. In the present specification, it is defined that the depth is deepened from a high depth or a high depth toward a low depth or a bottom depth. As the depth increases, the number of division of the maximum encoding unit increases and the total number of divisions of the maximum encoding unit corresponds to 'maximum depth'. The maximum size and the maximum depth of the encoding unit may be preset.

일 실시예에 따른 상기 부호화 심도 결정 단계는, 상기 부호화 단위가 임의적 비율로 분할된 파티션을 이용하여 인터 예측을 수행할지 여부를 선택적으로 결정하는 단계를 포함할 수 있다.The determining of the coding depth according to an embodiment may include selectively determining whether to perform inter prediction using a partition in which the coding unit is divided at an arbitrary ratio.

일 실시예에 따른 상기 출력 단계는, 상기 인터 예측을 위한 파티션 타입이 상기 부호화 단위를 임의적 비율로 분할하는 파티션을 포함하는지 여부를 나타내는 정보를 상기 비트스트림에 포함할 수 있다.According to an embodiment, the outputting step may include, in the bitstream, information indicating whether a partition type for inter prediction includes a partition that splits the coding unit at an arbitrary ratio.

일 실시예에 따른 상기 임의적 비율로 분할된 파티션은, 상기 부호화 단위의 높이 및 너비 중 적어도 하나가 1 대 3 또는 3 대 1로 분할될 수 있다.According to an embodiment, at least one of the height and the width of the coding unit may be divided into one to three or three to one in the partition divided at an arbitrary ratio.

일 실시예에 따른 상기 최대 부호화 단위는, 16x16, 32x32, 64x64, 128x128 및 256x256 블록들 중 적어도 하나로 선택적으로 설정될 수 있다.According to an embodiment, the maximum coding unit may be selectively set to at least one of 16x16, 32x32, 64x64, 128x128, and 256x256 blocks.

일 실시예에 따른 상기 부호화 심도는, 해당 분할 영역의 상기 계층적 구조에 따른 심도별 부호화 단위들에 기초한 부호화 결과들 중 부호화 효율이 가장 높은 심도별 부호화 단위의 심도로 결정되며, 상기 최대 부호화 단위 내의 적어도 하나의 분할 영역마다 부호화 심도가 독립적으로 결정될 수 있다.According to an embodiment, the coding depth is determined as a depth of a coding unit according to a depth having a highest coding efficiency among coding results based on coding units according to depths according to the hierarchical structure of a corresponding partition, and the maximum coding unit The coding depth may be independently determined for each of at least one divided region within the frame.

본 발명의 일 실시예에 따른 비디오 복호화 방법은, 부호화된 비디오에 대한 비트스트림을 수신하여 파싱(parsing)하는 단계; 상기 비트스트림으로부터, 최대 부호화 단위별로 부호화된 비디오 데이터 및 상기 최대 부호화 단위별 부호화 심도 및 부호화 모드에 관한 정보를 추출하는 단계; 및 상기 최대 부호화 단위별 부호화 심도 및 부호화 모드에 관한 정보에 기초하여, 상기 최대 부호화 단위별로 적어도 하나의 부호화 심도별 부호화 단위마다, 상기 부호화 단위가 임의적 비율로 분할된 파티션을 이용한 움직임 보상을 포함하는 복호화를 수행하는 단계를 포함하고, 상기 부호화 심도별 부호화 단위는, 상기 최대 부호화 단위의 적어도 하나의 분할 영역마다, 계층적 구조의 심도별 부호화 단위들의 심도들 중 하나로 결정된다.According to an embodiment of the present invention, there is provided a video decoding method including: receiving and parsing a bitstream of encoded video; Extracting video data encoded for each largest coding unit and information about a coded depth and an encoding mode for each maximum coding unit, from the bitstream; And motion compensation using a partition in which the coding unit is divided at an arbitrary ratio based on the information about the coded depth and the encoding mode for each maximum coding unit, for each of the at least one coding depth for each maximum coding unit. And performing decoding, wherein the coding units according to coding depths are determined as one of depths of coding units according to depths of a hierarchical structure, for each of at least one divided region of the maximum coding unit.

일 실시예에 따른 상기 추출 단계는, 상기 비트스트림으로부터 인터 예측을 위한 파티션 타입이 상기 부호화 단위를 임의적 비율로 분할하는 파티션을 포함하는지 여부를 나타내는 정보를 더 추출할 수 있다.According to an embodiment, the extracting may further extract information indicating whether a partition type for inter prediction includes a partition that splits the coding unit at an arbitrary ratio from the bitstream.

일 실시예에 따른 상기 복호화 수행 단계는, 상기 비트스트림으로부터 상기 인터 예측을 위한 파티션 타입이 상기 부호화 단위를 임의적 비율로 분할하는 파티션을 포함하는지 여부를 나타내는 정보에 기초하여, 상기 임의적 비율로 분할된 파티션을 이용하여 움직임 보상을 수행할지 여부를 선택적으로 결정할 수 있다.According to an embodiment of the present disclosure, the decoding may be performed based on information indicating whether a partition type for the inter prediction from the bitstream includes a partition that splits the coding unit at an arbitrary ratio. The partition may be used to selectively determine whether to perform motion compensation.

본 발명의 일 실시예에 따른 비디오 부호화 장치는, 비디오 데이터를 최대 크기의 부호화 단위인 적어도 하나의 최대 부호화 단위로 분할하는 최대 부호화 단위 분할부; 상기 부호화 단위가 임의적 비율로 분할된 파티션을 이용한 인터 예측을 포함하여, 상기 최대 부호화 단위의 적어도 하나의 분할 영역 별로, 심도가 깊어짐에 따라 상위 심도의 부호화 단위가 분할되는 계층적 구조의 심도별 부호화 단위에 기초하여 상기 최대 부호화 단위의 비디오 데이터를 부호화하고, 부호화 결과가 출력될 심도인 부호화 심도를 결정하는 부호화부; 및 상기 최대 부호화 단위별로, 상기 분할 영역 별 부호화 심도에 대응하는 부호화된 비디오 데이터, 상기 부호화 심도 및 부호화 모드에 관한 정보를 포함하는 비트스트림을 출력하는 출력부를 포함한다According to an embodiment of the present invention, a video encoding apparatus includes: a maximum coding unit splitter configured to divide video data into at least one maximum coding unit that is a coding unit having a maximum size; Depth-based encoding of a hierarchical structure in which coding units having higher depths are divided as depths are deepened in at least one divided region of the largest coding unit, including inter prediction using partitions in which the coding units are divided at random ratios. An encoder which encodes video data of the maximum coding unit based on a unit, and determines an encoding depth that is a depth at which an encoding result is to be output; And an output unit for outputting a bitstream including coded video data corresponding to the coded depths of each divided region, the coded depth, and the coded mode, for each of the maximum coding units.

본 발명의 일 실시예에 따른 비디오 복호화 장치는, 부호화된 비디오에 대한 비트스트림을 수신하여 파싱하는 파싱부; 상기 비트스트림으로부터, 최대 부호화 단위별로 부호화된 비디오 데이터 및 상기 최대 부호화 단위별 부호화 심도 및 부호화 모드에 관한 정보를 추출하는 추출부; 및 상기 최대 부호화 단위별 부호화 심도 및 부호화 모드에 관한 정보에 기초하여, 상기 최대 부호화 단위별로 적어도 하나의 부호화 심도별 부호화 단위마다, 상기 부호화 단위가 임의적 비율로 분할된 파티션을 이용한 움직임 보상을 포함하는 복호화를 수행하는 복호화부를 포함하고, 상기 부호화 심도별 부호화 단위는, 상기 최대 부호화 단위의 적어도 하나의 분할 영역마다, 계층적 구조의 심도별 부호화 단위들의 심도들 중 하나로 결정될 수 있다.According to an aspect of the present invention, there is provided a video decoding apparatus including: a parser for receiving and parsing a bitstream of encoded video; An extraction unit for extracting video data encoded for each largest coding unit and information about a coded depth and an encoding mode for each maximum coding unit, from the bitstream; And motion compensation using a partition in which the coding unit is divided at an arbitrary ratio based on the information about the coded depth and the encoding mode for each maximum coding unit, for each of the at least one coding depth for each maximum coding unit. And a decoding unit configured to perform decoding, wherein the coding units according to coding depths may be determined as one of depths of coding units according to depths of a hierarchical structure, for each of at least one divided region of the maximum coding unit.

본 발명은, 일 실시예에 따른 비디오 부호화 방법을 구현하기 위한 프로그램이 기록된 컴퓨터로 판독 가능한 기록매체를 포함한다. 또한 본 발명은, 일 실시예에 따른 비디오 복호화 방법을 구현하기 위한 프로그램이 기록된 컴퓨터로 판독 가능한 기록매체를 포함한다.The present invention includes a computer-readable recording medium having recorded thereon a program for implementing a video encoding method. The present invention also includes a computer-readable recording medium having recorded thereon a program for implementing a video decoding method.

도 1 은 본 발명의 일 실시예에 따른 비디오 부호화 장치의 블록도를 도시한다.
도 2 는 본 발명의 일 실시예에 따른 비디오 복호화 장치의 블록도를 도시한다.
도 3 은 본 발명의 일 실시예에 따른 부호화 단위의 개념을 도시한다.
도 4 는 본 발명의 일 실시예에 따른 부호화 단위에 기초한 영상 부호화부의 블록도를 도시한다.
도 5 는 본 발명의 일 실시예에 따른 부호화 단위에 기초한 영상 복호화부의 블록도를 도시한다.
도 6 는 본 발명의 일 실시예에 따른 심도별 부호화 단위 및 예측 단위를 도시한다.
도 7 은 본 발명의 일 실시예에 따른, 부호화 단위 및 변환 단위의 관계를 도시한다.
도 8 은 본 발명의 일 실시예에 따라, 심도별 부호화 정보들을 도시한다.
도 9 는 본 발명의 일 실시예에 따른 심도별 부호화 단위를 도시한다.
도 10a, 10b 및 10c는 본 발명의 일 실시예에 따른, 부호화 단위, 예측 단위 및 주파수 변환 단위의 관계를 도시한다.
도 11 은 본 발명의 일 실시예에 따른 부호화 단위별 부호화 정보를 도시한다.
도 12 는 본 발명의 일 실시예에 따른 비디오 부호화 방법의 흐름도를 도시한다.
도 13 은 본 발명의 일 실시예에 따른 비디오 복호화 방법의 흐름도를 도시한다.
도 14 는 일 실시예에 따라 임의적 비율로 분할된 파티션을 이용한 인터 예측에 따른 비디오 부호화 장치의 블록도를 도시한다.
도 15 는 일 실시예에 따라 임의적 비율로 분할된 파티션을 이용한 인터 예측에 의한 비디오 복호화 장치의 블록도를 도시한다.
도 16 은 일 실시예에 따라, 부호화 단위가 임의적 비율로 분할된 예시적 파티션들을 도시한다.
도 17 은 일 실시예에 따라, 부호화 단위를 임의적 비율로 분할하는 파티션을 포함하는지 여부를 나타내는 정보가 포함된 시퀀스 파라미터 세트의 신택스를 도시한다.
도 18 은 일 실시예에 따라 임의적 비율로 분할된 파티션을 이용한 인터 예측에 따른 비디오 부호화 방법의 흐름도를 도시한다.
도 19 은 일 실시예에 따라 임의적 비율로 분할된 파티션을 이용한 인터 예측에 의한 비디오 복호화 방법의 흐름도를 도시한다.1 shows a block diagram of a video encoding apparatus according to an embodiment of the present invention.
2 shows a block diagram of a video decoding apparatus according to an embodiment of the present invention.
FIG. 3 illustrates a concept of an encoding unit according to an embodiment of the present invention.
4 is a block diagram of an image encoding unit based on an encoding unit according to an embodiment of the present invention.
5 is a block diagram of an image decoding unit based on an encoding unit according to an embodiment of the present invention.
FIG. 6 illustrates a depth-based coding unit and a prediction unit according to an embodiment of the present invention.
FIG. 7 shows a relationship between an encoding unit and a conversion unit according to an embodiment of the present invention.
FIG. 8 illustrates depth-specific encoding information, in accordance with an embodiment of the present invention.
FIG. 9 shows a depth encoding unit according to an embodiment of the present invention.
FIGS. 10A, 10B, and 10C illustrate the relationship between an encoding unit, a prediction unit, and a frequency conversion unit according to an embodiment of the present invention.
FIG. 11 shows encoding information for each encoding unit according to an embodiment of the present invention.
12 shows a flowchart of a video coding method according to an embodiment of the present invention.
13 shows a flowchart of a video decoding method according to an embodiment of the present invention.
14 is a block diagram of a video encoding apparatus according to inter prediction using a partition divided at an arbitrary ratio, according to an embodiment.
FIG. 15 is a block diagram of an apparatus for video decoding by inter prediction using a partition divided at an arbitrary ratio, according to an embodiment.
16 illustrates example partitions in which a coding unit is divided at an arbitrary ratio, according to an embodiment.
17 is a diagram illustrating syntax of a sequence parameter set including information indicating whether to include a partition that divides a coding unit at an arbitrary ratio, according to an embodiment.
18 is a flowchart of a video encoding method according to inter prediction using a partition divided at an arbitrary ratio, according to an embodiment.
19 is a flowchart of a video decoding method by inter prediction using a partition divided at an arbitrary ratio, according to an embodiment.

이하 도 1 내지 도 27을 참조하여 본 발명의 일 실시예에 따른 비디오 부호화 장치 및 비디오 복호화 장치, 비디오 부호화 방법 및 비디오 복호화 방법이 상술된다. 도 1 내지 도 13을 참조하여 본 발명의 일 실시예에 따라 공간적으로 계층적인 데이터 단위에 기반한 비디오의 부호화 및 비디오의 복호화가 후술되고, 이하 도 14 내지 19 를 참조하여, 일 실시예에 따라 임의적 비율로 분할된 파티션을 이용한 인터 예측에 따른 비디오 부호화 및 비디오 복호화가 후술된다. Hereinafter, a video encoding apparatus and a video decoding apparatus, a video encoding method, and a video decoding method according to an embodiment of the present invention will be described with reference to FIGS. 1 to 27. Coding of video and decoding of video based on spatially hierarchical data units according to an embodiment of the present invention will be described below with reference to FIGS. 1 through 13, and with reference to FIGS. 14 through 19, Video coding and video decoding according to inter prediction using a partitioned partition will be described below.

이하 도 1 내지 도 13을 참조하여 본 발명의 일 실시예에 따른 비디오 부호화 장치 및 비디오 복호화 장치, 비디오 부호화 방법 및 비디오 복호화 방법이 상술된다.Hereinafter, a video encoding apparatus, a video encoding apparatus, a video encoding method, and a video decoding method according to an embodiment of the present invention will be described with reference to FIGS. 1 to 13.

도 1 은 본 발명의 일 실시예에 따른 비디오 부호화 장치의 블록도를 도시한다.1 shows a block diagram of a video encoding apparatus according to an embodiment of the present invention.

일 실시예에 따른 비디오 부호화 장치(100)는 최대 부호화 단위 분할부(110), 부호화 심도 결정부(120) 및 출력부(130)를 포함한다.The video coding apparatus 100 according to an embodiment includes a maximum coding unit division unit 110, a coding depth determination unit 120, and an output unit 130.

최대 부호화 단위 분할부(110)는 영상의 현재 픽처를 위한 최대 크기의 부호화 단위인 최대 부호화 단위에 기반하여 현재 픽처를 구획할 수 있다. 현재 픽처가 최대 부호화 단위보다 크다면, 현재 픽처의 영상 데이터는 적어도 하나의 최대 부호화 단위로 분할될 수 있다. 영상 데이터는 적어도 하나의 최대 부호화 단위별로 부호화 심도 결정부(120)로 출력될 수 있다.The maximum coding unit division unit 110 may divide a current picture based on a maximum coding unit which is a coding unit of a maximum size for a current picture of an image. If the current picture is larger than the maximum encoding unit, the image data of the current picture may be divided into at least one maximum encoding unit. The image data may be output to the coding depth determination unit 120 for each of at least one maximum coding unit.

일 실시예에 따른 부호화 단위는 최대 크기 및 심도로 특징지어질 수 있다. 심도란 부호화 단위가 계층적으로 분할되는 단계를 나타내며, 심도가 깊어질수록 심도별 부호화 단위는 최대 부호화 단위로부터 최소 부호화 단위까지 분할될 수 있다. 최대 부호화 단위의 심도가 최상위 심도이며 최소 부호화 단위가 최하위 부호화 단위로 정의될 수 있다. 최대 부호화 단위는 심도가 깊어짐에 따라 심도별 부호화 단위의 크기는 감소하므로, 상위 심도의 부호화 단위는 복수 개의 하위 심도의 부호화 단위를 포함할 수 있다.The coding unit according to an embodiment may be characterized by a maximum size and depth. Depth indicates a stage in which coding units are hierarchically divided. As the depth increases, the depth coding units can be divided from the maximum coding unit to the minimum coding unit. The depth of the largest coding unit is the highest depth and the minimum coding unit may be defined as the lowest coding unit. As the maximum coding unit decreases as the depth increases, the size of the coding unit for each depth decreases, and thus, the coding unit of the higher depth may include coding units of a plurality of lower depths.

전술한 바와 같이 부호화 단위의 최대 크기에 따라, 현재 픽처의 영상 데이터를 최대 부호화 단위로 분할하며, 각각의 최대 부호화 단위는 심도별로 분할되는 부호화 단위들을 포함할 수 있다. 일 실시예에 따른 최대 부호화 단위는 심도별로 분할되므로, 최대 부호화 단위에 포함된 공간 영역(spatial domain)의 영상 데이터가 심도에 따라 계층적으로 분류될 수 있다. As described above, the image data of the current picture may be divided into maximum coding units according to the maximum size of the coding unit, and each maximum coding unit may include coding units divided by depths. Since the maximum coding unit is divided according to depths, image data of a spatial domain included in the maximum coding unit may be hierarchically classified according to depths.

최대 부호화 단위의 높이 및 너비를 계층적으로 분할할 수 있는 총 횟수를 제한하는 최대 심도 및 부호화 단위의 최대 크기가 미리 설정되어 있을 수 있다.The maximum depth and the maximum size of the coding unit that limit the total number of times of hierarchically dividing the height and the width of the maximum coding unit may be preset.

부호화 심도 결정부(120)는, 심도마다 최대 부호화 단위의 영역이 분할된 적어도 하나의 분할 영역을 부호화하여, 적어도 하나의 분할 영역 별로 최종 부호화 결과가 출력될 심도를 결정한다. 즉 부호화 심도 결정부(120)는, 현재 픽처의 최대 부호화 단위마다 심도별 부호화 단위로 영상 데이터를 부호화하여 가장 작은 부호화 오차가 발생하는 심도를 선택하여 부호화 심도로 결정한다. 결정된 부호화 심도 및 최대 부호화 단위별 영상 데이터는 출력부(130)로 출력된다.The coding depth determiner 120 encodes at least one divided area in which the area of the maximum coding unit is divided for each depth, and determines the depth at which the final coding result is output for each of at least one of the divided areas. That is, the coding depth determination unit 120 selects the depth at which the smallest coding error occurs, and determines the coding depth as the coding depth by coding the image data in units of depth coding for each maximum coding unit of the current picture. The determined coded depth and the image data for each maximum coding unit are output to the outputter 130.

최대 부호화 단위 내의 영상 데이터는 최대 심도 이하의 적어도 하나의 심도에 따라 심도별 부호화 단위에 기반하여 부호화되고, 각각의 심도별 부호화 단위에 기반한 부호화 결과가 비교된다. 심도별 부호화 단위의 부호화 오차의 비교 결과 부호화 오차가 가장 작은 심도가 선택될 수 있다. 각각의 최대화 부호화 단위마다 적어도 하나의 부호화 심도가 결정될 수 있다. Image data in the largest coding unit is encoded based on coding units according to depths according to at least one depth less than or equal to the maximum depth, and encoding results based on the coding units for each depth are compared. As a result of comparing the encoding error of the coding units according to depths, a depth having the smallest encoding error may be selected. At least one coding depth may be determined for each maximum coding unit.

최대 부호화 단위의 크기는 심도가 깊어짐에 따라 부호화 단위가 계층적으로 분할되어 분할되며 부호화 단위의 개수는 증가한다. 또한, 하나의 최대 부호화 단위에 포함되는 동일한 심도의 부호화 단위들이라 하더라도, 각각의 데이터에 대한 부호화 오차를 측정하고 하위 심도로의 분할 여부가 결정된다. 따라서, 하나의 최대 부호화 단위에 포함되는 데이터라 하더라도 위치에 따라 심도별 부호화 오차가 다르므로 위치에 따라 부호화 심도가 달리 결정될 수 있다. 따라서, 하나의 최대 부호화 단위에 대해 부호화 심도가 하나 이상 설정될 수 있으며, 최대 부호화 단위의 데이터는 하나 이상의 부호화 심도의 부호화 단위에 따라 구획될 수 있다.As the depth of the maximum coding unit increases, the coding unit is divided into hierarchically and the number of coding units increases. In addition, even in the case of coding units having the same depth included in one largest coding unit, a coding error of each data is measured, and whether or not division into a lower depth is determined. Therefore, even in the data included in one largest coding unit, since the encoding error for each depth is different according to the position, the coding depth may be differently determined according to the position. Accordingly, one or more coding depths may be set for one maximum coding unit, and data of the maximum coding unit may be partitioned according to coding units of one or more coding depths.

최대 부호화 단위의 예측 부호화 및 주파수 변환이 수행될 수 있다. 예측 부호화 및 주파수 변환도 마찬가지로, 최대 부호화 단위마다, 최대 심도 이하의 심도마다 심도별 부호화 단위를 기반으로 수행된다. The predictive encoding and frequency conversion of the maximum encoding unit can be performed. Likewise, predictive coding and frequency conversion are performed on the basis of the depth coding unit for each maximum coding unit and for each depth below the maximum depth.

최대 부호화 단위가 심도별로 분할될 때마다 심도별 부호화 단위의 개수가 증가하므로, 심도가 깊어짐에 따라 생성되는 모든 심도별 부호화 단위에 대해 예측 부호화 및 주파수 변환을 포함한 부호화가 수행되어야 한다. 이하 설명의 편의를 위해 적어도 하나의 최대 부호화 단위 중 현재 심도의 부호화 단위을 기반으로 예측 부호화 및 주파수 변환을 설명하겠다.Since the number of coding units per depth is increased every time the maximum coding unit is divided by the depth, the coding including the predictive coding and the frequency conversion should be performed for every depth coding unit as the depth increases. For convenience of explanation, predictive coding and frequency conversion will be described based on a coding unit of a current depth among at least one maximum coding unit.

일 실시예에 따른 비디오 부호화 장치(100)는, 영상 데이터의 부호화를 위한 데이터 단위의 크기 또는 형태를 다양하게 선택할 수 있다. 영상 데이터의 부호화를 위해서는 예측 부호화, 주파수 변환, 엔트로피 부호화 등의 단계를 거치는데, 모든 단계에 걸쳐서 동일한 데이터 단위가 사용될 수도 있으며, 단계별로 데이터 단위가 변경될 수도 있다.The video encoding apparatus 100 according to an exemplary embodiment may select various sizes or types of data units for encoding image data. To encode the image data, a step such as predictive encoding, frequency conversion, and entropy encoding is performed. The same data unit may be used for all steps, and the data unit may be changed step by step.

예를 들어 비디오 부호화 장치(100)는, 영상 데이터의 부호화를 위한 부호화 단위 뿐만 아니라, 부호화 단위의 영상 데이터의 예측 부호화를 수행하기 위해, 부호화 단위와 다른 데이터 단위를 선택할 수 있다. For example, the video coding apparatus 100 can select a coding unit and a data unit different from the coding unit in order to perform predictive coding of the video data of the coding unit as well as the coding unit for coding the video data.

최대 부호화 단위의 예측 부호화를 위해서는, 최대 부호화 단위의 심도별 부호화 단위의 부분적 데이터 단위를 기반으로 예측 부호화가 수행될 수 있다. 부호화 단위의 부분적 데이터 단위는, 부호화 단위 및 심도별 부호화 단위의 높이 및 너비 중 적어도 하나가 분할된 데이터 단위를 포함할 수 있다. For predictive coding of the maximum coding unit, predictive coding may be performed based on the partial data unit of the coding unit for each depth of the maximum coding unit. The partial data unit of the encoding unit may include a data unit in which at least one of the height and the width of the encoding unit and the depth encoding unit is divided.

예를 들어, 부호화 단위의 크기가 2Nx2N(단, N은 양의 정수)인 경우, 부분적 데이터 단위의 크기는 2Nx2N, 2NxN, Nx2N, NxN 등일 수 있다. 부호화 단위의 높이 또는 너비 중 적어도 하나를 반분하는 형태의 데이터 단위 이외에도 다양하게 분할한 형태의 데이터 단위를 기반으로 예측 부호화가 수행될 수도 있다. 이하, 예측 부호화의 기반이 되는 데이터 단위는 '예측 단위'라고 지칭될 수 있다.For example, when the size of the encoding unit is 2Nx2N (where N is a positive integer), the size of the partial data unit may be 2Nx2N, 2NxN, Nx2N, NxN, and the like. Prediction coding may be performed based on data units of various types, in addition to data units of a type in which at least one of the height and the width of an encoding unit is divided by half. Hereinafter, a data unit on which prediction encoding is based may be referred to as a 'prediction unit'.

부호화 단위의 예측 모드는, 인트라 모드, 인터 모드 및 스킵 모드 중 적어도 하나일 수 있다. 예를 들어 인트라 모드 및 인터 모드는, 2Nx2N, 2NxN, Nx2N, NxN 크기의 예측 단위에 대해서 수행될 수 있다. 또한, 스킵 모드는 2Nx2N 크기의 예측 단위에 대해서만 수행될 수 있다. 부호화 단위 이내의 하나의 예측 단위마다 독립적으로 부호화가 수행되어 부호화 오차가 가장 작은 예측 모드가 선택될 수 있다.The prediction mode of the encoding unit may be at least one of an intra mode, an inter mode, and a skip mode. For example, the intra mode and the inter mode can be performed for prediction units of 2Nx2N, 2NxN, Nx2N, and NxN sizes. In addition, the skip mode can be performed only for a prediction unit of 2Nx2N size. The encoding may be performed independently for each prediction unit within the coding unit to select a prediction mode having the smallest encoding error.

또한, 일 실시예에 따른 비디오 부호화 장치(100)는, 영상 데이터의 부호화를 위한 부호화 단위 뿐만 아니라, 부호화 단위와 다른 데이터 단위를 기반으로 부호화 단위의 영상 데이터의 주파수 변환을 수행할 수 있다.In addition, the video encoding apparatus 100 according to an exemplary embodiment may perform frequency conversion of image data of an encoding unit based on not only an encoding unit for encoding image data but also a data unit different from the encoding unit.

부호화 단위의 주파수 변환을 위해서는, 부호화 단위보다 작거나 같은 크기의 데이터 단위를 기반으로 주파수 변환이 수행될 수 있다. 예를 들어, 주파수 변환을 위한 데이터 단위는, 인트라 모드를 위한 데이터 단위 및 인터 모드를 위한 데이터 단위를 포함할 수 있다. 이하, 주파수 변환의 기반이 되는 데이터 단위는 '변환 단위'라고 지칭될 수 있다.For frequency conversion of a coding unit, frequency conversion may be performed based on a data unit having a size smaller than or equal to the coding unit. For example, a data unit for frequency conversion may include a data unit for intra mode and a data unit for inter mode. Hereinafter, the data unit on which the frequency conversion is based may be referred to as a 'conversion unit'.

부호화 심도별 부호화 정보는, 부호화 심도 뿐만 아니라 예측 관련 정보 및 주파수 변환 관련 정보가 필요하다. 따라서, 부호화 심도 결정부(120)는 최소 부호화 오차를 발생시킨 부호화 심도 뿐만 아니라, 부호화 심도의 부호화 단위를 예측 단위로 분할한 파티션 타입, 예측 단위별 예측 모드, 주파수 변환을 위한 변환 단위의 크기 등을 결정할 수 있다.The encoded information for each coded depth requires not only the coded depth but also prediction related information and frequency transform related information. Therefore, the coding depth determiner 120 not only determines the coding depth at which the minimum coding error is generated, but also the partition type in which the coding unit of the coding depth is divided into prediction units, the prediction unit-specific prediction mode, Can be determined.

부호화 심도 결정부(120)는 심도별 부호화 단위의 부호화 오차를 라그랑지 곱(Lagrangian Multiplier) 기반의 율-왜곡 최적화 기법(Rate-Distortion Optimization)을 이용하여 측정할 수 있다.The coding depth determiner 120 may measure encoding errors of coding units according to depths using a rate-distortion optimization technique based on a Lagrangian multiplier.

출력부(130)는, 부호화 심도 결정부(120)에서 결정된 적어도 하나의 부호화 심도에 기초하여 부호화된 최대 부호화 단위의 영상 데이터및 심도별 부호화 모드에 관한 정보를 비트스트림 형태로 출력한다. The output unit 130 outputs, in the form of a bit stream, video data of the maximum encoding unit encoded based on at least one encoding depth determined by the encoding depth determination unit 120 and information on the depth encoding mode.

부호화된 비디오 데이터는 영상의 레지듀얼 데이터의 부호화 결과일 수 있다.The encoded video data may be a result of encoding residual data of the video.

심도별 부호화 모드에 관한 정보는, 부호화 심도 정보, 부호화 심도의 부호화 단위의 예측 단위의 파티션 타입 정보, 예측 단위별 예측 모드 정보, 변환 단위의 크기 정보 등을 포함할 수 있다.The information on the depth-dependent coding mode may include coding depth information, partition type information of a prediction unit of a coding unit of coding depth, prediction mode information per prediction unit, size information of a conversion unit, and the like.

부호화 심도 정보는, 현재 심도로 부호화하지 않고 하위 심도의 부호화 단위로 부호화할지 여부를 나타내는 심도별 분할 정보를 이용하여 정의될 수 있다. 현재 부호화 단위의 현재 심도가 부호화 심도라면, 현재 부호화 단위는 현재 심도의 부호화 단위로 부호화되므로 현재 심도의 분할 정보는 더 이상 하위 심도로 분할되지 않도록 정의될 수 있다. 반대로, 현재 부호화 단위의 현재 심도가 부호화 심도가 아니라면 하위 심도의 부호화 단위를 이용한 부호화를 시도해보아야 하므로, 현재 심도의 분할 정보는 하위 심도의 부호화 단위로 분할되도록 정의될 수 있다.The coded depth information may be defined using depth-specific segmentation information indicating whether to encode to a coding unit of a lower depth without encoding to the current depth. If the current depth of the current coding unit is a coding depth, since the current coding unit is encoded in a coding unit of the current depth, split information of the current depth may be defined so that it is no longer divided into lower depths. On the contrary, if the current depth of the current coding unit is not the coding depth, encoding should be attempted using the coding unit of the lower depth, and thus split information of the current depth may be defined to be divided into coding units of the lower depth.

현재 심도가 부호화 심도가 아니라면, 하위 심도의 부호화 단위로 분할된 부호화 단위에 대해 부호화가 수행된다. 현재 심도의 부호화 단위 내에 하위 심도의 부호화 단위가 하나 이상 존재하므로, 각각의 하위 심도의 부호화 단위마다 반복적으로 부호화가 수행되어, 동일한 심도의 부호화 단위마다 재귀적(recursive) 부호화가 수행될 수 있다.If the current depth is not the coded depth, encoding is performed on the coding unit divided into the coding units of the lower depth. Since at least one coding unit of a lower depth exists in the coding unit of the current depth, encoding may be repeatedly performed for each coding unit of each lower depth, and recursive coding may be performed for each coding unit of the same depth.

하나의 최대 부호화 단위 안에 적어도 하나의 부호화 심도가 결정되며 부호화 심도마다 적어도 하나의 부호화 모드에 관한 정보가 결정되어야 하므로, 하나의 최대 부호화 단위에 대해서는 적어도 하나의 부호화 모드에 관한 정보가 결정될 수 있다. 또한, 최대 부호화 단위의 데이터는 심도에 따라 계층적으로 구획되어 위치 별로 부호화 심도가 다를 수 있으므로, 데이터에 대해 부호화 심도 및 부호화 모드에 관한 정보가 설정될 수 있다.Since at least one coding depth is determined in one maximum coding unit and information about at least one coding mode should be determined for each coding depth, information about at least one coding mode may be determined for one maximum coding unit. Since the data of the maximum encoding unit is hierarchically divided according to the depth and the depth of encoding may be different for each position, information on the encoding depth and the encoding mode may be set for the data.

따라서, 일 실시예에 따른 출력부(130)는, 최대 부호화 단위에 포함되어 있는 최소 부호화 단위마다 해당 부호화 정보를 설정할 수 있다. 즉, 부호화 심도의 부호화 단위는 동일한 부호화 정보를 보유하고 있는 최소 부호화 단위를 하나 이상 포함하고 있다. 이를 이용하여, 인근 최소 부호화 단위들이 동일한 심도별 부호화 정보를 갖고 있다면, 동일한 최대 부호화 단위에 포함되는 최소 부호화 단위일 수 있다.Accordingly, the output unit 130 according to the embodiment can set the corresponding encoding information for each minimum encoding unit included in the maximum encoding unit. That is, the coding unit of the coding depth includes at least one minimum coding unit that holds the same coding information. By using this, if the neighboring minimum encoding units have the same depth encoding information, it can be the minimum encoding unit included in the same maximum encoding unit.

예를 들어 출력부(130)를 통해 출력되는 부호화 정보는, 심도별 부호화 단위별 부호화 정보와 예측 단위별 부호화 정보로 분류될 수 있다. 심도별 부호화 단위별 부호하 정보는, 예측 모드 정보, 파티션 크기 정보를 포함할 수 있다. 예측 단위별로 전송되는 부호화 정보는 인터 모드의 추정 방향에 관한 정보, 인터 모드의 참조 영상 인덱스에 관한 정보, 움직임 벡터에 관한 정보, 인트라 모드의 크로마 성분에 관한 정보, 인트라 모드의 보간 방식에 관한 정보 등을 포함할 수 있다. 또한, 픽처, 슬라이스 또는 GOP별로 정의되는 부호화 단위의 최대 크기에 관한 정보 및 최대 심도에 관한 정보는 비트스트림의 헤더에 삽입될 수 있다.For example, the encoding information output through the output unit 130 may be classified into encoding information per depth unit and encoding information per prediction unit. The under-coding-by-depth coding unit information may include prediction mode information and partition size information. The encoding information to be transmitted for each prediction unit includes information about the estimation direction of the inter mode, information about the reference picture index of the inter mode, information on the motion vector, information on the chroma component of the intra mode, information on the interpolation mode of the intra mode And the like. Information on the maximum size of a coding unit defined for each picture, slice or GOP, and information on the maximum depth can be inserted into the header of the bitstream.

비디오 부호화 장치(100)의 가장 간단한 형태의 실시예에 따르면, 심도별 부호화 단위는 한 계층 상위 심도의 부호화 단위의 높이 및 너비를 반분한 크기의 부호화 단위이다. 즉, 현재 심도의 부호화 단위의 크기가 2Nx2N이라면, 하위 심도의 부호화 단위의 크기는 NxN 이다. 또한, 2Nx2N 크기의 현재 부호화 단위는 NxN 크기의 하위 심도 부호화 단위를 최대 4개 포함할 수 있다.According to an embodiment of the simplest form of the video encoding apparatus 100, a coding unit according to depths is a coding unit having a size in which a height and a width of a coding unit of one layer higher depth are divided by half. That is, if the size of the coding unit of the current depth is 2Nx2N, the size of the coding unit of the lower depth is NxN. In addition, the current coding unit having a size of 2N × 2N may include up to four lower depth coding units having a size of N × N.

따라서, 일 실시예에 따른 비디오 복호화 장치(100)는 현재 픽처의 특성을 고려하여 결정된 최대 부호화 단위의 크기 및 최대 심도를 기반으로, 각각의 최대 부호화 단위마다 최적의 형태 및 크기의 부호화 단위를 결정할 수 있다. 또한, 각각의 최대 부호화 단위마다 다양한 예측 모드, 주파수 변환 방식 등으로 부호화할 수 있으므로, 다양한 영상 크기의 부호화 단위의 영상 특성을 고려하여 최적의 부호화 모드가 결정될 수 있다.Therefore, the video decoding apparatus 100 according to an embodiment determines an encoding unit of an optimal shape and size for each maximum encoding unit based on the size and the maximum depth of the maximum encoding unit determined in consideration of the characteristics of the current picture . In addition, since each encoding unit can be encoded by various prediction modes, frequency conversion methods, and the like, an optimal encoding mode can be determined in consideration of image characteristics of encoding units of various image sizes.

따라서, 영상의 해상도가 매우 높거나 데이터량이 매우 큰 영상을 기존 매크로블록 단위로 부호화한다면, 픽처당 매크로블록의 수가 과도하게 많아진다. 이에 따라, 매크로블록마다 생성되는 압축 정보도 많아지므로 압축 정보의 전송 부담이 커지고 데이터 압축 효율이 감소하는 경향이 있다. 따라서, 일 실시예에 따른 비디오 부호화 장치는, 영상의 크기를 고려하여 부호화 단위의 최대 크기를 증가시키면서, 영상 특성을 고려하여 부호화 단위를 조절할 수 있으므로, 영상 압축 효율이 증대될 수 있다.Therefore, if an image having a very high image resolution or a very large data amount is encoded in units of existing macroblocks, the number of macroblocks per picture becomes excessively large. This increases the amount of compression information generated for each macroblock, so that the burden of transmission of compressed information increases and the data compression efficiency tends to decrease. Therefore, the video encoding apparatus according to an embodiment can increase the maximum size of the encoding unit in consideration of the image size, and adjust the encoding unit in consideration of the image characteristic, so that the image compression efficiency can be increased.

도 2 는 본 발명의 일 실시예에 따른 비디오 복호화 장치의 블록도를 도시한다.2 shows a block diagram of a video decoding apparatus according to an embodiment of the present invention.

일 실시예에 따른 비디오 복호화 장치(200)는 수신부(210), 영상 데이터 및 부호화 정보 추출부(220) 및 영상 데이터 복호화부(230)를 포함한다. 일 실시예에 따른 비디오 복호화 장치(200)의 각종 프로세싱을 위한 부호화 단위, 심도, 예측 단위, 변환 단위, 각종 부호화 모드에 관한 정보 등 각종 용어의 정의는, 도 1 및 비디오 부호화 장치(100)을 참조하여 전술한 바와 동일하다. The video decoding apparatus 200 includes a receiving unit 210, an image data and encoding information extracting unit 220, and an image data decoding unit 230. The definition of various terms such as coding unit, depth, prediction unit, conversion unit, and information on various coding modes for various processing of the video decoding apparatus 200 according to an embodiment is the same as that of FIG. 1 and the video coding apparatus 100 Are the same as described above.

수신부(205)는 부호화된 비디오에 대한 비트스트림을 수신하여 파싱(parsing)한다. 영상 데이터 및 부호화 정보 추출부(220)는 파싱된 비트스트림으로부터 최대 부호화 단위별로 영상 데이터를 추출하여 영상 데이터 복호화부(230)로 출력한다. 영상 데이터 및 부호화 정보 추출부(220)는 현재 픽처에 대한 헤더로부터 현재 픽처의 부호화 단위의 최대 크기에 관한 정보를 추출할 수 있다. The receiving unit 205 receives and parses the bitstream of the encoded video. The image data and encoding information extracting unit 220 extracts image data for each maximum encoding unit from the parsed bit stream and outputs the extracted image data to the image data decoding unit 230. The image data and encoding information extracting unit 220 can extract information on the maximum size of the encoding unit of the current picture from the header of the current picture.

또한, 영상 데이터 및 부호화 정보 추출부(220)는 파싱된 비트스트림으로부터 최대 부호화 단위별 부호화 심도 및 부호화 모드에 관한 정보를 추출한다. 추출된 부호화 심도 및 부호화 모드에 관한 정보는 영상 데이터 복호화부(230)로 출력된다. 즉, 비트열의 영상 데이터를 최대 부호화 단위로 분할하여, 영상 데이터 복호화부(230)가 최대 부호화 단위마다 영상 데이터를 복호화하도록 할 수 있다. Also, the image data and encoding information extracting unit 220 extracts information on the encoding depth and the encoding mode for each maximum encoding unit from the parsed bitstream. The extracted information about the coded depth and the coding mode is output to the image data decoder 230. That is, the image data of the bit string may be divided into maximum coding units so that the image data decoder 230 may decode the image data for each maximum coding unit.

최대 부호화 단위별 부호화 심도 및 부호화 모드에 관한 정보는, 하나 이상의 부호화 심도 정보에 대해 설정될 수 있으며, 부호화 심도별 부호화 모드에 관한 정보는, 부호화 단위별 예측 단위의 파티션 타입 정보, 예측 모드 정보 및 변환 단위의 크기 정보 등을 포함할 수 있다. 또한, 부호화 심도 정보로서, 심도별 분할 정보가 추출될 수도 있다.Information on the coding depth and the coding mode per coding unit can be set for one or more coding depth information, and the information on the coding mode for each coding depth includes information on partition type information, prediction mode information, Size information of the conversion unit, and the like. In addition, as the encoding depth information, depth-based segmentation information may be extracted.

영상 데이터 및 부호화 정보 추출부(220)가 추출한 최대 부호화 단위별 부호화 심도 및 부호화 모드에 관한 정보는, 일 실시예에 따른 비디오 부호화 장치(100)와 같이 부호화단에서, 최대 부호화 단위별 심도별 부호화 단위마다 반복적으로 부호화를 수행하여 최소 부호화 오차를 발생시키는 것으로 결정된 부호화 심도 및 부호화 모드에 관한 정보이다. 따라서, 비디오 복호화 장치(200)는 최소 부호화 오차를 발생시키는 부호화 방식에 따라 데이터를 복호화하여 영상을 복원할 수 있다.The information about the coded depth and the encoding mode according to the maximum coding units extracted by the image data and the encoding information extractor 220 may be encoded according to the depth according to the maximum coding unit, as in the video encoding apparatus 100 according to an embodiment. Information about a coded depth and an encoding mode determined to repeatedly perform encoding for each unit to generate a minimum encoding error. Therefore, the video decoding apparatus 200 may reconstruct an image by decoding data according to an encoding method that generates a minimum encoding error.

영상 데이터 및 부호화 정보 추출부(220)는 최소 부호화 단위별로 부호화 심도 및 부호화 모드에 관한 정보를 추출할 수 있다. 최소 부호화 단위별로, 해당 최대 부호화 단위의 부호화 심도 및 부호화 모드에 관한 정보가 기록되어 있다면, 동일한 부호화 심도 및 부호화 모드에 관한 정보를 갖고 있는 최소 부호화 단위들은 동일한 최대 부호화 단위에 포함되는 데이터 단위로 유추될 수 있다. 즉, 동일한 정보의 최소 부호화 단위를 모아 복호화하면, 부호화 오차가 가장 작은 부호화 심도의 부호화 단위를 기반으로 한 복호화가 가능하다.The image data and encoding information extracting unit 220 can extract information on the encoding depth and the encoding mode for each minimum encoding unit. If information on the coding depth and the coding mode of the corresponding maximum coding unit is recorded for each minimum coding unit, the minimum coding units having the same coding depth and information on the coding mode are estimated as data units included in the same maximum coding unit . That is, when the minimum coding unit of the same information is collected and decoded, decoding based on the coding unit of the coded depth having the smallest coding error is possible.

영상 데이터 복호화부(230)는 최대 부호화 단위별 부호화 심도 및 부호화 모드에 관한 정보에 기초하여 각각의 최대 부호화 단위의 영상 데이터를 복호화하여 현재 픽처를 복원한다. 최대 부호화 단위별 부호화 심도 정보에 기초하여, 영상 데이터 복호화부(230)는 적어도 하나의 부호화 심도의 부호화 단위마다 영상 데이터를 복호화할 수 있다. 복호화 과정은 인트라 예측 및 움직임 보상을 포함하는 예측 과정, 및 주파수 역변환 과정을 포함할 수 있다.The image data decoder 230 reconstructs the current picture by decoding image data of each maximum coding unit based on the information about the coded depth and the encoding mode for each maximum coding unit. Based on the coded depth information for each maximum coding unit, the image data decoder 230 may decode the image data for each coding unit of at least one coded depth. The decoding process may include a prediction process including intra prediction and motion compensation, and an inverse frequency conversion process.

영상 데이터 복호화부(230)는, 부호화 단위별 예측 부호화를 위해, 부호화 심도별 부호화 단위의 예측 단위의 파티션 타입 정보 및 예측 모드 정보에 기초하여, 부호화 단위마다 각각의 예측 단위 및 예측 모드로 인트라 예측 또는 움직임 보상을 수행할 수 있다.The image data decoding unit 230 decodes each of the prediction units and prediction modes for each coding unit on the basis of the partition type information and the prediction mode information of the prediction unit of the coding unit for each coding depth for each coding unit, Or motion compensation.

또한, 영상 데이터 복호화부(230)는, 최대 부호화 단위별 주파수 역변환을 위해, 부호화 심도별 부호화 단위의 변환 단위의 크기 정보에 기초하여, 부호화 단위마다 각각의 변환 단위로 주파수 역변환을 수행할 수 있다.In addition, the image data decoding unit 230 may perform frequency inverse transform for each encoding unit on the basis of the size information of the conversion unit of each encoding depth-based encoding unit for frequency inverse conversion for each maximum encoding unit .

영상 데이터 복호화부(230)는 심도별 분할 정보를 이용하는 현재 최대 부호화 단위의 부호화 심도를 결정할 수 있다. 만약, 분할 정보가 현재 심도로 복호화할 것을 나타내고 있다면 현재 심도가 부호화 심도이다. 따라서, 영상 데이터 복호화부(230)는 현재 최대 부호화 단위의 영상 데이터에 대해 현재 심도의 부호화 단위를 예측 단위의 파티션 타입, 예측 모드 및 변환 단위 크기 정보를 이용하여 복호화할 수 있다. The image data decoding unit 230 can determine the coding depth of the current maximum encoding unit using the division information by depth. If the partition information indicates that the current depth is to be decoded, the current depth is the depth of the encoding. Therefore, the image data decoder 230 may decode the coding unit of the current depth using the partition type, the prediction mode, and the transformation unit size information of the prediction unit with respect to the image data of the current maximum coding unit.

즉, 최소 부호화 단위에 대해 설정되어 있는 부호화 정보를 관찰하여, 동일한 분할 정보를 포함한 부호화 정보를 보유하고 있는 최소 부호화 단위를 모아, 하나의 데이터 단위로 복호화할 수 있다. That is, it is possible to observe the encoding information set for the minimum encoding unit and to decode the minimum encoding units that hold the encoding information including the same division information, into one data unit.

일 실시예에 따른 비디오 복호화 장치(200)는, 부호화 과정에서 최대 부호화 단위마다 재귀적으로 부호화를 수행하여 최소 부호화 오차를 발생시킨 부호화 단위에 대한 정보를 획득하여, 현재 픽처에 대한 복호화에 이용할 수 있다. 즉, 최대 부호화 단위마다 최적 부호화 단위로 영상 데이터의 복호화가 가능해진다.The video decoding apparatus 200 according to an exemplary embodiment recursively performs encoding for each maximum encoding unit in the encoding process to obtain information on an encoding unit that has generated the minimum encoding error and can use the encoded information for decoding the current picture have. That is, it is possible to decode video data in the optimal encoding unit for each maximum encoding unit.

따라서, 높은 해상도의 영상 또는 데이터량이 과도하게 많은 영상이라도 부호화단으로부터 전송된 최적 부호화 모드에 관한 정보를 이용하여, 영상의 특성에 적응적으로 결정된 부호화 단위의 크기 및 부호화 모드에 따라 효율적으로 영상 데이터를 복호화하여 복원할 수 있다.Accordingly, even if an image with a high resolution or an excessively large amount of data is used, the information on the optimal encoding mode transmitted from the encoding end is used, and the image data is efficiently encoded according to the encoding unit size and encoding mode, Can be decoded and restored.

도 3 은 계층적 부호화 단위의 개념을 도시한다.FIG. 3 shows the concept of a hierarchical coding unit.

부호화 단위의 예는, 너비x높이가 64x64인 부호화 단위부터, 32x32, 16x16, 8x8, 및 4x4를 포함할 수 있다. 정사각형 형태의 부호화 단위 이외에도, 너비x높이가 64x32, 32x64, 32x16, 16x32, 16x8, 8x16, 8x4, 4x8인 부호화 단위들이 존재할 수 있다.An example of an encoding unit may include 32x32, 16x16, 8x8, and 4x4 from an encoding unit with a width x height of 64x64. In addition to the square coding units, there may be coding units having a width x height of 64x32, 32x64, 32x16, 16x32, 16x8, 8x16, 8x4, and 4x8.

비디오 데이터(310)에 대해서는, 해상도는 1920x1080, 부호화 단위의 최대 크기는 64, 최대 심도가 2로 설정되어 있다. 비디오 데이터(320)에 대해서는, 해상도는 1920x1080, 부호화 단위의 최대 크기는 64, 최대 심도가 4로 설정되어 있다. 비디오 데이터(330)에 대해서는, 해상도는 352x288, 부호화 단위의 최대 크기는 16, 최대 심도가 2로 설정되어 있다.With respect to the video data 310, the resolution is set to 1920 x 1080, the maximum size of the encoding unit is set to 64, and the maximum depth is set to 2. As for the video data 320, the resolution is set to 1920x1080, the maximum size of the coding unit is 64, and the maximum depth is 4. With respect to the video data 330, the resolution is set to 352 x 288, the maximum size of the encoding unit is set to 16, and the maximum depth is set to 2.

해상도가 높거나 데이터량이 많은 경우 부호화 효율의 향상 뿐만 아니라 영상 특성을 정확히 반형하기 위해 부호화 사이즈의 최대 크기가 상대적으로 큰 것이 바람직하다. 따라서, 비디오 데이터(330)에 비해, 해상도가 높은 비디오 데이터(310, 320)는 부호화 사이즈의 최대 크기가 64로 선택될 수 있다.It is preferable that the maximum size of the coding size is relatively large in order to improve the coding efficiency as well as to accurately characterize the image characteristics when the resolution or the data amount is large. Accordingly, the video data 310 or 320 having a higher resolution than the video data 330 may be selected to have a maximum size of 64.

최대 심도는 계층적 부호화 단위에서 총 계층수를 나타낸다. 따라서, 비디오 데이터(310)의 최대 심도는 2이므로, 비디오 데이터(310)의 부호화 단위(315)는 장축 크기가 64인 최대 부호화 단위로부터, 심도가 두 계층 깊어져서 장축 크기가 32, 16인 부호화 단위들까지 포함할 수 있다. 반면, 비디오 데이터(330)의 최대 심도는 2이므로, 비디오 데이터(330)의 부호화 단위(335)는 장축 크기가 16인 부호화 단위들로부터, 심도가 두 계층 깊어져서 장축 크기가 8, 4인 부호화 단위들까지 포함할 수 있다. The maximum depth indicates the total number of layers in the hierarchical encoding unit. Therefore, since the maximum depth of the video data 310 is 2, the encoding unit 315 of the video data 310 is encoded from a maximum encoding unit having a major axis size of 64, Units. On the other hand, since the maximum depth of the video data 330 is 2, the coding unit 335 of the video data 330 has two long depths of encoding from the coding units having the long axis size of 16, so that the long axis sizes are 8 and 4 It can include up to units.

비디오 데이터(320)의 최대 심도는 4이므로, 비디오 데이터(320)의 부호화 단위(325)는 장축 크기가 64인 최대 부호화 단위로부터, 심도가 네 계층 깊어져서 장축 크기가 32, 16, 8, 4인 부호화 단위들까지 포함할 수 있다. 심도가 깊어질수록 세부 정보의 표현능력이 향상될 수 있다.Since the maximum depth of the video data 320 is 4, the encoding unit 325 of the video data 320 has a depth of four layers from 32 to 16, 8, and 4 Encoding units. As the depth increases, the expressive power of the detailed information may be improved.

도 4 는 본 발명의 일 실시예에 따른 부호화 단위에 기초한 영상 부호화부의 블록도를 도시한다.4 is a block diagram of an image encoder based on coding units, according to an embodiment of the present invention.

일 실시예에 따른 영상 부호화부(400)는, 비디오 부호화 장치(100)의 부호화 심도 결정부(120)에서 영상 데이터를 부호화하는데 거치는 작업들을 포함한다. 즉, 인트라 예측부(410)는 현재 프레임(405) 중 인트라 모드의 부호화 단위에 대해 인트라 예측을 수행하고, 움직임 추정부(420) 및 움직임 보상부(425)는 인터 모드의 현재 프레임(405) 및 참조 프레임(495)를 이용하여 인터 추정 및 움직임 보상을 수행한다.The image encoding unit 400 according to an exemplary embodiment includes operations to encode image data in the encoding depth determination unit 120 of the video encoding apparatus 100. That is, the intraprediction unit 410 performs intraprediction on the intra-mode encoding unit of the current frame 405, and the motion estimation unit 420 and the motion compensation unit 425 perform intraprediction on the current frame 405 of the inter- And a reference frame 495. The inter-frame estimation and the motion compensation are performed using the reference frame and the reference frame.

인트라 예측부(410), 움직임 추정부(420) 및 움직임 보상부(425)로부터 출력된 데이터는 주파수 변환부(430) 및 양자화부(440)를 거쳐 양자화된 변환 계수로 출력된다. 양자화된 변환 계수는 역양자화부(460), 주파수 역변환부(470)을 통해 공간 영역의 데이터로 복원되고, 복원된 공간 영역의 데이터는 디블로킹부(480) 및 루프 필터링부(490)를 거쳐 후처리되어 참조 프레임(495)으로 출력된다. 양자화된 변환 계수는 엔트로피 부호화부(450)를 거쳐 비트스트림(455)으로 출력될 수 있다.The data output from the intraprediction unit 410, the motion estimation unit 420 and the motion compensation unit 425 is output as a quantized transform coefficient through the frequency transform unit 430 and the quantization unit 440. The quantized transform coefficients are reconstructed into spatial domain data through the inverse quantization unit 460 and the frequency inverse transform unit 470 and the data of the reconstructed spatial domain is passed through the deblocking unit 480 and the loop filtering unit 490 Processed and output to the reference frame 495. [ The quantized transform coefficients may be output to the bitstream 455 via the entropy encoder 450.

일 실시예에 따른 비디오 부호화 장치(100)에 적용되기 위해서는, 영상 부호화부(400)의 구성 요소들인 인트라 예측부(410), 움직임 추정부(420), 움직임 보상부(425), 주파수 변환부(430), 양자화부(440), 엔트로피 부호화부(450), 역양자화부(460), 주파수 역변환부(470), 디블로킹부(480) 및 루프 필터링부(490)가 모두, 최대 부호화 단위마다 최대 심도를 고려한 심도별 부호화 단위에 기반하여 작업을 수행하여야 한다. The motion estimation unit 420, the motion compensation unit 425, and the frequency transformation unit 420, which are components of the image encoding unit 400, are applied to the video encoding apparatus 100 according to an embodiment of the present invention. The quantization unit 440, the entropy encoding unit 450, the inverse quantization unit 460, the frequency inverse transform unit 470, the deblocking unit 480, and the loop filtering unit 490, The work should be performed based on the depth encoding unit considering the maximum depth.

특히, 인트라 예측부(410), 움직임 추정부(420) 및 움직임 보상부(425)는 부호화 단위의 최대 크기 및 심도를 고려하여 부호화 단위 내의 예측 단위 및 예측 모드를 결정하며, 주파수 변환부(430)는 부호화 단위의 최대 크기 및 심도를 고려하여 변환 단위의 크기를 고려하여야 한다.In particular, the intra prediction unit 410, the motion estimation unit 420, and the motion compensation unit 425 determine the prediction unit and the prediction mode in the coding unit in consideration of the maximum size and depth of the coding unit, and the frequency conversion unit 430 ) Should consider the size of the conversion unit considering the maximum size and depth of the encoding unit.

도 5 는 본 발명의 일 실시예에 따른 부호화 단위에 기초한 영상 복호화부의 블록도를 도시한다.5 is a block diagram of an image decoding unit based on an encoding unit according to an embodiment of the present invention.

비트스트림(505)이 파싱부(510)를 거쳐 복호화 대상인 부호화된 비디오 데이터 및 복호화를 위해 필요한 부호화에 관한 정보가 파싱된다. 부호화된 비디오 데이터는 엔트로피 복호화부(520) 및 역양자화부(530)를 거쳐 역양자화된 데이터로 출력되고, 주파수 역변환부(540)를 거쳐 공간 영역의 영상 데이터가 복원된다. The bit stream 505 passes through the parsing unit 510 and the encoded video data to be decoded and the encoding-related information necessary for decoding are parsed. The encoded video data is output as inverse quantized data through the entropy decoding unit 520 and the inverse quantization unit 530, and the image data in the spatial domain is restored through the frequency inverse transform unit 540.

공간 영역의 영상 데이터에 대해서, 인트라 예측부(550)는 인트라 모드의 부호화 단위에 대해 인트라 예측을 수행하고, 움직임 보상부(560)는 참조 프레임(585)를 함께 이용하여 인터 모드의 부호화 단위에 대해 움직임 보상을 수행한다.For the image data of the spatial domain, the intra prediction unit 550 performs intra prediction on the coding unit of the intra mode, and the motion compensator 560 uses the reference frame 585 together to apply the coding unit of the inter mode. Perform motion compensation for the

인트라 예측부(550) 및 움직임 보상부(560)를 거친 공간 영역의 데이터는 디블로킹부(570) 및 루프 필터링부(580)를 거쳐 후처리되어 복원 프레임(595)으로 출력될 수 있다. 또한, 디블로킹부(570) 및 루프 필터링부(580)를 거쳐 후처리된 데이터는 참조 프레임(585)으로서 출력될 수 있다.The data in the spatial domain that has passed through the intra prediction unit 550 and the motion compensation unit 560 may be post-processed through the deblocking unit 570 and the loop filtering unit 580 and output to the reconstruction frame 595. Further, the post-processed data via deblocking unit 570 and loop filtering unit 580 may be output as reference frame 585.

비디오 복호화 장치(200)의 영상 데이터 복호화부(230)에서 영상 데이터를 복호화하기 위해, 일 실시예에 따른 영상 복호화부(500)의 파싱부(510) 이후의 단계별 작업들이 수행될 수 있다.In order to decode the image data in the image data decoder 230 of the video decoding apparatus 200, step-by-step operations after the parser 510 of the image decoder 500 according to an embodiment may be performed.

일 실시예에 따른 비디오 복호화 장치(200)에 적용되기 위해서는, 영상 복호화부(500)의 구성 요소들인 파싱부(510), 엔트로피 복호화부(520), 역양자화부(530), 주파수 역변환부(540), 인트라 예측부(550), 움직임 보상부(560), 디블로킹부(570) 및 루프 필터링부(580)가 모두, 최대 부호화 단위마다 부호화 심도의 부호화 단위에 기반하여 작업을 수행하여야 한다. The entropy decoding unit 520, the inverse quantization unit 530, and the frequency inverse transforming unit 520, which are the components of the video decoding unit 500, in order to be applied to the video decoding apparatus 200 according to one embodiment. The intraprediction unit 550, the motion compensation unit 560, the deblocking unit 570 and the loop filtering unit 580 all have to perform an operation based on the encoding unit of the encoding depth for each maximum encoding unit .

특히, 인트라 예측부(550), 움직임 보상부(560)는 부호화 단위의 최대 크기 및 심도를 고려하여 부호화 단위 및 예측 모드를 결정하며, 주파수 역변환부(540)는 부호화 단위의 최대 크기 및 심도를 고려하여 변환 단위의 크기를 고려하여야 한다.In particular, the intra predictor 550 and the motion compensator 560 determine a coding unit and a prediction mode in consideration of the maximum size and depth of the coding unit, and the frequency inverse transformer 540 determines the maximum size and depth of the coding unit. Consideration should be given to the size of the transformation unit.

도 6 는 본 발명의 일 실시예에 따른 심도별 부호화 단위 및 예측 단위를 도시한다.FIG. 6 illustrates a depth-based coding unit and a prediction unit according to an embodiment of the present invention.

일 실시예에 따른 비디오 부호화 장치(100) 및 일 실시예에 따른 비디오 복호화 장치(200)는 영상 특성을 고려하기 위해 계층적인 부호화 단위를 사용한다. 부호화 단위의 최대 높이 및 너비, 최대 심도는 영상의 특성에 따라 적응적으로 결정될 수도 있으며, 사용자의 요구에 따라 다양하게 설정될 수도 있다. 미리 설정된 부호화 단위의 최대 크기에 따라, 심도별 부호화 단위의 크기가 결정될 수 있다.The video encoding apparatus 100 according to an embodiment and the video decoding apparatus 200 according to an embodiment use hierarchical coding units to consider image characteristics. The maximum height, width, and maximum depth of the coding unit may be adaptively determined according to the characteristics of the image, and may be variously set according to a user's request. According to the maximum size of the preset coding unit, the size of the coding unit for each depth may be determined.

일 실시예에 따른 부호화 단위의 계층 구조(600)는 부호화 단위의 최대 높이 및 너비가 64이며, 최대 심도가 4인 경우를 도시하고 있다. 일 실시예에 따른 부호화 단위의 계층 구조(600)의 세로축을 따라서 심도가 깊어지므로 심도별 부호화 단위의 높이 및 너비가 각각 분할한다. 또한, 부호화 단위의 계층 구조(600)의 가로축을 따라, 각각의 심도별 부호화 단위의 예측 부호화의 기반이 되는 부분적 데이터 단위인 예측 단위가 도시되어 있다.The hierarchical structure 600 of the encoding unit according to an embodiment shows a case where the maximum height and width of the encoding unit is 64 and the maximum depth is 4. Since the depth deepens along the vertical axis of the hierarchical structure 600 of the coding unit according to an embodiment, the height and the width of the coding unit for each depth are divided. In addition, along the horizontal axis of the hierarchical structure 600 of the coding unit, a prediction unit which is a partial data unit on which prediction coding of each depth coding unit is based is shown.

즉, 부호화 단위(610)는 부호화 단위의 계층 구조(600) 중 최대 부호화 단위로서 심도가 0이며, 부호화 단위의 크기, 즉 높이 및 너비가 64x64이다. 세로축을 따라 심도가 깊어지며, 크기 32x32인 심도 1의 부호화 단위(620), 크기 16x16인 심도 2의 부호화 단위(630), 크기 8x8인 심도 3의 부호화 단위(640), 크기 4x4인 심도 4의 부호화 단위(650)가 존재한다. 크기 4x4인 심도 4의 부호화 단위(650)는 최소 부호화 단위이다.That is, the coding unit 610 has a depth of 0 as the largest coding unit of the hierarchical structure 600 of the coding unit, and the size, ie, the height and width, of the coding unit is 64x64. A depth-1 encoding unit 620 having a size of 32x32, a depth-2 encoding unit 620 having a size 16x16, a depth-3 encoding unit 640 having a size 8x8, a depth 4x4 having a size 4x4, There is an encoding unit 650. An encoding unit 650 of depth 4 of size 4x4 is the minimum encoding unit.

각각의 심도별로 가로축을 따라, 부호화 단위의 예측 단위로서, 부분적 데이터 단위들이 배열된다. 즉, 심도 0의 크기 64x64의 부호화 단위(610)의 예측 단위는, 크기 64x64의 부호화 단위(610)에 포함되는 크기 64x64의 부분적 데이터 단위(610), 크기 64x32의 부분적 데이터 단위들(612), 크기 32x64의 부분적 데이터 단위들(614), 크기 32x32의 부분적 데이터 단위들(616)일 수 있다. 반대로 보면, 부호화 단위는 변환 단위들(610, 612, 614, 616)을 포함하는 최소 크기의 정사각형의 데이터 단위일 수 있다.The partial data units are arranged as a prediction unit of the encoding unit along the horizontal axis for each depth. That is, the prediction unit of the encoding unit 610 having a size of 64x64 with a depth of 0 is a partial data unit 610 having a size of 64x64, partial data units 612 having a size of 64x32 included in the encoding unit 610 of size 64x64, Partial data units 614 of size 32x64, and partial data units 616 of size 32x32. Conversely, the encoding unit may be a minimum-sized square data unit including the conversion units 610, 612, 614, and 616.

마찬가지로, 심도 1의 크기 32x32의 부호화 단위(620)의 예측 단위는, 크기 32x32의 부호화 단위(620)에 포함되는 크기 32x32의 부분적 데이터 단위(620), 크기 32x16의 부분적 데이터 단위들(622), 크기 16x32의 부분적 데이터 단위들(624), 크기 16x16의 부분적 데이터 단위들(626)일 수 있다. Likewise, the prediction unit of the encoding unit 620 of the size 32x32 of the depth 1 is the partial data unit 620 of the size 32x32, the partial data units 622 of the size 32x16, and the partial data units 620 of the size 32x32 included in the encoding unit 620 of the size 32x32, Partial data units 624 of size 16x32, partial data units 626 of size 16x16.

마찬가지로, 심도 2의 크기 16x16의 부호화 단위(630)의 예측 단위는, 크기 16x16의 부호화 단위(630)에 포함되는 크기 16x16의 부분적 데이터 단위(630), 크기 16x8의 부분적 데이터 단위들(632), 크기 8x16의 부분적 데이터 단위들(634), 크기 8x8의 부분적 데이터 단위들(636)일 수 있다. Likewise, the prediction unit of a 16x16 size 16x16 encoding unit is a 16x16 partial data unit 630, a 16x8 partial data unit 632, and a 16x16 partial data unit 630 included in the 16x16 encoding unit 630, Partial data units 634 of size 8x16, and partial data units 636 of size 8x8.

마찬가지로, 심도 3의 크기 8x8의 부호화 단위(640)의 예측 단위는, 크기 8x8의 부호화 단위(640)에 포함되는 크기 8x8의 부분적 데이터 단위(640), 크기 8x4의 부분적 데이터 단위들(642), 크기 4x8의 부분적 데이터 단위들(644), 크기 4x4의 부분적 데이터 단위들(646)일 수 있다. Similarly, the prediction unit of the coding unit 640 of size 8x8 having a depth of 3 includes the partial data unit 640 of the size 8x8 included in the coding unit 640 of the size 8x8, the partial data units 642 of the size 8x4, It may be partial data units 644 of size 4x8 and partial data units 646 of size 4x4.

마지막으로, 심도 4의 크기 4x4의 부호화 단위(650)는 최소 부호화 단위이며 최하위 심도의 부호화 단위이고, 해당 예측 단위도 크기 4x4의 데이터 단위(650)이다.Finally, a coding unit 650 of size 4x4 is the minimum coding unit and the coding unit of the lowest depth, and the prediction unit is a data unit 650 of size 4x4.

일 실시예에 따른 비디오 부호화 장치(100)의 부호화 심도 결정부(120)는, 최대 부호화 단위(610)의 부호화 심도를 결정하기 위해, 최대 부호화 단위(610)에 포함되는 각각의 심도의 부호화 단위마다 부호화를 수행하여야 한다. The coding depth determiner 120 of the video coding apparatus 100 according to an exemplary embodiment of the present invention determines the coding depth of the maximum coding unit 610 by multiplying the coding unit of each depth included in the maximum coding unit 610 Encoding is performed.

동일한 범위 및 크기의 데이터를 포함하기 위한 심도별 부호화 단위의 개수는, 심도가 깊어질수록 심도별 부호화 단위의 개수도 증가한다. 예를 들어, 심도 1의 부호화 단위 한 개가 포함하는 데이터에 대해서, 심도 2의 부호화 단위는 네 개가 필요하다. 따라서, 동일한 데이터의 부호화 결과를 심도별로 비교하기 위해서, 한 개의 심도 1의 부호화 단위 및 네 개의 심도 2의 부호화 단위를 이용하여 각각 부호화되어야 한다.The number of deeper coding units according to depths for including data having the same range and size increases as the depth increases. For example, four coding units of depth 2 are required for data included in one coding unit of depth 1. Therefore, in order to compare the encoding results of the same data for each depth, each of the coding units having one depth 1 and four coding units having four depths 2 should be encoded.

각각의 심도별 부호화를 위해서는, 부호화 단위의 계층 구조(600)의 가로축을 따라, 심도별 부호화 단위의 예측 단위들마다 부호화를 수행하여, 해당 심도에서 가장 작은 부호화 오차인 대표 부호화 오차가 선택될 수다. 또한, 부호화 단위의 계층 구조(600)의 세로축을 따라 심도가 깊어지며, 각각의 심도마다 부호화를 수행하여, 심도별 대표 부호화 오차를 비교하여 최소 부호화 오차가 검색될 수 있다. 최대 부호화 단위(610) 중 최소 부호화 오차가 발생하는 심도가 최대 부호화 단위(610)의 부호화 심도 및 파티션 타입으로 선택될 수 있다. For each depth coding, encoding may be performed for each prediction unit of a coding unit according to depths along a horizontal axis of the hierarchical structure 600 of the coding unit, and a representative coding error, which is the smallest coding error at a corresponding depth, may be selected. . In addition, a depth deeper along the vertical axis of the hierarchical structure 600 of the coding unit, the encoding may be performed for each depth, and the minimum coding error may be searched by comparing the representative coding error for each depth. The depth at which the minimum coding error occurs among the maximum coding units 610 can be selected as the coding depth and the partition type of the maximum coding unit 610. [

도 7 은 본 발명의 일 실시예에 따른, 부호화 단위 및 변환 단위의 관계를 도시한다. FIG. 7 shows a relationship between an encoding unit and a conversion unit according to an embodiment of the present invention.

일 실시예에 따른 비디오 부호화 장치(100) 또는 일 실시예에 따른 비디오 복호화 장치(200)는, 최대 부호화 단위마다 최대 부호화 단위보다 작거나 같은 크기의 부호화 단위로 영상을 부호화하거나 복호화한다. 부호화 과정 중 주파수 변환을 위한 변환 단위의 크기는 각각의 부호화 단위보다 크지 않은 데이터 단위를 기반으로 선택될 수 있다.The video encoding apparatus 100 according to an embodiment or the video decoding apparatus 200 according to an embodiment encodes or decodes an image in coding units having a size smaller than or equal to the maximum coding unit for each maximum coding unit. The size of the conversion unit for frequency conversion during encoding can be selected based on data units that are not larger than the respective encoding units.

예를 들어, 일 실시예에 따른 비디오 부호화 장치(100) 또는 일 실시예에 따른 비디오 복호화 장치(200)에서, 현재 부호화 단위(710)가 64x64 크기일 때, 32x32 크기의 변환 단위(720)를 이용하여 주파수 변환이 수행될 수 있다. For example, in the video encoding apparatus 100 or the video encoding apparatus 200 according to an embodiment, when the current encoding unit 710 is 64x64 size, the 32x32 conversion unit 720 The frequency conversion can be performed.

또한, 64x64 크기의 부호화 단위(710)의 데이터를 64x64 크기 이하의 32x32, 16x16, 8x8, 4x4 크기의 변환 단위들로 각각 주파수 변환을 수행하여 부호화한 후, 원본과의 오차가 가장 적은 변환 단위가 선택될 수 있다.In addition, the data of the encoding unit 710 of 64x64 size is encoded by performing the frequency conversion with the conversion units of 32x32, 16x16, 8x8, and 4x4 size of 64x64 or smaller, respectively, and then the conversion unit having the smallest error with the original Can be selected.

도 8 은 본 발명의 일 실시예에 따라, 심도별 부호화 정보들을 도시한다.FIG. 8 illustrates depth-specific encoding information, in accordance with an embodiment of the present invention.

일 실시예에 따른 비디오 부호화 장치(100)의 부호화 정보 부호화부는 부호화 모드에 관한 정보로서, 각각의 부호화 심도의 부호화 단위마다 파티션 타입에 관한 정보(800), 예측 모드에 관한 정보(810), 변환 단위 크기에 대한 정보(820)를 부호화하여 전송할 수 있다.The encoding information encoding unit of the video encoding apparatus 100 according to the embodiment is information on an encoding mode, and includes information on a partition type 800, information on a prediction mode 810, Information 820 on the unit size can be encoded and transmitted.

파티션 타입에 대한 정보(800)는, 현재 부호화 단위의 예측 부호화를 위해 예측 단위로서, 현재 부호화 단위가 분할된 타입에 대한 정보를 나타낸다. 예를 들어, 심도 0 및 크기 2Nx2N의 현재 부호화 단위 CU_0는, 크기 2Nx2N의 예측 단위(802), 크기 2NxN의 예측 단위(804), 크기 Nx2N의 예측 단위(806), 크기 NxN의 예측 단위(808) 중 어느 하나의 타입으로 분할되어 예측 단위로 이용될 수 있다. 이 경우 현재 부호화 단위의 파티션 타입에 관한 정보(800)는 크기 2Nx2N의 예측 단위(802), 크기 2NxN의 예측 단위(804), 크기 Nx2N의 예측 단위(806) 및 크기 NxN의 예측 단위(808) 중 하나를 나타내도록 설정된다.The partition type information 800 indicates information on a type in which the current encoding unit is divided as a prediction unit for predictive encoding of the current encoding unit. For example, the current encoding unit CU_0 of depth 0 and size 2Nx2N includes a prediction unit 802 of size 2Nx2N, a prediction unit 804 of size 2NxN, a prediction unit 806 of size Nx2N, a prediction unit 808 of size NxN ) And can be used as a prediction unit. In this case, information 800 regarding the partition type of the current encoding unit includes a prediction unit 802 of size 2Nx2N, a prediction unit 804 of size 2NxN, a prediction unit 806 of size Nx2N, and a prediction unit 808 of size NxN. Lt; / RTI >

예측 모드에 관한 정보(810)는, 각각의 예측 단위의 예측 모드를 나타낸다. 예를 들어 예측 모드에 관한 정보(810)를 통해, 파티션 타입에 관한 정보(800)가 가리키는 예측 단위가 인트라 모드(812), 인터 모드(814) 및 스킵 모드(816) 중 하나로 예측 부호화가 수행되는지 여부가 설정될 수 있다.The information 810 on the prediction mode indicates the prediction mode of each prediction unit. The prediction unit indicated by the information 800 relating to the partition type is predicted to be one of the intra mode 812, the inter mode 814 and the skip mode 816 through the prediction mode information 810, for example. Can be set.

또한, 변환 단위 크기에 관한 정보(820)는 현재 부호화 단위를 어떠한 변환 단위를 기반으로 주파수 변환을 수행할지 여부를 나타낸다. 예를 들어, 변환 단위는 제 1 인트라 변환 단위 크기(822), 제 2 인트라 변환 단위 크기(824), 제 1 인터 변환 단위 크기(826), 제 2 인트라 변환 단위 크기(828) 중 하나일 수 있다.In addition, the information 820 on the conversion unit size indicates whether to perform frequency conversion on the basis of which conversion unit the current encoding unit is performed. For example, the transform unit may be one of a first intra transform unit size 822, a second intra transform unit size 824, a first inter transform unit size 826, and a second intra transform unit size 828. have.

일 실시예에 따른 비디오 복호화 장치(200)의 부호화 정보 추출부는, 각각의 심도별 부호화 단위마다 파티션 타입에 관한 정보(800), 예측 모드에 관한 정보(810), 변환 단위 크기에 대한 정보(820)를 추출하여 복호화에 이용할 수 있다.The encoding information extracting unit of the video decoding apparatus 200 according to the embodiment extracts the information 800 about the partition type, the information 810 about the prediction mode, the information 820 about the conversion unit size ) Can be extracted and used for decoding.

도 9 는 본 발명의 일 실시예에 따른 심도별 부호화 단위를 도시한다. FIG. 9 shows a depth encoding unit according to an embodiment of the present invention.

심도의 변화를 나타내기 위해 분할 정보가 이용될 수 있다. 분할 정보는 현재 심도의 부호화 단위가 하위 심도의 부호화 단위로 분할될지 여부를 나타낸다. Partition information may be used to indicate changes in depth. The division information indicates whether the current-depth encoding unit is divided into lower-depth encoding units.

심도 0 및 2N_0x2N_0 크기의 부호화 단위의 예측 부호화를 위한 예측 단위(910)는 2N_0x2N_0 크기의 파티션 타입(912), 2N_0xN_0 크기의 파티션 타입(914), N_0x2N_0 크기의 파티션 타입(916), N_0xN_0 크기의 파티션 타입(918)을 포함할 수 있다. The prediction unit 910 for the prediction encoding of the coding units of depth 0 and 2N_0x2N_0 has a partition type 912 of 2N_0x2N_0 size, a partition type 914 of 2N_0xN_0 size, a partition type 916 of N_0x2N_0 size, a partition of size N_0xN_0 Type < / RTI >

파티션 타입마다, 한 개의 2N_0x2N_0 크기의 예측 단위, 두 개의 2N_0xN_0 크기의 예측 단위, 두 개의 N_0x2N_0 크기의 예측 단위, 네 개의 N_0xN_0 크기의 예측 단위마다 반복적으로 예측 부호화가 수행되어야 한다. 크기 2N_0x2N_0, 크기 N_0x2N_0 및 크기 2N_0xN_0 및 크기 N_0xN_0의 예측 단위에 대해서는, 인트라 모드 및 인터 모드로 예측 부호화가 수행될 수 있다. 스킵 모드는 크기 2N_0x2N_0의 예측 단위에 예측 부호화가 대해서만 수행될 수 있다.For each partition type, predictive encoding should be repeatedly performed for each prediction unit of 2N_0x2N_0 size, two 2N_0xN_0 size prediction units, two N_0x2N_0 size prediction units, and four N_0xN_0 size prediction units. For a prediction unit of size 2N_0x2N_0, size N_0x2N_0, size 2N_0xN_0, and size N_0xN_0, predictive coding may be performed in intra mode and inter mode. The skip mode can be performed only for predictive encoding in a prediction unit of size 2N_0x2N_0.

크기 N_0xN_0의 파티션 타입(918)에 의한 부호화 오차가 가장 작다면, 심도 0를 1로 변경하고(920), 심도 2 및 크기 N_0xN_0의 파티션 타입의 부호화 단위들(922, 924, 926, 928)에 대해 반복적으로 최소 부호화 오차를 검색해 나갈 수 있다. If the encoding error of the partition type 918 having the size N_0xN_0 is the smallest, the depth 0 is changed to 1 (920), and the depth 2 and the coding units 922, 924, 926, and 928 of the partition type having the size N_0xN_0 are changed. We can repeatedly search for the minimum coding error.

동일한 심도의 부호화 단위들(922, 924, 926, 928)에 대해 부호화가 반복적으로 수행되므로, 이중 하나만 예를 들어 심도 1의 부호화 단위의 부호화를 설명한다. 심도 1 및 크기 2N_1x2N_1 (=N_0xN_0)의 부호화 단위의 예측 부호화를 위한 예측 단위(930)는, 크기 2N_1x2N_1의 파티션 타입(932), 크기 2N_1xN_1의 파티션 타입(934), 크기 N_1x2N_1의 파티션 타입(936), 크기 N_1xN_1의 파티션 타입(938)을 포함할 수 있다. 파티션 타입마다, 한 개의 크기 2N_1x2N_1의 예측 단위, 두 개의 크기 2N_1xN_1의 예측 단위, 두 개의 크기 N_1x2N_1의 예측 단위, 네 개의 크기 N_1xN_1의 예측 단위마다 반복적으로 예측 부호화가 수행되어야 한다.Since encoding is repeatedly performed on the encoding units 922, 924, 926, and 928 of the same depth, encoding of only one of the encoding units of depth 1, for example, will be described. The prediction unit 930 for predicting the coding unit of the depth 1 and the size 2N_1x2N_1 (= N_0xN_0) includes a partition type 932 of size 2N_1x2N_1, a partition type 934 of size 2N_1xN_1, a partition type 936 of size N_1x2N_1, , And a partition type 938 of size N_1xN_1. For each partition type, prediction encoding must be repeatedly performed for each prediction unit of size 2N_1x2N_1, prediction units of two sizes 2N_1xN_1, prediction units of two sizes N_1x2N_1, and prediction units of four sizes N_1xN_1.

또한, 크기 N_1xN_1 크기의 파티션 타입(938)에 의한 부호화 오차가 가장 작다면, 심도 1을 심도 2로 변경하면서(940), 심도 2 및 크기 N_2xN_2의 부호화 단위들(942, 944, 946, 948)에 대해 반복적으로 최소 부호화 오차를 검색해 나갈 수 있다. Also, if the encoding error of the partition type 938 having the size N_1xN_1 is the smallest, the depth 1 is changed to the depth 2 (940), and the coding units 942, 944, 946, and 948 of the depth 2 and the size N_2xN_2 are used. We can repeatedly search for the minimum coding error.

최대 심도가 d인 경우, 심도별 분할 정보는 심도 d-1일 때까지 설정될 수 있다. 즉, 심도 d-1 및 크기 2N_(d-1)x2N_(d-1)의 부호화 단위의 예측 부호화를 위한 예측 단위(950)는, 크기 2N_(d-1)x2N_(d-1)의 파티션 타입(952), 크기 2N_(d-1)xN_(d-1)의 파티션 타입(954), 크기 N_(d-1)x2N_(d-1)의 파티션 타입(956), 크기 N_(d-1)xN_(d-1)의 파티션 타입(958)을 포함할 수 있다. If the maximum depth is d, the depth-based segmentation information can be set until the depth d-1. That is, the prediction unit 950 for predictive coding of the coding unit of the depth d-1 and the size 2N_ (d-1) x2N_ (d-1) A partition type 954 of size 2N_ (d-1) xN_ (d-1), a partition type 956 of size N_ (d-1) x2N_ 1) xN_ (d-1) < / RTI >

파티션 타입마다, 한 개의 크기 2N_(d-1)x2N_(d-1)의 예측 단위, 두 개의 크기 2N_(d-1)xN_(d-1)의 예측 단위, 두 개의 크기 N_(d-1)x2N_(d-1)의 예측 단위, 네 개의 크기 N_(d-1)xN_(d-1)의 예측 단위마다 반복적으로 예측 부호화를 통한 부호화가 수행되어야 한다. 최대 심도가 d이므로, 심도 d-1의 부호화 단위(952)는 더 이상 분할 과정을 거치지 않는다.(D-1) x2N_ (d-1), two predicted units of two sizes 2N_ (d-1) (d-1) x2N_ (d-1), four sizes N_ (d-1) xN_ (d-1). Since the maximum depth is d, the coding unit 952 of depth d-1 no longer undergoes the division process.

일 실시예에 따른 비디오 부호화 장치(100)는 부호화 단위(912)를 위한 부호화 심도를 결정하기 위해, 심도별 부호화 오차를 비교하여 가장 작은 부호화 오차가 발생하는 심도를 선택한다. The video coding apparatus 100 according to an exemplary embodiment compares depth-based coding errors to determine the depth of coding for the coding unit 912, and selects the depth at which the smallest coding error occurs.

예를 들어, 심도 0의 부호화 단위에 대한 부호화 오차는 파티션 타입(912, 914, 916, 918)마다 예측 부호화를 수행한 후 가장 작은 부호화 오차가 발생하는 예측 단위가 결정된다. 마찬가지로 심도 0, 1, ..., d-1 마다 부호화 오차가 가장 작은 예측 단위가 검색될 수 있다. 심도 d에서는, 크기 2N_dx2N_d의 부호화 단위이면서 예측 단위(960)를 기반으로 한 예측 부호화를 통해 부호화 오차가 결정될 수 있다. For example, a coding error for a coding unit of depth 0 is predicted for each partition type 912, 914, 916, and 918, and a prediction unit in which the smallest coding error occurs is determined. Similarly, a prediction unit having the smallest coding error can be searched for every depth 0, 1, ..., d-1. At the depth d, the coding error can be determined through predictive coding based on the prediction unit 960, which is a coding unit of size 2N_dx2N_d.

이런 식으로 심도 0, 1, ..., d-1, d의 모든 심도별 최소 부호화 오차를 비교하여 오차가 가장 작은 심도가 선택되어 부호화 심도로 결정될 수 있다. 부호화 심도 및 해당 심도의 예측 단위는 부호화 모드에 관한 정보로써 부호화되어 전송될 수 있다. 또한, 심도 0으로부터 부호화 심도에 이르기까지 부호화 단위가 분할되어야 하므로, 부호화 심도의 분할 정보만이 '0'으로 설정되고, 부호화 심도를 제외한 심도별 분할 정보는 '1'로 설정되어야 한다. In this way, the depth with the smallest error can be determined by comparing the minimum coding errors for all depths of depths 0, 1, ..., d-1, d, and can be determined as the coding depth. The coded depth and the prediction unit of the corresponding depth may be encoded and transmitted as information about an encoding mode. In addition, since the coding unit must be split from the depth 0 to the coded depth, only the split information of the coded depth is set to '0', and the split information for each depth except the coded depth should be set to '1'.

일 실시예에 따른 비디오 복호화 장치(200)의 부호화 정보 추출부(220)는 부호화 단위(912)에 대한 부호화 심도 및 예측 단위에 관한 정보를 추출하여 부호화 단위(912)를 복호화하는데 이용할 수 있다. 일 실시예에 따른 비디오 복호화 장치(200)는 심도별 분할 정보를 이용하여 분할 정보가 '0'인 심도를 부호화 심도로 파악하고, 해당 심도에 대한 부호화 모드에 관한 정보를 이용하여 복호화에 이용할 수 있다.The encoding information extracting unit 220 of the video decoding apparatus 200 according to an exemplary embodiment may extract information on the encoding depth and prediction unit of the encoding unit 912 and decode the encoding unit 912. [ The video decoding apparatus 200 according to an embodiment may identify a depth having split information of '0' as a coding depth using split information for each depth, and may use the decoding depth by using information about an encoding mode for a corresponding depth. have.

도 10a, 10b 및 10c는 본 발명의 일 실시예에 따른, 부호화 단위, 예측 단위 및 주파수 변환 단위의 관계를 도시한다.FIGS. 10A, 10B, and 10C illustrate the relationship between an encoding unit, a prediction unit, and a frequency conversion unit according to an embodiment of the present invention.

부호화 단위(1010)는, 최대 부호화 단위에 대해 일 실시예에 따른 비디오 부호화 장치(100)가 결정한 부호화 심도별 부호화 단위들이다. 예측 단위(1060)는 부호화 단위(1010) 중 각각의 부호화 심도별 부호화 단위의 예측 단위들이며, 변환 단위(1070)는 각각의 부호화 심도별 부호화 단위의 변환 단위들이다.The coding units 1010 are coding units according to coding depths determined by the video encoding apparatus 100 according to an embodiment with respect to the maximum coding unit. The prediction unit 1060 is a prediction unit of each coding depth unit among the coding units 1010 and the conversion unit 1070 is a conversion unit of each coding depth unit.

심도별 부호화 단위들(1010)은 최대 부호화 단위의 심도가 0이라고 하면, 부호화 단위들(1012, 1054)은 심도가 1, 부호화 단위들(1014, 1016, 1018, 1028, 1050, 1052)은 심도가 2, 부호화 단위들(1020, 1022, 1024, 1026, 1030, 1032, 1048)은 심도가 3, 부호화 단위들(1040, 1042, 1044, 1046)은 심도가 4이다. If the depth-based coding units 1010 have a depth of 0, the coding units 1012 and 1054 have a depth of 1, and the coding units 1014, 1016, 1018, 1028, 1050, and 1052 have depths. 2, coding units 1020, 1022, 1024, 1026, 1030, 1032, and 1048 have a depth of three, and coding units 1040, 1042, 1044, and 1046 have a depth of four.

예측 단위들(1060) 중 일부(1014, 1016, 1022, 1032, 1048, 1050, 1052, 1054)는 부호화 단위가 분할된 타입이다. 즉, 예측 단위(1014, 1022, 1050, 1054)는 2NxN의 파티션 타입이며, 예측 단위(1016, 1048, 1052)는 Nx2N의 파티션 타입, 예측 단위(1032)는 NxN의 파티션 타입이다. 즉, 심도별 부호화 단위들(1010)의 예측 단위는 각각의 부호화 단위보다 작거나 같다. A portion (1014, 1016, 1022, 1032, 1048, 1050, 1052, 1054) of the prediction units 1060 is a type in which the coding unit is divided. That is, the prediction units 1014, 1022, 1050 and 1054 are partition types of 2NxN, the prediction units 1016, 1048 and 1052 are partition types of Nx2N, and the prediction units 1032 are partition types of NxN. That is, the prediction unit of the deeper coding units 1010 is smaller than or equal to each coding unit.

변환 단위들(1070) 중 일부(1052)의 영상 데이터에 대해서는 부호화 단위에 비해 작은 크기의 데이터 단위로 주파수 변환 또는 주파수 역변환이 수행된다. 또한, 변환 단위(1014, 1016, 1022, 1032, 1048, 1050, 1052, 1054)는 예측 단위들(1060) 중 해당 예측 단위와 비교해보면, 서로 다른 크기 또는 형태의 데이터 단위이다. 즉, 일 실시예에 따른 비디오 부호화 장치(100) 및 일 실시예에 다른 비디오 복호화 장치(200)는 동일한 부호화 단위에 대한 인트라 예측/움직임 추정/움직임 보상 작업, 및 주파수 변환/역변환 작업이라 할지라도, 각각 별개의 데이터 단위를 기반으로 수행할 수 있다.The image data of a part 1052 of the conversion units 1070 is subjected to frequency conversion or frequency inverse conversion in units of data smaller in size than the encoding unit. The conversion units 1014, 1016, 1022, 1032, 1048, 1050, 1052, and 1054 are data units of different sizes or types when compared with the prediction units of the prediction units 1060. That is, the video encoding apparatus 100 according to the embodiment and the video decoding apparatus 200 according to an embodiment can perform the intra prediction / motion estimation / motion compensation operation for the same encoding unit and the frequency conversion / , Each based on a separate data unit.

도 11 은 본 발명의 일 실시예에 따른 부호화 단위별 부호화 정보를 도시한다.FIG. 11 shows encoding information for each encoding unit according to an embodiment of the present invention.

일 실시예에 따른 비디오 부호화 장치(100)의 출력부(130)는 부호화 단위별 부호화 정보를 출력하고, 일 실시예에 따른 비디오 복호화 장치(200)의 부호화 정보 추출부(220)는 부호화 단위별 부호화 정보를 추출할 수 있다.The output unit 130 of the video encoding apparatus 100 according to an exemplary embodiment outputs encoding information for each encoding unit and the encoding information extracting unit 220 of the video decoding apparatus 200 according to an embodiment extracts encoding information for each encoding unit It is possible to extract the encoding information.

부호화 정보는 부호화 단위에 대한 분할 정보, 파티션 타입 정보, 예측 모드 정보, 변환 단위 크기 정보를 포함할 수 있다. 도 11에 도시되어 있는 부호화 정보들은 일 실시예에 따른 비디오 부호화 장치(100) 및 일 실시예에 따른 비디오 복호화 장치(200)에서 설정할 수 있는 일례이다.The encoding information may include split information about a coding unit, partition type information, prediction mode information, and transformation unit size information. The encoding information shown in FIG. 11 is an example that can be set in the video encoding apparatus 100 according to the embodiment and the video encoding apparatus 200 according to an embodiment.

분할 정보는 해당 부호화 단위의 부호화 심도를 나타낼 수 있다. 즉, 분할 정보에 따라 더 이상 분할되지 않는 심도가 부호화 심도이므로, 부호화 심도에 대해서 파티션 타입 정보, 예측 모드, 변환 단위 크기 정보가 정의될 수 있다. 분할 정보에 따라 한 단계 더 분할되어야 하는 경우에는, 분할된 4개의 하위 심도의 부호화 단위마다 독립적으로 부호화가 수행되어야 한다.The split information may indicate a coded depth of a corresponding coding unit. That is, since depths that are no longer divided according to the division information are coding depths, partition type information, prediction mode, and conversion unit size information can be defined with respect to the coding depth. If it is to be further split by the split information, encoding should be performed independently for each coding unit of the divided four lower depths.

파티션 타입 정보는, 부호화 심도의 부호화 단위의 변환 단위의 파티션 타입을 2Nx2N, 2NxN, Nx2N 및 NxN 중 하나로 나타낼 수 있다. 예측 모드는, 인트라 모드, 인터 모드 및 스킵 모드 중 하나로 나타낼 수 있다. 인트라 모드 및 인터 모드는 파티션 타입 2Nx2N, 2NxN, Nx2N 및 NxN에서 정의될 수 있으며, 스킵 모드는 파티션 타입 2Nx2N에서만 정의될 수 있다. 변환 단위 크기는 인트라 모드에서 두 종류의 크기, 인터 모드에서 두 종류의 크기로 설정될 수 있다.As the partition type information, the partition type of the conversion unit of the coding unit of the coding depth can be represented by 2Nx2N, 2NxN, Nx2N and NxN. The prediction mode may be represented by one of an intra mode, an inter mode, and a skip mode. The intra mode and the inter mode can be defined in the partition types 2Nx2N, 2NxN, Nx2N and NxN, and the skip mode can be defined only in the partition type 2Nx2N. The conversion unit size may be set to two kinds of sizes in the intra mode and two kinds of sizes in the inter mode.

부호화 단위 내의 최소 부호화 단위마다, 소속되어 있는 부호화 심도의 부호화 단위별 부호화 정보를 수록하고 있을 수 있다. 따라서, 인접한 최소 부호화 단위들끼리 각각 보유하고 있는 부호화 정보들을 확인하면, 동일한 부호화 심도의 부호화 단위에 포함되는지 여부가 확인될 수 있다. 또한, 최소 부호화 단위가 보유하고 있는 부호화 정보를 이용하면 해당 부호화 심도의 부호화 단위를 확인할 수 있으므로, 최대 부호화 단위 내의 부호화 심도들의 분포가 유추될 수 있다.The encoding unit-specific encoding information of the belonging encoding depth may be stored for each minimum encoding unit in the encoding unit. Therefore, if encoding information held in each of the adjacent minimum encoding units is checked, it can be confirmed whether or not the encoding information is included in the encoding unit of the same encoding depth. In addition, since the encoding unit of the encoding depth can be identified by using the encoding information held in the minimum encoding unit, the distribution of encoding depths in the maximum encoding unit can be inferred.

따라서 이 경우 현재 부호화 단위가 주변 데이터 단위를 참조하여 예측하기 경우, 현재 부호화 단위에 인접하는 심도별 부호화 단위 내의 최소 부호화 단위의 부호화 정보가 직접 이용됨으로써 최소 부호화 단위의 데이터가 참조될 수 있다.In this case, when the current encoding unit is predicted with reference to the neighboring data unit, the encoding information of the minimum encoding unit in the depth encoding unit adjacent to the current encoding unit is directly used, so that the data of the minimum encoding unit can be referred to.

또 다른 실시예로, 심도별 부호화 단위의 부호화 정보가 심도별 부호화 단위 내 중 대표되는 최소 부호화 단위에 대해서만 저장되어 있을 수 있다. 이 경우 현재 부호화 단위가 주변 부호화 단위를 참조하여 예측되는 경우, 인접하는 심도별 부호화 단위의 부호화 정보를 이용하여, 심도별 부호화 단위 내에서 현재 부호화 단위에 인접하는 데이터가 검색됨으로써 참조될 수도 있다.In yet another embodiment, the encoding information of the depth encoding unit may be stored only for the minimum encoding unit represented in the depth encoding unit. In this case, when the current encoding unit is predicted by referring to the surrounding encoding unit, the data adjacent to the current encoding unit in the depth encoding unit may be retrieved using the encoding information of the adjacent depth encoding unit.

도 12 는 본 발명의 일 실시예에 따른 비디오 부호화 방법의 흐름도를 도시한다.12 shows a flowchart of a video coding method according to an embodiment of the present invention.

단계 1210에서, 현재 픽처는 적어도 하나의 최대 부호화 단위로 분할된다. 또한, 가능한 총 분할 횟수를 나타내는 최대 심도가 미리 설정될 수도 있다.In step 1210, the current picture is divided into at least one maximum encoding unit. In addition, a maximum depth indicating the total number of possible divisions may be set in advance.

단계 1220에서, 심도마다 최대 부호화 단위의 영역이 분할된 적어도 하나의 분할 영역시 부호화되어, 적어도 하나의 분할 영역 별로 최종 부호화 결과가 출력될 심도가 결정된다. 최대 부호화 단위가 단계별로 분할되며 심도가 깊어질 때마다, 하위 심도별 부호화 단위들마다 반복적으로 부호화가 수행되어야 한다. In step 1220, the area of the maximum coding unit is coded at least in one divided area for each depth, and the depth at which the final coding result is output for each of at least one of the divided areas is determined. The maximum encoding unit is divided into stages, and each time the depth is deepened, it is necessary to repeatedly perform encoding for each lower-depth encoding unit.

또한, 심도별 부호화 단위마다, 부호화 오차가 가장 작은 파티션 타입별 변환 단위가 결정되어야 한다. 부호화 단위의 최소 부호화 오차를 발생시키는 부호화 심도가 결정되기 위해서는, 모든 심도별 부호화 단위마다 부호화 오차가 측정되어 비교되어야 한다. For each depth-based coding unit, the conversion unit for each partition type having the smallest coding error should be determined. In order to determine the coding depth that causes the minimum coding error of the coding unit, the coding error should be measured and compared for each coding unit of each depth.

단계 1230에서, 최대 부호화 단위마다 적어도 하나의 분할 영역 별 최종 부호화 결과인 영상 데이터와, 부호화 심도 및 부호화 모드에 관한 정보가 출력된다. 부호화 모드에 관한 정보는 부호화 심도에 관한 정보 또는 분할 정보, 부호화 심도의 파티션 타입 정보, 예측 모드 정보 및 변환 단위 크기 정보 등을 포함할 수 있다. 부호화된 부호화 모드에 관한 정보는, 부호화된 비디오 데이터와 함께 복호화단으로 전송될 수 있다.In step 1230, video data as a final encoding result for each of at least one divided area and information on the coding depth and coding mode are output for each maximum coding unit. The information on the encoding mode may include information on the encoding depth or division information, partition type information of the encoding depth, prediction mode information, and conversion unit size information. Information on the encoded encoding mode can be transmitted to the decoding end together with the encoded video data.

도 13 은 본 발명의 일 실시예에 따른 비디오 복호화 방법의 흐름도를 도시한다.13 shows a flowchart of a video decoding method according to an embodiment of the present invention.

단계 1310에서, 부호화된 비디오에 대한 비트스트림이 수신되어 파싱된다. In step 1310, the bitstream for the encoded video is received and parsed.

단계 1320에서, 파싱된 비트스트림으로부터 최대 크기의 최대 부호화 단위에 할당되는 현재 픽처의 영상 데이터 및 최대 부호화 단위별 부호화 심도 및 부호화 모드에 관한 정보가 추출된다. 최대 부호화 단위별 부호화 심도는, 현재 픽처의 부호화 과정에서 최대 부호화 단위별로 부호화 오차가 가장 적은 심도로 선택된 심도이다. 최대 부호화 단위별 부호화는, 최대 부호화 단위를 심도별로 계층적으로 분할한 적어도 하나의 데이터 단위에 기반하여 영상 데이터가 부호화된 것이다. 따라서, 부호화 단위별 부호화 심도를 파악한 후 각각의 영상 데이터를 복호화함으로써 영상의 부복호화의 효율성이 향상될 수 있다.In step 1320, the image data of the current picture allocated to the largest coding unit of the maximum size from the parsed bitstream, and information on the coding depth and coding mode of each coding unit are extracted. The coding depth for each maximum coding unit is the depth selected with the smallest coding error for each maximum coding unit in the process of coding the current picture. The encoding of the maximum encoding unit is the image data encoded based on at least one data unit obtained by dividing the maximum encoding unit hierarchically by depth. Accordingly, the decoding efficiency of the image can be improved by decoding each image data after determining the coding depth for each coding unit.

단계 1330에서, 최대 부호화 단위별 부호화 심도 및 부호화 모드에 관한 정보에 기초하여 각각의 최대 부호화 단위의 영상 데이터가 복호화된다. 복호화된 영상 데이터는 재생 장치에 의해 재생되거나, 저장 매체에 저장되거나, 네트워크를 통해 전송될 수 있다.In step 1330, the image data of each maximum encoding unit is decoded based on the information on the encoding depth and the encoding mode for each maximum encoding unit. The decoded image data can be reproduced by a reproducing apparatus, stored in a storage medium, or transmitted via a network.

이하 도 14 내지 19 를 참조하여, 일 실시예에 따라 임의적 비율로 분할된 파티션을 이용한 인터 예측에 따른 비디오 부호화 및 비디오 복호화가 상술된다. Hereinafter, video encoding and video decoding according to inter prediction using a partition divided at an arbitrary ratio will be described with reference to FIGS. 14 to 19.

도 14 는 일 실시예에 따라 임의적 비율로 분할된 파티션을 이용한 인터 예측에 따른 비디오 부호화 장치의 블록도를 도시한다.14 is a block diagram of a video encoding apparatus according to inter prediction using a partition divided at an arbitrary ratio, according to an embodiment.

일 실시예에 따른 비디오 부호화 장치(1400)는 최대 부호화 단위 분할부(1410), 부호화부(1420) 및 출력부(1430)를 포함한다. The video encoding apparatus 1400 according to an embodiment includes a maximum coding unit splitter 1410, an encoder 1420, and an output unit 1430.

최대 부호화 단위 분할부(1410)는 비디오 데이터를 최대 크기의 부호화 단위인 적어도 하나의 최대 부호화 단위로 분할한다. 최대 부호화 단위로 분할된 비디오 데이터는 부호화부(1420)로 출력된다. 최대 부호화 단위는 프레임 시퀀스, 프레임, 슬라이스, 부호화 단위 등의 데이터 단위 별로 미리 설정될 수 있다. The maximum coding unit splitter 1410 splits the video data into at least one maximum coding unit that is a coding unit having a maximum size. The video data divided into the largest coding units is output to the encoder 1420. The maximum coding unit may be preset for each data unit such as a frame sequence, a frame, a slice, and a coding unit.

일 실시예에 따른 최대 부호화 단위는, 16x16, 32x32, 64x64, 128x128 및 256x256 블록들 중 적어도 하나로 선택적으로 설정될 수 있다. The maximum coding unit according to an embodiment may be selectively set to at least one of 16x16, 32x32, 64x64, 128x128, and 256x256 blocks.

부호화부(1420)는, 최대 부호화 단위 분할부(1410)에 의해 분할된 최대 부호화 단위별 비디오 데이터를 부호화한다. 부호화부(1420)는, 최대 부호화 단위의 적어도 하나의 분할 영역 별로 계층적 구조의 심도별 부호화 단위들에 기초하여 부호화한다. 심도별 부호화 단위의 부호화 과정 중 인터 예측은, 심도별 부호화 단위가 포함하는 파티션을 이용하여 유사 영역을 검색하여 파티션의 움직임 정보를 추정함으로써 수행된다. The encoder 1420 encodes the video data for each largest coding unit divided by the maximum coding unit splitter 1410. The encoder 1420 encodes at least one split region of the maximum coding unit based on depth-based coding units of a hierarchical structure. Inter-prediction during the encoding process of the coding units according to depths is performed by searching for a similar region using a partition included in the coding units according to depths to estimate motion information of the partitions.

일 실시예에 따른 인터 예측은 부호화 단위가 임의적 비율로 분할된 파티션을 이용할 수 있다. 앞서 도 3 내지 도 11 등에서 도시된 예측 단위 및 파티션 타입의 예는, 2Nx2N 크기의 부호화 단위가 분할된 2Nx2N, 2NxN, Nx2N, NxN 크기의 파티션들을 포함한다. 이런 식으로 부호화 단위의 너비 및 높이 중 적어도 하나가 1 대 1의 비율로 분할된 파티션 뿐만 아니라, 일 실시예에 따른 부호화부(1420)는 임의적 비율 또는 비대칭적 비율로 분할된 파티션을 포함하는 파티션 타입에 따라 인터 예측을 수행할 수 있다. According to an embodiment, inter prediction may use a partition in which coding units are divided at an arbitrary ratio. Examples of the prediction unit and the partition type illustrated in FIG. 3 to FIG. 11 and the like include partitions having 2Nx2N, 2NxN, Nx2N, and NxN sizes in which coding units having a size of 2Nx2N are divided. In this way, as well as a partition in which at least one of the width and height of the coding unit is divided at a ratio of 1 to 1, the encoder 1420 according to an embodiment may include a partition including a partition divided at an arbitrary ratio or an asymmetric ratio. Inter prediction may be performed according to a type.

예를 들어, 일 실시예에 따른 부호화 단위의 임의적 비율로 분할된 파티션은, 부호화 단위의 높이 및 너비 중 적어도 하나가 1 대 3 또는 3 대 1로 분할된 파티션일 수 있다. 또한, 파티션으로 분할되는 임의적 비율은, 1 대 2, 2 대 1, 1 대 3, 3 대 1, 2 대 3, 3 대 2, 1 대 4, 4 대 1 등의 다양한 비율일 수 있다. For example, the partition divided at an arbitrary ratio of the coding unit according to an embodiment may be a partition in which at least one of the height and the width of the coding unit is divided into one to three or three to one. Also, the arbitrary ratio divided into partitions may be various ratios such as 1 to 2, 2 to 1, 1 to 3, 3 to 1, 2 to 3, 3 to 2, 1 to 4, 4 to 1, and the like.

일 실시예에 따른 파티션 타입은, 부호화 단위가 임의적 비율로 분할되는 파티션 뿐만 아니라 부호화 단위가 비대칭적으로 분할되는 파티션을 포함할 수 있다. 또한, 일 실시예에 따른 부호화 단위의 인터 예측을 위한 파티션 타입은 임의적 비율에 따라 일정한 방향으로 분할되는 파티션에 한정하여 포함하는 것은 아니며, 임의적 형태의 파티션을 포함할 수도 있다.The partition type according to an embodiment may include a partition in which a coding unit is split asymmetrically as well as a partition in which a coding unit is split at an arbitrary ratio. In addition, the partition type for inter prediction of a coding unit according to an embodiment is not limited to a partition divided in a predetermined direction according to an arbitrary ratio, and may include an arbitrary type partition.

일 실시예에 따른 부호화부(1420)는, 부호화 단위가 임의적 비율로 분할된 파티션을 이용하여 인터 예측을 수행할지 여부를 선택적으로 결정할 수 있다. 부호화 단위가 임의적 비율로 분할된 파티션을 이용하여 인터 예측을 수행할지 여부를 나타내는 정보는 별도로 부호화되어 비트스트림에 포함될 수 있다.The encoder 1420 according to an embodiment may selectively determine whether to perform inter prediction using a partition in which a coding unit is divided at an arbitrary ratio. Information indicating whether to perform inter prediction using a partition in which a coding unit is divided at an arbitrary ratio may be separately encoded and included in a bitstream.

일 실시예에 따른 부호화부(1420)는, 최대 부호화 단위의 비디오 데이터를 분할 영역별로 계층적 구조에 따른 심도별 부호화 단위들에 기초하여 부호화하고, 심도별 부호화 결과들을 비교하여 부호화 효율이 가장 높은 심도를 선택한다. 선택된 심도는 해당 최대 부호화 단위의 분할 영역에 대한 부호화 심도로서, 부호화 심도에 대한 정보는 해당 부호화 단위의 부호화 결과로서 부호화된다. 최대 부호화 단위 내의 적어도 하나의 분할 영역마다 부호화 심도가 독립적으로 결정되므로, 하나의 최대 부호화 단위에 대해 적어도 하나의 부호화 심도가 결정될 수 있다.The encoder 1420 according to an embodiment encodes video data of a maximum coding unit based on coding units according to depths according to a hierarchical structure for each divided region, and compares encoding results according to depths to obtain the highest coding efficiency. Select the depth of field. The selected depth is a coded depth of a divided region of the corresponding largest coding unit, and information on the coded depth is encoded as a coding result of the corresponding coding unit. Since a coded depth is independently determined for at least one divided region in the maximum coding unit, at least one coded depth may be determined for one maximum coding unit.

일 실시예에 따른 출력부(1430)는 최대 부호화 단위별로, 분할 영역 별 부호화 심도에 대응하는 부호화된 비디오 데이터, 부호화 심도 및 부호화 모드에 관한 정보를 포함하는 비트스트림을 출력한다. 일 실시예에 따른 출력부(1430)는, 인터 예측을 위한 파티션 타입이 부호화 단위를 임의적 비율로 분할하는 파티션을 포함하는지 여부를 나타내는 정보를 비트스트림에 포함시킬 수 있다. 인터 예측을 위한 파티션 타입이 부호화 단위를 임의적 비율로 분할하는 파티션을 포함하는지 여부를 나타내는 정보는, 프레임 시퀀스, 슬라이스, 부호화 단위 등 데이터 단위 별로 설정되어, 비트스트림의 시퀀스 파라미터 세트, 슬라이스 헤더, 부호화 단위별 부호화 정보 등에 삽입될 수 있다.According to an embodiment, the output unit 1430 outputs a bitstream including coded video data, coded depth, and information about a coded mode, corresponding to a coded depth of each divided region, for each maximum coding unit. The output unit 1430 according to an embodiment may include information indicating whether a partition type for inter prediction includes a partition that splits a coding unit at an arbitrary ratio into a bitstream. Information indicating whether the partition type for inter prediction includes a partition that splits a coding unit at an arbitrary ratio is set for each data unit such as a frame sequence, a slice, a coding unit, and the like. It may be inserted into the encoding information for each unit.

일 실시예에 따른 부호화 단위는 기존 매크로블록에 비해 훨씬 큰 데이터량을 수록하므로, 하나의 부호화 단위 안에는 각각 다른 영상 특성을 가진 영역들이 포함될 수 있다. 부호화 단위의 예측 부호화를 위해서는, 부호화 단위가 동일한 영상 특성의 영역끼리 분할되어 예측 부호화를 위한 파티션이 생성되는 것이 유리하다. Since a coding unit according to an embodiment includes a much larger data amount than a conventional macroblock, regions having different image characteristics may be included in one coding unit. For predictive coding of coding units, it is advantageous to generate partitions for predictive coding by dividing regions of image characteristics having the same coding unit.

부호화 단위의 중심을 기준으로 서로 다른 영상 특성의 영역으로 구분될 수도 있지만, 부호화 단위의 크기가 클수록 상호 구별되는 영역 간의 경계가 상하좌우 어느 한쪽으로 치우칠 가능성이 높다. 부호화 단위의 너비 및 높이를 일 대 일로 반분하는 파티션만이 이용되는 경우, 서로 구별되는 영역들 간의 경계가 한쪽에 치우친 부호화 단위를 정확히 예측 부호화하기 위해서는, 하나의 독립적인 영역만을 포함하는 작은 파티션을 생성하기 위해 현재 부호화 단위가 하위 심도의 부호화 단위로 분할될 수 밖에 없다. Although the image may be divided into regions having different image characteristics based on the center of the coding unit, the larger the size of the coding unit, the more likely the boundary between the regions to be separated to one of the top, bottom, left and right. When only partitions that split the width and height of coding units one-to-one are used, a small partition including only one independent region may be used to accurately predict and encode coding units in which boundaries between distinct regions are biased to one side. In order to generate, the current coding unit may be split into coding units having a lower depth.

그러나, 일 실시예에 따른 비디오 부호화 장치(1400)와 같이 임의적 비율로 분할되는 파티션을 이용한 인터 예측이 가능한 경우, 현재 심도별 부호화 단위를 하위 심도로 더 분할 필요 없이 현재 심도에서 어느 한 쪽으로 치우쳐 분할되는 파티션을 이용하여 인터 예측을 수행하는 것으로 충분하다. 따라서, 부호화 단위의 파티션이 부호화 단위의 높이 또는 너비를 일 대 일로 반분하는 파티션 뿐만 아니라, 임의적 비율로 분할하는 파티션 또는 임의적 형태의 파티션을 포함하는 경우, 대형 부호화 단위에 대한 보다 효율적이고 정확한 예측 부호화가 가능하다.However, when inter prediction using a partition divided at an arbitrary ratio is possible, such as the video encoding apparatus 1400 according to an exemplary embodiment, the coding unit for each current depth may be split in one direction at the current depth without further dividing into lower depths. It is sufficient to perform inter prediction using partitions to be used. Therefore, when the partition of the coding unit includes not only a partition that divides the height or width of the coding unit in one-to-one, but also a partition that divides in an arbitrary ratio or an arbitrary form of partition, more efficient and accurate predictive coding for a large coding unit. Is possible.

또한, 부호화 단위가 임의적 비율로 분할되는 파티션 또는 임의적 형태의 파티션에 의한 예측 부호화는, 비디오 부복호화기의 하드웨어 성능, 비디오 부복호화 서비스를 제공받는 사용자 요구, 비디오가 부호화된 비트스트림의 전송 환경에 따라 선택적으로 수행될 수 있다.In addition, predictive encoding by a partition having a coding unit divided at an arbitrary ratio or an arbitrary type of partition may be applied to a hardware performance of a video decoder, a user's request for receiving a video decoding service, and a transmission environment of a video encoded bitstream. May be optionally performed accordingly.

도 15 는 일 실시예에 따라 임의적 비율로 분할된 파티션을 이용한 인터 예측에 의한 비디오 복호화 장치의 블록도를 도시한다.FIG. 15 is a block diagram of an apparatus for video decoding by inter prediction using a partition divided at an arbitrary ratio, according to an embodiment.

일 실시예에 따른 비디오 복호화 장치(1500)는, 파싱부(1510), 추출부(1520) 및 복호화부(1530)를 포함한다. 파싱부(1510)는, 부호화된 비디오에 대한 비트스트림을 수신하여 비트스트림의 심볼들을 파싱(parsing)한다. 추출부(1520)는, 파싱된 비트스트림으로부터, 최대 부호화 단위별로 부호화된 비디오 데이터, 및 최대 부호화 단위별 부호화 심도 및 부호화 모드에 관한 정보를 추출한다. The video decoding apparatus 1500 according to an embodiment includes a parser 1510, an extractor 1520, and a decoder 1530. The parser 1510 receives the bitstream of the encoded video and parses the symbols of the bitstream. The extractor 1520 extracts video data encoded for each largest coding unit, and information about a coded depth and an encoding mode for each maximum coding unit, from the parsed bitstream.

일 실시예에 따른 추출부(1520)는, 비트스트림으로부터 인터 예측을 위한 파티션 타입이 부호화 단위를 임의적 비율로 분할하는 파티션을 포함하는지 여부를 나타내는 정보를 더 추출할 수 있다. 인터 예측을 위한 파티션 타입이 부호화 단위를 임의적 비율로 분할하는 파티션을 포함하는지 여부는 비트스트림의 시퀀스 파라미터 세트, 슬라이스 헤더, 부호화 단위별 부호화 정보 등으로부터 추출될 수 있다.The extractor 1520 according to an embodiment may further extract information indicating whether a partition type for inter prediction includes a partition that splits a coding unit at an arbitrary ratio from the bitstream. Whether the partition type for inter prediction includes a partition that splits a coding unit at an arbitrary ratio may be extracted from a sequence parameter set of a bitstream, a slice header, encoding information for each coding unit, and the like.

복호화부(1520)는, 추출부(1520)에서 추출된 비디오 데이터 및 부호화 정보를 수신받아, 부호화 정보에 기초하여 비디오 데이터를 복호화한다. 구체적으로 일 실시예에 따른 복호화부(1520)는, 최대 부호화 단위별 부호화 심도 및 부호화 모드에 관한 정보에 기초하여, 최대 부호화 단위별로 적어도 하나의 부호화 심도별 부호화 단위마다 비디오 데이터를 복호화한다. The decoder 1520 receives the video data and the encoding information extracted by the extraction unit 1520, and decodes the video data based on the encoding information. In more detail, the decoder 1520 according to an embodiment decodes video data for at least one coding unit for each coding depth based on the information about the coded depth and the encoding mode for each largest coding unit.

특히 일 실시예에 따른 복호화부(1520)는, 추출부(1520)에서 추출된 인터 예측을 위한 파티션 타입이 부호화 단위를 임의적 비율로 분할하는 파티션을 포함하는지 여부를 나타내는 정보에 따라 선택적으로, 부호화 단위가 임의적 비율로 분할된 파티션을 이용한 움직임 보상을 수행할 수 있다. In particular, the decoder 1520 according to an embodiment may selectively encode based on information indicating whether a partition type for inter prediction extracted by the extractor 1520 includes a partition that splits a coding unit at an arbitrary ratio. Motion compensation may be performed using a partition in which units are divided at an arbitrary ratio.

즉 일 실시예에 따른 복호화부(1520)는, 부호화 단위가 1 대 1로 분할된 파티션 뿐만 아니라 1 대 2, 2 대 1, 1 대 3, 3 대 1, 2 대 3, 3 대 2, 1 대 4, 4 대 1 등의 임의적 비율로 비대칭적으로 분할된 파티션을 포함하는 파티션 타입에 따라 예측된 움직임 벡터를 이용하여, 움직임 보상을 수행할 수 있다. 또한, 부호화 단위가 일정한 방향으로 분할된 파티션 뿐만 아니라 임의적 형태의 파티션을 이용하여 움직임 보상이 수행될 수도 있다.That is, the decoder 1520 according to an embodiment may include not only partitions in which coding units are divided into one-to-one, but also one-to-two, two-to-one, one-to-three, three-to-one, two-to-three, three-to-two and one. Motion compensation may be performed using a motion vector predicted according to a partition type including partitions asymmetrically divided at an arbitrary ratio of 4 to 4, 4 to 1, and the like. In addition, motion compensation may be performed by using an arbitrary partition as well as a partition in which coding units are divided in a predetermined direction.

일 실시예에 따른 복호화부(1520)는, 부호화 단위가 임의적 비율로 분할된 파티션을 이용하여 인터 예측을 수행하며 부호화되었는지 여부를 파악하여, 선택적으로, 너비 및 높이가 임의적 비율인 파티션에 따라 움직임 보상을 수행할 수 있으므로, 다양한 영상 특성의 영역으로 구분되는 부호화 단위를 정확히 복원할 수 있다. The decoder 1520 according to an embodiment determines whether the coding unit is encoded by performing inter prediction using a partition divided at an arbitrary ratio, and optionally, moves the partition according to a partition having an arbitrary ratio of width and height. Since compensation may be performed, a coding unit divided into regions of various image characteristics may be accurately reconstructed.

일 실시예에 따른 비디오 복호화 장치(1500)는, 최대 부호화 단위별로 복호화된 비디오 데이터를 복원하여 재생할 수 있다.The video decoding apparatus 1500 according to an embodiment may restore and reproduce decoded video data for each largest coding unit.

따라서, 일 실시예에 따른 비디오 부호화 장치(1400) 및 일 실시예에 따른 비디오 복호화 장치(1500)와 같이 임의적 비율로 분할되는 파티션을 이용한 예측 부복호화가 가능한 경우, 현재 심도별 부호화 단위를 하위 심도로 더 분할 필요 없이 현재 심도에서 어느 한 쪽으로 치우쳐 분할되는 파티션을 이용하여 인터 예측을 수행할 수 있다. 이렇게 임의적 비율로 분할되는 파티션을 이용하여, 대형 부호화 단위에 대해 보다 정확하고 효율적으로 예측 부호화 또는 복호화할 수 있다.Therefore, when prediction encoding and decoding using a partition divided at an arbitrary ratio is possible, such as the video encoding apparatus 1400 and the video decoding apparatus 1500 according to an embodiment, a coding depth according to a current depth may be a lower depth. Inter prediction may be performed using partitions that are partitioned in one direction at the current depth without the need for further partitioning. By using a partition divided at an arbitrary ratio, it is possible to predictively encode or decode a large coding unit more accurately and efficiently.

도 16 은 일 실시예에 따라, 부호화 단위가 임의적 비율로 분할된 예시적 파티션들을 도시한다.16 illustrates example partitions in which a coding unit is divided at an arbitrary ratio, according to an embodiment.

일 실시예에 따라 부호화 단위의 예측 부호화를 위한 파티션 타입은, 부호화 단위의 높이 및 너비를 임의적 비율로 분할한 파티션을 포함할 수 있다. 예를 들어, 64x64 크기의 부호화 단위(1600)의 파티션은, 부호화 단위의 높이 및 너비 중 적어도 하나를 1 대 1로 분할한 64x32, 32x64, 32x32 파티션 뿐만 아니라, 1 대 3 또는 3대 1로 분할한 파티션들을 포함할 수 있다.According to an embodiment, a partition type for predictive encoding of a coding unit may include a partition obtained by dividing a height and a width of a coding unit at an arbitrary ratio. For example, a partition of a coding unit 1600 of size 64x64 may be divided into not only 64x32, 32x64, and 32x32 partitions in which at least one of the height and width of the coding unit is divided into one-to-one, but also divided into one-to-three or three-to-one. It can contain one partition.

즉, 64x64 크기의 부호화 단위(1600)의 파티션 타입은, 경우 1610 또는 1620과 같이 높이를 1 대 3 또는 3 대 1로 분할하여 생성된 64x16, 64x48 크기의 파티션들을 포함할 수 있다. 또한 64x64 크기의 부호화 단위(1600)의 파티션 타입은, 경우 1630 또는 1640과 같이 너비를 1 대 3 또는 3 대 1으로 분할하여 생성된 16x64, 48x64 크기의 파티션들을 포함할 수 있다.That is, the partition type of the coding unit 1600 of size 64x64 may include partitions of size 64x16 and 64x48 generated by dividing the height into one-to-three or three-to-one, such as 1610 or 1620. In addition, the partition type of the coding unit 1600 of size 64x64 may include partitions of size 16x64 or 48x64 generated by dividing the width into 1 to 3 or 3 to 1, such as 1630 or 1640.

도 17 은 일 실시예에 따라, 부호화 단위를 임의적 비율로 분할하는 파티션을 포함하는지 여부를 나타내는 정보가 포함된 시퀀스 파라미터 세트의 신택스를 도시한다.17 is a diagram illustrating syntax of a sequence parameter set including information indicating whether to include a partition that divides a coding unit at an arbitrary ratio, according to an embodiment.

sequence_parameter_set은 현재 영상 슬라이스에 대한 시퀀스 파라미터 세트(1700)의 신택스를 나타낸다. 일 실시예에 따른 인터 예측을 위한 파티션 타입이 부호화 단위를 임의적 비율로 분할하는 파티션을 포함하는지 여부를 나타내는 정보가 영상 슬라이스에 대한 시퀀스 파라미터 세트(1700)의 신택스에 삽입되어 있는 일례가 상술된다. sequence_parameter_set represents the syntax of the sequence parameter set 1700 for the current image slice. An example in which information indicating whether a partition type for inter prediction includes a partition that splits a coding unit at an arbitrary ratio is inserted into the syntax of the sequence parameter set 1700 for an image slice.

picture_width는 입력 영상의 너비, picture_height는 입력 영상의 높이를 나타내는 신택스이며, max_coding_unit_size는 최대 부호화 단위의 크기, max_coding_unit_depth는 최대 심도를 나타내는 신택스이다.picture_width is the width of the input image, picture_height is the syntax indicating the height of the input image, max_coding_unit_size is the size of the maximum encoding unit, and max_coding_unit_depth is the syntax indicating the maximum depth.

일 실시예에서는, 시퀀스 파라미터의 예로서, 부호화 단위 레벨의 독립적 복호화 여부를 나타내는 정보(use_independent_cu_decode_flag), 부호화 단위 레벨의 독립적 파싱 여부를 나타내는 정보(use_independent_cu_parse_flag), 움직임 벡터 정확성 제어 동작의 이용가능성(use_mv_accuracy_control_flag), 임의적 방향성 인트라 예측 동작의 이용가능성(use_arbitrary_direction_intra_flag), 주파수 변환 상의 예측 부복호화 동작의 이용가능성(use_frequency_domain_prediction_flag), 회전변환 동작의 이용가능성(use_rotational_transform_flag), 트리 시그니피컨트 맵을 이용한 부복호화의 이용가능성(use_tree_significant_map_flag), 멀티 파라미터를 이용한 인트라 예측 부호화 동작의 이용가능성(use_multi_parameter_intra_prediction_flag), 개선된 움직임 벡터 예측 부호화 동작의 이용가능성(use_advanced_motion_vector_prediction_flag), 적응적 루프 필터링 동작의 이용가능성(use_adaptive_loop_filter_flag), 콰드트리 구조의 적응적 루프 필터링 동작의 이용가능성(use_quadtree_adaptive_loop_filter_flag), 양자화 파라미터의 델타값을 이용한 양자화 동작의 이용가능성(use_delta_qp_flag), 랜덤 노이즈 생성 동작의 이용가능성(use_random_noise_generation_flag), 부호화 단위의 인터 예측을 위한 임의적 형태의 파티션을 허용하는지 여부(use_arbitrary_motion_partition_flag)을 나타내는 정보들이 정의될 수 있다. 전술된 각종 동작의 이용가능성을 나타내는 신택스들은 현재 슬라이스의 부복호화 과정에서 해당 동작들이 이용되는지 여부를 정의하여 효율적인 부복호화가 가능하게 한다.(Use_independent_cu_decode_flag) indicating whether the coding unit level is independently decoded, use_independent_cu_decode_flag indicating whether to independently parse the coding unit level (use_independent_cu_parse_flag), availability (motion_mv_accuracy_control_flag) of the motion vector accuracy control operation, Availability of an arbitrary directional intra prediction operation (use_arbitrary_direction_intra_flag), availability of a prediction unit decoding operation on frequency conversion (use_frequency_domain_prediction_flag), availability of rotation conversion operation (use_rotational_transform_flag), availability of subdecryption using a tri- use_tree_significant_map_flag), the availability of the intra prediction encoding operation using the multi-parameter (use_multi_parameter_intra_prediction_flag), the availability of the improved motion vector prediction encoding operation (use_advanced_motion_vector_prediction_flag) Availability of adaptive loop filtering operation (use_adaptive_loop_filter_flag), availability of quadrature adaptive loop filtering operation (use_quadtree_adaptive_loop_filter_flag), availability of quantization operation using delta value of quantization parameter (use_delta_qp_flag), use of random noise generation operation Information indicating a possibility (use_random_noise_generation_flag) and whether to use an arbitrary type partition for inter prediction of a coding unit (use_arbitrary_motion_partition_flag) may be defined. Syntaxes indicating the availability of the above-described various operations enable efficient decryption by defining whether the corresponding operations are used in the current decoding process of the slice.

특히, 적응적 루프 필터링 동작의 이용가능성(use_adaptive_loop_filter_flag) 및 콰드트리 구조의 적응적 루프 필터링 동작의 이용가능성(use_quadtree_adaptive_loop_filter_flag)에 따라, 적응적 루프 필터의 필터 길이(alf_filter_length), 적응적 루프 필터 타입(alf_filter_type), 적응적 루프 필터 계수의 양자화를 위한 기준값(alf_qbits), 적응적 루프 필터링에서의 컬러 성분의 개수(alf_num_color)가 시퀀스 파라미터 세트(1700)에서 정의될 수 있다. In particular, according to the availability of the adaptive loop filtering operation (use_adaptive_loop_filter_flag) and the availability of the adaptive loop filtering operation of the quadtree structure (use_quadtree_adaptive_loop_filter_flag), the filter length of the adaptive loop filter (alf_filter_length), the adaptive loop filter type (alf_filter_type) ), Reference values (alf_qbits) for quantization of the adaptive loop filter coefficients, and the number of color components (alf_num_color) in adaptive loop filtering may be defined in the sequence parameter set 1700.

일 실시예에 따른 비디오 부호화 장치(1400) 및 일 실시예에 따른 비디오 복호화 장치(1500)에서 이용되는 부호화 단위의 심도, 코딩 툴 및 작동 모드의 대응 관계에 관한 정보는, 부호화 단위의 심도(uiDepth)에 따라 대응되는 인터 예측의 작동 모드(mvp_mode[uiDepth]) 및 트리 시그니피컨트 맵 중 시그니피컨트 맵의 종류를 나타내는 작동 모드(significant_map_mode[uiDepth])를 포함할 수 있다. 즉, 일 실시예에 따른 부호화 단위의 심도에 따른, 인터 예측 및 해당 작동 모드의 대응 관계 또는 트리 시그니피컨트 맵을 이용한 부복호화 및 해당 작동 모드의 대응 관계가 시퀀스 파라미터 세트(1700)에서 설정될 수 있다.The information on the correspondence relationship between the depth of the coding unit, the coding tool, and the operation mode used in the video coding apparatus 1400 and the video coding apparatus 1500 according to an embodiment is determined by the depth of the coding unit uiDepth (Mvp_mode [uiDepth]) corresponding to the inter prediction in accordance with the inter-prediction prediction mode (mvp_mode [uiDepth]) and the operation mode (significant_map_mode [uiDepth]) indicating the type of the signify map among the tri- That is, a correspondence relationship between inter prediction and a corresponding operation mode, or an encoding and decoding using a tree signature map and a corresponding relationship between corresponding operation modes according to a depth of a coding unit according to an embodiment may be set in the sequence parameter set 1700. Can be.

입력 샘플의 비트 뎁스(input_sample_bit_depth) 및 내부 샘플의 비트 뎁스(internal_sample_bit_depth)도 또한 시퀀스 파라미터 세트(1700)에서 설정될 수 있다. The bit depth (input_sample_bit_depth) of the input sample and the bit depth (internal_sample_bit_depth) of the internal sample may also be set in the sequence parameter set 1700.

일 실시예에 따른 비디오 복호화 장치(1500)는 시퀀스 파라미터를 판독하여, 부호화 단위의 인터 예측을 위한 임의적 형태의 파티션을 허용하는지 여부(use_arbitrary_motion_partition_flag)을 나타내는 정보를 추출하고, 해당 시퀀스에서 부호화 단위가 임의적 비율로 분할된 파티션을 이용하여 인터 예측을 수행할지 여부를 결정할 수 있다.The video decoding apparatus 1500 according to an embodiment may read a sequence parameter to extract information indicating whether a partition of arbitrary form for inter prediction of a coding unit is allowed (use_arbitrary_motion_partition_flag), and the coding unit may be arbitrarily selected in the sequence. It may be determined whether to perform inter prediction using a partition divided by a ratio.

일 실시예에 따른 비디오 부호화 장치(1400) 및 일 실시예에 따른 비디오 복호화 장치(1500)가 이용하는, 부호화 단위의 인터 예측을 위한 임의적 형태의 파티션을 허용하는지 여부(use_arbitrary_motion_partition_flag)를 나타내는 정보는, 도 22에서 도시된 시퀀스 파라미터 세트(1700)에 삽입된 실시예에 한정되지 않고, 최대 부호화 단위, 슬라이스, 프레임, 픽처, GOP 등의 단위로 설정되어 부복호화될 수 있다. Information indicating whether an arbitrary type of partition for inter prediction of a coding unit is used, which is used by the video encoding apparatus 1400 and the video decoding apparatus 1500 according to an embodiment, is shown in FIG. The present invention is not limited to the embodiment inserted into the sequence parameter set 1700 illustrated in FIG. 22, and may be set and encoded in units of a maximum coding unit, slice, frame, picture, GOP, and the like.

슬라이스 헤더에 부호화 단위의 인터 예측을 위한 임의적 형태의 파티션을 허용하는지 여부(use_arbitrary_motion_partition_flag)를 나타내는 정보가 '참'값이라면 해당 슬라이스에서 부호화 단위가 임의적 비율로 분할된 파티션을 이용하여 인터 예측을 수행을 수행하고, '거짓'값을 가진다면 해당 슬라이스에서 부호화 단위의 너비 및 높이 중 적어도 하나가 1 대 1로 분할된 파티션만을 이용하여 인터 예측을 수행한다.If the information indicating whether an arbitrary type of partition for inter prediction of a coding unit is allowed (use_arbitrary_motion_partition_flag) is 'true' in a slice header, inter prediction is performed using a partition in which a coding unit is split at an arbitrary ratio in the slice. If the value has a 'false' value, inter prediction is performed using only a partition in which at least one of the width and the height of the coding unit is divided one-to-one in the slice.

도 18 은 일 실시예에 따라 임의적 비율로 분할된 파티션을 이용한 인터 예측에 따른 비디오 부호화 방법의 흐름도를 도시한다.18 is a flowchart of a video encoding method according to inter prediction using a partition divided at an arbitrary ratio, according to an embodiment.

단계 1810에서, 비디오 데이터가 최대 크기의 부호화 단위인 적어도 하나의 최대 부호화 단위로 분할된다.In operation 1810, the video data is divided into at least one largest coding unit that is a coding unit having a maximum size.

단계 1820에서, 최대 부호화 단위의 적어도 하나의 분할 영역 별로 계층적 구조의 심도별 부호화 단위에 기초하여 최대 부호화 단위의 비디오 데이터가 부호화되고, 부호화 결과가 출력될 심도인 부호화 심도가 결정된다. 일 실시예에 따라 인터 예측은 부호화 단위가 임의적 비율로 분할된 파티션을 선택적으로 이용할 수 있다. 부호화 단위가 임의적 비율로 분할된 파티션을 이용하여 인터 예측을 수행할지 여부는 프레임 시퀀스, 프레임, 슬라이스, 부호화 단위 등의 데이터 단위 별로 설정될 수 있다.In operation 1820, video data of a maximum coding unit is encoded based on depth-based coding units of a hierarchical structure for at least one divided region of the maximum coding unit, and a coding depth that is a depth to which an encoding result is output is determined. According to an embodiment, inter prediction may selectively use partitions in which coding units are divided at an arbitrary ratio. Whether to perform inter prediction using a partition in which coding units are divided at arbitrary ratios may be set for each data unit such as a frame sequence, a frame, a slice, and a coding unit.

단계 1830에서, 최대 부호화 단위별로, 분할 영역 별 부호화 심도에 대응하는 부호화된 비디오 데이터, 부호화 심도 및 부호화 모드에 관한 정보를 포함하는 비트스트림이 출력될 수 있다. 일 실시예에 따라 부호화 단위가 임의적 비율로 분할된 파티션을 이용하여 인터 예측을 수행할지 여부를 나타내는 정보가 부호화되어 비트스트림에 삽입되어 출력될 수 있다. In operation 1830, a bitstream including encoded video data, an encoded depth, and information about an encoding mode corresponding to an encoded depth of each divided region may be output for each largest coding unit. According to an embodiment, information indicating whether to perform inter prediction using a partition in which a coding unit is divided at an arbitrary ratio may be encoded, inserted into a bitstream, and output.

도 19 은 일 실시예에 따라 임의적 비율로 분할된 파티션을 이용한 인터 예측에 의한 비디오 복호화 방법의 흐름도를 도시한다.19 is a flowchart of a video decoding method by inter prediction using a partition divided at an arbitrary ratio, according to an embodiment.

단계 1910에서, 부호화된 비디오에 대한 비트스트림이 수신되어 심볼들이 파싱된다. In step 1910, a bitstream for the encoded video is received and the symbols parsed.

단계 1920에서, 비트스트림으로부터, 최대 부호화 단위별로 부호화된 비디오 데이터, 및 최대 부호화 단위별 부호화 심도 및 부호화 모드에 관한 정보가 추출된다. 일 실시예에 따라 부호화 단위가 임의적 비율로 분할된 파티션을 이용하여 인터 예측을 수행할지 여부를 나타내는 정보가 비트스트림으로부터 추출될 수도 있다. 부호화 단위가 임의적 비율로 분할된 파티션을 이용하여 인터 예측을 수행할지 여부를 나타내는 정보는 시퀀스 파라미터 세트, 슬라이스 헤더, 부호화 단위별 부호화 정보 등으로부터 추출될 수 있다.In operation 1920, video data encoded for each largest coding unit and information about a coded depth and an encoding mode for each maximum coding unit are extracted from the bitstream. According to an embodiment, information indicating whether to perform inter prediction using a partition in which a coding unit is divided at an arbitrary ratio may be extracted from the bitstream. Information indicating whether to perform inter prediction using a partition in which coding units are divided at arbitrary ratios may be extracted from a sequence parameter set, a slice header, and encoding information for each coding unit.

단계 1930에서, 최대 부호화 단위별 부호화 심도 및 부호화 모드에 관한 정보에 기초하여, 최대 부호화 단위별로 적어도 하나의 부호화 심도별 부호화 단위마다, 부호화 단위가 임의적 비율로 분할된 파티션을 이용한 움직임 보상을 포함하는 복호화가 수행될 수 있다. 부호화 단위가 임의적 비율로 분할된 파티션을 이용한 움직임 보상을 포함하는 복호화가 수행될지 여부는, 앞서 비트스트림으로부터 추출된 부호화 단위가 임의적 비율로 분할된 파티션을 이용하여 인터 예측을 수행할지 여부를 나타내는 정보에 따라 선택적으로 결정될 수 있다.In operation 1930, based on the information about the coded depth and the encoding mode for each maximum coding unit, for each of the at least one coding unit for each coding depth, the motion compensation using a partition in which the coding unit is divided at an arbitrary ratio may be included. Decryption can be performed. Whether decoding including motion compensation using a partition in which a coding unit is divided at an arbitrary ratio is performed is information indicating whether to perform inter prediction using a partition in which a coding unit extracted from the bitstream is split at an arbitrary ratio. It can be determined optionally depending on.

일 실시예에 따른 비디오 부호화 방법 및 비디오 복호화 방법과 같이 임의적 비율로 분할되는 파티션을 이용한 인터 예측이 가능한 경우, 현재 심도별 부호화 단위를 하위 심도로 더 분할 필요 없이 현재 심도에서 어느 한 쪽으로 치우쳐 분할되는 파티션을 이용하여 인터 예측을 수행할 수 있다. When inter prediction using a partition divided at an arbitrary ratio is possible, such as a video encoding method and a video decoding method according to an embodiment, the coding unit for each current depth is split to be offset from one side of the current depth without further dividing into lower depths. Inter prediction may be performed using a partition.

또한, 부호화 단위의 파티션이 부호화 단위의 높이 또는 너비를 일 대 일로 반분하는 파티션 뿐만 아니라, 임의적 비율로 분할하는 파티션 또는 임의적 형태의 파티션을 포함할지 여부를 선택할 수 있으므로, 기존의 임의적 비율로 분할된 파티션을 지원하지 못하는 부복호화 시스템에서도 일 실시예에 따른 비디오 부호화 방법 및 비디오 복호화 방법을 이용할 수 있다. 따라서, 필요에 따라 일 실시예의 비디오 부복호화 방법에 따른 정확하고 효율적인 예측 부호화가 선택적으로 수행될 수 있다.In addition, since the partition of the coding unit may include not only a partition that splits the height or width of the coding unit in one-to-one, but also a partition that divides at an arbitrary ratio or an arbitrary type of partition, the partition of the coding unit may be divided at an existing arbitrary ratio. A video encoding method and a video decoding method according to an embodiment may also be used in a decoding system that does not support partitioning. Therefore, if necessary, accurate and efficient prediction encoding according to the video encoding and decoding method of an embodiment may be selectively performed.

한편, 상술한 본 발명의 실시예들은 컴퓨터에서 실행될 수 있는 프로그램으로 작성가능하고, 컴퓨터로 읽을 수 있는 기록매체를 이용하여 상기 프로그램을 동작시키는 범용 디지털 컴퓨터에서 구현될 수 있다. 상기 컴퓨터로 읽을 수 있는 기록매체는 마그네틱 저장매체(예를 들면, 롬, 플로피 디스크, 하드디스크 등) 및 광학적 판독 매체(예를 들면, 시디롬, 디브이디 등)와 같은 저장매체를 포함한다.The above-described embodiments of the present invention can be embodied in a general-purpose digital computer that can be embodied as a program that can be executed by a computer and operates the program using a computer-readable recording medium. The computer-readable recording medium includes a storage medium such as a magnetic storage medium (e.g., ROM, floppy disk, hard disk, etc.) and an optical reading medium (e.g., CD-ROM, DVD, etc.).

이제까지 본 발명에 대하여 그 바람직한 실시예들을 중심으로 살펴보았다. 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자는 본 발명이 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 변형된 형태로 구현될 수 있음을 이해할 수 있을 것이다. 그러므로 개시된 실시예들은 한정적인 관점이 아니라 설명적인 관점에서 고려되어야 한다. 본 발명의 범위는 전술한 설명이 아니라 특허청구범위에 나타나 있으며, 그와 동등한 범위 내에 있는 모든 차이점은 본 발명에 포함된 것으로 해석되어야 할 것이다.So far I looked at the center of the preferred embodiment for the present invention. It will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims. Therefore, the disclosed embodiments should be considered in an illustrative rather than a restrictive sense. The scope of the present invention is defined by the appended claims rather than by the foregoing description, and all differences within the scope of equivalents thereof should be construed as being included in the present invention.

Claims

Generating asymmetric partition permission information indicating whether an asymmetric partition type can be used for inter prediction;
Generating split information of the coding unit indicating whether the height and the width of one coding unit are divided into smaller coding units among at least one coding unit included in the largest coding unit;
Generating information about a prediction mode of the coding unit to indicate a prediction mode representing one of a skip mode, an intra mode, and an inter mode;
Generating information about a partition type of the coding unit to indicate a partition type representing a height and a width of at least one prediction unit determined by partitioning the coding unit; And
Generating information on the size of at least one transform unit determined by partitioning the coding unit,
When the asymmetric partition type can be used, the partition type of the coding unit is one of the asymmetric partition type and the symmetric partition type,
If the asymmetric partition type cannot be used, the partition type of the coding unit is the symmetric partition type except the asymmetric partition type,
When the coding unit is not split into the smaller coding units, a prediction mode of the coding unit is determined and the prediction unit is generated from the coding unit,
And when the coding unit is not divided into the smaller coding unit, the transformation unit unit is generated from the coding unit.

In the video encoding apparatus,
Generate asymmetric partition permission information indicating whether an asymmetric partition type can be used for inter prediction, and split the height and width of one coding unit among at least one coding unit included in the largest coding unit A coding unit determiner configured to generate split information of the coding unit indicating whether to split into coding units having a size;
In order to indicate a prediction mode representing one of a skip mode, an intra mode, and an inter mode, information about a prediction mode of the coding unit is generated, and indicates a height and a width of at least one prediction unit determined by partitioning the coding unit. A prediction unit generating information on a partition type of the coding unit to indicate a partition type;
A transformation performing unit configured to generate information on the size of at least one transformation unit determined by partitioning the coding unit;
When the asymmetric partition type can be used, the partition type of the coding unit is one of the asymmetric partition type and the symmetric partition type,
If the asymmetric partition type cannot be used, the partition type of the coding unit is the symmetric partition type except the asymmetric partition type,
When the coding unit is not split into the smaller coding units, a prediction mode of the coding unit is determined and the prediction unit is generated from the coding unit,
And when the coding unit is not divided into the smaller coding unit, the transformation unit unit is generated from the coding unit.

Obtaining from the bitstream asymmetric partition permission information indicating whether a partition type for inter prediction includes an asymmetric partition type indicating an asymmetric prediction unit and a size of a coding unit;
Determining maximum coding units split from an image based on the information about the size of the coding unit;
Obtaining split information of a coding unit of a current depth from a bitstream among coding units split from one of the largest coding units among the maximum coding units;
Determining coding units of a lower depth generated by dividing the height and width of the coding unit of the current depth by splitting information of the coding unit of the current depth;
Obtaining partition type information of a coding unit of the current depth from the bitstream when split information of the coding unit of the current depth indicates undivision;
If an asymmetric partition type is allowed according to the asymmetric partition permission information, a non-partition prediction unit, a symmetric prediction unit, or an asymmetric for performing inter prediction on a coding unit of the current depth using the partition type information. Determining a prediction unit;
If the asymmetric partition type is not allowed according to the asymmetric partition permission information, the non-divisional prediction unit or the symmetric prediction unit for performing inter prediction on the coding unit of the current depth using the partition type information. Determining;
Performing inter prediction on a coding unit of the current depth using the determined prediction unit; And
Determining one or more transformation units from coding units of the current depth when split information of the coding units of the current depth indicates undivision; And
Performing inverse transform on the coding unit of the current depth using the transformation unit,
The non-divided prediction unit is a prediction unit having the same size as a coding unit of the current depth, and the symmetric prediction unit is a prediction unit generated by dividing one of the height and the width of the coding unit, and the asymmetric prediction unit is And a prediction unit generated by dividing one of a height and a width of the coding unit by an asymmetrical ratio.

A video decoding apparatus comprising:
Asymmetric partition permission information indicating whether a partition type for inter prediction includes an asymmetric partition type indicating an asymmetric prediction unit and information about a size of a coding unit are obtained from a bitstream, and Based on the information, determine the largest coding units split from the image, obtain split information of the coding unit of the current depth among the coding units split from one of the largest coding units, from the bitstream, and A coding unit determiner configured to determine coding units of a lower depth generated by dividing the height and the width of the coding unit of the current depth when split information of the coding unit of the current depth indicates splitting;
When the partition information of the coding unit of the current depth indicates undivision, when partition type information of the coding unit of the current depth is obtained from the bitstream, and if an asymmetric partition type is allowed according to the asymmetric partition permission information, The non-splitting prediction unit, the symmetric prediction unit, or the asymmetric prediction unit for performing inter prediction on the coding unit of the current depth is determined using the partition type information, and is asymmetric according to the asymmetric partition tolerance information. If the partition type is not allowed, the non-splitting prediction unit or the symmetric prediction unit for performing inter prediction on the coding unit of the current depth is determined using the partition type information, and the determined prediction unit is used. Perform inter prediction on the coding unit of the current depth It predicts execution unit; And
When the split information of the coding unit of the current depth indicates non-division, an inverse transform for determining one or more transformation units from the coding units of the current depth, and performing an inverse transformation on the coding units of the current depth using the transformation units. Including wealth,
The non-divided prediction unit is a prediction unit having the same size as a coding unit of the current depth, and the symmetric prediction unit is a prediction unit generated by dividing one of the height and the width of the coding unit, and the asymmetric prediction unit is And a prediction unit generated by dividing one of a height and a width of the coding unit by an asymmetrical ratio.

In a computer-readable recording medium storing a bitstream, the bitstream,
Asymmetric partition permission information to indicate whether an asymmetric partition type can be used for inter prediction;
Splitting information of the coding unit for indicating whether the height and the width of one coding unit are divided into coding units having a smaller size among at least one coding unit included in the largest coding unit;
Information on a partition type of the coding unit for indicating a partition type indicating a height and a width of at least one prediction unit determined by partitioning the coding unit; And
Includes information about the size of at least one transform unit determined by partitioning the coding unit,
If the asymmetric partition permission information indicates that an asymmetric partition type can be used, the information on the partition type of the coding unit indicates one of asymmetric partition types and symmetric partition types,
In case the asymmetric partition permission information indicates that the asymmetric partition type cannot be used, the information on the partition type of the coding unit indicates the symmetric partition types except for the asymmetric partition types,
When the coding unit is not divided into coding units having a smaller size, information about a partition type of the coding unit is included in the bitstream,
And when the coding unit is not divided into coding units having a smaller size, information about a size of a transformation unit of the coding unit is included in the bitstream.