KR102472462B1

KR102472462B1 - Method and apparatus for adaptive encoding and decoding based on image complexity

Info

Publication number: KR102472462B1
Application number: KR1020160131506A
Authority: KR
Inventors: 고현석; 김휘용; 강정원; 임성창; 이진호; 전동산; 최진수
Original assignee: 한국전자통신연구원
Priority date: 2015-10-13
Filing date: 2016-10-11
Publication date: 2022-12-01
Also published as: KR20220163912A; KR20170043461A

Abstract

영상 복잡도에 기반한 적응적 부호화 및 복호화를 위한 방법 및 장치가 개시된다. 부호화 장치 및 복호화 장치는 대상 유닛의 영상 복잡도에 기반하여 대상 유닛의 타입을 결정한다. 부호화 장치 및 복호화 장치는 대상 유닛에 대해 결정된 타입에 따라서 대상 유닛에 대한 인터 예측을 수행한다. 대상 유닛의 타입은 정적 유닛 및 동적 유닛을 포함한다. 대상 유닛의 타입에 따라서, 대상 유닛의 분할의 최대 깊이가 결정되고, 대상 유닛에 대한 탐색 범위가 결정된다.A method and apparatus for adaptive encoding and decoding based on image complexity are disclosed. The encoding device and the decoding device determine the type of the target unit based on the image complexity of the target unit. The encoding device and the decoding device perform inter prediction on the target unit according to the type determined for the target unit. Types of target units include static units and dynamic units. According to the type of the target unit, a maximum depth of segmentation of the target unit is determined, and a search range for the target unit is determined.

Description

Method and apparatus for adaptive encoding and decoding based on image complexity {METHOD AND APPARATUS FOR ADAPTIVE ENCODING AND DECODING BASED ON IMAGE COMPLEXITY}

아래의 실시예들은 영상의 복호화 방법, 복호화 장치, 부호화 방법 및 부호화 장치에 관한 것으로서, 보다 상세하게는 영상의 복잡도에 기반하여 부호화 및 복호화를 적응적으로 수행하는 방법 및 장치에 관한 것이다.The following embodiments relate to an image decoding method, a decoding apparatus, an encoding method, and an encoding apparatus, and more particularly, to a method and apparatus for adaptively performing encoding and decoding based on complexity of an image.

정보 통신 산업의 지속적인 발달을 통해 HD(High Definition) 해상도를 가지는 방송 서비스가 세계적으로 확산되었다. 이러한 확산을 통해, 많은 사용자들이 고해상도이며 고화질인 영상(image) 및/또는 비디오(video)에 익숙해지게 되었다.Through the continuous development of the information communication industry, broadcasting services having high definition (HD) resolution have been spread worldwide. Through this proliferation, many users have become accustomed to high-resolution and high-definition images and/or videos.

높은 화질에 대한 사용자들의 수요를 만족시키기 위하여, 많은 기관들이 차세대 영상 기기에 대한 개발에 박차를 가하고 있다. 에이치디티브이(High Definition TV; HDTV) 및 풀에이치디(Full HD; FHD) TV뿐만 아니라, FHD TV에 비해 4배 이상의 해상도를 갖는 울트라에이치디(Ultra High Definition; UHD) TV에 대한 사용자들의 관심이 증대하였고, 이러한 관심의 증대에 따라, 더 높은 해상도 및 화질을 갖는 영상에 대한 영상 부호화(encoding)/복호화(decoding) 기술이 요구된다.In order to satisfy users' demand for high image quality, many organizations are accelerating the development of next-generation video devices. Users' interest in Ultra High Definition (UHD) TV, which has four times higher resolution than FHD TV, as well as High Definition TV (HDTV) and Full HD (FHD) TV. has increased, and according to this increase in interest, an image encoding/decoding technology for an image having a higher resolution and quality is required.

영상 부호화(encoding)/복호화(decoding) 장치 및 방법은 고해상도 및 고화질의 영상에 대한 부호화/복호화를 수행하기 위해, 인터(inter) 예측(prediction) 기술, 인트라(intra) 예측 기술 및 엔트로피 부호화 기술 등을 사용할 수 있다. 인터 예측 기술은 시간적으로(temporally) 이전의 픽쳐 및/또는 시간적으로 이후의 픽쳐를 이용하여 현재 픽쳐에 포함된 픽셀의 값을 예측하는 기술일 수 있다. 인트라 예측 기술은 현재 픽쳐 내의 픽셀의 정보를 이용하여 현재 픽쳐에 포함된 픽셀의 값을 예측하는 기술일 수 있다. 엔트로피 부호화 기술은 출현 빈도가 높은 심볼에는 짧은 코드(code)를 할당하고, 출현 빈도가 낮은 심볼에는 긴 코드를 할당하는 기술일 수 있다.An image encoding/decoding device and method for encoding/decoding high-resolution and high-quality images, such as inter prediction technology, intra prediction technology, entropy encoding technology, etc. can be used. The inter-prediction technique may be a technique of predicting a value of a pixel included in a current picture using a temporally previous picture and/or a temporally subsequent picture. Intra-prediction technology may be a technology of predicting a value of a pixel included in a current picture by using pixel information in the current picture. The entropy coding technique may be a technique of allocating a short code to a symbol with a high frequency of occurrence and a long code to a symbol with a low frequency of occurrence.

영상의 부호화 및 복호화에 있어서, 예측은 원본 신호와 유사한 예측 신호를 생성하는 것을 의미할 수 있다. 예측은 크게 공간적(spatial) 복원(reconstructed) 영상을 참조하는 예측, 시간적(temporal) 복원 영상을 참조하는 예측 및 그 밖의 심볼에 대한 예측으로 분류될 수 있다. 말하자면, 시간적 참조는 시간적 복원 영상을 참조하는 것을 의미할 수 있고, 공간적 참조는 공간적 복원 영상을 참조하는 것을 의미할 수 있다.In video encoding and decoding, prediction may mean generating a prediction signal similar to an original signal. Prediction can be largely classified into prediction referring to a spatially reconstructed image, prediction referring to a temporally reconstructed image, and prediction of other symbols. In other words, temporal reference may mean referring to a temporally reconstructed image, and spatial reference may mean referring to a spatially reconstructed image.

현재 블록은 현재 부호화 또는 복호화의 대상인 블록일 수 있다. 현재 블록은 대상 블록 또는 대상 유닛으로 명명될 수 있다. 부호화에 있어서, 현재 블록은 부호화 대상 블록 또는 부호화 대상 유닛으로 명명될 수 있다. 복호화에 있어서, 현재 블록은 복호화 대상 블록 또는 복호화 대상 유닛으로 명명될 수 있다.The current block may be a block currently being encoded or decoded. The current block may be named a target block or target unit. In encoding, a current block may be named an encoding target block or an encoding target unit. In decoding, a current block may be named a decoding object block or a decoding object unit.

인터(inter) 예측은 시간적 참조 및 공간적 참조를 사용하여 현재 블록을 예측하는 기술일 수 있다. 인트라(intra) 예측은 공간적 참조만을 사용하여 현재 블록을 예측하는 기술일 수 있다.Inter prediction may be a technique of predicting a current block using temporal and spatial references. Intra prediction may be a technique of predicting a current block using only spatial reference.

일반적으로, 다수의 영상들의 블록들의 인터 예측에 있어서, 동일한 인터 예측이 사용된다. 예를 들면, 현재 블록을 포함하는 영상의 영상 특성이 고려되지 않은 채, 동일한 인터 예측이 적용될 수 있다. 항상 동일한 인터 예측이 사용될 경우, 부호화 장치의 복잡도가 증가할 수 있고, 부호화의 효율이 저하될 수 있다. 특히, 정적 영역을 포함하는 코딩(coding) 유닛의 예측에 있어서, 필요 이상의 세분화된 블록들에 대해 인터 예측들이 수행될 경우, 부호화 장치로부터 복호화 장치로 전송되어야 하는 움직임 정보의 양이 증가할 수 있고, 움직임 정보의 양의 증가에 의해 부호화 효율이 저하될 수 있다.In general, for inter prediction of blocks of multiple images, the same inter prediction is used. For example, the same inter prediction may be applied without considering the image characteristics of the image including the current block. When the same inter prediction is always used, the complexity of an encoding device may increase and encoding efficiency may decrease. In particular, in prediction of a coding unit including a static region, when inter prediction is performed on more subdivided blocks than necessary, the amount of motion information to be transmitted from the encoding device to the decoding device may increase. , encoding efficiency may decrease due to an increase in the amount of motion information.

일 실시예는 영상의 특성에 기반한 적응적 부호화 및 복호화를 수행하는 기술을 통해, 부호화 효율을 향상시키는 방법 및 장치를 제공할 수 있다.An embodiment may provide a method and apparatus for improving encoding efficiency through a technique of performing adaptive encoding and decoding based on characteristics of an image.

일 실시예는 영상의 특성에 기반한 적응적 부호화 및 복호화를 수행하는 기술을 통해, 부호화 장치의 복잡도를 감소시키는 방법 및 장치를 제공할 수 있다.An embodiment may provide a method and apparatus for reducing the complexity of an encoding device through a technique for performing adaptive encoding and decoding based on characteristics of an image.

일 실시예는 정적 영역으로 분류(classify)된 코딩 유닛이 쿼드트리(quadtree)에 기반하여 하위 유닛으로 분할되는 횟수를 제약하는 방법 및 장치를 제공할 수 있다.An embodiment may provide a method and apparatus for limiting the number of times a coding unit classified as a static region is divided into subunits based on a quadtree.

일 실시예는 정적 영역으로 분류된 코딩 유닛이 하위 유닛으로 분할되는 횟수를 제약함으로써 코딩 유닛을 보다 큰 크기의 유닛으로서 부호화하는 방법 및 장치를 제공할 수 있다.An embodiment may provide a method and apparatus for encoding a coding unit as a larger-sized unit by limiting the number of times a coding unit classified as a static region is divided into sub-units.

일 실시예는 정적 영역으로 분류된 코딩 유닛을 보다 큰 크기의 유닛으로서 부호화함으로써 부호화 장치의 복잡도를 감소시키는 방법 및 장치를 제공할 수 있다.An embodiment may provide a method and apparatus for reducing the complexity of an encoding device by encoding a coding unit classified as a static region as a larger unit.

일 실시예는 정적 영역으로 분류된 코딩 유닛을 보다 큰 크기의 유닛으로서 부호화함으로써 부호화 장치로부터 복호화 장치로 전송되는 움직임 정보의 양을 감소시키는 방법 및 장치를 제공할 수 있다.An embodiment may provide a method and apparatus for reducing the amount of motion information transmitted from an encoding device to a decoding device by encoding a coding unit classified as a static region as a larger unit.

일 실시예는, 정적 유닛 및 인접한 이웃 블록 간의 상관도(correlation)가 높다는 가정 하에서, 움직임 벡터의 비트 정밀도(bit precision) 및 탐색 영역(search range)을 감소시키는 방법 및 장치를 제공할 수 있다.An embodiment may provide a method and apparatus for reducing bit precision and search range of a motion vector under the assumption that correlation between a static unit and an adjacent block is high.

일 실시예는 움직임 벡터의 비트 정밀도 및 탐색 영역을 감소시킴으로써 부호화 효율을 향상시키고, 부호화 장치의 복잡도를 감소시키는 방법 및 장치를 제공할 수 있다.An embodiment may provide a method and apparatus for improving encoding efficiency and reducing the complexity of an encoding device by reducing bit precision and a search region of a motion vector.

일 측에 있어서, 대상 유닛의 영상 복잡도에 기반하여 대상 유닛의 타입을 결정하는 단계; 및 상기 결정된 타입에 따라서 상기 대상 유닛에 대한 인터 예측을 수행하는 단계를 포함하는 대상 유닛에 대한 인터 예측 방법이 제공된다.In one aspect, determining a type of a target unit based on an image complexity of the target unit; and performing inter prediction on the target unit according to the determined type.

영상의 특성에 기반한 적응적 부호화 및 복호화를 수행하는 기술을 통해, 부호화 효율을 향상시키는 방법 및 장치가 제공된다. A method and apparatus for improving coding efficiency through a technique for performing adaptive encoding and decoding based on the characteristics of an image are provided.

영상의 특성에 기반한 적응적 부호화 및 복호화를 수행하는 기술을 통해, 부호화 장치의 복잡도를 감소시키는 방법 및 장치가 제공된다.A method and apparatus for reducing the complexity of an encoding device through a technique of performing adaptive encoding and decoding based on characteristics of an image are provided.

정적 영역으로 분류(classify)된 코딩 유닛이 쿼드크리(quadtree)에 기반하여 하위 유닛으로 분할되는 횟수를 제약하는 방법 및 장치가 제공된다.A method and apparatus for limiting the number of times a coding unit classified as a static region is divided into subunits based on a quadtree are provided.

정적 영역으로 분류된 코딩 유닛이 하위 유닛으로 분할되는 횟수를 제약함으로써 코딩 유닛을 보다 큰 크기의 유닛으로서 부호화하는 방법 및 장치를 제공가 제공된다.Provided is a method and apparatus for encoding a coding unit as a larger unit by limiting the number of times a coding unit classified as a static region is divided into sub-units.

정적 영역으로 분류된 코딩 유닛을 보다 큰 크기의 유닛으로서 부호화함으로써 부호화 장치의 복잡도를 감소시키는 방법 및 장치가 제공된다.A method and apparatus for reducing the complexity of an encoding device by encoding a coding unit classified as a static region as a larger unit are provided.

정적 영역으로 분류된 코딩 유닛을 보다 큰 크기의 유닛으로서 부호화함으로써 부호화 장치로부터 복호화 장치로 전송되는 움직임 정보의 양을 감소시키는 방법 및 장치가 제공된다.A method and apparatus for reducing the amount of motion information transmitted from an encoding apparatus to a decoding apparatus by encoding a coding unit classified as a static region as a larger unit are provided.

정적 유닛 및 인접한 이웃 블록 간의 상관도(correlation)가 높다는 가정 하에서, 움직임 벡터의 비트 정밀도(bit precision) 및 탐색 영역(search range)을 감소시키는 방법 및 장치가 제공된다.Provided is a method and apparatus for reducing bit precision and search range of a motion vector under the assumption that correlation between a static unit and an adjacent block is high.

움직임 벡터의 비트 정밀도 및 탐색 영역를 감소시킴으로써 부호화 효율을 향상시키고, 부호화 장치의 복잡도를 감소시키는 방법 및 장치가 제공된다.A method and apparatus for improving encoding efficiency and reducing the complexity of an encoding device by reducing the bit precision and search range of motion vectors are provided.

도 1은 본 발명이 적용되는 부호화 장치의 일 실시예에 따른 구성을 나타내는 블록도이다.
도 2는 본 발명이 적용되는 복호화 장치의 일 실시예에 따른 구성을 나타내는 블록도이다.
도 3은 영상을 부호화 및 복호화할 때의 영상의 분할 구조를 개략적으로 나타내는 도면이다.
도 4a 내지 도 7h는 코딩 유닛(CU)이 포함할 수 있는 예측 유닛(PU)의 형태를 도시한 도면이다.
도 5는 코딩 유닛(CU)에 포함될 수 있는 변환 유닛(TU)의 형태를 도시한 도면이다.
도 6은 인트라 예측 과정의 실시예를 설명하기 위한 도면이다.
도 7은 인터 예측 과정의 실시예를 설명하기 위한 도면이다.
도 8은 일 실시예에 따른 인터 예측 방법의 흐름도이다.
도 9는 일 예에 따른 현재 픽쳐 및 참조 픽쳐의 차분 영상을 나타낸다.
도 10은 일 예에 따른 대상 유닛 및 복원된 이웃 유닛들의 타입 플래그들을 나타낸다.
도 11은 일 실시예에 따른 부호화 장치 및/또는 복호화 장치를 구현하는 전자 장치의 구조도이다.1 is a block diagram showing a configuration according to an embodiment of an encoding device to which the present invention is applied.
2 is a block diagram showing a configuration according to an embodiment of a decoding device to which the present invention is applied.
3 is a diagram schematically illustrating a division structure of an image when encoding and decoding an image.
4A to 7H are diagrams illustrating a form of a prediction unit (PU) that may be included in a coding unit (CU).
5 is a diagram illustrating a form of a transform unit (TU) that may be included in a coding unit (CU).
6 is a diagram for explaining an embodiment of an intra prediction process.
7 is a diagram for explaining an embodiment of an inter prediction process.
8 is a flowchart of an inter prediction method according to an embodiment.
9 illustrates differential images of a current picture and a reference picture according to an example.
10 illustrates type flags of a target unit and reconstructed neighboring units according to an example.
11 is a structural diagram of an electronic device implementing an encoding device and/or a decoding device according to an embodiment.

후술하는 예시적 실시예들에 대한 상세한 설명은, 특정 실시예를 예시로서 도시하는 첨부 도면을 참조한다. 이들 실시예는 당업자가 실시예를 실시할 수 있기에 충분하도록 상세히 설명된다. 다양한 실시예들은 서로 다르지만 상호 배타적일 필요는 없음이 이해되어야 한다. 예를 들어, 여기에 기재되어 있는 특정 형상, 구조 및 특성은 일 실시예에 관련하여 본 발명의 정신 및 범위를 벗어나지 않으면서 다른 실시예로 구현될 수 있다. 또한, 각각의 개시된 실시예 내의 개별 구성요소의 위치 또는 배치는 실시예의 정신 및 범위를 벗어나지 않으면서 변경될 수 있음이 이해되어야 한다. 따라서, 후술하는 상세한 설명은 한정적인 의미로서 취하려는 것이 아니며, 예시적 실시예들의 범위는, 적절하게 설명된다면, 그 청구항들이 주장하는 것과 균등한 모든 범위와 더불어 첨부된 청구항에 의해서만 한정된다.DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS For detailed descriptions of exemplary embodiments described below, reference is made to the accompanying drawings, which illustrate specific embodiments by way of example. These embodiments are described in sufficient detail to enable those skilled in the art to practice the embodiments. It should be understood that the various embodiments are different, but need not be mutually exclusive. For example, specific shapes, structures, and characteristics described herein may be implemented in one embodiment in another embodiment without departing from the spirit and scope of the invention. Additionally, it should be understood that the location or arrangement of individual components within each disclosed embodiment may be changed without departing from the spirit and scope of the embodiment. Accordingly, the detailed description set forth below is not to be taken in a limiting sense, and the scope of the exemplary embodiments, if properly described, is limited only by the appended claims, along with all equivalents as claimed by those claims.

도면에서 유사한 참조부호는 여러 측면에 걸쳐서 동일하거나 유사한 기능을 지칭한다. 도면에서의 요소들의 형상 및 크기 등은 보다 명확한 설명을 위해 과장될 수 있다.Like reference numbers in the drawings indicate the same or similar function throughout the various aspects. The shapes and sizes of elements in the drawings may be exaggerated for clarity.

어떤 구성요소(component)가 다른 구성요소에 "연결되어" 있다거나 "접속되어" 있다고 언급된 때에는, 상기의 2개의 구성요소들이 서로 간에 직접적으로 연결되어 있거나 또는 접속되어 있을 수도 있으나, 상기의 2개의 구성요소들의 중간에 다른 구성요소가 존재할 수도 있다고 이해되어야 할 것이다. 또한, 예시적 실시예들에서 특정 구성을 "포함"한다고 기술하는 내용은 상기의 특정 구성 이외의 구성을 배제하는 것이 아니며, 추가적인 구성이 예시적 실시예들의 실시 또는 예시적 실시예들의 기술적 사상의 범위에 포함될 수 있음을 의미한다.When a component is referred to as being “connected” or “connected” to another component, the two components may be directly connected or connected to each other, but It will be understood that other components may exist in the middle of the two components. In addition, the description of "including" a specific configuration in the exemplary embodiments does not exclude configurations other than the specific configuration above, and additional configurations are the implementation of the exemplary embodiments or the technical idea of the exemplary embodiments. This means that it can be included in the scope.

제1 및 제2 등의 용어는 다양한 구성요소들을 설명하는데 사용될 수 있지만, 상기의 구성요소들은 상기의 용어들에 의해 한정되어서는 안 된다. 상기의 용어들은 하나의 구성요소를 다른 구성요소로부터 구별하여 지칭하기 위해서 사용된다. 예를 들어, 권리 범위를 벗어나지 않으면서 제1 구성요소는 제2 구성요소로 명명될 수 있고, 유사하게 제2 구성요소도 제1 구성요소로 명명될 수 있다.Terms such as first and second may be used to describe various elements, but the above elements should not be limited by the above terms. The above terms are used to distinguish and refer to one component from another. For example, without departing from the scope of rights, a first element may be termed a second element, and similarly, a second element may also be termed a first element.

또한 실시예들에 나타나는 구성요소들은 서로 다른 특징적인 기능들을 나타내기 위해 독립적으로 도시되는 것으로, 각 구성요소가 분리된 하드웨어나 하나의 소프트웨어 구성 단위로만 이루어짐을 의미하지 않는다. 즉, 각 구성요소는 설명의 편의상 각각의 구성요소로 나열된 것이다. 예를 들면, 구성요소들 중 적어도 두 개의 구성요소들이 하나의 구성요소로 합쳐질 수 있다. 또한, 하나의 구성요소가 복수의 구성요소들로 나뉠 수 있다. 이러한 각 구성요소의 통합된 실시예 및 분리된 실시예 또한 본질에서 벗어나지 않는 한 권리범위에 포함된다.In addition, the components appearing in the embodiments are shown independently to represent different characteristic functions, and each component does not mean that each component consists of only a separate hardware or one software unit. That is, each component is listed as each component for convenience of description. For example, at least two of the components may be combined into one component. Also, one component may be divided into a plurality of components. An integrated embodiment and a separate embodiment of each of these components are also included in the scope of rights unless departing from the essence.

또한, 일부의 구성요소는 본질적인 기능을 수행하는 필수적인 구성요소는 아니고 단지 성능을 향상시키기 위한 선택적 구성요소일 수 있다. 실시예들은 실시예의 본질을 구현하는데 필수적인 구성부만을 포함하여 구현될 수 있고, 예를 들면, 단지 성능 향상을 위해 사용되는 구성요소와 같은, 선택적 구성요소가 제외된 구조 또한 권리 범위에 포함된다.In addition, some of the components may not be essential components for performing essential functions, but may be optional components for improving performance. Embodiments may be implemented by including only components essential to implement the essence of the embodiment, and structures excluding optional components, such as components used only for performance improvement, are also included in the scope of rights.

이하에서는, 기술분야에서 통상의 지식을 가진 자가 실시예들을 용이하게 실시할 수 있도록 하기 위하여, 첨부된 도면을 참조하여 실시예들을 상세히 설명하기로 한다. 실시예들을 설명함에 있어, 관련된 공지 구성 또는 기능에 대한 구체적인 설명이 본 명세서의 요지를 흐릴 수 있다고 판단되는 경우에는 그 상세한 설명은 생략한다.Hereinafter, the embodiments will be described in detail with reference to the accompanying drawings so that those skilled in the art can easily practice the embodiments. In describing the embodiments, if it is determined that a detailed description of a related known configuration or function may obscure the gist of the present specification, the detailed description will be omitted.

이하에서, 영상은 비디오(video)을 구성하는 하나의 픽쳐를 의미할 수 있으며, 비디오 자체를 나타낼 수도 있다. 예를 들면, "영상의 부호화 및/또는 복호화"는 "비디오의 부호화 및/또는 복호화"를 의미할 수 있으며, "비디오를 구성하는 영상들 중 하나의 영상의 부호화 및/또는 복호화"를 의미할 수도 있다.
Hereinafter, an image may mean one picture constituting a video, and may also indicate the video itself. For example, "encoding and/or decoding an image" may mean "encoding and/or decoding a video", and may mean "encoding and/or decoding one of images constituting a video". may be

우선, 실시예들에서 사용되는 용어를 설명한다.First, terms used in the embodiments will be described.

유닛(unit): "유닛"은 영상의 부호화 및 복호화의 단위를 나타낼 수 있다. 유닛 및 블록(block)의 의미들은 동일할 수 있다. 또한, 용어 "유닛" 및 "블록"은 서로 교체되어 사용될 수 있다.Unit: A “unit” may represent a unit of encoding and decoding of an image. The semantics of unit and block may be the same. Also, the terms “unit” and “block” may be used interchangeably.

유닛(또는, 블록)은 샘플의 MxN 배열일 수 있다. M 및 N은 각각 양의 정수일 수 있다. 유닛은 흔히 2차원의 샘플의 배열을 의미할 수 있다. 샘플은 픽셀 또는 픽셀 값일 수 있다. A unit (or block) can be an MxN array of samples. M and N may each be a positive integer. A unit may often mean a two-dimensional array of samples. A sample can be a pixel or a pixel value.

영상의 부호화 및 복호화에 있어서, 유닛은 하나의 영상의 분할에 의해 생성된 영역일 수 있다. 하나의 영상은 복수의 유닛들로 분할될 수 있다. 영상의 부호화 및 복호화에 있어서, 유닛의 종류에 따라서 유닛에 대한 기정의된 처리가 수행될 수 있다. 기능에 따라서, 유닛의 타입은 매크로 유닛(Macro Unit), 코딩 유닛(Coding Unit; CU), 예측 유닛(Prediction Unit; PU) 및 변환 유닛(transform Unit; TU) 등으로 분류될 수 있다. 하나의 유닛은 유닛에 비해 더 작은 크기를 갖는 하위 유닛으로 더 분할될 수 있다. In encoding and decoding of an image, a unit may be a region created by dividing one image. One image may be divided into a plurality of units. In encoding and decoding of an image, a predefined process for a unit may be performed according to the type of unit. Depending on the function, the unit type may be classified into a macro unit, a coding unit (CU), a prediction unit (PU), and a transform unit (TU). One unit can be further divided into sub-units having a smaller size relative to the unit.

블록 분할 정보는 유닛의 깊이(depth)에 관한 정보를 포함할 수 있다. 깊이 정보는 유닛이 분할되는 회수 및/또는 정도를 나타낼 수 있다. Block division information may include information about the depth of a unit. Depth information may indicate the number and/or degree of division of a unit.

하나의 유닛은 트리 구조(tree structure)에 기반하여 깊이 정보(depth)를 가지면서 계층적으로 복수의 하위 유닛들로 분할될 수 있다. 말하자면, 유닛 및 상기의 유닛의 분할에 의해 생성된 하위 유닛은 노드 및 상기의 노드의 자식 노드에 각각 대응할 수 있다. 각각의 분할된 하위 유닛은 깊이 정보를 가질 수 있다. 유닛의 깊이 정보는 유닛이 분할된 회수 및/또는 정도를 나타내므로, 하위 유닛의 분할 정보는 하위 유닛의 크기에 관한 정보를 포함할 수도 있다. One unit may be hierarchically divided into a plurality of sub-units while having depth information based on a tree structure. In other words, a unit and a sub-unit generated by division of the unit may correspond to a node and a child node of the node, respectively. Each divided sub-unit may have depth information. Since the depth information of a unit indicates the number and/or degree of division of the unit, the division information of the sub-unit may include information about the size of the sub-unit.

트리 구조에서, 가장 상위 노드는 분할되지 않은 최초의 유닛에 대응할 수 있다. 가장 상위 노드는 루트 노드(root node)로 칭해질 수 있다. 또한, 가장 상위 노드는 최소의 깊이 값을 가질 수 있다. 이 때, 가장 상위 노드는 레벨 0의 깊이를 가질 수 있다. In the tree structure, the highest node may correspond to the first undivided unit. The highest node may be referred to as a root node. Also, the highest node may have the smallest depth value. In this case, the highest node may have a depth of level 0.

레벨 1의 깊이를 갖는 노드는 최초의 유닛이 한 번 분할됨에 따라 생성된 유닛을 나타낼 수 있다. 레벨 2의 깊이를 갖는 노드는 최초의 유닛이 두 번 분할됨에 따라 생성된 유닛을 나타낼 수 있다. A node with a depth of level 1 may represent a unit created as the original unit is split once. A node with a depth of level 2 may represent a unit created as the original unit is split twice.

레벨 n의 깊이를 갖는 노드는 최초의 유닛이 n번 분할됨에 따라 생성된 유닛을 나타낼 수 있다. A node with a depth of level n may represent a unit created as the original unit is split n times.

리프 노드는 가장 하위의 노드일 수 있으며, 더 분할될 수 없는 노드일 수 있다. 리프 노드의 깊이는 최대 레벨일 수 있다. 예를 들면, 최대 레벨의 기정의된 값은 3일 수 있다. A leaf node may be the lowest node and may be a node that cannot be further divided. The depth of a leaf node may be the maximum level. For example, the predefined value of the maximum level may be 3.

변환 유닛(Transform Unit): 변환 유닛은 변환, 역변환, 양자화, 역양자화, 변환 계수 부호화, 및 변환 계수 복호화 등과 같은 잔차 신호(residual signal) 부호화 및/또는 잔여 신호 복호화에 있어서의 기본 유닛일 수 있다. 하나의 변환 유닛은 더 작은 크기를 갖는 다수의 변환 유닛들 분할될 수 있다.Transform Unit: The transform unit may be a basic unit in residual signal encoding and/or residual signal decoding, such as transform, inverse transform, quantization, inverse quantization, transform coefficient encoding, and transform coefficient decoding. . One transform unit may be divided into multiple transform units having smaller sizes.

예측 유닛(Prediction Unit) : 예측 유닛은 예측 또는 보상(compensation)의 수행에 있어서의 기본 단위일 수 있다. 예측 유닛은 분할에 의해 다수의 파티션(partition)들이 될 수 있다. 다수의 파티션들 또한 예측 또는 보상의 수행에 있어서의 기본 단위일 수 있다. 예측 유닛의 분할에 의해 생성된 파티션 또한 예측 유닛일 수 있다.Prediction Unit: A prediction unit may be a basic unit in performing prediction or compensation. A prediction unit may be divided into multiple partitions. A number of partitions may also be a basic unit in performing prediction or compensation. A partition generated by division of a prediction unit may also be a prediction unit.

복원된 이웃 유닛(Reconstructed Neighbor Unit) : 복원된 이웃 유닛은 부호화 대상 유닛 또는 복호화 대상 유닛의 주변에 이미 부호화 또는 복호화되어 복원된 유닛일 수 있다. 복원된 이웃 유닛은 대상 유닛에 대한 공간적(spatial) 인접 유닛 또는 시간적(temporal) 인접 유닛일 수 있다.Reconstructed Neighbor Unit: A reconstructed neighbor unit may be a unit that has already been encoded or decoded and reconstructed around a unit to be coded or a unit to be decoded. The reconstructed neighboring unit may be a spatial neighboring unit or a temporal neighboring unit of the target unit.

예측 유닛 파티션 : 예측 유닛 파티션은 예측 유닛이 분할된 형태를 의미할 수 있다.Prediction unit partition: A prediction unit partition may mean a form in which a prediction unit is divided.

파라미터 세트(Parameter Set): 파라미터 세트는 비트스트림 내의 구조(structure) 중 헤더(header) 정보에 해당할 수 있다. 예를 들면, 파라미터 세트는 시퀀스 파라미터 세트(sequence parameter set), 픽쳐 파라미터 세트(picture parameter set) 및 적응 파라미터 세트(adaptation parameter set) 등을 포함할 수 있다.Parameter Set: A parameter set may correspond to header information among structures in a bitstream. For example, the parameter set may include a sequence parameter set, a picture parameter set, an adaptation parameter set, and the like.

율왜곡 최적화(ratedistortion optimization): 부호화 장치는 코딩 유닛의 크기, 예측 모드, 예측 유닛의 크기, 움직임 정보 및, 변환 유닛의 크기 등의 조합을 이용해서 높은 부호화 효율을 제공하기 위해 율왜곡 최적화를 사용할 수 있다.Rate distortion optimization: An encoding device may use rate distortion optimization to provide high encoding efficiency by using a combination of coding unit size, prediction mode, prediction unit size, motion information, and conversion unit size. can

율왜곡 최적화 방식은 상기의 조합들 중에서 최적의 조합을 선택하기 위해 각 조합의 율왜곡 비용(ratedistortion cost)을 계산할 수 있다. 율왜곡 비용은 아래의 수학식 1을 이용하여 계산될 수 있다. 일반적으로 상기 율왜곡 비용이 최소가 되는 조합이 율왜곡 최적화 방식에 있어서의 최적의 조합으로 선택될 수 있다. The rate distortion optimization method may calculate a rate distortion cost of each combination in order to select an optimal combination among the above combinations. The rate distortion cost can be calculated using Equation 1 below. In general, a combination that minimizes the rate distortion cost can be selected as an optimal combination in the rate distortion optimization method.

D는 왜곡을 나타낼 수 있다. D는 변환 블록 내에서 원래의 변환 계수들 및 복원된 변환 계수들 간의 차이 값들의 제곱들의 평균(mean square error)일 수 있다.D can represent distortion. D may be a mean square error of difference values between original transform coefficients and reconstructed transform coefficients within a transform block.

R은 율을 나타낼 수 있다. R은 관련된 문맥 정보를 이용한 비트 율을 나타낼 수 있다.R can represent a rate. R may represent a bit rate using related context information.

λ는 라그랑지안 승수(Lagrangian multiplier)를 나타낼 수 있다. R은 예측 모드, 움직임 정보 및 부호화 블록 플래그(coded block flag) 등과 같은 부호화 파라미터 정보뿐만 아니라, 변환 계수의 부호화에 의해 발생하는 비트도 포함할 수 있다.λ may represent a Lagrangian multiplier. R may include not only encoding parameter information such as a prediction mode, motion information, and a coded block flag, but also bits generated by encoding transform coefficients.

부호화 장치는 정확한 D 및 R을 계산하기 위해 인터 예측 및/또는 인트라 예측, 변환, 양자화, 엔트로피 부호화, 역양자화, 역변환 등의 과정을 수행하는데, 이러한 과정은 부호화 장치에서의 복잡도를 크게 증가시킬 수 있다.The encoding device performs processes such as inter prediction and/or intra prediction, transformation, quantization, entropy encoding, inverse quantization, and inverse transformation to calculate accurate D and R. These processes can greatly increase complexity in the encoding device. have.

참조 픽쳐(reference picture) : 참조 픽쳐는 인터 예측 또는 움직임 보상에 사용되는 영상일 수 있다. 참조 픽쳐는 인터 예측 또는 움직임 보상을 위해 대상 유닛이 참조하는 참조 유닛을 포함하는 픽쳐일 수 있다. 픽쳐 및 영상의 의미들은 동일할 수 있다. 또한, 용어 "픽쳐" 및 "영상"은 서로 교체되어 사용될 수 있다.Reference picture: A reference picture may be an image used for inter prediction or motion compensation. A reference picture may be a picture including a reference unit referred to by a target unit for inter prediction or motion compensation. Meanings of a picture and a video may be the same. Also, the terms “picture” and “image” may be used interchangeably.

참조 픽쳐 리스트(reference picture list) : 참조 픽쳐 리스트는 인터 예측 또는 움직임 보상에 사용되는 참조 영상들을 포함하는 리스트일 수 있다. 참조 픽쳐 리스트의 종류는 리스트 조합(List Combined; LC), 리스트 0(List 0; L0) 및 리스트 1(List 1; L1) 등이 있을 수 있다.Reference picture list: The reference picture list may be a list including reference pictures used for inter prediction or motion compensation. Types of reference picture lists may include List Combined (LC), List 0 (L0), and List 1 (List 1; L1).

움직임 벡터(Motion Vector; MV) : 움직임 벡터는 인터 예측에서 사용되는 2차원의 벡터일 수 있다. 예를 들면, MV는 (mv_x, mv_y)와 같은 형태로 표현될 수 있다. mv_x는 수평(horizontal) 성분을 나타낼 수 있고, mv_y 는 수직(vertical) 성분을 나타낼 수 있다.Motion Vector (MV): A motion vector may be a two-dimensional vector used in inter prediction. For example, MV can be expressed as (mv _x , mv _y ). mv _x may represent a horizontal component, and mv _y may represent a vertical component.

MV는 대상 픽쳐 및 참조 픽쳐 간의 오프셋(offset)을 나타낼 수 있다. MV may represent an offset between a target picture and a reference picture.

탐색 영역(search range) : 탐색 영역은 인터 예측 중 MV에 대한 탐색이 이루어지는 2차원의 영역일 수 있다. 예를 들면, 탐색 영역의 크기는 MxN일 수 있다. M 및 N은 각각 양의 정수일 수 있다.Search range: The search range may be a two-dimensional area where MVs are searched during inter prediction. For example, the size of the search area may be MxN. M and N may each be a positive integer.

도 1은 본 발명이 적용되는 부호화 장치의 일 실시예에 따른 구성을 나타내는 블록도이다.1 is a block diagram showing a configuration according to an embodiment of an encoding device to which the present invention is applied.

부호화 장치(100)는 비디오 부호화 장치 또는 영상 부호화 장치일 수 있다. 비디오는 하나 이상의 영상들을 포함할 수 있다. 부호화 장치(110)는 비디오의 하나 이상의 영상들을 시간에 따라 순차적으로 부호화할 수 있다.The encoding device 100 may be a video encoding device or an image encoding device. A video may include one or more images. The encoding device 110 may sequentially encode one or more images of a video according to time.

도 1을 참조하면, 부호화 장치(100)는 인터 예측부(110), 인트라 예측부(120), 스위치(115), 감산기(125), 변환부(130), 양자화부(140), 엔트로피 부호화부(150), 역양자화부(160), 역변환부(170), 가산기(175), 필터부(180) 및 참조 픽쳐 버퍼(190)를 포함할 수 있다.Referring to FIG. 1, an encoding apparatus 100 includes an inter prediction unit 110, an intra prediction unit 120, a switch 115, a subtractor 125, a transform unit 130, a quantization unit 140, and entropy encoding. It may include a unit 150, an inverse quantization unit 160, an inverse transform unit 170, an adder 175, a filter unit 180, and a reference picture buffer 190.

부호화 장치(100)는 입력 영상에 대해 인트라 모드 및/또는 인터 모드로 부호화를 수행할 수 있다. 입력 영상은 현재 부호화의 대상인 현재 영상으로 칭해질 수 있다.The encoding apparatus 100 may perform encoding on an input image in an intra mode and/or an inter mode. The input image may be referred to as a current image that is a current encoding target.

또한, 부호화 장치(100)는 입력 영상에 대한 부호화를 통해 부호화의 정보를 포함하는 비트스트림을 생성할 수 있고, 생성된 비트스트림을 출력할 수 있다.Also, the encoding device 100 may generate a bitstream including encoding information through encoding of an input image and output the generated bitstream.

인트라 모드가 사용되는 경우, 스위치(115)는 인트라로 전환될 수 있다. 인터 모드가 사용되는 경우, 스위치(115)는 인터로 전환될 수 있다.When intra mode is used, switch 115 can be switched to intra. When the inter mode is used, the switch 115 can be switched to inter.

부호화 장치(100)는 입력 영상의 입력 블록에 대한 예측 블록을 생성할 수 있다. 또한, 부호화 장치(100)는 예측 블록이 생성된 후, 입력 블록 및 예측 블록의 차분(residual)을 부호화할 수 있다. 입력 블록은 현재 부호화의 대상인 현재 블록으로 칭해질 수 있다.The encoding apparatus 100 may generate a prediction block for an input block of an input image. Also, the encoding apparatus 100 may encode a residual between the input block and the prediction block after the prediction block is generated. An input block may be referred to as a current block that is a target of current encoding.

예측 모드가 인트라 모드인 경우, 인트라 예측부(120)는 현재 블록의 주변에 있는, 이미 부호화된 블록의 픽셀 값을 참조 픽셀로서 이용할 수 있다. 인트라 예측부(120)는 참조 픽셀을 이용하여 현재 블록에 대한 공간적 예측을 수행할 수 있고, 공간적 예측을 통해 현재 블록에 대한 예측 샘플들을 생성할 수 있다.When the prediction mode is the intra mode, the intra predictor 120 may use a pixel value of an already encoded block adjacent to the current block as a reference pixel. The intra predictor 120 may perform spatial prediction on the current block using the reference pixel and generate prediction samples for the current block through the spatial prediction.

인터 예측부(110)는 움직임 예측부 및 움직임 보상부를 포함할 수 있다.The inter prediction unit 110 may include a motion estimation unit and a motion compensation unit.

예측 모드가 인터 모드인 경우, 움직임 예측부는, 움직임 예측 과정에서 참조 영상으로부터 현재 블록과 가장 매치가 잘 되는 영역을 검색할 수 있고, 현재 블록 및 검색된 영역에 대한 움직임 벡터를 도출할 수 있다. 참조 영상은 참조 픽쳐 버퍼(190)에 저장될 수 있으며, 참조 영상에 대한 부호화 및/또는 복호화가 처리될 때 참조 픽쳐 버퍼(190)에 저장될 수 있다.When the prediction mode is the inter mode, the motion prediction unit may search for an area that best matches the current block from the reference picture in the motion prediction process, and derive a motion vector for the current block and the searched area. The reference picture may be stored in the reference picture buffer 190, and may be stored in the reference picture buffer 190 when encoding and/or decoding of the reference picture is processed.

움직임 보상부는 움직임 벡터를 이용하는 움직임 보상을 수행함으로써 예측 블록을 생성할 수 있다. 여기에서, 움직임 벡터는 인터 예측에 사용되는 2차원 벡터일 수 있다. 또한 움직임 벡터는 현재 영상 및 참조 영상 간의 오프셋(offset)을 나타낼 수 있다.The motion compensation unit may generate a prediction block by performing motion compensation using a motion vector. Here, the motion vector may be a 2D vector used for inter prediction. Also, the motion vector may indicate an offset between the current image and the reference image.

감산기(125)는 입력 블록 및 예측 블록의 차분인 잔차 블록(residual block)을 생성할 수 있다. 잔차 블록은 잔차 신호로 칭해질 수도 있다.The subtractor 125 may generate a residual block that is a difference between the input block and the prediction block. A residual block may also be referred to as a residual signal.

변환부(130)는 잔차 블록에 대해 변환(transform)을 수행하여 변환 계수를 생성할 수 있고, 생성된 변환 계수(transform coefficient)를 출력할 수 있다. 여기서, 변환 계수는 잔차 블록에 대한 변환을 수행함으로써 생성된 계수 값일 수 있다. 변환 생략(transform skip) 모드가 적용되는 경우, 변환부(130)는 잔차 블록에 대한 변환을 생략할 수도 있다.The transform unit 130 may generate transform coefficients by performing transform on the residual block, and may output the generated transform coefficients. Here, the transform coefficient may be a coefficient value generated by performing transform on the residual block. When a transform skip mode is applied, the transform unit 130 may skip transform on the residual block.

변환 계수에 양자화를 적용함으로써 양자화된 변환 계수 레벨(transform coefficient level)이 생성될 수 있다. 이하, 실시예들에서는 양자화된 변환 계수 레벨도 변환 계수로 칭해질 수 있다.A quantized transform coefficient level may be generated by applying quantization to the transform coefficient. Hereinafter, in the embodiments, a quantized transform coefficient level may also be referred to as a transform coefficient.

양자화부(140)는 변환 계수를 양자화 파라미터에 맞춰 양자화함으로써 양자화된 변환 계수 레벨(quantized transform coefficient level)을 생성할 수 있다. 양자화부(140)는 생성된 양자화된 변환 계수 레벨을 출력할 수 있다. 이때, 양자화부(140)에서는 양자화 행렬을 사용하여 변환 계수를 양자화할 수 있다.The quantization unit 140 may generate a quantized transform coefficient level by quantizing a transform coefficient according to a quantization parameter. The quantization unit 140 may output the generated quantized transform coefficient level. At this time, the quantization unit 140 may quantize the transform coefficient using a quantization matrix.

엔트로피 부호화부(150)는, 양자화부(140)에서 산출된 값들 및/또는 부호화 과정에서 산출된 부호화 파라미터 값들 등에 기초하여 확률 분포에 따른 엔트로피 부호화를 수행함으로써 비트스트림(bitstream)을 생성할 수 있다. 엔트로피 부호화부(150)는 생성된 비트스트림을 출력할 수 있다.The entropy encoding unit 150 may generate a bitstream by performing entropy encoding according to a probability distribution based on the values calculated by the quantization unit 140 and/or encoding parameter values calculated in the encoding process. . The entropy encoding unit 150 may output the generated bitstream.

엔트로피 부호화부(150)는 영상의 픽셀의 정보 외에 영상의 복호화를 위한 정보에 대한 엔트로피 부호화를 수행할 수 있다. 예를 들면, 영상의 복호화를 위한 정보는 신택스 엘리먼트(syntax element) 등을 포함할 수 있다. The entropy encoding unit 150 may perform entropy encoding on information for decoding an image in addition to pixel information of the image. For example, information for decoding an image may include a syntax element and the like.

부호화 파라미터는 부호화 및/또는 복호화를 위해 요구되는 정보일 수 있다. 부호화 파라미터는 부호화 장치에서 부호화되어 복호화 장치로 전달되는 정보를 포함할 수 있고, 부호화 혹은 복호화 과정에서 유추될 수 있는 정보를 포함할 수 있다. 예를 들면, 복호화 장치로 전달되는 정보로서, 신택스 엘리먼트가 있다.The encoding parameter may be information required for encoding and/or decoding. The encoding parameter may include information that is encoded by the encoding device and transmitted to the decoding device, and may include information that can be inferred during an encoding or decoding process. For example, as information transmitted to the decoding device, there is a syntax element.

예를 들면, 부호화 파라미터는 예측 모드, 움직임 벡터, 참조 픽쳐 색인(index), 부호화 블록 패턴(pattern), 잔차 신호 유무, 변환 계수, 양자화된 변환 계수, 양자화 파라미터, 블록 크기, 블록 분할(partition) 정보 등의 값 또는 통계를 포함할 수 있다. 예측 모드는 인트라 예측 모드 또는 인터 예측 모드를 가리킬 수 있다.For example, coding parameters include prediction mode, motion vector, reference picture index, coding block pattern, presence or absence of residual signal, transform coefficient, quantized transform coefficient, quantization parameter, block size, block partition It may contain values or statistics such as information. The prediction mode may indicate an intra prediction mode or an inter prediction mode.

잔차 신호는 원 신호 및 예측 신호 간의 차이(difference)를 의미할 수 있다. 또는, 잔차 신호는 원신호 및 예측 신호 간의 차이를 변환(transform)함으로써 생성된 신호일 수 있다. 또는, 잔차 신호는 원 신호 및 예측 신호 간의 차이를 변환 및 양자화함으로써 생성된 신호일 수 있다. 잔차 블록은 블록 단위의 잔차 신호일 수 있다.The residual signal may mean a difference between the original signal and the predicted signal. Alternatively, the residual signal may be a signal generated by transforming a difference between the original signal and the prediction signal. Alternatively, the residual signal may be a signal generated by transforming and quantizing the difference between the original signal and the predicted signal. A residual block may be a residual signal in units of blocks.

엔트로피 부호화가 적용되는 경우, 높은 발생 확률을 갖는 심볼에 적은 수의 비트가 할당될 수 있고, 낮은 발생 확률을 갖는 심볼에 많은 수의 비트가 할당될 수 있다. 이러한 할당을 통해 심볼이 표현됨에 따라, 부호화의 대상인 심볼들에 대한 비트열(bitstring)의 크기가 감소될 수 있다. 따라서, 엔트로피 부호화를 통해서 영상 부호화의 압축 성능이 향상될 수 있다. When entropy encoding is applied, a small number of bits may be allocated to a symbol having a high probability of occurrence, and a large number of bits may be allocated to a symbol having a low probability of occurrence. As symbols are represented through such allocation, the size of bitstrings for symbols that are encoding targets can be reduced. Therefore, compression performance of image encoding can be improved through entropy encoding.

또한, 엔트로피 부호화를 위해 지수 골롬(exponential golomb), 문맥적응형 가변 길이 코딩(ContextAdaptive Variable Length Coding; CAVLC) 및 문맥적응형 이진 산술 코딩(ContextAdaptive Binary Arithmetic Coding; CABAC) 등과 같은 부호화 방법이 사용될 수 있다. 예를 들면, 엔트로피 부호화부(150)는 가변 길이 부호화(Variable Lenghth Coding/Code; VLC) 테이블을 이용하여 엔트로피 부호화를 수행할 수 있다. 예를 들면, 엔트로피 부호화부(150)는 대상 심볼에 대한 이진화(binarization) 방법을 도출할 수 있다. 또한, 엔트로프 부호화부(150)는 대상 심볼/빈(bin)의 확률 모델(probability model)을 도출할 수 있다. 엔트로피 부호화부(150)는 도출된 이진화 방법 또는 확률 모델을 사용하여 엔트로피 부호화를 수행할 수도 있다.In addition, encoding methods such as exponential golomb, context-adaptive variable length coding (CAVLC), and context-adaptive binary arithmetic coding (CABAC) may be used for entropy encoding. . For example, the entropy encoding unit 150 may perform entropy encoding using a variable length coding/code (VLC) table. For example, the entropy encoding unit 150 may derive a binarization method for a target symbol. Also, the entropy encoder 150 may derive a probability model of the target symbol/bin. The entropy encoding unit 150 may perform entropy encoding using the derived binarization method or a probability model.

부호화 장치(100)에 의해 인터 예측을 통한 부호화를 수행되기 때문에, 부호화된 현재 영상은 이후에 처리되는 다른 영상(들)에 대하여 참조 영상으로서 사용될 수 있다. 따라서, 부호화 장치(100)는 부호화된 현재 영상을 다시 복호화할 수 있고, 복호화된 영상을 참조 영상으로서 저장할 수 있다. 복호화를 위해 부호화된 현재 영상에 대한 역양자화 및 역변환이 처리될 수 있다.Since encoding is performed through inter prediction by the encoding apparatus 100, the encoded current image may be used as a reference image for other image(s) to be processed later. Accordingly, the encoding apparatus 100 may decode the encoded current image again and store the decoded image as a reference image. Inverse quantization and inverse transformation of the current image encoded for decoding may be processed.

양자화된 계수는 역양자화부(160)에서 역양자화될(inversely quantized) 수 있고. 역변환부(170)에서 역변환될(inversely transformed) 수 있다. 역양자화 및 역변환된 계수는 가산기(175)를 통해 예측 블록과 합해질 수 있다, 역양자화 및 역변환된 계수 및 예측 블록을 합함으로써 복원(reconstructed) 블록이 생성될 수 있다.The quantized coefficient may be inversely quantized in the inverse quantization unit 160. It may be inversely transformed in the inverse transformation unit 170. The inverse quantized and inverse transformed coefficients may be combined with the prediction block through the adder 175. A reconstructed block may be generated by summing the inverse quantized and inverse transformed coefficients and the prediction block.

복원 블록은 필터부(180)를 거칠 수 있다. 필터부(180)는 디블록킹 필터(deblocking filter), 에스에이오(Sample Adaptive Offset; SAO), 에이엘어프(Adaptive Loop Filter; ALF) 중 적어도 하나 이상을 복원 블록 또는 복원 픽쳐에 적용할 수 있다. 필터부(180)는 적응적(adaptive) 인루프(inloop) 필터로 칭해질 수도 있다.The reconstruction block may pass through the filter unit 180 . The filter unit 180 may apply at least one of a deblocking filter, a Sample Adaptive Offset (SAO), and an Adaptive Loop Filter (ALF) to a reconstructed block or a reconstructed picture. The filter unit 180 may also be referred to as an adaptive in-loop filter.

디블록킹 필터는 블록들 간의 경계에서 발생한 블록 왜곡을 제거할 수 있다. SAO는 코딩 에러에 대한 보상을 위해 픽셀 값에 적정 오프셋(offset) 값을 더할 수 있다. ALF는 복원 영상 및 원래의 영상을 비교한 값에 기반하여 필터링을 수행할 수 있다. 필터부(180)를 거친 복원 블록은 참조 픽쳐 버퍼(190)에 저장될 수 있다.
The deblocking filter may remove block distortion generated at a boundary between blocks. SAO may add an appropriate offset value to a pixel value to compensate for a coding error. ALF may perform filtering based on a value obtained by comparing the restored image and the original image. A reconstructed block that has passed through the filter unit 180 may be stored in the reference picture buffer 190 .

도 2는 본 발명이 적용되는 복호화 장치의 일 실시예에 따른 구성을 나타내는 블록도이다.2 is a block diagram showing a configuration according to an embodiment of a decoding device to which the present invention is applied.

복호화 장치(200)는 비디오 복호화 장치 또는 영상 복호화 장치일 수 있다.The decoding device 200 may be a video decoding device or an image decoding device.

도 2를 참조하면, 복호화 장치(200)는 엔트로피 복호화부(210), 역양자화부(220), 역변환부(230), 인트라 예측부(240), 인터 예측부(250), 가산기(255), 필터부(260) 및 참조 픽쳐 버퍼(270)를 포함할 수 있다.Referring to FIG. 2 , the decoding apparatus 200 includes an entropy decoding unit 210, an inverse quantization unit 220, an inverse transform unit 230, an intra prediction unit 240, an inter prediction unit 250, and an adder 255. , a filter unit 260 and a reference picture buffer 270.

복호화 장치(200)는 부호화 장치(100)에서 출력된 비트스트림을 수신할 수 있다. 복호화 장치(200)는 비트스트림에 대하여 인트라 모드 및/또는 인터 모드의 복호화를 수행할 수 있다. 또한, 복호화 장치(200)는 복호화를 통해 복원 영상을 생성할 수 있고, 생성된 복원 영상을 출력할 수 있다.The decoding device 200 may receive the bitstream output from the encoding device 100. The decoding apparatus 200 may perform intra mode and/or inter mode decoding on a bitstream. Also, the decoding apparatus 200 may generate a restored image through decoding and output the generated restored image.

예를 들면, 복호화에 사용되는 예측 모드에 따른 인트라 모드 또는 인터 모드로의 전환은 스위치에 의해 이루어질 수 있다. 복호화에 사용되는 예측 모드가 인트라 모드인 경우 스위치가 인트라로 전환될 수 있다. 복호화에 사용되는 예측 모드가 인터 모드인 경우 스위치가 인터로 전환될 수 있다.For example, conversion to an intra mode or an inter mode according to a prediction mode used for decoding may be performed by a switch. When the prediction mode used for decoding is the intra mode, the switch may be switched to the intra mode. When the prediction mode used for decoding is the inter mode, the switch may be switched to inter mode.

복호화 장치(200)는 입력된 비트스트림으로부터 복원된 잔차 블록(reconstructed residual block)을 획득할 수 있고, 예측 블록을 생성할 수 있다. 복원된 잔차 블록 및 예측 블록이 획득되면, 복호화 장치(200)는 복원된 잔차 블록과 및 예측 블록을 더함으로써 복원 블록을 생성할 수 있다.The decoding apparatus 200 may obtain a reconstructed residual block from the input bitstream and generate a prediction block. When the reconstructed residual block and the prediction block are obtained, the decoding apparatus 200 may generate a reconstructed block by adding the reconstructed residual block and the prediction block.

엔트로피 복호화부(210)는 확률 분포에 기초하여 비트스트림에 대한 엔트로피 복호화를 수행함으로써 심볼들을 생성할 수 있다. 생성된 심볼들은 양자화된 계수(quantized coefficient) 형태의 심볼을 포함할 수 있다. 여기에서, 엔트로피 복호화 방법은 상술된 엔트로피 부호화 방법과 유사할 수 있다. 예를 들면, 엔트로피 복호화 방법은 상술된 엔트로피 부호화 방법의 역과정일 수 있다.The entropy decoding unit 210 may generate symbols by performing entropy decoding on a bitstream based on a probability distribution. The generated symbols may include symbols in the form of quantized coefficients. Here, the entropy decoding method may be similar to the above-described entropy encoding method. For example, the entropy decoding method may be a reverse process of the above-described entropy encoding method.

양자화된 계수는 역양자화부(220)에서 역양자화될 수 있다. 또한, 역양자화된 계수는 역변환부(230)에서 역변환될 수 있다. 양자화된 계수가 역양자화 및 역변환 된 결과로서, 복원된 잔차 블록이 생성될 수 있다. 이때, 역양자화부(220)는 양자화된 계수에 양자화 행렬을 적용할 수 있다.The quantized coefficient may be inversely quantized in the inverse quantization unit 220 . In addition, the inverse quantized coefficient may be inversely transformed in the inverse transform unit 230 . As a result of inverse quantization and inverse transformation of the quantized coefficients, a reconstructed residual block may be generated. At this time, the inverse quantization unit 220 may apply a quantization matrix to the quantized coefficients.

인트라 모드가 사용되는 경우, 인트라 예측부(240)는 현재 블록 주변의 이미 부호화된 블록의 픽셀 값을 이용하는 공간적 예측을 수행함으로써 예측 블록을 생성할 수 있다.When the intra mode is used, the intra predictor 240 may generate a prediction block by performing spatial prediction using pixel values of previously encoded blocks adjacent to the current block.

인터 예측부(250)는 움직임 보상부를 포함할 수 있다. 인터 모드가 사용되는 경우, 움직임 보상부는 움직임 벡터 및 참조 영상을 이용하는 움직임 보상을 수행함으로써 예측 블록을 생성할 수 있다. 참조 영상은 참조 픽쳐 버퍼(270)에 저장될 수 있다.The inter prediction unit 250 may include a motion compensation unit. When the inter mode is used, the motion compensation unit may generate a prediction block by performing motion compensation using a motion vector and a reference image. A reference image may be stored in the reference picture buffer 270 .

복원된 잔차 블록 및 예측 블록은 가산기(255)를 통해 더해질 수 있다. 가산기(255)는 복원된 잔차 블록 및 예측 블록을 더함으로써 복원 블록을 생성할 수 있다.The reconstructed residual block and the predicted block may be added through an adder 255. The adder 255 may generate a reconstructed block by adding the reconstructed residual block and the prediction block.

복원 블록은 필터부(260)를 거칠 수 있다. 필터부(260)는 디블록킹 필터, SAO 및 ALF 중 적어도 하나 이상을 복원 블록 또는 복원 픽쳐에 적용할 수 있다. 필터부(260)는 복원 영상을 출력할 수 있다. 복원 영상은 참조 픽쳐 버퍼(270)에 저장되어 인터 예측에 사용될 수 있다.
The reconstruction block may pass through the filter unit 260 . The filter unit 260 may apply at least one of a deblocking filter, SAO, and ALF to a reconstructed block or a reconstructed picture. The filter unit 260 may output a reconstructed image. The reconstructed image may be stored in the reference picture buffer 270 and used for inter prediction.

도 3은 영상을 부호화 및 복호화할 때의 영상의 분할 구조를 개략적으로 나타내는 도면이다.3 is a diagram schematically illustrating a division structure of an image when encoding and decoding an image.

영상을 효율적으로 분할하기 위해, 부호화 및 복호화에 있어서, 코딩 유닛(Coding Unit; CU)이 사용될 수 있다. 유닛은 1) 영상 샘플들을 포함하는 블록 및 2) 구문 요소(syntax element)을 합쳐서 지칭하는 용어일 수 있다. 예를 들면, "유닛의 분할"은 "유닛에 해당하는 블록의 분할"을 의미할 수 있다.In order to efficiently segment an image, a coding unit (CU) may be used in encoding and decoding. A unit may be a term that collectively refers to 1) a block including video samples and 2) a syntax element. For example, “division of a unit” may mean “division of a block corresponding to a unit”.

도 3을 참조하면, 영상(300)은 최대 코딩 유닛(Largest Coding Unit; LCU)의 단위로 순차적으로 분할될 수 있고, 영상(300)의 분할 구조는 LCU에 따라서 결정될 수 있다. 여기서, LCU는 코딩 트리 유닛(Coding Tree Unit; CTU)과 동일한 의미로 사용될 수 있다.Referring to FIG. 3 , an image 300 may be sequentially divided into units of a largest coding unit (LCU), and a division structure of the image 300 may be determined according to the LCU. Here, LCU may be used as the same meaning as Coding Tree Unit (CTU).

분할 구조는 LCU(310) 내에서의, 영상을 효율적으로 부호화하기 위한, 코딩 유닛(Coding Unit; CU)의 분포를 의미할 수 있다. 이러한 분포는 하나의 CU를 4개의 CU들로 분할할지 여부에 따라 결정될 수 있다. 분할에 의해 생성된 CU의 가로 크기 및 세로 크기는 각각 분할 전의 CU의 가로 크기의 절반 및 세로 크기의 절반일 수 있다. 분할된 CU는 동일한 방식으로 가로 크기 및 세로 크기가 절반으로 감소된 4개의 CU로 재귀적으로 분할될 수 있다.The division structure may refer to a distribution of Coding Units (CUs) in the LCU 310 to efficiently encode an image. This distribution may be determined according to whether one CU is divided into 4 CUs. The horizontal size and vertical size of the CU generated by division may be half of the horizontal size and half of the vertical size of the CU before division, respectively. The divided CU may be recursively divided into four CUs whose horizontal and vertical sizes are reduced by half in the same way.

이때, CU의 분할은 기정의된 깊이까지 재귀적으로 이루어질 수 있다. 깊이 정보는 CU의 크기를 나타내는 정보일 수 있다. 각 CU 별로 깊이 정보가 저장될 수 있다. 예컨대, LCU의 깊이는 0일 수 있고, 최소 코딩 유닛(Smallest Coding Unit; SCU)의 깊이는 기정의된 최대 깊이일 수 있다. 여기서, LCU는 상술된 것과 같이 최대의 코딩 유닛 크기를 가지는 CU일 수 있고, SCU는 최소의 코딩 유닛 크기를 가지는 CU일 수 있다. In this case, the division of the CU may be recursively performed up to a predefined depth. Depth information may be information indicating the size of a CU. Depth information may be stored for each CU. For example, the depth of the LCU may be 0, and the depth of the smallest coding unit (SCU) may be a predefined maximum depth. Here, the LCU may be a CU having the largest coding unit size as described above, and the SCU may be a CU having the smallest coding unit size.

LCU(310)로부터 분할이 시작될 수 있고, 분할에 의해 CU의 가로 크기 및 세로 크기가 절반으로 줄어들 때마다 CU의 깊이는 1씩 증가할 수 있다. 각각의 깊이 별로, 분할되지 않는 CU는 2Nx2N 크기를 가질 수 있다. 또한, 분할되는 CU의 경우, 2Nx2N 크기의 CU가 NxN 크기를 가지는 4개의 CU들로 분할될 수 있다. N의 크기는 깊이가 1씩 증가할 때마다 절반으로 감소할 수 있다. The division may start from the LCU 310, and the depth of the CU may increase by 1 whenever the horizontal and vertical sizes of the CU are reduced by half by division. For each depth, a CU that is not split may have a size of 2Nx2N. In addition, in the case of a CU to be divided, a CU of 2Nx2N size may be divided into 4 CUs having a size of NxN. The size of N can be halved every time the depth increases by 1.

도 3을 참조하면, 깊이가 0인 LCU는 64x64 픽셀들일 수 있다. 0은 최소 깊이일 수 있다. 깊이가 3인 SCU는 8x8 픽셀들일 수 있다. 3은 최대 깊이일 수 있다. 이때, LCU인 64x64 픽셀들의 CU는 깊이 0으로 표현될 수 있다. 32x32 픽셀들의 CU는 깊이 1로 표현될 수 있다. 16x16 픽셀들의 CU는 깊이 2로 표현될 수 있다. SCU인 8x8 픽셀들의 CU는 깊이 3으로 표현될 수 있다. Referring to FIG. 3 , an LCU having a depth of 0 may be 64×64 pixels. 0 may be the minimum depth. A SCU with a depth of 3 may be 8x8 pixels. 3 may be the maximum depth. In this case, a CU of 64x64 pixels, which is an LCU, may be expressed as a depth of 0. A CU of 32x32 pixels can be represented with a depth of 1. A CU of 16x16 pixels can be represented with a depth of 2. A CU of 8x8 pixels, which is an SCU, can be represented with a depth of 3.

또한, CU가 분할되는지 여부에 대한 정보는 CU의 분할 정보를 통해 표현될 수 있다. 분할 정보는 1비트의 정보일 수 있다. SCU를 제외한 모든 CU는 분할 정보를 포함할 수 있다. 예를 들면, CU가 분할되지 않는 경우 CU의 분할 정보의 값은 0일 수 있고, CU가 분할되는 경우 CU의 분할 정보의 값은 1일 수 있다.
Also, information on whether the CU is split may be expressed through split information of the CU. The division information may be 1 bit of information. All CUs except for the SCU may include partition information. For example, when the CU is not split, the value of the split information of the CU may be 0, and when the CU is split, the value of the split information of the CU may be 1.

도 4a 내지 도 4h는 코딩 유닛(CU)이 포함할 수 있는 예측 유닛(PU)의 형태를 도시한 도면이다.4A to 4H are diagrams illustrating a form of a prediction unit (PU) that may be included in a coding unit (CU).

LCU로부터 분할된 CU 중 더 이상 분할되지 않는 CU는 하나 이상의 예측 유닛(Prediction Unit; PU)들로 분할될 수 있다. 이러한 분할 또한, 파티션(partition)로 칭해질 수 있다.Among the CUs divided from the LCU, a CU that is not further divided may be divided into one or more prediction units (PUs). This division may also be referred to as a partition.

PU는 예측에 대한 기본 단위일 수 있다. PU는 스킵(skip) 모드, 인터 모드 및 인트라 모드 중 어느 하나로 부호화 및 복호화될 수 있다. PU는 각 모드에 따라서 다양한 형태로 분할될 수 있다.A PU may be a basic unit for prediction. A PU may be coded and decoded in any one of skip mode, inter mode, and intra mode. A PU may be divided into various types according to each mode.

도 4a에서 도시된 것과 같이, 스킵 모드에서는, CU 내에 분할이 존재하지 않을 수 있다. 스킵 모드에서는 분할 없이 PU 및 CU의 크기들이 동일한 2Nx2N 모드(410)가 지원될 수 있다.As shown in FIG. 4A , in skip mode, there may not be a split within a CU. In the skip mode, a 2Nx2N mode 410 in which sizes of PU and CU are the same may be supported without division.

인터 모드에서는, CU 내에서 8가지로 분할된 형태들이 지원될 수 있다. 예를 들면, 인터 모드에서는 2Nx2N 모드(410), 2NxN 모드(415), Nx2N 모드(420), NxN 모드(425), 2NxnU 모드(430), 2NxnD 모드(435), nLx2N 모드(440) 및 nRx2N 모드(445)가 지원될 수 있다.In inter mode, 8 divided types can be supported within the CU. For example, in inter mode, 2Nx2N mode 410, 2NxN mode 415, Nx2N mode 420, NxN mode 425, 2NxnU mode 430, 2NxnD mode 435, nLx2N mode 440 and nRx2N Mode 445 may be supported.

인트라 모드에서는, 2Nx2N 모드(410) 및 NxN 모드(425)가 지원될 수 있다.
In intra mode, 2Nx2N mode 410 and NxN mode 425 may be supported.

도 5는 코딩 유닛(CU)에 포함될 수 있는 변환 유닛(TU)의 형태를 도시한 도면이다.5 is a diagram illustrating a form of a transform unit (TU) that may be included in a coding unit (CU).

변환 유닛(Transform Unit; TU)은 CU 내에서 변환, 양자화, 역변환 및 역양자화의 과정을 위해 사용되는 기본 단위일 수 있다. TU는 정사각형 형태 또는 직사각형 형태를 가질 수 있다.A transform unit (TU) may be a basic unit used for transform, quantization, inverse transform, and inverse quantization processes within a CU. A TU may have a square shape or a rectangular shape.

LCU로부터 분할된 CU 중, 더 이상 CU들로 분할되지 않는 CU는 하나 이상의 TU들로 분할될 수 있다. 이때, TU의 분할 구조는 쿼드트리(quadtree) 구조일 수 있다. 예컨대, 도 5에서 도시된 것과 같이, 하나의 CU(510)가 쿼드트리 구조에 따라서 한 번 혹은 그 이상 분할될 수 있다. 분할을 통해, 하나의 CU(510)는 다양한 크기의 TU들로 구성될 수 있다.
Among the CUs split from the LCU, a CU that is no longer split into CUs may be split into one or more TUs. In this case, the division structure of the TU may be a quadtree structure. For example, as shown in FIG. 5 , one CU 510 may be divided one or more times according to a quadtree structure. Through division, one CU 510 may be composed of TUs of various sizes.

도 6은 인트라 예측 과정의 실시예를 설명하기 위한 도면이다.6 is a diagram for explaining an embodiment of an intra prediction process.

인트라 예측 모드의 개수는 예측 유닛의 크기에 관계없이 35개로 고정될 수 있다.The number of intra prediction modes may be fixed to 35 regardless of the size of a prediction unit.

예측 모드는 도 6에서 도시된 것과 같이 2개의 비방향성 모드들 및 33개의 방향성 모드들을 포함할 수 있다. 2개의 비방향성 모드들은 디씨(DC) 모드 및 플레너(Planar) 모드를 포함할 수 있다.The prediction mode may include 2 non-directional modes and 33 directional modes as shown in FIG. 6 . The two non-directional modes may include DC mode and Planar mode.

예측 모드의 개수는 색 성분(color component)의 타입에 따라 상이할 수 있다. 예를 들면, 색 성분이 휘도(luma) 신호인지 아니면 색차(chroma) 신호인지에 따라 예측 모드의 개수가 다를 수 있다.The number of prediction modes may be different according to the type of color component. For example, the number of prediction modes may be different depending on whether the color component is a luma signal or a chroma signal.

PU는 NxN의 크기 또는 2Nx2N의 크기를 갖는, 정사각형 형태를 가질 수 있다. NxN의 크기는 4x4, 8x8, 16x16, 32x32 및 64x64 등을 포함할 수 있다. PU의 단위는 CU, PU 및 TU 중 적어도 하나의 크기일 수 있다.A PU may have a square shape, with a size of NxN or a size of 2Nx2N. The size of NxN may include 4x4, 8x8, 16x16, 32x32, and 64x64. A unit of PU may be the size of at least one of CU, PU, and TU.

인트라 부호화 및/또는 복호화는 주변의 복원된 유닛에 포함되는 샘플 값 또는 부호화 파라미터를 이용하여 수행될 수 있다.
Intra encoding and/or decoding may be performed using sample values or encoding parameters included in neighboring reconstructed units.

도 7은 인터 예측 과정의 실시예를 설명하기 위한 도면이다.7 is a diagram for explaining an embodiment of an inter prediction process.

도 7에 도시된 사각형은 영상(또는, 픽쳐)를 나타낼 수 있다. 또한, 도 7에서 화살표는 예측 방향을 나타낼 수 있다. 즉, 영상은 예측 방향에 따라 부호화 및/또는 복호화될 수 있다.The rectangle shown in FIG. 7 may represent an image (or picture). Also, arrows in FIG. 7 may indicate prediction directions. That is, an image may be encoded and/or decoded according to a prediction direction.

각 영상(또는, 픽쳐)는 부호화 타입에 따라 I 픽쳐(Intra Picture), P 픽쳐(Uniprediction Picture), B 픽쳐(Biprediction Picture)로 분류될 수 있다. 각 픽쳐는 각 픽쳐의 부호화 타입에 따라 부호화될 수 있다.Each image (or picture) may be classified into an I picture (Intra Picture), a P picture (Uniprediction Picture), and a B picture (Biprediction Picture) according to an encoding type. Each picture may be coded according to the coding type of each picture.

부호화의 대상인 영상이 I 픽쳐인 경우, 영상은 인터 예측 없이 영상 자체에 대해 부호화될 수 있다. 부호화의 대상인 영상이 P 픽쳐인 경우, 영상은 순방향으로만 참조 픽쳐을 이용하는 인터 예측을 통해 부호화될 수 있다. 부호화의 대상인 영상이 B 픽쳐인 경우, 순방향 및 역방향의 양측으로 참조 픽쳐들을 이용하는 인터 예측을 통해 부호화될 수 있으며, 순방향 및 역방향 중 일 방향으로 참조 픽쳐를 이용하는 인터 예측을 통해 부호화될 수 있다.When an image to be encoded is an I picture, the image itself may be encoded without inter prediction. When an image to be coded is a P picture, the image may be coded through inter prediction using a reference picture only in the forward direction. When an image to be encoded is a B picture, it can be coded through inter prediction using reference pictures in both forward and backward directions, and can be coded through inter prediction using reference pictures in one of the forward and backward directions.

참조 픽쳐을 이용하여 부호화 및/또는 복호화되는 P 픽쳐 및 B 픽쳐는 인터 예측이 사용되는 영상으로 간주될 수 있다.A P picture and a B picture to be encoded and/or decoded using a reference picture may be regarded as an image using inter prediction.

아래에서, 실시예에 따른 인터 모드에서의 인터 예측에 대해 구체적으로 설명된다.Below, inter prediction in inter mode according to an embodiment is described in detail.

인터 모드에서, 부호화 장치(100) 및 복호화 장치(200)는 부호화 대상 유닛 및 복호화 대상 유닛에 대한 예측 및/또는 움직임 보상을 수행할 수 있다. 예를 들면, 부호화 장치(100) 또는 복호화 장치(200)는 복원된 주변 유닛의 움직임 정보를 부호화 대상 유닛 또는 복호화 대상 유닛의 움직임 정보로 사용함으로써 예측 및/또는 움직임 보상을 수행할 수 있다. 여기에서, 부호화 대상 유닛 또는 복호화 대상 유닛은 예측 유닛 및/또는 예측 유닛 파티션을 의미할 수 있다.In the inter mode, the encoding apparatus 100 and the decoding apparatus 200 may perform prediction and/or motion compensation on the encoding target unit and the decoding target unit. For example, the encoding apparatus 100 or the decoding apparatus 200 may perform prediction and/or motion compensation by using reconstructed motion information of neighboring units as motion information of the encoding target unit or decoding target unit. Here, the encoding target unit or the decoding target unit may mean a prediction unit and/or a prediction unit partition.

인터 예측은 참조 픽쳐 및 움직임 정보를 이용하여 수행될 수 있다. 또한, 인터 예측은 전술된 스킵 모드를 이용할 수도 있다.Inter prediction may be performed using a reference picture and motion information. Also, inter prediction may use the aforementioned skip mode.

참조 픽쳐(reference picture)는 현재 픽쳐의 이전 픽쳐 또는 현재 픽쳐의 이후 픽쳐 중 적어도 하나일 수 있다. 이때, 인터 예측은 참조 픽쳐에 기반하여 현재 픽쳐의 블록에 대한 예측을 수행할 수 있다. 여기에서, 참조 픽쳐는 블록의 예측에 이용되는 영상을 의미할 수 있다.A reference picture may be at least one of a picture before the current picture or a picture after the current picture. In this case, inter prediction may perform prediction on a block of the current picture based on the reference picture. Here, the reference picture may mean an image used for prediction of a block.

이때, 참조 픽쳐 내의 영역은 참조 픽쳐를 지시하는 참조 픽쳐 인덱스(refIdx) 및 후술될 움직임 벡터(motion vector) 등을 이용함으로써 특정될 수 있다.In this case, the region within the reference picture may be specified by using a reference picture index (refIdx) indicating the reference picture and a motion vector to be described later.

인터 예측은 참조 픽쳐 및 참조 픽쳐 내에서 현재 블록에 대응하는 참조 블록을 선택할 수 있고, 선택된 참조 블록을 사용하여 현재 블록에 대한 예측 블록을 생성할 수 있다. 현재 블록은 현재 픽쳐의 블록들 중 현재 부호화 또는 복호화의 대상인 블록일 수 있다.Inter prediction may select a reference picture and a reference block corresponding to a current block within the reference picture, and generate a prediction block for the current block using the selected reference block. The current block may be a block currently being encoded or decoded among blocks of the current picture.

움직임 정보는 부호화 장치(100) 및 복호화 장치(200)의 각각에 의해 인터 예측 중 도출될 수 있다. 또한, 도출된 움직임 정보는 인터 예측을 수행하는데 사용될 수 있다.Motion information may be derived during inter prediction by each of the encoding apparatus 100 and the decoding apparatus 200 . Also, the derived motion information may be used to perform inter prediction.

이때, 부호화 장치(100) 및 복호화 장치(200)는 복원된 주변 블록(neighboring block)의 움직임 정보 및/또는 콜 블록(collocated block; col block)의 움직임 정보를 이용함으로써 부호화 효율 및/또는 복호화 효율을 향상시킬 수 있다. 콜 블록은 이미 복원된 콜 픽쳐(collocated picture; col picture) 내의 현재 블록에 대응하는 블록일 수 있다.At this time, the encoding apparatus 100 and the decoding apparatus 200 use the motion information of the reconstructed neighboring block and/or the motion information of the collocated block (col block) to achieve encoding efficiency and/or decoding efficiency. can improve A collocated block may be a block corresponding to a current block in a previously reconstructed collocated picture (col picture).

복원된 주변 블록은 현재 픽쳐 내의 블록이면서, 이미 부호화 및/또는 복호화를 통해 복원된 블록일 수 있다. 복원 블록은 현재 블록에 인접한 이웃 블록 및/또는 현재 블록의 외부 코너에 위치한 블록일 수 있다. 여기에서, 현재 블록의 외부 코너에 위치한 블록이란, 현재 블록에 가로로 인접한 이웃 블록에 세로로 인접한 블록 또는 현재 블록에 세로로 인접한 이웃 블록에 가로로 인접한 블록일 수 있다.The reconstructed neighboring block may be a block in the current picture and may be a block already reconstructed through encoding and/or decoding. The reconstruction block may be a neighboring block adjacent to the current block and/or a block located at an outer corner of the current block. Here, the block located at the outer corner of the current block may be a block vertically adjacent to a neighboring block horizontally adjacent to the current block or a block horizontally adjacent to a neighboring block vertically adjacent to the current block.

예를 들면, 복원된 주변 유닛은 대상 유닛의 좌측에 위치한 유닛, 대상 유닛의 상단에 위치한 유닛, 대상 유닛의 좌측 하단 코너에 위치한 유닛, 대상 유닛의 우측 상단 코너에 위치한 유닛 또는 대상 유닛의 좌측 상단 코너에 위치한 유닛일 수 있다.For example, a restored peripheral unit may be a unit located at the left side of the target unit, a unit located at the top of the target unit, a unit located at the lower left corner of the target unit, a unit located at the upper right corner of the target unit, or a unit located at the upper left corner of the target unit. It may be a unit located in a corner.

부호화 장치(100) 및 복호화 장치(200)의 각각은 콜 픽쳐 내에서 공간적으로 현재 블록에 대응하는 위치에 존재하는 블록을 결정할 수 있고, 결정된 블록을 기준으로 기정의된 상대적인 위치를 결정할 수 있다. 기정의된 상대적인 위치는 공간적으로 현재 블록에 대응하는 위치에 존재하는 블록의 내부 및/또는 외부의 위치일 수 있다. 또한, 부호화 장치(100) 및 복호화 장치(200)의 각각은 결정된 기정의된 상대적인 위치에 기반하여 콜 블록을 도출할 수 있다. 여기서, 콜 픽쳐는 참조 픽쳐 리스트에 포함된 적어도 하나의 참조 픽쳐 중에서 하나의 픽쳐일 수 있다.Each of the encoding apparatus 100 and the decoding apparatus 200 may determine a block spatially located at a position corresponding to the current block in the collocated picture, and may determine a predefined relative position with respect to the determined block. The predefined relative position may be a position inside and/or outside a block spatially existing at a position corresponding to the current block. Also, each of the encoding apparatus 100 and the decoding apparatus 200 may derive a collocated block based on the determined relative position. Here, the collocated picture may be one picture among at least one reference picture included in the reference picture list.

참조 픽쳐 내의 블록은 복원된 참조 픽쳐 내에서 현재 블록의 위치에 공간적으로 대응되는 위치에 존재할 수 있다. 말하자면, 현재 픽쳐 내에서의 현재 블록의 위치 및 참조 픽쳐 내에서의 블록의 위치는 서로 대응할 수 있다. 이하, 참조 픽쳐에 포함된 블록의 움직임 정보는 시간적 움직임 정보로 불릴 수 있다.A block in the reference picture may exist at a position spatially corresponding to the position of the current block in the reconstructed reference picture. In other words, the position of the current block in the current picture and the position of the block in the reference picture may correspond to each other. Hereinafter, motion information of a block included in a reference picture may be referred to as temporal motion information.

움직임 정보의 도출 방식은 현재 블록의 예측 모드에 따라 변할 수 있다. 예를 들면, 인터 예측을 위해 적용되는 예측 모드로서, 향상된 움직임 벡터 예측자(Advanced Motion Vector Predictor; AMVP) 및 머지(merge) 등이 있을 수 있다.A method of deriving motion information may change according to a prediction mode of a current block. For example, as a prediction mode applied for inter prediction, there may be an Advanced Motion Vector Predictor (AMVP) and merge.

예를 들면, 예측 모드로서, AMVP가 적용되는 경우, 부호화 장치(100) 및 복호화 장치(200)의 각각은 복원된 주변 블록의 움직임 벡터 및/또는 콜 블록의 움직임 벡터를 이용하여 예측 움직임 벡터 후보 리스트를 생성할 수 있다. 복원된 주변 블록의 움직임 벡터 및/또는 콜 블록의 움직임 벡터는 예측 움직임 벡터 후보로 사용될 수 있다.For example, when AMVP is applied as a prediction mode, each of the encoding apparatus 100 and the decoding apparatus 200 uses a motion vector of a reconstructed neighboring block and/or a motion vector of a collocated block to predict motion vector candidates. You can create a list. A motion vector of a reconstructed neighboring block and/or a motion vector of a collocated block may be used as a predicted motion vector candidate.

부호화 장치(100)에 의해 생성된 비트스트림은 예측 움직임 벡터 인덱스를 포함할 수 있다. 예측 움직임 벡터 인덱스는 예측 움직임 벡터 후보 리스트에 포함된 예측 움직임 벡터 후보 중에서 선택된 최적의 예측 움직임 벡터를 지시할 수 있다. 비트스트림을 통해 예측 움직임 벡터 인덱스는 부호화 장치(100)로부터 복호화 장치(200)로 전송될 수 있다.A bitstream generated by the encoding apparatus 100 may include a predictive motion vector index. The predicted motion vector index may indicate an optimal predicted motion vector selected from motion vector predicted candidates included in the predicted motion vector candidate list. The predictive motion vector index may be transmitted from the encoding apparatus 100 to the decoding apparatus 200 through a bitstream.

복호화 장치(200)는 예측 움직임 벡터 인덱스를 이용하여 예측 움직임 벡터 후보 리스트에 포함된 예측 움직임 벡터 후보 중에서 현재 블록의 예측 움직임 벡터를 선택할 수 있다.The decoding apparatus 200 may use the predictive motion vector index to select a predictive motion vector of the current block from among the predictive motion vector candidates included in the predictive motion vector candidate list.

부호화 장치(100)는 현재 블록의 움직임 벡터 및 예측 움직임 벡터 간의 움직임 벡터 차분(MVD: Motion Vector Difference)을 계산할 수 있고, MVD를 부호화할 수 있다. 비트스트림은 부호화된 MVD를 포함할 수 있다. MVD는 비트스트림을 통해 부호화 장치(100)로부터 복호화 장치(200)로 전송될 수 있다. 이 때, 복호화 장치(200)는 수신된 MVD를 복호화할 수 있다. 복호화 장치(200)는 복호화된 MVD 및 예측 움직임 벡터의 합을 통해 현재 블록의 움직임 벡터를 도출할 수 있다.The encoding apparatus 100 may calculate a motion vector difference (MVD) between the motion vector of the current block and the predicted motion vector, and may encode the MVD. A bitstream may include an encoded MVD. The MVD may be transmitted from the encoding device 100 to the decoding device 200 through a bitstream. At this time, the decoding apparatus 200 may decode the received MVD. The decoding apparatus 200 may derive a motion vector of the current block through the sum of the decoded MVD and the predicted motion vector.

비트스트림은 참조 픽쳐를 지시하는 참조 픽쳐 인덱스 등을 포함할 수 있다. 참조 픽쳐 인덱스는 비트스트림을 통해 부호화 장치(100)로부터 복호화 장치(200)로 전송될 수 있다. 복호화 장치(200)는 주변 블록의 움직임 정보들을 이용하여 현재 블록의 움직임 벡터를 예측할 수 있고, 예측된 움직임 벡터 및 움직임 벡터 차분을 이용하여 현재 블록의 움직임 벡터를 유도할 수 있다. 복호화 장치(200)는 유도된 움직임 벡터와 참조 픽쳐 인덱스 정보에 기반하여 현재 블록에 대한 예측 블록을 생성할 수 있다.A bitstream may include a reference picture index indicating a reference picture. The reference picture index may be transmitted from the encoding apparatus 100 to the decoding apparatus 200 through a bitstream. The decoding apparatus 200 may predict a motion vector of the current block using motion information of neighboring blocks, and may derive a motion vector of the current block using the predicted motion vector and the motion vector difference. The decoding apparatus 200 may generate a prediction block for the current block based on the derived motion vector and reference picture index information.

부호화 대상 유닛 및 복호화 대상 유닛에 대해서 복원된 주변 유닛의 움직임 정보가 사용될 수 있기 때문에, 특정한 인터 예측 모드에서는, 부호화 장치(100)가 대상 유닛에 대한 움직임 정보를 별도로 부호화하지 않을 수도 있다. 대상 유닛의 움직임 정보가 부호화되지 않으면, 복호화 장치(200)로 전송되는 비트량이 감소될 수 있고, 부호화 효율이 향상될 수 있다. 예를 들면, 이러한 대상 유닛의 움직임 정보가 부호화되지 않는 인터 예측 모드로서, 스킵 모드(skip mode) 및/또는 머지 모드(merge mode) 등이 있을 수 있다. 이때, 부호화 장치(100) 및 복호화 장치(200)는 복원된 주변 유닛 중 어떤 유닛의 움직임 정보를 대상 유닛의 움직임 정보로서 사용되는지를 지시하는 식별자 및/또는 인덱스를 사용할 수 있다.Since motion information of neighboring units reconstructed for the encoding target unit and the decoding target unit can be used, the encoding apparatus 100 may not separately encode motion information for the target unit in a specific inter prediction mode. If the motion information of the target unit is not encoded, the amount of bits transmitted to the decoding apparatus 200 may be reduced and encoding efficiency may be improved. For example, as an inter prediction mode in which motion information of the target unit is not coded, there may be a skip mode and/or a merge mode. In this case, the encoding apparatus 100 and the decoding apparatus 200 may use an identifier and/or an index indicating which of the reconstructed neighboring units' motion information is used as the motion information of the target unit.

움직임 정보의 도출 방식의 다른 예로, 머지(merge)가 있다. 머지란 복수의 블록들에 대한 움직임의 병합을 의미할 수 있다. 머지는 하나의 블록의 움직임 정보를 다른 블록에도 함께 적용시키는 것을 의미할 수 있다. 머지가 적용되는 경우, 부호화 장치(100) 및 복호화 장치(200)의 각각은 복원된 주변 블록의 움직임 정보 및/또는 콜 블록의 움직임 정보를 이용하여 머지 후보 목록(merge candidate list)를 생성할 수 있다. 움직임 정보는 1) 움직임 벡터, 2) 참조 영상에 대한 인덱스, 및 3) 예측 방향 중 적어도 하나를 포함할 수 있다. 예측 방향은 단방향 또는 양방향일 수 있다.Another example of a motion information derivation method is merge. Merge may refer to merging of motions of a plurality of blocks. Merge may mean applying motion information of one block to another block as well. When merge is applied, each of the encoding device 100 and the decoding device 200 may generate a merge candidate list using motion information of a reconstructed neighboring block and/or motion information of a collocated block. have. The motion information may include at least one of 1) a motion vector, 2) an index for a reference picture, and 3) a prediction direction. The prediction direction can be unidirectional or bidirectional.

이때, 머지는 CU 단위 또는 PU 단위로 적용될 수 있다. CU 단위 또는 PU 단위로 머지가 수행되는 경우, 부호화 장치(100)는 비트스트림을 통해 기정의된 정보를 복호화 장치(200)로 전송할 수 있다. 비트스트림은 기정의된 정보를 포함할 수 있다. 기정의된 정보는, 1) 블록 파티션(partition)별로 머지를 수행할지 여부를 나타내는 정보, 2) 현재 블록에 인접한 주변 블록들 중 어떤 블록과 머지를 할 것인가에 대한 정보를 포함할 수 있다. 예를 들면, 현재 블록의 주변 블록들은 현재 블록의 좌측 인접 블록, 현재 블록의 상단 인접 블록 및 현재 블록의 시간적(temporal) 인접 블록 등을 포함할 수 있다.In this case, merge may be applied in units of CUs or units of PUs. When merging is performed in units of CUs or PUs, the encoding apparatus 100 may transmit predefined information to the decoding apparatus 200 through a bitstream. A bitstream may include predefined information. The predefined information may include 1) information indicating whether merging is to be performed for each block partition, and 2) information about which block to perform merging with among neighboring blocks adjacent to the current block. For example, neighboring blocks of the current block may include a left neighboring block of the current block, an upper neighboring block of the current block, and a temporal neighboring block of the current block.

머지 후보 목록은 움직임 정보들이 저장된 목록을 나타낼 수 있다. 또한, 머지 후보 목록은 머지가 수행되기 전에 생성될 수 있다. 머지 후보 목록에 저장되는 움직임 정보는, 1) 현재 블록에 인접한 주변 블록의 움직임 정보 또는 2) 참조 영상 에서 현재 블록에 대응하는 콜 블록(collocated block) 움직임 정보일 수 있다. 또한, 머지 후보 목록에 저장된 움직임 정보는 이미 머지 후보 목록에 존재하는 움직임 정보들의 조합에 의해 생성된 새로운 움직임 정보일 수 있다.The merge candidate list may indicate a list in which motion information is stored. Also, the merge candidate list may be generated before merging is performed. Motion information stored in the merge candidate list may be 1) motion information of neighboring blocks adjacent to the current block or 2) motion information of a collocated block corresponding to the current block in the reference image. Also, the motion information stored in the merge candidate list may be new motion information generated by a combination of motion information already existing in the merge candidate list.

스킵 모드는 주변 블록의 정보를 그대로 현재 블록에 적용하는 모드일 수 있다. 스킵 모드는 인터 예측에 이용되는 모드 중 하나일 수 있다. 스킵 모드가 사용되는 경우, 부호화 장치(100)는 어떤 블록의 움직임 정보를 현재 블록의 움직임 정보로서 이용할 것인지에 대한 정보만을 비트스트림을 통해 복호화 장치(200)에 전송할 수 있다. 부호화 장치(100)는 다른 정보는 복호화 장치(200)에 전송하지 않을 수 있다. 예를 들면, 다른 정보는 신택스(syntax) 정보일 수 있다. 신택스 정보는 움직임 벡터 차분 정보를 포함할 수 있다.
The skip mode may be a mode in which information of neighboring blocks is applied to the current block as it is. A skip mode may be one of modes used for inter prediction. When the skip mode is used, the encoding apparatus 100 may transmit only information on which block's motion information is to be used as the current block's motion information to the decoding apparatus 200 through a bitstream. The encoding device 100 may not transmit other information to the decoding device 200. For example, the other information may be syntax information. Syntax information may include motion vector difference information.

도 8은 일 실시예에 따른 인터 예측 방법의 흐름도이다.8 is a flowchart of an inter prediction method according to an embodiment.

아래에서는, 예시적으로 영상의 부호화 과정에 있어서 대상 유닛에 대한 인터 예측을 수행하는 방법이 설명된다. 예를 들면, 대상 유닛은 CU일 수 있다.Hereinafter, a method of performing inter prediction on a target unit in an image encoding process will be described as an example. For example, the target unit may be a CU.

단계(810)에서, 인터 예측부(110)는 대상 유닛의 영상 복잡도에 기반하여 대상 유닛의 타입을 결정할 수 있다.In step 810, the inter predictor 110 may determine the type of the target unit based on the image complexity of the target unit.

여기에서, 대상 유닛의 타입은 정적 영역 및 동적 영역을 포함할 수 있고, 대상 유닛은 정적 영역 및 동적 영역 중 하나로 분류될 수 있다.Here, the type of target unit may include a static region and a dynamic region, and the target unit may be classified into one of a static region and a dynamic region.

영상 복잡도는 대상 유닛 및 대응 유닛 간의 비교에 기반하여 결정될 수 있다. 말하자면, 대상 유닛을 나타내는 값 및 대응 유닛을 나타내는 값에 대한 기정의된 연산의 결과가 영상 복잡도로서 사용될 수 있다. 여기에서, 대상 유닛을 나타내는 값은 대상 유닛의 LCU일 수 있으며, 대응 유닛을 나타내는 값은 대응 유닛의 LCU일 수 있다.Image complexity may be determined based on comparison between the target unit and the corresponding unit. In other words, a result of a predefined operation on a value representing a target unit and a value representing a corresponding unit may be used as the image complexity. Here, the value representing the target unit may be the LCU of the target unit, and the value representing the corresponding unit may be the LCU of the corresponding unit.

영상 복잡도는 대상 유닛의 LCU 및 대응 유닛의 LCU의 절대 차분 값(Sum of Absolute difference; SAD)에 기반하여 결정될 수 있다. 예를 들면, 영상 복잡도는 대상 유닛의 LCU 및 대응 유닛 LCU의 SAD일 수 있으며, 대상 유닛의 LCU 및 대응 유닛 LCU의 SAD에 기반하여 생성되는 값일 수 있다.The image complexity may be determined based on a sum of absolute difference (SAD) between the LCU of the target unit and the LCU of the corresponding unit. For example, the image complexity may be the SAD of the LCU of the target unit and the LCU of the corresponding unit, and may be a value generated based on the SAD of the LCU of the target unit and the LCU of the corresponding unit.

또는, 영상 복잡도는 대상 유닛의 LCU 및 대응 유닛의 LCU의 제곱 에러 합(Sum of Squared Error; SSE)에 기반하여 결정될 수 있다. 예를 들면, 영상 복잡도는 대상 유닛의 LCU 및 대응 유닛 LCU의 SSE일 수 있으며, 대상 유닛의 LCU 및 대응 유닛 LCU의 SSE에 기반하여 생성되는 값일 수 있다.Alternatively, the image complexity may be determined based on a sum of squared error (SSE) of the LCU of the target unit and the LCU of the corresponding unit. For example, the image complexity may be the SSE of the LCU of the target unit and the LCU of the corresponding unit, and may be a value generated based on the SSE of the LCU of the target unit and the LCU of the corresponding unit.

대응 유닛은 대상 유닛에 대응하는 유닛일 수 있다. 예를 들면, 대응 유닛은 참조 픽쳐 내의 대상 유닛에 대응하는 콜 유닛일 수 있다. 현재 픽쳐 내에서의 대상 유닛의 위치 및 참조 픽쳐 내에서의 대응 유닛의 위치는 서로 대응할 수 있다.The corresponding unit may be a unit corresponding to the target unit. For example, the corresponding unit may be a call unit corresponding to a target unit in a reference picture. A position of the target unit in the current picture and a position of the corresponding unit in the reference picture may correspond to each other.

인터 예측부(110)는 대상 유닛의 영상 복잡도 및 기정의된 임계 값(Th) 간의 비교에 기반하여 대상 유닛의 타입을 결정할 수 있다. 기정의된 임계 값(Th)는 대상 유닛의 타입의 판단을 위해 최적으로 결정된 값일 수 있다. 예를 들면, 인터 예측부(110)는 대상 유닛의 영상 복잡도 또는 SAD가 기정의된 임계 값(Th)의 이상일 경우, 대상 유닛의 타입을 동적 유닛으로 결정할 수 있다. 인터 예측부(110)는 대상 유닛의 영상 복잡도 또는 SAD가 기정의된 임계 값(Th)보다 작을 경우, 대상 유닛의 타입을 정적 유닛으로 결정할 수 있다.The inter predictor 110 may determine the type of the target unit based on a comparison between the image complexity of the target unit and a predefined threshold value Th. The predefined threshold value Th may be a value optimally determined for determining the type of target unit. For example, the inter predictor 110 may determine the type of the target unit as a dynamic unit when the image complexity or SAD of the target unit is greater than or equal to a predefined threshold value Th. The inter predictor 110 may determine the type of the target unit as a static unit when the image complexity or SAD of the target unit is less than a predefined threshold value Th.

인터 예측부(110)는 대상 유닛의 영상 복잡도 및 기정의된 임계 값(Th) 간의 비교의 결과에 따라 플래그 "type flag"의 값을 설정할 수 있다.The inter predictor 110 may set the value of the flag “type flag” according to a result of comparison between the image complexity of the target unit and the predefined threshold value Th.

인터 예측부(110)는 대상 유닛의 영상 복잡도 또는 SAD가 기정의된 임계 값(Th)의 이상이면 "type flag"을 "참(true)", "논리 1" 또는 기정의된 제1 값으로 설정할 수 있다. 인터 예측부(110)는 인터 예측부(110)는 대상 유닛의 영상 복잡도 또는 SAD가 기정의된 임계 값(Th)보다 작으면 "type flag"을 "거짓(false)", "논리 0" 또는 기정의된 제2 값으로 설정할 수 있다. 예를 들면, 플래그 "type flag"는 대상 유닛의 타입을 나타낼 수 있다. "type flag"의 값이 기정의된 제1 값이면, 대상 유닛은 동적 유닛일 수 있다. "type flag"의 값이 기정의된 제2 값이면, 대상 유닛은 정적 유닛일 수 있다.The inter prediction unit 110 sets the "type flag" to "true", "logic 1" or a predefined first value if the image complexity or SAD of the target unit is greater than or equal to a predefined threshold value (Th). can be set The inter predictor 110 sets the "type flag" to "false", "logic 0", or "logic 0" if the image complexity or SAD of the target unit is less than a predefined threshold value Th. It may be set to a predefined second value. For example, the flag "type flag" may indicate the type of the target unit. If the value of “type flag” is a predefined first value, the target unit may be a dynamic unit. If the value of “type flag” is a predefined second value, the target unit may be a static unit.

대상 유닛의 슬라이스의 슬라이스 타입이 비슬라이스(Bslice)인 경우, 인터 예측부(110)는 대상 유닛의 타입을 결정함에 있어서 2개의 참조 픽쳐들을 사용할 수 있다. 대상 유닛의 슬라이스의 슬라이스 타입이 비슬라이스인 경우, 인터 예측부(110)는 대상 유닛의 LCU 및 참조 픽쳐 0의 제1 대응 유닛의 LCU의 제1 SAD를 계산할 수 있고, 대상 유닛의 LCU 및 참조 픽쳐 1의 제2 대응 유닛의 LCU의 제2 SAD를 계산할 수 있다. 인터 예측부(110)는 제1 SAD 및 제2 SAD의 평균을 대상 유닛에 대한 SAD로 사용할 수 있다.When the slice type of the slice of the target unit is Bslice, the inter prediction unit 110 may use two reference pictures in determining the type of the target unit. When the slice type of the slice of the target unit is non-slice, the inter predictor 110 may calculate the LCU of the target unit and the first SAD of the LCU of the first corresponding unit of the reference picture 0, and calculate the LCU and reference of the target unit. The second SAD of the LCU of the second corresponding unit of Picture 1 may be calculated. The inter predictor 110 may use the average of the first SAD and the second SAD as the SAD for the target unit.

대상 유닛의 타입은 초기 타입 및 최종 타입으로 분류될 수 있다. 초기 타입은 "type flag"가 나타내는 대상 유닛의 타입일 수 있다. 최종 타입은 대상 유닛의 "type flag" 및 인접 유닛의 "type flag"의 가중치가 부여된 합(weighted sum)에 기반하여 결정되는 타입일 수 있다. 인접 유닛은 대상 유닛에 대한 처리 이전에 이미 부호화 또는 복호화된, 대상 유닛에 인접한 유닛일 수 있다. 인접 유닛은 복수일 수 있다. 예를 들면, 복수의 인접한 유닛들은 대상 유닛에 좌측 상단으로 인접한 유닛, 대상 유닛에 상단으로 인접한 유닛, 대상 유닛에 우측 상단으로 인접한 유닛 및 대상 유닛에 좌측으로 인접한 유닛 중 적어도 하나를 포함할 수 있다.The type of target unit may be classified into an initial type and a final type. The initial type may be the type of the target unit indicated by the “type flag”. The final type may be a type determined based on a weighted sum of the “type flag” of the target unit and the “type flag” of adjacent units. The adjacent unit may be a unit adjacent to the target unit that has already been encoded or decoded prior to processing on the target unit. Adjacent units may be plural. For example, the plurality of adjacent units may include at least one of a unit that is upper left adjacent to the target unit, a unit that is uppermost adjacent to the target unit, a unit that is upper right adjacent to the target unit, and a unit that is left adjacent to the target unit. .

인터 예측부(110)는 대상 유닛 및 인접 유닛의 영상 복잡도들에 기반하여 대상 유닛의 최종의 타입을 결정할 수 있다. 예를 들면, 인터 예측부(110)는 대상 유닛의 "type flag" 및 인접 유닛의 "type flag"의 가중치가 부여된 합이 기정의된 제3 값의 이상이면 대상 유닛의 최종의 타입을 동적 유닛으로 결정할 수 있다. 예를 들면, 인터 예측부(110)는 대상 유닛의 "type flag" 및 인접 유닛의 "type flag"의 가중치가 부여된 합이 기정의된 제4 값보다 작으면 대상 유닛의 최종의 타입을 정적 유닛으로 결정할 수 있다.The inter predictor 110 may determine the final type of the target unit based on image complexities of the target unit and neighboring units. For example, if the weighted sum of the "type flag" of the target unit and the "type flag" of the adjacent unit is greater than or equal to a predefined third value, the inter predictor 110 dynamically determines the final type of the target unit. unit can be determined. For example, the inter prediction unit 110 statically determines the final type of the target unit when the weighted sum of the "type flag" of the target unit and the "type flag" of the adjacent unit is smaller than a predefined fourth value. unit can be determined.

단계(820)에서, 인터 예측부(110)는 결정된 타입에 따라서 대상 유닛에 대한 인터 예측을 수행할 수 있다.In step 820, the inter prediction unit 110 may perform inter prediction on the target unit according to the determined type.

인터 예측부(110)는 대상 유닛의 타입이 동적 유닛인 경우, 종래의 인터 예측을 사용하여 대상 유닛에 대한 예측을 수행할 수 있다. 여기에서, 종래의 인터 예측은 도 1 내지 도 7을 참조하여 전술된 인터 예측일 수 있다.When the type of the target unit is a dynamic unit, the inter prediction unit 110 may perform prediction on the target unit using conventional inter prediction. Here, the conventional inter prediction may be the inter prediction described above with reference to FIGS. 1 to 7 .

인터 예측부(110)는 대상 유닛의 타입이 정적 유닛인 경우, 간소화된 인터 예측을 사용하여 대상 유닛에 대한 예측을 수행할 수 있다.When the type of the target unit is a static unit, the inter predictor 110 may perform prediction on the target unit using simplified inter prediction.

인터 예측부(110)는 복수의 타입들에 대하여 서로 상이한 인터 예측들을 사용하여 대상 유닛에 대한 예측을 수행할 수 있다.The inter prediction unit 110 may perform prediction on a target unit using different inter predictions for a plurality of types.

서로 상이한 인터 예측들은, 예를 들면, 1) 대상 유닛에 대한 쿼드트리 분할의 횟수에 있어서 서로 상이하거나, 2) 참조 픽쳐의 개수에 있어서 서로 상이하거나, 3) 화면 간 예측 방향의 종류에 있어서 상이할 수 있다.Inter predictions that are different from each other are, for example, 1) different from each other in the number of quadtree splits for the target unit, 2) different from each other in the number of reference pictures, or 3) different from each other in the type of inter-prediction direction. can do.

예를 들면, 간소화된 인터 예측은 대상 유닛에 대한 쿼드트리 분할 횟수를 제한하는 인터 예측일 수 있다. 간소화된 인터 예측은 대상 유닛에 대한 쿼드트리 분할 횟수를 제한함으로써 더 큰 크기의 예측 유닛으로 부호화 및/또는 복호화를 수행하는 예측일 수 있다. 인터 예측부(110)는 정적 유닛에 대해서는 동적 유닛에 비해 더 작은 최대 깊이를 적용하여 대상 유닛에 대한 분할을 수행할 수 있다.For example, simplified inter prediction may be inter prediction that limits the number of quadtree splits for a target unit. Simplified inter prediction may be prediction in which encoding and/or decoding is performed with a prediction unit having a larger size by limiting the number of quad tree splits for the target unit. The inter predictor 110 may perform segmentation on the target unit by applying a smaller maximum depth to the static unit than to the dynamic unit.

또는, 간소화된 인터 예측은 인터 예측의 참조 픽쳐의 개수가 제한된 예측일 수 있다. 간소화된 인터 예측은 대상 유닛에 대한 참조 픽쳐의 개수를 제안하는 예측일 수 있다. 인터 예측부(110)는 정적 유닛에 대해서는 동적 유닛에 대해 더 적은 참조 픽쳐를 사용하여 대상 유닛에 대한 예측을 수행할 수 있다.Alternatively, simplified inter prediction may be prediction in which the number of reference pictures of inter prediction is limited. Simplified inter prediction may be prediction that proposes the number of reference pictures for a target unit. The inter prediction unit 110 may perform prediction on a target unit by using fewer reference pictures for a dynamic unit than for a static unit.

또는, 간소화된 인터 예측은 인터 예측의 인터 예측 방향이 제한된 예측일 수 있다. 간소화된 인터 예측은 대상 유닛에 대한 인터 예측 방향이 제한된 예측일 수 있다. 인터 예측부(110)는 정적 유닛에 대해서는 동적 유닛에 비해 더 제한된 인터 예측 방향으로 대상 유닛에 대한 예측을 수행할 수 있다.Alternatively, simplified inter prediction may be prediction in which the inter prediction direction of inter prediction is limited. Simplified inter prediction may be prediction in which an inter prediction direction for a target unit is restricted. The inter-prediction unit 110 may perform prediction on a target unit in a more limited inter-prediction direction for a static unit than for a dynamic unit.

인터 예측부(110)는 복수의 타입들에 대하여 서로 상이한 탐색 영역(search range)들을 사용하여 대상 유닛에 대한 예측을 수행할 수 있다.The inter prediction unit 110 may perform prediction on a target unit using different search ranges for a plurality of types.

예를 들면, 간소화된 인터 예측은 축소된 탐색 영역에서 탐색이 이루어지는 예측일 수 있다. 인터 예측부(110)는 정적 유닛에 대해서는 동적 유닛에 비해 더 작은 탐색 영역에서 대상 유닛에 대한 예측을 수행할 수 있다.For example, simplified inter prediction may be prediction in which a search is performed in a reduced search area. The inter predictor 110 may perform prediction on a target unit in a search area smaller than that of a dynamic unit for a static unit.

전술된 단계들(810 및 820)은 도 1을 참조하여 전술된 부호화 장치(110)의 다른 구성요소(component)에 의해 수행되는 동작들과 결합되어 수행될 수 있다.The above-described steps 810 and 820 may be performed in combination with operations performed by other components of the encoding device 110 described above with reference to FIG. 1 .

전술된 인터 예측부(110)에 의한 인터 예측부는 복호화 장치(200)의 인터 예측부(250)에 의해서도 수행될 수 있다. 전술된 단계들(810 및 820)은 인터 예측부(250)에 의해서도 수행될 수 있다. 또한, 전술된 단계들(810 및 820)은 도 2을 참조하여 전술된 복호화 장치(200)의 다른 구성요소에 의해 수행되는 동작들과 결합되어 수행될 수 있다.
The inter prediction by the aforementioned inter prediction unit 110 may also be performed by the inter prediction unit 250 of the decoding apparatus 200 . The aforementioned steps 810 and 820 may also be performed by the inter prediction unit 250 . In addition, the above-described steps 810 and 820 may be performed in combination with operations performed by other components of the decoding apparatus 200 described above with reference to FIG. 2 .

도 9는 일 예에 따른 현재 픽쳐 및 참조 픽쳐의 차분 영상을 나타낸다.9 illustrates differential images of a current picture and a reference picture according to an example.

도 9에서는, 현재 픽쳐 및 2개의 참조 픽쳐들이 도시되었다. 현재 픽쳐는 현재 부호화 또는 복호화의 대상인 대상 픽쳐일 수 있다. 현재 픽쳐는 현재 블록을 포함하는 픽쳐일 수 있다.In FIG. 9, a current picture and two reference pictures are shown. The current picture may be a target picture that is currently being encoded or decoded. The current picture may be a picture including the current block.

도 9에서, LCU_CUR은 대상 유닛을 나타낼 수 있다. LCU_AL, LCU_A, LCU_AR 및, LCU_L은 대상 유닛의 복원된 이웃 유닛들을 나타낼 수 있다.In FIG. 9 , LCU _CUR may indicate a target unit. LCU _AL , LCU _A , LCU _AR , and LCU _L may represent reconstructed neighboring units of the target unit.

현재 픽쳐에 대해 2개의 참조 픽쳐 리스트들인 L0 및 L1가 존재할 수 있다. 2개의 참조 픽쳐들 중 첫 번째 참조 픽쳐는 L0에서 선택된 픽쳐일 수 있고, 두 번째 참조 픽쳐는 L1에서 선택된 픽쳐일 수 있다.There may be two reference picture lists, L0 and L1, for the current picture. Among the two reference pictures, a first reference picture may be a picture selected in L0, and a second reference picture may be a picture selected in L1.

도 9에서, LCU^L0 _CUR은 첫 번째 참조 픽쳐의 대응 유닛을 나타낼 수 있다. LCU^L0 _AL, LCU ^L0 _A, LCU ^L0 _AR 및, LCU ^L0 _L은 첫 번째 참조 픽쳐의 대응 유닛의 복원된 이웃 유닛들을 나타낼 수 있다. LCU^L1 _CUR은 두 번째 참조 픽쳐의 대응 유닛을 나타낼 수 있다. LCU ^L1 _AL, LCU ^L1 _A, LCU ^L1 _AR 및, LCU ^L1 _L은 두 번째 참조 픽쳐의 대응 유닛의 복원된 이웃 유닛들을 나타낼 수 있다.In FIG. 9 , LCU ^L0 _CUR may indicate a corresponding unit of a first reference picture. LCU ^L0 _AL , LCU ^L0 _A , LCU ^L0 _AR and, LCU ^L0 _L may indicate reconstructed neighboring units of the corresponding unit of the first reference picture. LCU ^L1 _CUR may indicate a corresponding unit of the second reference picture. LCU ^L1 _AL , LCU ^L1A _, LCU ^L1 _AR and, LCU ^L1 _L may indicate reconstructed neighboring units of the corresponding unit of the second reference picture.

도 9에서는 2개의 차분 영상들이 도시되었다. 좌측의 차분 영상은 L0의 참조 픽쳐 및 현재 픽쳐 간의 차분 영상일 수 있다. 좌측의 차분 영상은 L0의 참조 픽쳐 및 현재 픽쳐의 SAD를 나타낼 수 있다. 우측의 차분 영상은 L1의 참조 픽쳐 및 현재 픽쳐 간의 차분 영상일 수 있다. 우측의 차분 영상은 L1의 참조 픽쳐 및 현재 픽쳐의 SAD를 나타낼 수 있다.In FIG. 9 , two difference images are shown. The left difference image may be a difference image between the reference picture of L0 and the current picture. The difference image on the left may indicate the SAD of the reference picture of L0 and the current picture. The difference image on the right side may be a difference image between the reference picture of L1 and the current picture. The difference image on the right may indicate SAD of the reference picture of L1 and the current picture.

도시된 것과 같이 현재 픽쳐는 복수 개의 CU들로 구성될 수 있으며, 부호화 또는 복호화는 래스터 스캔(raster scan)의 순서에 따라서 CU 별로 이루어질 수 있다.As shown, a current picture may be composed of a plurality of CUs, and encoding or decoding may be performed for each CU according to a raster scan order.

일반적으로, 영상의 특성에 따라 차분 영상의 CU는 아래와 같은 특성을 가질 수 있다.In general, the CU of a difference image may have the following characteristics according to characteristics of an image.

1) CU 내에 움직이는 물체(moving object)가 존재하는 경우, 시간 차에 따른 물체의 위상의 변화 및 물체 내의 세부적인 움직임의 변화에 기인하여 주로 물체의 물체 경계(object boundary) 및 에지(edge)의 영역들에서 (상대적으로 더 큰) 차분 값이 존재할 수 있다. 도 9에서, "A"로 표시된 부분은 움직이는 물체의 물체 경계 및 에지의 영역들에서 발생한 차분 값을 나타낼 수 있다.1) When there is a moving object in the CU, mainly the object boundary and edge of the object due to the change in the phase of the object and the change in detailed motion within the object with time difference There may be (relatively larger) difference values in regions. In FIG. 9 , a portion marked with “A” may represent a difference value generated in the object boundary and edge regions of the moving object.

2) CU가 정적 영역(static region) 내에 존재하는 경우, 시간 차에 따른 영상의 특성(예를 들면, 복잡도)의 변화가 없기 때문에, 대부분의 픽셀 값(또는, 차분 값)이 0에 가까운 값을 가질 수 있다. 도 9에서 "B"로 표시된 부분은, "A"로 표시된 부분 외의 영역을 나타낼 수 있고, 정적 영역을 나타낼 수 있다.2) When the CU exists in a static region, most of the pixel values (or difference values) are values close to 0 because there is no change in the characteristics (eg, complexity) of the image according to the time difference. can have A portion marked “B” in FIG. 9 may represent an area other than the portion marked “A” and may represent a static area.

전술된 영상의 일반 현상에 기초하여, 대상 유닛 내의 영상의 특성에 따라 대상 유닛이 정적 유닛 및 동적 유닛 중 하나로 분류될 수 있다.Based on the above-described general phenomena of images, a target unit may be classified into one of a static unit and a dynamic unit according to characteristics of images in the target unit.

대상 유닛의 분류를 위해, 인터 예측부(110)는 현재 픽쳐의 대상 유닛 및 참조 픽쳐의 대응 유닛 간의 픽셀 단위의 에러 값을 계산할 수 있다. 대응 유닛은 참조 픽쳐 내의 콜 블록일 수 있다.For classification of the target unit, the inter prediction unit 110 may calculate an error value in units of pixels between the target unit of the current picture and the corresponding unit of the reference picture. A corresponding unit may be a collocated block in a reference picture.

도 8을 참조하여 전술된 SAD는 픽셀 단위의 에러 값의 일 예일 수 있다. 예를 들면, 에러 값은 SAD 외에도, 제곱 에러 합(Sum of Squared Error; SSE) 등의 다른 에러 값일 수 있다. 도 8을 전술된 실시예에서, SAD는 예러 값 또는 SSE로 대체될 수 있다.The SAD described above with reference to FIG. 8 may be an example of an error value in units of pixels. For example, the error value may be other error values such as a sum of squared error (SSE) in addition to SAD. 8, SAD may be replaced with an example value or SSE.

인터 예측부(110)는 아래의 수학식 2에 기반하여 대상 블록의 SAD를 계산할 수 있다.The inter predictor 110 may calculate the SAD of the target block based on Equation 2 below.

여기에서, k는 L0의 참조 픽쳐 또는 L1의 참조 픽쳐를 나타낼 수 있다. i는 픽셀의 x 좌표를 나타낼 수 있다. j는 픽셀의 y 좌표를 나타낼 수 있다. N은 대상 유닛의 크기를 나타낼 수 있다.Here, k may indicate a reference picture of L0 or a reference picture of L1. i may represent the x-coordinate of a pixel. j may represent the y-coordinate of a pixel. N may represent the size of the target unit.

현재 픽쳐의 각 CU는 1비트의 "type_flag"를 가질 수 있다. CU에 대해 계산된 SAD의 값이 기정의된 SAD 임계 값(Th_SAD)의 이상인 경우, CU는 동적 유닛으로 결정될 수 있다. 이 경우, type flag"의 값은 "참(true)", "논리 1" 또는 기정의된 제1 값으로 설정될 수 있다. CU에 대해 계산된 SAD의 값이 기정의된 SAD 임계 값(Th_SAD)보다 작은 경우, CU는 정적 유닛으로 결정될 수 있다. 이 경우, type flag"의 값은 거짓(false)", "논리 0" 또는 기정의된 제2 값으로 설정될 수 있다.
Each CU of the current picture may have a 1-bit "type_flag". If the value of the SAD calculated for the CU is greater than or equal to a predefined SAD threshold value (Th _SAD ), the CU may be determined as a dynamic unit. In this case, the value of "type flag" may be set to "true", "logic 1" or a predefined first value. The SAD value calculated for the CU is a predefined SAD threshold value (Th _SAD ), the CU may be determined as a static unit. In this case, the value of "type flag" may be set to "false", "logic 0" or a predefined second value.

도 10은 일 예에 따른 대상 유닛 및 복원된 이웃 유닛들의 타입 플래그들을 나타낸다.10 illustrates type flags of a target unit and reconstructed neighboring units according to an example.

도 10에서, TF_CUR은 대상 유닛의 "type flag"를 나타낼 수 있다. TF_AL는 대상 유닛에 좌측 상단으로 인접한 인접 유닛의 "type flag"를 나타낼 수 있다. TF_A는 대상 유닛에 상단으로 인접한 인접 유닛의 "type flag"를 나타낼 수 있다. TF_AR는 대상 유닛에 우측 상단으로 인접한 유닛의 "type flag"를 나타낼 수 있다. TF_L는 대상 유닛에 좌측으로 인접한 유닛의 "type flag"를 나타낼 수 있다.In FIG. 10, TF _CUR may indicate a “type flag” of a target unit. TF _AL may indicate a “type flag” of an adjacent unit adjacent to the upper left of the target unit. TF _A may indicate a “type flag” of an adjacent unit uppermost adjacent to the target unit. TF _AR may indicate a “type flag” of a unit adjacent to the upper right of the target unit. TF _L may indicate a “type flag” of a unit adjacent to the left of the target unit.

전술된 상대적으로 단순한 대상 유닛의 타입의 결정 방식 외에도, 인터 예측부(110)는 인접 유닛들 및 대상 유닛 간의 상관도를 고려하여 보다 신뢰성 있는 대상 유닛의 타입의 결정 방식을 제공할 수 있다.In addition to the relatively simple method of determining the type of the target unit described above, the inter predictor 110 may provide a more reliable method of determining the type of the target unit by considering correlation between adjacent units and the target unit.

예를 들면, 일반적인 자연 영상에 있어서, 카메라의 움직임이 없음이 가정되었을 때, 배경에 존재하는 정적 영역은 넓은 범위에 걸쳐 나타나는 경향이 있다. 따라서, 대상 유닛의 타입의 결정에 있어서, 대상 유닛의 타입은 기 결정된 인접 유닛들의 타입들과 큰 상관도를 가질 수 있다.For example, in a general natural video, when it is assumed that there is no movement of a camera, a static area existing in the background tends to appear over a wide range. Accordingly, in determining the type of the target unit, the type of the target unit may have a high degree of correlation with the previously determined types of neighboring units.

이러한 상관도를 고려하기 위해, 인터 예측부(110)는 대상 유닛의 "type_flag" 및 인접 유닛들의 "type_flag"의 가중치가 부여된 합 "flag_sum"을 계산할 수 있다.In order to consider this correlation, the inter predictor 110 may calculate a weighted sum "flag_sum" of the "type_flag" of the target unit and the "type_flag" of adjacent units.

인터 예측부(110)는 아래의 수학식 3에 기반하여 대상 유닛의 가중치가 부여된 합 "flag_sum"을 계산할 수 있다.The inter prediction unit 110 may calculate the weighted sum of the target unit “flag_sum” based on Equation 3 below.

TF_i는 유닛 i의 "type_flag"를 나타낼 수 있다. i는 대상 유닛을 나타내는 CUR, 상단 좌측 인접(Above Left) 유닛을 나타내는 AL, 상단(Above) 인접 유닛을 나타내는 A, 상단 우측(Above Right) 인접 유닛을 나타내는 AR 및 좌측(Left) 인접 유닛을 나타내는 L 중 하나일 수 있다. K는 가용한 인접 유닛의 개수일 수 있다. w_i는 유닛 i의 가중치일 수 있다.TF _i may indicate "type_flag" of unit i. i represents CUR representing the target unit, AL representing the above left adjacent unit, A representing the above adjacent unit, AR representing the above right adjacent unit, and representing the left adjacent unit. may be one of L. K may be the number of available adjacent units. w _i may be the weight of unit i.

대상 유닛 및 인접 유닛의 가중치들은 서로 상이한 기정의된 값들일 수 있다. 예를 들면, 대상 유닛 및 좌측 인접 유닛 간의 상관도가 가장 높다는 가정하에, TF_i의 가중치는 w_cur=0.5, w_L=0.2 및 w_AL=w_A= w_AR=0.1 로 결정될 수 있다. 예를 들면, 이웃 유닛 중 대상 유닛과의 상관도가 더 높은 유닛이 더 높은 가중치를 가질 수 있다.The weights of the target unit and the neighboring unit may be different predefined values. For example, under the assumption that the correlation between the target unit and the left adjacent unit is the highest, the weight of TF _i may be determined as w _cur =0.5, w _L =0.2, and w _AL =w _A = w _AR =0.1. For example, a unit having a higher correlation with the target unit among neighboring units may have a higher weight.

인터 예측부(110)는 가중치가 부여된 합 "flag_sum"을 사용하여 대상 유닛의 최종의 타입을 결정할 수 있다.The inter predictor 110 may determine the final type of the target unit using the weighted sum "flag_sum".

인터 예측부(110)는 아래의 수학식 4에 기반하여 대상 유닛의 최종의 타입을 결정할 수 있다.The inter predictor 110 may determine the final type of the target unit based on Equation 4 below.

"type_flag"는 대상 유닛의 최종의 타입을 나타낼 수 있다. Th는 기정의된 임계 값일 수 있다."type_flag" may indicate the final type of the target unit. Th may be a predefined threshold value.

대상 유닛의 슬라이스의 슬라이스 타입이 비슬라이스(Bslice)인 경우, 인터 예측부(110)는 대상 유닛의 타입을 결정함에 있어서 좌측의 참조 픽쳐 및 우측의 참조 픽쳐를 사용할 수 있다. 인터 예측부(110)는 대상 유닛의 LCU 및 좌측 참조 픽쳐의 대응 유닛의 LCU 간의 SAD1를 계산할 수 있고, 대상 유닛의 LCU 및 우측 참조 픽쳐의 대응 유닛의 LCU의 SAD2를 계산할 수 있다. 인터 예측부(110)는 SAD1 및 SAD1의 평균을 대상 유닛에 대한 SAD로 사용할 수 있다. 인터 예측부(110)는 아래의 수학식 5와 같이 1비트의 우측 산술 쉬프트 연산을 사용하여 SAD를 계산할 수 있다.When the slice type of the slice of the target unit is Bslice, the inter prediction unit 110 may use the left reference picture and the right reference picture in determining the type of the target unit. The inter prediction unit 110 may calculate SAD1 between the LCU of the target unit and the LCU of the corresponding unit of the left reference picture, and calculate SAD2 of the LCU of the target unit and the LCU of the corresponding unit of the right reference picture. The inter predictor 110 may use SAD1 and the average of SAD1 as the SAD for the target unit. The inter predictor 110 may calculate SAD using a 1-bit right arithmetic shift operation as shown in Equation 5 below.

SAD가 계산되면, 인터 예측부(110)는 아래의 수학식 6에 기반하여 대상 유닛의 "type flae"인 TF를 결정할 수 있다.When the SAD is calculated, the inter predictor 110 may determine a TF that is a "type flae" of the target unit based on Equation 6 below.

도 3을 참조하여 전술된 것과 같이, 종래의 인터 예측 기술이 적용될 경우, 각 CU는 기정의된 깊이까지 재귀적으로 분할될 수 있다. 이때, 깊이 정보는 CU의 크기를 나타낼 수 있으며, CU 별로 저장될 수 있다. 예컨대, LCU의 깊이는 0일 수 있고, SCU의 깊이는 기정의된 최대 깊이일 수 있다. CU가 정적 유닛인 경우, CU가 나타내는 영상은 주로 평활한(homogeneous) 영역이거나 점진적으로 변화하는 (gradually changing) 특성을 가질 수 있다. 따라서, 정적 유닛인 CU들은 주로 큰 예측 유닛으로 부호화 되는 경향이 나타낼 수 있다. 종래의 인터 예측 기술이 사용될 경우, CU가 필요 이상으로 세분화된 다수의 예측 블록으로 부호화 될 가능성이 있다.As described above with reference to FIG. 3, when the conventional inter prediction technique is applied, each CU may be recursively divided to a predefined depth. In this case, the depth information may indicate the size of the CU and may be stored for each CU. For example, the depth of the LCU may be 0, and the depth of the SCU may be a predefined maximum depth. When the CU is a static unit, an image represented by the CU may have a mostly homogeneous area or a gradually changing characteristic. Therefore, CUs that are static units may tend to be encoded as large prediction units. When the conventional inter-prediction technique is used, there is a possibility that a CU is coded into a plurality of prediction blocks subdivided more than necessary.

CU가 필요 이상으로 분할될 경우, 각각의 하위 예측 유닛에 대하여 움직임 정보가 전송되어야 하기 때문에 부호화 효율이 감소할 수 있다. 또한, 정의된 최대 깊이까지 CU에 대한 재귀적 분할 및 예측이 수행되어야 하기 때문에 부호화 장치(100)의 복잡도가 증가할 수 있다.When a CU is divided more than necessary, encoding efficiency may decrease because motion information must be transmitted for each lower prediction unit. In addition, complexity of the encoding apparatus 100 may increase because recursive division and prediction of the CU must be performed up to the defined maximum depth.

인터 예측부(110)는 대상 유닛의 타입에 따라서, 복수의 타입들에 대해 서로 상이한 최대 깊이를 적용하여 대상 유닛을 분할할 수 있다. 여기에서, 인터 예측부(110)는 복수의 타입들 중 정적 유닛의 최대 깊이를 동적 유닛의 최대 깊이보다 더 작게 설정할 수 있다.The inter predictor 110 may divide the target unit by applying different maximum depths to a plurality of types according to the type of the target unit. Here, the inter prediction unit 110 may set a maximum depth of a static unit smaller than a maximum depth of a dynamic unit among a plurality of types.

말하자면, 인터 예측부(110)는 정적 유닛에 대해서는 분할을 위해 기정의된 최대 깊이의 정보를 감소시킴으로써 정적 유닛이 큰 크기의 예측 유닛으로만 부호화되도록 할 수 있다. 인터 예측부(110)는 정적 유닛을 큰 크기의 예측 유닛으로만 부호화되도록 함으로써 부호화 효율을 향상시킬 수 있고, 부호화 장치(100)의 복잡도를 감소시킬 수 있다. In other words, the inter-prediction unit 110 may reduce information of a predefined maximum depth for segmentation of the static unit so that the static unit is coded only as a prediction unit having a large size. The inter predictor 110 can improve encoding efficiency and reduce the complexity of the encoding apparatus 100 by encoding static units only with large prediction units.

예를 들면, 재귀적 분할을 위한 최대 깊이가 N일 때, 정적 유닛에 대한 최대 깊이는 NL일 수 있다. 여기에서, N은 1 이상의 정수일 수 있다. L은 0 이상, N1 이하의 정수일 수 있다. 또는, L은 1 및 N 사이의 값을 가질 수 있다.For example, when the maximum depth for recursive segmentation is N, the maximum depth for static units may be NL. Here, N may be an integer of 1 or greater. L may be an integer greater than or equal to 0 and less than or equal to N1. Alternatively, L may have a value between 1 and N.

인터 예측부(110)는 적응적으로 L의 값을 결정할 수 있다. 예를 들면, 인터 예측부(110)는 수학식 3의 flag_sum의 값에 기반하여 L의 값을 결정할 수 있다. 모든 가중치 w_i의 값이 1이라는 가정 하에서, L의 값은 N(Mflag_sum)와 같이 계산될 수 있다. 말하자면, flag_sum의 값이 5인 경우 L=N이 성립하고, flag_sum의 값이 4인 경우 L=N1이 성립하고, flag_sum의 값이 3인 경우 L=N2 등으로 결정될 수 있다. 여기에서, M은, flag_sum을 계산하기 위해 사용되는 유닛들의 개수, 또는 TF_i의 개수일 수 있다. 예를 들면, 인접한 유닛들이 상단 좌측 인접 유닛, 상단 인접 유닛, 우측 인접 유닛, 좌측 유닛일 경우, M의 값은 5일 수 있다.The inter predictor 110 may adaptively determine the value of L. For example, the inter predictor 110 may determine the value of L based on the value of flag_sum in Equation 3. Under the assumption that the values of all weights w _i are 1, the value of L can be calculated as N(Mflag_sum). In other words, when the value of flag_sum is 5, L=N holds, when the value of flag_sum is 4, L=N1 holds, and when the value of flag_sum is 3, L=N2, etc. may be determined. Here, M may be the number of units used to calculate flag_sum or the number of TF _i . For example, when the adjacent units are a top left adjacent unit, a top adjacent unit, a right adjacent unit, and a left unit, the value of M may be 5.

종래의 인터 예측 기술이 적용될 경우, 움직임 벡터의 값의 범위는 미리 정의된 탐색 영역(Search Range; SR)에 의해 결정될 수 있다. 또한, 부호화 장치(100)로부터 복호화 장치(200)로 전송되는 실제의 정보는 결정된 MV 및 인접한 블록의 MV 간의 움직임 벡터 차분 값(Motion Vector Difference; MVD)일 수 있다.When the conventional inter-prediction technique is applied, the motion vector value range may be determined by a predefined search range (SR). Also, actual information transmitted from the encoding apparatus 100 to the decoding apparatus 200 may be a motion vector difference (MVD) between the determined MV and an MV of an adjacent block.

정적 유닛의 경우, 인접 블록과의 상관도(correlation)이 높기 때문에 상대적으로 작은 값을 갖는 MV가 결정되는 경향이 있다. 따라서, 정적 부호화 유닛에 대해서는 정의된 SR 보다 줄어든 SR을 사용함으로써, 움직임 예측을 위한 부호화 장치(100)의 복잡도를 줄임과 동시에 움직임 정보의 전송량을 줄여 부호화 효율을 높일 수 있다.In the case of a static unit, since correlation with an adjacent block is high, an MV having a relatively small value tends to be determined. Therefore, by using a SR that is smaller than the defined SR for the static coding unit, the complexity of the encoding apparatus 100 for motion prediction can be reduced and the transmission amount of motion information can be reduced to increase coding efficiency.

예를 들면, 일반적인 유닛 또는 동적 유닛에 대해 기정의된 SR의 크기가 N일 때, 정적 유닛에 대한 SR의 크기는 NL일 수 있다. 여기에서, N은 2 이상의 정수일 수 있다. N은 1 이상 N 보다 작은 정수일 수 있다.For example, when the size of SR predefined for a normal unit or dynamic unit is N, the size of SR for a static unit may be NL. Here, N may be an integer of 2 or greater. N may be an integer greater than or equal to 1 and smaller than N.

인터 예측부(110)는 적응적으로 L의 값을 결정할 수 있다. 예를 들면, 인터 예측부(110)는 수학식 3의 flag_sum의 값에 기반하여 L의 값을 결정할 수 있다. 예를 들면, 인터 예측부(110)는 flag_sum의 값이 더 클수록 L의 값이 더 크도록 설정할 수 있다. 모든 가중치 w_i의 값이 1이라는 가정 하에서, L의 값은 N(Mflag_sum)와 같이 계산될 수 있다. 말하자면, flag_sum의 값이 5인 경우 L=의 값은 3*N/4일 수 있다. flag_sum의 값이 4인 경우 L의 값은 N/2일 수 있다. flag_sum의 값이 3의 이하인 경우 L의 값은 N/4일 수 있다.
The inter predictor 110 may adaptively determine the value of L. For example, the inter predictor 110 may determine the value of L based on the value of flag_sum in Equation 3. For example, the inter predictor 110 may set the value of L to increase as the value of flag_sum increases. Under the assumption that the values of all weights w _i are 1, the value of L can be calculated as N(Mflag_sum). In other words, when the value of flag_sum is 5, the value of L= may be 3*N/4. When the value of flag_sum is 4, the value of L may be N/2. When the value of flag_sum is 3 or less, the value of L may be N/4.

도 11은 일 실시예에 따른 부호화 장치 및/또는 복호화 장치를 구현하는 전자 장치의 구조도이다.11 is a structural diagram of an electronic device implementing an encoding device and/or a decoding device according to an embodiment.

일 실시예에 따르면, 부호화 장치(100)의 인터 예측부(110), 인트라 예측부(120), 스위치(115), 감산기(125), 변환부(130), 양자화부(140), 엔트로피 부호화부(150), 역양자화부(160), 역변환부(170), 가산기(175), 필터부(180) 및 참조 픽쳐 버퍼(190)의 적어도 일부는 프로그램 모듈들일 수 있으며, 외부의 장치 또는 시스템과 통신할 수 있다. 프로그램 모듈들은 운영 체제, 응용 프로그램 모듈 및 기타 프로그램 모듈의 형태로 부호화 장치(100)에 포함될 수 있다.According to an embodiment, the inter prediction unit 110, the intra prediction unit 120, the switch 115, the subtractor 125, the transform unit 130, the quantization unit 140, and the entropy encoding of the encoding apparatus 100 At least some of the unit 150, the inverse quantization unit 160, the inverse transform unit 170, the adder 175, the filter unit 180, and the reference picture buffer 190 may be program modules, and may be external devices or systems. can communicate with Program modules may be included in the encoding device 100 in the form of an operating system, application program modules, and other program modules.

또한 일 실시예에 따르면, 복호화 장치(200)의 엔트로피 복호화부(210), 역양자화부(220), 역변환부(230), 인트라 예측부(240), 인터 예측부(250), 가산기(255), 필터부(260) 및 참조 픽쳐 버퍼(270) 의 적어도 일부는 프로그램 모듈들일 수 있으며, 외부의 장치 또는 시스템과 통신할 수 있다. 프로그램 모듈들은 운영 체제, 응용 프로그램 모듈 및 기타 프로그램 모듈의 형태로 복호화 장치(200)에 포함될 수 있다.Also, according to an embodiment, the entropy decoding unit 210, the inverse quantization unit 220, the inverse transform unit 230, the intra prediction unit 240, the inter prediction unit 250, and the adder 255 of the decoding apparatus 200 ), the filter unit 260, and at least a part of the reference picture buffer 270 may be program modules, and may communicate with an external device or system. Program modules may be included in the decryption device 200 in the form of an operating system, application program modules, and other program modules.

프로그램 모듈들은 물리적으로는 여러 가지 공지의 기억 장치 상에 저장될 수 있다. 또한, 이러한 프로그램 모듈 중 적어도 일부는 부호화 장치(100)와 통신 가능한 원격 기억 장치 또는 복호화 장치(200)와 통신 가능한 원격 기억 장치 에 저장될 수도 있다.Program modules may be physically stored on various known storage devices. In addition, at least some of these program modules may be stored in a remote storage device capable of communicating with the encoding device 100 or a remote storage device capable of communicating with the decoding device 200.

프로그램 모듈들은 일 실시예에 따른 기능 또는 동작을 수행하거나, 일 실시예에 따른 추상 데이터 유형을 구현하는 루틴(routine), 서브루틴(subroutine), 프로그램, 오브젝트(object), 컴포넌트(component) 및 데이터 구조(data structure) 등을 포괄할 수 있지만, 이에 제한되지는 않는다.Program modules include routines, subroutines, programs, objects, components, and data that perform functions or operations according to an embodiment or implement abstract data types according to an embodiment. A data structure, etc. may be encompassed, but is not limited thereto.

프로그램 모듈들은 부호화 장치(100)의 적어도 하나의 프로세서(processor) 또는 복호화 장치(200)의 적어도 하나의 프로세서에 의해 수행되는 명령어(instruction) 또는 코드(code)로 구성될 수 있다.The program modules may include instructions or codes executed by at least one processor of the encoding device 100 or at least one processor of the decoding device 200 .

부호화 장치(100) 및/또는 복호화 장치(200)는 도 10에서 도시된 전자 장치(1100)로서 구현될 수 있다. 전자 장치(1100)는 부호화 장치(100) 및/또는 복호화 장치(200)로서 동작하는 범용의 컴퓨터 시스템일 수 있다.The encoding device 100 and/or the decoding device 200 may be implemented as the electronic device 1100 shown in FIG. 10 . The electronic device 1100 may be a general-purpose computer system that operates as the encoding device 100 and/or the decoding device 200.

도 10에서 도시된 바와 같이, 전자 장치(1100)는 버스(1190)를 통하여 서로 통신하는 적어도 하나의 처리부(processing unit)(1110), 메모리(1130), 사용자 인터페이스(User Interface; UI) 입력 디바이스(1150), UI 출력 디바이스(1160) 및 저장소(1140)를 포함할 수 있다. 또한, 전자 장치(1100)는 네트워크(1199)에 연결되는 통신부(1120)를 더 포함할 수 있다. 처리부(1110)는 중앙 처리 장치(Central Processing Unit; CPU), 메모리(1130) 또는 저장소(1140)에 저장된 프로세싱(processing) 명령어(instruction)들을 실행하는 반도체 장치일 수 있다. 메모리(1130) 및 저장소(1140)는 다양한 형태의 휘발성 또는 비휘발성 저장 매체일 수 있다. 예를 들어, 메모리는 롬(ROM)(1131) 및 램(RAM)(1132) 중 적어도 하나를 포함할 수 있다.As shown in FIG. 10 , the electronic device 1100 includes at least one processing unit 1110, a memory 1130, and a user interface (UI) input device communicating with each other through a bus 1190. 1150 , a UI output device 1160 and a storage 1140 . In addition, the electronic device 1100 may further include a communication unit 1120 connected to the network 1199 . The processing unit 1110 may be a semiconductor device that executes processing instructions stored in a central processing unit (CPU), memory 1130 or storage 1140 . Memory 1130 and storage 1140 may be various forms of volatile or non-volatile storage media. For example, the memory may include at least one of a ROM (ROM) 1131 and a RAM (RAM) 1132 .

부호화 장치(100) 및/또는 복호화 장치(200)는 컴퓨터에 의해 독출(read)될 수 있는 기록 매체를 포함하는 컴퓨터 시스템에서 구현될 수 있다.The encoding device 100 and/or the decoding device 200 may be implemented in a computer system including a recording medium readable by a computer.

기록 매체는 전자 장치(1100)가 부호화 장치(100) 및/또는 복호화 장치(200)로서 동작하기 위해 요구되는 적어도 하나의 모듈을 저장할 수 있다. 메모리(1130)는 적어도 하나의 모듈을 저장할 수 있고, 적어도 하나의 처리부(1110)에 의하여 실행되도록 구성될 수 있다.The recording medium may store at least one module required for the electronic device 1100 to operate as the encoding device 100 and/or the decoding device 200. The memory 1130 may store at least one module and may be configured to be executed by at least one processing unit 1110 .

부호화 장치(100) 및/또는 복호화 장치(200)의 데이터 또는 정보의 통신과 관련된 기능은 통신부(1120)를 통해 수행될 수 있다.
A function related to communication of data or information of the encoding device 100 and/or the decoding device 200 may be performed through the communication unit 1120 .

상술한 실시예들에서, 방법들은 일련의 단계 또는 유닛으로서 순서도를 기초로 설명되고 있으나, 본 발명은 단계들의 순서에 한정되는 것은 아니며, 어떤 단계는 상술한 바와 다른 단계와 다른 순서로 또는 동시에 발생할 수 있다. 또한, 당해 기술 분야에서 통상의 지식을 가진 자라면 순서도에 나타난 단계들이 배타적이지 않고, 다른 단계가 포함되거나, 순서도의 하나 또는 그 이상의 단계가 본 발명의 범위에 영향을 미치지 않고 삭제될 수 있음을 이해할 수 있을 것이다.In the foregoing embodiments, the methods are described on the basis of a flow chart as a series of steps or units, but the present invention is not limited to the order of steps, and some steps may occur in a different order or concurrently with other steps as described above. can In addition, those skilled in the art will understand that the steps shown in the flow chart are not exclusive, that other steps may be included, or that one or more steps of the flow chart may be deleted without affecting the scope of the present invention. You will understand.

이상 설명된 본 발명에 따른 실시예들은 다양한 컴퓨터 구성요소를 통하여 수행될 수 있는 프로그램 명령어의 형태로 구현되어 컴퓨터 판독 가능한 기록 매체에 기록될 수 있다. 상기 컴퓨터 판독 가능한 기록 매체는 프로그램 명령어, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 상기 컴퓨터 판독 가능한 기록 매체에 기록되는 프로그램 명령어는 본 발명을 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 분야의 당업자에게 공지되어 사용 가능한 것일 수도 있다. 컴퓨터 판독 가능한 기록 매체의 예에는, 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체, CDROM, DVD와 같은 광기록 매체, 플롭티컬 디스크(floptical disk)와 같은 자기광 매체(magnetooptical media), 및 ROM, RAM, 플래시 메모리 등과 같은 프로그램 명령어를 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. 프로그램 명령어의 예에는, 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드도 포함된다. 상기 하드웨어 장치는 본 발명에 따른 처리를 수행하기 위해 하나 이상의 소프트웨어 모듈로서 작동하도록 구성될 수 있으며, 그 역도 마찬가지이다.Embodiments according to the present invention described above may be implemented in the form of program instructions that can be executed through various computer components and recorded on a computer-readable recording medium. The computer readable recording medium may include program instructions, data files, data structures, etc. alone or in combination. Program instructions recorded on the computer-readable recording medium may be specially designed and configured for the present invention, or may be known and usable to those skilled in the art of computer software. Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks and magnetic tapes, optical recording media such as CDROMs and DVDs, magnetooptical media such as floptical disks, and ROMs. Hardware devices specially configured to store and execute program instructions, such as RAM, flash memory, and the like. Examples of program instructions include high-level language codes that can be executed by a computer using an interpreter or the like as well as machine language codes such as those produced by a compiler. The hardware device may be configured to act as one or more software modules to perform processing according to the present invention and vice versa.

이상에서 본 발명이 구체적인 구성요소 등과 같은 특정 사항들과 한정된 실시예 및 도면에 의해 설명되었으나, 이는 본 발명의 보다 전반적인 이해를 돕기 위해서 제공된 것일 뿐, 본 발명이 상기 실시예들에 한정되는 것은 아니며, 본 발명이 속하는 기술분야에서 통상적인 지식을 가진 자라면 이러한 기재로부터 다양한 수정 및 변형을 꾀할 수 있다.In the above, the present invention has been described by specific details such as specific components and limited embodiments and drawings, but these are provided to help a more general understanding of the present invention, and the present invention is not limited to the above embodiments. , Those skilled in the art to which the present invention pertains may seek various modifications and variations from these descriptions.

따라서, 본 발명의 사상은 상기 설명된 실시예에 국한되어 정해져서는 아니 되며, 후술하는 특허청구범위뿐만 아니라 이 특허청구범위와 균등하게 또는 등가적으로 변형된 모든 것들은 본 발명의 사상의 범주에 속한다고 할 것이다.Therefore, the spirit of the present invention should not be limited to the above-described embodiments, and not only the claims described later, but also all modifications equivalent or equivalent to these claims belong to the scope of the spirit of the present invention. will do it

Claims

determining a type of a target unit in inter prediction; and
performing inter prediction on the target unit according to the type;
including,
The inter prediction is performed based on a sum operation of absolute differences based on corresponding units corresponding to the target unit,
The type is designated in response to whether or not a result of the sum operation of the absolute difference is greater than or equal to a threshold value,
The type determines a search region for the target unit and affects segmentation of the target unit.

delete

determining a type of a target unit in inter prediction; and
performing inter prediction on the target unit according to the type;
including,
The decoding method of claim 1 , wherein the determination of the type is performed based on a sum operation of absolute differences based on a first corresponding unit in a first reference picture and a second corresponding unit in a second reference picture.

delete

A computer-readable recording medium storing a bitstream for decoding of a target unit,
The bitstream is
Encoded information about the target unit
including,
The decoding of the target unit is performed using the encoded information;
In the decoding, the type of the target unit is determined in inter prediction,
Inter prediction is performed on the target unit according to the type,
The inter prediction is performed based on a sum operation of absolute differences based on corresponding units corresponding to the target unit,
The type is designated in response to whether or not a result of the sum operation of the absolute difference is greater than or equal to a threshold value,
The type determines a search area for the target unit and affects segmentation for the target unit.

delete

A computer-readable recording medium storing a bitstream for decoding of a target unit,
The bitstream is
Encoded information about the target unit
including,
The decoding of the target unit is performed using the encoded information;
In the decoding, the type of the target unit is determined in inter prediction,
Inter prediction is performed on the target unit according to the type,
Wherein the determination of the type is performed based on a sum operation of absolute differences based on a first corresponding unit in a first reference picture and a second corresponding unit in a second reference picture.

generating a bitstream including encoding information through encoding of an input image; and
outputting the bitstream
including,
In performing decoding using inter prediction on a target unit using the bitstream, the inter prediction is performed based on a sum operation of absolute differences based on a corresponding unit corresponding to the target unit, and the type is the The bit, which is specified corresponding to whether a result of a sum operation of absolute differences is greater than or equal to a threshold value, the type determines a search region for the target unit, and the type affects division for the target unit. An encoding method that creates a stream.

generating a bitstream including encoding information through encoding of an input image; and
outputting the bitstream
including,
In performing decoding using inter prediction for a target unit using the bitstream, the determination of the type is an absolute difference based on a first corresponding unit in a first reference picture and a second corresponding unit in a second reference picture. An encoding method for generating the bitstream, which is performed based on a sum operation.