KR102400485B1

KR102400485B1 - Video decoding method and apparatus using the same

Info

Publication number: KR102400485B1
Application number: KR1020210126501A
Authority: KR
Inventors: 이진호; 강정원; 이하현; 최진수; 김진웅
Original assignee: 한국전자통신연구원
Priority date: 2013-07-09
Filing date: 2021-09-24
Publication date: 2022-05-20
Also published as: KR20210120948A; KR102222311B1; KR102307406B1; KR20150008337A; KR20210025553A

Abstract

본 발명의 일 실시예에 따른 영상 복호화 방법은 복호화 대상이 되는 제2 계층 픽쳐가 참조할 수 있는 제1 계층 픽쳐에 적용될 필터의 종류를 결정하는 필터 종류 결정 단계와; 상기 제1 계층 픽쳐에서 상기 필터가 적용될 적용 대상을 결정하는 적용 대상 결정 단계와; 결정된 필터 종류에 기초하여 필터 적용 대상에 필터링을 수행하는 단계와; 필터링 된 제1 계층 픽쳐를 제2 계층 참조 픽쳐 리스트에 추가하는 단계를 포함할 수 있다. 이로써, 상위 계층에서 예측 오차를 줄이고 부호화 효율을 향상시키는 영상의 복호화 방법 및 이를 이용하는 장치가 제공된다.An image decoding method according to an embodiment of the present invention includes: a filter type determining step of determining a type of filter to be applied to a first hierarchical picture that can be referenced by a second hierarchical picture to be decoded; an application target determining step of determining an application target to which the filter is to be applied in the first hierarchical picture; performing filtering on a filter application target based on the determined filter type; It may include adding the filtered first layer picture to the second layer reference picture list. Accordingly, an image decoding method and apparatus using the same are provided for reducing prediction errors in an upper layer and improving encoding efficiency.

Description

Video decoding method and apparatus using the same

본 발명은 영상의 부호화 및 복호화 처리에 관한 것으로서, 보다 상세하게는 비트스트림 내 복수의 레이어를 지원하는 영상의 부호화 및 복호화 방법 및 장치에 관한 것이다.The present invention relates to image encoding and decoding processing, and more particularly, to a method and apparatus for encoding and decoding an image supporting a plurality of layers in a bitstream.

최근 HD(High Definition) 해상도를 가지는 방송 서비스가 국내뿐만 아니라 세계적으로 확대되면서, 많은 사용자들이 고해상도, 고화질의 영상에 익숙해지고 있으며 이에 따라 많은 기관들이 차세대 영상기기에 대한 개발에 박차를 가하고 있다. 또한 HDTV와 더불어 HDTV의 4배 이상의 해상도를 갖는 UHD(Ultra High Definition)에 대한 관심이 증대되면서 보다 높은 해상도, 고화질의 영상에 대한 압축기술이 요구되고 있다.Recently, as broadcasting services with high definition (HD) resolution are expanding not only domestically but also worldwide, many users are getting used to high-resolution and high-definition images, and accordingly, many institutions are spurring the development of next-generation imaging devices. Also, along with HDTV, as interest in Ultra High Definition (UHD) having a resolution four times higher than that of HDTV is increasing, compression technology for higher resolution and high-definition images is required.

영상 압축을 위해, 시간적으로 이전 및/또는 이후의 픽쳐로부터 현재 픽쳐에 포함된 화소값을 예측하는 인터(inter) 예측 기술, 현재 픽쳐 내의 화소 정보를 이용하여 현재 픽쳐에 포함된 화소값을 예측하는 인트라(intra) 예측 기술, 출현 빈도가 높은 심볼(symbol)에 짧은 부호를 할당하고 출현 빈도가 낮은 심볼에 긴 부호를 할당하는 엔트로피 부호화 기술 등이 사용될 수 있다.For image compression, inter prediction technology that predicts pixel values included in the current picture from temporally previous and/or subsequent pictures, predicts pixel values included in the current picture using pixel information in the current picture An intra prediction technique, an entropy encoding technique in which a short code is assigned to a symbol having a high frequency of occurrence and a long code is assigned to a symbol having a low frequency of occurrence, etc. may be used.

영상 압축 기술에는 유동적인 네트워크 환경을 고려하지 않고 하드웨어의 제한적인 동작 환경하에서 일정한 네트워크 대역폭을 제공하는 기술이 있다. 그러나 수시로 대역폭이 변화하는 네트워크 환경에 적용되는 영상 데이터를 압축하기 위해서는 새로운 압축 기술이 요구되고, 이를 위해 스케일러블(scalable) 비디오 부호화/복호화 방법이 사용될 수 있다.In video compression technology, there is a technology that provides a constant network bandwidth under a limited operating environment of hardware without considering a flexible network environment. However, a new compression technique is required to compress image data applied to a network environment in which bandwidth is frequently changed, and for this purpose, a scalable video encoding/decoding method may be used.

본 발명은 계층간 필터링을 이용한 영상 부호화/복호화 방법 및 그 장치를 제공한다.The present invention provides a video encoding/decoding method and apparatus using inter-layer filtering.

또한 본 발명은 상위 계층의 참조 픽쳐 리스트에 하위 계층의 픽쳐를 추가함에 있어, 하위 계층의 픽쳐에 하나 이상의 필터를 적응적으로 적용한 후 추가함으로써, 상위 계층에서 예측 오차를 줄이고 부호화 효율을 향상시키는 영상의 복호화 방법 및 이를 이용하는 장치를 제공한다.In addition, in the present invention, when adding a picture of a lower layer to the reference picture list of the upper layer, one or more filters are adaptively applied to the picture of the lower layer and then added, thereby reducing prediction error in the upper layer and improving encoding efficiency. A decryption method and an apparatus using the same are provided.

이로써, 부호화 효율을 향상시키며 참조 픽쳐 리스트를 증가하지 않게 하는영상의 복호화 방법 및 이를 이용하는 장치를 제공한다.Accordingly, a method for decoding an image that improves encoding efficiency and does not increase a reference picture list, and an apparatus using the same are provided.

본 발명의 일 실시예인 복수의 계층을 지원하는 영상의 복호화 방법은 복호화 대상이 되는 제2 계층 픽쳐가 참조할 수 있는 제1 계층 픽쳐에 적용될 필터의 종류를 결정하는 필터 종류 결정 단계와; 상기 제1 계층 픽쳐에서 상기 필터가 적용될 적용 대상을 결정하는 적용 대상 결정 단계와; 결정된 필터 종류에 기초하여 필터 적용 대상에 필터링을 수행하는 단계와; 필터링 된 제1 계층 픽쳐를 제2 계층 참조 픽쳐 리스트에 추가하는 단계를 포함할 수 있다. An embodiment of the present invention provides a method for decoding an image supporting a plurality of layers, comprising: a filter type determining step of determining a type of filter to be applied to a first hierarchical picture that can be referenced by a second hierarchical picture to be decoded; an application target determining step of determining an application target to which the filter is to be applied in the first hierarchical picture; performing filtering on a filter application target based on the determined filter type; It may include adding the filtered first layer picture to the second layer reference picture list.

상기 필터 종류 결정 단계는, 소정의 고정된 필터를 상기 제1 계층 픽쳐에 적용하는 것으로 결정할 수 있다.The determining of the filter type may include applying a predetermined fixed filter to the first hierarchical picture.

상기 고정된 필터는 기설정된 필터 계수 세트를 갖는 기본 필터이고, 상기 기본 필터를 적용하는 경우, 정수 위치의 샘플에 대하여 필터링이 수행되지 않을 수 있다.The fixed filter is a basic filter having a preset filter coefficient set, and when the basic filter is applied, filtering may not be performed on samples at integer positions.

상기 고정된 필터는 기설정된 필터 계수 세트를 갖는 대체 필터이고, 상기 대체 필터를 적용하는 경우, 정수 위치의 샘플에 대하여 필터링이 수행될 수 있다. The fixed filter is a replacement filter having a preset filter coefficient set, and when the replacement filter is applied, filtering may be performed on samples at integer positions.

상기 대체 필터는 상기 기본 필터가 적용된 상기 제1 계층 픽쳐에 적용될 수있다. The replacement filter may be applied to the first hierarchical picture to which the basic filter is applied.

상기 필터 종류 결정 단계는, 상기 대체 필터 적용 여부에 대한 플래그 신호를 수신하여 복호화하는 단계를 더 포함할 수 있다. The determining of the filter type may further include receiving and decoding a flag signal indicating whether the replacement filter is applied.

상기 필터 종류 결정 단계는 하나 이상의 필터를 적응적으로 선택하여 제1 계층의 대응 픽쳐에 적용하는 것으로 결정할 수 있다. In the determining of the filter type, it may be determined that one or more filters are adaptively selected and applied to the corresponding picture of the first layer.

상기 필터 종류 결정 단계는, 기설정된 필터 계수 세트를 갖는 기본 필터를 상기 제1 계층 픽쳐에 적용하는 것으로 결정하고, 제1 계층 픽쳐의 정수 위치의 샘플에 대하여 상기 기본 필터와 상이한 대체 필터를 적용할지 여부를 결정하는 단계를 포함하고, 상기 대체 필터의 적용 여부를 결정하는 단계는 상기 대체 필터를 적용할 지 여부를 나타내는 플래그 신호를 수신하여 복호화할 수 있다. In the filter type determining step, it is determined that a basic filter having a preset filter coefficient set is applied to the first hierarchical picture, and a replacement filter different from the basic filter is applied to samples at integer positions of the first hierarchical picture. and determining whether to apply the replacement filter, wherein the determining whether to apply the replacement filter may receive and decode a flag signal indicating whether to apply the replacement filter.

상기 필터 종류 결정 단계는, 기설정된 필터 계수 세트를 갖는 기본 필터를 상기 제1 계층 픽쳐에 적용하는 것으로 결정하고, 제1 계층 픽쳐의 정수 위치의 샘플에 대하여 상기 기본 필터와 상이한 대체 필터를 적용할지 여부를 결정하는 단계를 포함하고, 상기 대체 필터를 적용할지 여부를 결정하는 단계는, 상기 제1 계층 픽쳐에 대하여 소정 크기의 블록 단위 별로 샘플 상관 관계를 계산 하는 단계와; 상기 상관 관계에 기초하여 상기 제1 계층 픽쳐에 상기 대체 필터를 적용할지 여부를 결정하는 단계를 포함할 수 있다. In the filter type determining step, it is determined that a basic filter having a preset filter coefficient set is applied to the first hierarchical picture, and a replacement filter different from the basic filter is applied to samples at integer positions of the first hierarchical picture. and determining whether to apply the replacement filter, wherein the determining of whether to apply the replacement filter includes: calculating a sample correlation for each block unit of a predetermined size with respect to the first hierarchical picture; and determining whether to apply the replacement filter to the first hierarchical picture based on the correlation.

상기 대체 필터를 적용할지 여부를 결정하는 단계는, 상기 블록 단위의 수평 방향 활동성 ?Ч硫痴? 방향 활동성에 기초한 블록 활동성이 소정 문턱값을 이상인 경우 상기 대체 필터를 적용하는 것으로 결정할 수 있다. The step of determining whether to apply the replacement filter may include: horizontal activity in the block unit ?Ч硫痴? When the block activity based on the directional activity exceeds a predetermined threshold, it may be determined to apply the replacement filter.

본 발명의 다른 실시예에 따른 복수의 계층을 지원하는 영상의 복호화 장치는 복호화 대상이 되는 제2 계층 픽쳐가 참조할 수 있는 제1 계층 픽쳐를 복호화하는 제1 계층 복호화부와; 상기 제1 계층 픽쳐에 적용될 필터의 종류를 결정하고, 상기 제1 계층 픽쳐에서 상기 필터가 적용될 적용 대상을 결정하고, 결정된 필터 종류에 기초하여 필터 적용 대상에 필터링을 수행하는 필터링부와; 필터링 된 제1 계층 픽쳐를 제2 계층 참조 픽쳐 리스트에 추가하는 예측부를 포함할 수 있다.An apparatus for decoding an image supporting a plurality of layers according to another embodiment of the present invention includes: a first layer decoder for decoding a first layer picture that can be referenced by a second layer picture to be decoded; a filtering unit that determines a type of filter to be applied to the first hierarchical picture, determines an application target to which the filter is to be applied in the first hierarchical picture, and performs filtering on a filter application target based on the determined filter type; It may include a prediction unit that adds the filtered first layer picture to the second layer reference picture list.

본 발명의 일 실시예에 따르면, 계층간 필터링을 이용한 영상 부호화/복호화 방법 및 그 장치가 제공된다.According to an embodiment of the present invention, a method and apparatus for encoding/decoding an image using inter-layer filtering are provided.

또한, 본 발명의 일 실시예에 따르면, 상위 계층의 참조 픽쳐 리스트에 하위 계층의 픽쳐를 추가함에 있어, 하위 계층의 픽쳐에 하나 이상의 필터를 적응적으로 적용한 후 추가함으로써, 상위 계층에서 예측 오차를 줄이고 부호화 효율을 향상시키는 영상의 복호화 방법 및 이를 이용하는 장치가 제공된다.In addition, according to an embodiment of the present invention, when adding a picture of a lower layer to the reference picture list of the upper layer, one or more filters are adaptively applied to the picture of the lower layer and then added, thereby reducing the prediction error in the upper layer. Provided are a video decoding method and apparatus using the same for reducing and improving encoding efficiency.

이로써, 부호화 효율을 향상시키며 참조 픽쳐 리스트를 증가하지 않게 하는영상의 복호화 방법 및 이를 이용하는 장치가 제공된다.Accordingly, a method for decoding an image that improves encoding efficiency and does not increase a reference picture list, and an apparatus using the same are provided.

도 1은 영상 부호화 장치의 일 실시예에 따른 구성을 나타내는 블록도이다.
도 2는 영상 복호화 장치의 일 실시예에 따른 구성을 나타내는 블록도이다.
도 3은 본 발명이 적용될 수 있는, 복수 계층을 이용한 스케일러블 비디오 코딩 구조의 일 실시예를 개략적으로 나타내는 개념도이다.
도 4는 참조 픽쳐 리스트를 구성하는 일 예를 도시한 도면이다.
도 5는 본 발명에 따른 영상의 처리 방법을 설명하기 위한 제어 흐름도이다.
도 6은 본 발명에 따른 다계층 픽쳐를 도시한 도면이다.
도 7은 본 발명에 따른 수평 방향 및 수직 방향 활동성을 연산하기 위한 블록 샘플을 도시한 도면이다.
도 8은 본 발명에 따른 제1 계층 및 제2 계층의 픽쳐의 POC를 도시한 도면이다.
도 9는 본 발명에 따른 참조 픽쳐 리스트를 도시한 도면이다.
도 10은 본 발명의 일 실시예에 따라 P 슬라이스에 대한 참조 픽쳐 리스트를 도시한 도면이다.
도 11은 본 발명의 일 실시예에 따라 B 슬라이스에 대한 참조 픽쳐 리스트를 도시한 도면이다.1 is a block diagram illustrating a configuration of an image encoding apparatus according to an embodiment.
2 is a block diagram illustrating a configuration of an image decoding apparatus according to an embodiment.
3 is a conceptual diagram schematically illustrating an embodiment of a scalable video coding structure using a plurality of layers to which the present invention can be applied.
4 is a diagram illustrating an example of configuring a reference picture list.
5 is a control flowchart illustrating an image processing method according to the present invention.
6 is a diagram illustrating a multi-layered picture according to the present invention.
7 is a diagram illustrating a block sample for calculating horizontal and vertical activity according to the present invention.
8 is a diagram illustrating a POC of a picture of a first layer and a second layer according to the present invention.
9 is a diagram illustrating a reference picture list according to the present invention.
10 is a diagram illustrating a reference picture list for a P slice according to an embodiment of the present invention.
11 is a diagram illustrating a reference picture list for a B slice according to an embodiment of the present invention.

이하, 도면을 참조하여 본 발명의 실시 형태에 대하여 구체적으로 설명한다. 본 명세서의 실시예를 설명함에 있어, 관련된 공지 구성 또는 기능에 대한 구체적인 설명이 본 명세서의 요지를 흐릴 수 있다고 판단되는 경우에는 그 상세한 설명은 생략한다.EMBODIMENT OF THE INVENTION Hereinafter, embodiment of this invention is described concretely with reference to drawings. In describing the embodiments of the present specification, if it is determined that a detailed description of a related known configuration or function may obscure the gist of the present specification, the detailed description thereof will be omitted.

어떤 구성 요소가 다른 구성 요소에 “연결되어” 있다거나 “접속되어” 있다고 언급된 때에는, 그 다른 구성 요소에 직접적으로 연결되어 있거나 또는 접속되어 있을 수도 있으나, 중간에 다른 구성 요소가 존재할 수도 있다고 이해되어야 할 것이다. 아울러, 본 발명에서 특정 구성을 “포함”한다고 기술하는 내용은 해당 구성 이외의 구성을 배제하는 것이 아니며, 추가적인 구성이 본 발명의 실시 또는 본 발명의 기술적 사상의 범위에 포함될 수 있음을 의미한다. When it is said that a component is “connected” or “connected to” another component, it may be directly connected or connected to the other component, but it is understood that other components may exist in between. it should be In addition, the description of “including” a specific configuration in the present invention does not exclude configurations other than the corresponding configuration, and it means that additional configurations may be included in the practice of the present invention or the scope of the technical spirit of the present invention.

제1, 제2 등의 용어는 다양한 구성요소들을 설명하는데 사용될 수 있지만, 상기 구성요소들은 상기 용어들에 의해 한정되어서는 안 된다. 상기 용어들은 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로만 사용된다. 예를 들어, 본 발명의 권리 범위를 벗어나지 않으면서 제1 구성요소는 제2 구성요소로 명명될 수 있고, 유사하게 제2 구성요소도 제1 구성요소로 명명될 수 있다.Terms such as first, second, etc. may be used to describe various elements, but the elements should not be limited by the terms. The above terms are used only for the purpose of distinguishing one component from another. For example, without departing from the scope of the present invention, a first component may be referred to as a second component, and similarly, a second component may also be referred to as a first component.

또한 본 발명의 실시예에 나타나는 구성부들은 서로 다른 특징적인 기능들을 나타내기 위해 독립적으로 도시되는 것으로, 각 구성부들이 분리된 하드웨어나 하나의 소프트웨어 구성단위로 이루어짐을 의미하지 않는다. 즉, 각 구성부는 설명의 편의상 각각의 구성부로 나열하여 포함한 것으로 각 구성부 중 적어도 두 개의 구성부가 합쳐져 하나의 구성부로 이루어지거나, 하나의 구성부가 복수 개의 구성부로 나뉘어져 기능을 수행할 수 있고 이러한 각 구성부의 통합된 실시예 및 분리된 실시예도 본 발명의 본질에서 벗어나지 않는 한 본 발명의 권리범위에 포함된다.In addition, the components shown in the embodiment of the present invention are shown independently to represent different characteristic functions, and it does not mean that each component is made of separate hardware or a single software component. That is, each component is listed as each component for convenience of description, and at least two components of each component are combined to form one component, or one component can be divided into a plurality of components to perform a function, and each Integrated embodiments and separate embodiments of the components are also included in the scope of the present invention without departing from the essence of the present invention.

또한, 일부의 구성 요소는 본 발명에서 본질적인 기능을 수행하는 필수적인 구성 요소는 아니고 단지 성능을 향상시키기 위한 선택적 구성 요소일 수 있다. 본 발명은 단지 성능 향상을 위해 사용되는 구성 요소를 제외한 본 발명의 본질을 구현하는데 필수적인 구성부만을 포함하여 구현될 수 있고, 단지 성능 향상을 위해 사용되는 선택적 구성 요소를 제외한 필수 구성 요소만을 포함한 구조도 본 발명의 권리범위에 포함된다.In addition, some of the components are not essential components for performing essential functions in the present invention, but may be optional components for merely improving performance. The present invention can be implemented by including only essential components to implement the essence of the present invention, except for components used for performance improvement, and a structure including only essential components excluding optional components used for performance improvement Also included in the scope of the present invention.

도 1은 영상 부호화 장치의 일 실시예에 따른 구성을 나타내는 블록도이다. 스케일러블(scalable) 비디오 부호화/복호화 방법 또는 장치는 스케일러빌리티(scalability)를 제공하지 않는 일반적인 영상 부호화/복호화 방법 또는 장치의 확장(extension)에 의해 구현될 수 있으며, 도 1의 블록도는 스케일러블 비디오 부호화 장치의 기초가 될 수 있는 영상 부호화 장치의 일 실시예를 나타낸다.1 is a block diagram illustrating a configuration of an image encoding apparatus according to an embodiment. A scalable video encoding/decoding method or apparatus may be implemented by extension of a general image encoding/decoding method or apparatus that does not provide scalability, and the block diagram of FIG. 1 is scalable. An embodiment of an image encoding apparatus that can be the basis of a video encoding apparatus is shown.

도 1을 참조하면, 상기 영상 부호화 장치(100)는 움직임 예측부(111), 움직임 보상부(112), 인트라 예측부(120), 스위치(115), 감산기(125), 변환부(130), 양자화부(140), 엔트로피 부호화부(150), 역양자화부(160), 역변환부(170), 가산기(175), 필터부(180) 및 참조영상 버퍼(190)를 포함한다.Referring to FIG. 1 , the image encoding apparatus 100 includes a motion prediction unit 111 , a motion compensation unit 112 , an intra prediction unit 120 , a switch 115 , a subtractor 125 , and a transform unit 130 . , a quantizer 140 , an entropy encoder 150 , an inverse quantizer 160 , an inverse transform unit 170 , an adder 175 , a filter unit 180 , and a reference image buffer 190 .

영상 부호화 장치(100)는 입력 영상에 대해 인트라(intra) 모드 또는 인터(inter) 모드로 부호화를 수행하고 비트스트림(bitstream)을 출력할 수 있다. 인트라 예측은 화면 내 예측, 인터 예측은 화면 간 예측을 의미한다. 인트라 모드인 경우 스위치(115)가 인트라로 전환되고, 인터 모드인 경우 스위치(115)가 인터로 전환된다. 영상 부호화 장치(100)는 입력 영상의 입력 블록에 대한 예측 블록을 생성한 후, 입력 블록과 예측 블록의 차분을 부호화할 수 있다.The image encoding apparatus 100 may encode an input image in an intra mode or an inter mode and output a bitstream. Intra prediction refers to intra prediction, and inter prediction refers to inter prediction. In the intra mode, the switch 115 is switched to intra, and in the inter mode, the switch 115 is switched to inter. The image encoding apparatus 100 may generate a prediction block for an input block of an input image, and then encode a difference between the input block and the prediction block.

인트라 모드인 경우, 인트라 예측부(120)는 현재 블록 주변의 이미 부호화된 블록의 화소값을 이용하여 공간적 예측을 수행하여 예측 블록을 생성할 수 있다.In the intra mode, the intra prediction unit 120 may generate a prediction block by performing spatial prediction using pixel values of an already encoded block around the current block.

인터 모드인 경우, 움직임 예측부(111)는, 움직임 예측 과정에서 참조 영상 버퍼(190)에 저장되어 있는 참조 영상에서 입력 블록과 가장 매치가 잘 되는 영역을 찾아 움직임 벡터를 구할 수 있다. 움직임 보상부(112)는 움직임 벡터와 참조 영상 버퍼(190)에 저장되어 있는 참조 영상을 이용하여 움직임 보상을 수행함으로써 예측 블록을 생성할 수 있다. In the inter mode, the motion prediction unit 111 may obtain a motion vector by finding a region that best matches the input block in the reference image stored in the reference image buffer 190 during the motion prediction process. The motion compensator 112 may generate a prediction block by performing motion compensation using a motion vector and a reference image stored in the reference image buffer 190 .

감산기(125)는 입력 블록과 생성된 예측 블록의 차분에 의해 잔여 블록(residual block)을 생성할 수 있다. 변환부(130)는 잔여 블록에 대해 변환(transform)을 수행하여 변환 계수(transform coefficient)를 출력할 수 있다. 그리고 양자화부(140)는 입력된 변환 계수를 양자화 파라미터에 따라 양자화하여 양자화된 계수(quantized coefficient)를 출력할 수 있다. The subtractor 125 may generate a residual block by the difference between the input block and the generated prediction block. The transform unit 130 may perform transform on the residual block to output a transform coefficient. In addition, the quantization unit 140 may quantize the input transform coefficient according to the quantization parameter to output a quantized coefficient.

엔트로피 부호화부(150)는, 양자화부(140)에서 산출된 값들 또는 부호화 과정에서 산출된 부호화 파라미터 값 등을 기초로, 심볼(symbol)을 확률 분포에 따라 엔트로피 부호화하여 비트스트림(bitstream)을 출력할 수 있다. 엔트로피 부호화 방법은 다양한 값을 갖는 심볼을 입력 받아, 통계적 중복성을 제거하면서, 복호 가능한 2진수의 열로 표현하는 방법이다. The entropy encoding unit 150 entropy-encodes a symbol according to a probability distribution based on the values calculated by the quantization unit 140 or an encoding parameter value calculated during the encoding process, and outputs a bitstream. can do. The entropy encoding method is a method of receiving symbols having various values, removing statistical redundancy, and expressing them as a decodable binary sequence.

여기서, 심볼이란 부호화/복호화 대상 구문 요소(syntax element) 및 부호화 파라미터(coding parameter), 잔여 신호(residual signal)의 값 등을 의미한다. 부호화 파라미터는 부호화 및 복호화에 필요한 매개변수로서, 구문 요소와 같이 부호화기에서 부호화되어 복호화기로 전달되는 정보뿐만 아니라, 부호화 혹은 복호화 과정에서 유추될 수 있는 정보를 포함할 수 있으며 영상을 부호화하거나 복호화할 때 필요한 정보를 의미한다. 부호화 파라미터는 예를 들어 인트라/인터 예측모드, 이동/움직임 벡터, 참조 영상 색인, 부호화 블록 패턴, 잔여 신호 유무, 변환 계수, 양자화된 변환 계수, 양자화 파라미터, 블록 크기, 블록 분할 정보 등의 값 또는 통계를 포함할 수 있다. 또한 잔여 신호는 원신호와 예측 신호의 차이를 의미할 수 있고, 또한 원신호와 예측 신호의 차이가 변환(transform)된 형태의 신호 또는 원신호와 예측 신호의 차이가 변환되고 양자화된 형태의 신호를 의미할 수도 있다. 잔여 신호는 블록 단위에서는 잔여 블록이라 할 수 있다.Here, the symbol means a syntax element to be encoded/decoded, a coding parameter, and a value of a residual signal. Encoding parameters are parameters necessary for encoding and decoding, and may include information that is encoded in the encoder and transmitted to the decoder, such as syntax elements, as well as information that can be inferred during the encoding or decoding process. means necessary information. Coding parameters include, for example, intra/inter prediction modes, motion/motion vectors, reference image indexes, coding block patterns, presence or absence of residual signals, transform coefficients, quantized transform coefficients, quantization parameters, block size, block partition information, etc. or It may include statistics. In addition, the residual signal may mean a difference between the original signal and the predicted signal, and a signal in which the difference between the original signal and the predicted signal is transformed or a signal in which the difference between the original signal and the predicted signal is transformed and quantized may mean The residual signal may be referred to as a residual block in block units.

엔트로피 부호화가 적용되는 경우, 높은 발생 확률을 갖는 심볼에 적은 수의 비트가 할당되고 낮은 발생 확률을 갖는 심볼에 많은 수의 비트가 할당되어 심볼이 표현됨으로써, 부호화 대상 심볼들에 대한 비트열의 크기가 감소될 수 있다. 따라서 엔트로피 부호화를 통해서 영상 부호화의 압축 성능이 높아질 수 있다. When entropy encoding is applied, a small number of bits are allocated to a symbol having a high probability of occurrence and a large number of bits are allocated to a symbol having a low probability of occurrence to express the symbol, so that the size of the bit stream for the symbols to be encoded is increased. can be reduced. Accordingly, compression performance of image encoding may be improved through entropy encoding.

엔트로피 부호화를 위해 지수 골룸(exponential golomb), CAVLC(Context-Adaptive Variable Length Coding), CABAC(Context-Adaptive Binary Arithmetic Coding)과 같은 부호화 방법이 사용될 수 있다. 예를 들어, 엔트로피 부호화부(150)에는 가변 길이 부호화(VLC: Variable Lenghth Coding/Code) 테이블과 같은 엔트로피 부호화를 수행하기 위한 테이블이 저장될 수 있고, 엔트로피 부호화부(150)는 저장된 가변 길이 부호화(VLC) 테이블을 사용하여 엔트로피 부호화를 수행할 수 있다. 또한 엔트로피 부호화부(150)는 대상 심볼의 이진화(binarization) 방법 및 대상 심볼/빈(bin)의 확률 모델(probability model)을 도출한 후, 도출된 이진화 방법 또는 확률 모델을 사용하여 엔트로피 부호화를 수행할 수도 있다.For entropy encoding, an encoding method such as exponential golomb, Context-Adaptive Variable Length Coding (CAVLC), or Context-Adaptive Binary Arithmetic Coding (CABAC) may be used. For example, a table for performing entropy encoding, such as a variable length coding (VLC) table, may be stored in the entropy encoder 150, and the entropy encoder 150 may store the stored variable length encoding. Entropy encoding can be performed using a (VLC) table. In addition, the entropy encoder 150 derives a binarization method of a target symbol and a probability model of a target symbol/bin, and then performs entropy encoding using the derived binarization method or probability model. You may.

양자화된 계수는 역양자화부(160)에서 역양자화되고 역변환부(170)에서 역변환될 수 있다. 역양자화, 역변환된 계수는 가산기(175)를 통해 예측 블록과 더해지고 복원 블록이 생성될 수 있다. The quantized coefficient may be inverse quantized by the inverse quantization unit 160 and inversely transformed by the inverse transform unit 170 . The inverse-quantized and inverse-transformed coefficients may be added to the prediction block through the adder 175 and a reconstructed block may be generated.

복원 블록은 필터부(180)를 거치고, 필터부(180)는 디블록킹 필터(deblocking filter), SAO(Sample Adaptive Offset), ALF(Adaptive Loop Filter) 중 적어도 하나 이상을 복원 블록 또는 복원 픽쳐에 적용할 수 있다. 필터부(180)를 거친 복원 블록은 참조 영상 버퍼(190)에 저장될 수 있다.The reconstructed block passes through the filter unit 180, and the filter unit 180 applies at least one of a deblocking filter, a sample adaptive offset (SAO), and an adaptive loop filter (ALF) to the reconstructed block or the reconstructed picture. can do. The reconstructed block passing through the filter unit 180 may be stored in the reference image buffer 190 .

도 2는 영상 복호화 장치의 일 실시예에 따른 구성을 나타내는 블록도이다. 도 1에서 상술한 바와 같이 스케일러블 비디오 부호화/복호화 방법 또는 장치는 스케일러빌리티를 제공하지 않는 일반적인 영상 부호화/복호화 방법 또는 장치의 확장에 의해 구현될 수 있으며, 도 2의 블록도는 스케일러블 비디오 복호화 장치의 기초가 될 수 있는 영상 복호화 장치의 일 실시예를 나타낸다.2 is a block diagram illustrating a configuration of an image decoding apparatus according to an embodiment. As described above with reference to FIG. 1 , the scalable video encoding/decoding method or apparatus may be implemented by extension of a general image encoding/decoding method or apparatus that does not provide scalability, and the block diagram of FIG. 2 shows scalable video decoding An embodiment of an image decoding apparatus that can be the basis of the apparatus is shown.

도 2를 참조하면, 상기 영상 복호화 장치(200)는 엔트로피 복호화부(210), 역양자화부(220), 역변환부(230), 인트라 예측부(240), 움직임 보상부(250), 필터부(260) 및 참조 영상 버퍼(270)를 포함한다.Referring to FIG. 2 , the image decoding apparatus 200 includes an entropy decoding unit 210 , an inverse quantization unit 220 , an inverse transform unit 230 , an intra prediction unit 240 , a motion compensator 250 , and a filter unit. 260 and a reference image buffer 270 .

영상 복호화 장치(200)는 부호화기에서 출력된 비트스트림을 입력 받아 인트라 모드 또는 인터 모드로 복호화를 수행하고 재구성된 영상, 즉 복원 영상을 출력할 수 있다. 인트라 모드인 경우 스위치가 인트라로 전환되고, 인터 모드인 경우 스위치가 인터로 전환될 수 있다. 영상 복호화 장치(200)는 입력 받은 비트스트림으로부터 복원된 잔여 블록(residual block)을 얻고 예측 블록을 생성한 후 복원된 잔여 블록과 예측 블록을 더하여 재구성된 블록, 즉 복원 블록을 생성할 수 있다.The image decoding apparatus 200 may receive a bitstream output from the encoder, perform decoding in an intra mode or an inter mode, and output a reconstructed image, that is, a reconstructed image. In the case of the intra mode, the switch may be switched to the intra mode, and in the case of the inter mode, the switch may be switched to the inter mode. The image decoding apparatus 200 may generate a reconstructed block, that is, a reconstructed block by obtaining a residual block reconstructed from the received bitstream, generating a prediction block, and adding the reconstructed residual block and the prediction block.

엔트로피 복호화부(210)는, 입력된 비트스트림을 확률 분포에 따라 엔트로피 복호화하여, 양자화된 계수(quantized coefficient) 형태의 심볼을 포함한 심볼들을 생성할 수 있다. 엔트로피 복호화 방법은 2진수의 열을 입력 받아 각 심볼들을 생성하는 방법이다. 엔트로피 복호화 방법은 상술한 엔트로피 부호화 방법과 유사하다.The entropy decoding unit 210 may entropy-decode the input bitstream according to a probability distribution to generate symbols including symbols in the form of quantized coefficients. The entropy decoding method is a method of generating each symbol by receiving a binary string as an input. The entropy decoding method is similar to the entropy encoding method described above.

양자화된 계수는 역양자화부(220)에서 역양자화되고 역변환부(230)에서 역변환되며, 양자화된 계수가 역양자화/역변환 된 결과, 복원된 잔여 블록(residual block)이 생성될 수 있다. The quantized coefficient is inverse quantized by the inverse quantization unit 220 and inverse transformed by the inverse transformation unit 230 , and as a result of inverse quantization/inverse transformation of the quantized coefficient, a reconstructed residual block may be generated.

인트라 모드인 경우, 인트라 예측부(240)는 현재 블록 주변의 이미 부호화된 블록의 화소값을 이용하여 공간적 예측을 수행하여 예측 블록을 생성할 수 있다. 인터 모드인 경우, 움직임 보상부(250)는 움직임 벡터 및 참조 영상 버퍼(270)에 저장되어 있는 참조 영상을 이용하여 움직임 보상을 수행함으로써 예측 블록을 생성할 수 있다. In the intra mode, the intra prediction unit 240 may generate a prediction block by performing spatial prediction using pixel values of an already encoded block around the current block. In the inter mode, the motion compensator 250 may generate a prediction block by performing motion compensation using a motion vector and a reference image stored in the reference image buffer 270 .

복원된 잔여 블록과 예측 블록은 가산기(255)를 통해 더해지고, 더해진 블록은 필터부(260)를 거친다. 필터부(260)는 디블록킹 필터, SAO, ALF 중 적어도 하나 이상을 복원 블록 또는 복원 픽쳐에 적용할 수 있다. 필터부(260)는 재구성된 영상, 즉 복원 영상을 출력한다. 복원 영상은 참조 영상 버퍼(270)에 저장되어 화면 간 예측에 사용될 수 있다.The reconstructed residual block and the prediction block are added through the adder 255 , and the added block passes through the filter unit 260 . The filter unit 260 may apply at least one of a deblocking filter, SAO, and ALF to a reconstructed block or a reconstructed picture. The filter unit 260 outputs a reconstructed image, that is, a reconstructed image. The restored image may be stored in the reference image buffer 270 and used for inter prediction.

상기 영상 복호화 장치(200)에 포함되어 있는 엔트로피 복호화부(210), 역양자화부(220), 역변환부(230), 인트라 예측부(240), 움직임 보상부(250), 필터부(260) 및 참조 영상 버퍼(270) 중 영상의 복호화에 직접적으로 관련된 구성요소들, 예컨대, 엔트로피 복호화부(210), 역양자화부(220), 역변환부(230), 인트라 예측부(240), 움직임 보상부(250), 필터부(260) 등을 다른 구성요소와 구분하여 복호화부 또는 디코딩부로 표현할 수 있다. The entropy decoding unit 210 , the inverse quantization unit 220 , the inverse transform unit 230 , the intra prediction unit 240 , the motion compensator 250 , and the filter unit 260 included in the image decoding apparatus 200 . and components directly related to image decoding among the reference image buffer 270 , for example, the entropy decoding unit 210 , the inverse quantization unit 220 , the inverse transform unit 230 , the intra prediction unit 240 , and motion compensation. The unit 250 and the filter unit 260 may be distinguished from other components and expressed as a decoder or a decoder.

또한, 영상 복호화 장치(200)는 비트스트림에 포함되어 있는 인코딩된 영상에 관련된 정보를 파싱하는 도시하지 않은 파싱부를 더 포함할 수 있다. 파싱부는 엔트로피 복호화부(210)를 포함할 수도 있고, 엔트로피 복호화부(210)에 포함될 수도 있다. 이러한 파싱부는 또한 디코딩부의 하나의 구성요소로 구현될 수도 있다.Also, the image decoding apparatus 200 may further include a parsing unit (not shown) that parses information related to the encoded image included in the bitstream. The parsing unit may include the entropy decoding unit 210 or may be included in the entropy decoding unit 210 . This parsing unit may also be implemented as one component of the decoding unit.

도 3은 본 발명이 적용될 수 있는, 복수 계층을 이용한 스케일러블 비디오 코딩 구조의 일 실시예를 개략적으로 나타내는 개념도이다. 도 3에서 GOP(Group of Picture)는 픽쳐군 즉, 픽쳐의 그룹을 나타낸다.3 is a conceptual diagram schematically illustrating an embodiment of a scalable video coding structure using a plurality of layers to which the present invention can be applied. In FIG. 3 , a group of pictures (GOP) indicates a picture group, that is, a group of pictures.

영상 데이터를 전송하기 위해서는 전송 매체가 필요하며, 그 성능은 다양한 네트워크 환경에 따라 전송 매체별로 차이가 있다. 이러한 다양한 전송 매체 또는 네트워크 환경에의 적용을 위해 스케일러블 비디오 코딩 방법이 제공될 수 있다.A transmission medium is required to transmit image data, and its performance is different for each transmission medium according to various network environments. A scalable video coding method may be provided for application to such various transmission media or network environments.

스케일러블 비디오 코딩 방법은 계층(layer) 간의 텍스쳐 정보, 움직임 정보, 잔여 신호 등을 활용하여 계층 간 중복성을 제거하여 부호화/복호화 성능을 높이는 코딩 방법이다. 스케일러블 비디오 코딩 방법은, 전송 비트율, 전송 에러율, 시스템 자원 등의 주변 조건에 따라, 공간적, 시간적, 화질적 관점에서 다양한 스케일러빌리티를 제공할 수 있다.The scalable video coding method is a coding method that improves encoding/decoding performance by removing inter-layer redundancy by utilizing texture information, motion information, residual signals, etc. between layers. The scalable video coding method may provide various scalability in terms of spatial, temporal, and picture quality according to ambient conditions such as a transmission bit rate, a transmission error rate, and a system resource.

스케일러블 비디오 코딩은, 다양한 네트워크 상황에 적용 가능한 비트스트림을 제공할 수 있도록, 복수 계층(multiple layers) 구조를 사용하여 수행될 수 있다. 예를 들어 스케일러블 비디오 코딩 구조는, 일반적인 영상 부호화 방법을 이용하여 영상 데이터를 압축하여 처리하는 기본 계층을 포함할 수 있고, 기본 계층의 부호화 정보 및 일반적인 영상 부호화 방법을 함께 사용하여 영상 데이터를 압축 처리하는 향상 계층을 포함할 수 있다.Scalable video coding may be performed using a structure of multiple layers so as to provide a bitstream applicable to various network situations. For example, the scalable video coding structure may include a base layer that compresses and processes image data using a general image encoding method, and compresses image data using both encoding information of the base layer and a general image encoding method. It may include an enhancement layer that handles it.

여기서, 계층(layer)은 공간(spatial, 예를 들어, 영상 크기), 시간(temporal, 예를 들어, 부호화 순서, 영상 출력 순서, 프레임 레이트), 화질, 복잡도 등을 기준으로 구분되는 영상 및 비트스트림(bitstream)의 집합을 의미한다. 또한 기본 계층(Base layer)은 하위 계층 또는 참조 계층, 향상 계층(Enhancement layer)은 상위 계층을 의미할 수 있다. 또한 복수의 계층들은 서로 간에 종속성을 가질 수도 있다.Here, a layer is an image and bit classified based on spatial (eg, image size), temporal (eg, encoding order, image output order, frame rate), image quality, complexity, etc. It means a set of streams (bitstream). In addition, a base layer may mean a lower layer or a reference layer, and an enhancement layer may mean an upper layer. Also, the plurality of layers may have dependencies on each other.

도 3을 참조하면, 예를 들어 기본 계층은 SD(standard definition), 15Hz의 프레임율, 1Mbps 비트율로 정의될 수 있고, 제1 향상 계층은 HD(high definition), 30Hz의 프레임율, 3.9Mbps 비트율로 정의될 수 있으며, 제2 향상 계층은 4K-UHD(ultra high definition), 60Hz의 프레임율, 27.2Mbps 비트율로 정의될 수 있다. 상기 포맷(format), 프레임율, 비트율 등은 하나의 실시예로서, 필요에 따라 달리 정해질 수 있다. 또한 사용되는 계층의 수도 본 실시예에 한정되지 않고 상황에 따라 달리 정해질 수 있다. Referring to FIG. 3 , for example, the base layer may be defined as SD (standard definition), a frame rate of 15 Hz, and a bit rate of 1 Mbps, and the first enhancement layer is HD (high definition), a frame rate of 30 Hz, and a bit rate of 3.9 Mbps. may be defined, and the second enhancement layer may be defined as 4K-UHD (ultra high definition), a frame rate of 60 Hz, and a bit rate of 27.2 Mbps. The format, frame rate, bit rate, etc. are one embodiment and may be differently determined as needed. Also, the number of layers to be used is not limited to the present embodiment and may be determined differently according to circumstances.

예를 들어, 전송 대역폭이 4Mbps라면 상기 제1향상계층 HD의 프레임 레이트를 줄여서 15Hz이하로 전송할 수 있다. 스케일러블 비디오 코딩 방법은 상기 도 3의 실시예에서 상술한 방법에 의해 시간적, 공간적, 화질적 스케일러빌리티를 제공할 수 있다.For example, if the transmission bandwidth is 4 Mbps, the frame rate of the first enhancement layer HD may be reduced to transmit at 15 Hz or less. The scalable video coding method may provide temporal, spatial, and picture quality scalability by the method described above in the embodiment of FIG. 3 .

스케일러블 비디오 코딩은 이하 부호화 관점에서는 스케일러블 비디오 부호화, 복호화 관점에서는 스케일러블 비디오 복호화와 동일한 의미를 가진다.Scalable video coding has the same meaning as scalable video coding in terms of encoding and scalable video decoding in terms of decoding.

본 발명은 복수의 계층(layer) 또는 시점(view)을 포함하는 영상의 부, 복호화(en-/de-coding) 과정에 관한 것으로 상기 복수의 계층 또는 시점은 제1, 제2, 제3, 제n 계층 또는 시점으로 표현할 수 있다. 이하 설명에서는 제1 계층과 제2 계층이 존재하는 픽쳐를 예로 기술하며, 그 이상의 계층 또는 시점에도 동일한 방법으로 적용할 수 있다. 또한, 상기 제1 계층은 하위(base) 계층, 제2 계층은 상위(upper) 계층으로 표현할 수도 있다. 또한, 상기 제1 계층은 참조(reference) 계층, 제 2 계층은 향상(enhancement) 계층으로 표현될 수 있다. The present invention relates to a process of encoding and decoding an image including a plurality of layers or views, wherein the plurality of layers or views are first, second, third, It can be expressed as an n-th layer or a viewpoint. In the following description, a picture in which the first layer and the second layer exist is described as an example, and the same method can be applied to layers or viewpoints higher than that. Also, the first layer may be expressed as a base layer and the second layer may be expressed as an upper layer. Also, the first layer may be expressed as a reference layer, and the second layer may be expressed as an enhancement layer.

제2 계층의 픽쳐/블록에 대응하는 제1 계층의 픽쳐/블록은 상기 제2 계층 픽쳐/블록의 크기에 맞게 변경할 수 있다. 즉, 제1 계층의 픽쳐/블록의 크기가 제2 계층의 픽쳐/블록 보다 작은 경우 업샘플링(up-sampling), re-sampling 등의 방법을 이용하여 스케일링(scaling)할 수 있다.The picture/block of the first layer corresponding to the picture/block of the second layer may be changed according to the size of the picture/block of the second layer. That is, when the size of the picture/block of the first layer is smaller than the picture/block of the second layer, scaling may be performed using a method such as up-sampling or re-sampling.

또한, 제1 계층의 픽쳐는 상기 제2 계층의 참조 픽쳐 리스트(reference picture list)에 추가하여 제2 계층의 영상 부/복호화에 이용될 수 있다. 이때, 제2 계층은 통상의 화면간 예측처럼 참조 픽쳐 리스트에 있는 제1 계층 영상을 이용하여 예측 및 부/복호화를 수행할 수 있다.Also, the picture of the first layer may be added to the reference picture list of the second layer and used for image encoding/decoding of the second layer. In this case, the second layer may perform prediction and encoding/decoding by using the first layer image in the reference picture list like normal inter prediction.

부/복호화를 위한 블록의 크기는 4x4, 8x8, 16x16, 32x32, 64x64 등 NxN 형태의 정사각형 또는 4x8, 16x8, 8x32 등의 NxM 형태의 직사각형 일 수 있으며, 블록의 단위는 부호화 블록(CB: Coding Block), 예측 블록(PB: Prediction Block), 변환 블록(TB: Transform Block) 중 적어도 하나 일 수 있으며 각각 서로 다른 크기를 가질 수 있다.The size of the block for encoding/decoding may be an NxN-shaped square such as 4x4, 8x8, 16x16, 32x32, 64x64, or an NxM-shaped rectangle such as 4x8, 16x8, 8x32, etc., and the unit of the block is a Coding Block (CB). ), a prediction block (PB), and a transform block (TB), and each may have a different size.

이하에서, 스케일러블 비디오, 즉 다계층 구조를 사용하는 영상의 부호화 및 복호화 방법 중 상위 계층의 부호화 및 복호화 대상이 되는 블록(이하 현재 블록 또는 대상 블록이라고 칭함)의 예측 블록, 즉 예측 신호를 생성하는 방법에 대하여 살펴본다. 이하 발명의 내용(방법 또는 장치)은 통상적으로 부호화기와 복호화기에 동일하게 적용될 수 있다.Hereinafter, a prediction block of a block (hereinafter referred to as a current block or a target block) that is a target for encoding and decoding of a higher layer among scalable video, that is, an image encoding and decoding method using a multi-layer structure, that is, a prediction signal is generated Let's see how to do it. The following content (method or apparatus) of the present invention can be equally applied to an encoder and a decoder in general.

통상적으로 인터 예측은 현재 픽쳐의 이전 픽쳐 또는 이후 픽쳐 중 적어도 하나를 참조 픽쳐로 하고, 참조 픽쳐를 기반으로 현재 블록에 대한 예측을 수행할 수 있다. 현재 블록의 예측에 이용되는 픽쳐를 참조 픽쳐(reference picture) 또는 참조 프레임(reference frame)이라고 한다. In general, inter prediction may use at least one of a picture before or after a picture of the current picture as a reference picture, and may perform prediction on the current block based on the reference picture. A picture used for prediction of the current block is called a reference picture or a reference frame.

참조 픽쳐는 참조 픽쳐 인덱스(refIdx)에 의하여 특정되고, 움직임 벡터(motion vector)를 통하여 참조 픽쳐 내의 소정 영역이 참조 블록으로 특정된다. A reference picture is specified by a reference picture index refIdx, and a predetermined region in the reference picture is specified as a reference block through a motion vector.

인터 예측은 참조 픽쳐 및 참조 픽쳐 내에서 현재 블록에 대응하는 참조 블록을 선택해서, 현재 블록에 대한 예측 블록을 생성할 수 있다. Inter prediction may generate a prediction block for the current block by selecting a reference picture and a reference block corresponding to the current block in the reference picture.

화면 간 예측에서 부호화 장치 및 복호화 장치는 현재 블록의 움직임 정보를 도출한 후, 도출된 움직임 정보에 기반하여 화면 간 예측 및/또는 움직임 보상을 수행할 수 있다. 이 때, 부호화 장치 및 복호화 장치는 복원된 주변 블록(neighboring block) 및/또는 이미 복원된 콜(col) 픽쳐(collocated picture) 내에서 현재 블록에 대응되는 콜(col) 블록(collocated block)의 움직임 정보를 이용함으로써, 부호화/복호화 효율을 향상시킬 수 있다. In inter prediction, the encoding apparatus and the decoding apparatus may derive motion information of the current block and then perform inter prediction and/or motion compensation based on the derived motion information. In this case, the encoding apparatus and the decoding apparatus move a collocated block corresponding to the current block in a reconstructed neighboring block and/or an already reconstructed collocated picture. By using the information, encoding/decoding efficiency can be improved.

여기서, 복원된 주변 블록은 이미 부호화 및/또는 복호화되어 복원된 현재 픽쳐 내의 블록으로서, 현재 블록에 인접한 블록 및/또는 현재 블록의 외부 코너에 위치한 블록을 포함할 수 있다. 또한 부호화 장치 및 복호화 장치는, 콜 픽쳐 내에서 현재 블록과 공간적으로 대응되는 위치에 존재하는 블록을 기준으로 소정의 상대적인 위치를 결정할 수 있고, 상기 결정된 소정의 상대적인 위치(상기 현재 블록과 공간적으로 대응되는 위치에 존재하는 블록의 내부 및/또는 외부의 위치)를 기반으로 상기 콜 블록을 도출할 수 있다. 여기서, 일례로 콜 픽쳐는 참조 픽쳐 리스트에 포함된 참조 픽쳐 중에서 하나의 픽쳐에 해당될 수 있다.Here, the reconstructed neighboring block is a block in the reconstructed current picture that has already been encoded and/or decoded, and may include a block adjacent to the current block and/or a block located at an outer corner of the current block. Also, the encoding apparatus and the decoding apparatus may determine a predetermined relative position based on a block existing at a position spatially corresponding to the current block in the collocated picture, and the determined relative position (corresponding spatially to the current block) It is possible to derive the collocated block based on the position inside and/or outside the block existing at the position where the block is located. Here, as an example, the collocated picture may correspond to one picture among the reference pictures included in the reference picture list.

인터 예측은 현재 블록과의 레지듀얼(residual) 신호가 최소화되며 움직임 벡터 크기 역시 최소가 되도록 예측 블록을 생성할 수 있다.Inter prediction may generate a prediction block such that a residual signal with a current block is minimized and a motion vector size is also minimized.

한편, 움직임 정보 도출 방식은 현재 블록의 예측 모드에 따라 달라질 수 있다. 인터 예측을 위해 적용되는 예측 모드에는 AMVP(Advanced Motion Vector Predictor), 머지(merge) 등이 있을 수 있다.Meanwhile, the method of deriving motion information may vary depending on the prediction mode of the current block. Prediction modes applied for inter prediction may include Advanced Motion Vector Predictor (AMVP), merge, and the like.

일례로, AMVP(Advanced Motion Vector Predictor)가 적용되는 경우, 부호화 장치 및 복호화 장치는 복원된 주변 블록의 움직임 벡터 및/또는 콜 블록의 움직임 벡터를 이용하여, 움직임 벡터 후보 리스트를 생성할 수 있다. 즉, 복원된 주변 블록의 움직임 벡터 및/또는 콜 블록의 움직임 벡터는 움직임 벡터 후보로 사용될 수 있다. 부호화 장치는 상기 리스트에 포함된 움직임 벡터 후보 중에서 선택된 최적의 움직임 벡터를 지시하는 예측 움직임 벡터 인덱스를 복호화 장치로 전송할 수 있다. 이 때, 복호화 장치는 상기 움직임 벡터 인덱스를 이용하여, 움직임 벡터 후보 리스트에 포함된 움직임 벡터 후보 중에서, 현재 블록의 예측 움직임 벡터를 선택할 수 있다.For example, when Advanced Motion Vector Predictor (AMVP) is applied, the encoding apparatus and the decoding apparatus may generate a motion vector candidate list by using a motion vector of a reconstructed neighboring block and/or a motion vector of a collocated block. That is, the motion vector of the reconstructed neighboring block and/or the motion vector of the collocated block may be used as a motion vector candidate. The encoding apparatus may transmit a predicted motion vector index indicating an optimal motion vector selected from among the motion vector candidates included in the list to the decoding apparatus. In this case, the decoding apparatus may select a predicted motion vector of the current block from among motion vector candidates included in the motion vector candidate list by using the motion vector index.

부호화 장치는 현재 블록의 움직임 벡터와 예측 움직임 벡터 간의 움직임 벡터 차분(MVD: Motion Vector Difference)을 구할 수 있고, 이를 부호화하여 복호화 장치로 전송할 수 있다. 이 때, 복호화 장치는 수신된 움직임 벡터 차분을 복호화할 수 있고, 복호화된 움직임 벡터 차분과 예측 움직임 벡터의 합을 통해 현재 블록의 움직임 벡터를 도출할 수 있다.The encoding apparatus may obtain a motion vector difference (MVD) between the motion vector of the current block and the predicted motion vector, encode it, and transmit it to the decoding apparatus. In this case, the decoding apparatus may decode the received motion vector difference, and may derive the motion vector of the current block through the sum of the decoded motion vector difference and the predicted motion vector.

부호화 장치는 또한 참조 픽쳐를 지시하는 참조 픽쳐 인덱스 등을 복호화 장치에 전송할 수 있다. The encoding apparatus may also transmit a reference picture index indicating a reference picture to the decoding apparatus.

복호화 장치는 주변 블록의 움직임 정보들을 이용하여 현재 블록의 움직임 벡터를 예측하고, 부호화 장치로부터 수신한 레지듀얼을 이용하여 현재 블록에 대한 움직임 벡터를 유도할 수 있다. 복호화 장치는 유도한 움직임 벡터와 부호화 장치로부터 수신한 참조 픽쳐 인덱스 정보를 기반으로 현재 블록에 대한 예측 블록을 생성할 수 있다.The decoding apparatus may predict the motion vector of the current block by using motion information of the neighboring block, and may derive the motion vector of the current block by using the residual received from the encoding apparatus. The decoding apparatus may generate a prediction block for the current block based on the derived motion vector and reference picture index information received from the encoding apparatus.

다른 예로, 머지(merge)가 적용되는 경우, 부호화 장치 및 복호화 장치는 복원된 주변 블록의 움직임 정보 및/또는 콜 블록의 움직임 정보를 이용하여, 머지 후보 리스트를 생성할 수 있다. 즉, 부호화 장치 및 복호화 장치는 복원된 주변 블록 및/또는 콜 블록의 움직임 정보가 존재하는 경우, 이를 현재 블록에 대한 머지 후보로 사용할 수 있다. As another example, when merge is applied, the encoding apparatus and the decoding apparatus may generate a merge candidate list by using motion information of a reconstructed neighboring block and/or motion information of a collocated block. That is, when motion information of the reconstructed neighboring block and/or collocated block exists, the encoding apparatus and the decoding apparatus may use it as a merge candidate for the current block.

부호화 장치는 머지 후보 리스트에 포함된 머지 후보 중에서 최적의 부호화 효율을 제공할 수 있는 머지 후보를 현재 블록에 대한 움직임 정보로 선택할 수 있다. 이 때, 상기 선택된 머지 후보를 지시하는 머지 인덱스가 비트스트림에 포함되어 복호화 장치로 전송될 수 있다. 복호화 장치는 상기 전송된 머지 인덱스를 이용하여, 머지 후보 리스트에 포함된 머지 후보 중에서 하나를 선택할 수 있으며, 상기 선택된 머지 후보를 현재 블록의 움직임 정보로 결정할 수 있다. 따라서, 머지 모드가 적용되는 경우, 복원된 주변 블록 및/또는 콜 블록의 움직임 정보가 현재 블록의 움직임 정보로 그대로 사용될 수 있다. 복호화 장치는 예측 블록과 부호화 장치로부터 전송되는 레지듀얼을 더하여 현재 블록을 복원할 수 있다.The encoding apparatus may select a merge candidate capable of providing optimal encoding efficiency from among merge candidates included in the merge candidate list as motion information for the current block. In this case, the merge index indicating the selected merge candidate may be included in the bitstream and transmitted to the decoding apparatus. The decoding apparatus may select one of the merge candidates included in the merge candidate list by using the transmitted merge index, and may determine the selected merge candidate as motion information of the current block. Accordingly, when the merge mode is applied, the motion information of the reconstructed neighboring block and/or the collocated block may be used as it is as the motion information of the current block. The decoding apparatus may reconstruct the current block by adding the prediction block and the residual transmitted from the encoding apparatus.

상술한 AMVP 및 머지 모드에서는, 현재 블록의 움직임 정보를 도출하기 위해, 복원된 주변 블록의 움직임 정보 및/또는 콜 블록의 움직임 정보가 사용될 수 있다.In the aforementioned AMVP and merge modes, in order to derive the motion information of the current block, motion information of a reconstructed neighboring block and/or motion information of a collocated block may be used.

화면 간 예측에 이용되는 다른 모드 중 하나 인 스킵 모드의 경우에, 주변 블록의 정보를 그대로 현재 블록에 이용할 수 있다. 따라서 스킵 모드의 경우에, 부호화 장치는 현재 블록의 움직임 정보로서 어떤 블록의 움직임 정보를 이용할 것인지를 지시하는 정보 외에 레지듀얼 등과 같은 신택스 정보를 복호화 장치에 전송하지 않는다. In the case of the skip mode, which is one of the other modes used for inter prediction, information on neighboring blocks may be used for the current block as it is. Accordingly, in the skip mode, the encoding apparatus does not transmit syntax information such as residuals to the decoding apparatus other than information indicating which block to use as the motion information of the current block.

부호화 장치 및 복호화 장치는 상기 도출된 움직임 정보에 기반하여 현재 블록에 대한 움직임 보상을 수행함으로써, 현재 블록의 예측 블록을 생성할 수 있다. 여기서, 예측 블록은 현재 블록에 대한 움직임 보상 수행 결과 생성된, 움직임 보상된 블록을 의미할 수 있다. 또한, 복수의 움직임 보상된 블록은 하나의 움직임 보상된 픽쳐를 구성할 수 있다.The encoding apparatus and the decoding apparatus may generate a prediction block of the current block by performing motion compensation on the current block based on the derived motion information. Here, the prediction block may mean a motion-compensated block generated as a result of performing motion compensation on the current block. Also, a plurality of motion-compensated blocks may constitute one motion-compensated picture.

복호화 장치는 현재 블록의 인터 예측에 필요한 움직임 정보, 예컨대 움직임 벡터, 참조 픽쳐 인덱스 등에 관한 정보를 부호화 장치로부터 수신한 스킵 플래그, 머지 플래그 등을 확인하고 이에 대응하여 유도할 수 있다. The decoding apparatus may check a skip flag, a merge flag, etc. received from the encoding apparatus for motion information necessary for inter prediction of the current block, for example, a motion vector, a reference picture index, and the like, and derive it correspondingly.

예측이 수행되는 처리 단위와 예측 방법 및 구체적인 내용이 정해지는 처리 단위는 서로 다를 수 있다. 예컨대, 예측 블록 단위로 예측모드가 정해져서 변환 블록 단위로 예측이 수행될 수도 있고, 예측 블록 단위로 예측 모드가 정해지고 변환 블록 단위로 화면 내 예측이 수행될 수 도 있다.A processing unit in which prediction is performed and a processing unit in which a prediction method and specific content are determined may be different from each other. For example, a prediction mode may be determined in units of prediction blocks and prediction may be performed in units of transform blocks, or a prediction mode may be determined in units of prediction blocks and intra prediction may be performed in units of transform blocks.

현재 픽쳐보다 이전에 인코딩/디코딩된 픽쳐들은 메모리(예컨대, Decoded Picture Buffer: DPB)에 저장되어 현재 블록(현재 픽쳐)의 예측에 이용될 수 있다. 현재 블록의 인터 예측에 이용 가능한 픽쳐들의 리스트는 참조 픽쳐 리스트로 유지된다. Pictures encoded/decoded before the current picture may be stored in a memory (eg, Decoded Picture Buffer: DPB) and used for prediction of a current block (current picture). A list of pictures available for inter prediction of the current block is maintained as a reference picture list.

P 슬라이스는 인트라 예측 또는 최대 하나의 움직임 벡터와 하나의 참조 픽쳐를 이용하는 인터 예측을 통해 디코딩되는 슬라이스이다. B 슬라이스는 인트라 예측 또는 최대 두 개의 움직임 벡터와 두 개의 참조 픽쳐를 이용하는 인터 예측을 통해 디코딩되는 슬라이스이다. 이때, 참조 픽쳐는 단기 참조 픽쳐(short term reference picture)와 장기 참조 픽쳐(long term reference picture)를 포함한다. 픽쳐는 표시 순서를 나타내는 POC(Picture Order Count)로 특정될 수 있으며, 단기 참조 픽쳐들은 현재 픽쳐와 POC 차이가 크지 않은 픽쳐들 일 수 있고, 장기 참조 픽쳐들은 현재 픽쳐와 POC 차이가 큰 픽쳐들 일 수 있다.A P slice is a slice decoded through intra prediction or inter prediction using at most one motion vector and one reference picture. A B slice is a slice decoded through intra prediction or inter prediction using up to two motion vectors and two reference pictures. In this case, the reference picture includes a short term reference picture and a long term reference picture. A picture may be specified with a picture order count (POC) indicating a display order, short-term reference pictures may be pictures with a small POC difference from the current picture, and long-term reference pictures may be pictures with a large POC difference from the current picture. can

참조 픽쳐 리스트 0 (reference picture list 0, 이하 설명의 편의를 위해 ‘L0’이라 함)는 P 슬라이스 또는 B 슬라이스의 인터 예측에 이용되는 참조 픽쳐 리스트이다. 참조 픽쳐 리스트 1 (reference picture list 1, 이하 설명의 편의를 위해 ‘L1’이라 함)은 B 슬라이스의 인터 예측을 위해 이용된다. 따라서, 단방향 예측을 수행하는 P 슬라이스의 블록에 대한 인터 예측에는 L0이 이용되며, 양방향 예측을 수행하는 B 슬라이스의 블록에 대한 인터 예측에는 L0과 L1이 이용된다. Reference picture list 0 (reference picture list 0, hereinafter referred to as 'L0' for convenience of description) is a reference picture list used for inter prediction of a P slice or a B slice. Reference picture list 1 (reference picture list 1, hereinafter referred to as 'L1' for convenience of description) is used for inter prediction of the B slice. Accordingly, L0 is used for inter prediction of a block of a P slice in which unidirectional prediction is performed, and L0 and L1 are used in inter prediction of a block of a B slice in which bidirectional prediction is performed.

복호화 장치는 인터 예측을 통해 P 슬라이스와 B 슬라이스에 대한 디코딩을 수행하는 경우에, 참조 픽쳐 리스트를 구성(construct)한다. 인터 예측에 이용되는 참조 픽쳐는 참조 픽쳐 리스트를 통해 지정된다. 참조 픽쳐 인덱스는 참조 픽쳐 리스트 상의 참조 픽쳐를 지시하는 인덱스이다.The decoding apparatus constructs a reference picture list when decoding the P slice and the B slice through inter prediction. A reference picture used for inter prediction is designated through a reference picture list. The reference picture index is an index indicating a reference picture on the reference picture list.

참조 픽쳐 리스트는 부호화 장치로부터 전송되는 참조 픽쳐 세트(reference picture set)을 기반으로 구성될 수 있다. 참조 픽쳐 세트는 참조 픽쳐로 사용되는 픽쳐의 POC와 해당 픽쳐가 직접적으로 참조되는지 여부를 알려주는 플래그(used_by_curr_pic_s0_flag)로 구성될 수 있다. 참조 픽쳐 리스트를 구성하는 참조 픽쳐들은 메모리(예컨대, DPB)에 저장될 수 있다. 메모리에 저장되는 픽쳐들(현재 픽쳐 이전에 인코딩/디코딩된 픽쳐들)은 부호화 장치와 복호화 장치에 의해 관리된다.The reference picture list may be constructed based on a reference picture set transmitted from the encoding apparatus. The reference picture set may include a POC of a picture used as a reference picture and a flag (used_by_curr_pic_s0_flag) indicating whether the corresponding picture is directly referenced. Reference pictures constituting the reference picture list may be stored in a memory (eg, DPB). Pictures stored in the memory (pictures encoded/decoded before the current picture) are managed by the encoding apparatus and the decoding apparatus.

참조 픽쳐 세트는 단기 참조 픽쳐를 포함하는 단기 참조 픽쳐 세트와 장기 참조 픽쳐를 포함하는 장기 참조 픽쳐 세트를 포함할 수 있으며, 이러한 단기 참조 픽쳐 세트와 장기 참조 픽쳐 세트에 기초하여 초기 참조 픽쳐 리스트가 구성될 수있다. The reference picture set may include a short-term reference picture set including a short-term reference picture and a long-term reference picture set including a long-term reference picture, and an initial reference picture list is constructed based on the short-term reference picture set and the long-term reference picture set. can be

도 4는 참조 픽쳐 리스트를 구성하는 일 예를 도시한 도면이다. 4 is a diagram illustrating an example of configuring a reference picture list.

참조 픽쳐들은 현재 픽쳐를 기준으로 현재 픽쳐(Curr)의 POC보다 작은 참조 픽쳐들(Ref 1, Ref 2)로 구성되는 제 1 단기 참조 픽쳐 세트(RefPicSetStCurr0)와 현재 픽쳐의 POC보다 큰 참조 픽쳐들(Ref 3, Ref 4)로 구성되는 제2 단기 참조 픽쳐 세트(RefPicSetStCurr1) 및 장기 참조 픽쳐들(Ref LT1, Ref LT2)로 구성되는 장기 참조 픽쳐 세트(RefPicSetLtCurr)로 분류될 수 있다. The reference pictures are a first short-term reference picture set (RefPicSetStCurr0) consisting of reference pictures (Ref 1, Ref 2) smaller than the POC of the current picture (Curr) based on the current picture and reference pictures larger than the POC of the current picture (RefPicSetStCurr0) ( It may be classified into a second short-term reference picture set (RefPicSetStCurr1) composed of Ref 3 and Ref 4) and a long-term reference picture set (RefPicSetLtCurr) composed of long-term reference pictures (Ref LT1 and Ref LT2).

이 때, 제 1 단기 참조 픽쳐 세트(RefPicSetStCurr0)는 used_by_curr_pic_s0_flag 값이 1인 픽쳐들로 구성되고(delta_poc_s0 with used_by_curr_pic_s0_flag=1), 제 2 단기 참조 픽쳐 세트 (RefPicSetStCurr1) 역시 used_by_curr_pic_s1_flag 값이 1인 픽쳐들로 구성된다(delta_poc_s1 with used_by_curr_pic_s1_flag=1).At this time, the first short-term reference picture set (RefPicSetStCurr0) is composed of pictures having a used_by_curr_pic_s0_flag value of 1 (delta_poc_s0 with used_by_curr_pic_s0_flag=1), and the second short-term reference picture set (RefPicSetStCurr1) is also composed of pictures with a used_by_curr_pic_s0_flag value of 1 (RefPicSetStCurr1). (delta_poc_s1 with used_by_curr_pic_s1_flag=1).

이렇게 성질을 달리하는 참조 픽쳐 세트들의 집합으로 초기 참조 픽쳐 리스트가 구성될수 있다. An initial reference picture list may be constituted by a set of reference picture sets having different properties.

참조 픽쳐 리스트 0, 즉 L0은 도 4에 도시된 바와 같이, 제 1 단기 참조 픽쳐 세트(RefPicSetStCurr0), 제 2 단기 참조 픽쳐 세트(RefPicSetStCurr1) 및 장기 참조 픽쳐 세트(RefPicSetLtCurr) 순으로 구성된다. As shown in FIG. 4 , the reference picture list 0, that is, L0, consists of a first short-term reference picture set (RefPicSetStCurr0), a second short-term reference picture set (RefPicSetStCurr1), and a long-term reference picture set (RefPicSetLtCurr) in this order.

한편, 참조 픽쳐 리스트 1, 즉 L1은, 제 2 단기 참조 픽쳐 세트(RefPicSetStCurr1), 제 1 단기 참조 픽쳐 세트(RefPicSetStCurr0) 및 장기 참조 픽쳐 세트(RefPicSetLtCurr) 순으로 구성된다. On the other hand, reference picture list 1, that is, L1, consists of a second short-term reference picture set (RefPicSetStCurr1), a first short-term reference picture set (RefPicSetStCurr0), and a long-term reference picture set (RefPicSetLtCurr) in this order.

참조 픽쳐 리스트에 포함될 수 있는 참조 픽쳐의 개수는 부호화 장치로부터 전송되는 정보에 기반하여 결정될 수 있다. 예컨대, 부호화 장치는 참조 픽쳐 리스트를 구성한 후, 사용할 참조 픽쳐들의 개수를 결정하고, 사용할 참조 픽초들의 개수에 관한 정보(예컨대, num_ref_idx_lX_default_active_minus1, X=0 또는 1)를 시퀀스 파라미터 세트(Sequence Parameter Set: SPS)의 구문 요소로서 복호화 장치에 전송할 수 있다. 복호화 장치는 수신한 num_ref_idx_lX_default_active_minus1 에 1을 더한 값으로 특정되는 참조 픽쳐들의 개수를 현재 시퀀스 내에서 디폴트 값으로 사용될 수 있다.The number of reference pictures that may be included in the reference picture list may be determined based on information transmitted from the encoding apparatus. For example, after constructing the reference picture list, the encoding apparatus determines the number of reference pictures to be used, and provides information on the number of reference pictures to be used (eg, num_ref_idx_lX_default_active_minus1, X=0 or 1) as a sequence parameter set (SPS). ) as a syntax element and can be transmitted to the decoding device. The decoding apparatus may use the number of reference pictures specified by adding 1 to the received num_ref_idx_lX_default_active_minus1 as a default value in the current sequence.

또한, 부호화 장치는 픽쳐 또는 슬라이스별로 참조 픽쳐들의 개수를 지정하고자 하는 경우에 참조 픽쳐들의 개수를 지시하는 별도의 정보(예컨대, num_ref_idx_l1_active_minus1, X=0 또는 1)를 픽쳐 파라미터 세트(Picture Parameter Set: PPS) 또는 슬라이스 헤더 등을 통해서 전송할 수 있다. 복호화 장치는 수신한 num_ref_idx_l1_active_minus1 에 1을 더한 값으로 특정되는 값을 현재 픽쳐 또는 현재 슬라이스에 대한 참조 픽쳐들의 개수로서 적용할 수 있다.Also, when the encoding apparatus intends to designate the number of reference pictures for each picture or slice, separate information indicating the number of reference pictures (eg, num_ref_idx_11_active_minus1, X=0 or 1) is provided as a Picture Parameter Set (PPS). ) or through a slice header. The decoding apparatus may apply a value specified by adding 1 to the received num_ref_idx_11_active_minus1 as the number of reference pictures for the current picture or current slice.

인터 예측 수행 시, 상기와 같이 구성된 참조 픽쳐 리스트 내 특정된 참조 픽쳐를 이용하여 움직임 보상이 수행될 수 있다.When performing inter prediction, motion compensation may be performed using a reference picture specified in the reference picture list configured as described above.

공간적 스케일러빌러티 또는 다시점 스케일러빌러티를 제공하는 다계층 구조에서 상위 계층의 참조 픽쳐는 같은 계층에 대한 참조 픽쳐들과 계층간 참조 픽쳐로 구성될 수 있다. In a multi-layer structure that provides spatial scalability or multi-view scalability, a reference picture of a higher layer may include reference pictures of the same layer and an inter-layer reference picture.

이 경우, 계층간 참조 픽쳐에 대한 시그널링은 계층을 식별하는 정보와 참조 픽쳐를 식별하는 정보를 통하여 수행될 수 있다. 예컨대, i 번째 레이어의 현재 픽쳐와 동일한 억세스 유닛(access usnit) 내에 존재하고, NAL(Network Abstraction Layer) 유닛 헤더에 포함되어 전송되는 j 번째 레이어 식별자(i는 j보다 큼)인 nuh_layer_id 값이 현재 픽쳐에 대한 RefPiclayerId 값과 동일하면, 해당 픽쳐는 현재 픽쳐에 대한 참조 픽쳐로 사용되는 것으로 판단할 수 있다. 계층간 참조 픽쳐는 장기 참조 픽쳐로 표시할 수 있다. In this case, signaling of the inter-layer reference picture may be performed through information identifying the layer and information identifying the reference picture. For example, the nuh_layer_id value, which is the j-th layer identifier (i is greater than j), which exists in the same access unit as the current picture of the i-th layer and is transmitted by being included in a Network Abstraction Layer (NAL) unit header, is the current picture. If it is the same as the RefPiclayerId value for , it may be determined that the corresponding picture is used as a reference picture for the current picture. An inter-layer reference picture may be displayed as a long-term reference picture.

RefPicLayerId는 슬라이스 헤더에 포함되는 구문요소inter_layer_pred_layer_idc로 시그널링 될 수 있는 값으로 현재 계층이 계층간 예측에 참조하는 계층을 의미한다.RefPicLayerId is a value that can be signaled as a syntax element inter_layer_pred_layer_idc included in the slice header, and means a layer that the current layer refers to for inter-layer prediction.

한편, 상위 계층의 참조 픽쳐 리스트에 하위 계층의 픽쳐를 추가할 때, 하위 계층의 픽쳐에 하나의 고정된 필터를 적용한 후 상위 계층 참조 픽쳐 리스트에 추가할 수 있다. 다만, 이 경우 부호화 효율이 저하되는 문제점이 발생할 수 있다. Meanwhile, when adding a picture of a lower layer to the reference picture list of the upper layer, one fixed filter may be applied to the picture of the lower layer and then added to the reference picture list of the upper layer. However, in this case, there may be a problem in that encoding efficiency is lowered.

또한, 하위 계층의 픽쳐에 두 개의 필터를 적용하는 경우 하나 이상의 픽쳐가 참조 픽쳐 리스트에 추가될 수 있으므로 참조 픽쳐 리스트를 구성하거나 시그널링 하는 정보가 증가할 수 있다. In addition, when two filters are applied to a picture of a lower layer, since one or more pictures may be added to the reference picture list, information configuring or signaling the reference picture list may increase.

이에, 본 발명은 하위 계층의 픽쳐에 필터를 적용함에 있어, 하나 이상의 필터를 소정의 유닛 단위에 적응적으로 선택/적용하고, 하나의 하위 계층 픽쳐를 상위 계층 참조 픽쳐 리스트에 추가함으로써 부호화 효율을 향상시키며 참조 픽쳐 리스트를 구성하는 복잡도가 증가하지 않도록 하는 것을 특징으로 한다.Accordingly, the present invention improves encoding efficiency by adaptively selecting/applying one or more filters to a predetermined unit unit and adding one lower layer picture to the upper layer reference picture list in applying the filter to the lower layer picture. It is characterized in that the complexity of constructing the reference picture list is not increased.

도 5는 본 발명에 따른 영상의 처리 방법을 설명하기 위한 제어 흐름도이다. 보다 구체적으로 도 5는 제 2 계층의 픽쳐에 대한 부, 복호화 과정을 수행할 때 제1 계층의 픽쳐를 이용하는 다계층 영상의 부호화 및 복호화 방법을 설명하기 위한 도면이다. 따라서, 도 5에 대한 설명은 영상의 복호화 및 부호화 방법에 모두 적용될 수 있다. 5 is a control flowchart illustrating an image processing method according to the present invention. More specifically, FIG. 5 is a diagram for explaining a method of encoding and decoding a multi-layered image using a picture of a first layer when performing an encoding and decoding process on a picture of a second layer. Accordingly, the description of FIG. 5 may be applied to both video decoding and encoding methods.

제1 계층 픽쳐에 대한 필터링을 위하여 우선, 부호화기 및 복호화기는 제1 계층 픽쳐에 대한 필터 종류를 결정한다(S510). 필터의 종류가 결정되기 이전에 제1 계층 픽쳐의 복호화가 수행되며, 제1 계층 픽쳐 및 제2 계층 픽쳐는 각각 상이한 구성 또는 모듈에 의하여 복호화 될 수 있다. 부호화기 및 복호화기는 제1 계층 픽쳐의 복호화를 위하여 도 1 및 도 2의 구성을 갖는 복호화부와 제2 계층 픽쳐의 복호화를 위하여 도 1 및 도 2의 구성을 갖는 복호화부를 각각 포함할 수 있다. In order to filter the first hierarchical picture, first, the encoder and the decoder determine a filter type for the first hierarchical picture (S510). The first hierarchical picture is decoded before the filter type is determined, and the first hierarchical picture and the second hierarchical picture may be decoded by different configurations or modules, respectively. The encoder and decoder may include a decoder having the configuration of FIGS. 1 and 2 for decoding the first hierarchical picture, and a decoder having the configuration of FIGS. 1 and 2 for decoding the second hierarchical picture, respectively.

도 6은 본 발명에 따른 다계층 픽쳐를 도시한 도면이다. 도시된 바와 같이, 제 2 계층의 픽쳐가 부호화 및 복호화 대상이 되는 대상 픽쳐가 되고, 제1 계층의 픽쳐는 대상 픽쳐에 대응하는 대응 픽쳐로 제2 계층 픽쳐의 참조 픽쳐가 될 수 있다. 6 is a diagram illustrating a multi-layered picture according to the present invention. As shown, the picture of the second layer becomes a target picture to be encoded and decoded, and the picture of the first layer is a corresponding picture corresponding to the target picture and may be a reference picture of the picture of the second layer.

대상 픽쳐와 대응 픽쳐는 동일한 POC를 가지며, 예를 들어 대상 픽쳐와 대응 픽쳐의 POC는 4일 수 있다. 대상 픽쳐의 예측을 위하여 POC가 4인 대응 픽쳐와 POC가 0 또는 POC 8인 제2 계층의 픽쳐가 이용될 수 있다.The target picture and the corresponding picture have the same POC, for example, the POC of the target picture and the corresponding picture may be 4. For prediction of a target picture, a corresponding picture having a POC of 4 and a picture of a second layer having a POC of 0 or POC 8 may be used.

제2 계층의 부/복호화 대상 픽쳐와 제1 계층 대응 픽쳐의 크기는 같거나 다를 수 있다. 제2 계층의 대상 픽쳐와 제1 계층 대응 픽쳐의 크기가 같더라도 신호의 특성이 다를 수 있다. 따라서, 제1 계층의 복원된 픽쳐에 대하여 업 샘플링(up-sampling) 또는 리 샘플링(re-sampling)을 적용하여 두 계층의 픽쳐 크기를 같게 하거나 신호의 특성을 변화시킨 후 이용함으로써 예측 효율을 향상시킬 수 있다. The size of the sub/decoding target picture of the second layer and the picture corresponding to the first layer may be the same or different. Even if the size of the target picture of the second layer and the corresponding picture of the first layer are the same, characteristics of signals may be different. Therefore, the prediction efficiency is improved by applying up-sampling or re-sampling to the reconstructed picture of the first layer to make the picture sizes of the two layers the same or to use after changing the signal characteristics. can do it

이러한 샘플링을 위해 소정의 필터를 적용할 수 있으며, 필터는 휘도(Luma) 성분 또는 색차(Chroma) 성분에 따라 다를 수 있다.A predetermined filter may be applied for this sampling, and the filter may be different according to a luminance (Luma) component or a chroma (Chroma) component.

부호화기 및 복호화기는 제1 계층 픽쳐의 필터 종류를 하나의 고정된 필터로 결정할 수도 있고, 하나의 이상의 필터를 적응적으로 선택하여 제1 계층 픽쳐에 적용할 수도 있다. The encoder and decoder may determine the filter type of the first hierarchical picture as one fixed filter, or may adaptively select one or more filters and apply them to the first hierarchical picture.

부호화기 및 복호화기가 제1 계층 픽쳐에 고정 필터를 적용하는 경우, 제1 계층의 대응 픽쳐에 대해 기본(default) 내삽(interpolation) 필터를 적용할 수 있다. When the encoder and the decoder apply the fixed filter to the first layer picture, a default interpolation filter may be applied to the corresponding picture of the first layer.

기본 필터(Default filter)는 하기의 표 1 및 표 2와 같은 휘도/색차 신호에 대한 필터 세트(set)을 가질 수 있으며 계층간의 픽쳐 크기 비율에 따라 적용하는 위상(phase) 및 필터 계수는 다를 수 있다.The default filter may have a filter set for the luminance/color difference signal as shown in Tables 1 and 2 below, and the phase and filter coefficients applied according to the picture size ratio between layers may be different. there is.

표 1은 휘도 신호에 대한 16 위상 리샘플링 필터 계수를 나타내고 있으며, 표 2는 색차 신호에 대한 16 위상 리샘플링 필터 계수를 나타내고 있다. Table 1 shows the 16-phase resampling filter coefficients for the luminance signal, and Table 2 shows the 16-phase resampling filter coefficients for the chrominance signal.

이때, 위상 ‘0’ 은 정수(integer) 위치의 샘플에 대한 필터 계수를 의미할 수 있으며, 필터를 적용하지 않는 것과 같은 결과를 가져올 수 있다. In this case, the phase '0' may mean a filter coefficient for a sample at an integer position, and may have the same result as not applying a filter.

또는, 부호화기 및 복호화기는 제1 계층의 대응 픽쳐 또는 기본 필터를 적용한 픽쳐에 대해 소정의 고정된 필터로서 대체 필터(Alternative filter)를 적용할 수 있다. Alternatively, the encoder and the decoder may apply an alternative filter as a predetermined fixed filter to the corresponding picture of the first layer or the picture to which the basic filter is applied.

대체 필터는 상술된 기본 필터와 유사하게 하나 이상의 필터 계수를 포함하는 필터 세트을 가질 수 있다. 이 경우 필터 계수는 부호화기에서 정해진 후 복호화기로 시그널링 될 수 있다.An alternative filter may have a filter set comprising one or more filter coefficients, similar to the basic filter described above. In this case, the filter coefficients may be determined by the encoder and signaled to the decoder.

대체 필터는 기본 필터를 적용한 픽쳐에 대해 적용할 수 있으며, 기본 필터의 위상‘0’에 해당하는 정수(integer) 위치의 샘플에 대한 필터를 의미할 수 있다. 이때, 대체 필터는 [-1, 3, 12, 3, ?1]/16 과 같이 고정된 필터 계수를 가질 수 있다.The replacement filter may be applied to a picture to which the basic filter is applied, and may mean a filter for a sample at an integer position corresponding to the phase '0' of the basic filter. In this case, the replacement filter may have a fixed filter coefficient such as [-1, 3, 12, 3, ?1]/16.

화질 스케일러빌터티(SNR scalability)를 지원하는 비디오의 경우, 제2 계층과 제1 계층의 픽쳐의 크기가 같기 때문에 필터링을 수행하지 않을 수 있다. 하지만, 본 발명에 따를 경우 제1 계층의 신호의 특성을 변화시키기 위해 제1 계층 대응 픽쳐에 대체 필터를 적용할 수 있다.In the case of a video supporting picture quality scalability (SNR scalability), filtering may not be performed because the size of the picture of the second layer and the picture of the first layer is the same. However, according to the present invention, a replacement filter may be applied to the picture corresponding to the first layer in order to change the characteristics of the signal of the first layer.

이때, 대체 필터의 사용 여부를 나타내는 플래그는 비디오 파라미터 세트 (video parameter set), 시퀀스 파라미터 세트(sequence parameter set), 픽쳐 파라미터 세트(picture parameter set), 슬라이스 헤더(Slice header) 등 중 적어도 하나를 통하여 시그널링 될 수 있다.In this case, the flag indicating whether to use the alternative filter is through at least one of a video parameter set, a sequence parameter set, a picture parameter set, a slice header, etc. can be signaled.

또 다른 실시예에 따르면, 부호화기 및 복호화기는 필터 종류로 고정된 필터가 아닌 하나 이상의 필터를 적응적으로 선택하여 제1 계층의 대응 픽쳐에 적용할 수 있다. According to another embodiment, the encoder and the decoder may adaptively select one or more filters instead of a fixed filter as a filter type and apply them to the corresponding picture of the first layer.

예를 들어, 부호화기 및 복호화기는 공간적 스케일러빌러티(spatial scalability)가 지원되는 비디오에서, 제1 계층의 픽쳐에 대해 기본 필터를 적용한 후 정수 위치 샘플에 대해서는 대체 필터를 항상 적용할 수 있다. 이 때, 부호화기로부터 추가적인 시그널링이 없어도 복호화기는 기본 필터와 대체 필터 두 개를 각각 적응적으로 적용하여 픽쳐를 필터링 할 수 있다.For example, in a video in which spatial scalability is supported, the encoder and the decoder may always apply the replacement filter to the integer position samples after applying the basic filter to the picture of the first layer. In this case, even without additional signaling from the encoder, the decoder can filter the picture by adaptively applying two basic filters and two replacement filters, respectively.

또 다른, 예를 들면, 부호화기 및 복호화기는 공간적 스케일러빌러티가 지원되는 비디오에서, 제1 계층의 픽쳐에 대해 기본 필터를 적용한다. 그런 후, 복호화기는 부호화기로부터 시그널링 되는 플래그에 따라 대체 필터의 적용 여부 결정할 수 있다. Another, for example, encoder and decoder apply a basic filter to a picture of a first layer in a video in which spatial scalability is supported. Then, the decoder may determine whether to apply the replacement filter according to a flag signaled from the encoder.

부호화기는 제1 계층의 픽쳐에 대해 기본 필터를 적용한 후 정수 위치 샘플에 대해 대체 필터를 적용할 수 있다. 정수 위치 샘플에 대체 필터를 적용한 후, RDO(Rate Distortion Optimization), SATD(Sum of Absolute Hadamard Transformed Differences), SAD(Sum of Absolute Difference) 등 중 하나 이상을 연산하여 대체 필터의 적용 여부를 결정할 수 있다. 연산의 결과에 따라 대체 필터를 적용하는 것이 적용하지 않을 때 보다 효과가 좋은 경우 부호화기는 대체 필터를 정수 위치 샘플에 적용하는 것으로 결정할 수 있다. 대체 필터를 적용할 지 여부에 대한 신호, 즉 대체 샘플의 적용 여부를 나타내는 플래그는 비디오 파라미터 세트 (video parameter set), 시퀀스 파라미터 세트(sequence parameter set), 픽쳐 파라미터 세트(picture parameter set), 슬라이스 헤더(Slice header) 등 중 적어도 하나를 통하여 시그널링 될 수 있다.After applying the basic filter to the picture of the first layer, the encoder may apply the replacement filter to the integer position sample. After applying the replacement filter to the integer position sample, it is possible to determine whether to apply the replacement filter by calculating one or more of Rate Distortion Optimization (RDO), Sum of Absolute Hadamard Transformed Differences (SATD), and Sum of Absolute Difference (SAD). . According to the result of the operation, if applying the replacement filter is more effective than not applying it, the encoder may decide to apply the replacement filter to integer position samples. A signal indicating whether or not to apply the replacement filter, that is, a flag indicating whether to apply the replacement sample, is a video parameter set, a sequence parameter set, a picture parameter set, and a slice header. (Slice header) may be signaled through at least one of.

또 다른 실시예에 따르면, 부호화기 및 복호화기는 공간적 스케일러빌러티가 지원되는 비디오에 포함된 제1 계층의 픽쳐에 대해 기본 필터를 적용하고, 화질적 스케일러빌러티가 적용되는 비디오에 포함된 제1 계층의 픽쳐에 대해 대체 필터를 적용할 수 있다. According to another embodiment, the encoder and the decoder apply a basic filter to a picture of a first layer included in a video to which spatial scalability is supported, and a first layer included in a video to which picture quality scalability is applied. A replacement filter can be applied to a picture of .

또, 다른 실시예에 따르면, 부호화기 및 복호화기는 샘플들 간의 상관 관계를 기반으로 적응적으로 필터를 선택하여 적용할 수 있다. 즉, 4x4, 8x8, 16x16, 32x32, 64x64 등의 특정 블록 단위로 샘플간 상관 관계를 계산한 후 특정 문턱값(threshold)과 비교하여 대체 필터를 적응적으로 적용할 수 있다. 이때, 샘플들 간의 상관 관계는 제1 계층 대응 픽쳐(복원된 픽쳐) 또는 기본 필터를 적용한 픽쳐를 기반으로 산출할 수 있으며, 문턱값은 스케일러빌러티가 공간적 인지 화질적인지에 따라 상이하게 설정될 수 있다.Also, according to another embodiment, the encoder and the decoder may adaptively select and apply a filter based on the correlation between samples. That is, after calculating the correlation between samples in units of specific blocks, such as 4x4, 8x8, 16x16, 32x32, 64x64, the replacement filter can be adaptively applied by comparing it with a specific threshold. In this case, the correlation between the samples can be calculated based on the first layer corresponding picture (reconstructed picture) or the picture to which the basic filter is applied, and the threshold value can be set differently depending on whether the scalability is spatial or image quality. there is.

예를 들어, 부호화기 및 복호화기는 도 7과 같이 블록의 크기가 4x4인 경우, 수직 방향의 활동성(Vertical Activity, VA))과 수평 방향의 활동성(Horizontal Activity, HA)을 수식 1을 이용하여 구할 수 있다.For example, the encoder and the decoder can obtain the vertical activity (VA) and the horizontal activity (HA) using Equation 1 when the block size is 4x4 as shown in FIG. 7 . there is.

<수식 1><Formula 1>

이때, i와 j는 0과 2이다. In this case, i and j are 0 and 2.

수식 1에 기초하여 수직 방향 및 수평 방향 활동성이 연산되면, 수식 2를 이용하여 블록 활동성(Block activity)이 연산될 수 있다. When vertical and horizontal activity is calculated based on Equation 1, block activity may be calculated using Equation 2.

<수식 2><Formula 2>

블록 활동성 = (VA + HA) >> 2Block Activity = (VA + HA) >> 2

부호화기 및 복호화기는 블록 활동성이 문턱값 보다 작은 경우 대상 블록에 대체 필터를 적용하고, 그렇지 않은 경우, 즉 블록 활동성이 문턱값 이상이면 대체 필터를 적용하지 않을 수 있다. The encoder and the decoder may apply the replacement filter to the target block when the block activity is less than the threshold, and may not apply the replacement filter when the block activity is greater than or equal to the threshold.

수직 방향 활동성, 수평 방향 활동성 및 블록 활동성을 연산하여 대체 필터를 적용하는 과정은 8x8 이상의 블록에도 동일하게 적용될 수 있다.The process of applying the replacement filter by calculating the vertical vitality, horizontal vitality, and block vitality can be equally applied to blocks of 8x8 or more.

또 다른 실시예에 따르면 부호화기 및 복호화기는 블록의 크기가 NxN인 경우, 수직 방향 및 수평 방향의 샘플이 아래 수식 3을 만족하는 경우 대상 블록에 대체 필터를 적용하고 만족하지 않는 경우 대체 필터를 적용하지 않을 수 있다. According to another embodiment, the encoder and the decoder apply the replacement filter to the target block when the size of the block is NxN, when samples in the vertical and horizontal directions satisfy Equation 3 below, and do not apply the replacement filter when not satisfied. it may not be

<수식 3><Formula 3>

Abs(p[?1][N>>1-1] + p[N-1][N>>1-1] ? 2*p[N>>1-1][N>>1-1]) < (1<<( BitDepthY ? 5)) Abs(p[?1][N>>1-1] + p[N-1][N>>1-1] ? 2*p[N>>1-1][N>>1-1] ) < (1<<( BitDepthY ? 5))

Abs(p[N>>1-1][?1] + p[N>>1-1][N-1] ? 2*p[N>>1-1][N>>1-1]) < (1<<( BitDepthY ? 5))Abs(p[N>>1-1][?1] + p[N>>1-1][N-1] ? 2*p[N>>1-1][N>>1-1] ) < (1<<( BitDepthY ? 5))

이때, p[x][y]는 (x, y) 위치의 샘플값, Abs는 절대값을 나타내며, BitDepthY는 휘도 신호의 비트 뎁스(bit depth)를 의미할 수 있다.In this case, p[x][y] may indicate a sample value at the (x, y) position, Abs may indicate an absolute value, and BitDepthY may indicate a bit depth of a luminance signal.

이와 같이, 본 발명에 따르면, 제1 계층 픽쳐에 대한 필터링 시 기본 필터 또는 대체 필터 중 어느 하나의 고정된 필터를 적용할 수도 있고, 기본 필터 및 대체 필터를 둘 중에 하나 또는 모두를 선택적으로 적용할 수 있다. As such, according to the present invention, when filtering the first hierarchical picture, either a fixed filter of the basic filter or the replacement filter may be applied, and one or both of the basic filter and the replacement filter may be selectively applied. can

제1 계층의 픽쳐에 대한 필터 종류가 결정되면, 부호화기 및 복호화기는 필터를 적용하는 대상을 결정할 수 있다(S520).When the filter type for the picture of the first layer is determined, the encoder and the decoder may determine a target to which the filter is applied ( S520 ).

부호화기 및 복호화기는 상기의 고정된 필터 또는 적응적 필터를 제1 계층 영상의 대응 픽쳐, 슬라이스, 부호화 유닛(CU), 예측 유닛(PU), 변환 유닛(TU), 소정의 유닛(NxN) 별로 적용할 수 있다. The encoder and the decoder apply the fixed filter or the adaptive filter for each corresponding picture, slice, encoding unit (CU), prediction unit (PU), transform unit (TU), and predetermined unit (NxN) of the first hierarchical image. can do.

예를 들어, 부호화기에서 픽쳐 단위로 대체 필터의 사용 여부를 결정하고 PPS를 통하여 사용 여부를 나타내는 플래그를 시그널링 할 수 있다. 복호화기는 시그널링 되는 플래그에 따라 대체 필터의 사용 여부를 판단하고 픽쳐 단위로 필터링을 수행할 수 있다.For example, the encoder may determine whether to use the replacement filter in units of pictures and may signal a flag indicating whether to use the replacement filter through the PPS. The decoder may determine whether to use an alternative filter according to a signaled flag, and may perform filtering in units of pictures.

또 다른 예에 따르면, 부호화기에서 슬라이스 단위로 대체 필터의 사용 여부를 결정하고 슬라이스 헤더를 통하여 사용 여부를 나타내는 플래그를 시그널링 할 수 있다. 복호화기는 시그널링 되는 플래그에 따라 대체 필터의 사용 여부를 판단하고 슬라이스 단위로 필터링을 수행할 수 있다.According to another example, the encoder may determine whether to use the replacement filter on a slice-by-slice basis, and may signal a flag indicating whether to use the replacement filter through a slice header. The decoder may determine whether to use an alternative filter according to a signaled flag, and may perform filtering in units of slices.

또는, 부호화기와 복호화기는 예측 유닛 마다 샘플 간의 상관 관계를 분석한 후 기본 필터 또는 대체 필터를 적응적으로 적용할 수 있다. 이 경우, 부호화기와 복호화기에서 개별적으로 샘플 간의 상관 관계가 분석되어 적응적으로 필터링이 수행될 수 있으므로 필터링 여부에 대한 추가 시그널링은 필요하지 않을 수 있다.Alternatively, the encoder and the decoder may adaptively apply the basic filter or the replacement filter after analyzing the correlation between samples for each prediction unit. In this case, since the correlation between samples is individually analyzed by the encoder and the decoder to perform adaptive filtering, additional signaling on whether to filter may not be required.

예를 들어, 적응적 필터링을 위한 소정의 유닛을 결정하고 소정의 유닛 단위로 기본 필터 또는 대체 필터를 적용할 수 있다. 이때, 대체 필터는 정수 위치의 샘플에 대하여 필터링을 수행하는 필터일 수 있다. For example, a predetermined unit for adaptive filtering may be determined and a basic filter or a replacement filter may be applied in a predetermined unit unit. In this case, the replacement filter may be a filter that performs filtering on samples in integer positions.

상기 소정의 유닛을 8x8로 결정한 경우, 제1 계층 픽쳐에 대하여 8x8 단위로 샘플간 상관 관계를 산출한 후 적응적으로 필터를 적용할 수 있다.When the predetermined unit is determined to be 8x8, a filter may be adaptively applied after calculating a correlation between samples in an 8x8 unit with respect to the first hierarchical picture.

단계 S510 및 단계 S520을 통하여 필터 종류와 필터 적용 대상이 결정되면, 부호화기 및 복호화기는 하나 이상의 필터 또는 적용 대상의 조합을 통하여 제1 계층 픽쳐에 대한 필터링을 수행한다(S530). 즉, 부호화기 및 복호화기는 결정된 필터 종류에 기초하여 필터 적용 대상에 필터링을 수행한다.When the filter type and filter application target are determined through steps S510 and S520, the encoder and the decoder perform filtering on the first layer picture through one or more filters or a combination of application targets (S530). That is, the encoder and the decoder perform filtering on the filter application target based on the determined filter type.

그런 후, 부호화기 및 복호화기는 필터링 된 제1 계층 픽쳐를 제2 계층 참조 픽쳐 리스트에 추가할 수 있다(S540). 제2 계층 픽쳐를 복호화 하기 위한 예측부에 의하여 제1 계층 픽쳐가 제2 계층 참조 픽쳐 리스트에 추가될 수 있다. Then, the encoder and the decoder may add the filtered first layer picture to the second layer reference picture list (S540). The first layer picture may be added to the second layer reference picture list by the prediction unit for decoding the second layer picture.

제2 계층 대상 픽쳐를 부호화 및 복호화함에 있어 필터링을 통하여 얻어진 하나 이상의 제1 계층 픽쳐를 제2 계층 참조 픽쳐 리스트 0 또는 참조 픽쳐 리스트 1의 임의의 위치에 추가하여 이용할 수 있다. In encoding and decoding the second layer target picture, one or more first layer pictures obtained through filtering may be added and used at an arbitrary position in the second layer reference picture list 0 or reference picture list 1.

이 경우, 부호화기 및 복호화기는 제1 계층 픽쳐를 참조 픽쳐 리스트 내의 임의의 인덱스에 할당할 수 있다. In this case, the encoder and the decoder may allocate the first hierarchical picture to an arbitrary index in the reference picture list.

도 8은 본 발명에 따른 제1 계층 및 제2 계층의 픽쳐의 POC를 도시한 도면이다. 도 8을 참조하면, 부호화 및 복호화 대상이 되는 픽쳐는 POC가 2인 제2 계층 픽쳐일 수 있고, 대상 픽쳐는 제1 계층의 대응 픽쳐, 자신 보다 POC 가 작은 픽쳐 (POC 0, POC 1) 및 자신 보다 POC 가 큰 픽쳐(POC 3, POC 4)를 단기 참조 픽쳐로 참조할 수 있다. 또한, 도시하지 않은 장기 참조 픽쳐를 참조할 수도 있다. 8 is a diagram illustrating a POC of a picture of a first layer and a second layer according to the present invention. Referring to FIG. 8 , a picture to be encoded and decoded may be a second layer picture having a POC of 2, and the target picture is a corresponding picture of the first layer, a picture having a smaller POC than itself (POC 0, POC 1) and Pictures (POC 3, POC 4) having a larger POC than themselves may be referred to as short-term reference pictures. Also, a long-term reference picture (not shown) may be referred to.

도 9는 본 발명에 따른 참조 픽쳐 리스트를 도시한 도면이다. 부호화기 및 복호화기는 이러한 참조 픽쳐를 참조 픽쳐 리스트에 추가할 때, 도 9와 같이 임의 위치에 제1 계층 픽쳐(F0)를 추가할 수 있다. 9 is a diagram illustrating a reference picture list according to the present invention. When adding the reference picture to the reference picture list, the encoder and the decoder may add the first layer picture F0 to an arbitrary position as shown in FIG. 9 .

예를 들어, 도 9의 [a]와 같이, 부호화기 및 복호화기는 참조 픽쳐 리스트 0(List 0)를 구성할 때, 현재 대상 픽쳐의 POC보다 POC가 작은 단기 참조 픽쳐(POC 0, POC 1), 현재 대상 픽쳐의 POC보다 POC가 큰 단기 참조 픽쳐(POC 3, POC 4), 제1 계층 픽쳐(F0)를 순차적으로 참조 픽쳐 리스트 0에 추가하고, 마지막으로 장기 참조 픽쳐(lt)를 추가할 수 있다. For example, as shown in [a] of FIG. 9 , when the encoder and the decoder construct the reference picture list 0 (List 0), the POC of the current target picture is smaller than the POC of the short-term reference picture (POC 0, POC 1), Short-term reference pictures (POC 3, POC 4), the first layer picture (F0), which have a larger POC than the POC of the current target picture, are sequentially added to the reference picture list 0, and finally, a long-term reference picture (lt) can be added. there is.

유사하게 참조 픽쳐 리스트 1(List 1)은 현재 대상 픽쳐의 POC보다 POC가 큰 단기 참조 픽쳐(POC 3, POC 4), 현재 대상 픽쳐의 POC보다 POC가 작은 단기 참조 픽쳐(POC 0, POC 1), 제1 계층 픽쳐(F0), 장기 참조 픽쳐(lt) 순으로 구성될 수 있다. Similarly, reference picture list 1 (List 1) is a short-term reference picture (POC 3, POC 4) having a larger POC than the POC of the current target picture, and a short-term reference picture having a smaller POC than the POC of the current target picture (POC 0, POC 1) , a first layer picture (F0), and a long-term reference picture (lt) may be configured in the order.

또 다른 실시예에 따르면, 부호화기 및 복호화기는 [b]와 같이 참조 픽쳐 리스트를 구성할 수도 있다. 도 9의 [b]를 참조하면, 참조 픽쳐 리스트 0(List 0)은 현재 대상 픽쳐의 POC보다 POC가 작은 단기 참조 픽쳐(POC 0, POC 1), 제1 계층 픽쳐(F0), 현재 대상 픽쳐의 POC보다 POC가 큰 단기 참조 픽쳐(POC 3, POC 4), 장기 참조 픽쳐(lt) 순으로 구성될 수 있다. According to another embodiment, the encoder and the decoder may construct a reference picture list as in [b]. Referring to [b] of FIG. 9 , the reference picture list 0 (List 0) is a short-term reference picture (POC 0, POC 1) having a smaller POC than the POC of the current target picture, the first layer picture (F0), the current target picture The short-term reference pictures (POC 3, POC 4) having a larger POC than the POC of may be configured in the order of a long-term reference picture (lt).

유사하게 참조 픽쳐 리스트 1(List 1)는 현재 대상 픽쳐의 POC보다 POC가 큰 단기 참조 픽쳐(POC 3, POC 4), 현재 대상 픽쳐의 POC보다 POC가 작은 단기 참조 픽쳐(POC 0, POC 1), 제1 계층 픽쳐(F0), 장기 참조 픽쳐(lt) 순으로 구성될 수 있다. Similarly, reference picture list 1 (List 1) is a short-term reference picture (POC 3, POC 4) having a larger POC than the POC of the current target picture, and a short-term reference picture having a smaller POC than the POC of the current target picture (POC 0, POC 1) , a first layer picture (F0), and a long-term reference picture (lt) may be configured in the order.

또 다른 실시예에 따르며, 부호화기 및 복호화기는 각 리스트의 처음 또는 마지막에 추가할 수 있다. 즉, 제1 계층 픽쳐(F0)를 List 0 에 1, 0, 3, 4, lt 에 해당하는 제2 계층의 참조 픽쳐를 추가하고, 가장 마지막에 필터링을 통하여 얻어진 제1 계층 픽쳐(F0)를 추가할 수 있다. List 1 역시, 3, 4, 1, 0, lt 다음으로 제1 계층 픽쳐(F0)를 추가할 수 있다. According to another embodiment, the encoder and the decoder may be added to the beginning or the end of each list. That is, the first layer picture (F0) is added to the reference picture of the second layer corresponding to 1, 0, 3, 4, lt to List 0, and finally the first layer picture (F0) obtained through filtering can be added List 1 may also add a first hierarchical picture F0 following 3, 4, 1, 0, and lt.

한편, 본 발명의 다른 실시예에 따르면, 추가되는 제1 계층 픽쳐 개수는 리스트 마다 다를 수 있으며, 각 리스트에 추가되는 픽쳐에 적용한 필터는 특정 필터로 고정될 수도 있고, 부호화기에서 결정하여 복호화기로 시그널링 할 수도 있다.Meanwhile, according to another embodiment of the present invention, the number of first layer pictures to be added may be different for each list, and a filter applied to a picture added to each list may be fixed to a specific filter, or may be determined by an encoder and signaled to a decoder You may.

도 10은 본 발명의 일 실시예에 따라 P 슬라이스에 대한 참조 픽쳐 리스트를 도시한 도면이다. 10 is a diagram illustrating a reference picture list for a P slice according to an embodiment of the present invention.

P 슬라이스의 경우, 도 10의 [a]와 같이 제1 계층 픽쳐(F0)에 대하여 필터링 방법 적용한 하나의 픽쳐를 참조 픽쳐 리스트 0의 마지막에 추가할 수 있다. 이때, 픽쳐 1, 0, 3, 4는 제2 계층의 참조 픽쳐일 수 있고, F0 는 상기 필터링 방법 중 하나의 방법을 이용하여 제1 계층 대응 픽쳐를 필터링한 픽쳐를 나타낼 수 있다.In the case of P slice, one picture to which the filtering method is applied to the first hierarchical picture F0 may be added to the end of the reference picture list 0 as shown in [a] of FIG. 10 . In this case, pictures 1, 0, 3, and 4 may be reference pictures of the second layer, and F0 may represent a picture obtained by filtering the corresponding picture of the first layer using one of the filtering methods.

한편, 부호화기 및 복호화기는 P 슬라이스의 경우, 도 10의 [b]와 같이, 제2 계층의 참조 픽쳐 다음으로 제1 계층 픽쳐에 대하여 필터링을 수행한 두 개의 픽쳐(F0, F1)를 참조 픽쳐 리스트 0에 추가할 수 있다. 이때, F0 와 F1 은 각각 도 5 내지 도 7을 참조하여 설명된 필터링 방법 중 하나를 이용하여 제1 계층 대응 픽쳐를 필터링한 픽쳐를 나타낼 수 있다.On the other hand, in the case of a P slice, the encoder and the decoder refer to two pictures (F0, F1) on which filtering is performed on the first layer picture after the second layer reference picture as shown in [b] of FIG. 10 in the case of a P slice reference picture list 0 can be added. In this case, F0 and F1 may represent pictures obtained by filtering the first layer corresponding picture using one of the filtering methods described with reference to FIGS. 5 to 7 , respectively.

도 11은 본 발명의 일 실시예에 따라 B 슬라이스에 대한 참조 픽쳐 리스트를 도시한 도면이다. 11 is a diagram illustrating a reference picture list for a B slice according to an embodiment of the present invention.

부호화기 및 복호화기는 B 슬라이스의 경우, 도 11의 [a]와 같이 제1 계층 픽쳐에 대하여 필터링을 적용한 하나의 픽쳐(F0)를 참조 픽쳐 리스트 0 또는 참조 픽쳐 리스트 1에 추가할 수 있다. 이때, 제1 계층 픽쳐(F0)는 도 5 내지 도 7을 참조하여 설명된 필터링 방법 중 하나를 이용하여 제1 계층 대응 픽쳐를 필터링한 픽쳐를 나타낼 수 있다.In the case of a B slice, the encoder and decoder may add one picture F0 to which filtering is applied to the first hierarchical picture as shown in [a] of FIG. 11 to the reference picture list 0 or the reference picture list 1. In this case, the first layer picture F0 may represent a picture obtained by filtering the first layer corresponding picture using one of the filtering methods described with reference to FIGS. 5 to 7 .

또한, 부호화기 및 복호화기는 B 슬라이스의 경우, 도 11의 [b]와 같이 제1 계층 픽쳐에 대하여 필터링을 적용한 두 개의 픽쳐를 참조 픽쳐 리스트 0(List 0) 와 참조 픽쳐 리스트 1(List 1)에 추가할 수 있다. 이때, F0 와 F1 은 도 5 내지 도 7을 참조하여 설명된 필터링 방법 중 하나를 이용하여 제1 계층 대응 픽쳐를 필터링한 픽쳐를 나타낼 수 있으며, F0와 F1은 같을 수도 있다. In addition, in the case of a B slice, the encoder and the decoder add the two pictures to which filtering is applied to the first hierarchical picture as shown in [b] of FIG. 11 in the reference picture list 0 (List 0) and the reference picture list 1 (List 1). can be added In this case, F0 and F1 may represent a picture obtained by filtering the first layer-corresponding picture using one of the filtering methods described with reference to FIGS. 5 to 7 , and F0 and F1 may be the same.

예를 들어, 부호화기에서 SATD 등을 통하여 대체 필터를 사용하는 것으로 결정된 경우, F0는 기본 필터를 적용한 픽쳐, F1은 기본 필터를 적용한 후 대체 필터를 적용한 픽쳐일 수 있다. For example, when the encoder determines to use the replacement filter through SATD, etc., F0 may be a picture to which the basic filter is applied, and F1 may be a picture to which the replacement filter is applied after applying the basic filter.

이때, 화질적 스케일러빌러티의 경우에는 F0는 필터를 적용하지 않은 대응 픽쳐, F1은 대체 필터를 적용한 픽쳐일 수 있다.In this case, in the case of picture quality scalability, F0 may be a corresponding picture to which no filter is applied, and F1 may be a picture to which a replacement filter is applied.

또 다른 실시예에 따르면, 부호화기 및 복호화기는 B 슬라이스에 대한 참조 픽쳐 리스트를 구성할 때 도 11의 [c]와 같이 제1 계층 픽쳐에 대하여 필터링을 적용한 4개의 픽쳐(F0, F1, F2, F3)를 참조 픽쳐 리스트 0와 참조 픽쳐 리스트 1 에 추가할 수 있다. 이때, F0, F1, F2, F3 는 상술된 필터링 방법 중 하나를 이용하여 제1 계층 대응 픽쳐를 필터링한 픽쳐를 나타내며 F0, F1, F2, F3 중 하나 이상은 같을 수도 있다.According to another embodiment, when the encoder and the decoder construct the reference picture list for the B slice, four pictures (F0, F1, F2, F3) to which filtering is applied to the first hierarchical picture as shown in [c] of FIG. 11 . ) may be added to the reference picture list 0 and the reference picture list 1. In this case, F0, F1, F2, and F3 represent a picture obtained by filtering the first layer-corresponding picture using one of the above-described filtering methods, and at least one of F0, F1, F2, and F3 may be the same.

상술한 바와 같이, 본 발명에 따를 경우 상위 계층의 참조 픽쳐 리스트에 하위 계층의 픽쳐를 추가함에 있어, 하위 계층의 픽쳐에 하나 이상의 필터를 적응적으로 적용한 후 추가함으로써, 상위 계층에서 예측 오차를 줄이고 부호화 효율을 향상시키는 영상의 복호화 방법 및 이를 이용하는 장치가 제공된다.As described above, according to the present invention, when adding a picture of a lower layer to the reference picture list of an upper layer, one or more filters are adaptively applied to the picture of the lower layer and then added, thereby reducing the prediction error in the upper layer and A video decoding method for improving encoding efficiency and an apparatus using the same are provided.

이로써, 부호화 효율을 향상시키며 참조 픽쳐 리스트를 증가하지 않게 하는영상의 복호화 방법 및 이를 이용하는 장치가 제공될 수 있다. Accordingly, it is possible to provide a method for decoding an image that improves encoding efficiency and does not increase the reference picture list, and an apparatus using the same.

상술한 실시예에서, 방법들은 일련의 단계 또는 블록으로서 순서도를 기초로 설명되고 있으나, 본 발명은 단계들의 순서에 한정되는 것은 아니며, 어떤 단계는 상술한 바와 다른 단계와 다른 순서로 또는 동시에 발생할 수 있다. 또한, 당해 기술 분야에서 통상의 지식을 가진 자라면 순서도에 나타난 단계들이 배타적이지 않고, 다른 단계가 포함되거나, 순서도의 하나 또는 그 이상의 단계가 본 발명의 범위에 영향을 미치지 않고 삭제될 수 있음을 이해할 수 있을 것이다.In the above embodiment, the methods are described on the basis of a flowchart as a series of steps or blocks, but the present invention is not limited to the order of the steps, and some steps may occur in a different order or concurrently with other steps as described above. there is. In addition, those of ordinary skill in the art will recognize that the steps shown in the flowchart are not exclusive, other steps may be included, or that one or more steps in the flowchart may be deleted without affecting the scope of the present invention. You will understand.

상술한 실시예는 다양한 양태의 예시들을 포함한다. 다양한 양태들을 나타내기 위한 모든 가능한 조합을 기술할 수는 없지만, 해당 기술 분야의 통상의 지식을 가진 자는 다른 조합이 가능함을 인식할 수 있을 것이다. 따라서, 본 발명은 이하의 특허청구범위 내에 속하는 모든 다른 교체, 수정 및 변경을 포함한다고 할 것이다.The above-described embodiments include examples of various aspects. It is not possible to describe every possible combination for representing the various aspects, but one of ordinary skill in the art will recognize that other combinations are possible. Accordingly, it is intended that the present invention cover all other substitutions, modifications and variations falling within the scope of the following claims.

*100 : 영상 부호화 장치 111: 움직임 예측부
112: 움직임 보상부 120 : 인트라 예측부
115 : 스위치 125 : 감산기
130 : 변환부 140 : 양자화부
150 : 엔트로피 부호화부 160 : 역양자화부
170 : 역변환부 180 : 필터부
*100: video encoding device 111: motion prediction unit
112: motion compensation unit 120: intra prediction unit
115: switch 125: subtractor
130: transform unit 140: quantization unit
150: entropy encoding unit 160: inverse quantization unit
170: inverse transform unit 180: filter unit

Claims

In the video decoding method supporting a plurality of layers, the method is performed by an image decoding apparatus,
generating a reference picture list of a current slice of the current layer including an inter-layer reference picture of a reference layer of the current layer;
generating a prediction block of a current block of the current slice by referring to at least one of a plurality of reference pictures included in the reference picture list; and
generating a reconstructed block of the current block of the current slice by using the prediction block,
The inter-layer reference picture is marked only as a long-term reference picture, and is added to the reference picture list for inter-layer prediction,
The reference picture list of the current block includes an inter-layer reference picture at an arbitrary position of the reference picture list according to reference picture list information indicating the position of the inter-layer reference picture.

The method of claim 1,
The step of generating a prediction block of the current block comprises:
and generating the prediction block of the current block by using motion information derived using motion information of at least one of neighboring blocks and collocated blocks of the current block.

3. The method of claim 2,
The call block is
The video decoding method, characterized in that the block in the reference picture included in the reference picture list.

3. The method of claim 2,
The call block is
The image decoding method according to claim 1, wherein the external block is located outside a block existing at a position spatially corresponding to the current block in the reference picture included in the reference picture list.

In the video encoding method supporting a plurality of layers, the method is performed by a video encoding apparatus,
generating a reference picture list of a current slice of the current layer including an inter-layer reference picture of a reference layer of the current layer; and
Encoding information of the reference picture list,
The inter-layer reference picture is displayed only as a long-term reference picture, and is added to the reference picture list for inter-layer prediction,
The reference picture list of the current block includes an inter-layer reference picture at an arbitrary position in the reference picture list according to reference picture list information indicating the position of the inter-layer reference picture.

A computer-readable recording medium storing a bitstream generated by a video encoding method, the video encoding method comprising:
generating a reference picture list of a current slice of the current layer including an inter-layer reference picture of a reference layer of the current layer; and
Encoding information of the reference picture list,
The inter-layer reference picture is displayed only as a long-term reference picture, and is added to the reference picture list for inter-layer prediction,
The reference picture list of the current block includes an inter-layer reference picture at an arbitrary position in the reference picture list according to reference picture list information indicating the position of the inter-layer reference picture.