KR102271878B1

KR102271878B1 - Video encoding and decoding method and apparatus using the same

Info

Publication number: KR102271878B1
Application number: KR1020130162757A
Authority: KR
Inventors: 강정원; 이하현; 이진호; 최진수; 김진웅
Original assignee: 한국전자통신연구원
Priority date: 2012-12-26
Filing date: 2013-12-24
Publication date: 2021-07-02
Also published as: KR20140088015A; KR20140087972A

Abstract

본 발명은 적어도 한 개 이상의 계층(layer) (화질(quality), 공간(spatial), 시점(view))을 포함하는 구조의 영상 부호화 및 복호화에 관한 기술로, 상위계층을 부호화, 복호화 함에 있어서 한 개 이상의 참조 계층을 이용하여 상위 계층 신호를 예측하는 방법에 관한 기술이다. 보다 상세하게는 상위 계층의 부호화, 복호화 영상을 부, 복호화 함에 있어서 대상 계층과 동일 시점내에서 참조할 공간 및 화질 계층으로 구성된 공간 화질 참조 계층 리스트와 대상 계층과 동일한 공간 및 화질 계층들로 이루어진 시점 참조 계층 리스트를 분리하여 각 계층의 특성을 고려하여 계층간 예측을 수행할 수 있도록 하며, 이로 인해 부호화 및 복호화 효율을 향상시킬 수 있다.The present invention relates to a technique for encoding and decoding an image having a structure including at least one layer (quality, spatial, and view), and is a technique for encoding and decoding an upper layer. A technique for predicting a higher layer signal using two or more reference layers. In more detail, a spatial image quality reference layer list composed of spatial and image quality layers to be referenced within the same view as the target layer and a view composed of the same spatial and image quality layers as the target layer in encoding and decoding the encoding and decoding images of the higher layer By separating the reference layer list, inter-layer prediction can be performed in consideration of the characteristics of each layer, thereby improving encoding and decoding efficiency.

Description

Video encoding/decoding method and apparatus using the same

본 발명은 영상의 부호화 및 복호화 처리에 관한 것으로서, 보다 상세하게는 계층적 비디오 부호화에서 다중 참조 계층을 적용한 계층간 영상 부/복호화 방법 및 그 장치에 관한 것이다.The present invention relates to image encoding and decoding processing, and more particularly, to an inter-layer image encoding/decoding method and apparatus to which multiple reference layers are applied in hierarchical video encoding.

최근 HD(High Definition) 해상도를 가지는 방송 서비스가 국내뿐만 아니라 세계적으로 확대되면서, 많은 사용자들이 고해상도, 고화질의 영상에 익숙해지고 있으며 이에 따라 많은 기관들이 차세대 영상기기에 대한 개발에 박차를 가하고 있다. 또한 HDTV와 더불어 HDTV의 4배 이상의 해상도를 갖는 UHD(Ultra High Definition)에 대한 관심이 증대되면서 보다 높은 해상도, 고화질의 영상에 대한 압축기술이 요구되고 있다.Recently, as broadcasting services having high definition (HD) resolution are expanding not only in Korea but also worldwide, many users are getting used to high-resolution and high-definition images, and accordingly, many organizations are spurring the development of next-generation imaging devices. In addition, as interest in UHD (Ultra High Definition) having a resolution four times higher than that of HDTV increases along with HDTV, compression technology for higher resolution and high-definition images is required.

영상 압축을 위해, 시간적으로 이전 및/또는 이후의 픽쳐로부터 현재 픽쳐에 포함된 화소값을 예측하는 인터(inter) 예측 기술, 현재 픽쳐 내의 화소 정보를 이용하여 현재 픽쳐에 포함된 화소값을 예측하는 인트라(intra) 예측 기술, 출현 빈도가 높은 심볼(symbol)에 짧은 부호를 할당하고 출현 빈도가 낮은 심볼에 긴 부호를 할당하는 엔트로피 부호화 기술 등이 사용될 수 있다.For image compression, inter prediction technology that predicts pixel values included in the current picture from temporally previous and/or later pictures, predicting pixel values included in the current picture using pixel information in the current picture An intra prediction technique, an entropy encoding technique in which a short code is assigned to a symbol having a high frequency of occurrence and a long code is assigned to a symbol having a low frequency of occurrence, may be used.

영상 압축 기술에는 유동적인 네트워크 환경을 고려하지 않고 하드웨어의 제한적인 동작 환경하에서 일정한 네트워크 대역폭을 제공하는 기술이 있다. 그러나 수시로 대역폭이 변화하는 네트워크 환경에 적용되는 영상 데이터를 압축하기 위해서는 새로운 압축 기술이 요구되고, 이를 위해 스케일러블(scalable) 비디오 부호화/복호화 방법이 사용될 수 있다. In video compression technology, there is a technology that provides a constant network bandwidth under a limited operating environment of hardware without considering a flexible network environment. However, a new compression technique is required to compress image data applied to a network environment in which bandwidth is frequently changed, and for this purpose, a scalable video encoding/decoding method may be used.

본 발명은 상위 계층의 부호화, 복호화 영상을 부, 복호화 함에 있어서 대상계층과 동일시점내에서 참조할 공간 및 화질 계층으로 구성된 공간화질 참조계층 리스트와 대상계층과 동일한 공간 및 화질 계층들로 이루어진 시점 참조계층 리스트를 분리하여 각 계층의 특성을 고려하여 계층간 예측을 수행할 수 있는 영상의 부호화 및 복호화 방법 및 이를 이용하는 장치를 제공한다. The present invention provides a spatial image quality reference layer list composed of spatial and image quality layers to be referenced within the same time point as the target layer in encoding and decoding the encoding and decoding images of the upper layer, and the view point composed of the same spatial and image quality layers as the target layer. Provided are a method for encoding and decoding an image capable of performing inter-layer prediction in consideration of characteristics of each layer by separating a layer list, and an apparatus using the same.

이로써, 부호화 효율을 향상시킬 수 있다.Thereby, the encoding efficiency can be improved.

본 발명의 일 실시예인 복수의 계층을 지원하는 영상의 복호화 방법은, 현재 복호화 대상이 되는 대상 계층의 영상이 참조할 수 있는 참조 계층 리스트를 생성하는 단계와; 상기 대상 계층의 영상의 화면 간 예측을 위하여 시점(view) 참조 계층의 복호화 영상을 포함하는 참조픽쳐 리스트를 생성하는 단계와; 상기 참조 픽쳐 리스트를 참조하여 상기 대상 계층의 영상을 블록 단위로 예측 및 복호화 하는 단계를 포함할 수 있다.According to an embodiment of the present invention, a method for decoding an image supporting a plurality of layers includes: generating a reference layer list that can be referenced by an image of a target layer that is a current decoding target; generating a reference picture list including a decoded image of a view reference layer for inter prediction of the image of the target layer; The method may include predicting and decoding the image of the target layer in block units with reference to the reference picture list.

상기 참조 계층 리스트를 생성하는 단계는, 전제 비트스트림에서 상기 대상 계층과 동일한 계층이 참조할 수 있는 공간 화질 참조 계층 리스트 및 상기 시점 참조 리스트를 생성하는 단계를 포함할 수 있다.The generating of the reference layer list may include generating a spatial quality reference layer list and the view reference list that can be referenced by the same layer as the target layer in the entire bitstream.

상기 공간 화질 참조 계층 리스트는 상기 대상 계층과 동일한 시점을 갖는 계층으로 구성될 수 있다.The spatial quality reference layer list may be composed of layers having the same viewpoint as the target layer.

한편 상기 시점 참조 계층 리스트는 상기 대상 계층과 동일한 공간 및 화질을 갖는 계층으로 구성될 수 있다. Meanwhile, the view reference layer list may be composed of layers having the same space and image quality as the target layer.

상기 참조픽쳐 리스트를 생성하는 단계는, 상기 시점 참조 계층의 복호화 영상을 포함하는 제1 집합을 구성하는 단계와; 상기 대상 계층의 영상과 동일한 계층의 영상으로 이루어진 제2 집합을 구성하는 단계와; 상기 제1 집합 및 상기 제2 집합을 조합하는 단계를 포함할 수 있다.The generating of the reference picture list may include: constructing a first set including the decoded image of the view reference layer; constructing a second set comprising an image of the same layer as the image of the target layer; It may include combining the first set and the second set.

상기 제1 집합은 장기 참조 픽쳐로 표시될 수 있다. The first set may be represented as a long-term reference picture.

상기 제1 집합에 포함되어 있는 영상들은 상기 참조픽쳐 리스트의 첫 번째, 두 번째 및 마지막 중 어느 하나에 추가될 수 있다. Images included in the first set may be added to any one of the first, second, and last of the reference picture list.

상기 영상을 블록 단위로 예측 및 복호화 하는 단계는 상기 공간 화질 참조 계층을 참조하며, 상기 영상을 블록 단위로 예측 및 복호화 하는 단계는, 상기 공간 화질 참조 계층 리스트에서 현재 복호화 대상 블록이 복호화시 사용하는 참조 계층을 결정하는 단계와; 상기 결정된 공간 화질 참조 계층에서 상기 대상 블록에 대응되는 참조 블록을 결정하는 단계와; 상기 참조 블록의 복원샘플, 참조 블록의 레지듀얼, 참조 블록의 부호화 파라미터 중 적어도 하나를 이용하여 대상 블록을 복호화하는 단계를 포함할 수 있다.The step of predicting and decoding the image in units of blocks refers to the spatial quality reference layer, and the step of predicting and decoding the image in units of blocks refers to the step of the current decoding object block in the spatial image quality reference layer list used when decoding. determining a reference layer; determining a reference block corresponding to the target block in the determined spatial quality reference layer; The method may include decoding the target block by using at least one of a reconstructed sample of the reference block, a residual of the reference block, and an encoding parameter of the reference block.

상기 영상을 블록 단위로 예측 및 복호화 하는 단계는, 상기 참조 픽쳐 리스트 내의 참조 픽쳐를 이용하여 현재 복호화 대상 블록에 대한 화면 간 예측을 수행할 수 있다.The predicting and decoding of the image in block units may include performing inter prediction on the current decoding object block by using a reference picture in the reference picture list.

본 발명의 다른 실시예에 따른 복수의 계층을 지원하는 영상의 복호화 장치는 비스트스림을 통하여 수신된 영상의 예측 및 복호화를 위한 정보를 복호화하는 엔트로피 복호화부와; 현재 복호화 대상이 되는 대상 계층의 영상이 참조할 수 있는 참조 계층 리스트를 생성하고, 상기 대상 계층의 영상의 화면 간 예측을 위하여 시점(view) 참조 계층의 복호화 영상을 포함하는 참조픽쳐 리스트를 생성하고, 상기 참조 픽쳐 리스트를 참조하여 상기 대상 계층의 영상을 블록 단위로 예측 및 복호화 하는 예측부를 포함할 수 있다.An apparatus for decoding an image supporting a plurality of layers according to another embodiment of the present invention includes: an entropy decoding unit for decoding information for prediction and decoding of an image received through a non-stream; generating a reference layer list that can be referenced by an image of a target layer currently being decoded, and generating a reference picture list including a decoded image of a view reference layer for inter-screen prediction of an image of the target layer, and , a prediction unit that predicts and decodes the image of the target layer in block units with reference to the reference picture list.

본 발명의 일 실시예에 따르면, 상위 계층의 부호화, 복호화 영상을 부, 복호화 함에 있어서 대상계층과 동일시점내에서 참조할 공간 및 화질 계층으로 구성된 공간화질 참조계층 리스트와 대상계층과 동일한 공간 및 화질 계층들로 이루어진 시점 참조계층 리스트를 분리하여 각 계층의 특성을 고려하여 계층간 예측을 수행할 수 있는 영상의 부호화 및 복호화 방법 및 이를 이용하는 장치가 제공된다. According to an embodiment of the present invention, a list of spatial quality reference layers composed of spatial and picture quality layers to be referenced within the same time point as the target layer and the same spatial and picture quality as the target layer in encoding and decoding the encoded and decoded image of the higher layer A method for encoding and decoding an image capable of performing inter-layer prediction in consideration of characteristics of each layer by separating a view reference layer list consisting of layers, and an apparatus using the same are provided.

이로써, 영상의 부, 복호화 효율을 향상시킬 수 있는 영상의 부호화/복호화 방법 및 장치가 제공된다.Accordingly, an image encoding/decoding method and apparatus capable of improving image encoding/decoding efficiency are provided.

도 1은 영상 부호화 장치의 일 실시예에 따른 구성을 나타내는 블록도이다.
도 2는 영상 복호화 장치의 일 실시예에 따른 구성을 나타내는 블록도이다.
도 3은 본 발명이 적용될 수 있는, 복수 계층을 이용한 스케일러블 비디오 코딩 구조의 일 실시예를 개략적으로 나타내는 개념도이다.
도 4는 본 발명의 일 실시예에 따라 공간 화질 계층 및 시점 계층을 개략적으로 나타내는 개념도이다
도 5는 본 발명의 일 실시예에 따른 부호화 장치에서 상위 계층의 부호화를 수행하는 방법을 설명하기 위한 제어흐름도이다.
도 6은 본 발명의 일 실시예예 따라 부호화 장치에서 공간 화질 참조 계층 리스트 및 시점 참조 계층 리스트를 구성하는 방법을 설명하기 위한 제어 흐름도이다.
도 7은 본 발명의 일 실시예에 따른 복호화 장치에서 상위 계층의 복호화를 수행하는 방법을 설명하기 위한 제어흐름도이다.
도 8은 본 발명의 일 실시예예 따라 복호화 장치에서 공간 화질 참조 계층 리스트 및 시점 참조 계층 리스트를 구성하는 방법을 설명하기 위한 제어 흐름도이다.1 is a block diagram illustrating a configuration of an image encoding apparatus according to an embodiment.
2 is a block diagram illustrating a configuration of an image decoding apparatus according to an embodiment.
3 is a conceptual diagram schematically illustrating an embodiment of a scalable video coding structure using a plurality of layers to which the present invention can be applied.
4 is a conceptual diagram schematically illustrating a spatial image quality layer and a view layer according to an embodiment of the present invention.
5 is a control flow diagram illustrating a method of performing encoding of a higher layer in an encoding apparatus according to an embodiment of the present invention.
6 is a control flowchart illustrating a method of constructing a spatial quality reference layer list and a view reference layer list in an encoding apparatus according to an embodiment of the present invention.
7 is a control flowchart illustrating a method of performing upper layer decoding in the decoding apparatus according to an embodiment of the present invention.
8 is a control flowchart illustrating a method of constructing a spatial quality reference layer list and a view reference layer list in a decoding apparatus according to an embodiment of the present invention.

이하, 도면을 참조하여 본 발명의 실시 형태에 대하여 구체적으로 설명한다. 본 명세서의 실시예를 설명함에 있어, 관련된 공지 구성 또는 기능에 대한 구체적인 설명이 본 명세서의 요지를 흐릴 수 있다고 판단되는 경우에는 그 상세한 설명은 생략한다.EMBODIMENT OF THE INVENTION Hereinafter, embodiment of this invention is described concretely with reference to drawings. In describing the embodiments of the present specification, if it is determined that a detailed description of a related known configuration or function may obscure the gist of the present specification, the detailed description thereof will be omitted.

어떤 구성 요소가 다른 구성 요소에 “연결되어” 있다거나 “접속되어” 있다고 언급된 때에는, 그 다른 구성 요소에 직접적으로 연결되어 있거나 또는 접속되어 있을 수도 있으나, 중간에 다른 구성 요소가 존재할 수도 있다고 이해되어야 할 것이다. 아울러, 본 발명에서 특정 구성을 “포함”한다고 기술하는 내용은 해당 구성 이외의 구성을 배제하는 것이 아니며, 추가적인 구성이 본 발명의 실시 또는 본 발명의 기술적 사상의 범위에 포함될 수 있음을 의미한다. When it is said that a component is “connected” or “connected” to another component, it may be directly connected or connected to the other component, but it is understood that other components may exist in between. it should be In addition, the content described as “including” a specific configuration in the present invention does not exclude configurations other than the corresponding configuration, and means that additional configurations may be included in the practice of the present invention or the scope of the technical spirit of the present invention.

제1, 제2 등의 용어는 다양한 구성요소들을 설명하는데 사용될 수 있지만, 상기 구성요소들은 상기 용어들에 의해 한정되어서는 안 된다. 상기 용어들은 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로만 사용된다. 예를 들어, 본 발명의 권리 범위를 벗어나지 않으면서 제1 구성요소는 제2 구성요소로 명명될 수 있고, 유사하게 제2 구성요소도 제1 구성요소로 명명될 수 있다.Terms such as first, second, etc. may be used to describe various elements, but the elements should not be limited by the terms. The above terms are used only for the purpose of distinguishing one component from another. For example, without departing from the scope of the present invention, a first component may be referred to as a second component, and similarly, a second component may also be referred to as a first component.

또한 본 발명의 실시예에 나타나는 구성부들은 서로 다른 특징적인 기능들을 나타내기 위해 독립적으로 도시되는 것으로, 각 구성부들이 분리된 하드웨어나 하나의 소프트웨어 구성단위로 이루어짐을 의미하지 않는다. 즉, 각 구성부는 설명의 편의상 각각의 구성부로 나열하여 포함한 것으로 각 구성부 중 적어도 두 개의 구성부가 합쳐져 하나의 구성부로 이루어지거나, 하나의 구성부가 복수 개의 구성부로 나뉘어져 기능을 수행할 수 있고 이러한 각 구성부의 통합된 실시예 및 분리된 실시예도 본 발명의 본질에서 벗어나지 않는 한 본 발명의 권리범위에 포함된다.In addition, the components shown in the embodiment of the present invention are shown independently to represent different characteristic functions, and it does not mean that each component is composed of separate hardware or a single software component. That is, each component is listed as each component for convenience of description, and at least two components of each component are combined to form one component, or one component can be divided into a plurality of components to perform a function, and each of these components can be divided into a plurality of components. Integrated embodiments and separate embodiments of components are also included in the scope of the present invention without departing from the essence of the present invention.

또한, 일부의 구성 요소는 본 발명에서 본질적인 기능을 수행하는 필수적인 구성 요소는 아니고 단지 성능을 향상시키기 위한 선택적 구성 요소일 수 있다. 본 발명은 단지 성능 향상을 위해 사용되는 구성 요소를 제외한 본 발명의 본질을 구현하는데 필수적인 구성부만을 포함하여 구현될 수 있고, 단지 성능 향상을 위해 사용되는 선택적 구성 요소를 제외한 필수 구성 요소만을 포함한 구조도 본 발명의 권리범위에 포함된다.
In addition, some of the components are not essential components for performing essential functions in the present invention, but may be optional components for merely improving performance. The present invention can be implemented by including only essential components to implement the essence of the present invention, except for components used for performance improvement, and a structure including only essential components excluding optional components used for performance improvement Also included in the scope of the present invention.

도 1은 영상 부호화 장치의 일 실시예에 따른 구성을 나타내는 블록도이다. 스케일러블(scalable) 비디오 부호화/복호화 방법 또는 장치는 스케일러빌리티(scalability)를 제공하지 않는 일반적인 영상 부호화/복호화 방법 또는 장치의 확장(extension)에 의해 구현될 수 있으며, 도 1의 블록도는 스케일러블 비디오 부호화 장치의 기초가 될 수 있는 영상 부호화 장치의 일 실시예를 나타낸다.1 is a block diagram illustrating a configuration of an image encoding apparatus according to an embodiment. A scalable video encoding/decoding method or apparatus may be implemented by an extension of a general image encoding/decoding method or apparatus that does not provide scalability, and the block diagram of FIG. 1 is scalable. An embodiment of an image encoding apparatus that can be the basis of a video encoding apparatus is shown.

도 1을 참조하면, 상기 영상 부호화 장치(100)는 움직임 예측부(111), 움직임 보상부(112), 인트라 예측부(120), 스위치(115), 감산기(125), 변환부(130), 양자화부(140), 엔트로피 부호화부(150), 역양자화부(160), 역변환부(170), 가산기(175), 필터부(180) 및 참조영상 버퍼(190)를 포함한다.Referring to FIG. 1 , the image encoding apparatus 100 includes a motion prediction unit 111 , a motion compensation unit 112 , an intra prediction unit 120 , a switch 115 , a subtractor 125 , and a transform unit 130 . , a quantizer 140 , an entropy encoder 150 , an inverse quantizer 160 , an inverse transform unit 170 , an adder 175 , a filter unit 180 , and a reference image buffer 190 .

영상 부호화 장치(100)는 입력 영상에 대해 인트라(intra) 모드 또는 인터(inter) 모드로 부호화를 수행하고 비트스트림(bit stream)을 출력할 수 있다. 인트라 예측은 화면 내 예측, 인터 예측은 화면 간 예측을 의미한다. 인트라 모드인 경우 스위치(115)가 인트라로 전환되고, 인터 모드인 경우 스위치(115)가 인터로 전환된다. 영상 부호화 장치(100)는 입력 영상의 입력 블록에 대한 예측 블록을 생성한 후, 입력 블록과 예측 블록의 차분을 부호화할 수 있다.The image encoding apparatus 100 may perform encoding on an input image in an intra mode or an inter mode and output a bit stream. Intra prediction refers to intra prediction, and inter prediction refers to inter prediction. In the intra mode, the switch 115 is switched to intra, and in the inter mode, the switch 115 is switched to inter. The image encoding apparatus 100 may generate a prediction block for an input block of an input image, and then encode a difference between the input block and the prediction block.

인트라 모드인 경우, 인트라 예측부(120)는 현재 블록 주변의 이미 부호화된 블록의 화소값을 이용하여 공간적 예측을 수행하여 예측 블록을 생성할 수 있다.In the intra mode, the intra prediction unit 120 may generate a prediction block by performing spatial prediction using pixel values of an already coded block around the current block.

인터 모드인 경우, 움직임 예측부(111)는, 움직임 예측 과정에서 참조 영상 버퍼(190)에 저장되어 있는 참조 영상에서 입력 블록과 가장 매치가 잘 되는 영역을 찾아 움직임 벡터를 구할 수 있다. 움직임 보상부(112)는 움직임 벡터와 참조 영상 버퍼(190)에 저장되어 있는 참조 영상을 이용하여 움직임 보상을 수행함으로써 예측 블록을 생성할 수 있다. In the inter mode, the motion predictor 111 may obtain a motion vector by finding a region that best matches the input block in the reference image stored in the reference image buffer 190 during the motion prediction process. The motion compensator 112 may generate a prediction block by performing motion compensation using a motion vector and a reference image stored in the reference image buffer 190 .

감산기(125)는 입력 블록과 생성된 예측 블록의 차분에 의해 잔여 블록(residual block)을 생성할 수 있다. 변환부(130)는 잔여 블록에 대해 변환(transform)을 수행하여 변환 계수(transform coefficient)를 출력할 수 있다. 그리고 양자화부(140)는 입력된 변환 계수를 양자화 파라미터에 따라 양자화하여 양자화된 계수(quantized coefficient)를 출력할 수 있다. The subtractor 125 may generate a residual block by the difference between the input block and the generated prediction block. The transform unit 130 may perform transform on the residual block to output a transform coefficient. In addition, the quantization unit 140 may quantize the input transform coefficient according to the quantization parameter to output a quantized coefficient.

엔트로피 부호화부(150)는, 양자화부(140)에서 산출된 값들 또는 부호화 과정에서 산출된 부호화 파라미터 값 등을 기초로, 심볼(symbol)을 확률 분포에 따라 엔트로피 부호화하여 비트스트림(bit stream)을 출력할 수 있다. 엔트로피 부호화 방법은 다양한 값을 갖는 심볼을 입력 받아, 통계적 중복성을 제거하면서, 복호 가능한 2진수의 열로 표현하는 방법이다. The entropy encoding unit 150 entropy-encodes a symbol according to a probability distribution based on the values calculated by the quantization unit 140 or an encoding parameter value calculated during the encoding process to generate a bit stream. can be printed out. The entropy encoding method is a method of receiving symbols having various values, removing statistical redundancy, and expressing them as a decodable binary number sequence.

여기서, 심볼이란 부호화/복호화 대상 구문 요소(syntax element) 및 부호화 파라미터(coding parameter), 잔여 신호(residual signal)의 값 등을 의미한다. 부호화 파라미터는 부호화 및 복호화에 필요한 매개변수로서, 구문 요소와 같이 부호화 장치에서 부호화되어 복호화 장치로 전달되는 정보뿐만 아니라, 부호화 혹은 복호화 과정에서 유추될 수 있는 정보를 포함할 수 있으며 영상을 부호화하거나 복호화할 때 필요한 정보를 의미한다. 부호화 파라미터는 예를 들어 인트라/인터 예측모드, 이동/움직임 벡터, 참조 영상 색인, 부호화 블록 패턴, 잔여 신호 유무, 변환 계수, 양자화된 변환 계수, 양자화 파라미터, 블록 크기, 블록 분할 정보 등의 값 또는 통계를 포함할 수 있다. 또한 잔여 신호는 원신호와 예측 신호의 차이를 의미할 수 있고, 또한 원신호와 예측 신호의 차이가 변환(transform)된 형태의 신호 또는 원신호와 예측 신호의 차이가 변환되고 양자화된 형태의 신호를 의미할 수도 있다. 잔여 신호는 블록 단위에서는 잔여 블록이라 할 수 있다.Here, the symbol means a syntax element to be encoded/decoded, a coding parameter, and a value of a residual signal. Encoding parameters are parameters necessary for encoding and decoding, and may include information that is encoded in an encoding device and transmitted to a decoding device, such as syntax elements, as well as information that can be inferred during encoding or decoding. information needed to do so. Coding parameters include, for example, intra/inter prediction modes, motion/motion vectors, reference image indexes, coding block patterns, residual signal presence, transform coefficients, quantized transform coefficients, quantization parameters, block size, block partition information, etc. It may include statistics. In addition, the residual signal may mean a difference between the original signal and the predicted signal, and a signal in which the difference between the original signal and the predicted signal is transformed or a signal in which the difference between the original signal and the predicted signal is transformed and quantized. may mean The residual signal may be referred to as a residual block in block units.

엔트로피 부호화가 적용되는 경우, 높은 발생 확률을 갖는 심볼에 적은 수의 비트가 할당되고 낮은 발생 확률을 갖는 심볼에 많은 수의 비트가 할당되어 심볼이 표현됨으로써, 부호화 대상 심볼들에 대한 비트열의 크기가 감소될 수 있다. 따라서 엔트로피 부호화를 통해서 영상 부호화의 압축 성능이 높아질 수 있다. When entropy encoding is applied, a small number of bits are allocated to a symbol having a high probability of occurrence and a large number of bits are allocated to a symbol having a low probability of occurrence to express the symbol, so that the size of the bit string for the symbols to be encoded is increased. can be reduced. Accordingly, compression performance of image encoding may be improved through entropy encoding.

엔트로피 부호화를 위해 지수 골룸(exponential golomb), CAVLC(Context-Adaptive Variable Length Coding), CABAC(Context-Adaptive Binary Arithmetic Coding)과 같은 부호화 방법이 사용될 수 있다. 예를 들어, 엔트로피 부호화부(150)에는 가변 길이 부호화(VLC: Variable Lenghth Coding/Code) 테이블과 같은 엔트로피 부호화를 수행하기 위한 테이블이 저장될 수 있고, 엔트로피 부호화부(150)는 저장된 가변 길이 부호화(VLC) 테이블을 사용하여 엔트로피 부호화를 수행할 수 있다. 또한 엔트로피 부호화부(150)는 대상 심볼의 이진화(binarization) 방법 및 대상 심볼/빈(bin)의 확률 모델(probability model)을 도출한 후, 도출된 이진화 방법 또는 확률 모델을 사용하여 엔트로피 부호화를 수행할 수도 있다.For entropy encoding, an encoding method such as exponential golomb, Context-Adaptive Variable Length Coding (CAVLC), or Context-Adaptive Binary Arithmetic Coding (CABAC) may be used. For example, the entropy encoding unit 150 may store a table for performing entropy encoding, such as a variable length coding (VLC) table, and the entropy encoding unit 150 may store the stored variable length encoding. Entropy encoding can be performed using a (VLC) table. In addition, the entropy encoder 150 derives a binarization method of a target symbol and a probability model of a target symbol/bin, and then performs entropy encoding using the derived binarization method or probability model. You may.

양자화된 계수는 역양자화부(160)에서 역양자화되고 역변환부(170)에서 역변환될 수 있다. 역양자화, 역변환된 계수는 가산기(175)를 통해 예측 블록과 더해지고 복원 블록이 생성될 수 있다. The quantized coefficient may be inverse quantized by the inverse quantizer 160 and inversely transformed by the inverse transform unit 170 . The inverse-quantized and inverse-transformed coefficients may be added to the prediction block through the adder 175 and a reconstructed block may be generated.

복원 블록은 필터부(180)를 거치고, 필터부(180)는 디블록킹 필터(deblocking filter), SAO(Sample Adaptive Offset), ALF(Adaptive Loop Filter) 중 적어도 하나 이상을 복원 블록 또는 복원 픽쳐에 적용할 수 있다. 필터부(180)를 거친 복원 블록은 참조 영상 버퍼(190)에 저장될 수 있다.
The reconstructed block passes through the filter unit 180, and the filter unit 180 applies at least one of a deblocking filter, a sample adaptive offset (SAO), and an adaptive loop filter (ALF) to the reconstructed block or the reconstructed picture. can do. The reconstructed block passing through the filter unit 180 may be stored in the reference image buffer 190 .

도 2는 영상 복호화 장치의 일 실시예에 따른 구성을 나타내는 블록도이다. 도 1에서 상술한 바와 같이 스케일러블 비디오 부호화/복호화 방법 또는 장치는 스케일러빌리티를 제공하지 않는 일반적인 영상 부호화/복호화 방법 또는 장치의 확장에 의해 구현될 수 있으며, 도 2의 블록도는 스케일러블 비디오 복호화 장치의 기초가 될 수 있는 영상 복호화 장치의 일 실시예를 나타낸다.2 is a block diagram illustrating a configuration of an image decoding apparatus according to an embodiment. As described above with reference to FIG. 1 , the scalable video encoding/decoding method or apparatus may be implemented by extension of a general image encoding/decoding method or apparatus that does not provide scalability, and the block diagram of FIG. 2 shows scalable video decoding An embodiment of an image decoding apparatus that can be the basis of the apparatus is shown.

도 2를 참조하면, 상기 영상 복호화 장치(200)는 엔트로피 복호화부(210), 역양자화부(220), 역변환부(230), 인트라 예측부(240), 움직임 보상부(250), 필터부(260) 및 참조 영상 버퍼(270)를 포함한다.Referring to FIG. 2 , the image decoding apparatus 200 includes an entropy decoding unit 210 , an inverse quantization unit 220 , an inverse transform unit 230 , an intra prediction unit 240 , a motion compensator 250 , and a filter unit. 260 and a reference image buffer 270 .

영상 복호화 장치(200)는 부호화 장치에서 출력된 비트스트림을 입력 받아 인트라 모드 또는 인터 모드로 복호화를 수행하고 재구성된 영상, 즉 복원 영상을 출력할 수 있다. 인트라 모드인 경우 스위치가 인트라로 전환되고, 인터 모드인 경우 스위치가 인터로 전환될 수 있다. 영상 복호화 장치(200)는 입력 받은 비트스트림으로부터 복원된 잔여 블록(residual block)을 얻고 예측 블록을 생성한 후 복원된 잔여 블록과 예측 블록을 더하여 재구성된 블록, 즉 복원 블록을 생성할 수 있다.The image decoding apparatus 200 may receive a bitstream output from the encoding apparatus, perform decoding in an intra mode or an inter mode, and output a reconstructed image, that is, a reconstructed image. In the case of the intra mode, the switch may be switched to the intra mode, and in the inter mode, the switch may be switched to the inter mode. The image decoding apparatus 200 may generate a reconstructed block, that is, a reconstructed block by obtaining a reconstructed residual block from the received bitstream, generating a prediction block, and adding the reconstructed residual block and the prediction block.

엔트로피 복호화부(210)는, 입력된 비트스트림을 확률 분포에 따라 엔트로피 복호화하여, 양자화된 계수(quantized coefficient) 형태의 심볼을 포함한 심볼들을 생성할 수 있다. 엔트로피 복호화 방법은 2진수의 열을 입력 받아 각 심볼들을 생성하는 방법이다. 엔트로피 복호화 방법은 상술한 엔트로피 부호화 방법과 유사하다.The entropy decoding unit 210 may entropy-decode the input bitstream according to a probability distribution to generate symbols including symbols in the form of quantized coefficients. The entropy decoding method is a method of generating each symbol by receiving a binary string as an input. The entropy decoding method is similar to the entropy encoding method described above.

양자화된 계수는 역양자화부(220)에서 역양자화되고 역변환부(230)에서 역변환되며, 양자화된 계수가 역양자화/역변환 된 결과, 복원된 잔여 블록(residual block)이 생성될 수 있다. The quantized coefficient is inverse quantized by the inverse quantization unit 220 and inverse transformed by the inverse transformation unit 230 , and as a result of inverse quantization/inverse transformation of the quantized coefficient, a reconstructed residual block may be generated.

인트라 모드인 경우, 인트라 예측부(240)는 현재 블록 주변의 이미 부호화된 블록의 화소값을 이용하여 공간적 예측을 수행하여 예측 블록을 생성할 수 있다. 인터 모드인 경우, 움직임 보상부(250)는 움직임 벡터 및 참조 영상 버퍼(270)에 저장되어 있는 참조 영상을 이용하여 움직임 보상을 수행함으로써 예측 블록을 생성할 수 있다. In the intra mode, the intra prediction unit 240 may generate a prediction block by performing spatial prediction using pixel values of an already encoded block around the current block. In the inter mode, the motion compensator 250 may generate a prediction block by performing motion compensation using a motion vector and a reference image stored in the reference image buffer 270 .

복원된 잔여 블록과 예측 블록은 가산기(255)를 통해 더해지고, 더해진 블록은 필터부(260)를 거친다. 필터부(260)는 디블록킹 필터, SAO, ALF 중 적어도 하나 이상을 복원 블록 또는 복원 픽쳐에 적용할 수 있다. 필터부(260)는 재구성된 영상, 즉 복원 영상을 출력한다. 복원 영상은 참조 영상 버퍼(270)에 저장되어 화면 간 예측에 사용될 수 있다.The reconstructed residual block and the prediction block are added through the adder 255 , and the added block passes through the filter unit 260 . The filter unit 260 may apply at least one of a deblocking filter, SAO, and ALF to a reconstructed block or a reconstructed picture. The filter unit 260 outputs a reconstructed image, that is, a reconstructed image. The restored image may be stored in the reference image buffer 270 and used for inter prediction.

상기 영상 복호화 장치(200)에 포함되어 있는 엔트로피 복호화부(210), 역양자화부(220), 역변환부(230), 인트라 예측부(240), 움직임 보상부(250), 필터부(260) 및 참조 영상 버퍼(270) 중 영상의 복호화에 직접적으로 관련된 구성요소들, 예컨대, 엔트로피 복호화부(210), 역양자화부(220), 역변환부(230), 인트라 예측부(240), 움직임 보상부(250), 필터부(260) 등을 다른 구성요소와 구분하여 복호화부 또는 디코딩부로 표현할 수 있다. The entropy decoding unit 210 , the inverse quantization unit 220 , the inverse transform unit 230 , the intra prediction unit 240 , the motion compensator 250 , and the filter unit 260 included in the image decoding apparatus 200 . and components directly related to image decoding among the reference image buffer 270 , for example, the entropy decoding unit 210 , the inverse quantization unit 220 , the inverse transform unit 230 , the intra prediction unit 240 , and motion compensation. The unit 250 and the filter unit 260 may be distinguished from other components and expressed as a decoder or a decoder.

또한, 영상 복호화 장치(200)는 비트스트림에 포함되어 있는 인코딩된 영상에 관련된 정보를 파싱하는 도시하지 않은 파싱부를 더 포함할 수 있다. 파싱부는 엔트로피 복호화부(210)를 포함할 수도 있고, 엔트로피 복호화부(210)에 포함될 수도 있다. 이러한 파싱부는 또한 디코딩부의 하나의 구성요소로 구현될 수도 있다.
Also, the image decoding apparatus 200 may further include a parsing unit (not shown) that parses information related to the encoded image included in the bitstream. The parsing unit may include the entropy decoding unit 210 or may be included in the entropy decoding unit 210 . This parsing unit may also be implemented as one component of the decoding unit.

도 3은 본 발명이 적용될 수 있는, 복수 계층을 이용한 스케일러블 비디오 코딩 구조의 일 실시예를 개략적으로 나타내는 개념도이다. 도 3에서 GOP(Group of Picture)는 픽쳐군 즉, 픽쳐의 그룹을 나타낸다.3 is a conceptual diagram schematically illustrating an embodiment of a scalable video coding structure using a plurality of layers to which the present invention can be applied. In FIG. 3 , a group of pictures (GOP) indicates a picture group, that is, a group of pictures.

영상 데이터를 전송하기 위해서는 전송 매체가 필요하며, 그 성능은 다양한 네트워크 환경에 따라 전송 매체별로 차이가 있다. 이러한 다양한 전송 매체 또는 네트워크 환경에의 적용을 위해 스케일러블 비디오 코딩 방법이 제공될 수 있다.In order to transmit image data, a transmission medium is required, and its performance is different for each transmission medium according to various network environments. A scalable video coding method may be provided for application to such various transmission media or network environments.

스케일러블 비디오 코딩 방법은 계층(layer) 간의 텍스쳐 정보, 움직임 정보, 잔여 신호 등을 활용하여 계층간 중복성을 제거하여 부호화/복호화 성능을 높이는 코딩 방법이다. 스케일러블 비디오 코딩 방법은, 전송 비트율, 전송 에러율, 시스템 자원 등의 주변 조건에 따라, 공간적, 시간적, 화질적 관점에서 다양한 스케일러빌리티를 제공할 수 있다.The scalable video coding method is a coding method that improves encoding/decoding performance by removing inter-layer redundancy by utilizing inter-layer texture information, motion information, residual signals, and the like. The scalable video coding method may provide various scalability in terms of spatial, temporal, and picture quality according to ambient conditions such as a transmission bit rate, a transmission error rate, and a system resource.

스케일러블 비디오 코딩은, 다양한 네트워크 상황에 적용 가능한 비트스트림을 제공할 수 있도록, 복수 계층(multiple layers) 구조를 사용하여 수행될 수 있다. 예를 들어 스케일러블 비디오 코딩 구조는, 일반적인 영상 부호화 방법을 이용하여 영상 데이터를 압축하여 처리하는 기본 계층을 포함할 수 있고, 기본 계층의 부호화 정보 및 일반적인 영상 부호화 방법을 함께 사용하여 영상 데이터를 압축 처리하는 향상 계층을 포함할 수 있다.Scalable video coding may be performed using a structure of multiple layers so as to provide a bitstream applicable to various network situations. For example, the scalable video coding structure may include a base layer that compresses and processes image data using a general image encoding method, and compresses the image data using both encoding information of the base layer and a general image encoding method. It may include an enhancement layer to process.

여기서, 계층(layer)은 공간(spatial, 예를 들어, 영상 크기), 시간(temporal, 예를 들어, 부호화 순서, 영상 출력 순서, 프레임 레이트), 화질, 복잡도 등을 기준으로 구분되는 영상 및 비트스트림(bitstream)의 집합을 의미한다. 또한 기본 계층은 하위 계층, 참조 계층 또는 Base layer, 향상 계층은 상위 계층, Enhancement layer를 의미할 수 있다. 또한 복수의 계층들은 서로 간에 종속성을 가질 수도 있다.Here, a layer is an image and bit classified based on spatial (eg, image size), temporal (eg, encoding order, image output order, frame rate), image quality, complexity, etc. It means a set of streams (bitstream). In addition, the base layer may mean a lower layer, a reference layer or a base layer, and the enhancement layer may mean an upper layer, and an enhancement layer. Also, a plurality of layers may have dependencies on each other.

도 3을 참조하면, 예를 들어 기본 계층은 SD(standard definition), 15Hz의 프레임율, 1Mbps 비트율로 정의될 수 있고, 제1 향상 계층은 HD(high definition), 30Hz의 프레임율, 3.9Mbps 비트율로 정의될 수 있으며, 제2 향상 계층은 4K-UHE(ultra high definition), 60Hz의 프레임율, 27.2Mbps 비트율로 정의될 수 있다. 상기 포맷(format), 프레임율, 비트율 등은 하나의 실시예로서, 필요에 따라 달리 정해질 수 있다. 또한 사용되는 계층의 수도 본 실시예에 한정되지 않고 상황에 따라 달리 정해질 수 있다. Referring to FIG. 3 , for example, the base layer may be defined as SD (standard definition), a frame rate of 15 Hz, and a bit rate of 1 Mbps, and the first enhancement layer is HD (high definition), a frame rate of 30 Hz, and a bit rate of 3.9 Mbps. may be defined, and the second enhancement layer may be defined as 4K-UHE (ultra high definition), a frame rate of 60 Hz, and a bit rate of 27.2 Mbps. The format, frame rate, bit rate, etc. are an example and may be differently determined as needed. Also, the number of layers to be used is not limited to this embodiment and may be determined differently depending on circumstances.

예를 들어, 전송 대역폭이 4Mbps라면 상기 제1향상계층 HD의 프레임 레이트를 줄여서 15Hz이하로 전송할 수 있다. 스케일러블 비디오 코딩 방법은 상기 도 3의 실시예에서 상술한 방법에 의해 시간적, 공간적, 화질적 스케일러빌리티를 제공할 수 있다.
For example, if the transmission bandwidth is 4 Mbps, the frame rate of the first enhancement layer HD may be reduced to transmit at 15 Hz or less. The scalable video coding method may provide temporal, spatial, and picture quality scalability by the method described above in the embodiment of FIG. 3 .

비트스트림 내 복수의 계층을 지원하는 비디오의 부호화 및 복호화, 즉 스케일러블 코딩(scalable coding)의 경우, 복수의 계층간에는 강한 연관성(correlation)이 존재하기 때문에 이런 연관성을 이용하여 예측을 수행하면 데이터의 중복 요소를 제거할 수 있고 영상의 부호화 성능을 향상시킬 수 있다. 다른 계층의 정보를 이용하여 예측의 대상이 되는 현재 레이어의 예측을 수행하는 것을 이하에서는 계층간 예측(inter-layer prediction)이라고 표현한다. 스케일러블 비디오 코딩은 이하 부호화 관점에서는 스케일러블 비디오 부호화, 복호화 관점에서는 스케일러블 비디오 복호화와 동일한 의미를 가진다. In the case of encoding and decoding of video supporting a plurality of layers in a bitstream, that is, scalable coding, there is a strong correlation between a plurality of layers. Duplicate elements can be removed and the encoding performance of an image can be improved. Prediction of a current layer, which is a prediction target, by using information from another layer is hereinafter referred to as inter-layer prediction. Scalable video coding has the same meaning as scalable video coding in terms of encoding and scalable video decoding in terms of decoding.

복수의 계층들은 해상도, 프레임 레이트, 컬러 포맷 중 적어도 하나가 서로 다를 수 있으며, 계층간 예측 시 해상도의 조절을 위하여 레이어의 업샘플링 또는 다운샘플링이 수행될 수 있다.
At least one of a resolution, a frame rate, and a color format of the plurality of layers may be different from each other, and layer upsampling or downsampling may be performed to adjust resolution during inter-layer prediction.

도 4는 본 발명의 일 실시예에 따라 공간 화질 계층 및 시점 계층을 개략적으로 나타내는 개념도이다. 4 is a conceptual diagram schematically illustrating a spatial image quality layer and a view layer according to an embodiment of the present invention.

도시된 바와 같이, 비트스트림은 복수의 계층을 포함할 수 있다. As shown, the bitstream may include a plurality of layers.

비트스트림은 공간(spatial) 및 화질(quality)은 동일하지만 상이한 시점(view)에 대한 복수의 시점 계층(시점 1, 시점 2, 시점 3)을 포함할 수 있다.The bitstream may include a plurality of view layers (view 1, view 2, and 3) for different views although spatial and quality are the same.

또한, 비트스트림은 시점은 동일하지만 공간 및 화질이 상이한 계층으로 구성될 수도 있다. 공간 화질 계층은 SD 급 계층과 HD 급 계층으로 분류될 수 있으며, SD 급 계층과 HD 급 계층은 또 다시 기본 계층과 향상 계층으로 구성될 수 있다. In addition, the bitstream may be composed of layers having the same view but different spatial and image quality. The spatial image quality layer may be classified into an SD class layer and an HD class layer, and the SD class layer and the HD class layer may be composed of a base layer and an enhancement layer again.

도시된 바와 같이 공간, 화질 및 시점이 혼재되어 있는 계층을 식별하기 위하여 각 계층은 식별자(layer_id)로 서로 구분할 수 있다. 각 식별자가 어떠한 계층(예컨대, 시점 계층, 공간 및 화질 계층)인지, 계층 내에서 상위 계층인지 하위 계층인지에 대한 정보는 VPS(video parameter set) 또는 SPS(sequence parameter set), NAL 유닛 헤더(nal unit header) 등에 포함되어 시그널링 될 수 있다. As shown, in order to identify a layer in which space, picture quality, and viewpoint are mixed, each layer may be distinguished from each other by an identifier (layer_id). Information on which layer (eg, view layer, spatial and picture quality layer) each identifier is, and whether it is an upper layer or a lower layer within the layer is a video parameter set (VPS) or a sequence parameter set (SPS), a NAL unit header (nal). unit header) may be included and signaled.

상술한 바와 같이, 계층간의 연관성을 이용하여 계층간 예측을 수행하는 경우, 적어도 하나 이상의 하위 계층을 참조하여 상위 계층을 예측하게 된다. 이하, 설명의 편의를 위하여 예측이 수행되는 계층을 대상 계층이라고 명명하고, 대상 계층의 예측에 이용되는 또는 참조되는 계층을 참조 계층이라고 표현한다. As described above, when inter-layer prediction is performed using inter-layer correlation, the higher layer is predicted with reference to at least one lower layer. Hereinafter, for convenience of description, a layer on which prediction is performed is called a target layer, and a layer used or referenced for prediction of the target layer is expressed as a reference layer.

본 발명은 동일한 슬라이스 내의 블록들을 부호화하는 경우에 한 개 이상의 참조 계층을 이용하여 부호화함에 있어서 공간, 화질, 시점 스케일러빌리티의 부호화 효율을 고려한 효율적인 참조 계층 리스트 생성 및 관리를 위한 발명이다. The present invention is an invention for efficient reference layer list generation and management in consideration of encoding efficiency of spatial, picture quality, and view scalability in encoding using one or more reference layers when encoding blocks within the same slice.

이를 위하여 대상 계층과 동일 시점내에서 참조할 공간 화질 참조 계층 리스트와 대상 계층과 동일 공간 및 화질 계층을 가지는 시점 참조 계층 리스트를 분리 생성하여, 각 계층의 특성에 맞는 부호화 및 복호화 방식을 적용하도록 함으로써 부호화 효율을 향상시키는 것을 목적으로 한다.
To this end, the spatial quality reference layer list to be referenced within the same view as the target layer and the view reference layer list having the same spatial and quality layer as the target layer are separately generated, and encoding and decoding methods suitable for the characteristics of each layer are applied. It aims to improve encoding efficiency.

통상적으로 화면간 예측은 현재 픽쳐의 이전 픽쳐 또는 이후 픽쳐 중 적어도 하나를 참조 픽쳐로 하고, 참조 픽쳐를 기반으로 현재 블록에 대한 예측을 수행할 수 있다. 현재 블록의 예측에 이용되는 영상을 참조 픽쳐(reference picture) 또는 참조 프레임(reference frame)이라고 한다. In general, inter prediction may use at least one of a picture before or after a picture of the current picture as a reference picture, and prediction of the current block may be performed based on the reference picture. An image used for prediction of the current block is referred to as a reference picture or a reference frame.

참조 픽쳐 내의 영역은 참조 픽쳐를 지시하는 참조 픽쳐 인덱스(refIdx) 및 움직임 벡터(motion vector) 등을 이용하여 나타낼 수 있다. The region in the reference picture may be indicated using a reference picture index refIdx indicating the reference picture, a motion vector, and the like.

화면간 예측은 참조 픽쳐 및 참조 픽쳐 내에서 현재 블록에 대응하는 참조 블록을 선택해서, 현재 블록에 대한 예측 블록을 생성할 수 있다. Inter prediction may generate a prediction block for the current block by selecting a reference picture and a reference block corresponding to the current block in the reference picture.

화면 간 예측에서 부호화 장치 및 복호화 장치는 현재 블록의 움직임 정보를 도출한 후, 도출된 움직임 정보에 기반하여 화면 간 예측 및/또는 움직임 보상을 수행할 수 있다. 이 때, 부호화 장치 및 복호화 장치는 복원된 주변 블록(neighboring block) 및/또는 이미 복원된 콜(col) 픽쳐(collocated picture) 내에서 현재 블록에 대응되는 콜(col) 블록(collocated block)의 움직임 정보를 이용함으로써, 부호화/복호화 효율을 향상시킬 수 있다. In inter prediction, the encoding apparatus and the decoding apparatus may derive motion information of the current block and then perform inter prediction and/or motion compensation based on the derived motion information. In this case, the encoding apparatus and the decoding apparatus move a collocated block corresponding to the current block in a reconstructed neighboring block and/or an already reconstructed collocated picture. By using the information, encoding/decoding efficiency can be improved.

여기서, 복원된 주변 블록은 이미 부호화 및/또는 복호화되어 복원된 현재 픽쳐 내의 블록으로서, 현재 블록에 인접한 블록 및/또는 현재 블록의 외부 코너에 위치한 블록을 포함할 수 있다. 또한 부호화 장치 및 복호화 장치는, 콜 픽쳐 내에서 현재 블록과 공간적으로 대응되는 위치에 존재하는 블록을 기준으로 소정의 상대적인 위치를 결정할 수 있고, 상기 결정된 소정의 상대적인 위치(상기 현재 블록과 공간적으로 대응되는 위치에 존재하는 블록의 내부 및/또는 외부의 위치)를 기반으로 상기 콜 블록을 도출할 수 있다. 여기서, 일례로 콜 픽쳐는 참조 픽쳐 리스트에 포함된 참조 픽쳐 중에서 하나의 픽쳐에 해당될 수 있다.Here, the reconstructed neighboring block is a block in the reconstructed current picture that has already been encoded and/or decoded, and may include a block adjacent to the current block and/or a block located at an outer corner of the current block. In addition, the encoding apparatus and the decoding apparatus may determine a predetermined relative position based on a block existing at a position spatially corresponding to the current block in the collocated picture, and the determined relative position (corresponding spatially to the current block). It is possible to derive the collocated block based on the position inside and/or outside the block existing at the position where it is located. Here, as an example, the collocated picture may correspond to one of the reference pictures included in the reference picture list.

화면간 예측은 현재 블록과의 레지듀얼(residual) 신호가 최소화되며 움직임 벡터 크기 역시 최소가 되도록 예측 블록을 생성할 수 있다.In the inter prediction, a prediction block may be generated such that a residual signal with a current block is minimized and a motion vector size is also minimized.

한편, 움직임 정보 도출 방식은 현재 블록의 예측 모드에 따라 달라질 수 있다. 인터 예측을 위해 적용되는 예측 모드에는 AMVP(Advanced Motion Vector Predictor), 머지(merge) 등이 있을 수 있다.Meanwhile, the method of deriving motion information may vary depending on the prediction mode of the current block. Prediction modes applied for inter prediction may include Advanced Motion Vector Predictor (AMVP), merge, and the like.

일례로, AMVP(Advanced Motion Vector Predictor)가 적용되는 경우, 부호화 장치 및 복호화 장치는 복원된 주변 블록의 움직임 벡터 및/또는 콜 블록의 움직임 벡터를 이용하여, 예측 움직임 벡터 후보 리스트를 생성할 수 있다. 즉, 복원된 주변 블록의 움직임 벡터 및/또는 콜 블록의 움직임 벡터는 예측 움직임 벡터 후보로 사용될 수 있다. 부호화 장치는 상기 리스트에 포함된 예측 움직임 벡터 후보 중에서 선택된 최적의 예측 움직임 벡터를 지시하는 예측 움직임 벡터 인덱스를 복호화 장치로 전송할 수 있다. 이 때, 복호화 장치는 상기 예측 움직임 벡터 인덱스를 이용하여, 예측 움직임 벡터 후보 리스트에 포함된 예측 움직임 벡터 후보 중에서, 현재 블록의 예측 움직임 벡터를 선택할 수 있다.For example, when Advanced Motion Vector Predictor (AMVP) is applied, the encoding apparatus and the decoding apparatus may generate a predictive motion vector candidate list by using a motion vector of a reconstructed neighboring block and/or a motion vector of a collocated block. . That is, the motion vector of the reconstructed neighboring block and/or the motion vector of the collocated block may be used as a prediction motion vector candidate. The encoding apparatus may transmit a prediction motion vector index indicating an optimal prediction motion vector selected from among the prediction motion vector candidates included in the list to the decoding apparatus. In this case, the decoding apparatus may select the predicted motion vector of the current block from among the predicted motion vector candidates included in the predicted motion vector candidate list by using the predicted motion vector index.

부호화 장치는 현재 블록의 움직임 벡터와 예측 움직임 벡터 간의 움직임 벡터 차분(MVD: Motion Vector Difference)을 구할 수 있고, 이를 부호화하여 복호화 장치로 전송할 수 있다. 이 때, 복호화 장치는 수신된 움직임 벡터 차분을 복호화할 수 있고, 복호화된 움직임 벡터 차분과 예측 움직임 벡터의 합을 통해 현재 블록의 움직임 벡터를 도출할 수 있다.The encoding apparatus may obtain a motion vector difference (MVD) between the motion vector of the current block and the predicted motion vector, encode it, and transmit it to the decoding apparatus. In this case, the decoding apparatus may decode the received motion vector difference, and may derive the motion vector of the current block through the sum of the decoded motion vector difference and the predicted motion vector.

부호화 장치는 또한 참조 픽쳐를 지시하는 참조 픽쳐 인덱스 등을 복호화 장치에 전송할 수 있다. The encoding apparatus may also transmit a reference picture index indicating a reference picture to the decoding apparatus.

복호화 장치는 주변 블록의 움직임 정보들을 이용하여 현재 블록의 움직임 벡터를 예측하고, 부호화 장치로부터 수신한 움직임 벡터에 대한 차분값을 이용하여 현재 블록에 대한 움직임 벡터를 유도할 수 있다. 복호화 장치는 유도한 움직임 벡터와 부호화 장치로부터 수신한 참조 픽쳐 인덱스 정보를 기반으로 현재 블록에 대한 예측 블록을 생성할 수 있다.The decoding apparatus may predict the motion vector of the current block by using motion information of the neighboring block, and may derive the motion vector of the current block by using a difference value with respect to the motion vector received from the encoding apparatus. The decoding apparatus may generate a prediction block for the current block based on the derived motion vector and reference picture index information received from the encoding apparatus.

다른 예로, 머지(merge)가 적용되는 경우, 부호화 장치 및 복호화 장치는 복원된 주변 블록의 움직임 정보 및/또는 콜 블록의 움직임 정보를 이용하여, 머지 후보 리스트를 생성할 수 있다. 즉, 부호화 장치 및 복호화 장치는 복원된 주변 블록 및/또는 콜 블록의 움직임 정보가 존재하는 경우, 이를 현재 블록에 대한 머지 후보로 사용할 수 있다. As another example, when merge is applied, the encoding apparatus and the decoding apparatus may generate a merge candidate list by using motion information of a reconstructed neighboring block and/or motion information of a collocated block. That is, when motion information of the reconstructed neighboring block and/or collocated block exists, the encoding apparatus and the decoding apparatus may use it as a merge candidate for the current block.

부호화 장치는 머지 후보 리스트에 포함된 머지 후보 중에서 최적의 부호화 효율을 제공할 수 있는 머지 후보를 현재 블록에 대한 움직임 정보로 선택할 수 있다. 이 때, 상기 선택된 머지 후보를 지시하는 머지 인덱스가 비트스트림에 포함되어 복호화 장치로 전송될 수 있다. 복호화 장치는 상기 전송된 머지 인덱스를 이용하여, 머지 후보 리스트에 포함된 머지 후보 중에서 하나를 선택할 수 있으며, 상기 선택된 머지 후보를 현재 블록의 움직임 정보로 결정할 수 있다. 따라서, 머지 모드가 적용되는 경우, 복원된 주변 블록 및/또는 콜 블록의 움직임 정보가 현재 블록의 움직임 정보로 그대로 사용될 수 있다. 복호화 장치는 예측 블록과 부호화 장치로부터 전송되는 레 지듀얼을 더하여 현재 블록을 복원할 수 있다.The encoding apparatus may select a merge candidate capable of providing optimal encoding efficiency from among merge candidates included in the merge candidate list as motion information for the current block. In this case, the merge index indicating the selected merge candidate may be included in the bitstream and transmitted to the decoding apparatus. The decoding apparatus may select one of the merge candidates included in the merge candidate list by using the transmitted merge index, and may determine the selected merge candidate as motion information of the current block. Accordingly, when the merge mode is applied, the motion information of the reconstructed neighboring block and/or the collocated block may be used as the motion information of the current block as it is. The decoding apparatus may reconstruct the current block by adding the prediction block and the residual transmitted from the encoding apparatus.

상술한 AMVP 및 머지 모드에서는, 현재 블록의 움직임 정보를 도출하기 위해, 복원된 주변 블록의 움직임 정보 및/또는 콜 블록의 움직임 정보가 사용될 수 있다.In the aforementioned AMVP and merge modes, in order to derive the motion information of the current block, motion information of a reconstructed neighboring block and/or motion information of a collocated block may be used.

화면 간 예측에 이용되는 다른 모드 중 하나 인 스킵 모드의 경우에, 주변 블록의 정보를 그대로 현재 블록에 이용할 수 있다. 따라서 스킵 모드의 경우에, 부호화 장치는 현재 블록의 움직임 정보로서 어떤 블록의 움직임 정보를 이용할 것인지를 지시하는 정보 외에 레지듀얼 등과 같은 신택스 정보를 복호화 장치에 전송하지 않는다. In the case of the skip mode, which is one of the other modes used for inter prediction, information on neighboring blocks may be used for the current block as it is. Accordingly, in the skip mode, the encoding apparatus does not transmit syntax information such as residuals to the decoding apparatus other than information indicating which block to use as the motion information of the current block.

부호화 장치 및 복호화 장치는 상기 도출된 움직임 정보에 기반하여 현재 블록에 대한 움직임 보상을 수행함으로써, 현재 블록의 예측 블록을 생성할 수 있다. 여기서, 예측 블록은 현재 블록에 대한 움직임 보상 수행 결과 생성된, 움직임 보상된 블록을 의미할 수 있다. 또한, 복수의 움직임 보상된 블록은 하나의 움직임 보상된 영상을 구성할 수 있다.The encoding apparatus and the decoding apparatus may generate a prediction block of the current block by performing motion compensation on the current block based on the derived motion information. Here, the prediction block may mean a motion-compensated block generated as a result of performing motion compensation on the current block. Also, a plurality of motion-compensated blocks may constitute one motion-compensated image.

복호화 장치는 현재 블록의 인터 예측에 필요한 움직임 정보, 예컨대 움직임 벡터, 참조 픽쳐 인덱스 등에 관한 정보를 부호화 장치로부터 수신한 스킵 플래그, 머지 플래그 등을 확인하고 이에 대응하여 유도할 수 있다. The decoding apparatus may check a skip flag, a merge flag, etc. received from the encoding apparatus for motion information necessary for inter prediction of the current block, for example, a motion vector, a reference picture index, and the like, and derive it correspondingly.

예측이 수행되는 처리 단위와 예측 방법 및 구체적인 내용이 정해지는 처리 단위는 서로 다를 수 있다. 예컨대, 예측 블록 단위로 예측모드가 정해져서 변환 블록 단위로 예측이 수행될 수도 있고, 예측 블록 단위로 예측 모드가 정해지고 변환 블록 단위로 화면 내 예측이 수행될 수 도 있다.
A processing unit in which prediction is performed and a processing unit in which a prediction method and specific content are determined may be different from each other. For example, a prediction mode may be determined in units of prediction blocks and prediction may be performed in units of transform blocks, or prediction modes may be determined in units of prediction blocks and intra prediction may be performed in units of transform blocks.

도 5는 본 발명의 일 실시예에 따른 부호화 장치에서 상위 계층의 부호화를 수행하는 방법을 설명하기 위한 제어흐름도이다. 5 is a control flow diagram illustrating a method of performing encoding of a higher layer in an encoding apparatus according to an embodiment of the present invention.

이하에서는 도 5를 참조하여 적어도 한가지 이상의 스케일러빌리티 (예를 들어, 공간, 화질, 시점 스케일러빌리티)를 지원하며 다 계층 구조를 사용하는 비디오 부호화 방법에 있어서 상위 계층의 부호화를 수행하는 방법, 보다 구체적으로 대상 계층이 참조할 수 있는 참조 계층 리스트를 구성하는 방법에 대하여 살펴본다. Hereinafter, with reference to FIG. 5 , in a video encoding method that supports at least one scalability (eg, spatial, picture quality, and view scalability) and uses a multi-layered structure, a method of performing higher layer encoding, more specifically Let's look at how to construct a reference layer list that the target layer can refer to.

우선, 부호화 장치는 현재 부호화 대상 계층의 영상이 참조할 수 있는 계층의 리스트를 구성한다(S510).First, the encoding apparatus constructs a list of layers that an image of the current encoding target layer can refer to ( S510 ).

부호화 장치는 현재 부호화 대상 계층의 하위계층들 가운데 현재 부호화 대상 계층이 부호화시 동일 시점 내에서 참조할 수 있는 적어도 한 개 이상의 공간 혹은 화질 계층들을 포함하는 공간 화질 참조 계층 리스트를 구성하고, 동일 공간 및 화질을 가지는 계층들 중에서 대상 계층이 참조할 수 있는 시점 계층들을 포함하는 시점 참조 계층 리스트를 구성할 수 있다. 참조 계층 리스트는 하기 설명된 방법 중 적어도 하나에 따라 구성될 수 있다.
The encoding apparatus constructs a spatial quality reference layer list including at least one spatial or picture quality layer that the current encoding target layer can refer to within the same view when encoding, among lower layers of the current encoding target layer, and A view reference layer list including view layers that a target layer can refer to may be configured from among the layers having image quality. The reference layer list may be constructed according to at least one of the methods described below.

도 6은 본 발명의 일 실시예예 따라 부호화 장치에서 공간 화질 참조 계층 리스트 및 시점 참조 계층 리스트를 구성하는 방법을 설명하기 위한 제어 흐름도이다. 6 is a control flowchart illustrating a method of constructing a spatial image quality reference layer list and a view reference layer list in an encoding apparatus according to an embodiment of the present invention.

도 6에 도시되어 있는 첫 번째 실시예에 따르면, 부호화 장치는 우선, 전체 비트스트림에서 현재 부호화 대상 계층과 동일한 계층들이 동일한 시점 내에서 참조할 수 있는 공간 화질 참조 계층 리스트를 구성할 수 있다(S610).According to the first embodiment shown in FIG. 6 , the encoding apparatus may first construct a spatial image quality reference layer list that can be referred to within the same view by the same layers as the current encoding target layer in the entire bitstream (S610). ).

부호화 장치는 대상 계층과 동일 시점을 가지는 공간 및 화질 참조 계층들을 임의의 순서로 구성하여 현재 부호화 대상 계층과 동일 시점을 갖는 참조 가능한 공간 화질 참조 계층 리스트를 생성할 수 있다.The encoding apparatus may configure the spatial and picture quality reference layers having the same view as the target layer in an arbitrary order to generate a referenceable spatial picture quality reference layer list having the same view as the current encoding target layer.

또는 현재 부호화 대상 계층과 동일 시점을 가지는 참조 가능한 공간 화질 참조 계층 리스트는 대상 계층과 동일 시점을 가지는 공간 및 화질 참조 계층들 가운데 layer_id 값이 대상 계층의 layer_id 값과 차이가 적은 계층 (즉, 가까운 계층)부터 차이가 큰 계층 순서로 구성될 수 있다.Alternatively, the list of referenceable spatial quality reference layers having the same view as the current encoding target layer is a layer having a small difference from the layer_id value of the target layer (that is, a layer close to the layer_id value among the spatial and picture quality reference layers having the same view as the target layer). ) can be configured in a hierarchical order with a large difference.

또는 현재 부호화 대상 계층과 동일 시점을 가지는 참조 가능한 공간 화질 참조 계층 리스트는 대상 계층과 동일 시점을 가지는 공간 및 화질 참조 계층들 가운데 우선 순위(priority)가 높은 계층부터 낮은 계층의 순서로 구성될 수 있다.Alternatively, the referenceable spatial image quality reference layer list having the same view as the current encoding target layer may be configured in the order of a layer having a higher priority to a lower layer among spatial and image quality reference layers having the same view as the target layer. .

우선 순위와 관련된 정보는 NAL 유닛 헤더(NALU header) 혹은 비디오 파라미터 세트(video parameter set) 등에 포함되어 시그널링될 수 있다.Priority related information may be signaled by being included in a NAL unit header (NALU header) or a video parameter set.

또는 현재 부호화 대상 계층과 동일 시점을 가지는 참조 가능한 공간 화질 참조 계층 리스트는 현재 부호화 대상과 동일 시점을 가지는 공간 (spatial) 및 화질(quality) 참조 계층들 가운데 현재 부호화 대상 계층과 공간 해상도 차이가 적은 계층부터 큰 계층 순으로 구성될 수 있다. 이때, 동일한 공간 해상도에서의 화질 참조 계층 순서는 현재 부호화 대상 계층의 layer_id 값과 차이가 작은 계층(즉, 가까운 계층)부터 큰 계층 순서일 수 있다.Alternatively, the referenceable spatial quality reference layer list having the same view as the current encoding target layer is a layer having a small spatial resolution difference from the current encoding target layer among spatial and quality reference layers having the same view as the current encoding target layer. It can be configured in the order of the largest hierarchy from In this case, the order of the picture quality reference layers in the same spatial resolution may be from a layer having a small difference from a layer_id value of the current encoding target layer (ie, a layer close to it) to a larger layer order.

예를 들어, 도 4와 같은 비트스트림 구조에서 layer_id 가 n인 계층의 참조 계층 리스트는 layer_id 가 n-1, n-2, n-3 인 계층의 순서로 구성될 수 있다.For example, in the bitstream structure shown in FIG. 4 , the reference layer list of the layer having the layer_id of n may be configured in the order of the layers having the layer_id of n-1, n-2, and n-3.

또는 현재 부호화 대상 계층과 동일 시점을 가지는 참조 가능한 공간 화질 참조 계층 리스트는 현재 부호화 대상과 동일 시점을 가지는 공간 (spatial) 및 화질(quality) 참조 계층들 가운데 현재 부호화 대상 계층과 공간 해상도 차이가 작은 계층부터 큰 계층 순으로 구성될 수 있다. 이때, 동일한 공간 해상도에서의 화질 참조 계층 순서는 부호화하고자 하는 양자화 파라미터(quantization parameter)값이 낮은 값에서 높은 값의 순서(즉, 복호시 화질이 좋은 계층부터 낮은 계층순서) 일 수 있다. Alternatively, the referenceable spatial quality reference layer list having the same view as the current encoding target layer is a layer having a small spatial resolution difference from the current encoding target layer among spatial and quality reference layers having the same view as the current encoding target layer. It can be configured in the order of the largest hierarchy from In this case, the order of the picture quality reference layers in the same spatial resolution may be an order of a quantization parameter value to be encoded from a low value to a high value (ie, from a layer having a good quality during decoding to a low layer order).

대상 계층과 동일한 계층들이 참조할 수 있는 공간 화질 참조 계층 리스트가 생성되면, 부호화 장치는 현재 부호화 대상 계층과 동일 공간과 동일 화질계층들로 이루어진 참조 가능한 시점 참조 계층 리스트를 다음의 방법 가운데 하나를 적용하여 구성할 수 있다(S620).When the spatial quality reference layer list that can be referenced by the same layers as the target layer is generated, the encoding apparatus applies one of the following methods to the referenceable view reference layer list composed of the same spatial and same picture quality layers as the current encoding target layer. and can be configured (S620).

부호화 장치는 현재 부호화 대상 계층과 동일 공간과 동일 화질계층들로 이루어진 시점 참조 계층들이 임의의 순서로 구성된 시점 참조 계층 리스트를 생성할 수 있다.The encoding apparatus may generate a view reference layer list in which view reference layers formed in the same space and the same image quality layers as the current encoding target layer are arranged in an arbitrary order.

또는 부호화 장치는 현재 부호화 대상 계층과 동일 공간과 동일 화질계층들로 이루어진 시점 참조 계층들이 현재 부호화 대상 시점과 가까운 시점부터 먼 시점순으로 구성된 시점 참조 계층 리스트를 생성할 수 있다.Alternatively, the encoding apparatus may generate a view reference layer list in which view reference layers composed of the same image quality layers and the same space as the current encoding target layer are configured in the order of a view closer to the current encoding target view to a distant view.

상기와 같이 구성된 공간화질 참조 계층 리스트들과 시점 참조 계층 리스트들은 현재 부호화 대상 영상이 속한 계층과 동일한 계층에 속한 영상들을 부호화하기 위해 사용될 수 있다.The spatial quality reference layer lists and the view reference layer lists configured as described above may be used to encode images belonging to the same layer as the layer to which the current encoding target image belongs.

현재 부호화 대상 계층과 동일한 계층들(즉, 동일한 layer_id 값을 가지는 계층들)이 참조 가능한 공간 화질 참조 계층 리스트와 시점 참조 계층 리스트는 효율적인 시그널링을 위하여 통합되어 하나의 참조 가능한 참조 계층 리스트로 기술될 수 있다.The spatial quality reference layer list and the view reference layer list in which the same layers as the current encoding target layer (that is, layers having the same layer_id value) can be referenced may be integrated for efficient signaling and described as a single referenceable reference layer list. have.

표 1 및 표 2는 참조 계층 리스트와 시점 참조 계층 리스트가 통합되어 시그널링되는 예를 도시한 것이다. Tables 1 and 2 show examples in which the reference layer list and the view reference layer list are integrated and signaled.

표 1을 참조하면, num_direct_ref_layers[i]는 i번째 계층 (즉, nuh_layer_id[i]의 layer_id를 가지는 계층)이 직접적으로 참조하는 참조계층의 수를 나타낸다.Referring to Table 1, num_direct_ref_layers[i] indicates the number of reference layers directly referred to by the i-th layer (ie, a layer having a layer_id of nuh_layer_id[i]).

ref_layer_id[i][j]는 i번째 계층이 참조하는 j번째 참조계층의 layer_id를 나타낸다.ref_layer_id[i][j] represents the layer_id of the j-th reference layer referenced by the i-th layer.

표 1과 같이, 공간 화질 참조 계층 리스트와 시점 참조 계층 리스트는 비디오 파라미터 세트에서 layer_id_in_nuh[i]의 값을 가지는 계층의 참조 계층들(ref_layer_id)이 기술(descprition)됨으로써 시그널링될 수 있다.As shown in Table 1, the spatial quality reference layer list and the view reference layer list may be signaled by description of reference layers (ref_layer_id) of a layer having a value of layer_id_in_nuh[i] in the video parameter set.

표 2를 참조하면, direct_dependency_flag[i][j]는 i번째 계층 (즉, nuh_layer_id[i]의 layer_id를 가지는 계층) 이 j번째 참조계층((즉, nuh_layer_id[j]의 layer_id를 가지는 계층)을 참조하는지 여부를 나타내고, direct_dependency_flag[i][j]이 “1”의 값을 가지는 경우, i번째 계층이 j번째 참조계층을 작접적으로 참조함을 의미한다.
Referring to Table 2, direct_dependency_flag[i][j] is the i-th layer (that is, the layer having the layer_id of nuh_layer_id[i]) the j-th reference layer (that is, the layer having the layer_id of nuh_layer_id[j]) Indicates whether the reference layer is referenced, and when direct_dependency_flag[i][j] has a value of “1”, it means that the i-th layer directly refers to the j-th reference layer.

통합된 참조 계층 리스트의 순서는 임의의 순서대로 시그널링 되거나, layer_id값이 큰 값부터 작은 값 순서로 시그널링될 수도 있고, 공간 화질 참조 계층 리스트 다음에 시점 참조 계층 리스트가 기술될 수도 있으며, 시점 참조 계층 리스트 다음에 공간 화질 참조 계층 리스트가 기술될 수도 있다.
The order of the integrated reference layer list may be signaled in an arbitrary order, or may be signaled from a large value to a small value of the layer_id value, and the view reference layer list may be described after the spatial quality reference layer list, and the view reference layer Following the list, a spatial quality reference layer list may be described.

현재 부호화 대상 계층의 영상이 참조할 수 있는 계층의 리스트를 구성하는 두 번째 실시예에 따르면, 부호화 장치는 현재 부호화하고자 하는 영상의 현재 부호화 대상 계층 (혹은 해당 슬라이스)이 참조 가능한 공간 화질 참조 계층 리스트 및 시점 참조 계층 리스트를 구성할 수 있다. According to the second embodiment of configuring the list of layers that the image of the current encoding target layer can refer to, the encoding apparatus lists the spatial quality reference layer that the current encoding target layer (or the corresponding slice) of the image to be currently encoded can refer to. and a view reference layer list can be configured.

부호화 장치는 우선, 현재 부호화하고자 하는 영상의 현재 부호화 대상 계층이 참조 가능한 공간 화질 참조 계층 리스트를 다음의 방법 가운데 하나로 구성할 수 있다.First, the encoding apparatus may configure a spatial quality reference layer list that can be referenced by a current encoding target layer of an image to be currently encoded using one of the following methods.

부호화 장치는 대상 계층과 동일 시점을 가지는 공간 및 화질 참조 계층들을 임의의 순서로 구성하여 현재 부호화 대상 계층이 참조 가능한 공간 화질 참조 계층 리스트를 생성할 수 있다. The encoding apparatus may configure spatial and picture quality reference layers having the same view as the target layer in an arbitrary order to generate a list of spatial picture quality reference layers that the current encoding target layer can refer to.

또는 부호화 장치는 대상 계층과 동일 시점을 가지는 공간 및 화질 참조 계층들 가운데 layer_id 값이 대상 계층의 layer_id 값과 차이가 적은 계층 (즉, 가까운 계층)부터 큰 계층 순서로 구성하여 공간 화질 참조 계층 리스트를 생성할 수 있다. Alternatively, the encoding device configures the spatial quality reference layer list in the order of the largest layer from the layer having a smaller layer_id value than the layer_id value of the target layer (ie, the closest layer) among the spatial and picture quality reference layers having the same view as the target layer. can create

또는 공간 화질 참조 계층 리스트는 대상 계층과 동일 시점을 가지는 공간 및 화질 참조 계층들 가운데 우선 순위(priority)가 높은 계층부터 낮은 계층의 순서로 구성될 수 있다.Alternatively, the spatial quality reference layer list may be configured in order from a layer having a high priority to a layer having a low priority among spatial and picture quality reference layers having the same view as the target layer.

이때, 우선 순위와 관련된 정보는 NAL 유닛 헤더(NALU header) 혹은 비디오 파라미터 세트(video parameter set) 등에 포함되어 시그널링될 수 있다.In this case, the priority-related information may be signaled by being included in a NAL unit header (NALU header) or a video parameter set.

또는 현재 부호화 대상 계층과 동일 시점을 가지는 참조 가능한 공간 화질 참조 계층 리스트는 현재 부호화 대상과 동일 시점을 가지는 공간 (spatial) 및 화질(quality) 참조 계층들 가운데 현재 부호화 대상 계층과 공간 해상도 차이가 적은 계층부터 큰 계층 순으로 구성될 수 있다. 이때, 동일한 공간 해상도를 갖는 화질 참조 계층의 순서는 현재 부호화 대상 계층의 layer_id 값과 차이가 작은 계층(즉, 가까운 계층)부터 큰 계층 순서일 수 있다.Alternatively, the referenceable spatial quality reference layer list having the same view as the current encoding target layer is a layer having a small spatial resolution difference from the current encoding target layer among spatial and quality reference layers having the same view as the current encoding target layer. It can be configured in the order of the largest hierarchy from In this case, the order of the picture quality reference layers having the same spatial resolution may be from a layer having a small difference from a layer_id value of the current encoding target layer (ie, a layer close to it) to a larger layer order.

대상 계층이 참조할 수 있는 공간 화질 참조 계층 리스트가 생성되면, 부호화 장치는 현재 부호화 대상 계층과 동일한 공간과 동일한 화질계층들로 이루어진 참조 가능한 시점 참조 계층 리스트를 다음의 방법 가운데 하나를 적용하여 구성할 수 있다.When the spatial image quality reference layer list that the target layer can refer to is generated, the encoding apparatus constructs a referenceable view reference layer list comprising the same space and the same picture quality layers as the current encoding target layer by applying one of the following methods. can

부호화 장치는 현재 부호화 대상 계층과 동일한 공간과 동일한 화질계층들로 이루어진 시점 참조 계층들이 임의의 순서로 구성된 시점 참조 계층 리스트를 생성할 수 있다.The encoding apparatus may generate a view reference layer list in which view reference layers formed in the same space and the same picture quality layers as the current encoding target layer are configured in an arbitrary order.

상기와 같이 구성된 공간 화질 참조 계층 리스트들과 시점 참조 계층 리스트들은 현재 부호화 대상 영상의 부호화 대상 계층 또는 해당 슬라이스를 부호화하기 위해 사용될 수 있다.The spatial quality reference layer lists and view reference layer lists configured as described above may be used to encode an encoding target layer or a corresponding slice of the current encoding target image.

현재 부호화 대상 계층과 동일한 계층들(즉, 동일한 layer_id 값을 가지는 계층들)이 참조 가능한 공간 화질 참조 계층 리스트와 시점 참조 계층 리스트는 효율적인 시그널링을 위하여 통합되어 하나의 참조 가능한 참조 계층 리스트로 기술될 수 있다. The spatial quality reference layer list and the view reference layer list in which the same layers as the current encoding target layer (that is, layers having the same layer_id value) can be referenced may be integrated for efficient signaling and described as a single referenceable reference layer list. have.

표 3 내지 표 12는 공간 화질 참조 계층 리스트와 시점 참조 계층 리스트가 통합되어 시그널링되는 예를 도시한 것이다. Tables 3 to 12 show examples in which the spatial quality reference layer list and the view reference layer list are integrated and signaled.

예를 들어, 부호화기는 표 3에서 표 12까지의 신택스 요소 중 하나를 슬라이스 헤더에 포함킬 수 있고, 이를 통해 참조계층들에 대한 기술(discription)을 시그널링할 수 있다.For example, the encoder may include one of the syntax elements from Tables 3 to 12 in the slice header, and through this, may signal a description of reference layers.

이때, 기술되는 해당 계층이 부호화시 참조할 수 있는 계층들은 전체 비트스트림에서 현재 부호화 대상 계층과 동일한 계층들이 참조할 수 있는 참조계층들의 서브 세트(sub-set), 즉 슬라이스 헤더에서 시그널링되는 참조 계층들은 전체 비트스트림에서 현재 부호화 대상 계층과 동일한 계층이 참조할 수 있는 참조 계층들 중 일부로 구성될 수 있다.In this case, the layers that the described corresponding layer can refer to during encoding are a subset of reference layers that can be referenced by the same layers as the current encoding target layer in the entire bitstream, that is, the reference layer signaled in the slice header. may be composed of some of reference layers that can be referenced by the same layer as the current encoding target layer in the entire bitstream.

예컨대, 슬라이스 헤더에 시그널링 되는 참조계층들은 비디오 파라미터 세트에서 시그널링되는 현재 부호화 대상 계층과 동일한 계층들이 참조할 수 있는 참조계층 리스트의 서브 세트일 수 있다.For example, the reference layers signaled in the slice header may be a subset of the reference layer list that can be referenced by the same layers as the current encoding target layer signaled in the video parameter set.

표 3을 참조하면 slice_num_direct_ref_layers는 해당 영상이 직접적으로 참조하는 참조계층의 수를 나타낸다. slice_num_direct_ref_layers는 비디오 파라미터 세트에서 시그널링되는 해당 영상과 동일한 layer_id (즉, nuh_layer_id)를 가지는 계층들이 참조하는 참조계층의 수(즉, NumDirectRefLayers[LayerIdInVps[nuh_layer_id])와 같거나 작아야 한다. Referring to Table 3, slice_num_direct_ref_layers indicates the number of reference layers directly referenced by a corresponding image. slice_num_direct_ref_layers must be less than or equal to the number of reference layers (ie, NumDirectRefLayers[LayerIdInVps[nuh_layer_id]) referenced by layers having the same layer_id (ie, nuh_layer_id) as the corresponding image signaled in the video parameter set).

ref_layer_id[j]는 해당 영상이 직접적으로 참조하는 j번째 참조계층의 layer_id를 나타낸다.ref_layer_id[j] indicates the layer_id of the j-th reference layer directly referenced by the corresponding image.

표 4를 참조하면, slice_num_direct_ref_layers는 해당 영상이 직접적으로 참조하는 참조계층의 수를 나타낸다. 이 때, slice_num_direct_ref_layers는 비디오 파라미터 세트에서 시그널링되는 해당 영상과 동일한 layer_id (즉, nuh_layer_id)를 가지는 계층들이 참조하는 참조계층의 수(즉, NumDirectRefLayers[LayerIdInVps[nuh_layer_id])와 같거나 작아야 한다.Referring to Table 4, slice_num_direct_ref_layers indicates the number of reference layers directly referenced by a corresponding image. In this case, slice_num_direct_ref_layers must be equal to or smaller than the number of reference layers (ie, NumDirectRefLayers[LayerIdInVps[nuh_layer_id]) referenced by layers having the same layer_id (ie, nuh_layer_id) as the corresponding image signaled in the video parameter set.

ref_layer_id_delta[j]는 해당 영상이 직접적으로 참조하는 j번째 참조계층의 layer_id와 j-1번째 참조계층의 layer_id의 차이를 나타낸다. 이때, 레이어의 인덱스가 “0”에 가까울수록 현재 영상이 해당하는 계층과 가까운 layer_id를 가질 수 있다. ref_layer_id_delta[0]는 0번째 참조계층의 layer_id와 현재 영상이 해당하는 계층의 layer_id와의 차이를 나타낼 수 있다. ref_layer_id_delta[j] represents the difference between the layer_id of the j-th reference layer directly referenced by the corresponding image and the layer_id of the j-1th reference layer. In this case, as the index of the layer is closer to “0”, the current image may have a layer_id closer to the corresponding layer. ref_layer_id_delta[0] may indicate a difference between the layer_id of the 0th reference layer and the layer_id of the layer corresponding to the current image.

표 5를 참조하면, slice_num_direct_ref_layers는 해당 영상이 직접적으로 참조하는 참조계층의 수를 나타낸다. 이 때, slice_num_direct_ref_layers는 비디오 파라미터 세트에서 시그널링되는 해당 영상과 동일한 layer_id (즉, nuh_layer_id)를 가지는 계층들이 참조하는 참조계층의 수(즉, NumDirectRefLayers[LayerIdInVps[nuh_layer_id])와 같거나 작아야 한다.Referring to Table 5, slice_num_direct_ref_layers indicates the number of reference layers directly referenced by a corresponding image. In this case, slice_num_direct_ref_layers must be equal to or smaller than the number of reference layers (ie, NumDirectRefLayers[LayerIdInVps[nuh_layer_id]) referenced by layers having the same layer_id (ie, nuh_layer_id) as the corresponding image signaled in the video parameter set.

ref_layer_idx_delta[j]는 해당 영상이 직접적으로 참조하는 j번째 참조계층의 인덱스(vps에 기술된 인덱스 기준)와 j-1번째 참조계층의 인덱스(vps에 기술된 인덱스 기준)의 차이를 나타내고, ref_layer_idx_delta[0]는 0번째 참조계층의 인덱스를 나타낼 수 있다.ref_layer_idx_delta[j] represents the difference between the index of the j-th reference layer directly referenced by the image (based on the index described in vps) and the index of the j-1th reference layer (based on the index described in vps), ref_layer_idx_delta[ 0] may indicate the index of the 0th reference layer.

표 6을 참조하면, slice_num_direct_ref_layers는 해당 영상이 직접적으로 참조하는 참조계층의 수를 나타낸다. 이 때, slice_num_direct_ref_layers는 비디오 파라미터 세트에서 시그널링되는 해당 영상과 동일한 layer_id (즉, nuh_layer_id)를 가지는 계층들이 참조하는 참조계층의 수(즉, NumDirectRefLayers[LayerIdInVps[nuh_layer_id])와 같거나 작아야 한다. Referring to Table 6, slice_num_direct_ref_layers indicates the number of reference layers directly referenced by a corresponding image. In this case, slice_num_direct_ref_layers must be equal to or smaller than the number of reference layers (ie, NumDirectRefLayers[LayerIdInVps[nuh_layer_id]) referenced by layers having the same layer_id (ie, nuh_layer_id) as the corresponding image signaled in the video parameter set.

ref_layer_idx[j]는 해당 영상이 직접적으로 참조하는 j번째 참조계층의 인덱스(vps에 기술된 인덱스 기준) 를 나타낼 수 있다.ref_layer_idx[j] may indicate the index (based on the index described in vps) of the j-th reference layer directly referenced by the corresponding image.

표 7을 참조하면, slice_num_direct_ref_layers는 해당 영상이 직접적으로 참조하는 참조계층의 수를 나타낸다. 이 때, slice_num_direct_ref_layers는 비디오 파라미터 세트에서 시그널링되는 해당 영상과 동일한 layer_id (즉, nuh_layer_id 를 가지는 계층들이 참조하는 참조계층의 수 (즉, NumDirectRefLayers[LayerIdInVps[nuh_layer_id]) 와 같거나 작아야 한다. slice_num_direct_ref_layers 가 “0”인 경우는 비디오 파라미터 세트에서 시그널링 되는 해당 영상에 해당하는 참조계층을 현재 영상의 참조계층으로 사용할 수 있다. Referring to Table 7, slice_num_direct_ref_layers indicates the number of reference layers directly referenced by a corresponding image. At this time, slice_num_direct_ref_layers must be equal to or smaller than the number of reference layers referenced by layers having nuh_layer_id (that is, NumDirectRefLayers[LayerIdInVps[nuh_layer_id]) with the same layer_id as the corresponding image signaled in the video parameter set. ”, the reference layer corresponding to the corresponding image signaled in the video parameter set may be used as the reference layer of the current image.

ref_layer_id_delta[j]는 해당 영상이 직접적으로 참조하는 j번째 참조계층의 layer_id와 j-1번째 참조계층의 layer_id의 차이를 나타낸다. 이때, 레이어 인덱스가 “0”에 가까울수록 현재 영상이 해당하는 계층과 가까운 layer_id를 가질 수 있다. ref_layer_id_delta[0]는 0번째 참조계층의 layer_id와 현재 영상이 해당하는 계층의 layer_id와의 차이를 나타낼 수 있다.ref_layer_id_delta[j] represents the difference between the layer_id of the j-th reference layer directly referenced by the corresponding image and the layer_id of the j-1th reference layer. In this case, as the layer index is closer to “0”, the current image may have a layer_id closer to the corresponding layer. ref_layer_id_delta[0] may indicate a difference between the layer_id of the 0th reference layer and the layer_id of the layer corresponding to the current image.

표 8을 참조하면, slice_num_direct_ref_layers는 해당 영상이 직접적으로 참조하는 참조계층의 수를 나타낸다. 이 때, slice_num_direct_ref_layers는 비디오 파라미터 세트에서 시그널링되는 해당 영상과 동일한 layer_id (즉, nuh_layer_id)를 가지는 계층들이 참조하는 참조계층의 수(즉, NumDirectRefLayers[LayerIdInVps[nuh_layer_id])와 같거나 작아야 한다. slice_num_direct_ref_layers 가 “0”인 경우는 video parameter set에서 시그널링 되는 해당 영상에 해당하는 참조계층을 현재 영상의 참조계층으로 사용할 수 있다.Referring to Table 8, slice_num_direct_ref_layers indicates the number of reference layers directly referenced by a corresponding image. In this case, slice_num_direct_ref_layers must be equal to or smaller than the number of reference layers (ie, NumDirectRefLayers[LayerIdInVps[nuh_layer_id]) referenced by layers having the same layer_id (ie, nuh_layer_id) as the corresponding image signaled in the video parameter set. When slice_num_direct_ref_layers is “0”, the reference layer corresponding to the image signaled in the video parameter set can be used as the reference layer of the current image.

ref_layer_idx_delta[j]는 해당 영상이 직접적으로 참조하는 j번째 참조계층의 인덱스(vps에 기술된 인덱스 기준)와 j-1번째 참조계층의 인덱스(vps에 기술된 인덱스 기준)의 차이를 나타낸다. ref_layer_idx_delta[0]는 0번째 참조계층의 인덱스를 나타낼 수 있다.ref_layer_idx_delta[j] represents the difference between the index of the j-th reference layer directly referenced by the corresponding image (based on the index described in vps) and the index of the j-1th reference layer (based on the index described in vps). ref_layer_idx_delta[0] may indicate the index of the 0th reference layer.

표 9를 참조하면, slice_num_direct_ref_layers는 해당 영상이 직접적으로 참조하는 참조계층의 수를 나타낸다. 이 때, slice_num_direct_ref_layers는 비디오 파라미터 세트에서 시그널링되는 해당 영상과 동일한 layer_id (즉, nuh_layer_id)를 가지는 계층들이 참조하는 참조계층의 수(즉, NumDirectRefLayers[LayerIdInVps[nuh_layer_id])와 같거나 작아야 한다. slice_num_direct_ref_layers 가 “0”인 경우는 video parameter set에서 시그널링 되는 해당 영상에 해당하는 참조계층을 현재 영상의 참조계층으로 사용할 수 있다.Referring to Table 9, slice_num_direct_ref_layers indicates the number of reference layers directly referenced by a corresponding image. In this case, slice_num_direct_ref_layers must be equal to or smaller than the number of reference layers (ie, NumDirectRefLayers[LayerIdInVps[nuh_layer_id]) referenced by layers having the same layer_id (ie, nuh_layer_id) as the corresponding image signaled in the video parameter set. When slice_num_direct_ref_layers is “0”, the reference layer corresponding to the image signaled in the video parameter set can be used as the reference layer of the current image.

ref_layer_idx는 해당 영상이 직접적으로 참조하는 j번째 참조계층의 인덱스(vps에 기술된 인덱스 기준) 를 나타낼 수 있다.ref_layer_idx may indicate the index (based on the index described in vps) of the j-th reference layer directly referenced by the corresponding image.

표 10을 참조하면, layer_dependency_sps_flag는 참조계층 정보를 슬라이스 헤더 (슬라이스 시그먼트 헤더)에서 시그널링 하는지 여부를 나타낸다. layer_dependency_sps_flag이 “0”인 경우 시그널링 된다. Referring to Table 10, layer_dependency_sps_flag indicates whether reference layer information is signaled in a slice header (slice segment header). Signaled when layer_dependency_sps_flag is “0”.

slice_num_direct_ref_layers는 해당 영상이 직접적으로 참조하는 참조계층의 수를 나타낸다. 이 때, slice_num_direct_ref_layers는 비디오 파라미터 세트에서 시그널링되는 해당 영상과 동일한 layer_id (즉, nuh_layer_id)를 가지는 계층들이 참조하는 참조계층의 수(즉, NumDirectRefLayers[LayerIdInVps[nuh_layer_id])와 같거나 작아야 한다.slice_num_direct_ref_layers indicates the number of reference layers directly referenced by the corresponding image. In this case, slice_num_direct_ref_layers must be equal to or smaller than the number of reference layers (ie, NumDirectRefLayers[LayerIdInVps[nuh_layer_id]) referenced by layers having the same layer_id (ie, nuh_layer_id) as the corresponding image signaled in the video parameter set.

ref_layer_id_delta[j]는 해당 영상이 직접적으로 참조하는 j번째 참조계층의 layer_id와 j-1번째 참조계층의 layer_id의 차이를 나타낸다. 이때, 레이어의 인덱스가 “0”에 가까울수록 현재 영상이 해당하는 계층과 가까운 layer_id를 가질 수 있다. ref_layer_id_delta[0]는 ref_layer_id[0]와 현재 영상의 layer_id와의 차이를 나타낼 수 있다. ref_layer_id_delta[j] represents the difference between the layer_id of the j-th reference layer directly referenced by the corresponding image and the layer_id of the j-1th reference layer. In this case, as the index of the layer is closer to “0”, the current image may have a layer_id closer to the corresponding layer. ref_layer_id_delta[0] may indicate a difference between ref_layer_id[0] and layer_id of the current image.

표 11을 참조하면, layer_dependency_sps_flag는 참조계층 정보를 슬라이스 헤더 (슬라이스 시그먼트 헤더)에서 시그널링 하는지 여부를 나타낸다. layer_dependency_sps_flag이 “0”인 경우 시그널링 된다.Referring to Table 11, layer_dependency_sps_flag indicates whether reference layer information is signaled in a slice header (slice segment header). Signaled when layer_dependency_sps_flag is “0”.

slice_num_direct_ref_layers는 해당 영상이 직접적으로 참조하는 참조계층의 수를 나타낸다. 이 때, slice_num_direct_ref_layers는 비디오 파라미터 세트에서 시그널링되는 해당 영상과 동일한 layer_id (즉, nuh_layer_id)를 가지는 계층들이 참조하는 참조계층의 수(즉, NumDirectRefLayers[LayerIdInVps[nuh_layer_id])와 같거나 작아야 한다. slice_num_direct_ref_layers indicates the number of reference layers directly referenced by the corresponding image. In this case, slice_num_direct_ref_layers must be equal to or smaller than the number of reference layers (ie, NumDirectRefLayers[LayerIdInVps[nuh_layer_id]) referenced by layers having the same layer_id (ie, nuh_layer_id) as the corresponding image signaled in the video parameter set.

표 12를 참조하면, layer_dependency_sps_flag는 참조계층 정보를 슬라이스 헤더 (슬라이스 시그먼트 헤더)에서 시그널링 하는지 여부를 나타낸다. layer_dependency_sps_flag이 “0”인 경우 시그널링 된다.Referring to Table 12, layer_dependency_sps_flag indicates whether reference layer information is signaled in a slice header (slice segment header). Signaled when layer_dependency_sps_flag is “0”.

ref_layer_idx[j]는 해당 영상이 직접적으로 참조하는 j번째 참조계층의 인덱스(vps에 기술된 인덱스 기준) 를 나타낸다.
ref_layer_idx[j] indicates the index (based on the index described in vps) of the j-th reference layer directly referenced by the corresponding image.

다시 도 5로 돌아가서, 현재 부호화 대상 계층의 영상이 참조할 수 있는 계층의 리스트를 구성한 부호화 장치는 대상 계층이 참조할 수 있는 시점 참조 계층의 복호화 영상이 포함된 현재 부호화 대상 영상의 화면간 예측을 위한 참조 픽쳐 리스트를 생성한다(S520).Returning to FIG. 5 , the encoding apparatus configuring a list of layers that can be referenced by the image of the current encoding target layer performs inter prediction of the current encoding target image including the decoded image of the view reference layer that the target layer can refer to. A reference picture list is generated for ( S520 ).

부호화 장치는 시점 참조 계층의 복호화 영상을 포함하여 현재 부호화 대상 영상의 화면간 예측을 위한 참조픽쳐 집합(reference picture set)을 구성하고 참조픽쳐 형태표시(reference picture marking)를 수행할 수 있다.The encoding apparatus may configure a reference picture set for inter prediction of the current encoding target image including the decoded image of the view reference layer, and perform reference picture marking.

이 때, 부호화 장치는 시점 참조 계층 리스트에 포함된 영상이 복원된 영상으로 가용한지(available)를 확인하고 가용한 경우 해당 복원영상을 참조픽쳐 집합에 포함시키고, 가용하지 않은 경우 해당 복원영상을 “no reference picture”로 표시할 수 있다.At this time, the encoding device checks whether the image included in the view reference layer list is available as a reconstructed image, and if available, includes the reconstructed image in the reference picture set, and if not available, the reconstructed image is “ No reference picture”.

시점 참조 계층 리스트에 포함된 영상들로 구성된 참조픽쳐 집합(제1 집합)은 “used for long term reference”로 표시되어 현재 부호화 대상 영상의 화면간 예측시 장기 참조 픽쳐(long-term reference picture) 로 취급될 수 있다.A reference picture set (first set) composed of images included in the view reference layer list is marked as “used for long term reference” and is used as a long-term reference picture for inter prediction of the current encoding target image. can be treated.

제1 집합, 즉 시점 참조 계층 리스트에 포함된 영상들로 구성된 참조픽쳐 집합 이외에 현재 부호화 대상 계층과 동일한 계층의 영상으로 이루어진 화면간 예측을 위한 참조픽쳐 집합은 다양한 형태로 존재할 수 있다.In addition to the first set, that is, a reference picture set composed of images included in the view reference layer list, a reference picture set for inter prediction composed of an image of the same layer as the current encoding target layer may exist in various forms.

부호화 대상 계층과 동일한 계층의 영상으로 이루어진 화면간 예측을 위한 참조픽쳐 집합은 현재 부호화 대상 영상의 화면간 예측에 사용되며, 디스플레이 순서상 현재 부호화 대상 영상을 기준으로 이전인 단기 참조 픽쳐(short-term reference picture) (제2 집합), 현재 부호화 대상 영상의 화면간 예측에 사용되며, 디스플레이 순서상 현재 부호화 대상 영상을 기준으로 이후인 단기 참조 픽쳐 (제3 집합), 현재 부호화 대상 영상의 화면간 예측을 위한 장기 참조 픽쳐 (제4 집합), 현재 부호화 대상 영상 이후에 부호화할 수 있는 영상을 위한 단기 참조 픽쳐 (제5 집합), 현재 부호화 대상 영상 이후에 부호화할 수 있는 영상을 위한 장기 참조 픽쳐 (제6 집합)으로 구성될 수 있다. The reference picture set for inter prediction, which consists of images of the same layer as the encoding target layer, is used for inter prediction of the current encoding target image, and is a short-term reference picture that is previous to the current encoding target image in the display order. reference picture) (second set), a short-term reference picture that is used for inter prediction of the current encoding target image, and is later based on the current encoding target image in display order (third set), inter prediction of the current encoding target image A long-term reference picture (fourth set) for , a short-term reference picture (fifth set) for an image that can be encoded after the current encoding target image, and a long-term reference picture for an image that can be encoded after the current encoding target image ( 6th set).

또한 부호화 장치는 상기와 같은 여러 참조 픽쳐 집합에 기초하여 참조픽쳐 집합의 특성과 참조픽쳐 형태에 따라 현재 부호화 대상 영상의 참조픽쳐 리스트를 생성할 수 있다.Also, the encoding apparatus may generate the reference picture list of the current encoding target image according to the characteristics of the reference picture set and the reference picture type based on the various reference picture sets as described above.

일 예로, 부호화 장치는 현재 부호화 대상 영상과 동일한 계층의 영상들로 이루어진 참조픽쳐 집합들로 구성된 화면간 참조영상 리스트 L0 및 L1에 제1 집합에 포함된 시점 참조 계층 리스트로 구성된 참조픽쳐 집합을 다음과 같이 추가하여 최종 참조픽쳐 리스트를 생성할 수 있다.As an example, the encoding apparatus adds a reference picture set including a view reference layer list included in the first set to inter-picture reference picture lists L0 and L1 including reference picture sets including images of the same layer as the current encoding target image. It is possible to create a final reference picture list by adding

이 경우, 부호화 장치는 참조픽쳐 리스트 생성시마다 고정된 위치에 시점 참조 계층의 복호화 영상을 추가할 수도 있고, 효율적인 부호화를 위해 참조픽쳐 리스트 생성한 후 추가로 시점 참조 계층의 복호화 영상의 위치를 변경할 수도 있다.In this case, the encoding apparatus may add the decoded image of the view reference layer to a fixed position each time the reference picture list is generated, or may additionally change the position of the decoded image of the view reference layer after generating the reference picture list for efficient encoding. have.

참조픽쳐 리스트 생성시마다 고정된 위치에 시점 참조 계층의 복호화 영상을 추가하는 경우, L0 리스트 생성 시 제1 집합을 마지막 혹은 첫 번째(ref_idx=0) 혹은 두 번째(ref_idx=1) 위치에부터 추가할 수 있다.When the decoded image of the view reference layer is added at a fixed position each time the reference picture list is generated, the first set is added from the last or first (ref_idx=0) or second (ref_idx=1) position when the L0 list is generated. can

시점 참조 계층을 L0 리스트의 중간 위치에 추가하는 경우, 해당위치 이후에 있는 영상들의 리스트 내의 인덱스는 추가된 시점 참조 계층의 수(참조 계층 리스트로 구성된 참조픽쳐 집합의 수) 만큼씩 증가될 수 있다.When the view reference layer is added to the middle position of the L0 list, the index in the list of images after the corresponding position may be increased by the number of the added view reference layers (the number of reference picture sets composed of the reference layer list). .

또는 부호화 장치는 L0 리스트 생성 시 제1 집합으로 첫 번째 (ref_idx=0) 혹은 두 번째 (ref_idx=1) 위치에서부터 시점 참조 계층 리스트로 구성된 참조픽쳐 집합의 수만큼의 참조영상들을 대체할 수 있다.Alternatively, the encoding apparatus may replace as many reference images as the number of reference picture sets including the view reference layer list from the first (ref_idx=0) or second (ref_idx=1) position as the first set when generating the L0 list.

부호화 장치는 L0 리스트 생성 시 제1 집합을 임의의 시그널링된 위치에서부터 추가할 수 있다. 제1 집합을 리스트의 중간 위치에 추가하는 경우, 해당 위치 및 그 이후에 있던 영상들의 리스트 내의 인덱스는 추가된 시점참조계층 수(시점참조계층 리스트로 구성된 참조픽쳐 집합의 수) 만큼씩 증가시될 수 있다.The encoding apparatus may add the first set from an arbitrary signaled position when generating the L0 list. When the first set is added to the middle position of the list, the index in the list of images at the position and thereafter is increased by the number of added view reference layers (the number of reference picture sets composed of the view reference layer list). can

또는 부호화 장치는 L0 리스트 생성 시 제1 집합으로 임의의 시그널링된 위치에서부터 시점 참조 계층 리스트로 구성된 참조픽쳐 집합의 수만큼의 참조영상들을 대체할 수 있다.Alternatively, the encoding apparatus may replace as many reference pictures as the number of reference picture sets composed of the view reference layer list from an arbitrary signaled position as the first set when generating the L0 list.

또는 부호화 장치는 L0 리스트 생성 시 제1 집합의 시점 참조계층 리스트에 포함된 각각의 픽쳐들을 임의의 서로 다른 위치에 추가할 수 있다. 추가된 영상들의 해당 위치 및 이후에 있던 영상들의 리스트 내의 인덱스는 추가된 시점참조계층 수(시점참조계층 리스트로 구성된 참조픽쳐 집합의 수) 만큼씩 증가될 수 있다.Alternatively, the encoding apparatus may add each picture included in the view reference layer list of the first set at different positions when generating the L0 list. The corresponding position of the added images and the index in the list of subsequent images may be increased by the number of added view reference layers (the number of reference picture sets composed of the view reference layer list).

또는 부호화 장치는 L0 리스트 생성 시 제1 집합의 시점 참조계층 리스트에 포함된 각각의 픽쳐들로 임의의 서로 다른 위치에 있는 참조영상들을 대체할 수 있다.Alternatively, when generating the L0 list, the encoding apparatus may replace reference images located at different positions with respective pictures included in the first set of view reference layer lists.

또는 부호화 장치는 L1 리스트 생성 시 제1 집합을 마지막 혹은 첫 번째 (ref_idx=0) 혹은 두 번째 (ref_idx=1) 위치에 추가할 수 있다.Alternatively, the encoding apparatus may add the first set to the last or first (ref_idx=0) or second (ref_idx=1) position when generating the L1 list.

제1 집합을 L1 리스트의 중간 위치에 추가하는 경우, 해당위치 및 그후에 있던 영상들의 리스트 내의 인덱스는 추가한 참조 계층의 수(시점 참조 계층 리스트로 구성된 참조픽쳐 집합의 수) 만큼씩 증가시킬 수 있다.When the first set is added to the middle position of the L1 list, the index in the list of images at the position and thereafter may be increased by the number of the added reference layers (the number of reference picture sets composed of the view reference layer list). .

또는 부호화 장치는 L1 리스트 생성 시 제1 집합으로 첫 번째 (ref_idx=0) 혹은 두 번째 (ref_idx=1) 위치에서부터 시점 참조 계층 리스트로 구성된 참조픽쳐 집합의 수만큼의 참조영상들을 대체할 수 있다. Alternatively, the encoding apparatus may replace as many reference images as the number of reference picture sets including the view reference layer list from the first (ref_idx=0) or second (ref_idx=1) position as the first set when generating the L1 list.

부호화 장치는 L1 리스트 생성 시 제1 집합을 임의의 시그널링된 위치에서부터 추가할 수 있다. 제1 집합을 리스트의 중간 위치에 추가하는 경우, 해당위치 및 그 이후에 있던 영상들의 리스트 내의 인덱스는 추가한 시점참조계층 수(시점참조계층 리스트로 구성된 참조픽쳐 집합의 수) 만큼씩 증가될 수 있다.The encoding apparatus may add the first set from an arbitrary signaled position when generating the L1 list. When the first set is added to the middle position of the list, the index in the list of images at the position and thereafter may be increased by the number of added view reference layers (the number of reference picture sets composed of the view reference layer list). have.

또는 부호화 장치는 L1 리스트 생성 시 제1 집합으로 임의의 시그널링된 위치에서부터 시점참조계층 리스트로 구성된 참조픽쳐 집합의 수만큼의 참조영상들을 대체할 수 있다.Alternatively, the encoding apparatus may replace as many reference images as the number of reference picture sets composed of the view reference layer list from an arbitrary signaled position as the first set when generating the L1 list.

또는 부호화 장치는 L1 리스트 생성 시 제1 집합의 시점참조계층 리스트에 포함된 각각의 픽쳐들을 임의의 서로 다른 위치에 추가할 수 있다. 추가된 영상들의 해당위치 및 이후에 있던 영상들의 리스트 내의 인덱스는 추가한 시점참조계층 수(시점참조계층 리스트로 구성된 참조픽쳐 집합의 수) 만큼씩 증가될 수 있다.Alternatively, when generating the L1 list, the encoding apparatus may add each picture included in the view reference layer list of the first set to arbitrary different positions. Corresponding positions of the added images and indexes in the list of subsequent images may be increased by the number of added view reference layers (the number of reference picture sets composed of the view reference layer list).

또는 부호화 장치는 L1 리스트 생성 시 제1 집합의 시점참조계층 리스트에 포함된 각각의 픽쳐들로 임의의 서로 다른 위치에 있는 참조영상들을 대체할 수 있다.Alternatively, when generating the L1 list, the encoding apparatus may replace reference images in arbitrary different positions with each picture included in the view reference layer list of the first set.

한편, 참조픽쳐 리스트 생성한 후, 추가로 효율적인 부호화를 위해 시점 참조 계층의 복호화 영상의 위치를 변경하는 경우, 슬라이스 헤더 혹은 픽쳐 파라미터 세트에 포함될 수 있는 부호화 파라미터를 이용하여 시점 참조 계층의 복호화 영상의 위치를 참조픽쳐 리스트의 어떠한 위치로든지 변경시킬 수 있다
On the other hand, when the position of the decoded image of the view reference layer is changed for additional efficient encoding after the reference picture list is generated, the decoded image of the view reference layer is changed using encoding parameters that may be included in the slice header or the picture parameter set. The position can be changed to any position in the reference picture list.

참조 계층 리스트가 생성되면, 부호화 장치는 현재 계층의 영상을 블록 단위로 부호화할 수 있다(S530).When the reference layer list is generated, the encoding apparatus may encode the image of the current layer in units of blocks ( S530 ).

부호화 장치는 현재계층의 부호화 대상 블록이 참조 가능한 공간 화질 참조 계층 혹은 시점 참조 계층의 영상을 포함한 화면간 예측을 이용하여 대상 블록을 부호화 할 수 있다. The encoding apparatus may encode the target block by using inter prediction including the image of the spatial quality reference layer or the view reference layer to which the encoding target block of the current layer can be referenced.

일 예로, 부호화 장치는 참조 가능한 공간 화질 참조 계층의 참조 블록에 대한 복수의 정보들 중 적어도 하나를 이용하여 부호화를 수행할 수 있다. 이때, 참조 계층의 참조 블록은 현재 계층의 현재 부호화 대상 블록에 대응되는 참조 계층의 블록으로 예를 들어, 현재 부호화 대상 블록과 동일 위치의 블록을 의미할 수 있다.For example, the encoding apparatus may perform encoding by using at least one of a plurality of pieces of information on a reference block of a referenceable spatial image quality reference layer. In this case, the reference block of the reference layer is a block of the reference layer corresponding to the current encoding object block of the current layer, and may mean, for example, a block at the same position as the current encoding object block.

부호화 장치는 현재계층의 부호화 대상 블록이 참조 가능한 공간 화질 참조 계층 리스트 가운데 하나의 참조 계층을 선택할 수 있으며, 현재계층의 부호화 대상 블록은 참조 계층의 참조 블록의 정보들 중 참조 블록의 복원샘플, 참조 블록의 레지듀얼 및 참조 블록의 부호화 파라미터, 예를 들어, 참조 프레임, 움직임 벡터 예측 모드, 블록 파티셔닝 정보 중 어느 하나 또는 적어도 하나를 이용하여 대상 블록을 부호화를 수행할 수 있다.The encoding apparatus may select one reference layer from a list of spatial quality reference layers that can be referenced by the encoding target block of the current layer, and the encoding target block of the current layer is a reference block reconstruction sample and reference among information of the reference block of the reference layer. Encoding of the target block may be performed using any one or at least one of residual of the block and encoding parameters of the reference block, for example, a reference frame, a motion vector prediction mode, and block partitioning information.

부호화 대상 블록의 부호화시 참조 계층 리스트에 포함되어 있는 참조 계층의 정보를 이용한 경우, 부호화 장치는 사용한 참조 계층을 나타내는 인덱스를 부호화할 수 있다.When the encoding object block is encoded using information on the reference layer included in the reference layer list, the encoding apparatus may encode an index indicating the reference layer used.

예를 들어, 부호화 장치는 도 4의 layer_id 가 n 인 계층이 참조하는 공간 화질 참조 계층 리스트에 포함된 계층의 layer_id 가 n-1, n-2이고, layer_id 가 n-1인 계층이 참조 계층 리스트의 0에 인덱스 되고, layer_id 가 n-2인 계층이 참조 계층 리스트의 1에 인덱스 된 경우, 현재 부호화 대상 블록이 layer_id 가 n-2인 참조 계층을 참조하는 경우 공간 화질 참조 계층 리스트의 인덱스 “1”을 부호화하여 시그널링할 수 있다.For example, in the encoding apparatus, layer_id of a layer included in the spatial quality reference layer list referenced by a layer having layer_id of n in FIG. 4 is n-1 and n-2, and a layer having layer_id of n-1 is a reference layer list When the layer indexed to 0 and layer_id of n-2 is indexed to 1 of the reference layer list, when the current encoding target block refers to the reference layer having layer_id of n-2, index “1 of the spatial quality reference layer list” ” can be encoded to signal.

이때 사용되는 공간화질 참조계층 리스트는 슬라이스 헤더에서 시그널링되는 현재 부호화 대상 계층이 참조하는 참조계층 리스트로부터 구성될 수 있다. 만약에 슬라이스 헤더에서 참조계층 리스트를 시그널링하지 않는다면, 비디오 파라미터 세트에서 시그널링되는 전체 비트스트림에서 현재 부호화 대상계층과 동일한 계층들이 참조할 수 있는 참조계층들로부터 구성될 수 있다.In this case, the spatial quality reference layer list used may be configured from the reference layer list referenced by the current encoding target layer signaled in the slice header. If the reference layer list is not signaled in the slice header, the same layers as the current encoding target layer in the entire bitstream signaled in the video parameter set may be configured from reference layers that can be referenced.

다른 예에 따르면, 부호화 장치는 현재계층의 현재 부호화 대상 블록이 화면간 예측을 하는 경우, 참조픽쳐 리스트내의 참조픽쳐를 이용해서 현재 부호화 대상 블록에 대한 움직임 예측 및 움직임 보상을 수행할 수 있다.According to another example, when the current encoding object block of the current layer performs inter prediction, the encoding apparatus may perform motion prediction and motion compensation on the current encoding object block by using a reference picture in the reference picture list.

본 실시예에 따를 경우, 부호화 장치는 단계 S520에서 생성된 시점참조 계층의 복호화 영상을 포함하는 참조픽쳐 리스트내의 참조픽쳐를 이용하여 통상적인 화면간 예측방법으로 현재 부호화 대상 영상에 대한 움직임 예측 및 움직임 보상을 수행할 수 있다.According to the present embodiment, the encoding apparatus uses a reference picture in the reference picture list including the decoded image of the view reference layer generated in step S520 to predict motion and motion of the current encoding target image using a conventional inter prediction method. compensation can be performed.

도 5를 참조한 본 발명에 따르면, 상위 계층의 영상을 부호화 함에 있어서 대상 계층과 동일 시점내에서 참조할 공간 및 화질 계층으로 구성된 공간 화질 참조 계층 리스트와 대상 계층과 동일한 공간 및 화질 계층들로 이루어진 시점 참조 계층 리스트를 분리하여 각 계층의 특성을 고려하여 계층간 예측을 수행할 수 있도록 함으로써 부호화 효율을 향상시키는 이점이 있다.
According to the present invention with reference to FIG. 5, when encoding an image of a higher layer, a spatial quality reference layer list composed of spatial and picture quality layers to be referenced within the same view as the target layer and a view composed of the same spatial and picture quality layers as the target layer There is an advantage of improving encoding efficiency by separating the reference layer list and performing inter-layer prediction in consideration of the characteristics of each layer.

도 7은 본 발명의 일 실시예에 따른 복호화 장치에서 상위 계층의 복호화를 수행하는 방법을 설명하기 위한 제어흐름도이다. 본 발명에 따른 복호화 장치는 적어도 한가지 이상의 스케일러빌리티 (예를 들어, 공간, 화질, 시점 스케일러빌리티)를 지원하며 다 계층 구조를 지원하는 비디오 구조에서 상위 계층의 복호화를 수행한다.7 is a control flow diagram illustrating a method of performing upper layer decoding in a decoding apparatus according to an embodiment of the present invention. The decoding apparatus according to the present invention supports at least one scalability (eg, spatial, picture quality, and view scalability) and performs upper layer decoding in a video structure supporting a multi-layered structure.

도 7을 참조하면 우선, 복호화 장치는 현재 복호화 대상 계층의 영상이 참조할 수 있는 계층들의 리스트를 구성한다(S710). 현재 복호화 대상 계층의 영상이 참조할 수 있는 계층들의 리스트는 전체 비트스트림에서 현재 복호화 대상 계층과 동일한 계층들이 참조할 수 있는 공간 화질 참조 계층 리스트 및 시점 참조 계층 리스트 또는 현재 복호화 대상 계층의 영상이 참조하고 있는 계층의 리스트를 유도함으로써 생성될 수 있다. Referring to FIG. 7 , first, the decoding apparatus constructs a list of layers that can be referenced by an image of the current decoding target layer ( S710 ). For the list of layers that the image of the current decoding target layer can refer to, the spatial quality reference layer list and the view reference layer list or the image of the current decoding target layer that can be referenced by the same layers as the current decoding target layer in the entire bitstream are referenced. It can be created by deriving a list of the hierarchies it is doing.

본 발명의 일 실시예에 따라 복호화 장치는 전체 비트스트림에서 현재 복호화 대상 계층과 동일한 계층들이 참조할 수 있는 공간 화질 참조 계층 리스트 및 시점 참조 계층 리스트를 구성할 수 있으며, 이렇게 구성된 참조 계층 리스트들은 현재 복호화 대상 영상이 속한 계층과 동일한 계층에 속한 영상들을 복호화하기 위해 사용될 수 있다.
According to an embodiment of the present invention, the decoding apparatus may construct a spatial quality reference layer list and a view reference layer list that can be referenced by the same layers as the current decoding target layer in the entire bitstream, and the thus constructed reference layer lists are currently It may be used to decode images belonging to the same layer as the layer to which the decoding target image belongs.

도 8은 본 발명의 일 실시예예 따라 복호화 장치에서 공간 화질 참조 계층 리스트 및 시점 참조 계층 리스트를 구성하는 방법을 설명하기 위한 제어 흐름도이다.8 is a control flowchart illustrating a method of constructing a spatial quality reference layer list and a view reference layer list in a decoding apparatus according to an embodiment of the present invention.

우선, 복호화 장치는 비디오 파라미터 세트에 포함되어 시그널링되는 현재 복호화 대상 계층의 참조 계층 정보를 이용하여 공간 화질 참조 계층 리스트를 구성할 수 있다(S810).First, the decoding apparatus may construct a spatial quality reference layer list by using reference layer information of a current decoding target layer that is signaled and included in a video parameter set ( S810 ).

예컨대, 복호화 장치는 표 1에 도시되어 있는 바와 같이, layer_id_in_nuh[i]의 값을 가지는 계층의 참조 계층들(ref_layer_id) 가운데 현재 복호화 대상과 동일 시점을 가지는 공간 (spatial) 및 화질(quality) 참조 계층들로 현재 복호화 대상과 동일 시점을 가지는 공간 화질 참조 계층 리스트를 구성할 수 있다.For example, as shown in Table 1, in the decoding apparatus, among reference layers (ref_layer_id) of a layer having a value of layer_id_in_nuh[i], a spatial and quality reference layer having the same view as the current decoding target. A list of spatial image quality reference layers having the same viewpoint as the current decoding target may be configured with the .

또 다른 예에 따르면, 복호화 장치는 표 2와 같이 시그널링 되는 nuh_layer_id의 값을 가지는 계층의 참조계층들 가운데, 현재 복호화 대상과 동일시점을 가지는 공간 (spatial) 및 화질(quality) 참조 계층들로 공간 화질 참조 계층 리스트를 구성할 수 있다.According to another example, the decoding apparatus uses spatial and quality reference layers having the same view point as the current decoding target among reference layers of a layer having a value of nuh_layer_id signaled as shown in Table 2 A reference hierarchy list can be constructed.

공간 화질 참조 계층 리스트를 구성함에 있어서 계층들의 순서는 다양하게 설정될 수 있다.In constructing the spatial image quality reference layer list, the order of the layers may be variously set.

예를 들어 복호화 장치는 현재 복호화 대상과 동일 시점을 가지는 공간 (spatial) 및 화질(quality) 참조 계층들 가운데 layer_id 값이 복호화 대상 계층의 layer_id 값과 차이가 작은 계층 (즉, 가까운 계층)부터 큰 계층 순서로 공간 화질 참조 계층 리스트를 구성할 수 있다.For example, in the decoding apparatus, among spatial and quality reference layers having the same view as the current decoding target, the layer_id value from the layer having a small difference from the layer_id value of the decoding target layer (that is, the closest layer) to the largest layer The spatial quality reference layer list can be constructed in order.

또는 복호화 장치는 현재 복호화 대상과 동일 시점을 가지는 공간 (spatial) 및 화질(quality) 참조 계층들 가운데 우선 순위(priority)가 높은 계층부터 낮은 계층의 순서로 공간 화질 참조 계층 리스트를 구성할 수 있다.Alternatively, the decoding apparatus may configure the spatial quality reference layer list in the order of a layer having a high priority to a layer having a low priority among spatial and quality reference layers having the same view as the current decoding target.

이 때, 우선 순위에 대한 정보는 NAL 유닛 헤더 혹은 비디오 파라미터 세트등에서 시그널링될 수 있다.In this case, information on priority may be signaled in a NAL unit header or a video parameter set.

또는 복호화 장치는 현재 복호화 대상과 동일 시점을 가지는 공간 (spatial) 및 화질(quality) 참조 계층들 가운데 현재 복호화 대상 계층과 공간 해상도 차이가 작은 계층부터 큰 계층 순으로 공간 화질 참조 계층 리스트를 구성할 수 있다.Alternatively, the decoding apparatus may configure the spatial quality reference layer list in order from the smallest to the largest spatial resolution difference from the current decoding target layer among spatial and quality reference layers having the same view as the current decoding target. have.

이때, 동일한 공간 해상도에서의 화질 참조 계층 순서는 현재 복호화 대상 계층의 layer_id값과 차이가 작은 계층(즉, 가까운 계층)부터 큰 계층 순서로 구성할 수 있다. In this case, the image quality reference layer order in the same spatial resolution may be configured from a layer having a small difference from the layer_id value of the current decoding target layer (that is, a layer close to it) to a larger layer order.

또는 현재 복호화 대상 계층과 동일 시점을 가지는 참조 가능한 공간 화질 참조 계층 리스트는 현재 복호화 대상과 동일 시점을 가지는 공간 (spatial) 및 화질(quality) 참조 계층들 가운데 현재 복호화 대상 계층과 공간 해상도 차이가 작은 계층부터 큰 계층 순으로 구성될 수 있다. 이때, 동일한 공간 해상도에서의 화질 참조 계층 순서는 복호화하고자 하는 양자화 파라미터(quantization parameter)값이 낮은 값에서 높은 값의 순서(즉, 복호시 화질이 좋은 계층부터 낮은 계층순서) 일 수 있다. Alternatively, the referenceable spatial quality reference layer list having the same view as the current decoding target layer is a layer having a small spatial resolution difference from the current decoding target layer among spatial and quality reference layers having the same view as the current decoding target layer. It can be configured in the order of the largest hierarchy from In this case, the order of the picture quality reference layers in the same spatial resolution may be an order of a quantization parameter to be decoded from a low value to a high value (ie, from a layer having a good picture quality to a low layer during decoding).

대상 계층과 동일한 계층들이 참조할 수 있는 공간 화질 참조 계층 리스트가 생성되면, 복호화 장치는 비디오 파라미터 세트에 포함되어 시그널링되는 현재 복호화 대상 계층의 참조 계층 정보를 이용하여 현재 복호화 대상 계층과 동일 공간과 동일 화질계층들로 이루어진 참조 가능한 시점 참조 계층 리스트를 구성할 수 있다(S820).When the spatial quality reference layer list that the same layers as the target layer can refer to is generated, the decoding apparatus uses the reference layer information of the current decoding target layer signaled by being included in the video parameter set to have the same space as the current decoding target layer. A referenceable view reference layer list composed of picture quality layers may be configured (S820).

예를 들어, 복호화 장치는 표 1과 같이 시그널링되는 layer_id_in_nuh[i]의 값을 가지는 계층의 참조 계층들 (ref_layer_id) 중 현재 복호화 대상과 동일한 공간(spatial)과 동일한 화질(quality)을 가지는 계층들 가운데 현재 부호화 대상 계층의 시점이 다른 계층들로 참조 계층 리스트를 구성할 수 있다. For example, among the reference layers (ref_layer_id) of the layer having the value of layer_id_in_nuh[i] signaled as shown in Table 1, the decoding apparatus is configured among the layers having the same spatial and same quality as the current decoding target. The reference layer list may be composed of layers having different views of the current encoding target layer.

또 다른 예에 따르면, 복호화 장치는 표 2와 같이 시그널링 되는 nuh_layer_id의 값을 가지는 계층의 참조계층들 가운데, 현재 복호화 대상과 동일한 공간 (spatial) 및 동일한 화질(quality)을 가지는 계층들 가운데 현재 부호화 대상 계층의 시점이 다른 계층들로 시점 참조 계층 리스트를 구성할 수 있다.According to another example, the decoding apparatus currently encodes a current encoding target among reference layers of a layer having a value of nuh_layer_id signaled as shown in Table 2, among layers having the same spatial and same quality as a current decoding target. A view reference layer list may be configured with layers having different views of the layer.

복호화 장치는 현재 부호화 대상 계층과 동일 공간과 동일 화질계층들로 이루어진 시점 참조 계층들이 시그널링 되는 순서대로 시점 참조 계층 리스트를 구성할 수 있다.The decoding apparatus may configure the view reference layer list in the order in which view reference layers including the same space and the same picture quality layers as the current encoding target layer are signaled.

또는 복호화 장치는 현재 복호화 대상 계층과 동일 공간과 동일 화질계층들로 이루어진 시점 참조 계층들이 현재 복호화 대상 시점과 가까운 시점부터 먼 시점순으로 구성된 시점 참조 계층 리스트를 생성할 수 있다.
Alternatively, the decoding apparatus may generate a view reference layer list in which view reference layers composed of the same space and the same picture quality layers as the current decoding target layer are in the order of a view closer to the current decoding target view to a distant view.

본 발명의 다른 실시예에 따라 복호화 장치는 현재 부호화하고자 하는 영상의 현재 부호화 대상 계층 (혹은 해당 슬라이스)이 참조 가능한 공간 화질 참조 계층 리스트 및 시점 참조 계층 리스트를 구성할 수 있으며, 구성된 참조 계층 리스트들은 현재 복호화 대상 영상을 복호화하기 위해 사용될 수 있다.According to another embodiment of the present invention, the decoding apparatus may construct a spatial quality reference layer list and a view reference layer list that can be referenced by a current encoding target layer (or a corresponding slice) of an image to be currently encoded, and the configured reference layer lists are It may be used to decode a current decoding target image.

복호화 장치는 현재 복호화 대상 계층의 슬라이스 헤더에서 시그널링되는 참조 계층 정보를 이용하여 공간 화질 참조 계층 리스트 및 시점 참조 계층 리스트를 구성할 수 있다.The decoding apparatus may configure the spatial quality reference layer list and the view reference layer list by using reference layer information signaled in the slice header of the current decoding target layer.

복호화 장치는 현재 복호화대상 영상이 한 개 이상의 슬라이스로 나누어져 있는 경우에도 슬라이스 헤더에서 시그널링되는 참조계층 정보는 동일할 수 있다.The decoding apparatus may have the same reference layer information signaled in the slice header even when the current decoding target image is divided into one or more slices.

복호화 장치는 우선, 현재 부호화하고자 하는 영상의 현재 부호화 대상 계층이 참조 가능한 공간 화질 참조 계층 리스트를 다음의 방법 가운데 하나로 구성할 수 있다.First, the decoding apparatus may configure a spatial quality reference layer list that can be referenced by a current encoding target layer of an image to be currently encoded using one of the following methods.

예를 들어, 표 3에서 표 12까지의 방식 가운데 하나를 사용해서, 슬라이스 헤더에 시그널링 되는 참조계층들 가운데 현재 복호화 대상과 동일시점을 가지는 공간 (spatial) 및 화질(quality) 참조 계층들로 공간 화질 참조계층 리스트를 구성할 수 있다.For example, using one of the schemes from Tables 3 to 12, spatial image quality as spatial and quality reference layers having the same view point as the current decoding target among reference layers signaled to the slice header A reference layer list can be constructed.

이때, 슬라이스 헤더에 시그널링되는 참조계층들은 전체 비트스트림에서 현재 복호화 대상 계층과 동일한 계층들이 참조할 수 있는 참조계층들의 서브 세트일 수 있다.In this case, the reference layers signaled in the slice header may be a subset of reference layers that can be referenced by the same layers as the current decoding target layer in the entire bitstream.

예를들어, 슬라이스 헤더에 시그널링 되는 참조계층들은 비디오 파라미터 세트에서 시그널링되는 현재 복호화 대상 계층과 동일한 계층들이 참조할 수 있는 참조계층 리스트의 서브 세트일 수 있다.For example, the reference layers signaled in the slice header may be a subset of the reference layer list that can be referenced by the same layers as the current decoding target layer signaled in the video parameter set.

이 때 복호화 장치는 공간 화질 참조 계층 리스트를 구성함에 있어서 계층들의 순서를 다양하게 구성할 수 있다.In this case, the decoding apparatus may configure the order of the layers in various ways in configuring the spatial image quality reference layer list.

일 예로 복호화 장치는 현재 복호화 대상과 동일 시점을 가지는 공간 (spatial) 및 화질(quality) 참조 계층들 가운데 layer_id 값이 복호화 대상 계층의 layer_id 값과 차이가 적은 계층 (즉, 가까운 계층)부터 큰 계층 순서로 구성하여 공간 화질 참조 계층 리스트를 생성할 수 있다. For example, in the decoding apparatus, among spatial and quality reference layers having the same view as the current decoding target, the layer_id value is the layer with a small difference from the layer_id value of the decoding target layer (ie, the closest layer) to the largest layer order It is possible to create a spatial quality reference layer list by configuring .

이때, 우선 순위와 관련된 정보는 NAL 유닛 헤더(NALU header) 혹은 비디오 파라미터 세트(video parameter set) 등에 포함되어 시그널링될 수 있다In this case, priority-related information may be signaled by being included in a NAL unit header (NALU header) or a video parameter set.

또는 현재 복호화 대상 계층과 동일 시점을 가지는 참조 가능한 공간 화질 참조 계층 리스트는 현재 복호화 대상과 동일 시점을 가지는 공간 (spatial) 및 화질(quality) 참조 계층들 가운데 현재 부호화 대상 계층과 공간 해상도 차이가 적은 계층부터 큰 계층 순으로 구성될 수 있다. 이때, 동일한 공간 해상도를 갖는 화질 참조 계층의 순서는 현재 복호화 대상 계층의 layer_id 값과 차이가 작은 계층(즉, 가까운 계층)부터 큰 계층 순서일 수 있다.Alternatively, the referenceable spatial quality reference layer list having the same view as the current decoding target layer is a layer having a small spatial resolution difference from the current encoding target layer among spatial and quality reference layers having the same view as the current decoding target layer. It can be configured in the order of the largest hierarchy from In this case, the order of the picture quality reference layers having the same spatial resolution may be from a layer having a small difference from the layer_id value of the current decoding target layer (ie, a layer close to it) to a larger layer order.

대상 계층과 동일한 계층들이 참조할 수 있는 공간 화질 참조 계층 리스트가 생성되면, 복호화 장치는 현재 복호화 대상 계층과 동일 공간과 동일 화질계층들로 이루어진 참조 가능한 시점 참조 계층 리스트를 구성할 수 있다.When the spatial quality reference layer list that can be referenced by the same layers as the target layer is generated, the decoding apparatus may construct a referenceable view reference layer list including the same spatial and same picture quality layers as the current decoding target layer.

예를 들어, 복호화 장치는 표 3에서 표 12까지의 방식 가운데 하나를 사용해서, 슬라이스 헤더에 시그널링 되는 참조계층들 가운데 현재 복호화 대상과 동일한 공간(spatial)과 동일한 화질(quality)을 가지는 계층들 가운데 현재 부호화 대상 계층의 시점이 다른 계층들로 시점 참조 계층 리스트를 구성할 수 있다. For example, the decoding apparatus uses one of the methods of Tables 3 to 12, among reference layers signaled to the slice header, among the layers having the same spatial and same quality as the current decoding target. The view reference layer list may be composed of layers having different views of the current encoding target layer.

또는 복호화 장치는 현재 복호화 대상 계층과 동일 공간과 동일 화질계층들로 이루어진 시점 참조 계층들이 현재 복호화 대상 시점과 가까운 시점부터 먼 시점순으로 구성된 시점 참조 계층 리스트를 생성할 수 있다.Alternatively, the decoding apparatus may generate a view reference layer list in which view reference layers composed of the same space and the same picture quality layers as the current decoding target layer are in the order of a view closer to the current decoding target view to a distant view.

최대한 참조할 수 있는 계층의 수는 비트스트림 전체에 대해 제한될 수 있으며, 이는 비디오 파라미터 세트, 시퀀스 파라미터 세트 혹은 슬라이스 헤더 등에서 시그널링 될 수도 있고, 프로파일 및 레벨에 따라 제약을 둘 수도 있다.The maximum number of layers that can be referenced may be limited for the entire bitstream, which may be signaled in a video parameter set, a sequence parameter set, or a slice header, or may be restricted according to a profile and a level.

복호화 장치는 구성된 참조 계층 리스트에 대하여 추가 시그널링(예를 들어, 슬라이스 헤더와 같은 상위 레벨의 시그널링)이 있을 경우 시그널링에서 표현하는 내용에 따라 리스트 내에서의 순서를 변경할 수 있다.
When there is additional signaling (eg, higher-level signaling such as a slice header) with respect to the configured reference layer list, the decoding apparatus may change the order in the list according to content expressed in the signaling.

다음으로, 복호화 장치는 시점참조 계층의 복호화 영상을 포함하여 현재 복호화 대상 영상의 화면 간 예측을 위한 참조픽쳐 리스트를 생성한다(S720).Next, the decoding apparatus generates a reference picture list for inter prediction of the current decoding target image including the decoded image of the view reference layer (S720).

복호화 장치는 시점 참조 계층의 복호화 영상을 포함하여 현재 복호화 대상 영상의 화면간 예측을 위한 참조픽쳐 집합(reference picture set) 구성 및 참조픽쳐 형태표시(reference picture marking)를 수행할 수 있다.The decoding apparatus may configure a reference picture set and perform reference picture marking for inter prediction of the current decoding target image including the decoded image of the view reference layer.

즉, 복호화 장치는 시점 참조 계층 리스트에 포함된 영상들로 구성된 참조픽쳐 집합(제1 집합)을 구성한다. 이 때, 시점 참조 계층 리스트에 포함된 영상이 복원된 영상으로 가용한지(available)를 확인하고 가용한 경우 해당 복원영상을 참조픽쳐 집합에 포함시키고, 가용하지 않은 경우 해당 복원영상을 “no reference picture”로 표시할 수 있다.That is, the decoding apparatus configures a reference picture set (a first set) including images included in the view reference layer list. At this time, it is checked whether the image included in the viewpoint reference layer list is available as a reconstructed image, and if available, the reconstructed image is included in the reference picture set, and if not available, the reconstructed image is set as “no reference picture”. ” can be indicated.

시점 참조 계층 리스트에 포함된 영상들로 구성된 참조픽쳐 집합은 “used for long term reference”로 표시하여 현재 복호화 대상 영상의 화면간 예측시 장기 참조 픽쳐로 취급할 수 있다.A reference picture set composed of images included in the view reference layer list may be marked as “used for long term reference” and treated as a long-term reference picture when inter prediction of a current decoding target image is performed.

제1 집합, 즉 시점 참조 계층 리스트에 포함된 영상들로 구성된 참조픽쳐 집합 이외에 복호화 장치는 현재 복호화 대상 계층과 동일한 계층의 영상으로 이루어진 화면간 예측을 위한 하기와 같이 다양한 참조픽쳐 집합들을 구성할 수 있다. In addition to the first set, that is, the reference picture set composed of images included in the view reference layer list, the decoding apparatus may configure various reference picture sets as follows for inter prediction composed of images of the same layer as the current decoding target layer. have.

참조픽쳐 집합들은 현재 복호화 대상 영상의 화면간 예측에 사용되며, 디스플레이 순서상 현재 복호화 대상 영상을 기준으로 이전인 단기 참조 픽쳐 (제2 집합), 현재 복호화 대상 영상의 화면간 예측에 사용되며, 디스플레이 순서상 현재 복호화 대상 영상을 기준으로 이후인 단기 참조 픽쳐 (제3 집합), 현재 복호화 대상 영상의 화면간 예측을 위한 장기 참조 픽쳐 (제4 집합), 현재 복호화 대상 영상 이후에 복호화할 수 있는 영상을 위한 단기 참조 픽쳐 (제5 집합), 현재 복호화 대상 영상 이후에 복호화할 수 있는 영상을 위한 장기 참조 픽쳐 (제6 집합) 중 적어도 하나 일 수 있다.The reference picture sets are used for inter prediction of the current decoding target image, and are used for inter prediction of the short-term reference picture (the second set) that is previous based on the current decoding target image in the display order, and the current decoding target image. Short-term reference pictures (third set) that are later in order based on the current decoding target image, long-term reference pictures for inter-screen prediction of the current decoding target image (fourth set), and images that can be decoded after the current decoding target image It may be at least one of a short-term reference picture (a fifth set) for , and a long-term reference picture (a sixth set) for an image that can be decoded after the current decoding target image.

복호화 장치는 참조픽쳐 집합과 참조픽쳐 형태에 따라 현재 복호화 대상 영상의 참조픽쳐 리스트를 생성할 수 있다. 즉 복호화 장치는 제1 집합과 상기 제2 집합 내지 제4 집합을 조합하여 참조픽쳐 리스트를 생성할 수 있다.The decoding apparatus may generate a reference picture list of a current decoding target image according to the reference picture set and the reference picture type. That is, the decoding apparatus may generate a reference picture list by combining the first set and the second to fourth sets.

예컨대, 복호화 장치는 현재 복호화 대상 영상의 참조픽쳐 리스트를 생성함에 있어서, 현재 복호화 대상 영상과 동일한 계층의 영상들로 이루어진 참조픽쳐 집합들로 구성된 화면간 참조영상 리스트 L0 및 L1에, 제1 집합에 포함된 시점 참조 계층 리스트로 구성된 참조픽쳐 집합을 추가하여 최종 참조픽쳐 리스트를 생성할 수 있다.For example, when the decoding apparatus generates the reference picture list of the current decoding target image, the inter-screen reference picture lists L0 and L1 including reference picture sets including images of the same layer as the current decoding target image, L0 and L1, and the first set A final reference picture list can be generated by adding a reference picture set composed of the included view reference layer list.

이 경우, 참조픽쳐 리스트 생성시마다 고정된 위치에 시점 참조 계층의 복호화 영상을 추가할 수도 있고, 효율적인 부호화를 위해 시점 참조 계층의 복호화 영상의 위치를 변경할 수도 있다.In this case, the decoded image of the view reference layer may be added to a fixed position whenever the reference picture list is generated, or the position of the decoded image of the view reference layer may be changed for efficient encoding.

참조픽쳐 리스트 생성시마다 고정된 위치에 시점 참조 계층의 복호화 영상을 추가하는 경우, L0 리스트 생성 시 제1 집합을 마지막 혹은 첫 번째(ref_idx=0) 혹은 두 번째(ref_idx=1) 위치에부터 추가할 수 있다. When the decoded image of the view reference layer is added at a fixed position each time the reference picture list is generated, the first set is added from the last or first (ref_idx=0) or second (ref_idx=1) position when the L0 list is generated. can

제1 집합을 L0 리스트의 중간 위치에 추가하는 경우, 해당위치 및 그 이후에 있던 영상들의 리스트 내의 인덱스는 추가한 시점 참조 계층 수(참조 계층 리스트로 구성된 참조픽쳐 집합의 수) 만큼씩 증가될 수 있다.When the first set is added to the middle position of the L0 list, the index in the list of images at the position and thereafter may be increased by the number of the added view reference layers (the number of reference picture sets composed of the reference layer list). have.

또는 복호화 장치는 L0 리스트 생성 시 제1 집합으로 첫 번째 (ref_idx=0) 혹은 두 번째 (ref_idx=1) 위치에서부터 시점 참조 계층 리스트로 구성된 참조픽쳐 집합의 수만큼의 참조영상들을 대체할 수 있다.Alternatively, the decoding apparatus may replace as many reference images as the number of reference picture sets including the view reference layer list from the first (ref_idx=0) or second (ref_idx=1) position as the first set when generating the L0 list.

또는 복호화 장치는 L0 리스트 생성 시 제1 집합을 임의의 시그널링된 위치에서부터 추가할 수 있다. 제1 집합을 리스트의 중간 위치에 추가하는 경우, 해당위치 및 그 이후에 있던 영상들의 리스트 내의 인덱스는 추가한 시점 참조 계층 수(참조 계층 리스트로 구성된 참조픽쳐 집합의 수) 만큼씩 증가될 수 있다.Alternatively, the decoding apparatus may add the first set from an arbitrary signaled position when generating the L0 list. When the first set is added to the middle position of the list, the index in the list of images at the position and thereafter may be increased by the number of added view reference layers (the number of reference picture sets composed of the reference layer list). .

또는 복호화 장치는 L0 리스트 생성 시 제1 집합으로 임의의 시그널링된 위치에서부터 시점참조계층 리스트로 구성된 참조픽쳐 집합의 수만큼의 참조영상들을 대체할 수 있다.Alternatively, the decoding apparatus may replace as many reference pictures as the number of reference picture sets composed of the view reference layer list from an arbitrary signaled position as the first set when generating the L0 list.

또는 복호화 장치는 L0 리스트 생성 시 제1 집합의 시점참조계층 리스트에 포함된 각각의 픽쳐들을 임의의 서로 다른 위치에 추가할 수 있다. 추가된 영상들의 해당 위치 및 이후에 있던 영상들의 리스트 내의 인덱스는 추가한 시점 참조 계층 수(참조 계층 리스트로 구성된 참조픽쳐 집합의 수) 만큼씩 증가될 수 있다.Alternatively, the decoding apparatus may add each picture included in the view reference layer list of the first set to arbitrary different positions when generating the L0 list. Corresponding positions of the added images and indexes in the list of subsequent images may be increased by the number of added view reference layers (the number of reference picture sets composed of the reference layer list).

또는 복호화 장치는 L0 리스트 생성 시 제1 집합의 시점참조계층 리스트에 포함된 각각의 픽쳐들로 임의의 서로 다른 위치에 있는 참조영상들을 대체할 수 있다.Alternatively, when the L0 list is generated, the decoding apparatus may replace reference images in arbitrary different positions with each picture included in the view reference layer list of the first set.

또는 복호화 장치는 L1 리스트 생성 시 제1 집합을 마지막 혹은 첫 번째 (ref_idx=0) 혹은 두 번째 (ref_idx=1) 위치에 추가할 수 있다. Alternatively, the decoding apparatus may add the first set to the last or first (ref_idx=0) or second (ref_idx=1) position when generating the L1 list.

시점 참조 계층을 L1 리스트의 중간 위치에 추가하는 경우, 해당위치 및 그 이후에 있던 영상들의 리스트 내의 인덱스는 추가한 시점 참조 계층 수(참조 계층 리스트로 구성된 참조픽쳐 집합의 수) 만큼씩 증가될 수 있다.When a view reference layer is added to the middle position of the L1 list, the index in the list of images at the position and thereafter may be increased by the number of the added view reference layers (the number of reference picture sets composed of the reference layer list) have.

또는 복호화 장치는 L1 리스트 생성 시 제1 집합로 첫 번째 (ref_idx=0) 혹은 두 번째 (ref_idx=1) 위치에서부터 시점 참조 계층 리스트로 구성된 참조픽쳐 집합의 수만큼의 참조영상들을 대체할 수 있다. Alternatively, the decoding apparatus may replace as many reference pictures as the number of reference picture sets including the view reference layer list from the first (ref_idx=0) or second (ref_idx=1) position as the first set when generating the L1 list.

또는 복호화 장치는 L1 리스트 생성 시 제1 집합을 임의의 시그널링된 위치에서부터 추가할 수 있다. 제1 집합을 리스트의 중간 위치에 추가하는 경우, 해당위치 및 그 이후에 있던 영상들의 리스트 내의 인덱스는 추가한 시점 참조 계층 수(참조 계층 리스트로 구성된 참조픽쳐 집합의 수) 만큼씩 증가될 수 있다.Alternatively, the decoding apparatus may add the first set from an arbitrary signaled position when generating the L1 list. When the first set is added to the middle position of the list, the index in the list of images at the position and thereafter may be increased by the number of added view reference layers (the number of reference picture sets composed of the reference layer list). .

또는 복호화 장치는 L1 리스트 생성 시 제1 집합으로 임의의 시그널링된 위치에서부터 시점참조계층 리스트로 구성된 참조픽쳐 집합의 수만큼의 참조영상들을 대체할 수 있다.Alternatively, the decoding apparatus may replace as many reference pictures as the number of reference picture sets composed of the view reference layer list from an arbitrary signaled position as the first set when generating the L1 list.

또는 복호화 장치는 L1 리스트 생성 시 제1 집합의 시점참조계층 리스트에 포함된 각각의 픽쳐들을 임의의 서로 다른 위치에 추가할 수 있다. 추가된 영상들의 해당위치 및 이후에 있던 영상들의 리스트 내의 인덱스는 추가한 시점 참조 계층 수(참조 계층 리스트로 구성된 참조픽쳐 집합의 수) 만큼씩 증가될 수 있다.Alternatively, the decoding apparatus may add each picture included in the view reference layer list of the first set to arbitrary different positions when generating the L1 list. Corresponding positions of the added images and indexes in the list of subsequent images may be increased by the number of added view reference layers (the number of reference picture sets composed of the reference layer list).

또는 복호화 장치는 L1 리스트 생성 시 제1 집합의 시점참조계층 리스트에 포함된 각각의 픽쳐들로 임의의 서로 다른 위치에 있는 참조영상들을 대체할 수 있다.Alternatively, when generating the L1 list, the decoding apparatus may replace reference images located at different positions with respective pictures included in the view reference layer list of the first set.

한편, 참조픽쳐 리스트 생성한 후, 추가로 효율적인 부호화를 위해 시점 참조 계층의 복호화 영상의 위치를 변경하는 경우, 슬라이스 헤더 혹은 픽쳐 파라미터 세트에 포함될 수 있는 부호화 파라미터를 이용하여 시점 참조 계층의 복호화 영상의 위치를 참조픽쳐 리스트의 어떠한 위치로든지 변경할 수 있다.
On the other hand, when the position of the decoded image of the view reference layer is changed for additional efficient encoding after the reference picture list is generated, the decoded image of the view reference layer is changed using encoding parameters that may be included in the slice header or the picture parameter set. The position can be changed to any position in the reference picture list.

참조 계층 리스트가 생성되면, 복호화 장치는 현재계층의 영상을 블록단위로 복호화할 수 있다(S730).When the reference layer list is generated, the decoding apparatus may decode the image of the current layer in block units (S730).

현재 계층의 현재 복호화 대상 블록이 공간 화질 참조 계층을 참조하는 경우 다음과 같이 복호화 할 수 있다.When the current decoding target block of the current layer refers to the spatial quality reference layer, decoding can be performed as follows.

일 예로, 복호화 장치는 현재 복호화 대상 영상에서 사용되는 공간 화질 참조 계층 리스트에서 현재 복호화 대상 블록이 복호화시 사용하는 참조 계층을 결정하고 해당 참조 계층의 참조 블록을 결정할 수 있다.For example, the decoding apparatus may determine a reference layer used for decoding by the current decoding object block from the spatial quality reference layer list used in the current decoding object image, and determine the reference block of the reference layer.

이때 사용되는 공간화질 참조계층 리스트는 슬라이스 헤더에서 시그널링되는 현재 복호화 대상 계층이 참조하는 참조계층 리스트로부터 구성될 수 있다. 만약에 슬라이스 헤더에서 참조계층 리스트를 시그널링하지 않는다면, 비디오 파라미터 세트에서 시그널링되는 전체 비트스트림에서 현재 복호화 대상계층과 동일한 계층들이 참조할 수 있는 참조계층들로부터 구성될 수 있다.In this case, the spatial quality reference layer list used may be configured from the reference layer list referenced by the current decoding target layer signaled in the slice header. If the reference layer list is not signaled in the slice header, the same layers as the current decoding target layer in the entire bitstream signaled in the video parameter set may be configured from reference layers that can be referenced.

복호화 장치는 복호화 대상 블록단위로 시그널링되는 공간 화질 참조 계층을 나타내는 인덱스에 따라 공간 화질 참조 계층을 결정할 수 있다.The decoding apparatus may determine the spatial quality reference layer according to an index indicating the spatial quality reference layer signaled in units of a decoding object block.

공간 화질 참조 계층이 결정되면, 복호화 장치는 결정된 공간 화질 참조 계층에서 현재 복호화 대상 블록에 대응되는 참조 블록을 결정할 수 있다.When the spatial quality reference layer is determined, the decoding apparatus may determine a reference block corresponding to the current decoding object block from the determined spatial quality reference layer.

참조 계층의 참조 블록은 현재 계층의 현재 복호화 대상 블록에 대응되는 참조 계층의 블록을 의미하며, 예컨대, 참조 계층에서 현재 복호화 대상 블록과 동일 위치에 존재하는 블록을 의미할 수 있다.The reference block of the reference layer means a block of the reference layer corresponding to the current decoding object block of the current layer, and may mean, for example, a block existing in the same position as the current decoding object block in the reference layer.

예를 들어, 도 4에서 layer_id가 n 인 계층의 복호화 대상 블록에 대응되는 공간 화질 참조 계층을 결정함에 있어서, 시점 1에서 layer_id가 n 인 계층의 공간 화질 참조 계층 리스트에 layer_id가 n-1 및 layer_id가 n-2인 영상이 포함되어 있고, 현재 복호화 대상 블록의 공간 화질 참조 계층 인덱스가 “1”인 경우 현재 복호화 대상 블록은 layer_id가 n-2인 계층을 공간 화질 참조 계층으로 하고, 공간 화질 참조 계층에서 현재 복호화 대상 블록에 대응되는 참조 블록을 결정할 수 있다.For example, in determining the spatial quality reference layer corresponding to the decoding target block of the layer having the layer_id of n in FIG. 4 , layer_id is n-1 and layer_id in the spatial quality reference layer list of the layer having the layer_id of n at time 1 If an image with n-2 is included and the spatial quality reference layer index of the current decoding target block is “1”, the current decoding target block uses the layer whose layer_id is n-2 as the spatial quality reference layer, and refers to the spatial quality The layer may determine a reference block corresponding to the current decoding object block.

그런 다음, 복호화 장치는 선택된 공간 화질 참조 계층의 참조 블록의 정보들 중 참조 블록의 복원샘플, 참조 블록의 레지듀얼, 참조 블록의 부호화 파라미터(예를 들어, 참조 프레임, 움직임 벡터, 예측 모드, 블록 파티셔닝 정보 등) 중 적어도 하나를 이용하여 대상 블록에 대한 복호화를 수행할 수 있다.Then, the decoding apparatus determines a reconstructed sample of a reference block, a residual of the reference block, and encoding parameters (eg, reference frame, motion vector, prediction mode, block) of the reference block among the information of the reference block of the selected spatial quality reference layer. Partitioning information, etc.) may be used to decode the target block.

한편, 현재 계층의 현재 복호화 대상 블록이 화면간 예측을 하는 경우 복호화 장치는 참조픽쳐 리스트내의 참조픽쳐를 이용해서 현재 복호화 대상 블록에 대한 움직임 보상을 수행할 수 있다.Meanwhile, when the current decoding object block of the current layer performs inter prediction, the decoding apparatus may perform motion compensation on the current decoding object block by using the reference picture in the reference picture list.

이 경우, 복호화 장치는 단계 S720에서 생성된 시점참조 계층의 복호화 영상을 포함한 참조픽쳐 리스트내의 참조픽쳐를 이용하여 통상적인 화면간 예측방법으로 현재 복호화 대상 영상에 대한 움직임 보상을 수행할 수 있다.In this case, the decoding apparatus may perform motion compensation on the current image to be decoded by using the reference picture in the reference picture list including the decoded image of the view reference layer generated in step S720 by a typical inter prediction method.

상술한 실시예에서, 방법들은 일련의 단계 또는 블록으로서 순서도를 기초로 설명되고 있으나, 본 발명은 단계들의 순서에 한정되는 것은 아니며, 어떤 단계는 상술한 바와 다른 단계와 다른 순서로 또는 동시에 발생할 수 있다. 또한, 당해 기술 분야에서 통상의 지식을 가진 자라면 순서도에 나타난 단계들이 배타적이지 않고, 다른 단계가 포함되거나, 순서도의 하나 또는 그 이상의 단계가 본 발명의 범위에 영향을 미치지 않고 삭제될 수 있음을 이해할 수 있을 것이다.In the embodiments described above, the methods are described on the basis of a flowchart as a series of steps or blocks, but the present invention is not limited to the order of steps, and some steps may occur in a different order or concurrently with other steps as described above. have. In addition, those of ordinary skill in the art will recognize that the steps shown in the flowchart are not exclusive, other steps may be included, or that one or more steps in the flowchart may be deleted without affecting the scope of the present invention. You will understand.

상술한 실시예는 다양한 양태의 예시들을 포함한다. 다양한 양태들을 나타내기 위한 모든 가능한 조합을 기술할 수는 없지만, 해당 기술 분야의 통상의 지식을 가진 자는 다른 조합이 가능함을 인식할 수 있을 것이다. 따라서, 본 발명은 이하의 특허청구범위 내에 속하는 모든 다른 교체, 수정 및 변경을 포함한다고 할 것이다.The above-described embodiments include examples of various aspects. It is not possible to describe every possible combination for representing the various aspects, but one of ordinary skill in the art will recognize that other combinations are possible. Accordingly, it is intended that the present invention cover all other substitutions, modifications and variations falling within the scope of the following claims.

100 : 영상 부호화 장치 111: 움직임 예측부
112: 움직임 보상부 120 : 인트라 예측부
115 : 스위치 125 : 감산기
130 : 변환부 140 : 양자화부
150 : 엔트로피 부호화부 160 : 역양자화부
170 : 역변환부 180 : 필터부100: video encoding device 111: motion prediction unit
112: motion compensation unit 120: intra prediction unit
115: switch 125: subtractor
130: transform unit 140: quantization unit
150: entropy encoding unit 160: inverse quantization unit
170: inverse transform unit 180: filter unit

Claims

In the video decoding method supporting a plurality of layers,
generating a reference layer list that can be referenced by an image of a target layer that is a current decoding target;
generating a reference picture list including at least one of a decoded image of a view reference layer and a decoded image of a spatial quality reference layer for inter prediction of the image of the target layer;
Predicting and decoding the image of the target layer in block units with reference to the reference picture list,
The step of generating the reference picture list comprises:
constructing a first set including the decoded image of the view reference layer;
constructing a second set comprising an image of the same layer as the image of the target layer;
combining the first set and the second set;
The picture included in the first set is displayed as a long-term reference picture.

According to claim 1,
The step of generating the reference layer list comprises:
and generating a spatial quality reference layer list and a view reference layer list that can be referenced by the same layer as the target layer in the entire bitstream.

3. The method of claim 2,
The image decoding method according to claim 1, wherein the spatial quality reference layer list includes layers having the same viewpoint as the target layer.

3. The method of claim 2,
The view reference layer list is an image decoding method, characterized in that it is composed of a layer having the same space and quality as the target layer.

delete

According to claim 1,
The images included in the first set are added to any one of the first, second, and last of the reference picture list.

According to claim 1,
Predicting and decoding the image in block units refers to the spatial quality reference layer,
Predicting and decoding the image in block units includes:
determining a reference layer used for decoding by a current decoding object block from the spatial quality reference layer list;
determining a reference block corresponding to the target block in the determined spatial quality reference layer;
and decoding the target block by using at least one of a reconstructed sample of the reference block, a residual of the reference block, and an encoding parameter of the reference block.

According to claim 1,
Predicting and decoding the image in block units includes:
The image decoding method according to claim 1, wherein inter prediction is performed on the current decoding object block by using the reference picture in the reference picture list.

delete

In the encoding method of an image supporting a plurality of layers,
generating a reference layer list that can be referenced by an image of a target layer that is a current encoding target;
generating a reference picture list including at least one of an image of a view reference layer and an image of a spatial quality reference layer for inter prediction of the image of the target layer;
Predicting and encoding the image of the target layer in block units with reference to the reference picture list,
The step of generating the reference picture list comprises:
constructing a first set including the decoded image of the view reference layer;
constructing a second set comprising an image of the same layer as the image of the target layer;
combining the first set and the second set;
The picture included in the first set is displayed as a long-term reference picture.

A recording medium for storing a bitstream generated by an image encoding method supporting a plurality of layers, the recording medium comprising:
generating a reference layer list that can be referenced by an image of a target layer that is a current encoding target;
generating a reference picture list including at least one of an image of a view reference layer and an image of a spatial quality reference layer for inter prediction of the image of the target layer;
Predicting and encoding the image of the target layer in block units with reference to the reference picture list,
The step of generating the reference picture list comprises:
constructing a first set including the decoded image of the view reference layer;
constructing a second set comprising an image of the same layer as the image of the target layer;
combining the first set and the second set;
A picture included in the first set is displayed as a long-term reference picture.