KR100728222B1

KR100728222B1 - Hierarchical Video Encoding/Decoding Method for Complete Spatial Scalability and Apparatus thereof

Info

Publication number: KR100728222B1
Application number: KR1020060026899A
Authority: KR
Inventors: 강정원; 최해철; 김재곤; 홍진우; 노용만; 배태면; 정용주; 청꽁탕
Original assignee: 한국전자통신연구원; 한국정보통신대학교 산학협력단
Priority date: 2005-03-25
Filing date: 2006-03-24
Publication date: 2007-06-13
Also published as: KR20060103226A; WO2006112620A1; EP1862010A1; EP1862010A4

Abstract

본 발명은 스케일러블 비디오 부호화/복호화 방법 및 장치에 관한 것으로, 더욱 상세하게는 공간영역에서의 완전한 스케일러빌리티를 제공하기 위하여 서로 다른 공간해상도와 다양한 위치를 가지는 다중 영역(multi regeion)들을 해상도에 따라 계층적으로 부호화/복호화하는 방법 및 장치에 관한 것이다.The present invention relates to a scalable video encoding / decoding method and apparatus. More particularly, in order to provide complete scalability in a spatial domain, multi regeions having different spatial resolutions and various positions according to resolution may be used. A method and apparatus for hierarchically encoding / decoding.

본 발명은 상기 목적을 달성하기 위한 본 발명은 비디오 영상을 부호화하는 부호화기로서, 상기 비디오 영상 내에서 부호화하고자 하는 복수개의 영역(ROI)에 대한 부호화 영역 정보를 입력받아, 상기 ROI 사이의 중복영역(OR)을 검출하는 OR 검출부; 상기 비디오 영상, ROI, 및 검출된 OR을 부호화하고자 하는 해상도에 따라 복수의 계층(Layer)에 배치하는 영역 배치부; 및 상기 비디오 영상, ROI, 및 검출된 OR을 상기 영역 배치부에서 배치된 계층의 해상도에 따라 부호화하는 영역 부호화부를 포함하여 이루어진다. 상기 부호화 영역 정보는 상기 ROI의 비디오 영상 내 위치 정보와 상기 ROI의 부호화 해상도 정보를 포함한다. According to an aspect of the present invention, there is provided an encoder for encoding a video image, and receives encoding region information about a plurality of regions (ROIs) to be encoded in the video image, and overlaps the region between the ROIs. OR detection unit for detecting OR); A region arranging unit which arranges the video image, the ROI, and the detected OR in a plurality of layers according to a resolution to be encoded; And an area encoder which encodes the video image, the ROI, and the detected OR according to the resolution of the layer arranged in the area arrangement unit. The encoding region information includes position information in a video image of the ROI and encoding resolution information of the ROI.

본 발명은, 영상 내 ROI를 정의함으로써 해상도 뿐만 아니라 공간적으로도 완전한 스케일러빌리티를 가능케 하는 비디오 부호화/복호화 장치를 제공할 수 있는 효과가 있다. 또한 본 발명은, 복수의 ROI 영역 사이의 계층간 중복성을 고려함으로써 부호화 효율을 높일 수 있는 효과가 있다.The present invention has the effect of providing a video encoding / decoding apparatus that enables full scalability not only in resolution but also spatially by defining an ROI in an image. In addition, the present invention has the effect of increasing the coding efficiency by considering the inter-layer redundancy between the plurality of ROI regions.

SVC(Scalable Video Coding), MPEG-4, 해상도, 계층, ROI, inter-layer coding, intra-layer coding Scalable Video Coding (SVC), MPEG-4, Resolution, Layer, ROI, inter-layer coding, intra-layer coding

Description

Hierarchical Video Encoding / Decoding Method for Complete Spatial Scalability and Apparatus

도 1은 본 발명에 따른 공간적 스케일러빌리티를 제공하는 영상 처리 시스템의 일실시예 구성도,1 is a block diagram of an embodiment of an image processing system for providing spatial scalability according to the present invention;

도 2는 본 발명에 따른 도 1의 부호화기(110)의 일실시예 구성도,2 is a block diagram of an embodiment of the encoder 110 of FIG. 1 according to the present invention;

도 3은 본 발명에 따른 중복영역 검출부(210)의 동작을 설명하기 위한 일실시예 흐름도,3 is a flowchart illustrating an operation of the overlapped area detector 210 according to the present invention;

도 4는 본 발명에 따른 중복영역 검출 과정을 설명하기 위한 일실시예 흐름도,4 is a flowchart illustrating an example of a process for detecting overlapping areas according to the present invention;

도 5는 본 발명의 일실시예에 따라 다중 계층 기반 부호화 방식이 적용되는 경우의 영역(Region 및 Overlapped Region)들에 대한 계층(layer)별 배치도,FIG. 5 is a layout diagram according to layers for regions and overlapped regions when a multi-layer based coding scheme is applied according to an embodiment of the present invention; FIG.

도 6은 본 발명의 일실시예에 따라 일 계층 기반 부호화 방식이 적용되는 경우의 영역(Region 및 Overlapped Region)들에 대한 계층(layer)별 배치도,6 is a layout diagram according to layers for regions and overlapped regions when a layer-based coding scheme is applied according to an embodiment of the present invention;

도 7은 다중 계층 기반 부호화 방식인 경우의 각 영역에 대한 계층별 배치 방법을 설명하기 위한 일실시예 흐름도,7 is a flowchart illustrating a method of arranging layers according to layers in a multi-layer based coding scheme;

도 8은 일 계층 기반 부호화 방식인 경우의 각 영역에 대한 계층별 배치 방법을 설명하기 위한 일실시예 흐름도,8 is a flowchart illustrating an example of a layer-based arrangement method for each region in the case of a layer-based encoding scheme;

도 9는 본 발명에 따라 다중 계층 기반 부호화 방식이 적용되는 경우의 영역 부호화부의 일실시예 구성도,9 is a diagram illustrating an embodiment of a region encoding unit when a multi-layer based encoding scheme is applied according to the present invention;

도 10은 본 발명에 따라 일 계층 기반 부호화 방식이 적용되는 경우의 영역 부호화부의 일실시예 구성도,10 is a diagram illustrating an embodiment of a region encoding unit in the case where a one-layer based encoding scheme is applied according to the present invention;

도 11은 본 발명에 따라 다중 계층 기반 부호화 방식이 적용되는 경우의 영역 부호화부의 동작을 설명하기 위한 일실시예 흐름도,11 is a flowchart illustrating an operation of an area encoder in a case where a multi-layer based coding scheme is applied according to the present invention;

도 12는 본 발명에 따라 다중 계층 기반 부호화 방식이 적용되는 경우에 있어서 선택 영역의 블록(block)에 대하여 출력되는 스트림의 부호화 모드를 결정하는 방법을 설명하기 위한 일실시예 흐름도,12 is a flowchart illustrating a method of determining an encoding mode of a stream output for a block of a selection region when a multi-layer based encoding scheme is applied according to the present invention;

도 13은 본 발명에 따라 일 계층 기반 부호화 방식이 적용되는 경우의 영역 부호화부의 동작을 설명하기 위한 일실시예 흐름도,FIG. 13 is a flowchart illustrating an operation of an area coder when one layer-based encoding is applied according to the present invention; FIG.

도 14는 본 발명에 따라 일 계층 기반 부호화 방식이 적용되는 경우에 있어서 선택 영역의 블록(block)에 대하여 출력되는 스트림의 부호화 모드를 결정하는 방법을 설명하기 위한 일실시예 흐름도,14 is a flowchart illustrating a method of determining an encoding mode of a stream output for a block of a selected region when a hierarchical coding scheme is applied according to the present invention;

도 15는 본 발명의 일실시예에 따른 움직임 예측/보상 모드(motion prediction/compensation mode)와 인터 레이어 코딩 모드(inter-layer coding mode)에서 사용되는 내삽(interpolation) 방법을 설명하기 위한 도면,FIG. 15 illustrates an interpolation method used in a motion prediction / compensation mode and an inter-layer coding mode according to an embodiment of the present invention; FIG.

도 16은 본 발명의 제1 실시예에 따른 영상 내 ROI가 존재함을 나타내는 플 래그(flag)가 추가된 SVC 비트스트림을 보여주는 도면,FIG. 16 is a view showing an SVC bitstream added with a flag indicating that there is an ROI in an image according to a first embodiment of the present invention; FIG.

도 17은 본 발명의 제2 실시예에 따른 영상 내 ROI가 존재함을 나타내는 플래그(flag)가 추가된 SVC 비트스트림을 보여주는 도면,FIG. 17 is a view showing an SVC bitstream to which a flag indicating that there is an ROI in an image according to a second embodiment of the present invention; FIG.

도 18은 본 발명의 제3 실시예에 따른 영상 내 ROI가 존재함을 나타내는 플래그(flag)가 추가된 SVC 비트스트림을 보여주는 도면,18 is a view showing an SVC bitstream to which a flag indicating that there is an ROI in an image according to a third embodiment of the present invention;

도 19는 다중 계층 부호화 방식이 적용되는 경우 인터 레이어 코딩(inter-layer coding)에 있어서의 영역 간의 부호화 종속성을 보여주는 도면,19 is a diagram illustrating encoding dependencies between regions in inter-layer coding when a multi-layer coding scheme is applied;

도 20은 일 계층 부호화 방식이 적용되는 경우 인터 레이어 코딩(inter-layer coding)에 있어서의 영역 간의 부호화 종속성을 보여주는 도면,20 is a diagram illustrating encoding dependencies between regions in inter-layer coding when one layer coding scheme is applied;

도 21은 본 발명의 일실시예에 따른 관련된 하위 계층(Layer) 영역(Region)에 대한 정보를 포함하는 부호화 스트림의 신택스 구조를 보여주는 도면,21 is a diagram illustrating a syntax structure of an encoded stream including information about an associated lower layer region according to an embodiment of the present invention;

도 22는 본 발명에 따른 도 1의 복호화기(120)의 일실시예 구성도,22 is a block diagram of an embodiment of the decoder 120 of FIG. 1 according to the present invention;

도 23은 본 발명에 따라 다중 계층 기반 방식이 적용되는 경우의 영역 복호화부의 동작을 설명하기 위한 일실시예 흐름도,23 is a flowchart illustrating an operation of an area decoding unit when a multi-layer based method is applied according to the present invention;

도 24는 본 발명에 따라 다중 계층 기반 방식이 적용되는 경우에 있어서 선택 영역의 블록(block)에 대한 부호화 스트림의 복호화 방법을 설명하기 위한 일실시예 흐름도,24 is a flowchart illustrating a method of decoding an encoded stream for a block of a selected region when a multi-layer based scheme is applied according to the present invention;

도 25는 본 발명에 따라 일 계층 기반 방식이 적용되는 경우의 영역 복호화부의 동작을 설명하기 위한 일실시예 흐름도,25 is a flowchart illustrating an operation of an area decoder in a case where a one-layer based scheme is applied according to the present invention;

도 26은 본 발명에 따라 일 계층 기반 방식이 적용되는 경우에 있어서 선택 영역의 블록(block)에 대한 부호화 스트림의 복호화 방법을 설명하기 위한 일실시예 흐름도이다.FIG. 26 is a flowchart illustrating a method of decoding an encoded stream for a block of a selected region when a one-layer based scheme is applied according to the present invention.

* 도면의 주요 부분에 대한 부호의 설명* Explanation of symbols for the main parts of the drawings

110: 부호화기 120: 복호화기110: encoder 120: decoder

130: 사용자 인터페이스부 140: 표시부130: user interface unit 140: display unit

210: 중복영역(OR) 검출부 220: 영역(Region) 배치부210: overlapped area (OR) detection unit 220: region arrangement unit

230: 영역(Region) 부호화부 2210: 복호 영역(Region) 추출부230: Region encoding unit 2210: Decoding region extraction unit

2220: 영역(Region) 복호화부2220: region decoder

JPEG 2000 표준[ISO/IEC JTC1 15444-2: JPEG 2000 Image Coding System; Extensions, 2004]은 영상의 서로 다른 부분을 다양한 비트율(bit rate)로 부호화할 수 있도록 지원한다. 즉 영상 내 특정 영역이 상기 영상의 다른 부분보다 더 높 은 비트율로 부호화될 수 있다. 이러한 영상 내 특정 영역에 대한 부호화는 지난 몇 년 동안 핵심 논점이 되고 있다. 이와 관련하여 상기 JPEG 2000에서는 영상 내 특정 영역을 ROI(Region of Interest)로 정의하고 이를 스케일링 기반 방법(scaling based method)를 이용하여 영상의 다른 부분보다 먼저 부호화함으로써, 복호화기에서 상기 ROI가 공간적 스케일러빌리티를 가지고 독립적으로 복호화할 수 있도록 하고 있다. 그러나 상기와 같은 JPEG 2000의 부호화 방법은 기본적으로 웨이블릿 기반(wavelet-based)의 정지영상 부호화 방법이기 때문에, MPEG을 기반으로 하는 비디오 영상 부호화에는 적용할 수 없다는 문제점이 있다.JPEG 2000 standard [ISO / IEC JTC1 15444-2: JPEG 2000 Image Coding System; Extensions, 2004] supports encoding of different parts of an image at various bit rates. That is, a specific region in the image may be encoded at a higher bit rate than other portions of the image. Coding of specific regions in these images has been a key issue in the last few years. In relation to this, in JPEG 2000, a specific region in an image is defined as a region of interest (ROI) and encoded before the other part of the image by using a scaling based method, so that the ROI is spatially scaled in the decoder. It has the ability to independently decode. However, since the encoding method of JPEG 2000 is basically a wavelet-based still image encoding method, it cannot be applied to MPEG-based video image encoding.

MPEG 기반의 부호화 방법은 비디오 영상을 부호화하기 위한 일반적인 방법으로서, 특히 그 중 MPEG-4[ISO/IEC JTC1 14496-2: Coding of Audio-Visual Objects, Part 2, 1998]의 객체기반 부호화 방법(object based encoding)은 하나 이상의 영역을 포함하는 영상에 대하여 각 영역들이 독립적으로 복호화될 수 있는 조건을 만족시킬 수 있는 부호화 방법을 제공하고 있다. 상기 영상 내 포함되는 복수개의 영역은 임의의 형태를 가지는 2차원 영역으로서, 각각 독립적으로 부호화되거나, 움직임 예측(motion estimation)과 잔여부호화(residual coding) 및 SADCT(Shape-Adaptive DCT)를 통해 부호화된다. 그러나 상기와 같은 MPEG-4의 객체기반 부호화 방법은 영상 내 포함되는 복수개의 영역이 상기 영상과 다른 공간 해상도를 가지는 경우에 대한 부호화 방법을 제공할 수 없다는 문제점이 있다.The MPEG-based encoding method is a general method for encoding a video image, and in particular, the object-based encoding method of MPEG-4 [ISO / IEC JTC1 14496-2: Coding of Audio-Visual Objects, Part 2, 1998] based encoding) provides an encoding method capable of satisfying a condition that each region may be independently decoded for an image including one or more regions. The plurality of regions included in the image are two-dimensional regions having an arbitrary shape, and are independently encoded or encoded through motion estimation, residual coding, and SADCT (Shape-Adaptive DCT). . However, there is a problem in that the object-based encoding method of MPEG-4 cannot provide an encoding method for a case where a plurality of regions included in an image have a different spatial resolution than the image.

현재 부호화된 비디오 신호로부터 해상도가 낮은 비디오 신호나 프레임율(frame rate)이 다른 부호화 신호를 추출함으로써 전제되는 부호화 신호 없이 다양 한 해상도와 프레임율을 가지는 비디오 신호를 복호화하기 위한 스케일러블 비디오 부호화(scalable video coding)에 대한 연구가 활발하게 진행되고 있다. H.262/MPEG-2 Visual[ISO/IEC JTC1 13818-2: Generic Coding of Moving Pictures and Associated Audio Information, Part 2; Video, 1994], H.263[ITU-T: Video Coding for Low Bitrate Communication, 1995(version 1), 1998(version 2)], MPEG-4 Visual[ISO/IEC JTC1 14496-2: Coding of Audio-Visual Objects, Part 2, 1998] 등에서는 공간 해상도 가변 부호화를 위하여 계층(layer) 기반의 부호화 방법을 사용함으로써 동일한 영상이 서로 다른 공간 해상도로 복호화될 수 있도록 지원한다. 이러한 계층(layer) 기반의 부호화 방법은 원래의 영상보다 낮은 해상도로 만든 영상을 기초 계층(base layer)으로 부호화하고, 상기 부호화된 영상의 정보를 이용하여 생성되는 더 높은 공간 해상도의 영상을 다음 계층(next layer)으로 부호화하는 방법이다. 최근 PDA, 핸드폰, DMB폰, SDTV, HDTV 등의 다양한 비디오 단말기의 등장과 함께 다양한 동영상 서비스에 대한 필요성이 증가하고 있으며, 이에 따라 다양한 해상도는 물론 다양한 위치에서의 영상을 복호화할 수 있도록 지원하는 비디오 부호화 방법이 필요한 상황이다. 하지만, 상기와 같은 종래의 계층(layer) 기반 부호화 방법들은 영상 내 다른 공간 해상도를 가지는 영역들이 존재하거나 영상 내에서 다양한 위치를 가지는 영역들이 존재하는 경우에 대하여는 적용이 어려운 문제점이 있다. 또한, 상기와 같은 종래의 계층(layer) 기반 부호화 방법을 그대로 영상 내 복수개의 영역에 적용하는 경우 영역 간의 중복영역이 반복적으로 부호화되기 때문에, 스케일러블 비디오 부호화 체계(scalable video coding scheme)에 있어서 부호화 효율이 크게 저하된다는 문제점이 있다. Scalable video coding for decoding video signals having various resolutions and frame rates without encoding signals that are presumed by extracting low resolution video signals or encoding signals having different frame rates from the currently encoded video signal Video coding is being actively researched. H.262 / MPEG-2 Visual [ISO / IEC JTC1 13818-2: Generic Coding of Moving Pictures and Associated Audio Information, Part 2; Video, 1994], H.263 [ITU-T: Video Coding for Low Bitrate Communication, 1995 (version 1), 1998 (version 2)], MPEG-4 Visual [ISO / IEC JTC1 14496-2: Coding of Audio- Visual Objects, Part 2, 1998] supports the same image to be decoded at different spatial resolutions by using a layer-based encoding method for spatial resolution variable encoding. This layer-based encoding method encodes an image made with a lower resolution than the original image into a base layer, and encodes a higher spatial resolution image generated using information of the encoded image into a next layer. (next layer). Recently, with the advent of various video terminals such as PDAs, mobile phones, DMB phones, SDTVs, and HDTVs, the need for various video services is increasing. Accordingly, video that supports decoding of images at various locations as well as various resolutions There is a need for an encoding method. However, the above-described conventional layer-based coding methods have a problem in that it is difficult to apply to the case where regions having different spatial resolutions in the image or regions having various positions in the image exist. In addition, when the above-described conventional layer-based coding method is applied to a plurality of regions in an image as it is, overlapping regions between regions are repeatedly encoded, and thus, coding in a scalable video coding scheme is possible. There is a problem that the efficiency is greatly reduced.

본 발명은 상기 문제점을 해결하기 위하여 제안된 것으로, 영상 내 다양한 위치와 다양한 해상도를 가지는 영역들이 독립적으로 복호화될 수 있도록 하는 공간 영역에서 완전한 스케일러빌리티를 제공하는 부호화/복호화 장치 및 방법을 제공하는데 그 목적이 있다.The present invention has been proposed to solve the above problems, and provides an encoding / decoding apparatus and method for providing full scalability in a spatial domain so that regions having various positions and various resolutions within an image can be independently decoded. There is a purpose.

또한, 본 발명은 상기와 같은 영역들의 부호화에 있어 계층 사이의 중복성을 고려한 효율적인 계층적 부호화/복호화 장치 및 방법을 제공하는데 그 목적이 있다.Another object of the present invention is to provide an efficient hierarchical encoding / decoding apparatus and method considering the redundancy between layers in encoding of the above areas.

또한, 본 발명은 사용자가 원하는 임의의 영역을 선택하여 원하는 해상도로 시청할 수 있도록 하기 위한 스케일러블 부호화/복호화 장치를 제공하는데 그 목적이 있다.Another object of the present invention is to provide a scalable encoding / decoding apparatus for selecting a user's desired region and viewing the image at a desired resolution.

발명의 다른 목적 및 장점들은 하기의 설명에 의해서 이해될 수 있으며, 본 발명의 실시예에 의해 보다 분명하게 알게 될 것이다. 또한, 본 발명의 목적 및 장점들은 특허 청구 범위에 나타낸 수단 및 그 조합에 의해 실현될 수 있음을 쉽게 알 수 있을 것이다.Other objects and advantages of the invention can be understood by the following description, and become apparent with reference to the embodiments of the present invention. Also, it will be readily appreciated that the objects and advantages of the present invention may be realized by the means and combinations thereof indicated in the claims.

상기 목적을 달성하기 위한 본 발명은 비디오 영상을 부호화하는 부호화기로 서, 상기 비디오 영상 내에서 부호화하고자 하는 복수개의 영역(ROI)에 대한 부호화 영역 정보를 입력받아, 상기 ROI 사이의 중복영역(OR)을 검출하는 OR 검출부; 상기 비디오 영상, ROI, 및 검출된 OR을 부호화하고자 하는 해상도에 따라 복수의 계층(Layer)에 배치하는 영역 배치부; 및 상기 비디오 영상, ROI, 및 검출된 OR을 상기 영역 배치부에서 배치된 계층의 해상도에 따라 부호화하는 영역 부호화부를 포함한다. 상기 부호화 영역 정보는, 상기 ROI의 비디오 영상 내 위치 정보와 상기 ROI의 부호화 해상도 정보를 포함한다. 상기 영역 부호화부는, 하위 계층에 배치된 영역을 이용하여 블록 단위로 부호화하는 인터 레이어 코딩(inter-layer coding)을 수행한다. 또한, 상기 인터 레이어 코딩(inter-layer coding)에서 ROI 영역 밖의 화소를 이용하여 내삽(interpolation)을 수행하여야 하는 경우 외삽(extrapolation)을 통해 상기 ROI 영역 밖의 화소값을 결정한 후 내삽을 수행한다. 상기 부호화 스트림에는 상기 비디오 영상 내 ROI가 존재함을 나타내는 플래그가 추가된다. 상기 인터 레이어 코딩(inter-layer coding)에서 ROI 영역 밖의 화소를 이용하여 내삽(interpolation)을 수행하여야 하는 경우에는 상기 내삽(interpolation)을 수행하지 않고 인트라 레이어 코딩(intra-layer coding)을 수행할 수 있다.According to an aspect of the present invention, an encoder for encoding a video image receives input encoding region information about a plurality of regions (ROIs) to be encoded in the video image, and overlaps the region between the ROIs. OR detection unit for detecting the; A region arranging unit which arranges the video image, the ROI, and the detected OR in a plurality of layers according to a resolution to be encoded; And an area encoder which encodes the video image, the ROI, and the detected OR according to the resolution of the layer arranged in the area arrangement unit. The encoding region information includes position information in a video image of the ROI and encoding resolution information of the ROI. The area encoder performs inter-layer coding for encoding in block units by using an area disposed in a lower layer. In the inter-layer coding, when interpolation is performed using pixels outside the ROI region, interpolation is performed after determining pixel values outside the ROI region through extrapolation. A flag indicating that there is an ROI in the video image is added to the encoded stream. In the inter-layer coding, when interpolation is to be performed using pixels outside the ROI region, intra-layer coding may be performed without performing interpolation. have.

또한, 상기 목적을 달성하기 위한 본 발명의 복호화기는, 복호화하고자 하는 영역에 대한 선택 정보를 입력받아 복수의 ROI에 대한 부호화 정보를 포함하는 부호화 스트림으로부터 상기 선택 영역에 대응되는 ROI의 복호화를 위해 필요한 영역 정보를 추출하는 복호영역 추출부; 및 상기 추출된 영역 정보를 입력받아 복호화를 수행함으로써 비디오 전체 영상 내 ROI에 대한 영상 신호를 복원하는 영역 복호화부를 포함한다. 상기 선택 정보는, 상기 복호화하고자 하는 영역의 위치 정보와 해상도 정보를 포함한다. 상기 복호영역 추출부는,상기 부호화스트림으로부터 상기 선택 영역에 대응되는 ROI의 부호화 스트림을 추출하여, 상기 ROI를 복호화하는데 필요한 하위 계층의 관련 영역 정보를 추출한다. 또한, 상기 복호영역 추출부는, 낮은 해상도의 영역이 배치되는 하위 계층의 영역을 이용하여 블록 단위로 인터 레이어 디코딩(inter-layer decoding)을 수행한다. 상기 인터 레이어 디코딩(inter-layer decoding)에서 ROI 경계에 위치하는 블록은 인트라 레이어 디코딩(intra-layer decoding)을 수행한다. 한편, 상기 복수의 ROI는 인터랙티브 복호화를 지원할 수 있도록 작은 직사각형 영역으로 구성될 수 있다. In addition, the decoder of the present invention for achieving the above object is required for decoding the ROI corresponding to the selection region from the encoding stream that receives the selection information for the region to be decoded and includes encoding information for a plurality of ROI. A decoding area extraction unit for extracting area information; And an area decoder configured to recover the image signal of the ROI in the entire video by decoding the received area information. The selection information includes position information and resolution information of the region to be decoded. The decoding region extractor extracts an encoded stream of an ROI corresponding to the selected region from the encoded stream, and extracts relevant region information of a lower layer necessary to decode the ROI. The decoding region extractor performs inter-layer decoding on a block-by-block basis using a region of a lower layer in which a region of low resolution is disposed. In the inter-layer decoding, a block located at an ROI boundary performs intra-layer decoding. On the other hand, the plurality of ROI may be composed of a small rectangular area to support interactive decoding.

한편 본 발명의 방법은 공간 스케일러빌리티를 제공하는 부호화 방법으로서, 입력되는 비디오 영상에 대하여 부호화를 수행할 복수의 ROI에 대한 위치 정보와 해상도 정보를 입력받는 입력 단계; 상기 ROI를 해상도에 따라 계층별로 배치하는 단계; 및 상기 배치된 ROI에 대하여 블록 단위의 부호화를 수행하는 부호화 단계를 포함한다. 상기 부호화 단계는, 하위 계층의 ROI 정보를 이용하는 인터 레이어 코딩을 수행하는 경우, ROI 외부의 화소를 이용한 내삽(interpolaton)을 필요로 하는 블록에 대하여 인트라 레이어 코딩을 수행한다. 또한, 시간적 상관 관계를 가지는 동일 계층의 ROI 정보를 이용하는 움직임 예측/보상 코딩(motion prediction/compensation coding)을 수행하는 경우, ROI 외부의 화소를 이용한 내삽(interpolaton)을 필요로 하는 블록의 움직임 정보에 대하여 정수 화소(integer pel) 단위 예측을 수행한다. 또한, 하위 계층의 ROI 정보를 이용하는 인터 레이어 코딩을 수행하는 경우, ROI 외부의 화소를 이용한 내삽(interpolaton)을 필요로 하는 블록에 대하여 외삽(extrapolaton)을 통해 외부 화소값을 결정한 후 내삽을 수행한다. 또한, 시간적 상관 관계를 가지는 동일 계층의 ROI 정보를 이용하는 움직임 예측/보상 코딩(motion prediction/compensation coding)을 수행하는 경우, ROI 외부의 화소를 이용한 내삽(interpolaton)을 필요로 하는 블록에 대하여 외삽(extrapolaton)을 통해 외부 화소값을 결정한 후 내삽을 수행한다. 상기 ROI의 위치 정보는 비디오 영상에 대한 매크로 블록 스캔 번호로 표현된다. The method of the present invention provides an encoding method for providing spatial scalability, comprising: an input step of receiving position information and resolution information of a plurality of ROIs to be encoded with respect to an input video image; Disposing the ROI for each layer according to a resolution; And an encoding step of performing encoding on a block basis with respect to the arranged ROI. In the encoding step, when inter-layer coding using ROI information of a lower layer is performed, intra-layer coding is performed on a block requiring interpolation using pixels outside the ROI. In addition, when performing motion prediction / compensation coding using ROI information of the same layer having a temporal correlation, the motion information of a block requiring interpolaton using pixels outside the ROI is included. Integer pel unit prediction is performed with respect to each other. In addition, when inter-layer coding using ROI information of a lower layer is performed, an external pixel value is determined through extrapolation for an block requiring interpolaton using pixels outside the ROI and then interpolated. . In addition, when performing motion prediction / compensation coding using ROI information of the same layer having a temporal correlation, extrapolation (block) that requires interpolaton using pixels outside the ROI is performed. After determining the external pixel value through extrapolaton), interpolation is performed. The location information of the ROI is represented by a macro block scan number for the video image.

또한, 본 발명의 방법은 공간 스케일러빌리티를 제공하는 복호화 방법으로서, 입력되는 비디오 영상에서 복호화하고자 하는 영역에 대한 위치 정보와 복호화하고자 하는 해상도 정보를 입력받는 입력 단계; 전송된 부호화 스트림으로부터 상기 복호화하고자 하는 영역에 대응되는 ROI 정보를 추출하는 단계; 및 상기 추출된 정보를 이용하여 상기 ROI에 대하여 복호화를 수행하는 복호화 단계를 포함한다. 상기 복호화 단계는, 하위 계층의 ROI 정보를 이용하는 인터 레이어 디코딩을 수행하는 경우, ROI 외부의 화소를 이용한 내삽(interpolaton)을 필요로 하는 블록에 대하여 인트라 레이어 디코딩을 수행한다. 또한, 시간적 상관 관계를 가지는 동일 계층의 ROI 정보를 이용하는 움직임 예측/보상 디코딩(motion prediction/compensation decoding)을 수행하는 경우, ROI 외부의 화소를 이용한 내삽(interpolaton)을 필요로 하는 블록의 움직임 정보에 대하여 정수 화소(integer pel) 단위 예측을 수행한다. 또한, 하위 계층의 ROI 정보를 이용하는 인터 레이어 디코딩을 수행하는 경우, ROI 외부의 화소를 이용한 내삽(interpolaton)을 필요로 하는 블록에 대하여 외삽(extrapolaton)을 통해 외부 화소값을 결정한 후 내삽을 수행한다. 또한, 시간적 상관 관계를 가지는 동일 계층의 ROI 정보를 이용하는 움직임 예측/보상 디코딩(motion prediction/compensation decoding)을 수행하는 경우, ROI 외부의 화소를 이용한 내삽(interpolaton)을 필요로 하는 블록에 대하여 외삽(extrapolaton)을 통해 외부 화소값을 결정한 후 내삽을 수행한다. 상기 복호화 단계에서는는, 상기 부호화 스트림에 포함된 나타내기 위한 플래그를 통해 부호화 스트림에 비디오 영상 내 ROI 영역이 존재하는지 여부를 확인한다. 상기 ROI의 위치 정보는 비디오 영상에 대한 매크로 블록 스캔 번호로 표현된다.The present invention also provides a decoding method for providing spatial scalability, comprising: an input step of receiving position information of a region to be decoded and resolution information to be decoded in an input video image; Extracting ROI information corresponding to the region to be decoded from the transmitted encoded stream; And a decoding step of performing decoding on the ROI using the extracted information. In the decoding step, when interlayer decoding using ROI information of a lower layer is performed, intra layer decoding is performed on a block requiring interpolation using pixels outside the ROI. In addition, when performing motion prediction / compensation decoding using ROI information of the same layer having a temporal correlation, the motion information of a block requiring interpolaton using pixels outside the ROI is included. Integer pel unit prediction is performed with respect to each other. In addition, when interlayer decoding using ROI information of a lower layer is performed, an external pixel value is determined through extrapolation for an block requiring interpolaton using pixels outside the ROI and then interpolated. . In addition, when performing motion prediction / compensation decoding using ROI information of the same layer having a temporal correlation, extrapolation (block) that requires interpolaton using pixels outside the ROI is performed. After determining the external pixel value through extrapolaton), interpolation is performed. In the decoding step, it is determined whether a ROI region in a video image exists in the encoded stream through a flag included in the encoded stream. The location information of the ROI is represented by a macro block scan number for the video image.

상술한 본 발명의 내용은 첨부된 도면과 관련한 다음의 상세한 설명을 통하여 보다 분명해 질 것이며, 그에 따라 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자가 본 발명의 기술적 사상을 용이하게 실시할 수 있을 것이다. 또한, 본 발명을 설명함에 있어서 본 발명과 관련된 공지 기술에 대한 구체적인 설명이 본 발명의 요지를 불필요하게 흐릴 수 있다고 판단되는 경우에 그 상세한 설명을 생략하기로 한다. 이하, 첨부된 도면을 참조하여 본 발명에 따른 바람직한 일실시예를 상세히 설명하기로 한다.The above-described contents of the present invention will become more apparent through the following detailed description with reference to the accompanying drawings, and thus, those skilled in the art to which the present invention pertains may easily implement the technical idea of the present invention. will be. In addition, in describing the present invention, when it is determined that the detailed description of the known technology related to the present invention may unnecessarily obscure the gist of the present invention, the detailed description thereof will be omitted. Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 1은 본 발명에 따라 입력 비디오 영상과 상기 영상에 포함된 영역들에 대하여 공간적 스케일러빌리티(spatial scalability)를 제공하는 영상 처리 시스템의 일실시예 구성도이다. 본 발명의 영상 처리 시스템은, 도 1에서 보는 바와 같이, 부호화기(110), 복호화기(120), 사용자 인터페이스(130), 및 표시부(140)를 포함하여 이루어진다.1 is a diagram illustrating an embodiment of an image processing system that provides spatial scalability for an input video image and regions included in the image according to the present invention. As shown in FIG. 1, the image processing system of the present invention includes an encoder 110, a decoder 120, a user interface 130, and a display unit 140.

상기 부호화기(110)는 비디오 영상과 상기 영상 내 부호화 영역(Region of Interest; ROI)에 대한 정보를 입력받아 본 발명에 따른 부호화를 수행함으로써 부호화된 비트 스트림을 출력한다. 상기 부호화 영역에 대한 정보는 영상 내 부호화할 ROI(Region of Interest) 영역의 위치 정보와 부호화 해상도(Resolution)에 대한 정보를 포함하며, 상기 ROI의 영역 인덱스(Region Index)를 추가적으로 포함할 수 있다. 상기 부호화기(110)에서 출력되는 부호화 스트림은 채널(channel)을 통해 복호화기(120)로 전송되며, 상기 복호화기(120)는 사용자에 의해 선택되는 복호화 영역 정보를 사용자 인터페이스부(130)로부터 입력받아 본 발명에 따른 복호화를 수행함으로써 사용자가 선택하는 영역에 대한 영상 신호를 복원한다. 상기 복호화 영역 정보는 사용자가 복호화하고자 하는 영역의 위치 정보를 포함하며, 복호화하고자 하는 해상도에 관한 정보를 추가적으로 포함할 수 있다. 또한, 표시부(140)는 상기 복호화기(120)에서 복원된 영상 신호를 받아 사용자가 선택한 영역의 영상을 디스플레이에 표시한다.The encoder 110 receives a video image and information about a region of interest (ROI) in the image and outputs the encoded bit stream by performing encoding according to the present invention. The information on the encoding region may include location information of a region of interest (ROI) to be encoded in the image and information on encoding resolution, and may further include a region index of the ROI. The encoded stream output from the encoder 110 is transmitted to the decoder 120 through a channel, and the decoder 120 inputs decoding region information selected by the user from the user interface 130. Receiving the decoding according to the present invention restores the video signal for the area selected by the user. The decryption area information may include position information of an area to be decoded by a user, and may further include information regarding a resolution to be decoded. In addition, the display unit 140 receives the image signal reconstructed by the decoder 120 and displays an image of the area selected by the user on the display.

도 2는 본 발명에 따라 입력 비디오 영상과 영상 내 영역들을 부호화하기 위 한 도 1의 부호화기(110)의 일실시예 구성도이다. 도 2에서 보는 바와 같이 본 발명에 따른 부호화기(110)는 비디오 영상과 영상 내 부호화 영역(Region)에 대한 정보를 입력 받아 상기 영상 및 영역들 사이의 중복영역(Overlapped Region; OR)을 검출하는 중복영역(OR) 검출부(210), 상기 검출된 중복영역(OR)을 포함하여 부호화할 영상 내 영역들을 계층(Layer)별로 배치하는 영역(Region) 배치부(220), 및 상기 입력 비디오 영상과 영역들을 부호화함으로써 부호화 스트림을 생성하는 영역(Region) 부호화부(230)를 포함하여 이루어진다. 이하, 상기 부호화기(110)의 중복영역 검출부(210), 영역 배치부(220), 및 영상/영역 부호화부(230)의 구성 및 동작 내용을 도면을 참조하여 설명한다.2 is a diagram illustrating an embodiment of the encoder 110 of FIG. 1 for encoding an input video image and regions within an image according to the present invention. As shown in FIG. 2, the encoder 110 according to the present invention receives information about a video image and an encoding region in the image, and detects an overlapped region (OR) between the image and the regions. A region detector 210, a region arrangement unit 220 for arranging regions in an image to be encoded, including the detected overlap region OR, by layer, and the input video image and region. And a region encoder 230 for generating an encoded stream by encoding the signals. Hereinafter, the configuration and operation of the overlapping region detector 210, the region arrangement unit 220, and the image / region encoder 230 of the encoder 110 will be described with reference to the drawings.

중복영역(OR) 검출부(210)는 입력되는 부호화 영역의 위치 정보를 이용하여 영역(Region)간에 중복되는 영역(Overlapped Region; OR)을 검출하고 상기 검출된 중복 영역에 대하여 새로운 영역 인덱스(Region Index)를 정의한다. 도 5를 참조하여 설명하면, 영역 1(Region 1)과 영역 2(Region 2) 사이에 검출된 중복 영역은 OR 1로 정의되며, 영역 2(Region 2)와 영역 3(Region 3) 사이에 검출된 중복 영역은 OR 2로 정의된다. 한편, 영역 3(Region 3)과 영역 4(Region 4) 사이의 중복 영역은 영역 2(Region 2)와도 중복되는 영역을 포함하고 있다. 이러한 경우, 먼저 동일한 공간 해상도를 가지는 영역 2(Region 2)와 영역 3(Region 3) 사이의 중복 영역을 검출하여 OR 2로 정의한 후, 서로 다른 공간 해상도를 가지는 영역 3(Region 3)과 영역 4(Region 4) 사이에 검출되는 중복 영역 가운데 상기 OR 2를 제외한 영역을 OR 3으로 정의한다. 결과적으로, 영역 3(Region 3)과 영역 4(Region 4) 사이에 검출되는 중복 영역은 OR 2와 OR 3으로 구성된다. The overlapped region (OR) detector 210 detects an overlapped region (OR) between regions by using the input position information of the encoded region, and generates a new region index for the detected overlapped region. ). Referring to FIG. 5, a duplicate region detected between Region 1 and Region 2 is defined as OR 1, and is detected between Region 2 and Region 3. Overlapping regions are defined as OR 2. On the other hand, the overlapping region between the region 3 (Region 3) and the region 4 (Region 4) includes a region also overlapping with the region 2 (Region 2). In this case, first, a duplicate region between the region 2 and the region 3 having the same spatial resolution is detected and defined as OR 2, and then the region 3 and the region 4 having different spatial resolutions are defined. Among the overlapping regions detected between (Region 4), the region except for the OR 2 is defined as OR 3. As a result, the overlapping region detected between the region 3 (Region 3) and the region 4 (Region 4) is composed of OR 2 and OR 3.

도 3은 본 발명에 따른 중복영역 검출부(210)의 동작을 설명하기 위한 일실시예 흐름도이다. 도면에서 보듯이, 중복영역 검출부(210)는 부호화 영역 정보를 이용하여 영상 내 모든 영역(Region)들의 중복 영역을 검출(S310)하고, 상기 검출되는 중복 영역 사이의 중복 영역도 모두 검출한다(S320). 그리고 검출된 중복 영역에 대하여 새로운 영역 인덱스(Region Index)를 정의한다(S330). 상기 OR 3에 대한 인덱스 정의 방법에서 설명한 바와 같이 단계 S330에서 새로운 영역 인덱스를 정의함에 있어서 서로 다른 공간 해상도를 가지는 영역 사이에 검출되는 중복 영역이 동일한 공간 해상도를 가지는 영역 사이에 검출되는 중복 영역(OR 2)을 포함하는 경우에는, 상기 서로 다른 공간 해상도를 가지는 영역 사이에 검출되는 중복 영역 가운데 동일한 공간 해상도를 가지는 영역 사이에 검출되는 중복 영역(OR 2)을 제외한 영역에 대하여 새로운 영역 인덱스(OR 3)를 정의한다.3 is a flowchart illustrating an operation of the overlapped area detector 210 according to the present invention. As shown in the drawing, the overlapped area detector 210 detects overlapped areas of all regions in the image using the encoded area information (S310), and also detects all overlapped areas between the detected overlapped areas (S320). ). A new region index is defined for the detected overlapping region (S330). As described in the index definition method for OR 3, in defining a new region index in step S330, an overlapping region (OR) detected between regions having different spatial resolutions is detected between regions having the same spatial resolution (OR). 2), a new region index (OR 3) for the region except for the overlapping region (OR 2) detected between the regions having the same spatial resolution among the overlapping regions detected between the regions having different spatial resolutions. ).

도 4는 상기 단계 S310에서 중복 영역을 검출하는 과정을 설명하기 위한 일실시예 흐름도로서, 단계 S320의 중복 영역 검출에도 동일하게 적용된다. 우선, 입력된 부호화 영역 정보(ROI 영역 정보)를 이용하여 임의의 두 영역(Region)이 선택(S410)되면, 선택된 두 영역 사이에 중복 영역(OR)이 존재하는지를 판단한다(S420). 중복 영역이 존재하는 경우에는 두 영역의 공간 해상도가 동일한지를 판단(S430)하며, 동일한 경우에는 중복되는 영역을 중복 영역(OR)으로 구성한다(S460). 반면 두 영역의 공간 해상도가 서로 다른 경우에는 낮은 공간 해상도를 가지는 영 역이 높은 공간 해상도를 가지는 영역에 완전히 포함되는지를 판단(S440)하고, 포함되지 않는 경우에는 낮은 공간 해상도를 가지는 영역 가운데 높은 공간 해상도를 가지는 영역과 중복되는 영역을 중복 영역(OR)로 구성한다(S450). 그리고, 선택되지 않았던 두 영역(Region)의 쌍(pair)을 선택하여 상기 과정을 반복한다(S470, S480).FIG. 4 is a flowchart illustrating an example of detecting a duplicate area in step S310. The same applies to detection of the duplicate area in step S320. First, when any two regions (Region) are selected using the input encoding region information (ROI region information) (S410), it is determined whether a duplicate region (OR) exists between the two selected regions (S420). If the overlapping region exists, it is determined whether the spatial resolutions of the two regions are the same (S430). If the overlapping region is the same, the overlapping region is configured as the overlapping region OR (S460). On the other hand, if the spatial resolutions of the two areas are different from each other, it is determined whether the region having the low spatial resolution is completely included in the region having the high spatial resolution (S440). A region overlapping with a region having a resolution is configured as a redundant region OR (S450). Then, the process is repeated by selecting a pair of two regions that are not selected (S470 and S480).

영역(Region) 배치부(220)는 영역(Region 및 Overlapped Region)들을 해상도에 따라 계층(Layer)별로 배치한다. 입력 비디오 영상은 상기 계층별 배치에 있어서 가장 높은 해상도를 가지는 하나의 영역으로 취급된다. The region placement unit 220 arranges regions and overlapped regions for each layer according to the resolution. The input video image is treated as one area having the highest resolution in the hierarchical arrangement.

본 발명은 중복 영역(OR)을 처리하는 방식에 따라 두 가지로 나뉘어지며, 도 1의 동영상 처리 시스템에 있어서 부호화기(110), 복호화기(120), 및 표시부(140)의 동작 내용은 상기 중복 영역(OR)의 처리 방식에 따라 상이하게 이루어진다. 상기 두 가지 방식 가운데 첫 번째 방식은 중복 영역이 존재하는 경우 새로운 계층(Layer)을 구성하여 상기 중복 영역을 배치하고, 하나의 계층을 부호화함에 있어 복수개의 관련된 하위 계층의 정보를 이용하는 다중 계층 기반 부호화 방식(multi-layer based coding method)이다. 그리고 두 번째 방식은 중복 영역을 위하여 별도의 중복 계층을 구성하지 않으며, 하나의 계층을 부호화함에 있어 한 단계 아래의 하위 계층 정보만을 이용하는 일 계층 기반 부호화 방식(one-layer based coding method)이다. The present invention is divided into two types according to a method of processing an overlapped area OR. In the video processing system of FIG. 1, operations of the encoder 110, the decoder 120, and the display 140 are duplicated. It is made different according to the processing method of the area OR. In the first of the two methods, when there is a duplicate area, a multi-layer based encoding using a plurality of related lower layers in configuring a new layer and arranging the duplicate area and encoding one layer is performed. Multi-layer based coding method. The second method is a one-layer based coding method that does not form a separate redundant layer for the overlapping region and uses only lower layer information one step below in encoding one layer.

도 5는 본 발명의 일실시예에 따라 다중 계층 기반 부호화 방식이 적용되는 경우의 영역(Video Image, Region 및 Overlapped Region)들에 대한 계층(Layer)별 배치도며, 도 6은 본 발명의 일실시예에 따라 일 계층 기반 부호화 방식이 적용되는 경우의 영역(Video Image, Region 및 Overlapped Region)들에 대한 계층(Layer)별 배치도이다. 도 5와 도 6에서 영역 2(Region 2)와 영역 3(Region 3)은 동일한 해상도를 가지며, 영역 4(Region 4)는 상기 두 영역(Region 2, Region 3)보다 높은 해상도를 가진다. 또한, 영역 1(Region 1)은 영역 4(Region 4)보다 더 높은 해상도를 가지며, 입력 비디오 영상은 가장 높은 해상도를 가지는 영역으로 취급된다. 또한 제1 계층(Layer 1), 제2 계층(Layer 2), 제3 계층(Layer 3), 제4 계층(Layer 4), 및 제5 계층(Layer 5)은 계층 인덱스(Layer Index)가 각각 1 ~ 5 라는 것을 의미하며, 상기 계층 인덱스(Layer Index)가 높은 계층일수록 공간 해상도가 높은 상위 계층에 해당한다.FIG. 5 is a layout diagram according to layers for regions (video image, regions, and overlapped regions) when a multi-layer based coding scheme is applied according to an embodiment of the present invention. FIG. 6 is an embodiment of the present invention. According to an example, this is a layout diagram for each layer of regions (Video Image, Region, and Overlapped Regions) when one layer-based encoding is applied. 5 and 6, region 2 and region 3 have the same resolution, and region 4 has a higher resolution than the two regions 2 and 3. Also, region 1 has a higher resolution than region 4, and the input video image is treated as the region having the highest resolution. In addition, the first layer (Layer 1), the second layer (Layer 2), the third layer (Layer 3), the fourth layer (Layer 4), and the fifth layer (Layer 5) has a layer index, respectively 1 to 5, and a higher layer index corresponds to a higher layer having a higher spatial resolution.

영역(Video Image, Region 및 Overlapped Region)들을 계층(Layer)별로 배치하는 방법을 도 5 및 도 6를 참조하여 설명하면, 높은 해상도를 가지는 영역이 상위 계층(Layer)에 배치되며 낮은 해상도를 가지는 영역은 하위 계층(Layer)에 배치된다. 또한, 동일한 해상도를 가지는 영역들은 동일한 계층(Layer)에 배치되며, 가장 높은 해상도를 가지는 입력 비디오 영상은 하나의 영역으로 취급되어 최상위 계층(Layer)에 배치된다.A method of arranging regions (Video Image, Region, and Overlapped Regions) by layer will be described with reference to FIGS. 5 and 6, where a region having a high resolution is disposed in a higher layer and has a lower resolution. Is placed in the lower layer. In addition, regions having the same resolution are arranged in the same layer, and an input video image having the highest resolution is treated as one region and arranged in the uppermost layer.

한편, 중복 영역이 존재하는 경우에 상기 중복 영역을 배치하는 방법은 다중 계층 기반 부호화 방식(도 5 참조)인지 일 계층 기반 부호화 방식(도 6 참조)인지에 따라 상이하다. 우선, 다중 계층 기반 부호화 방식의 경우에는 도 5에서 보는 바와 같이, 중복되는 영역의 해상도와 동일한 해상도를 가지는 영역이 배치된 계층(Layer)과 한 단계 아래의 계층(Layer) 사이에 중복 영역(OR)을 배치하기 위한 중복영역 레이어(OR Layer)를 새롭게 정의하고, 상기 중복영역 레이어(OR Layer)에 중복 영역(OR)을 배치한다. 반면, 일 계층 기반 부호화 방식의 경우에는 도 6에서 보는 바와 같이, 중복되는 영역의 해상도와 동일한 해상도를 가지는 영역이 배치된 계층(Layer)에 중복 영역(OR)을 배치한다.On the other hand, when there is a duplicate region, the method of disposing the duplicate region differs depending on whether a multi-layer based encoding scheme (see FIG. 5) or a one-layer based encoding scheme (see FIG. 6) is used. First, in the case of a multi-layer based coding scheme, as shown in FIG. 5, an overlapped region OR is formed between a layer in which a region having the same resolution as that of the overlapping region is disposed and a layer below one level. ) Is newly defined, and an overlapping area (OR) is disposed in the overlapping layer (OR layer). On the other hand, in the case of the hierarchical coding scheme, as shown in FIG. 6, the overlapping region OR is disposed in a layer in which a region having the same resolution as that of the overlapping region is disposed.

도 7은 다중 계층 기반 부호화 방식인 경우에 영역의 계층별 배치 방법을 설명하기 위한 일실시예 흐름도이다. 먼저, 부호화할 영역(입력비디오 영상과 중복 영역을 포함) 가운데 가장 공간 해상도가 낮은 영역(Region)을 선택(S710)하고 현재 계층(Layer)의 인덱스(Index)를 1로 초기화한다(S720). 그리고 현재 선택된 영역과 동일한 공간 해상도를 가지는 중복 영역(OR)이 존재하는지를 확인(S730)한다. 확인 결과 동일한 공간 해상도를 가지는 중복 영역이 존재하는 경우, 상기 중복 영역(OR)을 현재 계층에 배치한 후 중복 영역이 아닌 영역을 한 단계 상위 계층(Layer)에 배치한다(S750, S760). 반면, 동일 해상도를 가지는 중복 영역(OR)이 존재하지 않는 경우에는 선택된 영역과 동일한 해상도를 가지는 영역(Region)들을 현재 계층(Layer)에 배치한다(S760). 이어서, 현재 선택된 영역보다 높은 공간 해상도를 가지는 영역이 존재하는지를 확인(S770)하고, 존재하는 경우에는 배치되지 않은 영역 가운데 가장 해상도가 낮은 영역(Region)을 선택(S780)하여 계층 인덱스(Layer Index)를 1 증가(S790)시킨 후 상기 영역 배치 과정을 반복한다.FIG. 7 is a flowchart illustrating a method of arranging regions according to layers in a multi-layer based encoding scheme. First, a region having the lowest spatial resolution among a region to be encoded (including an input video image and a duplicate region) is selected (S710), and an index of the current layer is initialized to 1 (S720). In operation S730, it is determined whether a redundant region OR having the same spatial resolution as the currently selected region exists. As a result of the check, when there are overlapping regions having the same spatial resolution, the overlapping region OR is disposed in the current layer and then an area which is not the overlapping region is disposed in an upper layer (S750, S760). On the other hand, if there is no overlapping area OR having the same resolution, regions having the same resolution as the selected area are arranged in the current layer (S760). Subsequently, it is checked whether there is an area having a higher spatial resolution than the currently selected area (S770), and if present, a region having the lowest resolution among the unplaced areas is selected (S780), and a layer index is selected. After increasing by 1 (S790), the area arrangement process is repeated.

도 8은 일 계층 기반 부호화 방식인 경우에 영역의 계층별 배치 방법을 설명하기 위한 일실시예 흐름도이다. 먼저, 부호화할 영역(입력비디오 영상과 중복 영역을 포함) 가운데 가장 공간 해상도가 낮은 영역(Region)을 선택(S810)하고 현재 계층(Layer)의 인덱스(Index)를 1로 초기화한다(S820). 그리고 동일한 공간 해상도를 가지는 영역(Region)들을 현재 계층에 배치한다(S830). 이어서, 계층 인덱스(Layer Index)를 1 증가(S840)시킨 후 현재 선택된 영역보다 높은 공간 해상도를 가지는 영역이 존재하는지를 확인(S850)하고, 존재하는 경우에는 배치되지 않은 영역 가운데 가장 해상도가 낮은 영역(Region)을 선택(S8600)하여 상기 영역 배치 과정을 반복한다.8 is a flowchart illustrating a method of arranging regions according to layers in the case of a layer-based encoding scheme. First, a region having the lowest spatial resolution among a region to be encoded (including an input video image and a duplicate region) is selected (S810), and the index of the current layer is initialized to 1 (S820). Regions having the same spatial resolution are placed in the current layer (S830). Subsequently, after increasing the layer index by 1 (S840), it is checked whether there is an area having a higher spatial resolution than the currently selected area (S850), and if present, the area having the lowest resolution among the unplaced areas ( Region (S8600) and repeats the region arrangement process.

영역 부호화부(230)는 입력 비디오 영상과 영역들을 부호화하여 부호화 스트림을 생성한다. 상기 영역 부호화부(230)의 구성 및 동작은 중복 영역(OR)을 처리하는 방식에 따라 상이하게 이루어지므로, 이하 다중 계층 기반 부호화 방식이 적용되는 경우와 일 계층 기반 부호화 방식이 적용되는 경우로 나누어 설명한다.The region encoder 230 generates an encoded stream by encoding the input video image and the regions. Since the configuration and operation of the region encoder 230 are made differently according to the method of processing the overlapped region OR, it is divided into a case where the multi-layer based coding scheme is applied and a case where the one-layer based coding scheme is applied. Explain.

도 9는 본 발명에 따라 다중 계층 기반 부호화 방식이 적용되는 경우의 영역 부호화부(230)의 일실시예 구성도이다. 영역 배치부(220)로부터 비디오 영상과 영역 정보가 입력되면, 영역 부호화부(230)는 도 11 및 도 12에 도시된 절차에 따라 계층별 영역에 대한 부호화를 수행한다(비디오 영상도 하나의 영역으로 취급하여 부호화가 수행된다). 상기 입력되는 영역 정보에는 부호화를 수행할 영역(중복 영역을 포함)에 대한 정보 및 상기 영역의 계층(Layer)별 배치 정보가 포함된다. 각 계층의 영역을 부호화하는 엔코더(920, 940, 950)는 상기 영역 정보를 입력받아 도 12에 도시된 절차에 따라 선택 영역의 블록(block)에 대한 부호화 모드(coding mode)를 결정하고, 상기 결정된 모드에 따른 부호화 스트림을 스트림 다중화기(Stream MUX)로 출력한다. 도 9에서 계층 1에 대한 엔코더(Layer 1 encoder)는 부호화할 블록이 하위 계층의 영역들과 중복되는 부분이 없으므로 부호화 모드 선택에 있어서 인터 레이어 코딩 모드(inter-layer coding mode)는 제외한다. 따라서, 계층 1에 대한 엔코더(Layer 1 encoder) 다운 샘플링부(910)에서 계층(Layer 1)에 적합한 해상도로 다운샘플링(downsampling)된 비디오 영상을 입력받아 선택 영역의 블록에 대하여 움직임 예측/보상 모드(motion prediction/compensation mode)와 인트라 모드(intra mode) 가운데 하나의 모드로 부호화된 부호화 스트림을 출력한다. 한편, 다른 계층에 대한 엔코더(Layer 2 encoder, Layer 3 encoder)는 각 계층의 해상도에 해당하는 비디오 영상을 입력받아 선택 영역의 블록에 대하여 인터 레이어 코딩 모드(inter-layer coding mode), 움직임 예측/보상 모드(motion prediction/compensation mode), 및 인트라 모드(intra mode) 가운데 하나의 모드로 부호화된 부호화 스트림을 출력한다. 엔코더가 인터 레이어 코딩(inter-layer coding)을 수행하는 경우에는 하위 계층(Layer)의 영역들 가운데 부호화될 블록(block)과 중복되는 영역(Region)의 움직임 정보(motion information), 텍스쳐 정보(texture information), 및 움직임 보상 잔여 정보(motion compensated residual) 정보를 업샘플링(upsampliing)하여 사용한다. 한편, 스트림 다중화기(Stream MUX)(960)는 각각의 계층에 대한 엔코더로부터 출력되는 부호화 스트림을 입력받아 다중화한다.FIG. 9 is a diagram illustrating an embodiment of the region encoder 230 when a multi-layer based encoding scheme is applied according to the present invention. When the video image and the region information are input from the region arranging unit 220, the region encoder 230 performs encoding on a layer-by-layer region according to the procedures illustrated in FIGS. 11 and 12. Coding is performed). The input region information includes information on a region (including a duplicate region) to be encoded and layout information for each layer of the region. The encoders 920, 940, and 950 encoding regions of each layer receive the region information and determine a coding mode for a block of a selected region according to the procedure illustrated in FIG. 12. The encoded stream according to the determined mode is output to a stream multiplexer (Stream MUX). In FIG. 9, the encoder for Layer 1 does not have a portion where a block to be encoded overlaps with regions of a lower layer, and thus, an inter-layer coding mode is excluded in selecting a coding mode. Accordingly, the encoder 1 downsampler 910 receives a downsampled video image at a resolution suitable for the layer 1, and the motion prediction / compensation mode is applied to the block of the selected region. A coded stream encoded in one of a motion prediction / compensation mode and an intra mode is output. On the other hand, the encoder (Layer 2 encoder, Layer 3 encoder) for the other layer receives a video image corresponding to the resolution of each layer, the inter-layer coding mode, motion prediction / A coded stream encoded in one of a compensation mode (motion prediction / compensation mode) and an intra mode is output. When the encoder performs inter-layer coding, motion information and texture information of a region overlapping a block to be encoded among regions of a lower layer. information, and motion compensated residual information are upsampled and used. On the other hand, the stream multiplexer (Stream MUX) 960 receives and multiplexes an encoded stream output from an encoder for each layer.

도 10은 본 발명에 따라 일 계층 기반 부호화 방식이 적용되는 경우의 영역 부호화부(230)의 일실시예 구성도이다. 영역 배치부(220)로부터 비디오 영상과 영역 정보가 입력되면, 영상 및 영역 부호화부(230)는 도 13 및 도 14에 도시된 절차에 따라 계층별 영역에 대한 부호화를 수행한다. 상기 입력되는 영역 정보에는 부호화를 수행할 영역(중복 영역을 포함)에 대한 정보 및 상기 영역의 계층(Layer)별 배치 정보가 포함된다. 각 계층의 영역을 부호화하는 엔코더(1020, 1040, 1050)는 상기 영역 정보를 입력받아 도 14에 도시된 절차에 따라 선택 영역의 블록에 대한 부호화 모드를 결정하고, 상기 결정된 모드에 따른 부호화 스트림을 스트림 다중화기(Stream MUX)로 출력한다. 도 10에서 계층 1에 대한 엔코더(Layer 1 encoder)의 동작 내용은 도 9에서 설명한 바와 같다. 한편, 다른 계층에 대한 엔코더(Layer 2 encoder, Layer 3 encoder)가 인터 레이어 코딩(inter-layer coding)을 수행하는 경우, 복수 개의 하위 계층에 존재하는 영역의 정보를 이용할 수 있는 도 9와는 달리, 한 단계 아래 계층에 존재하는 영역(Region)의 움직임 정보(motion information), 텍스쳐 정보(texture information), 및 움직임 보상 잔여 정보(motion compensated residual) 정보를 업샘플링(upsampliing)하여 사용한다. 이 때 하위 계층(Layer)에 중복되는 영역이 존재하지 않는 경우에는 중개 영역(intermediate region)을 이용하여 인터 레이어 코딩(inter-layer coding)을 수행한다. 한편, 스트림 다중화기(Stream MUX)(1060)는 각각의 계층에 대한 엔코더로부터 출력되는 부호화 스트림을 입력받아 다중화한다.FIG. 10 is a diagram illustrating an embodiment of the region encoder 230 when a hierarchical coding scheme is applied according to the present invention. When the video image and the region information are input from the region arranging unit 220, the image and region encoding unit 230 performs encoding on the layered region according to the procedures illustrated in FIGS. 13 and 14. The input region information includes information on a region (including a duplicate region) to be encoded and layout information for each layer of the region. The encoders 1020, 1040, and 1050 encoding regions of each layer receive the region information, determine an encoding mode for a block of a selected region according to the procedure illustrated in FIG. 14, and determine an encoding stream according to the determined mode. Output to the stream multiplexer. Operation of the encoder (Layer 1 encoder) for the layer 1 in FIG. 10 is as described with reference to FIG. On the other hand, when an encoder for another layer (Layer 2 encoder, Layer 3 encoder) performs inter-layer coding (inter-layer coding), unlike FIG. 9 that can use the information of the region present in a plurality of lower layers, The motion information, the texture information, and the motion compensated residual information of a region existing in a hierarchy below one level are upsampled and used. In this case, when there is no overlapping region in the lower layer, inter-layer coding is performed using an intermediate region. On the other hand, the stream multiplexer (Stream MUX) 1060 receives an encoded stream output from an encoder for each layer and multiplexes it.

도 11은 본 발명에 따라 다중 계층 기반 부호화 방식이 적용되는 경우의 영역 부호화부(230)의 동작을 설명하기 위한 일실시예 흐름도이다. 상기 영역 배치부(220)로부터 영역(비디오 영상과 중복 영역을 포함)의 계층별 배치 정보가 입력되면 최하위 계층(Layer)의 영역이 선택되어 부호화된다(S1110, S1120). 부호화가 끝나면 동일한 계층(Layer) 내에 다른 영역이 존재하는지를 확인(S1130)하고, 존재하는 경우 그 영역을 선택하여 부호화한다(S1140). 동일한 계층의 영역에 대한 부호화를 마치면 상위 계층(Layer)에 존재하는 영역을 선택하여 부호화한다(S1150, S1160). 11 is a flowchart illustrating an operation of the region encoder 230 when a multi-layer based coding scheme is applied according to the present invention. When the layout information for each layer of the region (including the video image and the overlapped region) is input from the region arrangement unit 220, the region of the lowest layer is selected and encoded (S1110 and S1120). After the encoding is finished, it is checked whether another region exists in the same layer (S1130), and if present, the region is selected and encoded (S1140). After encoding the region of the same layer, the region existing in the upper layer is selected and encoded (S1150, S1160).

단계 S1120에서 이루어지는 부호화는 블록 단위로 이루어지는 블록 기반 비디오 부호화 체계(block-based video coding scheme)를 따르며, 하위 계층(Layer)의 영역 정보를 이용하는 인터 레이어 코딩(inter-layer coding)과 동일한 계층(Layer)의 영역 정보를 이용하는 인트라 레이어 코딩(intra-layer coding) 가운데 하나의 부호화 방법을 사용한다. 인터 레이어 코딩(inter-layer coding)은 MPEG-4 표준 [ISO/IEC 14496-2 (1998)]에서도 제안하고 있는 부호화 방법으로서, 하위 계층(Layer)의 움직임 정보(motion information)와 텍스쳐 정보(texture information) 및 움직임 보상 잔여 정보(motion compensated residual) 정보를 업샘플링(upsampliing)하여 사용하는 부호화 방법이다. 또한, 인트라 레이어 코딩(intra-layer coding)은 MPEG-4 AVC [ISO/IEC 14496-10: Advanced Video Coding, 2003]에서 제안하고 있는 부호화 방법으로서, 움직임 예측/보상 모드(motion prediction/compensation mode)와 인트라 모드(intra mode)로 나뉘어진다. 결과적 으로 영역 부호화부(230)는 각각의 선택 영역의 블록에 대하여 인터 레이어 코딩 모드(inter-layer coding mode), 움직임 예측/보상 모드(motion prediction/compensation mode), 및 인트라 모드(intra mode) 가운데 하나의 모드로 부호화된 부호화 스트림을 출력한다.The encoding in step S1120 follows a block-based video coding scheme in blocks, and is the same layer as inter-layer coding using area information of a lower layer. A coding method of one of intra-layer coding using region information of the C-based apparatus is used. Inter-layer coding is a coding method that is also proposed in the MPEG-4 standard [ISO / IEC 14496-2 (1998)] and includes motion information and texture information of a lower layer. A coding method using upsampling of information and motion compensated residual information. In addition, intra-layer coding is a coding method proposed by MPEG-4 AVC [ISO / IEC 14496-10: Advanced Video Coding, 2003], and is a motion prediction / compensation mode. It is divided into and intra mode. As a result, the region encoder 230 may select one of an inter-layer coding mode, a motion prediction / compensation mode, and an intra mode for each block of the selected region. Outputs an encoded stream encoded in one mode.

도 12는 본 발명에 따라 다중 계층 기반 부호화 방식이 적용되는 경우에 있어서 선택 영역의 블록(block)에 대하여 출력되는 부호화 스트림의 부호화 모드를 결정하는 방법을 설명하기 위한 일실시예 흐름도다. 상기에서도 설명한 바와 같이 단계 S1120에서 수행하는 선택 영역(Region)에 대한 부호화는 블록 기반 비디오 코딩 체계(block-based video coding scheme)를 따르며, 각 블록(block)에 대하여는 세 가지 모드(inter-layer coding mode, motion prediction/compensation mode, intra mode) 가운데 하나의 모드로 부호화된 부호화 스트림이 출력된다. 우선 인터 레이어 코딩 모드(inter-layer coding mode)를 적용하기 위해서는 각 블록이 하위 계층(Layer)의 어떠한 영역(Region)을 이용하여야 하는지를 알아야 한다. 따라서, 영역 배치부(220)로부터 전달되는 계층별 영역 배치 정보를 이용하여 부호화할 선택 영역의 블록(block)이 하위 계층(Layer)들의 영역(Region)들과 중복되는 영역 부분이 존재하는지를 확인(S1210)하고, 존재하는 경우 중복되는 부분을 가지는 영역들 가운데 가장 계층 인덱스(Layer Index)가 큰 계층(Layer)에 존재하는 영역을 선택한다(S1220). 이어서, 상기 영역에 대하여 중복 영역(OR)이 정의되어 있는지를 확인(1230)하고, 정의된 경우에는 정의된 중복 영역(OR)을 이용하여 인터 레이어 코딩(inter-later coding)을 수행한다(S1240). 반대로 중복 영역(OR)이 정의되지 않은 경우에는 중복되는 부분을 가지는 영역들 가운데 가장 인덱스가 큰 계층에 존재하는 영역을 이용하여 인터 레이어 코딩(inter-later coding)을 수행한다(S1250). 한편, 단계 S1260에서는 부호화할 선택 영역의 블록(block)이 하위 계층(Layer)들의 영역(Region)들과 중복되는 영역 부분이 있는지와 무관하게 상기 블록(block)에 대하여 움직임 예측/보상 모드(motion prediction/compensation mode)와 인트라 모드(intra mode)의 인트라 레이어 코딩(intra-layer coding)을 적용한다. 마지막으로 단계 S1270에서는 상기 세가지 모드의 부호화 수행 결과를 이용하여 비트율(bit rate)을 최소화하는 한가지 모드를 선택한다. 이 때, 단계 S1210에서 부호화할 블록(block)과 하위 레이어(Layer)들의 영역들 사이에 중복되는 영역 부분이 없다고 판단되는 경우에는 모드 선택에 있어서 인터 레이어 코딩 모드(inter-layer coding mode)를 제외시킨다.12 is a flowchart illustrating a method of determining an encoding mode of an encoded stream output for a block of a selected region when a multi-layer based encoding scheme is applied according to the present invention. As described above, the encoding of the selection region performed in step S1120 follows a block-based video coding scheme, and three blocks (inter-layer coding) for each block are performed. A coded stream encoded in one of modes (motion mode, motion prediction / compensation mode, intra mode) is output. First, in order to apply the inter-layer coding mode, it is necessary to know which region of each layer should be used for each block. Therefore, it is checked whether there is an area portion in which a block of a selection area to be encoded overlaps with regions of lower layers using the layer arrangement information for each layer transmitted from the area arrangement unit 220. In operation S1210, an area having a layer index having the largest layer index among the areas having overlapping portions is selected (S1220). Subsequently, it is checked whether the overlapped region OR is defined for the region (1230), and if defined, inter-later coding is performed using the defined overlapped region OR (S1240). ). On the contrary, when the overlapped region OR is not defined, inter-later coding is performed using a region existing in the layer having the largest index among regions having overlapping portions (S1250). Meanwhile, in step S1260, the motion prediction / compensation mode (motion) is performed on the block regardless of whether there is a region portion in which the block of the selection region to be encoded overlaps with the regions of the lower layers. Intra-layer coding of prediction / compensation mode and intra mode is applied. Finally, in step S1270, one mode of minimizing bit rate is selected by using the encoding results of the three modes. In this case, when it is determined in step S1210 that there is no overlapping area between the blocks to be encoded and the areas of the lower layers, the inter-layer coding mode is excluded in mode selection. Let's do it.

도 13은 본 발명에 따라 일 계층 기반 부호화 방식이 적용되는 경우의 영역 부호화부의 동작을 설명하기 위한 일실시예 흐름도이다. 상기 영역 배치부(220)로부터 영역(비디오 영상과 중복 영역을 포함)의 계층별 배치 정보가 입력되면 최하위 계층(Layer)의 영역이 선택되어 부호화된다(S1310, S1320). 부호화가 끝나면 동일한 계층(Layer) 내에 다른 영역이 존재하는지를 확인(S1330)하고, 존재하는 경우 그 영역을 선택하여 부호화한다(S1340). 동일한 계층(Layer) 내에 다른 영역이 존재하지 않는 경우에는 한 단계 아래의 계층에 존재하는 영역들 가운데 현재 계층의 영역과 중복되지 않는 영역이 존재하는지를 확인하며(S1350), 존재하는 경우 상기 한 단계 아래의 계층의 영역들 가운데 현재 계층 계층의 영역과 공간적으로 정합되지 않는 영역에 대하여 중개 영역(Intermediate Region)을 구성한다(S1360). 상기 중개 영역(Intermediate Region)은 상위 계층(Layer)에서 인터 레이어 코딩(inter-layer coding) 부호화를 수행할 때 참조되는 영역이다. 한편, 단계 S1350에서 한 단계 아래의 계층에 존재하는 영역들 가운데 현재 계층의 영역과 중복되지 않는 영역이 존재하지 않는 것으로 확인되면, 상위 계층(Layer)에 존재하는 영역을 선택하여 부호화한다(S1370, S1380). FIG. 13 is a flowchart illustrating an example of an operation of an area encoder when a hierarchical coding scheme is applied according to the present invention. When the layout information for each layer of the region (including the video image and the overlapped region) is input from the region arrangement unit 220, the region of the lowest layer is selected and encoded (S1310 and S1320). After the encoding is finished, it is checked whether another region exists in the same layer (S1330), and if present, the region is selected and encoded (S1340). If no other area exists in the same layer, it is checked whether there exists an area that does not overlap with the area of the current layer among the areas existing in the hierarchy below one step (S1350). Intermediate regions are configured for regions that are not spatially matched with regions of the current hierarchy among the regions of the hierarchy (S1360). The intermediate region is a region that is referred to when inter-layer coding is performed in an upper layer. On the other hand, if it is determined in step S1350 that there is no area that does not overlap with the area of the current layer among areas existing in the layer below one step, the area existing in the upper layer (Layer) is selected and encoded (S1370, S1380).

단계 S1360에서 구성되는 중개 영역(Intermediate Region)은 한 단계 아래의 하위 계층(Layer)에 존재하는 영역(Region)의 움직임 정보(motion information)와 텍스쳐 정보(texture information) 및 움직임 보상 잔여 정보(motion compensated residual) 정보를 현재 계층(Layer)의 공간해상도에 맞게 구성한 영역으로서, 상기 중개 영역(Intermediate Region)에 대하여 부호화는 수행되지 않는다. 상기와 같이 구성되는 중개 영역(Intermediate Region)은 상위 계층(Layer)에서 인터 레이어 코딩(inter-layer coding) 부호화를 수행할 때 사용된다. 본원 발명의 부호화는 블록 기반 비디오 부호화 체계(block-based video coding scheme)를 따르므로, 중개 영역(Intermediate Region)의 구성에 있어서도 블록 단위의 움직임 정보(motion information)와 텍스쳐 정보(texture information) 및 움직임 보상 잔여 정보(motion compensated residual) 정보가 필요하다. 내삽(interpolation)을 통해 현재 계층(Layer)의 공간 해상도에 맞게 업샘플링(upsampling)된 움직임 정보(motion information)와 텍스쳐 정보(texture information) 및 움직임 보상 잔여 정보 (motion compensated residual) 정보는, 상위 계층(Layer)에 중개 영역과 정합되는 영역이 존재하는 않는 경우에는 상위 계층(Layer)에서 동일한 방법의 내삽(interpolation)을 통해 새로운 중개영역을 구성하는데 사용될 수 있다. 한편, 상기 움직임 정보(motion information)에는 움직임 벡터(motion vector)와 해당 블록의 부호화 모드(coding mode)(inter-layer coding mode, motion prediction/compensation mode, intra mode)에 대한 정보가 포함된다.The intermediate region configured in step S1360 includes motion information, texture information, and motion compensation residual information of a region existing in a lower layer below one step. Residual) is a region configured with the spatial resolution of the current layer, and encoding is not performed on the intermediate region. The intermediate region configured as described above is used when performing inter-layer coding encoding in an upper layer. Since the encoding according to the present invention follows a block-based video coding scheme, even in the configuration of an intermediate region, motion information, texture information, and motion in units of blocks are used. Motion compensated residual information is needed. Motion information, texture information, and motion compensated residual information upsampled up to the spatial resolution of the current layer through interpolation are higher layers. If there is no area matching the mediation area in the layer, it may be used to construct a new mediation area through interpolation of the same method in the upper layer. On the other hand, the motion information includes a motion vector and information about a coding mode (inter-layer coding mode, motion prediction / compensation mode, intra mode) of the corresponding block.

단계 S1320에서 이루어지는 부호화는, 다중 계층 기반 부호화 방식이 적용되는 경우에서 설명한 바와 동일하게, 블록 단위로 이루어지는 블록 기반 비디오 부호화 체계(block-based video coding scheme)를 따르며, 하위 계층(Layer)의 영역 정보를 이용하는 인터 레이어 코딩(inter-layer coding)과 동일한 계층(Layer)의 영역 정보를 이용하는 인트라 레이어 코딩(intra-layer coding) 가운데 하나의 부호화 방법을 사용한다. 인터 레이어 코딩(inter-layer coding)은 MPEG-4 표준 [ISO/IEC 14496-2 (1998)]에서도 제안하고 있는 부호화 방법으로서, 하위 계층(Layer)의 움직임 정보(motion information)와 텍스쳐 정보(texture information) 및 움직임 보상 잔여 정보(motion compensated residual) 정보를 업샘플링(upsampliing)하여 사용하는 부호화 방법이다. 또한, 인트라 레이어 코딩(intra-layer coding)은 MPEG-4 AVC [ISO/IEC 14496-10: Advanced Video Coding, 2003]에서 제안하고 있는 부호화 방법으로서, 움직임 예측/보상 모드(motion prediction/compensation mode)와 인트라 모드(intra mode)로 나뉘어진다. 결과적으로 영상 및 영역 부호화부(230)는 각각의 선택 영역의 블록에 대하여 인터 레이 어 코딩 모드(inter-layer coding mode), 움직임 예측/보상 모드(motion prediction/compensation mode), 및 인트라 모드(intra mode) 가운데 하나의 모드로 부호화된 부호화 스트림을 출력한다.The encoding performed in step S1320 follows a block-based video coding scheme made in units of blocks, as described in the case where the multi-layer based coding scheme is applied, and the region information of the lower layer. One coding method of inter-layer coding using intra-layer coding and intra-layer coding using region information of the same layer is used. Inter-layer coding is a coding method that is also proposed in the MPEG-4 standard [ISO / IEC 14496-2 (1998)] and includes motion information and texture information of a lower layer. A coding method using upsampling of information and motion compensated residual information. In addition, intra-layer coding is a coding method proposed by MPEG-4 AVC [ISO / IEC 14496-10: Advanced Video Coding, 2003], and is a motion prediction / compensation mode. It is divided into and intra mode. As a result, the image and region encoder 230 performs an inter-layer coding mode, a motion prediction / compensation mode, and an intra mode for each block of the selected region. mode) outputs an encoded stream encoded in one mode.

도 14는 본 발명에 따라 일 계층 기반 부호화 방식이 적용되는 경우에 있어서 선택 영역의 블록(block)에 대하여 출력되는 부호화 스트림의 부호화 모드를 결정하는 방법을 설명하기 위한 일실시예 흐름도다. 상기에서도 설명한 바와 같이 단계 S1320에서 수행하는 선택 영역(Region)에 대한 부호화는 블록 기반 비디오 코딩 체계(block-based video coding scheme)를 따르며, 각 블록(block)에 대하여는 세 가지 모드(inter-layer coding mode, motion prediction/compensation mode, intra mode) 가운데 하나의 모드로 부호화된 부호화 스트림이 출력된다. 우선 인터 레이어 코딩 모드(inter-layer coding mode)를 적용하기 위해서는 각 블록이 하위 계층(Layer)의 어떠한 영역(Region)을 이용하여야 하는지를 알아야 한다. 따라서, 영역 배치부(220)로부터 전달되는 계층별 영역 배치 정보를 이용하여 부호화할 선택 영역의 블록(block)이 한 단계 아래 계층(Layer)들의 영역(Region)들과 중복되는 영역(Region)이 존재하는지를 확인(S1410)하고, 존재하는 경우 한 단계 아래 계층(Layer)의 중복되는 영역(Region)을 이용하여 인터 레이어 코딩(inter-later coding)을 수행한다(S1420). 상기 중복되는 영역(Region)이 존재하지 않는 경우에는 상기 선택 영역의 블록(block)이 상기 한 단계 아래 계층의 중개 영역(intermediate region)과 중복되는지를 확인하며, 중복되는 경우 상기 중개 영역(intermediate region)을 이용하여 인터 레이어 코딩(inter-later coding)을 수행 한다(S1450). 한편, 단계 S1460에서는 부호화할 선택 영역의 블록(block)이 하위 계층(Layer)들의 영역(Region)들과 중복되는 영역 부분이 있는지와 무관하게 상기 블록(block)에 대하여 움직임 예측/보상 모드(motion prediction/compensation mode)와 인트라 모드(intra mode)의 인트라 레이어 코딩(intra-layer coding)을 적용한다. 마지막으로 단계 S1270에서는 상기 세가지 모드의 부호화 수행 결과를 이용하여 비트율(bit rate)을 최소화하는 모드를 선택한다. 이 때, 단계 S1440에서 부호화할 블록(block)과 하위 레이어(Layer)들의 영역들 사이에 중복되는 영역 부분이 없다고 판단되는 경우에는 단계 S1430에서 모드를 선택함에 있어 인터 레이어 코딩 모드(inter-layer coding mode)를 제외시킨다.FIG. 14 is a flowchart illustrating a method of determining an encoding mode of an encoded stream output for a block of a selection region when a hierarchical encoding scheme is applied according to the present invention. As described above, the encoding of the selection region performed in step S1320 follows a block-based video coding scheme, and three blocks (inter-layer coding) for each block are performed. A coded stream encoded in one of modes (motion mode, motion prediction / compensation mode, intra mode) is output. First, in order to apply the inter-layer coding mode, it is necessary to know which region of each layer should be used for each block. Accordingly, a region in which a block of a selection region to be encoded by using the region arrangement information for each layer transmitted from the region arrangement unit 220 overlaps with regions of layers in one step below. In operation S1410, inter-later coding is performed by using an overlapping region of a layer below one step, in operation S1420. If the overlapping region does not exist, it is checked whether a block of the selection region overlaps with an intermediate region of the hierarchy below one step, and if the region overlaps, the intermediate region In operation S1450, inter-later coding is performed. Meanwhile, in step S1460, the motion prediction / compensation mode (motion) is performed on the block regardless of whether there is a region part in which the block of the selection area to be encoded overlaps with the regions of the lower layers. Intra-layer coding of prediction / compensation mode and intra mode is applied. Finally, in step S1270, a mode for minimizing bit rate is selected using the encoding results of the three modes. In this case, when it is determined in step S1440 that there is no overlapping region between the blocks to be encoded and the areas of the lower layers, the inter-layer coding mode is selected in selecting the mode in step S1430. mode).

도 15는 본 발명의 일실시예에 따라 단계 S1260와 S1460에서 수행되는 움직임 예측/보상 모드(motion prediction/compensation mode), 및 단계 S1240, S1250, S1420, S1450에서 수행되는 인터 레이어 코딩 모드(inter-layer coding mode)에서 사용되는 내삽(interpolation) 방법을 설명하기 위한 도면이다. 회색 처리된 작은 직사각형 영역은 원 영상의 화소(pixel)를 나타내며, 원 영상의 해상도를 2배씩 확장하기 위하여는 상기 화소 사이의 중간 화소(half-pixel) 값을 내삽(interpolation)을 통해 생성하여야 한다. MPEG-4 AVC(Advanced Video Coding)과 SVC(Scalable Video Coding)에서는 중간 화소 움직임 예측(half pel motion estimation)을 위하여 6 탭(tap)을 가지는 FIR 필터를 사용하는데, 동일한 필터가 본원 발명의 내삽(interpolation)에도 적용될 수 있다. [수학식 1]은 본 발명의 움 직임 예측/보상 모드(motion prediction/compensation mode)와 인터 레이어 코딩 모드(inter-layer coding mode)의 내삽(interpolation)에 사용되는 필터 수식의 일례다. 내삽(interpolation) 방법은 기본적으로 중간 화소 내삽(half-pel interpolation)을 통해 영상의 해상도를 2배로 확장하는 방법으로서, [수학식 1]은 연산량과 성능을 고려했을 때 상기와 같은 내삽에 적용 가능한 필터 수식 가운데 하나이다.FIG. 15 illustrates a motion prediction / compensation mode performed in steps S1260 and S1460 and an inter-layer coding mode performed in steps S1240, S1250, S1420, and S1450 according to an embodiment of the present invention. A diagram for describing an interpolation method used in a layer coding mode. The grayed out small rectangular area represents the pixels of the original image, and in order to double the resolution of the original image, a half-pixel value between the pixels must be generated by interpolation. . MPEG-4 Advanced Video Coding (AVC) and Scalable Video Coding (SVC) use an FIR filter with six taps for half pel motion estimation, and the same filter uses the interpolation of the present invention. interpolation). Equation 1 is an example of a filter equation used for interpolation of a motion prediction / compensation mode and an inter-layer coding mode of the present invention. The interpolation method is basically a method of doubling the resolution of an image through half-pel interpolation. [Equation 1] is applicable to the above interpolation considering the amount of computation and performance. One of the filter expressions.

상기 [수학식 1]에서 I(x, y)는 입력 영상에서 (x, y)좌표의 화소값, O(x, y)는 내삽이 수행된 출력 영상에서 (x, y)좌표의 화소값을 의미한다.In Equation 1, I (x, y) is the pixel value of the (x, y) coordinate in the input image, and O (x, y) is the pixel value of the (x, y) coordinate in the interpolated output image. Means.

하지만 상기와 같은 내삽(interpolation)을 수행함에 있어서, 부호화가 수행되는 영역(Regon of Interest; 이하 ROI라 함)은 복호화시 상기 ROI 이외의 영역 정보를 이용하여 복호화되어서는 안된다. 따라서 영역 부호화부(230)에서 ROI 경계 구역에 대한 내삽(interpolation)을 수행함에 있어서 ROI 영역 밖의 화소(pixel)값을 이용하지 않도록 할 필요가 있다. 이를 위해 본원 발명은 두 가지 방법 중 하나 를 사용한다.However, in performing the interpolation as described above, a region in which encoding is performed (hereinafter referred to as ROI) should not be decoded using region information other than the ROI when decoding. Therefore, when interpolation of the ROI boundary region is performed by the region encoder 230, it is necessary not to use a pixel value outside the ROI region. To this end, the present invention uses one of two methods.

첫 번째 방법은 ROI 경계에서 내삽(interplation)을 수행하기 전에 우선 외삽(extrapolation)을 통해 경계 영역 외부의 화소(pixel) 값을 결정한 후 중간 화소 내삽(half-pel interpolation)을 수행하는 것이다. 상기 외삽(extrapolation)에는 영차(zero order) 외삽법 또는 특정 상수값 대입법 등이 사용될 수 있다. 아래의 [수학식 2]는 영차(zero order) 외삽법이 적용하여 ROI 경계면을 고려한 필터의 수식이다.The first method is to determine the pixel value outside the boundary region by extrapolation before performing interpolation at the ROI boundary and then perform half-pel interpolation. In the extrapolation, a zero order extrapolation method or a specific constant value substitution method may be used. Equation 2 below is a filter formula considering the ROI interface by applying zero order extrapolation.

상기와 같이 ROI 경계에서 외삽(extrapolation)을 통해 경계 영역 외부의 화소(pixel) 값을 결정한 후 중간 화소 내삽(half-pel interpolation)을 수행하는 경우, 복호화기에서 채널을 통해 전송되는 영상 내에 ROI가 존재한다는 것을 알 수 있어야 한다. 따라서 영상 및 영역 부호화부로부터 출력되는 부호화 스트림에는 roi_flag(또는 roi_enable 또는 boundary_handling 또는 multiple_roi_flag)와 같은 전송되는 비트 스트림에 독립적으로 복호화가 가능한 ROI 영역이 존재함을 나타내기 위한 플래그(flag)를 추가한다. 상기와 같은 플래그(flag)가 추가된 비트 스트림을 전송받는 복호화기는 상기 플래그(flag) 값을 통해 ROI가 정의되어 있는지를 확인하고, 정의되어 있는 경우에는 상기와 같이 ROI 경계 영역을 고려하여 부호화된 비트 스트림을 복호화할 수 있는 기능을 활성화(enable)한다. 도 16 ~ 도 18은 상기와 같은 영상 내 ROI가 존재함을 나타내기 위한 플래그(flag)가 부호화 스트림에 추가되는 예를 설명하기 위한 도면이다. 우선, 도 16은 ROI가 슬라이스 그룹 맵 타입 2(slice_group_map_type 2)로 생성되는 경우에 있어서 roi_flag(또는 roi_enable 또는 boundary_handling 또는 multiple_roi_flag)가 추가(1610)된 SVC 비트 스트림의 예를 보여준다. 도 17은 ROI를 위한 새로운 슬라이스 그룹 맵 타입(slice_group_map_type)을 생성하여 상기 슬라이스 그룹 맵 타입에서 roi_flag(또는 roi_enable 또는 boundary_handling 또는 multiple_roi_flag)를 활성화(enable)(1720)하는 예를 보여준다. 즉, 현재 존재하는 슬라이스 그룹 맵 타입(slice_group_map_type)을 확장하여 슬라이스 그룹 맵 타입을 6보다 큰 정수로 하여 생성하며, 슬라이스 그룹 맵 타입을 판단하기 전에 상기 roi_flag(또는 roi_enable 또는 boundary_handling 또는 multiple_roi_flag)를 비활성화(disable)(1710)하고, 상기 ROI를 위한 새로운 슬라이스 그룹 맵 타입에서 상기 플래그를 활성화(enable)(1720)한다. 이 때 슬라이스 그룹 맵의 생성은 종래 슬라이스 그룹 맵 타입(slice_group_map_type)이 2인 경우와 동일하게 한다. 도 18은 도 16과 마찬가지로 ROI가 슬라이스 그룹 맵 타입 2(slice_group_map_type 2)(1820)로 생성되는 경우이다. 도 18에서는 roi_flag(또는 roi_enable 또는 boundary_handling 또는 multiple_roi_flag)가 활성화(enable)되는 경우, num_rois_minus1, roi_top_left, 및 roi_bottom_right 변수를 통해 ROI의 개수와 좌표 등의 기하학적 정보를 읽고, ROI 경계(boundary)와 일치하는 슬라이스 그룹 경계(slice group boundary)에 대하여 ROI 경계면을 고려한 내삽(interpolation)(즉, 업샘플링)을 수행한다. 상기 ROI의 위치를 표현함에 있어서는 좌표를 사용하지 않고, 전체 영상의 매크로 블록(macro block)에 대하여 레스터 스캔(rester scan) 순서로 번호를 부여한 후 ROI영역이 시작되는 매크로 블록(macro block) 번호와 ROI 영역이 끝나는 매크로 블록(macro block) 번호를 사용하여 ROI의 위치를 표현할 수 있다.As described above, when the pixel value outside the boundary area is determined through extrapolation at the ROI boundary and then half-pel interpolation is performed, the ROI in the image transmitted through the channel is decoded by the decoder. You must know that it exists. Therefore, a flag is added to the encoded stream output from the image and region encoder to indicate that there is an ROI region that can be independently decoded in the transmitted bit stream such as roi_flag (or roi_enable or boundary_handling or multiple_roi_flag). The decoder receiving the bit stream to which the above flag is added checks whether the ROI is defined through the flag value, and if so, the decoder is encoded in consideration of the ROI boundary region as described above. Enables the ability to decode bit streams. 16 to 18 are diagrams for describing an example in which a flag for indicating that an ROI in an image exists as described above is added to an encoded stream. First, FIG. 16 shows an example of an SVC bit stream in which roi_flag (or roi_enable or boundary_handling or multiple_roi_flag) is added 1610 when the ROI is generated as slice_group_map_type 2. 17 shows an example of generating a new slice group map type (slice_group_map_type) for ROI and enabling 1720 roi_flag (or roi_enable or boundary_handling or multiple_roi_flag) in the slice group map type. That is, the slice group map type (slice_group_map_type) is expanded to generate a slice group map type as an integer greater than 6, and the roi_flag (or roi_enable or boundary_handling or multiple_roi_flag) is disabled before determining the slice group map type. 1710 and enable 1720 the flag in the new slice group map type for the ROI. At this time, the generation of the slice group map is the same as the case of the conventional slice group map type (slice_group_map_type). FIG. 18 illustrates a case in which an ROI is generated as a slice group map type 2 1820 as in FIG. 16. In FIG. 18, when roi_flag (or roi_enable or boundary_handling or multiple_roi_flag) is enabled, the num_rois_minus1, roi_top_left, and roi_bottom_right variables read geometric information such as the number and coordinates of the ROI, and slices matching the ROI boundary. Interpolation (i.e., upsampling) taking into account the ROI boundary is performed on the slice group boundary. In order to express the position of the ROI, the macro block number where the ROI region starts after the numbers of macro blocks of the entire image are assigned in a raster scan order without using coordinates. The position of the ROI may be expressed by using a macro block number at which the ROI region ends.

한편, 복호화기에서 상기 ROI 외부 영역의 정보를 이용하여 복호화하지 않도록 하기 위한 두 번째 방법은 움직임 예측/보상 모드(motion prediction/compensation mode) 및 인터 레이어 코딩 모드(inter-layer coding mode)에 의한 부호화시 제약 조건을 부가하는 방법이다. 더 구체적으로 설명하면, 단계 S1260와 S1460에서 수행되는 움직임 예측/보상 모드(motion prediction/compensation mode)에서 ROI 경계 밖의 외부 화소를 참조하여 내삽(interpolation)을 수행하여야 하는 경우에는 내삽(interpolation)을 수행하지 않는다. 따라서, ROI 경계를 참조하는 움직임 정보(motion information)은 정수 화소(integer pel) 단위로 예측된다. 또한, 단계 S1240, S1250, S1420, S1450에서 수행되는 인터 레이어 코딩 모드(inter-layer coding mode)에서 ROI 경계에 위치하는 블록(block)에 대하여는 인터 레이어 예측(inter-layer prediction)을 수행하지 않는다. 따라서, ROI 경계에 위치하는 블록(block)은 하위 계층이 아닌 현재 해당하는 계층(Layer)의 정보만을 이용하여 부호화할 수 있는 모드, 즉 움직임 예측/보상 모드(motion prediction/compensation mode) 또는 인트라 모드(intra mode)로 부호화된다. On the other hand, a second method for the decoder not to decode by using the information of the ROI outer region is encoding by a motion prediction / compensation mode and an inter-layer coding mode This method adds a time constraint. More specifically, interpolation is performed when interpolation should be performed by referring to external pixels outside the ROI boundary in the motion prediction / compensation mode performed in steps S1260 and S1460. I never do that. Therefore, motion information referring to the ROI boundary is predicted in integer pixels. In addition, inter-layer prediction is not performed on blocks located at ROI boundaries in the inter-layer coding mode performed in steps S1240, S1250, S1420, and S1450. Therefore, a block located at the ROI boundary can be encoded using only information of the current layer, not a lower layer, that is, a motion prediction / compensation mode or an intra mode. encoded in (intra mode).

도 19와 도 20은 각각 다중 계층 부호화 방식이 적용되는 경우와 일 계층 부호화 방식이 적용되는 경우의 인터 레이어 코딩(inter-layer coding)에 있어서의 영역 간의 부호화 종속성(coding dependency)을 보여주는 도면이다. 도 19과 도 20에서 화살표가 가리키는 영역(Region)을 부호화하기 위해서는 화살표가 시작된 영역(Region)이 필요하다. 19 and 20 are diagrams illustrating coding dependencies between regions in inter-layer coding when the multi-layer coding scheme is applied and when the one-layer coding scheme is applied, respectively. In FIG. 19 and FIG. 20, a region where an arrow starts is needed to encode a region indicated by an arrow.

우선 다중 계층 부호화 방식이 적용되는 경우를 도 19과 도 12를 참조하여 설명하면, 영역 1(Region 1)의 부호화에 있어서 하위 계층의 영역(Region)과 중복 영역이 존재하지 않는 영역(1910)에 속하는 블록(block)들은 움직임 예측/보상 모 드(motion prediction/compensation mode)와 인트라 모드(intra mode) 가운데 비트율을 최소화하는 모드로 부호화된다. 한편, 하위 계층의 영역(Region)과 중복 영역이 존재하는 영역(1920)에 속하는 블록(block)들은 인터 레이어 코딩 모드(inter-layer coding mode), 움직임 예측/보상 모드(motion prediction/compensation mode), 및 인트라 모드(intra mode) 가운데 비트율을 최소화하는 모드로 부호화된다. 이 때, 인터 레이어 코딩 모드(inter-layer coding mode)가 적용되는 경우 영역 1(Region 1)은 영역 2(Region 2)와 중복되는 영역 부분이 존재하며, 상기 중복되는 부분에 대하여 중복 영역 OR 1이 정의되어 있으므로, 상기 하위 계층의 영역(Region)과 중복 영역이 존재하는 영역(1920)에 속하는 블록(block)들은 OR 1을 이용하여 부호화된다.First, a case in which the multi-layer coding scheme is applied will be described with reference to FIGS. 19 and 12. In encoding of region 1, regions 1910 where regions of the lower layer and regions that do not overlap exist do not exist. The belonging blocks are encoded in a mode that minimizes the bit rate among a motion prediction / compensation mode and an intra mode. On the other hand, blocks belonging to the region 1920 where the region of the lower layer and the overlapped region exist are inter-layer coding mode, motion prediction / compensation mode. Are encoded in a mode that minimizes a bit rate among, and intra modes. In this case, when the inter-layer coding mode is applied, region 1 has a region portion overlapping with region 2, and overlap region OR 1 with respect to the overlapping portion. Since is defined, blocks belonging to the region 1920 in which the region of the lower layer and the overlapped region exist are encoded using OR 1.

다음으로, 일 계층 부호화 방식이 적용되는 경우를 도 20과 도 14를 참조하여 설명하면, 영역 1(Region 1)의 부호화에 있어서 하위 계층의 영역(Region)과 중복 영역이 존재하지 않는 영역(2010)에 속하는 블록(block)들은 다중 계층 부호화 방식이 적용되는 경우와 마찬가지로 움직임 예측/보상 모드(motion prediction/compensation mode)와 인트라 모드(intra mode) 가운데 비트율을 최소화하는 모드로 부호화된다. 한편, 하위 계층의 영역(Region)과 중복 영역이 존재하는 영역(2020)에 속하는 블록(block)들은 인터 레이어 코딩 모드(inter-layer coding mode), 움직임 예측/보상 모드(motion prediction/compensation mode), 및 인트라 모드(intra mode) 가운데 비트율을 최소화하는 모드로 부호화된다. 이 때, 인터 레이어 코딩 모드(inter-layer coding mode)가 적용되는 경우 영역 1(Region 1)은 영역 2(Region 2)와 중복되는 영역 부분이 존재하며, 상기 중복되는 부분은 두 단계 아래에 존재하는 계층(Layer 1)에서 중복 영역 OR 1으로 정의되어 있다. 하지만 일 계층 부호화 방식은 한 단계 아래 계층이 영역을 사용하여 인터 레이어 코딩을 수행하므로, 우선 계층 2(Layer 2)에서 OR 1의 움직임 정보(motion information)와 텍스쳐 정보(texture information) 및 움직임 잔여 정보(motion compensated information)을 업샘플링(upsampling)함으로써 중개 영역(Intermediate Region)(2030)을 구성한다. 그리고 계층 3(Layer 3)에서 하위 계층의 영역(Region)과 중복 영역이 존재하는 영역(2020)에 속하는 블록(block)들은 상기 중개 영역(Intermediate Region)(2030)의 정보를 이용하여 부호화된다.Next, a case in which the one-layer coding scheme is applied will be described with reference to FIGS. 20 and 14. In the encoding of region 1, a region where a region of a lower layer does not exist and a duplicate region do not exist (2010). Blocks belonging to the Rx are encoded in a mode that minimizes bit rate among a motion prediction / compensation mode and an intra mode, as in the case where a multi-layer coding scheme is applied. Meanwhile, blocks belonging to the region 2020 in which the region of the lower layer and the overlapped region exist may be inter-layer coding mode or motion prediction / compensation mode. Are encoded in a mode that minimizes a bit rate among, and intra modes. In this case, when an inter-layer coding mode is applied, region 1 has a region portion overlapping with region 2, and the overlapping portion exists under two steps. It is defined as a duplicate area OR 1 in the layer 1. However, in the one-layer coding scheme, since a layer below one step performs inter-layer coding using a region, first, motion information, texture information, and motion residual information of OR 1 in Layer 2 are performed. The intermediate region 2030 is configured by upsampling motion compensated information. In Layer 3, blocks belonging to a region 2020 in which a region of a lower layer and a duplicate region exist are encoded by using the information of the intermediate region 2030.

도 21은 본 발명의 일실시예에 따른 관련된 하위 계층(Layer) 영역(Region)에 대한 정보를 포함하는 부호화 스트림의 신택스 구조를 보여주는 도면이다. 도 11의 단계 S1120, 및 도 13의 단계 S1320에서는 선택된 영역(Region)의 부호화에 사용되는 하위 계층(Layer)의 영역(Region)들에 대한 정보가 부호화 스트림에 추가된다. 도 21을 참조하여 설명하면, 선택 영역(Region)에 대한 부호화 스트림은 상기 선택 영역의 인덱스(Region index)(2110), 상기 선택 영역이 속한 계층의 인덱스(Layer index)(2120), 상기 선택 영역의 부호화에 사용되는 영역의 개수(Number of related Regions)(2130), 상기 부호화에 사용되는 영역들에 대한 정보(related Region 정보)(2140), 및 상기 선택 영역에 대하여 부호화된 비디오 신호(부호화된 Region 비디오 신호)(2150)를 포함한다. 상기 선택 영역의 부호화에 사용되는 영역 들에 대한 정보(related Region 정보)(2140)에는 상기 선택 영역의 부호화에 사용되는 각각의 영역에 대한 정보(2160, 2170, 2180)가 상기 사용되는 영역의 개수(Number of related Regions)(2130)만큼 기록된다. 또한, 상기 부호화에 사용되는 영역에 대한 정보는 상기 부호화에 사용되는 영역의 인덱스(Region index)(2171), 상기 부호화에 사용되는 영역이 속한 계층의 인덱스(Layer index)(2172), 선택 영역에 대한 상기 부호화에 사용되는 영역의 수평축 위치(h_org)(2173), 선택 영역에 대한 상기 부호화에 사용되는 영역의 수직축 위치(v_org)(2174), 상기 부호화에 사용되는 영역의 수평 길이(width)(2175), 및 상기 부호화에 사용되는 영역의 수직 길이(height)(2176)을 포함한다. 상기 영역의 위치를 표현함에 있어서는 좌표값이 아닌 매크로 블록(macro block) 번호가 사용될 수 있다. 즉, 전체 영상의 매크로 블록(macro block)에 대하여 레스터 스캔(rester scan) 순서로 번호를 부여한 후 상기 영역이 시작되는 매크로 블록(macro block) 번호와 상기 영역이 끝나는 매크로 블록(macro block) 번호를 사용하여 상기 영역의 위치를 표현할 수 있다.FIG. 21 is a diagram illustrating a syntax structure of an encoded stream including information about an associated lower layer region, according to an embodiment of the present invention. FIG. In step S1120 of FIG. 11 and step S1320 of FIG. 13, information about regions of a lower layer used for encoding a selected region is added to an encoding stream. Referring to FIG. 21, an encoding stream for a selection region includes a region index 2110 of the selection region, a layer index 2120 of the layer to which the selection region belongs, and the selection region. Number of related Regions 2130 used for the encoding of the information, information on the regions used for the encoding (related Region information) 2140, and a video signal encoded for the selected region (encoded). Region video signal) 2150. In the information (region related information) 2140 of regions used for encoding the selected region, information 2160, 2170, and 2180 for each region used for encoding the selected region is included in the number of regions used. (Number of related Regions) 2130. Further, the information on the region used for encoding is included in the region index 2171 of the region used for encoding, the layer index 2172 of the layer to which the region used for encoding belongs, and the selection region. The horizontal axis position (h_org) 2173 of the region used for the encoding for the selected region, the vertical axis position (v_org) 2174 of the region used for the encoding for the selected region, and the horizontal width (width) of the region used for the encoding ( 2175, and a vertical length 2176 of the region used for the encoding. In representing the location of the region, macro block numbers, not coordinates, may be used. That is, the macro blocks of the entire image are numbered in the order of the raster scan, and then the macro block number at which the region starts and the macro block number at which the region ends. Can be used to represent the location of the region.

한편, 입력 비디오 영상에서 동일한 위치를 가지는 영역(Region)들은 상기 영역들의 공간 해상도가 서로 다른 경우에도 동일한 영역 인덱스(Region index)를 가지도록 한다. 이를 통해 복호화기에서는 특정 영역(Region)의 복호화를 위해 각 계층(Layer)에서 동일한 영역 인덱스(Region index)를 가진 영역의 부호화 스트림을 추출함으로써 상기 특정 영역(Region)을 다양한 공간 해상도로 복호화할 수 있다.Meanwhile, regions having the same position in the input video image have the same region index even when the spatial resolutions of the regions are different. Through this, the decoder can decode the specific region at various spatial resolutions by extracting an encoding stream of a region having the same region index in each layer to decode the specific region. have.

도 22는 본 발명에 따른 부호화 스트림을 복호화하기 위한 도 1의 복호화기(120)의 일실시예 구성도이다. 도 22에서 보는 바와 같이 복호 영역(Region) 추출부(2210)와 영역(Region) 복호화부(2220)로 구성되는 복호화기(120)는, 사용자에 의해 선택되는 복호화 영역 정보를 사용자 인터페이스부(130)로부터 입력받아 부호화기(110)로부터 전송되는 부호화 스트림에 대하여 본 발명에 따른 복호화를 수행함으로써 영상 신호를 복원한다. 상기 복호화 영역 정보는 사용자가 복호화를 위해 선택한 영상 내 영역의 위치 정보를 포함한다. 22 is a block diagram of an embodiment of the decoder 120 of FIG. 1 for decoding an encoded stream according to the present invention. As illustrated in FIG. 22, the decoder 120 including the decoding area extraction unit 2210 and the region decoding unit 2220 may receive the decoding area information selected by the user from the user interface unit 130. Video signal is recovered by performing decoding according to the present invention on the encoded stream received from the encoder 110 and transmitted from the encoder 110. The decoding area information includes location information of an area within an image selected by the user for decoding.

상기 복호 영역(Region) 추출부(2210)는 사용자 인터페이스부(130)로부터 복호화 영역 정보를 입력받아, 채널을 통해 전송되는 부호화 스트림으로부터 상기 복호화 영역을 복호화하기 위하여 필요한 정보를 추출 및 구성한다. 즉, 상기 복호 영역 추출부(2210)는 부호화 스트림으로부터 사용자가 선택한 영역을 포함하는 ROI를 복호화하는데 필요한 하위 계층 영역의 개수(Number of related Regions)(2130)와 각각의 하위 계층 영역의 정보(related Region 정보)(2140)를 읽어들여 상기 ROI 복호화에 필요한 영역(Region)의 목록을 구성하며, 상기 부호화 스트림으로부터 ROI 복호화를 위해 필요한 영역 부분을 추출하여 각 영역에 대한 정보를 구성한다. The decoding region extraction unit 2210 receives decoding region information from the user interface 130, and extracts and configures information necessary for decoding the decoding region from an encoded stream transmitted through a channel. That is, the decoding region extracting unit 2210 may include a number of related regions 2130 and information about each lower layer region required for decoding an ROI including a region selected by a user from an encoded stream. Region information 2140 is read to form a list of regions necessary for the ROI decoding, and a region portion necessary for ROI decoding is extracted from the encoded stream to configure information about each region.

이 때 부호화기(110)에서 전체 영상을 중첩되지 않는 다수의 작은 영역으로 나누어 부호화하는 경우, 복호화기(120)에서 임의의 영역을 선택하여 복호화할 수 있는 인터랙티브(interactive) 복호화가 구현 가능하다. 상기와 같은 부호화는 복 수의 계층(Layer)에서 전체 영상에 대하여 이루어질 수도 있다. 더 구체적으로, 현재 스케일러블 비디오 코딩(Scalable Video Coding; SVC) 체계에서 슬라이스 그룹 맵 타입(slice group map type 2)를 통해 영역(Region)을 구성함에 있어서 아주 작은 사각형 영역으로 구성토록 하고, 복호화시 사용자가 선택한 영역과 대응되는 영역들을 복호화함으로써 인터랙티브(interactive) 복호화를 가능케 할 수 있다. 이 경우, 사용자가 사용자 인터페이스부(130)를 통해 복호화하기를 원하는 임의의 영역을 선택하면 복호 영역 추출부(2210)는 사용자가 선택한 임의 영역과 대응하는 영역들로 ROI 영역을 구성한다. 그리고 채널을 통해 전송된 부호화 스트림으로부터 상기 ROI 영역을 복호화하는데 필요한 하위 계층 영역의 개수(Number of related Regions)(2130)와 각각의 하위 계층 영역의 정보(related Region 정보)(2140)를 읽어들여 복호화에 필요한 영역(Region)의 목록을 구성하며, 상기 부호화 스트림으로부터 상기 ROI 영역을 복호화하는데 필요한 영역 정보를 추출한다.In this case, when the encoder 110 divides and encodes an entire image into a plurality of non-overlapping small regions, interactive decoding may be implemented in which the decoder 120 may select and decode an arbitrary region. Such encoding may be performed on the entire image in a plurality of layers. More specifically, in the current Scalable Video Coding (SVC) scheme, when configuring a region through a slice group map type 2, the region is configured to be a very small rectangular region and decoded. It is possible to enable interactive decoding by decoding regions corresponding to the region selected by the user. In this case, when the user selects an arbitrary area that the user wants to decode through the user interface 130, the decoding area extractor 2210 configures the ROI area into areas corresponding to the selected area selected by the user. The number of related regions 2130 and the related region information 2140 of the lower layer regions required for decoding the ROI region are read and decoded from the encoded stream transmitted through the channel. A list of regions required for the configuration is constructed, and region information necessary for decoding the ROI region is extracted from the encoded stream.

영역(Region) 복호화부(2220)는 상기 복호 영역 추출부(2210)로부터 ROI 영역 복호화에 필요한 영역(Region) 정보를 입력받아 복호화를 수행함으로써 영상 신호를 복원한다. 이하 상기 영역 복호화부(2220)의 동작을 다중 계층 기반 방식과 일 계층 기반 방식이 적용되는 경우로 설명한다.The region decoder 2220 receives region information necessary for decoding the ROI region from the decoding region extractor 2210 and restores the image signal by performing decoding. Hereinafter, an operation of the region decoder 2220 will be described as a case where a multi-layer based method and a one-layer based method are applied.

도 23은 본 발명에 따라 다중 계층 기반 방식이 적용되는 경우의 영역 복호화부의 동작을 설명하기 위한 일실시예 흐름도이다. 상기 복호 영역 추출부(2210) 로부터 복호화에 필요한 영역(Region) 정보가 입력되면 복호화와 관련된 영역 가운데 최하위 계층(Layer)의 영역이 선택되어 복호화된다(S2310, S2320). 복호화가 끝나면 동일한 계층(Layer) 내에 다른 영역이 존재하는지를 확인(S2330)하고, 존재하는 경우 그 영역을 선택하여 복호화한다(S2340). 동일한 계층의 영역에 대한 복호화를 마치면 상위 계층(Layer)에 존재하는 영역을 선택하여 복호화한다(S2350, S2360, S2320). FIG. 23 is a flowchart illustrating an operation of an area decoder when a multi-layer based method is applied according to the present invention. When region information necessary for decoding is input from the decoding region extracting unit 2210, the region of the lowest layer among the regions related to decoding is selected and decoded (S2310 and S2320). After decoding, it is checked whether another region exists in the same layer (S2330), and if present, the region is selected and decoded (S2340). After decoding the region of the same layer, the region existing in the upper layer is selected and decoded (S2350, S2360, and S2320).

단계 S2320에서 이루어지는 복호화는 블록 단위로 이루어지는 블록 기반 비디오 부호화 체계(block-based video coding scheme)를 따르며, 하위 계층(Layer)의 영역 정보를 이용하는 인터 레이어 코딩(inter-layer coding)과 동일한 계층(Layer)의 영역 정보를 이용하는 인트라 레이어 코딩(intra-layer coding) 가운데 하나의 부호화 방식에 대응하는 복호화 방법을 사용한다. 인트라 레이어 코딩(intra-layer coding)은 움직임 예측/보상 모드(motion prediction/compensation mode)와 인트라 모드(intra mode)로 나뉘어지며, 결과적으로 영역 복호화부(2220)는 선택 영역의 각 블록에 대하여 인터 레이어 디코딩(inter-layer decoding), 움직임 예측/보상 디코딩(motion prediction/compensation decoding), 및 인트라 모드 디코딩(intra mode decoding) 가운데 하나의 방법으로 복호화한다. 상기 움직임 예측/보상 디코딩(motion prediction/compensation decoding), 및 인트라 모드 디코딩(intra mode decoding) 방법은 MPEG-4 AVC [ISO/IEC 14496-10: Advanced Video Coding, 2003]에 제시되어 있다. 상기 인터 레이어 디코딩(inter-layer decoding)방법은 MPEG-4 표준 [ISO/IEC 14496-2 (1998)]에 제시되어 있는 복호화 방법으로 서, 하위 계층(Layer)의 움직임 정보(motion information)와 텍스쳐 정보(texture information) 및 움직임 보상 잔여 정보(motion compensated residual) 정보를 업샘플링(upsampliing)하여 사용하는 복호화 방법이다. Decoding in step S2320 follows a block-based video coding scheme in blocks, and is the same layer as inter-layer coding using area information of a lower layer. A decoding method corresponding to one of the intra-layer coding methods using intra-layer coding is used. Intra-layer coding is divided into a motion prediction / compensation mode and an intra mode. As a result, the region decoder 2220 may interleave each block of the selected region. Decoding is performed by one of inter-layer decoding, motion prediction / compensation decoding, and intra mode decoding. The motion prediction / compensation decoding and intra mode decoding methods are presented in MPEG-4 AVC [ISO / IEC 14496-10: Advanced Video Coding, 2003]. The inter-layer decoding method is a decoding method proposed in the MPEG-4 standard [ISO / IEC 14496-2 (1998)] and includes motion information and texture of a lower layer. It is a decoding method using upsampling of texture information and motion compensated residual information.

도 24는 본 발명에 따라 다중 계층 기반 방식이 적용되는 경우에 있어서 선택 영역의 블록(block)에 대한 부호화 스트림의 복호화 방법을 설명하기 위한 일실시예 흐름도이다. 상기에서도 설명한 바와 같이 단계 S2320에서 수행하는 선택 영역(Region)에 대한 복호화는 블록을 기반으로 하는 체계를 따르며, 각 블록(block)의 부호화 스트림은 세 가지 방법(inter-layer decoding, motion prediction/compensation decoding, intra mode decoding) 가운데 하나의 방법으로 복호화된다. 도 24를 참조하여 본 발명에 따른 선택 영역의 블록에 대한 복호화 방법을 설명하면, 우선 복호화할 블록의 부호화 모드를 확인한다(S2410, S2440). 그리고 복호화를 수행할 블록의 부호화 모드가 인터 레이어 코딩 모드(inter-layer coding mode)인 경우, 현재 복호화할 블록과 중복되는 하위 계층(Layer)의 영역들 가운데 가장 계층 인덱스(Layer Index)가 큰 계층(Layer)에 존재하는 영역을 선택(S2420)하여 인터 레이어 디코딩(inter-layer decoding)을 수행한다(S2430). 복호화를 수행할 선택 블록의 부호화 모드가 움직임 예측 보상 모드(motion prediction/compensation mode)인 경우에는 상기 블록에 대하여 움직임 예측 보상 디코딩(motion prediction/compensation decoding)을 수행하며, 상기 블록의 부호화 모드가 움직임 예측 보상 모드(motion prediction/compensation mode)와 인터 레이어 코딩 모드(inter-layer coding mode) 모두 아닌 경우에는 상기 블록에 대하 여 인트라 모드 디코딩(intra mode decoding)을 수행한다.24 is a flowchart illustrating a method of decoding an encoded stream for a block of a selected region when a multi-layer based scheme is applied according to the present invention. As described above, the decoding of the selection region performed in step S2320 follows a block-based scheme, and the encoding stream of each block has three methods (inter-layer decoding, motion prediction / compensation). decoding, intra mode decoding). Referring to FIG. 24, a decoding method for a block of a selection area according to the present invention will be described. First, the encoding modes of a block to be decoded are checked (S2410 and S2440). When the coding mode of the block to be decoded is the inter-layer coding mode, the layer having the largest layer index among the regions of the lower layer overlapping the current block to be decoded. An area present in the layer is selected (S2420) and inter-layer decoding is performed (S2430). If the encoding mode of the selection block to be decoded is a motion prediction / compensation mode, motion prediction / compensation decoding is performed on the block, and the encoding mode of the block is motion. In the case of neither the motion prediction / compensation mode nor the inter-layer coding mode, intra mode decoding is performed on the block.

도 25는 본 발명에 따라 일 계층 기반 방식이 적용되는 경우의 영역 복호화부의 동작을 설명하기 위한 일실시예 흐름도이다. 상기 복호 영역 추출부(2210)로부터 복호화에 필요한 영역(Region) 정보가 입력되면 복호화와 관련된 영역 가운데 최하위 계층(Layer)의 영역이 선택(S2510)되고, 상기 선택된 영역이 최상위 계층(Layer)에 존재하는 경우에는 바로 상기 선택된 영역에 대하여 복호화를 수행한다(S2550). 상기 선택된 영역이 최상위 계층(Layer)에 존재하지 않는 경우에는 한 단계 아래의 계층에 존재하는 영역들 가운데 현재 계층의 영역과 중복되지 않는 영역이 존재하는지를 확인(S2330)하여, 존재하지 않는 경우에는 바로 상기 선택 영역에 대한 복호화를 수행하며(S2550), 존재하는 경우에는 상기 한 단계 아래의 계층의 영역들 가운데 현재 계층 계층의 영역과 공간적으로 정합되지 않는 영역에 대하여 중개 영역(Intermediate Region)을 구성하여 복호화를 수행한다(S2540, S2550). 복호화가 끝나면 동일한 계층(Layer) 내에 다른 영역이 존재하는지를 확인(S2560)하고, 존재하는 경우 그 영역을 선택하여 복호화한다(S2570). 동일한 계층의 영역에 대한 복호화를 마치면 상위 계층(Layer)에 존재하는 영역을 선택하여 복호화한다(S2580, S2590, S2520). 25 is a flowchart illustrating an operation of an area decoder in the case where a one-layer based scheme is applied according to the present invention. When region information necessary for decoding is input from the decoding region extracting unit 2210, a region of a lower layer among the regions related to decoding is selected (S2510), and the selected region exists in the uppermost layer. In this case, decoding is performed on the selected region (S2550). If the selected area does not exist in the top layer, it is checked whether there exists an area that does not overlap with the area of the current layer among areas existing in the hierarchy below one step (S2330). Decode the selected region (S2550). If present, an intermediate region is formed for a region that is not spatially matched with the region of the current layer among the regions of the layer below the one step. Decryption is performed (S2540, S2550). After decoding, it is checked whether another region exists in the same layer (S2560), and if present, the region is selected and decoded (S2570). After decoding the region of the same layer, the region existing in the upper layer is selected and decoded (S2580, S2590, and S2520).

단계 S2540에서 구성되는 중개 영역(Intermediate Region)은 한 단계 아래의 하위 계층(Layer)에 존재하는 영역(Region)의 움직임 정보(motion information)와 텍스쳐 정보(texture information) 및 움직임 보상 잔여 정보(motion compensated residual) 정보를 내삽(intepolation)하여 구성되는 영역으로서, 상위 계층(Layer)에서 인터 레이어 디코딩(inter-layer decoding)을 수행할 때 사용된다. 본원 발명의 복호화는 블록을 기반으로 하는 체계를 따르므로, 중개 영역(Intermediate Region)의 구성도 블록 단위로 이루어진다. 상기 움직임 정보(motion information)에는 움직임 벡터(motion vector)와 해당 블록의 부호화 모드(coding mode)(inter-layer coding mode, motion prediction/compensation mode, intra mode)에 대한 정보가 포함된다.The intermediate region configured in step S2540 includes motion information, texture information, and motion compensation residual information of a region existing in a lower layer below one step. A region formed by interpolating residual information and used when inter-layer decoding is performed in an upper layer. Since the decoding of the present invention follows a block-based scheme, the structure of an intermediate region is also formed in units of blocks. The motion information includes a motion vector and information about a coding mode (inter-layer coding mode, motion prediction / compensation mode, intra mode) of the corresponding block.

단계 S2550에서 이루어지는 복호화는 다중 계층 기반 방식이 적용되는 경우에서 설명한 바와 동일하게 블록 단위로 이루어지며, 선택 영역의 각 블록은 인터 레이어 디코딩(inter-layer decoding), 움직임 예측/보상 디코딩(motion prediction/compensation decoding), 및 인트라 모드 디코딩(intra mode decoding) 가운데 하나의 방법으로 복호화된다. 상기 움직임 예측/보상 디코딩(motion prediction/compensation decoding), 및 인트라 모드 디코딩(intra mode decoding) 방법은 MPEG-4 AVC [ISO/IEC 14496-10: Advanced Video Coding, 2003]에 제시되어 있으며, 상기 인터 레이어 디코딩(inter-layer decoding)방법은 MPEG-4 표준 [ISO/IEC 14496-2 (1998)]에 제시되어 있는 복호화 방법으로서 하위 계층(Layer)의 움직임 정보(motion information)와 텍스쳐 정보(texture information) 및 움직임 보상 잔여 정보(motion compensated residual) 정보를 업샘플링(upsampliing)하여 사용하는 방법이다. Decoding in step S2550 is performed in units of blocks as described in the case where the multi-layer-based scheme is applied, and each block of the selection region is inter-layer decoding, motion prediction / compensation decoding. decoding in one of compensation compensation and intra mode decoding. The motion prediction / compensation decoding and intra mode decoding methods are presented in MPEG-4 AVC [ISO / IEC 14496-10: Advanced Video Coding, 2003]. The inter-layer decoding method is a decoding method proposed in the MPEG-4 standard [ISO / IEC 14496-2 (1998)], which is motion information and texture information of a lower layer. And up-sampling of motion compensated residual information.

도 26은 본 발명에 따라 일 계층 기반 방식이 적용되는 경우에 있어서 선택 영역의 블록(block)에 대한 부호화 스트림의 복호화 방법을 설명하기 위한 일실시예 흐름도이다. 일 계층 기반 방식이 적용되는 경우의 복호화 체계는 기본적으로 다중 계층 기반 방식이 적용되는 경우와 동일하지만 인터 레이어 디코딩(inter-layer decoding) 수행시 한 단계 아래의 계층(Layer)만을 참조하는 것이 다르다. 일 계층 기반 방식에서는 한 단계 아래의 계층(Layer)에 중복되는 영역 부분이 존재하지 않는 경우에 한 단계 아래의 계층(Layer)만을 참조하기 위하여 단계 S2540에서 보는 바와 같이 중개 영역(intermediate)을 구성하여 복호화를 수행한다. 도 26를 참조하여 본 발명에 따른 선택 영역의 블록에 대한 복호화 방법을 설명하면, 우선 복호화할 블록의 부호화 모드를 확인한다(S2610, S2640). 그리고 복호화를 수행할 블록의 부호화 모드가 인터 레이어 코딩 모드(inter-layer coding mode)인 경우, 현재 복호화할 블록과 중복되는 영역 부분을 가지는 한 단계 아래 계층의 영역(Region) 또는 단계 S2540에서 구성되는 중개 영역(Intermediate Region)을 이용하여 인터 레이어 디코딩(inter-layer decoding)을 수행한다(S2620, S2630). 복호화를 수행할 선택 블록의 부호화 모드가 움직임 예측 보상 모드(motion prediction/compensation mode)인 경우에는 상기 블록에 대하여 움직임 예측 보상 디코딩(motion prediction/compensation decoding)을 수행하며, 상기 블록의 부호화 모드가 움직임 예측 보상 모드(motion prediction/compensation mode)와 인터 레이어 코딩 모드(inter-layer coding mode) 모두 아닌 경우에는 상기 블록에 대하여 인트라 모드 디코딩(intra mode decoding)을 수행한다.FIG. 26 is a flowchart illustrating a method of decoding an encoded stream for a block of a selected region when a one-layer based scheme is applied according to the present invention. The decoding scheme when the one-layer based scheme is applied is basically the same as the case where the multi-layer based scheme is applied, except that only a layer below one layer is referred to when performing inter-layer decoding. In one layer-based method, when there is no overlapping region part in a layer below one step, an intermediate area is configured as shown in step S2540 to refer only to the layer below one step. Perform decryption. Referring to FIG. 26, a decoding method for a block of a selection area according to the present invention will be described. First, the encoding modes of a block to be decoded are checked (S2610 and S2640). When the coding mode of the block to be decoded is an inter-layer coding mode, the region of one layer below or one region having a region portion overlapping with the block to be decoded is configured in step S2540. Inter-layer decoding is performed using an intermediate region (S2620 and S2630). If the encoding mode of the selection block to be decoded is a motion prediction / compensation mode, motion prediction / compensation decoding is performed on the block, and the encoding mode of the block is motion. In the case of neither the motion prediction / compensation mode nor the inter-layer coding mode, intra mode decoding is performed on the block.

단계 S2450과 S2650의 수행되는 움직임 예측/보상 디코딩(moton predicton decoding)에 있어서 중간 화소 내삽(half pel interpolation)을 수행하는 방법과, 단계 S2430과 S2630의 인터 레이어 디코딩(inter-layer decoding)에 있어서 업샘플링(upsampling)을 수행하는 방법은, 앞서 단계 S1260, S1460, S1240, S1250, S1420, 및 S1450에서 도15를 참조하여 설명한 바와 같다. 상기 단계들(S1260, S1460, S1240, S1250, S1420, 및 S1450)에서 설명한 바와 같이 부호화기(110)에서는 ROI 경계 영역에 대하여 두 가지 처리 방법 가운데 하나의 방법을 사용하며, 상기 부호화기(110)에서 사용되는 방법에 따라 복호화기(120)의 동작도 다르게 이루어진다. A method of performing half pel interpolation in the motion prediction / compensation decoding performed in steps S2450 and S2650, and in the inter-layer decoding in steps S2430 and S2630, The method of performing upsampling is as described above with reference to FIG. 15 in steps S1260, S1460, S1240, S1250, S1420, and S1450. As described in the steps S1260, S1460, S1240, S1250, S1420, and S1450, the encoder 110 uses one of two processing methods for the ROI boundary region and is used by the encoder 110. The operation of the decoder 120 is made differently according to the method.

우선, 부호화기(110)에서 ROI 경계에 대하여 내삽(interplation)을 수행하기 전에 우선 외삽(extrapolation)을 통해 경계 영역 외부의 화소(pixel) 값을 결정한 후 중간 화소 내삽(half-pel interpolation) 또는 업샘플링(upsampling)을 수행하는 경우에는, 복호화기(120)는 채널을 통해 전송되는 부호화 스트림에 포함된 roi_flag(또는 roi_enable 또는 boundary_handling 또는 multiple_roi_flag)와 같은 플래그(flag)를 통해 비트 스트림에 독립적으로 복호화가 가능한 ROI 영역이 정의되어 있는지 여부를 확인한다. 확인 결과 독립적으로 복호화 가능한 ROI 영역이 정의되어 있는 경우, 영역 복호화부(2220)는 상기와 같이 ROI 경계 영역을 고려하여 부호화된 비트 스트림을 복호화할 수 있는 기능(예를 들어, [수학식 2]에 해당하는 필터)을 활성화(enable)한다.First, before performing interpolation on the ROI boundary in the encoder 110, the pixel value outside the boundary area is first determined by extrapolation, and then half-pel interpolation or upsampling is performed. In the case of performing upsampling, the decoder 120 can independently decode the bit stream through a flag such as roi_flag (or roi_enable or boundary_handling or multiple_roi_flag) included in the encoded stream transmitted through the channel. Check whether ROI area is defined. As a result of the check, when an independently decodable ROI region is defined, the region decoder 2220 may decode the encoded bit stream considering the ROI boundary region as described above (for example, [Equation 2]). Enable filter).

한편, 부호화기(110)는 ROI 경계 영역 밖의 화소를 참조하지 않도록 하기 위 하여 ROI 경계 밖의 외부 화소를 참조하여야 하는 경계 영역에 대하여 내삽(interpolation)을 수행하지 않고 ROI 경계에 위치하는 블록(block)에 대하여는 현재 해당하는 계층(Layer)의 정보만을 이용하여 부호화할 수 있는 모드(motion prediction/compensation mode 또는 intra mode)로 부호화하는 제약 조건을 부가할 수 있다. 부호화기(110)에서 상기와 같은 제약 조건을 부가하는 방법이 사용된 경우, 영역 복호화부(2220)는 움직임 예측/보상 디코딩(motion prediction/compensation decoding)시 ROI 경계를 참조하는 움직임 정보(motion information)에 대하여 정수 화소(integer pel) 단위로 예측하며, ROI 경계에 위치하는 블록(block)에 대하여는 인터 레이어 디코딩(inter-layer decoding)이 아닌 현재 해당하는 계층(Layer)의 정보만을 이용하여 복호화할 수 있는 방법(motion prediction/compensation decoding 또는 intra mode decoding)을 사용하여 복호화를 수행한다.On the other hand, the encoder 110 does not interpolate a boundary region that should refer to an external pixel outside the ROI boundary so as not to refer to a pixel outside the ROI boundary region. For this, constraints for encoding in a mode (motion prediction / compensation mode or intra mode) that can be encoded using only information on the current layer may be added. When the method of adding the above constraints in the encoder 110 is used, the region decoder 2220 may motion information referring to the ROI boundary during motion prediction / compensation decoding. Is predicted in integer pixel units, and the block located at the ROI boundary can be decoded using only the information of the current layer, not inter-layer decoding. Decoding is performed using a method (motion prediction / compensation decoding or intra mode decoding).

상술한 바와 같은 본 발명의 방법은 프로그램으로 구현되어 컴퓨터로 읽을 수 있는 형태로 기록매체(씨디롬, 램, 롬, 플로피 디스크, 하드 디스크, 광자기 디스크 등)에 저장될 수 있다. 이러한 과정은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있으므로 더 이상 상세히 설명하지 않기로 한다.As described above, the method of the present invention may be implemented as a program and stored in a recording medium (CD-ROM, RAM, ROM, floppy disk, hard disk, magneto-optical disk, etc.) in a computer-readable form. Since this process can be easily implemented by those skilled in the art will not be described in more detail.

이상에서 설명한 본 발명은, 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 있어 본 발명의 기술적 사상을 벗어나지 않는 범위 내에서 여러 가지 치환, 변형 및 변경이 가능하므로 전술한 실시예 및 첨부된 도면에 의해 한정되는 것이 아니다.The present invention described above is capable of various substitutions, modifications, and changes without departing from the technical spirit of the present invention for those skilled in the art to which the present invention pertains. It is not limited by the drawings.

상기와 같은 본 발명은, 비디오 영상 내 ROI를 정의함으로써 해상도 뿐만 아니라 공간적으로도 완전한 스케일러블 비디오 부호화/복호화 장치를 제공할 수 있는 효과가 있다. 또한 본 발명은, 복수의 ROI 영역 사이의 계층간 중복성을 고려함으로써 부호화 효율을 높일 수 있는 효과가 있다.As described above, the present invention has the effect of providing a scalable video encoding / decoding apparatus that is not only in resolution but also in space by defining ROI in a video image. In addition, the present invention has the effect of increasing the coding efficiency by considering the inter-layer redundancy between the plurality of ROI regions.

또한, 본 발명은 상기와 같은 복수의 ROI 영역에 관한 정보를 효율적으로 전송할 수 있는 신택스 구조를 가지는 부호화 스트림을 통해, 복호화기에서 손쉽게 해당 영역의 복호화에 필요한 영역 정보를 획득할 수 있도록 한다. 뿐만 아니라, 본 발명은 ROI 플래그를 신택스 구조에 추가하는 방법을 제시함으로써 복호화기에서 독립적으로 복호화가 가능한 ROI 영역이 존재함을 알 수 있도록 하고 관련된 복호화 기능을 활성화할 수 있도록 한다.In addition, the present invention enables a decoder to easily obtain region information necessary for decoding a corresponding region through an encoded stream having a syntax structure capable of efficiently transmitting the information on the plurality of ROI regions as described above. In addition, the present invention provides a method of adding an ROI flag to a syntax structure so that a decoder can recognize that there is an ROI region that can be independently decoded and activate an associated decoding function.

또한, 본 발명은 다른 영역의 정보를 참조해야 하는 영역 부호화시 ROI 경계 영역에 처리 방법을 제시함으로써, 복호화기에서 ROI를 독립적으로 복호화할 있도록 하는 효과가 있다. 특히, 본 발명은 부호화시 ROI 외부 영역의 화소를 이용하여 내삽을 수행해야 하는 경우 내삽을 수행하지 않도록 하는 제약 조건을 부가함으로써, 복호화기에서 별다른 추가적인 구성없이 손쉽게 ROI를 복호화할 수 있도록 하는 효과가 있다.In addition, the present invention has the effect of allowing the decoder to independently decode the ROI by suggesting a processing method in the ROI boundary region during region encoding that should refer to information of another region. In particular, the present invention has an effect of easily decoding the ROI without any additional configuration by adding a constraint that the interpolation is not performed when the interpolation is to be performed using a pixel of the ROI outer region during encoding. have.

Claims

An encoder for encoding a video image,

An OR detector which receives encoding region information on a plurality of regions (ROIs) to be encoded in the video image and detects overlapping regions (ORs) between the ROIs;

A region arranging unit which arranges the video image, the ROI, and the detected OR in a plurality of layers according to a resolution to be encoded; And

And an area encoder which encodes the video image, the ROI, and the detected OR according to the resolution of the layer arranged in the area arranging unit.

The method of claim 1,

The coding region information,

And location information in the video image of the ROI and encoding resolution information of the ROI.

The method of claim 1,

The OR detection unit,

And if the low resolution ROI is not spatially included in the high resolution ROI, detect an OR in the low resolution ROI.

The method of claim 3, wherein

The OR detection unit,

And if the low resolution ROI is spatially included in the high resolution ROI, the encoder is not detected.

The method of claim 3, wherein

The OR detection unit,

When the overlapping part between ROIs having different resolutions includes the overlapping part between ROIs having the same resolution, the part except the overlapping part between ROIs having the same resolution in the overlapping part between ROIs having different resolutions Is an OR.

The method of claim 3, wherein

The area arrangement unit,

And a region having a higher resolution among the video image, the ROI, and the OR treated as one region, and a region having a lower resolution in a lower layer.

The method of claim 6,

The area arrangement unit,

And generating an OR layer under a layer having an ROI having the same resolution as the OR and arranging an OR that is spatially overlapped with the ROI in the generated OR layer.

The method of claim 6,

The area arrangement unit,

And an OR that is spatially overlapped with the ROI in a layer having an ROI having the same resolution as the OR.

The method of claim 6,

The area encoder,

And an inter-layer coding for encoding in units of blocks by using a region arranged in a lower layer.

The method of claim 6,

The area encoder,

And an intra-layer coding for performing motion prediction / compensation on a block basis using a region of the same layer.

The method of claim 7, wherein

The area encoder,

And performing inter-layer coding on the block by using a region disposed in a higher layer among regions of a lower layer having overlapping blocks of a region to be encoded.

The method of claim 11,

The area encoder,

And if an OR is detected with respect to the overlapped part, inter-layer coding of the block is performed using the detected OR.

The method of claim 11,

The area encoder,

And a coded stream encoded in a mode that minimizes a bit rate among inter-layer coding mode, motion prediction / compensation mode, and intra mode for a block of a region to be encoded.

The method of claim 8,

The area encoder,

And performing inter-layer coding on the block by using an area of a first-level lower layer having an overlapping part with a block of an area to be encoded.

The method of claim 14,

The area encoder,

And an intermediary region for reference when performing inter-layer coding in a higher layer by upsampling motion information and texture information of a lower layer region.

The method of claim 15,

The area encoder,

The method of claim 9,

The area encoder,

In the inter-layer coding, when interpolation is to be performed using pixels outside the ROI region, interpolation is performed after determining pixel values outside the ROI region through extrapolation. Encoder.

The method of claim 10,

The area encoder,

In case of performing interpolation using pixels outside the ROI region in the intra-layer coding for performing the motion prediction / compensation, after determining the pixel value outside the ROI region through extrapolation An encoder that performs interpolation.

The method of claim 17 or 18,

The extrapolation is an encoder using a zero-order extrapolation or a constant constant assignment.

The method of claim 17 or 18,

The area encoder,

And a flag indicating that there is an ROI in the video image to an encoded stream.

The method of claim 20,

And the flag is one of roi_flag, roi_enable, boundary_handling, and multiple_roi_flag.

The method of claim 20,

The ROI is generated as a slice group map type 2 (slice_group_map_type 2).

The method of claim 20,

And the ROI is generated as an extended slice group map type (slice_group_map_type) having a type identifier of 6 or more.

The method of claim 20,

The area encoder,

And adding geometric information of the ROI in the video image to an encoded stream.

The method of claim 24,

The geometric information is represented by the num_rois_minus1, roi_top_left, and roi_bottom_right variables.

The method of claim 24,

And the geometric information of the ROI is expressed through a scan number of a macroblock constituting the ROI.

The method of claim 26,

Wherein the scan number is a number generated through a raster scan of a macroblock of the video image.

The method of claim 9,

The area encoder,

And the interpolation is not performed when interpolation is to be performed using pixels outside the ROI region in the inter-layer coding.

The method of claim 9,

And a block located at a ROI boundary in the inter-layer coding performs intra-layer coding.

The method of claim 10,

The area encoder,

And the interpolation is not performed when interpolation is performed using pixels outside the ROI region in the intra-layer coding for performing the motion prediction / compensation.

The method of claim 10,

In case of performing interpolation using pixels outside the ROI region in the intra-layer coding for performing motion prediction / compensation, motion information is predicted in units of integer pels. Encoder.

The method of claim 9,

The area encoder,

And an encoder for outputting an encoding stream including information on lower layer regions required for region encoding.

The method of claim 32,

The encoded stream,

And an information about a region index of an encoding region, a hierarchical index of the encoding region, the number of related regions required for encoding, and a related region required for encoding.

The method of claim 33, wherein

The information about the relevant region required for the encoding is

And an area index, a layer index, and position information of the related region.

The method of claim 34, wherein

The location information of the related area is

And a horizontal axis position, a vertical axis position, a horizontal length, and a vertical length with respect to the encoding region.

The method of claim 34, wherein

The location information of the related area is

And an encoder is represented by a raster scan number of a macroblock in a video image.

A decoding region extracting unit which receives selection information about a region to be decoded and extracts region information necessary for decoding the ROI corresponding to the selection region from an encoding stream including encoding information about a plurality of ROIs; And

And a region decoder configured to receive the extracted region information and to decode the image signal for the ROI in the entire video.

The method of claim 37,

The selection information,

And a position information in the video image of the region to be decoded and resolution information to be decoded.

The method of claim 37,

The decoding region extraction unit,

And extracting the encoded stream of the ROI corresponding to the selected region from the encoded stream, and extracting relevant region information of a lower layer necessary to decode the ROI.

The method of claim 39,

The extracted encoded stream,

And information about the region index of the ROI, the layer index of the ROI, the number of the related regions, and the related regions.

The method of claim 39,

The relevant area information,

And a region index, a layer index, and position information of the related region.

42. The method of claim 41 wherein

The location information of the related area is

And a horizontal axis position, a vertical axis position, a horizontal length, and a vertical length with respect to the ROI region to be decoded.

42. The method of claim 41 wherein

The location information of the related area is

A decoder as represented by a raster scan number for a macroblock in a video image.

The method of claim 39,

The area decoder,

And performing inter-layer decoding on a block-by-block basis using an area of a lower layer in which a region of low resolution is disposed.

The method of claim 39,

The area decoder,

And an intra-layer decoding for performing motion prediction / compensation on a block-by-block basis using an area of the same layer in which an area of the same resolution is disposed.

The method of claim 44,

The area decoder,

And the interpolation is not performed when interpolation is to be performed using pixels outside the ROI region in the inter-layer decoding.

The method of claim 44,

And a block located at an ROI boundary in the inter-layer decoding performs intra-layer decoding.

The method of claim 45,

The area decoder,

The decoder does not perform interpolation when it is necessary to perform interpolation using pixels outside the ROI region in intra-layer decoding for performing motion prediction / compensation. .

The method of claim 45,

In the case of intra-layer decoding for performing motion prediction / compensation, when interpolation is to be performed using a pixel outside the ROI region, motion information is predicted in units of integer pels. Decoder.

46. The method of claim 44 or 45,

The area decoder,

When the flag indicating that there is an ROI in the encoded stream is activated, the geometric information of the fraudulent ROI included in the encoded stream is determined, and the pixel value outside the ROI region is determined through extrapolation and then interpolated. Decoder characterized by.

The method of claim 39,

The area decoder,

If the encoding mode of the block in the region to be decoded is the inter-layer coding mode, the inter-layer of the block is made by using a region disposed at the highest layer among the related regions of the lower layer having an overlapping portion with the block of the region to be decoded. And a decoder for performing inter-layer decoding.

The method of claim 39,

The area decoder,

And a mediation region for reference when performing inter-layer decoding in a higher layer by upsampling motion information and texture information of a lower layer region.

The method of claim 52, wherein

The area decoder,

If the encoding mode of a block in a region to be decoded is an interlayer coding mode, inter-layer decoding of the block using an area of a first-level lower layer having an overlapping portion with a block of the region to be decoded. And a decoder.

The method of claim 37,

And the plurality of ROIs is configured of a small rectangular area to support interactive decoding.

Input means for receiving encoding region information about a plurality of ROIs to encode the input video image; And

Encoding means for performing encoding on the ROI,

The interpolation is not performed when the interpolation is to be performed with reference to the pixels outside the ROI in encoding the ROI.

The method of claim 55,

The encoding means,

Performing motion prediction / compensation coding on the ROI using temporal correlation

Video encoding apparatus characterized in that.

The method of claim 56, wherein

The encoding means,

In the case of performing the motion prediction / compensation coding, when interpolation is to be performed with reference to a pixel outside the ROI, motion information is predicted in an integer pixel unit.

Video encoding apparatus characterized in that.

The method of claim 55,

The coding region information,

And position information in the video image of the ROI and encoding resolution information of the ROI.

The method of claim 55 or 58,

And means for arranging the ROI for each layer according to the resolution to be encoded.

The method of claim 59,

The encoding means,

And an inter-layer coding for encoding an ROI disposed in a higher resolution upper layer by using an area disposed in a lower layer having a lower resolution.

The method of claim 60,

When the encoding means performs interlayer coding on the ROI in units of blocks, a block that needs to be interpolated using pixels outside the ROI region is encoded using intra-layer coding. And a video encoding apparatus.

The method of claim 55,

And the plurality of ROIs comprises a small rectangular area to support interactive decoding.

The method of claim 55,

And the ROI position information included in the encoded stream output from the encoding means is represented by a coordinate value.

The method of claim 55,

ROI position information included in the encoded stream output from the encoding means is represented by a macroblock scan number in the entire video image.

The method of claim 60,

The inter-layer coding is a video encoding apparatus characterized by using an overlapping region in the lower layer and the ROI to be encoded.

66. The method of claim 65,

The overlapping region is configured as OR in a newly generated layer below the lower layer.

66. The method of claim 65,

The overlapping area is configured as OR in the lower layer.

The method of claim 67 wherein

The encoding means,

If there is no spatially overlapping ROI in the upper layer of the ROI or OR, upsampling the ROI or OR texture information of the lower layer to form a mediation area suitable for the resolution of the upper layer.

Video encoding apparatus characterized in that.

The method of claim 68, wherein

The encoding means,

A video encoding apparatus characterized by performing inter-layer coding using an area of a layer below one step.

Input means for receiving selection information on an area to be decoded in the input video image;

Decoding means for performing decoding on the ROI corresponding to the region to be decoded,

The interpolation is not performed when the interpolation is to be performed with reference to a pixel outside the ROI in the decoding of the ROI.

The method of claim 70,

The decoding means,

Performing motion prediction / compensation decoding using temporal correlation on the ROI

Video decoding apparatus characterized in that.

The method of claim 71 wherein

The decoding means,

In the case of performing the motion prediction / compensation decoding, when interpolation is to be performed by referring to a pixel outside the ROI, motion information is predicted in an integer pixel unit.

Video decoding apparatus characterized in that.

The method of claim 70,

The selection information,

And position information in the video image of the region to be encoded and resolution information to encode the region.

The method of claim 70,

The decoding means,

And an inter-layer decoding for decoding an ROI disposed in a higher resolution upper layer by using an area disposed in a lower layer having a lower resolution.

The method of claim 74, wherein

When the decoding means performs interlayer decoding on a block-by-block basis with respect to the ROI, intra-layer coding is performed on a block that needs to be interpolated using pixels outside the ROI region. A video decoding apparatus.

The method of claim 70,

And a plurality of small rectangular ROI regions for decoding the region to be decoded.

The method of claim 70,

And the ROI location information is represented by a macroblock scan number in the entire video image.

The method of claim 74, wherein

The inter-layer decoding is a video decoding apparatus, characterized in that using the ROI to perform decoding and the region (OR) overlapping in the lower layer.

The method of claim 78,

The decoding means,

Video decoding apparatus characterized in that.

80. The method of claim 79 wherein

The decoding means,

And an inter-layer decoding using an area of a layer below one step.

An input step of receiving position information and resolution information of a plurality of ROIs to be encoded with respect to an input video image;

Disposing the ROI for each layer according to a resolution; And

Encoding step of performing block unit encoding on the arranged ROI

An encoding method for providing spatial scalability comprising a.

82. The method of claim 81 wherein

The encoding step,

In case of performing inter-layer coding using ROI information of lower layer, performing intra-layer coding on a block requiring interpolaton using pixels outside the ROI.

An encoding method for providing spatial scalability, characterized in that.

82. The method of claim 81 wherein

The encoding step,

In case of performing motion prediction / compensation coding using ROI information of the same layer having temporal correlation, an integer is obtained for motion information of a block requiring interpolaton using pixels outside the ROI. Performing pixel-by-pixel prediction

An encoding method for providing spatial scalability, characterized in that.

82. The method of claim 81 wherein

The encoding step,

In case of performing inter-layer coding using ROI information of a lower layer, performing interpolation after determining an external pixel value through extrapolation for a block requiring interpolaton using pixels outside the ROI.

An encoding method for providing spatial scalability, characterized in that.

82. The method of claim 81 wherein

The encoding step,

When performing motion prediction / compensation coding using ROI information of the same layer having temporal correlation, extrapolation is performed on blocks requiring interpolaton using pixels outside the ROI. To interpolate after determining external pixel values

An encoding method for providing spatial scalability, characterized in that.

86. The method of claim 84 or 85,

The encoding step,

Adding a flag to the encoded stream to indicate that there is an ROI region in the video image;

A coding method that provides spatial scalability.

82. The method of claim 81 wherein

The location information of the ROI may be represented by a macroblock scan number for the video image.

A coding method that provides spatial scalability.

82. The method of claim 81 wherein

Detecting a duplicate area OR between ROIs;

The encoding step

And inter-layer coding on the ROI using the OR.

89. The method of claim 88 wherein

Configuring an intermediary region through upsampling with respect to a region that is not spatially overlapped with an upper layer of the ROI or the OR;

The encoding step

A coding method for providing spatial scalability, wherein the inter-layer coding is performed by using information of a region disposed in a layer below one step.

An input step of receiving position information on a region to be decoded and resolution information to be decoded in an input video image;

Extracting ROI information corresponding to the region to be decoded from the transmitted encoded stream; And

And a decoding step of performing decoding on the ROI using the extracted information.

92. The method of claim 90,

The decoding step,

When performing interlayer decoding using lower layer ROI information, performing intra layer decoding on a block requiring interpolaton using pixels outside the ROI.

A decoding method for providing spatial scalability, characterized in that.

92. The method of claim 90,

The decoding step,

When performing motion prediction / compensation decoding using ROI information of the same layer having a temporal correlation, an integer is obtained for motion information of a block requiring interpolaton using pixels outside the ROI. Performing pixel-by-pixel prediction

A decoding method for providing spatial scalability, characterized in that.

92. The method of claim 90,

The decoding step,

In the case of performing inter-layer decoding using ROI information of a lower layer, performing interpolation after determining an external pixel value through extrapolation for a block requiring interpolaton using pixels outside the ROI.

A decoding method for providing spatial scalability, characterized in that.

92. The method of claim 90,

The decoding step,

In case of performing motion prediction / compensation decoding using ROI information of the same layer having temporal correlation, extrapolation is performed on blocks requiring interpolaton using pixels outside the ROI. To interpolate after determining external pixel values

A decoding method for providing spatial scalability, characterized in that.

95. The method of claim 93 or 94,

The decoding step,

Determining whether a ROI region in a video image exists in the encoded stream through a flag indicating that there is a ROI in the video image included in the encoded stream.

Decoding Method Provides Spatial Scalability

92. The method of claim 90,

Decoding method that provides spatial scalability.

92. The method of claim 90,

The decryption step

And performing interlayer decoding on the ROI using the overlapped region OR between the ROIs.

97. The method of claim 97,

Configuring an intermediate region through upsampling with respect to an area that is not spatially overlapped with an upper layer of the ROI or the OR;

The decryption step

A decoding method for providing spatial scalability, wherein the inter-layer decoding is performed using information of a region disposed in a layer below one step.